PyNNDescent

Python implementation of Nearest Neighbor Descent for k-neighbor-graph construction and ANN search. Targets 80%-100% accuracy with fast performance and supports wide variety of distance metrics. This is an OSS library.

Visit Website

Overview

PyNNDescent provides a Python implementation of Nearest Neighbor Descent for k-neighbor-graph construction and approximate nearest neighbor search. Based on a 2011 ACM paper focusing on high-accuracy ANN searches.

Key Features

Fast Performance: Among the fastest ANN libraries
Easy Installation: pip and conda installable, no platform issues
Flexible: Supports wide variety of distance metrics
High Accuracy: Targets 80%-100% accuracy rate
Scikit-learn Integration: Provides KNeighborTransformer support
Pure Python: No compilation required

Performance Characteristics

Performs solidly in ann-benchmarks top performing libraries
Fast approximate nearest neighbor queries
Efficient k-neighbor-graph construction
Good accuracy/speed trade-off

Technical Approach

Nearest Neighbor Descent

Core algorithm from 2011 ACM paper by Dong, Wei, Charikar Moses, and Kai Li
Efficient graph construction
Iterative refinement

Enhancements

Random Projection Trees: Used for initialization
Graph Diversification: Prunes longest edges of triangles
Optimized Search: Efficient query algorithms

Distance Metrics

Supports extensive list of metrics:

Euclidean, Manhattan, Chebyshev
Minkowski, Hamming, Cosine
Correlation, Jaccard, Dice
And many more specialized metrics

Installation

PyPI

pip install pynndescent

Conda

conda install pynndescent

Scikit-learn Integration

KNeighborTransformer support
Compatible with sklearn pipelines
Fits into existing ML workflows
Drop-in replacement for sklearn's KNN

Use Cases

High-accuracy ANN search (80%+ recall)
K-neighbor graph construction
Dimensionality reduction
Clustering preprocessing
Manifold learning
Similarity search

API

Simple Python interface:

from pynndescent import NNDescent

index = NNDescent(data)
neighbors, distances = index.query(query_data, k=10)

Comparison to Alternatives

Advantages

Easy installation (pure Python)
No platform-specific issues

Surveys

Loading more......

Information

Websitegithub.com

PublishedMar 6, 2026

Tags

3 Items

#open-source #python #ann

Similar Products

Sentence Transformers v3.0

Major update to the Sentence Transformers library introducing a new SentenceTransformerTrainer for easier fine-tuning, multi-GPU support, improved loss logging, and access to 15,000+ pre-trained models on HuggingFace.

000

PUFFINN

Parameterless and Universal Fast Finding of Nearest Neighbors - an LSH-based library for approximate nearest neighbor search with probabilistic guarantees. Features a parameterless design requiring only memory budget and result quality specifications.

000

FLANN

Fast Library for Approximate Nearest Neighbors containing a collection of algorithms optimized for nearest neighbor search in high dimensional spaces with automatic algorithm and parameter selection.

000

PageANN

Disk-based approximate nearest neighbor search framework with page-aligned graph structure. Achieves 1.85x-10.83x higher throughput than state-of-the-art methods through optimized SSD utilization.

000

PipeANN

Low-latency, billion-scale updatable graph-based vector store on SSD. Achieves <1ms search latency with 10x less memory than in-memory indexes through alignment of best-first search with SSD characteristics.

000

Gensim

Gensim is a Python library for topic modeling and vector space modeling, providing tools to generate high-dimensional vector embeddings from text data. These embeddings can be stored and efficiently searched in vector databases, making Gensim directly relevant to vector search use cases.

000

Overview

Key Features

Fast Performance: Among the fastest ANN libraries
Easy Installation: pip and conda installable, no platform issues
Flexible: Supports wide variety of distance metrics
High Accuracy: Targets 80%-100% accuracy rate
Scikit-learn Integration: Provides KNeighborTransformer support
Pure Python: No compilation required

Performance Characteristics

Performs solidly in ann-benchmarks top performing libraries
Fast approximate nearest neighbor queries
Efficient k-neighbor-graph construction
Good accuracy/speed trade-off

Technical Approach

Nearest Neighbor Descent

Core algorithm from 2011 ACM paper by Dong, Wei, Charikar Moses, and Kai Li
Efficient graph construction
Iterative refinement

Enhancements

Random Projection Trees: Used for initialization
Graph Diversification: Prunes longest edges of triangles
Optimized Search: Efficient query algorithms

Distance Metrics

Supports extensive list of metrics:

Euclidean, Manhattan, Chebyshev
Minkowski, Hamming, Cosine
Correlation, Jaccard, Dice
And many more specialized metrics

Installation

PyPI

pip install pynndescent

Conda

conda install pynndescent

Scikit-learn Integration

KNeighborTransformer support
Compatible with sklearn pipelines
Fits into existing ML workflows
Drop-in replacement for sklearn's KNN

Use Cases

High-accuracy ANN search (80%+ recall)
K-neighbor graph construction
Dimensionality reduction
Clustering preprocessing
Manifold learning
Similarity search

API

Simple Python interface:

from pynndescent import NNDescent

index = NNDescent(data)
neighbors, distances = index.query(query_data, k=10)

Comparison to Alternatives

Advantages

Easy installation (pure Python)
No platform-specific issues

PyNNDescent

Overview

Key Features

Performance Characteristics

Technical Approach

Nearest Neighbor Descent

Enhancements

Distance Metrics

Installation

PyPI

Conda

Scikit-learn Integration

Use Cases

API

Comparison to Alternatives

Advantages

Information

Categories

Tags

Similar Products

PyNNDescent

Overview

Key Features

Performance Characteristics

Technical Approach

Nearest Neighbor Descent

Enhancements

Distance Metrics

Installation

PyPI

Conda

Scikit-learn Integration

Use Cases

API

Comparison to Alternatives

Advantages

Information

Categories

Tags

Similar Products

Trade-offs

Documentation

Community

License

Related Projects

Pricing