



Python/Rust/Go/JS SDK for fast embedding generation via ONNX Runtime with batch embed (list inputs), async multiprocessing support. Optimized for app integration, LangChain embedding modules; 12x CPU speedup vs PyTorch libs, no GPU/API dependency.
Loading more......
FastEmbed is Qdrant's lightweight Python library for generating embeddings efficiently without GPU requirements or heavy dependencies.
Lightweight: Few external dependencies, no PyTorch (uses ONNX Runtime)
Fast: Faster than PyTorch for inference
Accurate: Better than OpenAI Ada-002 with Flag Embedding
Serverless-Friendly: Small footprint ideal for AWS Lambda
pip install fastembed
# With GPU support
pip install fastembed-gpu
Open-source on GitHub: qdrant/fastembed
Supports LangChain integration