• Home
  • Categories
  • Pricing
  • Submit
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies
    Decorative pattern
    Decorative pattern
    1. Home
    2. Sdks Libraries
    3. Apache Lucene

    Apache Lucene

    Apache Lucene is a high-performance, full-featured open-source text search engine library written in Java. It provides approximate nearest neighbor (ANN) vector search capabilities using Hierarchical Navigable Small World (HNSW) graphs, enabling semantic search on high-dimensional embedding vectors up to 1024 dimensions by default (extendable via custom codecs).

    Overview

    Apache Lucene is an open-source information retrieval software library originally written in Java. It is widely used as the foundation for search engines like Apache Solr and Elasticsearch. Lucene supports both traditional lexical (keyword) search and modern vector-based semantic search.

    Vector Search Capabilities

    • HNSW Algorithm: Implements Hierarchical Navigable Small World Graphs for approximate nearest neighbor (ANN) search
    • KnnFloatVectorField: Field type for indexing floating-point embedding vectors with configurable similarity functions (e.g., cosine similarity)
    • KnnFloatVectorQuery: Query type for performing KNN searches on indexed vector fields
    • Default dimension limit: 1024 dimensions (can be extended via custom KnnVectorsFormat codecs, e.g., HighDimensionKnnVectorsFormat for 1536+ dimensions)
    • Similarity functions: Supports cosine similarity and other distance metrics via VectorSimilarityFunction
    • Codec customization: Lucene95Codec with Mode.BEST_SPEED for optimized vector indexing performance

    Index Storage

    • ByteBuffersDirectory: Keeps index files in heap memory for fast access
    • FSDirectory: Stores index files on the file system for persistence
    • IndexWriter: Handles document indexing with configurable analyzers and codecs

    Use Cases

    • Semantic search over unstructured data (text, images, video)
    • Embedding-based document retrieval with deep learning models (e.g., OpenAI embeddings)
    • High-dimensional vector similarity search for AI/ML applications
    • Hybrid search combining lexical and semantic relevance

    Pricing

    Free and open-source under the Apache License 2.0.

    Surveys

    Loading more......

    Information

    Websitelucene.apache.org
    PublishedApr 4, 2026

    Categories

    1 Item
    Sdks Libraries

    Tags

    3 Items
    #open-source#vector-search#full-text-search

    Similar Products

    6 result(s)

    Tantivy

    Tantivy is a full-text search engine library inspired by Apache Lucene, offering fast and scalable similarity search capabilities. While primarily focused on text, it supports efficient vector-based similarity searches, making it useful for vector search tasks.

    Typesense

    Typesense is an open-source search engine that supports hybrid search, including vector search capabilities, providing an alternative to proprietary vector search solutions.

    Deep Searcher

    Deep Searcher is a local open-source deep research solution that integrates Milvus and LangChain to provide advanced vector search and retrieval capabilities using open-source models.

    RediSearch

    RediSearch is a Redis module that provides high-performance vector search and similarity search capabilities on top of Redis, enabling advanced search and retrieval features for AI and data applications.

    arroy

    Arroy is an open-source library for efficient similarity search and management of vector embeddings, useful in vector database systems.

    Bleve

    Bleve is an open-source search library with experimental support for vector search, enabling hybrid search and retrieval in applications.