NucliaDB
NucliaDB is a commercial vector database that enables semantic and vector search across unstructured data, supporting advanced AI and ML-powered applications.
About this tool
NucliaDB
NucliaDB is an open-source AI-powered search database designed for Retrieval-Augmented Generation (RAG) and advanced semantic/vector search across unstructured data. It supports hybrid search capabilities and is tailored for modern AI and ML-powered applications.
Features
- Hybrid Search: Supports vector, full-text, and graph search indexes.
- Unstructured Data Support: Designed for storing and searching unstructured data such as text, files, and more.
- Multi-Tenant: Built to handle indexing and searching across large, multi-tenant datasets.
- Data Types: Store text, files, vectors, labels, annotations, links, conversations, and metadata.
- Semantic Search: Perform semantic searches using vectors and NLP, allowing for similarity search beyond exact keyword matches.
- Text Search: Traditional keyword and fuzzy text search.
- Data Export: Export data in formats compatible with NLP pipelines (e.g., HuggingFace, PyTorch datasets).
- Field-Level Indexing: Index fields, paragraphs, and semantic sentences.
- Cloud Integration: Cloud data and insight extraction using the Nuclia Understanding API™.
- ML Model Training: Connect to Nuclia Learning API™ for ML model training.
- Role-Based Security: Role-based access with upstream proxy authentication validation.
- Storage Backends: Uses PostgreSQL for the storage layer; supports blob storage with S3-compatible API, Google Cloud Storage, and Azure Blob Storage.
- Replication & Distributed Search: Replication of index storage and distributed search capabilities.
- Cloud-Native: Designed for cloud deployment and scalability.
- API Access: Exposes an API for integration into applications.
- Open Source: Licensed under AGPLv3.
- Written in: Rust and Python.
Pricing
- Open Source: NucliaDB is open-source under the AGPLv3 license.
- Commercial Offerings: The company offers NucliaDB as a service (Nuclia Cloud) and APIs for data normalization and enrichment (Nuclia Learning API, Nuclia Understanding API). Details on pricing for these services are not specified in the provided content.
Links
Loading more......
Information
Categories
Tags
Similar Products
6 result(s)Meilisearch offers vector search capabilities as part of its search engine, enabling hybrid and semantic search for AI applications.
Vectara is a commercial vector database and search platform that enables semantic and hybrid AI-powered search using vector embeddings.
Marqo is an open-source neural search engine that leverages vector representations to enable semantic search over textual data. It abstracts vector database complexity and provides a high-level interface for building advanced search applications.
Denser Retriever is a vector-based retrieval system designed for efficient similarity search and information access in AI and ML workloads.
Infinity is an AI-native database built for LLM applications, offering fast hybrid search of dense vectors, sparse vectors, tensors, and full-text data.
AstraDB (also known as Astra DB by DataStax) is a cloud-native vector database built on Apache Cassandra, supporting real-time AI applications with scalable vector search. It is designed for large-scale deployments and features a user-friendly Data API, robust vector capabilities, and automation for AI-powered applications.