Search Results
12 results for “rag”
Chroma
An open-source vector database designed for embedding-based applications, optimised for developer ergonomics and increasingly for large-scale serverless retrieval through a 2025 Rust-core rewrite.
Cohere
Cohere is a Canadian AI company specialising in enterprise large language models, offering Command, Embed, and Rerank model families alongside secure deployment infrastructure designed for regulated industries.
LangChain
LangChain is an open-source framework for building applications powered by large language models, providing composable abstractions for chaining LLM calls with tools, memory, and data retrieval in Python and JavaScript.
LlamaIndex
LlamaIndex is an open-source Python and TypeScript framework for building retrieval-augmented and agentic AI applications over private data sources.
Parameter-Efficient Fine-Tuning
A family of techniques that adapts a pretrained language or vision model to a downstream task by training only a small fraction of its parameters, dramatically reducing compute, memory, and storage requirements compared to full fine-tuning.
Pinecone
Pinecone is a managed, cloud-native vector database designed for storing high-dimensional embeddings and serving low-latency similarity search for retrieval-augmented AI applications.
Qdrant
An open-source, Rust-based vector database and similarity search engine designed for high-performance storage and retrieval of high-dimensional embeddings, with support for hybrid search and multitenant deployments.
Retrieval-Augmented Generation
A technique that enhances large language model outputs by retrieving relevant documents from an external knowledge base at inference time, grounding responses in up-to-date and domain-specific information.
Semantic Search
Semantic search is a search paradigm that retrieves results based on the meaning and intent of a query rather than exact keyword matches, using vector embeddings to measure conceptual similarity between text.
Vector Database
A specialised database system that stores data as high-dimensional numerical vectors and enables fast approximate nearest-neighbour search, forming the retrieval backbone of semantic search and RAG systems.
Weaviate
An open-source, cloud-native vector database that combines vector similarity search with structured filtering, GraphQL APIs, and built-in vectorisation for AI applications.
Zero-Shot Learning
Zero-shot learning is a machine learning paradigm in which a model makes accurate predictions on categories it has never seen during training by leveraging semantic descriptions or attribute representations.