AIWiki
Malaysia

Search Results

12 results for RAG

Companies & Tools

Chroma

An open-source vector database designed for embedding-based applications, optimised for developer ergonomics and increasingly for large-scale serverless retrieval through a 2025 Rust-core rewrite.

4 min readUpdated May 2026
Companies & Tools

Cohere

Cohere is a Canadian AI company specialising in enterprise large language models, offering Command, Embed, and Rerank model families alongside secure deployment infrastructure designed for regulated industries.

6 min readUpdated May 2026
Infrastructure

LangChain

LangChain is an open-source framework for building applications powered by large language models, providing composable abstractions for chaining LLM calls with tools, memory, and data retrieval in Python and JavaScript.

6 min readUpdated May 2026
Companies & Tools

LlamaIndex

LlamaIndex is an open-source Python and TypeScript framework for building retrieval-augmented and agentic AI applications over private data sources.

6 min readUpdated May 2026
Infrastructure

Parameter-Efficient Fine-Tuning

A family of techniques that adapts a pretrained language or vision model to a downstream task by training only a small fraction of its parameters, dramatically reducing compute, memory, and storage requirements compared to full fine-tuning.

5 min readUpdated May 2026
Companies & Tools

Pinecone

Pinecone is a managed, cloud-native vector database designed for storing high-dimensional embeddings and serving low-latency similarity search for retrieval-augmented AI applications.

5 min readUpdated May 2026
Companies & Tools

Qdrant

An open-source, Rust-based vector database and similarity search engine designed for high-performance storage and retrieval of high-dimensional embeddings, with support for hybrid search and multitenant deployments.

5 min readUpdated May 2026
Applications

Retrieval-Augmented Generation

A technique that enhances large language model outputs by retrieving relevant documents from an external knowledge base at inference time, grounding responses in up-to-date and domain-specific information.

6 min readUpdated May 2026
Applications

Semantic Search

Semantic search is a search paradigm that retrieves results based on the meaning and intent of a query rather than exact keyword matches, using vector embeddings to measure conceptual similarity between text.

6 min readUpdated May 2026
Infrastructure

Vector Database

A specialised database system that stores data as high-dimensional numerical vectors and enables fast approximate nearest-neighbour search, forming the retrieval backbone of semantic search and RAG systems.

7 min readUpdated May 2026
Companies & Tools

Weaviate

An open-source, cloud-native vector database that combines vector similarity search with structured filtering, GraphQL APIs, and built-in vectorisation for AI applications.

5 min readUpdated May 2026
Foundations

Zero-Shot Learning

Zero-shot learning is a machine learning paradigm in which a model makes accurate predictions on categories it has never seen during training by leveraging semantic descriptions or attribute representations.

6 min readUpdated May 2026