What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

Vector Database

A specialised database system that stores data as high-dimensional numerical vectors and enables fast approximate nearest-neighbour search, forming the retrieval backbone of semantic search and RAG systems.

7 min readLast updated May 2026Infrastructure

A vector database is a data management system designed to store, index, and query high-dimensional vectors — numerical representations of data objects such as text, images, audio, or code — with a primary operation of finding the vectors most similar to a given query vector. Unlike relational databases that retrieve rows matching exact field values, or full-text search engines that rank documents by keyword overlap, vector databases retrieve items by semantic or structural similarity in a continuous embedding space. They are a foundational component of modern AI infrastructure, enabling applications such as retrieval-augmented generation (RAG), semantic search, recommendation systems, anomaly detection, and multimodal retrieval.[^1]

Embeddings and Vector Representations

The utility of a vector database depends on the quality of the embeddings stored within it. An embedding is a dense numerical vector produced by a neural network — typically a transformer encoder — that maps an object (a sentence, a product description, a medical image, a piece of code) to a point in a high-dimensional space. Objects that are semantically or perceptually similar are mapped to nearby points; dissimilar objects are placed far apart.

Embedding dimensionality varies by model. OpenAI's text-embedding-3-large model produces 3,072-dimensional vectors. Sentence-BERT variants commonly produce 384- or 768-dimensional vectors. Image embeddings from CLIP models are typically 512 or 1,024 dimensions. The choice of embedding model determines the quality of the similarity relationships encoded in the vector space.

Indexing and Search Algorithms

Searching for the nearest neighbour to a query vector in a high-dimensional space is computationally challenging. An exact brute-force search comparing the query against every stored vector is accurate but has linear time complexity, becoming impractical at the scale of millions or billions of vectors.

Vector databases address this using Approximate Nearest Neighbour (ANN) algorithms, which trade a small reduction in recall for dramatic gains in query speed.

HNSW (Hierarchical Navigable Small World) is the most widely deployed ANN algorithm. It builds a multi-layer graph structure where nodes at higher layers represent coarser clusters and nodes at lower layers represent individual vectors. Search navigates from the top layer to the bottom, pruning the search space at each level. HNSW achieves query latency that grows logarithmically with dataset size and consistently delivers high recall (95–99%) for most workloads.[^2]

IVF (Inverted File Index) divides the vector space into clusters, assigns each vector to its nearest cluster centroid, and searches only the most relevant clusters at query time. IVF variants such as IVF-PQ (with product quantisation for compression) are commonly used for very large-scale deployments where memory efficiency is critical.

FAISS (Facebook AI Similarity Search) is an open-source library from Meta that implements multiple ANN algorithms and serves as the retrieval backend for many vector database products.

Similarity Metrics

Vector databases support multiple distance metrics to quantify similarity:

| Metric | Formula | Best for | |--------|---------|----------| | Cosine similarity | Normalised dot product | Text embeddings, direction matters | | Euclidean distance | L2 norm | Spatial data, magnitude matters | | Dot product | Raw dot product | Scaled embeddings (e.g., OpenAI text-embedding) | | Hamming distance | Bit-level XOR | Binary embeddings |

Most NLP applications use cosine similarity because it measures directional alignment in embedding space regardless of vector magnitude, making it robust to embedding normalisation differences.

Leading Vector Database Systems

The vector database market expanded rapidly from 2022 onward, producing a diverse set of specialised systems.

Pinecone is a fully managed cloud-native vector database. It abstracts all infrastructure management, offering a simple API for upsert and query operations. Pinecone introduced Dedicated Read Nodes in late 2024 for predictable performance at high query volumes, and its Pinecone Assistant product (generally available January 2025) bundles chunking, embedding, retrieval, and answer generation behind a single endpoint.[^3]

Weaviate is an open-source vector database with native hybrid search (combining vector similarity and BM25 keyword search), support for multimodal data types, and a modular architecture for plugging in embedding models directly. It scales horizontally by distributing shards across a cluster.

Qdrant is an open-source system written in Rust, optimised for filtered vector search — queries that combine an ANN search with structured metadata filters. It is available as a self-hosted deployment or via Qdrant Cloud. Benchmarks from 2025 show Qdrant achieving 20ms p95 latency at 15,000 queries per second for billion-vector datasets.

pgvector is a PostgreSQL extension that adds vector storage and ANN search to an existing relational database. It allows teams already using PostgreSQL to add vector search without introducing a separate data store, at the cost of some performance relative to purpose-built systems.

Chroma is an open-source, lightweight vector database commonly used for local development and small-scale RAG prototypes. Its simple Python-first API makes it popular in research and educational settings.

Milvus is an open-source vector database developed by Zilliz, designed for large-scale enterprise deployments with heterogeneous index types and tiered storage.

Hybrid Search

Pure vector search excels at semantic similarity but may miss documents containing precise technical terms, named entities, or numeric identifiers that appear verbatim in the query. Hybrid search combines vector similarity scores with traditional keyword (BM25) scores, reranking the merged result set to produce retrieval that is both semantically aware and precise. Weaviate, Qdrant, and Elasticsearch all offer native hybrid search capabilities. Hybrid retrieval has become the default architecture for production RAG pipelines.

Malaysian Context — Vector Database Adoption in Enterprise AI

Vector databases have entered Malaysian enterprise technology stacks primarily as components of retrieval-augmented generation deployments and semantic search implementations. The adoption follows the broader AI investment surge driven by Malaysia's AI Roadmap and the Madani Digital initiative.

In the banking sector, Maybank and CIMB have implemented vector database-backed semantic search for internal knowledge management, enabling staff to find relevant regulatory circulars from Bank Negara Malaysia (BNM) and internal policies using natural language queries. These systems typically use pgvector integrated into existing PostgreSQL infrastructure or managed services such as Azure Cosmos DB for MongoDB's vector search feature, accessed through their existing Microsoft Azure agreements.

Telecommunications companies, including Telekom Malaysia (TM) and Maxis, have deployed vector databases for network operations knowledge retrieval, enabling automated diagnostic tools to find similar historical fault cases from millions of historical incident records. The semantic search capability is particularly valuable given the volume and variety of technical documentation involved.

Malaysian e-commerce platforms and marketplaces, such as Lazada Malaysia, have adopted vector databases for product recommendation and visual similarity search, indexing millions of product images and descriptions for retrieval by both text and image query.

The open-source vector database ecosystem — particularly Qdrant and Chroma — has seen adoption among Malaysian AI startups and university research groups that lack the budget for managed cloud services. The Malaysia Digital Hub at Cyberjaya and MRANTI Technology Park host several AI startups that have built RAG-based SaaS products on open-source vector database infrastructure.

HRD Corp-approved training providers have introduced courses covering vector database concepts and implementation, reflecting employer demand for engineers capable of building and maintaining AI search infrastructure.

References

DataCamp. (2025). The Top 5 Vector Databases. DataCamp Blog. https://www.datacamp.com/blog/the-top-5-vector-databases
Malkov, Y. A., & Yashunin, D. A. (2018). Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(4).
InfoQ. (2025). Pinecone Introduces Dedicated Read Nodes in Public Preview for Predictable Vector Workloads. https://www.infoq.com/news/2025/12/pinecone-drn-vector-workloads/
Johnson, J., Douze, M., & Jégou, H. (2019). Billion-Scale Similarity Search with GPUs. IEEE Transactions on Big Data, 7(3).

Tags:vector database embedding semantic search Pinecone Weaviate Qdrant

Type	Specialised database system
Core operation	Approximate nearest-neighbour (ANN) search
Key use	Semantic search, RAG pipelines, recommendation systems
Leading systems	Pinecone, Weaviate, Qdrant, Chroma, pgvector, Milvus
Related	Embedding, retrieval-augmented generation, semantic search

Embeddings and Vector Representations

Indexing and Search Algorithms

Similarity Metrics

Leading Vector Database Systems

Hybrid Search

See Also

References

References