What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

pgvector

pgvector is an open-source PostgreSQL extension that adds a vector data type and similarity-search operators, allowing embeddings to be stored and queried directly inside a relational database.

4 min readLast updated June 2026Companies & Tools

pgvector is an open-source extension for PostgreSQL that adds support for storing and searching vector embeddings directly within a relational database. It introduces a dedicated vector column type along with operators and index structures for similarity search, allowing developers to keep embeddings alongside their existing relational data rather than running a separate, specialised vector database. This makes pgvector a popular choice for teams that already use PostgreSQL and want to add semantic capabilities without adopting new infrastructure.

How it works

pgvector adds a vector column type that holds an array of floating-point numbers representing an embedding. It provides distance operators for the three most common similarity measures: cosine distance, Euclidean (L2) distance, and inner product. A typical query selects rows ordered by the distance between a stored vector and a query vector, returning the nearest neighbours. Because this all happens within standard SQL, vector similarity can be combined naturally with ordinary filters, joins, and aggregations, producing context-aware results in a single query.

For example, an application can retrieve the most similar documents to a query embedding while simultaneously filtering on a category column, a date range, or a user identifier, joining the result against other tables as needed. This tight integration with relational features is the principal advantage of pgvector over standalone vector stores for many applications.

Indexing and performance

To make search fast at scale, pgvector supports two approximate-nearest-neighbour index types. IVFFlat partitions vectors into lists and searches only the most relevant partitions, while HNSW builds a navigable graph that generally offers better recall and query speed at the cost of more memory and slower index construction. With HNSW indexing, pgvector can handle millions of vectors, and published benchmarks report query times under roughly 20 milliseconds at one million vectors with recall above 95 percent, which is sufficient for many production workloads.

The 0.7.0 release and subsequent versions expanded pgvector's capabilities, including support for additional vector representations and improved indexing, reflecting steady development driven by the surge in demand for embedding storage.

Use cases and ecosystem

pgvector enables similarity and semantic search, retrieval-augmented generation, image search, recommendation systems, and other natural-language and computer-vision applications. It has been widely adopted because PostgreSQL is one of the most common databases in production, and major managed platforms support the extension. Supabase, Microsoft Azure Database for PostgreSQL, Amazon RDS and Aurora, and Google Cloud SQL all offer pgvector, lowering the barrier to adding vector search to existing systems.

A frequently cited argument in favour of pgvector is operational simplicity. For applications whose vector counts reach the millions rather than the billions, keeping embeddings in the same database as the rest of the data avoids the cost and complexity of synchronising a separate system, which is why some practitioners argue that many teams do not need a dedicated vector database at all. For the largest or most demanding workloads, purpose-built systems such as Milvus or Qdrant may still be preferable.

Malaysian Context — Pragmatic Vector Search on Existing Databases

PostgreSQL is widely used across Malaysian technology teams, from startups to enterprises and government systems, which makes pgvector an especially practical entry point for adding AI-powered search to existing applications. Rather than provisioning new infrastructure, Malaysian developers can enable the extension on databases they already run, reducing both cost and operational risk.

This pragmatism matters for organisations subject to the Personal Data Protection Act (PDPA), because keeping embeddings inside an existing, already-governed PostgreSQL instance simplifies data governance and residency compared with sending data to an external vector service. Public-sector agencies and regulated firms such as banks (Maybank, CIMB) and insurers can adopt vector search within infrastructure that already meets their compliance requirements.

Local cloud and hosting providers, as well as the major managed-database services available in Malaysian and regional data centres in Johor and Cyberjaya, support pgvector, giving Malaysian teams several deployment paths. For the talent ecosystem promoted by MDEC and HRD Corp, pgvector is an accessible way for engineers already fluent in SQL to learn embedding-based retrieval without mastering an entirely new database, supporting the digital-skills goals of the MyDigital Blueprint.

References

pgvector. (2026). pgvector GitHub Repository. https://github.com/pgvector/pgvector
PostgreSQL. (2024). pgvector 0.7.0 Released. https://www.postgresql.org/about/news/pgvector-070-released-2852/
Supabase. (2026). pgvector: Embeddings and vector similarity.
Encore. (2025). pgvector Guide: Vector Search and RAG in PostgreSQL.

Tags:pgvector postgresql vector-database embedding similarity-search

Type	PostgreSQL extension
Purpose	Vector storage and similarity search
Indexes	HNSW, IVFFlat
Distances	Cosine, L2, inner product
Licence	Open-source (PostgreSQL licence)
Key use	RAG, semantic search, recommendations