What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

Qdrant

An open-source, Rust-based vector database and similarity search engine designed for high-performance storage and retrieval of high-dimensional embeddings, with support for hybrid search and multitenant deployments.

5 min readLast updated May 2026Companies & Tools

Qdrant is an open-source vector database and similarity search engine, designed for storing and retrieving high-dimensional vectors at scale. Written in Rust and released under the Apache 2.0 licence, Qdrant is among the most widely deployed vector databases for retrieval-augmented generation, semantic search, recommendation systems, and other AI workloads that depend on embedding similarity. The project is developed by Qdrant Solutions GmbH, a Berlin-based company that also operates the managed Qdrant Cloud service on Amazon Web Services, Google Cloud Platform, and Microsoft Azure.

Architecture

Qdrant is built around a custom storage and indexing engine implemented in Rust, which gives it high throughput and predictable latency under load. The core indexing data structure is a Hierarchical Navigable Small World (HNSW) graph, an approximate nearest neighbour algorithm that provides good recall and query speed across millions to billions of vectors. Qdrant supports configurable distance metrics including cosine similarity, dot product, Euclidean (L2), and Manhattan (L1) distance.

The system organises data into named collections, each of which holds points consisting of an identifier, one or more vectors, and an arbitrary JSON payload. Collections can be sharded across nodes and replicated for fault tolerance, and Qdrant supports both single-node and distributed cluster deployments.

Key features

Qdrant supports dense vectors for semantic similarity, sparse vectors that approximate keyword behaviour, and multivector search for objects with several embeddings — useful for late-interaction models such as ColBERT. Hybrid search combines dense and sparse vectors in a single query and merges results using configurable fusion strategies such as Reciprocal Rank Fusion (RRF) and Distribution-Based Score Fusion (DBSF).

Filtered search is a particular strength of Qdrant. Each point carries a JSON payload, and queries can combine vector similarity with rich filter conditions — keyword matching, full-text search, numeric range, geo-bounding box, datetime range, and boolean must, should, and must_not clauses. Filters are integrated into the HNSW traversal rather than applied as a post-processing step, which preserves recall under aggressive filtering.

Built-in quantisation, including scalar, product, and binary quantisation, can reduce memory consumption by up to 97% with a configurable trade-off between speed and precision. Snapshots, point-in-time backups, and storage tiering enable production operations.

Multitenancy and enterprise features

In 2025, Qdrant added a Tiered Multitenancy capability that lets operators dedicate a shard of the underlying vector database to a specific tenant or workload, simplifying isolation in software-as-a-service deployments. Qdrant Cloud has added simplified cluster management, cloud role-based access controls, granular database API keys, advanced monitoring and observability dashboards, and cloud automation APIs aimed at enterprise users.

Ecosystem and integrations

Qdrant integrates broadly with the modern AI stack. Embedding providers including OpenAI, Cohere, Voyage AI, Mistral, and Hugging Face produce vectors that Qdrant stores. Application frameworks such as LangChain, LlamaIndex, Haystack, Semantic Kernel, and Spring AI provide native Qdrant connectors. Observability platforms such as Langfuse, Helicone, and Arize AI offer tracing for retrieval calls into Qdrant.

| Aspect | Qdrant | |--------|--------| | Language | Rust | | Indexing | HNSW | | Hybrid search | Native (dense + sparse) | | Filtering | Rich JSON payload filters | | Quantisation | Scalar, product, binary | | Deployment | Self-hosted, Qdrant Cloud |

Performance and benchmarks

Independent benchmarks published by ANN-Benchmarks and Qdrant's own benchmark suite have consistently ranked Qdrant among the fastest open-source vector databases on common workloads. The Rust implementation and asynchronous I/O architecture allow Qdrant to scale across CPU cores and handle high concurrency without the garbage-collection pauses common to Java-based alternatives.

Malaysian Context — Self-Hosted Vector Search and Data Sovereignty

Qdrant is widely used by Malaysian AI teams that need self-hosted vector search for retrieval-augmented generation applications. Its Apache 2.0 licence and Docker-based deployment model make it practical to run on-premise or within a Malaysian cloud region, which is attractive for organisations subject to the Personal Data Protection Act 2010 (PDPA), Bank Negara Malaysia's Risk Management in Technology (RMiT) policy, and Ministry of Health data-residency expectations.

Banks regulated by BNM — including Maybank, CIMB, Public Bank, and RHB — typically prefer self-hosted or sovereign-region vector stores for chatbot, customer-service, and compliance retrieval workloads. Qdrant's role-based access control, audit logging, and quantisation features fit those constraints. Government modernisation work driven by MDEC and the Ministry of Digital, including pilots within Jabatan Perdana Menteri and several state-government digital-transformation programmes, also uses Qdrant where data sovereignty is a hard requirement.

Local AI service providers in Cyberjaya, Penang, and Bangsar South — including AITG Sdn Bhd's Teragrid Ai Platform — frequently deploy Qdrant as the retrieval layer in production RAG systems serving Malaysian clients. Universiti Malaya, Universiti Sains Malaysia, and other public universities use Qdrant in NLP research, particularly for Bahasa Malaysia embedding work and Malay-English code-mixed retrieval. Cloud deployment options include Qdrant Cloud's Singapore region, which is the closest fully managed offering to Malaysia, while local cloud providers such as TM ONE and CelcomDigi Cloud have hosted Qdrant for clients requiring in-country residency.

References

Qdrant Solutions GmbH. (2024–2026). Qdrant Documentation. qdrant.tech.
Malkov, Y., and Yashunin, D. (2018). Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs. IEEE TPAMI.
Qdrant. (2025). Tiered Multitenancy and Enterprise Cloud Features. Qdrant Blog.
ANN-Benchmarks. (2024). Open-Source Vector Database Benchmark Results.
Bank Negara Malaysia. (2023). Risk Management in Technology (RMiT) Policy Document. BNM.

Tags:qdrant vector-database embedding rust similarity-search

Type	Vector database and search engine
Initial release	2021
Written in	Rust
Licence	Apache 2.0
Company HQ	Berlin, Germany
Cloud	Qdrant Cloud (AWS, GCP, Azure)

Architecture

Key features

Multitenancy and enterprise features

Ecosystem and integrations

Performance and benchmarks

See Also

References

References