What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

Weaviate

An open-source, cloud-native vector database that combines vector similarity search with structured filtering, GraphQL APIs, and built-in vectorisation for AI applications.

5 min readLast updated May 2026Companies & Tools

Weaviate is an open-source, cloud-native vector database developed by the Dutch company Weaviate B.V. (originally SeMI Technologies). First released in 2019, it has become one of the most widely adopted vector databases for retrieval-augmented generation (RAG), semantic search, and other AI applications that rely on high-dimensional embeddings. Weaviate distinguishes itself from pure vector search libraries by combining nearest-neighbour search with structured filtering, an integrated module system for automatic vectorisation, and a knowledge-graph data model that allows objects to reference one another.

Architecture

A Weaviate instance organises data into collections (formerly called classes). Each collection has a schema with typed properties — for example, a Document collection might have title, body, author, and createdAt fields. When an object is inserted, Weaviate stores both its scalar properties and its vector representation. Vectors can be supplied directly by the client or produced automatically through a configured vectoriser module that calls an embedding model such as OpenAI's text-embedding-3-small, Cohere's embed-multilingual-v3, or a self-hosted Hugging Face model.

For indexing, Weaviate primarily uses Hierarchical Navigable Small World (HNSW) graphs, with optional product quantisation, scalar quantisation, or binary quantisation to reduce memory footprint. A flat index option is available for small collections, and a dynamic index can switch between flat and HNSW as a collection grows.

Weaviate supports hybrid search that combines BM25 keyword scoring with vector similarity, fused via reciprocal rank fusion or a tunable alpha parameter. Filtering is first-class — queries can combine structured predicates (such as createdAt > 2025-01-01) with vector similarity in a single request.

APIs

Three protocols are exposed. REST is the original interface and is convenient for simple operations. gRPC, added in v1.23, is the recommended high-throughput interface for production workloads and is the default in the official Python client v4. GraphQL is offered as a query language particularly suited to cross-reference traversal between collections.

The Python, TypeScript, Java, and Go clients are officially supported, with several community-maintained clients in other languages.

Modules and ecosystem

Weaviate's module system handles vectorisation, reranking, generative answer construction, and other operations as pluggable components. Modules exist for OpenAI, Cohere, Hugging Face, Google Vertex AI, AWS Bedrock, Anthropic, Voyage, Jina, and several locally hosted alternatives such as text2vec-transformers and Ollama. Reranker modules (Cohere Rerank, Hugging Face cross-encoders) and generator modules (GPT, Claude, Gemini) allow Weaviate to act as an end-to-end RAG backend without external orchestration code.

Deployment

Weaviate can be self-hosted on Docker Compose, Kubernetes (with an official Helm chart and operator), or bare metal. The company also operates Weaviate Cloud, a managed multi-region service, and offers a serverless tier and a bring-your-own-cloud option for customers with data residency requirements. Production deployments rely on sharding for horizontal scaling, replication for high availability, multi-tenancy for SaaS architectures, and role-based access control (RBAC) for security.

Comparison with alternatives

| Database | Type | Strengths | |---|---|---| | Weaviate | Open source, managed | Hybrid search, modules, multi-tenancy | | Pinecone | Managed only | Serverless scaling, simple API | | Qdrant | Open source, managed | Strong filtering, payload-aware indexing | | Milvus | Open source, managed | Large-scale GPU acceleration | | Chroma | Open source, embedded | Developer ergonomics, lightweight | | pgvector | PostgreSQL extension | Familiar SQL, transactional |

Weaviate is commonly chosen when teams want a fully open-source option with first-class hybrid search, modular integration with multiple embedding providers, and a managed-cloud upgrade path.

Use cases

Typical Weaviate deployments include retrieval-augmented generation backends for enterprise chatbots, semantic search across document corpora, product and content recommendation, image and multimodal search, and knowledge graph-style applications that combine vector similarity with structured relationships.

Malaysian Context — Weaviate in Malaysian RAG deployments

Vector databases including Weaviate have seen rapid adoption in Malaysian organisations building RAG systems for internal knowledge bases, customer support, and regulatory document search. Malaysian banks, telecommunications providers, and government agencies frequently choose Weaviate over fully managed alternatives because it can be self-hosted inside data centres located in Cyberjaya, Kuala Lumpur, or Johor, satisfying Personal Data Protection Act 2010 (PDPA) considerations and any sector-specific data residency expectations from Bank Negara Malaysia (BNM) or the Securities Commission Malaysia (SC).

Maybank, CIMB, RHB, and Hong Leong Bank operate internal AI assistants that retrieve from product manuals, compliance documents, and customer correspondence. Telekom Malaysia, Maxis, and CelcomDigi have published technology-team material describing RAG architectures combining Weaviate (or comparable systems) with locally hosted embedding models and Amazon Bedrock, Vertex AI, or Azure OpenAI hosted in regional zones.

Government and GLC pilots — including those coordinated through MDEC, the National AI Office Malaysia, and the Public Service Department's digital transformation programmes — increasingly require vector databases that can be deployed on premises or in the MyGovCloud private cloud. Weaviate's BSD-3 license and on-premises deployability make it a frequent shortlist candidate for these tenders.

AITG SDN BHD and other AWS Partner Network members deploy Weaviate alongside Amazon Bedrock-hosted embedding models for Malaysian customers building Teragrid Agent and similar agentic AI solutions. HRD Corp-funded training programmes in applied AI now routinely include modules on vector databases, embedding selection, and RAG evaluation. Universities including Universiti Malaya, Universiti Teknologi PETRONAS, and Multimedia University have featured Weaviate in graduate-level AI engineering courses since 2024.

Limitations

Vector databases including Weaviate carry operational complexity that should not be underestimated. Schema design, embedding choice, chunking strategy, and reranker selection have outsized effects on retrieval quality and are not solved by the database itself. Memory cost for very large collections can be significant unless quantisation is configured. As with any open-source platform, self-hosting requires capacity for upgrades, monitoring, and security patching that some teams prefer to outsource to managed services.

References

Weaviate B.V. (2025). Weaviate Documentation. docs.weaviate.io.
Malkov, Y. A., & Yashunin, D. A. (2018). Efficient and Robust Approximate Nearest Neighbor Search Using HNSW Graphs. IEEE TPAMI.
Weaviate B.V. (2024). Weaviate 1.27 Release Notes. weaviate.io/blog.
Zilliz. (2025). Top Open Source Vector Databases in 2025. zilliz.com.

Tags:weaviate vector-database semantic-search rag open-source

Type	Open-source vector database
Initial release	2019
Developer	Weaviate B.V. (Netherlands)
Written in	Go
License	BSD-3-Clause
Indexing	HNSW, flat, dynamic; supports binary, scalar, and product quantisation
APIs	REST, gRPC, GraphQL