What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

Knowledge Graph

A structured knowledge representation that encodes entities and their relationships as a directed labelled graph, enabling machines to reason over interconnected facts across diverse domains.

6 min readLast updated June 2026Foundations

A knowledge graph (KG) is a data structure that represents knowledge as a collection of entities (real-world objects, concepts, or events) and the typed relationships that connect them. It is formalised as a directed labelled graph in which nodes represent entities and edges represent relations, with each edge carrying a label specifying the type of relationship. The fundamental unit is the triple: (subject, predicate, object) — for example, (Kuala Lumpur, capital_of, Malaysia) or (Maybank, founded_in, 1960).

Knowledge graphs combine the flexibility of a graph data model with formal semantic definitions drawn from ontologies, enabling machines to perform structured reasoning, inference, and complex query answering over large collections of interconnected facts.

Historical Context

The intellectual roots of knowledge graphs lie in symbolic AI and the Semantic Web initiative. Tim Berners-Lee's vision of a machine-readable web of linked data, formalised through the Resource Description Framework (RDF) and the Web Ontology Language (OWL) standards, established the triple-store paradigm in the early 2000s. The term "knowledge graph" was popularised by Google in 2012 when the company announced its Knowledge Graph feature, which enriched search results with structured information about entities sourced from Wikipedia, Freebase, and other curated databases.

Large-scale public knowledge graphs include Wikidata (a collaborative knowledge base maintained by the Wikimedia Foundation with over 100 million entities as of 2025), DBpedia (extracted from Wikipedia), and YAGO. Proprietary knowledge graphs maintained by technology companies are substantially larger; Microsoft's Satori, Google's Knowledge Graph, and LinkedIn's Economic Graph each contain billions of entities and triples.

Data Model

Knowledge graphs typically use one of two complementary data models. The RDF model, standardised by the World Wide Web Consortium (W3C), represents every fact as a subject-predicate-object triple stored in triple stores queryable via SPARQL. RDF graphs are well-suited to linked open data and interoperability across systems.

Property graphs, supported by databases such as Neo4j, Amazon Neptune, and TigerGraph, allow both nodes and edges to carry arbitrary key-value properties, providing a more expressive and developer-friendly model for application-level knowledge management. Cypher (Neo4j) and Gremlin are the dominant query languages for property graphs.

Knowledge Graph Completion

Real-world knowledge graphs are inevitably incomplete — Wikidata contains millions of missing facts that can be inferred from existing information. Knowledge graph completion (KGC) is the task of predicting missing links or entity attributes. Embedding-based methods such as TransE, DistMult, ComplEx, and RotatE learn low-dimensional vector representations for entities and relations such that the geometry of the embedding space reflects the graph's relational structure, enabling missing triples to be scored by geometric operations.

More recent approaches combine graph neural networks with knowledge graph embeddings, or use large language models to perform KGC by framing it as a text generation or ranking task.

Integration with Large Language Models

A significant development in 2024-2025 was the integration of knowledge graphs with large language models to address the hallucination problem. LLMs trained on text alone may generate plausible but factually incorrect statements. Grounding LLM outputs in structured knowledge graphs provides a verifiable factual backbone. In GraphRAG (Microsoft, 2024), a knowledge graph is constructed from a document corpus and used to augment retrieval-augmented generation, enabling more accurate responses to multi-hop questions that require traversing multiple relationships.

Knowledge graphs also improve the explainability of AI outputs: because each fact is traceable to a named source triple, systems can cite specific graph paths as justification for their answers. PingCAP's TiKV and similar graph-augmented databases reported up to 300 percent accuracy improvements on complex multi-hop queries when knowledge graph integration was applied.

Applications

Knowledge graphs power Google's featured snippets and entity panels in search results. In healthcare, graphs such as the Human Disease Ontology and DrugBank link symptoms, diagnoses, genes, proteins, and pharmaceutical compounds, enabling hypothesis generation and adverse drug interaction detection. In finance, knowledge graphs model corporate ownership structures, supply chain relationships, and transaction networks for risk and compliance. In e-commerce, product knowledge graphs connect items, attributes, brands, and user preferences to improve search and recommendation. In manufacturing, KGs encode bill-of-materials hierarchies, supplier relationships, and process parameters to support root-cause analysis.

Malaysian Context — Knowledge Graphs in Government and Finance

Malaysia's public sector has begun exploring knowledge graphs as part of the country's data governance strategy under the MyDigital Blueprint. The National Digital Identity initiative aims to link citizen records across agencies — a natural use case for a government knowledge graph that connects identities, entitlements, land records, and licences while respecting data privacy under the Personal Data Protection Act 2010 (PDPA).

Bank Negara Malaysia (BNM) has signalled interest in graph-based approaches for systemic risk monitoring, where a knowledge graph modelling interbank exposures, corporate group ownership, and cross-border capital flows would enable supervisors to trace contagion pathways across the financial system. CIMB and Maybank have separately explored property graph databases for correspondent banking due diligence, mapping beneficial ownership chains to comply with anti-money laundering (AML) regulations.

In the healthcare space, the Ministry of Health Malaysia maintains linked administrative health data across its hospital information systems. Researchers at Universiti Malaya Medical Centre (UMMC) and Hospital Kuala Lumpur have investigated RDF-based clinical knowledge graphs to connect diagnoses, drug prescriptions, and laboratory results, supporting clinical decision support and pharmacovigilance.

Malaysia's oil and gas sector, led by Petronas, uses asset knowledge graphs to manage complex engineering information across offshore platforms, linking equipment, maintenance histories, inspection records, and supply chain components. This is part of a broader industrial IoT strategy, where structured knowledge graphs help contextualise sensor data.

The government's open data portal (data.gov.my) publishes datasets that could be used to build linked open data graphs covering demographics, economics, and geography. Civil society organisations have called for these to be published as structured RDF linked data to enable cross-dataset querying and support evidence-based policy research.

References

Hogan, A., et al. (2021). Knowledge graphs. ACM Computing Surveys, 54(4), 1-37.
Bordes, A., et al. (2013). Translating embeddings for modeling multi-relational data. NeurIPS 2013.
Edge, D., et al. (2024). From local to global: A graph RAG approach to query-focused summarization. arXiv:2404.16130. Microsoft Research.
W3C. (2004). Resource description framework (RDF): Concepts and abstract syntax. World Wide Web Consortium.
PingCAP. (2025). How knowledge graphs transform machine learning in 2025. pingcap.com.

Tags:knowledge graph ontology RDF entity semantic web

Type	Knowledge representation structure
Origins	Semantic Web, RDF (W3C, 2004); Google Knowledge Graph (2012)
Key technologies	RDF, OWL, SPARQL, Property Graphs, Neo4j
Key use	Search enrichment, question answering, drug discovery, fraud detection
Related	Graph neural network, semantic search, RAG, ontology, embedding

Historical Context

Data Model

Knowledge Graph Completion

Integration with Large Language Models

Applications

See Also

References

References