What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

LlamaIndex

LlamaIndex is an open-source Python and TypeScript framework for building retrieval-augmented and agentic AI applications over private data sources.

6 min readLast updated May 2026Companies & Tools

LlamaIndex is an open-source framework for building retrieval-augmented generation (RAG) and agentic applications that combine large language models with external data. Originally released in November 2022 as GPT Index by Jerry Liu, the project was renamed and incorporated as LlamaIndex, Inc. in 2023. The framework provides Python and TypeScript libraries for ingesting documents, parsing complex file formats, building indices, performing retrieval, orchestrating workflows, and evaluating outputs. LlamaIndex is widely used alongside or as an alternative to [[langchain]] for production RAG systems.

Background

The release of GPT-3.5 and ChatGPT in late 2022 created widespread demand for connecting language models to external knowledge bases. Two complementary approaches emerged. Tool and chain orchestration frameworks such as LangChain focused on composing arbitrary calls to LLMs, retrievers, and tools, while LlamaIndex initially focused on the data layer — indexing structured and unstructured content for efficient retrieval. Over time both projects expanded into overlapping territory, but LlamaIndex retains a clear emphasis on parsing fidelity, retrieval flexibility, and production observability.

Architecture

A typical LlamaIndex application progresses through several stages. Documents are loaded from sources such as files, databases, SaaS APIs, or web pages via reader connectors. Loaded content is parsed into nodes by chunkers that respect document structure. Nodes are embedded with a model of the developer's choice and stored in a vector store such as [[pinecone]], Weaviate, Qdrant, Chroma, pgvector, or Elasticsearch. At query time, a retriever fetches candidate nodes, an optional reranker reorders them, and a query engine assembles a prompt for the LLM. Tools and agents extend this pattern into multi-step workflows that can call functions, query SQL, or invoke other agents.

Core modules

| Module | Role | | --- | --- | | Readers and connectors | Load documents from files, APIs, cloud storage, and SaaS apps | | LlamaParse | Managed document parser with strong handling of PDFs and tables | | Node parsers | Chunk documents by structure, semantics, or fixed size | | Embeddings | Wrappers around OpenAI, Cohere, Voyage, Hugging Face, local models | | Indices | Vector, summary, knowledge graph, keyword, and composite indices | | Retrievers and rerankers | Configurable retrieval pipelines including hybrid search | | Query engines | Compose prompts and synthesise final answers | | Workflows and agents | Multi-step orchestration with tool use and memory | | Evaluators | Faithfulness, relevancy, and correctness metrics |

LlamaIndex integrates with model providers including OpenAI, [[anthropic]], Google, [[mistral-ai]], and local runners such as Ollama and vLLM, and with deployment backends such as AWS Bedrock and Google Vertex AI.

LlamaParse and document understanding

A distinguishing capability of LlamaIndex is LlamaParse, a managed parser focused on complex PDFs, slide decks, and spreadsheets. LlamaParse handles tables, figures, multi-column layouts, and skewed scans, and has been a focus of significant investment by the company. In 2025, LlamaIndex reported improvements in retrieval accuracy and document parsing fidelity, including new models inside LlamaParse for difficult layouts. The combination of structured parsing and configurable retrieval pipelines has positioned LlamaIndex strongly for document-heavy applications such as legal research, financial analysis, technical documentation, and regulatory work.

Agents and workflows

In addition to RAG, LlamaIndex supports the construction of agentic systems through a Workflow abstraction. Workflows are event-driven graphs of steps that can invoke LLMs, tools, sub-agents, and human-in-the-loop checkpoints. The framework includes prebuilt patterns for ReAct-style agents, function-calling agents, and multi-agent collaboration, and provides instrumentation hooks for observability platforms such as LangSmith, Arize, Langfuse, and OpenTelemetry-compatible backends.

LangChain and LlamaIndex are the two most widely used open-source frameworks for LLM applications. LangChain typically emphasises broad orchestration primitives and integrations, while LlamaIndex emphasises data ingestion, parsing, and retrieval. Both projects can be combined, and many production systems use them together. Other relevant frameworks include Haystack from Deepset, Semantic Kernel from Microsoft, Microsoft AutoGen for multi-agent systems, CrewAI, and DSPy from Stanford NLP for programmatic prompt and module optimisation. The choice between frameworks usually depends on integration requirements, retrieval needs, evaluation tooling, and the developer's preferred mental model.

Commercial offering

LlamaIndex, Inc. offers a managed cloud platform called LlamaCloud that hosts LlamaParse, document indices, evaluation pipelines, and observability. The company has raised venture funding from investors including Greylock and Notable Capital, and it generates revenue through consumption-based pricing of cloud services and enterprise contracts. The open-source library remains permissively licensed under the MIT License.

Malaysian Context — LlamaIndex in Local AI Projects

LlamaIndex is used by Malaysian developers and companies to build internal RAG copilots, customer-support assistants, and document-search tools. Banks such as Maybank, CIMB, RHB, and Public Bank have explored LlamaIndex and similar frameworks for internal knowledge-management tools that respect Bank Negara Malaysia (BNM) data-residency and risk-management expectations. Telekom Malaysia, Maxis, and Digi build retrieval-augmented assistants for customer care, while logistics and e-commerce companies such as Pos Malaysia, Shopee Malaysia, and Lazada Malaysia use RAG patterns for internal operations and seller support.

Government and public-sector adoption is coordinated through MAMPU, the Malaysia Digital Economy Corporation (MDEC), and the National AI Office (NAIO), with several pilots focused on policy search and citizen-services chatbots that combine LlamaIndex with Malay-language embedding models. Universities including UM, USM, UKM, and MMU teach LlamaIndex patterns inside data engineering and applied AI courses, and HRD Corp subsidises industry training in RAG and agent frameworks for Malaysian engineers.

Penang's growing AI ecosystem, including firms such as AITG Sdn Bhd, has used LlamaIndex and its workflow engine in agentic products such as Teragrid Agent, often integrating with local document stores and Bahasa Malaysia embedding pipelines.

Limitations and considerations

LlamaIndex's breadth of features creates a steep learning curve, and rapid iteration on APIs has occasionally led to breaking changes across versions. Production deployments need to manage cost (embeddings, LLM calls, parser usage), evaluate retrieval quality, monitor drift, and govern data access. The framework provides primitives for each of these concerns, but choosing the right combination requires engineering judgment.

References

Liu, J. (2022). GPT Index: A Project to Connect LLMs with External Data. GitHub.
LlamaIndex, Inc. (2024). LlamaIndex Documentation. https://docs.llamaindex.ai
LlamaIndex, Inc. (2025). LlamaParse and LlamaCloud Release Notes. LlamaIndex Blog.
Lewis, P. et al. (2020). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. NeurIPS.

Tags:llamaindex RAG framework LLM

Type	RAG and agent framework
Developed by	LlamaIndex, Inc. (formerly GPT Index)
First released	November 2022
Languages	Python, TypeScript
Key use	Document ingestion, retrieval, agents
Related	RAG, LangChain, vector database

Background

Architecture

Core modules

LlamaParse and document understanding

Agents and workflows

Comparison with related frameworks

Commercial offering

Limitations and considerations

References