What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

AI Agents

Autonomous AI systems that perceive their environment, reason over goals, select and execute actions using external tools, and operate across multi-step tasks with minimal human intervention.

7 min readLast updated May 2026Applications

AI agents are software systems that use artificial intelligence — most commonly a large language model (LLM) — as a cognitive controller to perceive inputs, reason about goals, plan action sequences, and execute actions through external tools or APIs, iterating over multiple steps to complete complex tasks without requiring a human to approve each individual action. Unlike a conversational AI system that produces a single response and waits, an agent takes initiative: it queries databases, writes and executes code, navigates web pages, calls APIs, sends messages, and revises its approach based on the results it receives. The period from 2025 onward has been widely described in the AI industry as "the era of AI agents," reflecting the rapid maturation of the architectures and infrastructure needed to deploy agents reliably.[^1]

Architecture

An AI agent built on an LLM typically has four functional components.

Perception is the agent's ability to receive and interpret diverse inputs — text, structured data, images, audio, code, and the outputs of tool calls. Modern multimodal LLMs can process all of these natively, allowing agents to work with heterogeneous information sources without preprocessing pipelines.

Reasoning and planning is handled by the LLM itself. Given a goal and the current state of the world (including conversation history and tool outputs), the LLM generates a plan: which tools to invoke, in which order, and how to handle intermediate results. Chain-of-thought reasoning, where the model explicitly articulates intermediate steps before acting, is a common prompt design pattern that improves plan quality for complex tasks.

Memory allows an agent to retain information beyond a single context window. Short-term memory is held in the active context window. Long-term memory is implemented by writing facts, summaries, or retrieved documents to an external store (a database or vector database) and retrieving them when relevant. Memory management — deciding what to store, when to retrieve, and how to compress long histories — is an active research problem.

Tool use (also called function calling) is the mechanism by which an agent affects the world beyond text generation. The LLM selects a tool from a predefined set (a web search API, a code interpreter, a file system, a calendar, a CRM system), generates a call with appropriate arguments, receives the tool's output, and incorporates it into its reasoning. Tool results are returned to the LLM as new context, continuing the reasoning loop.

Agentic Design Patterns

Several recurring design patterns organise agent behaviour:

ReAct (Reason + Act) interleaves reasoning steps ("Thought") with tool invocations ("Action") and their results ("Observation"), allowing the agent to adjust its approach based on tool feedback before each next step.[^2]

Plan-and-Execute separates planning from execution. A planning module generates a complete multi-step plan before any tools are invoked; an execution module carries out the steps, potentially feeding results back to the planner for re-planning if steps fail.

Reflection and self-critique patterns have the agent evaluate its own outputs against the original goal or a set of criteria, producing a critique that guides revision. This reduces errors and improves output quality for tasks with clear correctness criteria.

Multi-agent systems decompose complex tasks across multiple specialised agents — a researcher, a writer, a fact-checker — that communicate and coordinate through a shared message bus or orchestrator. Frameworks such as Microsoft AutoGen and CrewAI implement multi-agent orchestration patterns.

Tool Ecosystems

The Model Context Protocol (MCP), introduced by Anthropic in 2024 and adopted broadly in 2025, defines a standardised interface for connecting LLM-based agents to external tools and data sources. MCP enables tool developers to publish connectors that any compatible agent can use, reducing the integration burden for both agent builders and tool providers. By mid-2025, thousands of MCP servers had been published for services including databases, developer tools, communication platforms, and enterprise software.

Safety and Reliability

Autonomous agents introduce safety risks distinct from those of conversational AI. An agent that can execute code, send emails, or modify files can cause irreversible real-world harm if it misinterprets instructions or encounters edge cases not anticipated in its design.

Key mitigation strategies include minimal footprint principles (granting agents only the permissions needed for a specific task), confirmation checkpoints where the agent pauses for human approval before high-stakes actions, sandboxed execution environments for code generated by agents, and output auditing. The concept of a "human in the loop" — a checkpoint at which a human reviews the agent's proposed action before it executes — is a standard design consideration for production deployments.

Reliability challenges include context length limitations that prevent agents from maintaining full task history over long workflows, error propagation (where early mistakes compound into larger failures), and the tendency of LLMs to hallucinate tool names or arguments that don't match the actual tool specification.[^3]

Malaysian Context — AI Agents in Financial Services and Enterprise

AI agents have attracted significant interest in Malaysia's financial services sector, which is among the most technologically advanced in Southeast Asia. BNM's Regulatory Sandbox and the Joint Committee on Financial Innovation (JCFI) have provided frameworks for piloting agentic AI in financial applications under controlled conditions.

Ryt Bank, Malaysia's first AI-powered digital bank launched in August 2025, built its core banking experience around a multi-agent architecture. Customer instructions — such as fund transfers, bill payments, and account queries — are processed by specialised agents that verify intent, enforce regulatory guardrails under the Financial Services Act, and execute approved transactions. The bank's agentic design reduces reliance on traditional mobile banking UX in favour of conversational task completion.

Hong Leong Bank has reported productivity gains of 10–25% in software development workflows by deploying AI coding agents — models that autonomously generate, test, and document code — integrated into its engineering pipelines. Similarly, Maybank's technology division has evaluated agentic AI for automating internal audit procedures, where agents retrieve, analyse, and summarise regulatory filings.

Malaysian enterprises have also deployed AI agents for accounts payable automation, where agents extract data from invoices, validate against purchase orders, reconcile discrepancies, and route exceptions to human reviewers. This use case is particularly active in the manufacturing sector, including companies in the Penang and Johor industrial corridors.

The National AI Office (NAIO), established in December 2024 under the Prime Minister's Department, has identified agentic AI governance as a priority area, noting that existing regulatory frameworks — including the PDPA, the Computer Crimes Act, and BNM's RMIT guidelines — require interpretation and potentially new guidance to adequately address the risks introduced by autonomous AI action. The Malaysia AI Governance Framework is being updated to address agentic scenarios, including accountability for agent-initiated actions, audit trail requirements, and data minimisation obligations under PDPA when agents access personal data.

CelcomDigi, Malaysia's largest telecommunications company by subscriber base, has deployed AI agents for network operations center (NOC) automation, where agents monitor telemetry, diagnose fault patterns, and initiate remediation actions in network equipment — escalating to human engineers only for novel fault categories.

References

MIT Sloan Management Review. (2025). Agentic AI, explained. MIT Sloan School of Management. https://mitsloan.mit.edu/ideas-made-to-matter/agentic-ai-explained
Yao, S., Zhao, J., Yu, D., Du, N., Shafran, I., Narasimhan, K., & Cao, Y. (2023). ReAct: Synergizing Reasoning and Acting in Language Models. ICLR 2023.
AWS. (2025). Evaluating AI agents: Real-world lessons from building agentic systems at Amazon. AWS Machine Learning Blog. https://aws.amazon.com/blogs/machine-learning/evaluating-ai-agents-real-world-lessons/
Anthropic. (2025). Claude Agent SDK: Building autonomous AI workflows. Anthropic Documentation.
BNM. (2024). Risk Management in Technology (RMIT) Policy Document. Bank Negara Malaysia.

Tags:AI agents autonomous agents agentic AI tool use LLM multi-agent

Type	Autonomous AI system
Core components	Perception, reasoning, planning, tool use, memory
Based on	Large language models (primarily)
Key frameworks	LangChain, LlamaIndex, AutoGen, CrewAI, Claude Agent SDK
Related	Multi-agent systems, prompt engineering, RAG, function calling

Architecture

Agentic Design Patterns

Tool Ecosystems

Safety and Reliability

See Also

References

References