What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

AI Planning

AI planning is the discipline of automatically generating a sequence of actions that an intelligent agent can execute to move from an initial state to a goal, increasingly used inside LLM-based agents to decompose and reason about complex tasks.

5 min readLast updated June 2026Foundations

Planning in artificial intelligence is the problem of automatically producing a sequence of actions that, when executed by an agent in some environment, transforms an initial state into a state that satisfies a stated goal. Planning is one of the founding subfields of AI: it underpins game-playing, robotics, logistics, autonomous vehicles, and, more recently, the orchestration of multi-step behaviour inside large language model (LLM) agents.

Classical planning

Classical planning assumes a fully observable, deterministic environment with a finite state space and instantaneous actions. A planning problem is specified by an initial state, a goal description, and a set of action schemas with preconditions and effects. The dominant representation languages are STRIPS (Stanford Research Institute Problem Solver), introduced in 1971, and its successor PDDL (Planning Domain Definition Language), which has been the lingua franca of the International Planning Competition since 1998. Solvers explore the state space using heuristic search algorithms such as A*, with heuristics derived from problem relaxations (FF, LAMA, Fast Downward).

Extensions cover non-classical settings: temporal planning with durations and concurrency, probabilistic planning modelled as Markov decision processes, partially observable planning as POMDPs, multi-agent planning, and hierarchical task network (HTN) planning, where high-level tasks are decomposed into lower-level subtasks.

Planning with large language models

The rise of LLMs has produced a new generation of planners that treat natural-language task descriptions as planning problems. Several patterns recur:

| Pattern | Idea | |---|---| | Chain-of-thought | Prompt the model to reason step by step before answering | | Plan-and-Solve | Generate a full plan first, then execute step by step | | ReAct | Interleave reasoning thoughts with tool actions and observations | | Tree of Thoughts | Explore multiple candidate plans as a tree with self-evaluation | | Graph of Thoughts | Allow merging and revisiting of partial plans in a graph | | Reflexion | Reflect on past failures and revise the plan in the next attempt | | LLM-as-planner with verifier | Use an LLM to generate plans and a separate verifier (symbolic or neural) to check feasibility | | LLM + PDDL | Translate natural-language tasks into PDDL and use classical solvers |

These approaches differ in their commitment to single-shot versus stepwise planning, and in whether they require an external symbolic component. Empirically, stepwise approaches handle dynamic environments and tool failures better, while one-shot approaches are cheaper when the task is well structured.

Task decomposition

A practical concern in agentic systems is task decomposition: how to break a high-level goal into subtasks small enough for reliable execution. Recent work has shown that aggressive decomposition combined with per-step verification can scale agent reliability dramatically, with some published systems reporting near-zero error rates across millions of reasoning steps when each step is small and locally checkable. Hierarchical decomposition also fits naturally with tool-using agents that delegate subtasks to specialised tools, smaller models, or human reviewers.

Evaluation

Planning systems are evaluated on success rate, plan length or cost, generalisation across problem instances, robustness to perturbations, and computational efficiency. Modern LLM agent benchmarks (AgentBench, GAIA, WebArena, SWE-bench, OSWorld) all stress planning behaviour in addition to single-step reasoning.

Limitations and open problems

Even capable LLM planners struggle with long-horizon planning, irreversible actions, partial observability, and adversarial environments. Hallucinated steps, infinite loops, and brittle recovery from tool errors remain common failure modes. Research directions include neuro-symbolic hybrids that combine LLMs with classical planners or constraint solvers, world-model learning to simulate consequences before acting, and self-improvement through replay of past trajectories.

Malaysian Context — Operational Agents in Regulated Workflows

Planning capabilities are central to the agentic AI systems being deployed by Malaysian banks, government agencies, telcos, logistics providers, and e-commerce platforms. These deployments typically operate inside regulated workflows where each step must be auditable and bounded — exactly the use cases where careful task decomposition and verification matter most.

In financial services, agents that triage customer queries, prepare loan files, or perform regulatory reporting for Bank Negara Malaysia (BNM) and the Securities Commission (SC) need planners that respect approval thresholds, segregation of duties, and the BNM Risk Management in Technology (RMiT) framework. Maybank, CIMB, RHB, Public Bank, Hong Leong, and digital banks (GXBank, AEON Bank, Boost Bank, Ryt Bank) are progressively introducing such workflows under internal AI governance committees.

In government and the public sector, the MyDigital Blueprint, MAMPU's data sharing initiatives, and the National AI Office's emerging governance framework anticipate agentic automations in service delivery — for example, document checking, eligibility assessment, and case routing — that must follow legally defined procedures rather than open-ended exploration. Planning approaches that emit explicit, inspectable plans are therefore preferred over opaque one-shot generation.

In logistics and e-commerce, Pos Malaysia, GDex, Ninja Van Malaysia, Lalamove, J&T Express, Shopee Malaysia, Lazada Malaysia, and Foodpanda use route, fulfilment, and customer-service agents whose planning must integrate with classical operations research solvers for vehicle routing and warehouse picking. In manufacturing, especially the Penang and Kulim electronics cluster, agentic systems plan inspection and maintenance steps that connect to existing MES and CMMS systems. Local vendors under AITG SDN BHD, including the Teragrid Agent product, build agentic workflows that incorporate planning, tool use, and memory for these regulated environments.

References

Fikes, R. and Nilsson, N. (1971). STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving. Artificial Intelligence.
Ghallab, M., Nau, D., and Traverso, P. (2004). Automated Planning: Theory and Practice. Morgan Kaufmann.
Yao, S. et al. (2023). ReAct: Synergizing Reasoning and Acting in Language Models. ICLR.
Yao, S. et al. (2023). Tree of Thoughts: Deliberate Problem Solving with Large Language Models. NeurIPS.
Cognizant AI Lab. (2025). MAKER Achieves Million-Step, Zero-Error LLM Reasoning.

Tags:agents reasoning task-decomposition llm

Type	AI reasoning discipline
Classical languages	STRIPS, PDDL, HTN
Modern variants	LLM planners, ReAct, Plan-and-Solve, Tree of Thoughts
Key concepts	State, action, goal, plan
Common metrics	Plan length, success rate, cost
Related	AI agents, tool use, reasoning, search

Classical planning

Planning with large language models

Task decomposition

Evaluation

Limitations and open problems

See Also

References

References