What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

ReAct (Reasoning and Acting)

ReAct is a prompting framework that interleaves reasoning traces with task actions, letting a language model plan, call external tools, and incorporate observations to solve problems more reliably.

5 min readLast updated June 2026Applications

ReAct, short for Reasoning and Acting, is a prompting framework that combines a language model's reasoning with its ability to take actions in an external environment. Introduced in a paper led by Shunyu Yao and colleagues and published at ICLR 2023, ReAct interleaves reasoning traces with task-specific actions so that the model can plan, gather information from external sources, and adjust its approach based on what it observes. It is one of the foundational techniques behind modern tool-using AI agents.

Core idea

Earlier work had explored two capabilities separately: chain-of-thought prompting, which improves a model's internal reasoning, and action generation, which lets a model call tools or interact with an environment. ReAct asks what happens when these are combined, augmenting the action space of an agent with an internal reasoning space. The central insight is that reasoning and acting reinforce each other. Reasoning traces help the model decompose a task, track progress, form and update plans, and handle exceptions, while actions let the model retrieve external information that grounds its reasoning in reality rather than relying solely on what it has memorised.

How it works

A ReAct prompt typically provides a few worked examples, each consisting of a trajectory that alternates between three kinds of step: a thought, in which the model reasons in natural language about what to do next; an action, in which it issues a command such as a search query or an API call; and an observation, which is the result returned by the environment. The model continues this thought, action, observation loop until it reaches a final answer.

For instance, when answering a factual question, a ReAct agent might reason that it needs to look up a name, issue a search action against a knowledge source, read the returned observation, reason about what is still missing, and issue a further query before concluding. This interleaving lets the model break out of purely internal reasoning, which is prone to fabricating facts, by checking its assumptions against an external source at each step.

Results and impact

In the original experiments, ReAct was tested on both knowledge-intensive and interactive tasks. On question answering using the HotpotQA benchmark and on fact verification using Fever, connecting the model to a simple Wikipedia interface allowed ReAct to overcome the hallucination and error-propagation problems that affect pure chain-of-thought reasoning, because the model could verify intermediate facts rather than inventing them. On interactive decision-making benchmarks such as ALFWorld and WebShop, ReAct substantially outperformed imitation-learning and reinforcement-learning baselines, with large absolute gains in success rate.

Beyond the specific benchmarks, ReAct established a general pattern that has become standard in agent design. The thought, action, observation loop underpins many agent frameworks and is closely tied to the rise of tool use and function calling in large language models. Frameworks such as LangChain and LangGraph implement ReAct-style agents, and the approach is a conceptual ancestor of the broader agentic AI systems now used for research, coding, and automation.

Strengths and limitations

ReAct improves reliability and interpretability, because the visible reasoning trace explains why the agent took each action, and grounding actions in external tools reduces hallucination. Its limitations include sensitivity to the quality of the tools and observations, the risk of the model getting stuck in unproductive loops, and higher latency and cost from multiple model calls and tool invocations. These trade-offs have motivated later refinements, including reflection mechanisms that let agents critique and revise their own trajectories.

Malaysian Context — Tool-Using Agents for Local Applications

ReAct is a building block for the AI agents that Malaysian organisations are beginning to deploy. As banks, telecommunications providers, and government agencies experiment with assistants that can look up information, query internal systems, and take actions, the reasoning-and-acting pattern is central to making those systems dependable. Grounding an agent's actions in authoritative internal sources is particularly important in regulated sectors overseen by Bank Negara Malaysia (BNM) and the Securities Commission, where fabricated answers carry real consequences.

For the developer and talent ecosystem supported by MDEC and HRD Corp, familiarity with agent frameworks such as LangChain and LangGraph, which implement ReAct-style loops, is an increasingly sought-after skill. Malaysian startups building customer-service automation, document processing, and workflow tools rely on these patterns to connect language models to real data and systems.

The transparency of ReAct, where each action is preceded by a visible reasoning step, also supports the accountability goals of the Malaysia AI Governance Framework and the National AI Office. Being able to inspect why an agent took an action helps organisations audit AI behaviour, which matters for compliance under the Personal Data Protection Act (PDPA) and for public trust. At the same time, the additional cost of multiple tool calls is a practical consideration for Malaysian deployments operating under tight budgets.

References

Yao, S., Zhao, J., Yu, D., Du, N., Shafran, I., Narasimhan, K., & Cao, Y. (2022). ReAct: Synergizing Reasoning and Acting in Language Models. arXiv:2210.03629 (ICLR 2023).
Google Research. (2022). ReAct: Synergizing Reasoning and Acting in Language Models.
ReAct project page. (2023). https://react-lm.github.io/

Tags:react ai-agents reasoning tool-use prompting

Type	Agent prompting framework
Full name	Reasoning and Acting
Introduced	2022 (ICLR 2023)
Lead author	Shunyu Yao et al.
Mechanism	Interleaved thought, action, observation
Key use	Tool-using language agents