What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

AI Memory

AI memory refers to the mechanisms that allow artificial intelligence agents to retain, retrieve, and use information across interactions, extending capability beyond a single context window.

5 min readLast updated June 2026Applications

AI memory is the umbrella term for the mechanisms by which artificial intelligence systems, especially LLM-based agents, retain information beyond the boundaries of a single prompt and use that retained information to inform later behaviour. Without explicit memory, a model is stateless: each request is evaluated only against its current context window and built-in parameters, so personalisation, long-running tasks, and continuity across sessions are impossible. Memory mechanisms add the missing state by writing, retrieving, summarising, and sometimes forgetting information across interactions.

Why memory matters for agents

Static LLMs are constrained by their fixed context window, which even at hundreds of thousands of tokens cannot hold a user's complete history, a large codebase, or a multi-week project record. Agents that operate over long horizons must therefore externalise state. Memory enables personalisation (remembering a user's preferences), tool reliability (recalling which tools have failed), planning (referring to earlier plans and reflections), and cross-session continuity (resuming a conversation).

Taxonomy of memory types

Researchers and practitioners increasingly distinguish four functional categories, loosely analogous to human cognitive systems:

| Type | Stored content | Typical implementation | |---|---|---| | Short-term (working) memory | Current conversation, scratchpad | The model's active context window | | Episodic memory | Specific past events and interactions | Vector store of interaction logs | | Semantic memory | General facts, preferences, world knowledge | Knowledge graph or summarised notes | | Procedural memory | Learned skills, tool-use patterns | Fine-tuned weights, cached plans |

A 2025 wave of research papers argues that the older "short-term vs long-term" split is too coarse and that production agents need explicit episodic and semantic stores with different write, retrieval, and decay policies.

How memory is implemented

Most production agent stacks implement memory as a layered system around an LLM call. On every turn the agent (1) writes salient new information to one or more stores, (2) retrieves relevant prior information by similarity or keyword search, (3) composes the retrieved memory into the prompt alongside the user message, and (4) optionally reflects by summarising or consolidating older entries to keep storage bounded.

Common storage substrates include vector databases (Pinecone, Weaviate, Qdrant, Chroma, pgvector), key-value or document stores (Redis, MongoDB, DynamoDB), and graph databases (Neo4j, ArangoDB, TigerGraph) for relational facts. Frameworks such as LangChain, LangGraph, LlamaIndex, MemGPT, Letta, Mem0, and Zep provide higher-level memory primitives.

Retrieval and forgetting

Memory systems must solve two opposing problems: ensuring that relevant information is recalled when needed and ensuring that the store does not grow unbounded or dilute retrieval quality with stale entries. Retrieval typically combines dense vector similarity with metadata filters, recency boosts, and importance scores; forgetting is handled by time-based decay, summarisation into higher-level notes, or explicit user-controlled deletion. Designing the forgetting policy is often as important as designing the writing policy, since retaining everything degrades retrieval relevance and raises privacy risk.

Privacy and governance

Persistent memory raises distinctive governance questions: which categories of personal data may be retained, for how long, with what user controls, and under what jurisdiction. Memory stores that learn from user behaviour are subject to the same privacy laws as any other personal data system, including the EU GDPR, Singapore PDPA, and Malaysia's PDPA, and may attract specific obligations under emerging AI regulations regarding data subject rights of access and erasure.

Malaysian Context — Personalisation, Privacy, and Local Use Cases

Malaysian organisations deploying agentic AI must reconcile the personalisation benefits of persistent memory with the requirements of the Personal Data Protection Act 2010 (PDPA), enforced by the Personal Data Protection Department (JPDP). Recent PDPA amendments raised penalties, introduced data breach notification, and clarified data portability rights, which directly affect any agent that retains user interaction history.

Banks and digital banks licensed by Bank Negara Malaysia (BNM) — including Maybank, CIMB, RHB, GXBank, AEON Bank, Boost Bank, and Ryt Bank — that experiment with AI advisers and copilots typically implement strict retention windows, role-based access to memory contents, and audit trails of memory reads and writes. BNM's discussion paper on the use of AI by financial institutions encourages traceable, auditable AI systems, which extends naturally to memory governance.

Telcos (Maxis, Celcom Digi, U Mobile, TM) and super-apps (Grab Malaysia, Touch 'n Go, Shopee Malaysia, Lazada Malaysia, Foodpanda) are using memory-enabled assistants to remember customer preferences, order history, and support context. PDPA cross-border transfer rules and the BNM Risk Management in Technology (RMiT) framework encourage on-shore storage of memory data, which has driven adoption of regionally hosted vector databases and local cloud regions in AWS Asia Pacific (Kuala Lumpur), Azure Malaysia Central, Google Cloud Kuala Lumpur, and Tencent Cloud Malaysia.

Local AI vendors under the AITG SDN BHD umbrella and within the Cyberjaya and Penang ecosystems are integrating memory layers into customer-service, sales, and operations agents, including the Teragrid Agent product family. MDEC, MOSTI, and the National AI Office are expected to publish more specific guidance on long-term memory and personalisation under the Malaysian AI Governance Framework.

References

Park, J. et al. (2023). Generative Agents: Interactive Simulacra of Human Behavior. UIST.
Packer, C. et al. (2023). MemGPT: Towards LLMs as Operating Systems. arXiv.
Liu, S. et al. (2025). Memory in the Age of AI Agents: A Survey. arXiv.
Kim, J. et al. (2022). A Machine with Short-Term, Episodic, and Semantic Memory Systems. arXiv:2212.02098.
Bank Negara Malaysia. (2024). Discussion Paper on the Use of AI by Financial Institutions.

Tags:agents llm vector-database episodic-memory

Type	Agentic AI architectural pattern
Common storage	Vector databases, key-value stores, knowledge graphs
Memory types	Short-term, episodic, semantic, procedural
Key challenge	Retrieval relevance and forgetting
Related	AI agents, RAG, context window, vector database

Why memory matters for agents

Taxonomy of memory types

How memory is implemented

Retrieval and forgetting

Privacy and governance

See Also

References

References