What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

Text Summarisation

Text summarisation is the natural language processing task of producing a shorter version of a document that preserves its key information, using extractive or abstractive techniques.

4 min readLast updated May 2026Applications

Text summarisation is the natural language processing task of producing a shorter version of one or more source documents while preserving the most important information. It is one of the oldest problems in NLP, with research dating to Luhn's 1958 work on automatic abstracting at IBM, and remains a benchmark capability for modern large language models.

Extractive summarisation

Extractive summarisation selects existing sentences or spans from the source document and concatenates them into a summary. Classical approaches score sentences using frequency-based heuristics such as TF-IDF, graph-based methods such as TextRank and LexRank, or supervised classifiers trained to predict whether each sentence should be included. Modern extractive systems use transformer encoders such as BERT to score sentences in context, often combined with sequence labelling or pointer networks. Because all output text comes verbatim from the input, extractive summaries are unlikely to hallucinate, but they often read as choppy and miss high-level themes.

Abstractive summarisation

Abstractive summarisation generates novel text that paraphrases and restructures the source. Early neural approaches used encoder-decoder recurrent neural networks with attention mechanisms. Pretrained encoder-decoder transformers such as BART, T5, and Pegasus, fine-tuned on summarisation datasets, set the state of the art in the late 2010s and early 2020s. Pegasus introduced gap-sentence pretraining specifically designed for summarisation. Large general-purpose language models including GPT-4, Claude, Gemini, and Llama now perform abstractive summarisation in zero-shot or few-shot settings with quality often matching or exceeding fine-tuned specialised models, particularly for long documents handled through long-context architectures or retrieval-augmented generation.

Hybrid and structured approaches

Hybrid systems combine extractive selection with abstractive rewriting, either through pipeline architectures or end-to-end models such as bottom-up summarisers. Structured approaches produce summaries aligned to a schema — for example, news bullet points, executive summaries, medical discharge notes, or legal briefs — improving downstream usability and supporting evaluation against templates.

Evaluation

ROUGE (Recall-Oriented Understudy for Gisting Evaluation) remains the most widely reported metric, measuring n-gram overlap between system and reference summaries. ROUGE-1, ROUGE-2, and ROUGE-L variants capture different overlap granularities. Newer metrics including BERTScore, BLEURT, and learned-reward metrics correlate better with human judgement. Faithfulness — whether a summary's claims are supported by the source — is increasingly evaluated using natural language inference models and dedicated factuality benchmarks such as FactCC and SummaC. Human evaluation along fluency, informativeness, and faithfulness axes remains the gold standard for production systems.

Common challenges

Hallucination, in which a generated summary asserts facts not present in the source, is the central challenge for abstractive systems. Long-document summarisation strains context windows and dilutes attention. Multi-document summarisation must reconcile conflicting information across sources. Domain-specific summarisation — for example, medical literature or legal opinions — requires terminology coverage and respect for safety-critical accuracy. Low-resource languages and code-mixed text present additional difficulties addressed through cross-lingual transfer and multilingual pretraining.

Applications

Text summarisation is deployed in news aggregators, search snippets, meeting transcription tools, legal e-discovery, scientific literature search, contact-centre call summarisation, and clinical note generation. It is also a core component of retrieval-augmented generation pipelines, where retrieved passages are summarised or compressed before being passed to a downstream model.

Malaysian Context — Summarisation for Bahasa Melayu and Multilingual Workflows

Text summarisation systems used in Malaysia must handle Bahasa Melayu, English, Mandarin, Tamil, and frequent code-mixing. Public-sector deployments include the Malaysia Digital Economy Corporation's (MDEC) automated policy briefing summarisers, the Ministry of Digital's news monitoring tools, and the National Audit Department's pilot use of large language models to summarise procurement records. Government communications often rely on summarisation pipelines tuned for Bahasa Melayu through fine-tuning runs on Malay news corpora such as Berita Harian and Utusan.

In financial services, Maybank, CIMB, and RHB use summarisation models for credit memo generation, customer-call transcription summarisation, and regulatory filings. Bank Negara Malaysia issued guidance in 2024 reminding licensed institutions to validate the faithfulness of summarisation outputs and to retain source records for audit. The Securities Commission Malaysia has flagged hallucination risk in research note generation for capital markets participants.

Academic and industry research on Malay-language summarisation is led by Universiti Kebangsaan Malaysia, Universiti Sains Malaysia, and the Malaysian Institute of Microelectronic Systems (MIMOS). Open-source contributions include Malay BART and SEA-LION variants. Local startups including AwanTunai, Naluri, and ServisHero use summarisation in customer-experience products, often through commercial APIs from OpenAI, Anthropic, and Google deployed via Malaysian or Singaporean cloud regions to comply with Personal Data Protection Act 2010 obligations.

References

Luhn, H. P. (1958). The Automatic Creation of Literature Abstracts. IBM Journal of Research and Development.
Lewis, M. et al. (2020). BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation. ACL.
Zhang, J. et al. (2020). PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization. ICML.
Lin, C.-Y. (2004). ROUGE: A Package for Automatic Evaluation of Summaries. ACL Workshop.

Tags:NLP summarisation language models extractive abstractive

Type	NLP task
Main approaches	Extractive, abstractive, hybrid
Common datasets	CNN/Daily Mail, XSum, MultiNews
Common metrics	ROUGE, BERTScore, BLEU
Key models	BART, T5, Pegasus, GPT-4, Claude