What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

Zero-Shot Learning

Zero-shot learning is a machine learning paradigm in which a model makes accurate predictions on categories it has never seen during training by leveraging semantic descriptions or attribute representations.

6 min readLast updated May 2026Foundations

Zero-shot learning (ZSL) is a machine learning setting in which a model is expected to correctly classify or process instances belonging to categories that it was never explicitly trained on. Rather than requiring labelled examples for every class, ZSL systems rely on an auxiliary source of knowledge — such as semantic attribute vectors, class descriptions in natural language, or knowledge graph embeddings — to bridge the gap between seen and unseen categories. The term "zero-shot" reflects the fact that zero labelled training examples are available for the target classes at inference time.

Motivation

Standard supervised learning assumes that the categories present at test time are the same as those seen during training. This closed-world assumption breaks down in many real-world scenarios where new classes emerge after a model is deployed, collecting labelled training data is expensive or impossible, or a model must operate across a large number of categories where exhaustive labelling is impractical. Zero-shot learning addresses these situations by enabling generalisation to novel categories without additional training.

How It Works

The central idea of ZSL is to transfer knowledge from seen classes to unseen classes through a shared semantic embedding space. In a typical ZSL pipeline:

Feature extraction: A neural network maps each input instance to a high-dimensional feature vector.
Semantic space: Each class — both seen and unseen — is described by a semantic vector. This may be hand-annotated attribute vectors (e.g., "has stripes", "can fly"), word embeddings of the class name, or sentence embeddings of a natural language description.
Compatibility function: A model learns to align instance features with semantic class representations. At inference time, an unseen image is matched to the semantic representation of the unseen class that scores highest.

The key challenge is the hubness problem — in high-dimensional spaces, a small number of "hub" points tend to appear as nearest neighbours for many query points, degrading ZSL accuracy. Several methods address this through normalisation, calibration, or generative approaches.

Generalised Zero-Shot Learning

Practical deployments typically operate under Generalised Zero-Shot Learning (GZSL), where the model must classify instances from both seen and unseen classes simultaneously. GZSL is significantly harder than standard ZSL because models trained on seen classes tend to be biased towards predicting seen-class labels. Addressing this bias — through output calibration, generative data augmentation, or auxiliary classifiers — is an active research area.

Relationship to Large Language Models

The popularisation of large language models such as GPT-3, GPT-4, and their successors gave the term "zero-shot" an additional meaning in NLP. In the LLM context, "zero-shot prompting" refers to asking a model to perform a task — translation, classification, summarisation — by providing only an instruction and no examples. This is distinct from classical ZSL in computer vision, though both involve generalising without per-task training examples.

CLIP (Contrastive Language-Image Pre-training), released by OpenAI in 2021, bridged the two traditions by enabling zero-shot image classification purely via natural language descriptions, without any dedicated visual training for the target classes. A user can query CLIP with a text description of any category and it will identify matching images, even categories absent from standard visual training benchmarks.

Applications

Zero-shot learning has practical impact in settings where labelled data is scarce or expensive. In medical imaging, rare diseases can be classified from imaging data using clinical description text as the semantic anchor, without requiring enough cases to train a conventional classifier. In e-commerce, product images can be matched to catalogue categories that did not exist when the visual model was trained, using textual product metadata as the bridge. In cybersecurity, previously unseen malware families can be detected based on behavioural descriptions, without retraining the detection model.

In natural language processing, zero-shot and cross-lingual transfer are especially valuable. LLMs trained primarily on English can perform tasks in dozens of other languages without language-specific fine-tuning, using cross-lingual embeddings to bridge the gap between language representations.

Recent Advances

Research presented at ICLR 2025 demonstrated that combining diffusion model-based data augmentation with supervised contrastive learning — an approach called ZeroDiff — achieved 76.3% accuracy on standard ZSL benchmarks with 90% less training data than prior methods. These results underscore the continued relevance of zero-shot learning research even in an era dominated by foundation models.

Malaysian Context — Zero-Shot Learning and Low-Resource Languages

Zero-shot and few-shot generalisation are particularly significant in Malaysia's multilingual environment. Malaysia's population speaks Bahasa Malaysia, English, Mandarin, Tamil, and numerous regional and indigenous languages. For many of these — especially indigenous languages such as Iban, Kadazan-Dusun, and Bidayuh — virtually no annotated NLP training data exists. Zero-shot learning methods that transfer knowledge from resource-rich languages offer a viable path to building useful language tools for these communities.

Researchers at Universiti Malaya and Universiti Sains Malaysia have explored cross-lingual transfer for Bahasa Malaysia NLP tasks, leveraging multilingual pretrained models such as mBERT and XLM-RoBERTa in zero-shot or few-shot regimes. The AI Malaysia initiative and MDEC's AI in Education programme have identified low-resource language technology as a priority area under the National AI Roadmap.

In the computer vision domain, Malaysian companies in agricultural technology — including those working with Felda and palm oil producers — face the challenge of identifying novel pest species or crop diseases without extensive labelled datasets. Zero-shot learning is one approach being explored to extend existing plant disease classifiers to newly observed conditions, reducing the time and cost required to respond to emerging agricultural threats.

MIMOS Berhad, Malaysia's national applied ICT R&D centre under MITI, has conducted research into transfer learning and generalisation methods applicable to Malaysian context data, including research relevant to Bahasa Malaysia speech and text understanding. The broader question of how AI systems can serve Malaysia's linguistic diversity without requiring enormous labelled datasets for each language is central to the country's AI inclusion agenda.

References

Larochelle, H., et al. (2008). Zero-data learning of new tasks. Proceedings of the 23rd AAAI Conference on Artificial Intelligence.
Lampert, C. H., Nickisch, H., and Harmeling, S. (2009). Learning to detect unseen object classes by between-class attribute transfer. CVPR 2009.
Xian, Y., et al. (2018). Zero-Shot Learning — A Comprehensive Evaluation of the Good, the Bad and the Ugly. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(9), 2251-2265.
Radford, A., et al. (2021). Learning Transferable Visual Models From Natural Language Supervision (CLIP). OpenAI / arXiv:2103.00020.

Tags:zero-shot-learning transfer-learning generalisation few-shot

Type	Machine learning paradigm
Sub-field	Transfer learning, generalisation
First proposed	circa 2009
Key use	Classification without per-class training examples
Related	Few-shot learning, Meta-learning, Transfer learning