What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

Sentiment Analysis

Sentiment analysis is a natural language processing technique that automatically identifies and classifies the emotional tone of text as positive, negative, or neutral, and is widely used in customer feedback, social media monitoring, and financial analysis.

6 min readLast updated May 2026Applications

Sentiment analysis, also known as opinion mining, is a branch of natural language processing (NLP) concerned with the computational identification of subjective information in text — principally the emotional tone, attitude, or opinion expressed by a writer toward a subject. The most basic form of sentiment analysis classifies text into positive, negative, or neutral categories. More advanced systems perform fine-grained analysis, detecting specific emotions such as anger, joy, fear, or surprise, or identifying the sentiment directed toward particular entities or aspects within a text (aspect-based sentiment analysis).

The field emerged as a formal research area in the early 2000s following the growth of online reviews and forums, and has since become one of the most commercially applied areas of NLP, used across industries from consumer goods and media to finance and human resources.

Task Formulation

Sentiment analysis encompasses several related but distinct tasks. Document-level sentiment analysis assigns a single sentiment label to an entire document, such as a product review. Sentence-level analysis evaluates the sentiment of individual sentences within a document. Aspect-based sentiment analysis (ABSA) identifies both the specific aspect being discussed (for example, battery life in a smartphone review) and the sentiment expressed toward that aspect. This finer-grained approach provides more actionable intelligence for businesses than document-level classification.

Entity-level sentiment analysis identifies named entities mentioned in a text and determines the sentiment expressed toward each, enabling, for instance, the tracking of public opinion about individual companies or political figures across news articles and social media.

Methods and Algorithms

Lexicon-Based Methods

The earliest and most interpretable sentiment analysis systems use sentiment lexicons: dictionaries in which words are associated with pre-assigned sentiment scores or polarity labels. Well-known English lexicons include SentiWordNet and VADER (Valence Aware Dictionary and sEntiment Reasoner). The sentiment of a text is computed by aggregating the scores of its constituent words, with adjustments for negation (not good becomes negative) and intensifiers (very good increases the positive score).

Lexicon-based methods are fast, transparent, and require no training data, making them useful for domains where labelled data is unavailable. Their main limitation is sensitivity to domain shift: a word that is positive in one context may be negative in another (for example, unpredictable can be positive when describing a film plot but negative when describing software behaviour).

Machine Learning Methods

Supervised machine learning approaches frame sentiment analysis as a text classification problem. Features are extracted from text (bag-of-words representations, n-grams, term frequency-inverse document frequency (TF-IDF) vectors) and fed to classifiers such as Naive Bayes, Support Vector Machines (SVMs), or logistic regression. These methods require labelled training data but generalise better across nuanced expressions than purely lexicon-based approaches.

Deep Learning Methods

Recurrent neural networks (RNNs), long short-term memory (LSTM) networks, and convolutional neural networks (CNNs) improved sentiment analysis performance significantly through the 2010s by capturing sequential dependencies and local n-gram patterns in text. Pre-trained word embeddings such as Word2Vec and GloVe provided better initial feature representations than bag-of-words approaches.

The introduction of transformer-based pre-trained models, beginning with BERT (Bidirectional Encoder Representations from Transformers) in 2018, produced a step change in performance. Fine-tuning BERT on labelled sentiment datasets achieved state-of-the-art results across standard benchmarks. Subsequent models including RoBERTa, DistilBERT, and XLM-RoBERTa extended this approach to multiple languages.

Large language models including GPT-4, Claude, and Gemini can perform sentiment analysis through zero-shot and few-shot prompting, making high-quality sentiment classification accessible without labelled training data for the target domain.

Applications

The most widespread commercial application is customer feedback analysis. Businesses collect reviews, support tickets, and survey responses and use sentiment analysis to summarise the overall sentiment, track sentiment trends over time, and identify specific product features or service aspects that drive satisfaction or dissatisfaction.

Social media monitoring tools use sentiment analysis to track public opinion about brands, products, campaigns, and public figures across platforms such as X (formerly Twitter), Facebook, and Instagram. Financial services firms apply sentiment analysis to news articles, earnings call transcripts, and analyst reports to extract signals correlated with market movements — a field known as alternative data or textual analysis in finance.

In human resources, companies analyse employee engagement survey responses and internal communication to gauge workforce sentiment and identify early signals of attrition risk. Healthcare providers analyse patient feedback and clinical notes to monitor patient experience and identify concerns.

Challenges

Sentiment analysis faces several persistent challenges. Sarcasm and irony are difficult to detect without contextual understanding. Implicit sentiment, where the emotional tone is conveyed through factual statements rather than explicit sentiment words, is problematic for lexicon and n-gram based methods. Multilingual sentiment analysis requires labelled data and lexicons for each target language, and low-resource languages remain underserved.

Domain adaptation is another challenge: a model trained on film reviews may perform poorly on financial news because the vocabulary and sentiment conventions differ substantially. Cross-domain and domain-adaptive sentiment models are an active research area.

Malaysian Context — Sentiment Analysis for Local Languages and Industries

Sentiment analysis in Malaysia presents unique challenges and opportunities due to the country's multilingual character. Malaysian social media and customer communications frequently contain Bahasa Malaysia, English, Mandarin Chinese, Tamil, and code-mixed language (Manglish — a blend of Malay and English). Building effective sentiment models for these mixed-language contexts requires specialised training data and multilingual model architectures.

Researchers at Universiti Malaya, Universiti Kebangsaan Malaysia (UKM), and Universiti Sains Malaysia (USM) have published work on Malay-language sentiment analysis, developing annotated corpora of Bahasa Malaysia text from social media, news comments, and product reviews. The availability of multilingual pre-trained models such as XLM-RoBERTa and mBERT has improved the feasibility of multilingual Malaysian sentiment classifiers.

Malaysian banks and financial institutions, including Maybank, CIMB, Public Bank, and RHB, have deployed sentiment analysis tools to process customer feedback from mobile banking apps, contact centres, and social media. Analysing this feedback at scale helps identify service pain points, monitor customer reactions to product launches, and prioritise improvements.

E-commerce platforms operating in Malaysia, including Shopee and Lazada, use sentiment analysis to process product reviews and seller feedback. Malaysia's large and active e-commerce market generates substantial review data in multiple languages, and sentiment analysis enables platforms to surface useful reviews, detect fraudulent or incentivised reviews, and identify product quality trends.

The Securities Commission Malaysia (SC) and Bank Negara Malaysia (BNM) have explored NLP-based regulatory intelligence tools, including sentiment analysis of financial disclosures and news, as part of their broader supervisory technology (SupTech) initiatives. Tracking sentiment around listed companies and financial instruments supports market surveillance and investor protection objectives.

References

Pang, B., and Lee, L. (2008). Opinion Mining and Sentiment Analysis. Foundations and Trends in Information Retrieval, 2(1-2), 1-135.
Devlin, J. et al. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of NAACL-HLT 2019.
Hutto, C., and Gilbert, E. (2014). VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text. Proceedings of ICWSM 2014.
Pontiki, M. et al. (2016). SemEval-2016 Task 5: Aspect Based Sentiment Analysis. Proceedings of SemEval 2016.
Abdullah, M.T. et al. (2022). Sentiment Analysis for Bahasa Malaysia Social Media Text: A Survey. IEEE Access, 10, 58637-58659.

Tags:sentiment analysis NLP opinion mining text classification natural language processing

Also known as	Opinion mining, emotion AI
Type	Natural language processing application
Output	Positive, negative, neutral (or fine-grained emotions)
Key methods	Lexicon-based, machine learning, deep learning
Applications	Customer feedback, social listening, finance, HR