AIWiki
Malaysia

Search Results

51 results for LLM

Applications

AI Agents

Autonomous AI systems that perceive their environment, reason over goals, select and execute actions using external tools, and operate across multi-step tasks with minimal human intervention.

7 min readUpdated May 2026
Infrastructure

AI Benchmarking

The systematic evaluation of AI systems using standardised datasets, tasks, and metrics to measure capability, compare models, and track progress across research and deployment contexts.

6 min readUpdated June 2026
Infrastructure

AI Guardrails

AI guardrails are runtime safety mechanisms that validate, filter, and enforce policies on large language model inputs and outputs in production systems, preventing harmful content, data leakage, prompt injection, and off-topic behaviour.

6 min readUpdated June 2026
Applications

AI Memory

AI memory refers to the mechanisms that allow artificial intelligence agents to retain, retrieve, and use information across interactions, extending capability beyond a single context window.

5 min readUpdated June 2026
Foundations

AI Planning

AI planning is the discipline of automatically generating a sequence of actions that an intelligent agent can execute to move from an initial state to a goal, increasingly used inside LLM-based agents to decompose and reason about complex tasks.

5 min readUpdated June 2026
Companies & Tools

Arize AI

Arize AI is an American ML observability and LLM evaluation platform that helps teams monitor, debug, and improve artificial intelligence models in production, offering both open-source and enterprise-grade tooling.

5 min readUpdated June 2026
Infrastructure

AutoGen

AutoGen is an open-source multi-agent conversation framework developed by Microsoft Research that enables developers to build LLM applications where multiple AI agents communicate with each other to accomplish complex tasks collaboratively.

6 min readUpdated June 2026
Applications

Autonomous Agents

Autonomous AI agents are software systems that use large language models as a reasoning core, enabling them to plan multi-step tasks, use external tools, maintain memory, and take actions to achieve goals with minimal human intervention.

6 min readUpdated May 2026
Applications

Chain-of-Thought Prompting

A prompt engineering technique that improves large language model reasoning on complex tasks by instructing the model to generate explicit intermediate reasoning steps before arriving at a final answer.

5 min readUpdated May 2026
Applications

Chatbot

A chatbot is a software application designed to simulate human conversation through text or voice, ranging from simple rule-based systems to sophisticated AI assistants powered by large language models.

3 min readUpdated May 2026
Models

ChatGLM

A family of open-source bilingual (Chinese-English) large language models developed by Zhipu AI and Tsinghua University, known for strong reasoning capabilities, large context windows, and enterprise-grade open-weight releases under MIT licensing.

5 min readUpdated June 2026
Models

Claude (Language Model)

A family of large language models developed by Anthropic, designed with a focus on safety, helpfulness, and Constitutional AI training methods for enterprise and consumer use.

5 min readUpdated May 2026
Applications

Code Generation

AI code generation is the use of large language models to automatically produce, complete, or transform source code from natural language descriptions, enabling assisted and autonomous software development.

6 min readUpdated May 2026
Companies & Tools

Cohere

Cohere is a Canadian AI company specialising in enterprise large language models, offering Command, Embed, and Rerank model families alongside secure deployment infrastructure designed for regulated industries.

6 min readUpdated May 2026
Companies & Tools

Comet ML

Comet ML is a cloud-based MLOps platform for tracking machine learning experiments, managing model versions, monitoring production models, and evaluating large language model applications.

5 min readUpdated June 2026
Foundations

Context Window

The maximum number of tokens — including the prompt, prior conversation, retrieved documents, and the model's own output — that a large language model can process in a single forward pass.

5 min readUpdated May 2026
Companies & Tools

DeepSeek

A Chinese artificial intelligence company founded in 2023, known for developing open-source large language models including DeepSeek-R1 and DeepSeek-V3 that achieved performance competitive with leading Western AI systems.

5 min readUpdated May 2026
Models

Doubao

A suite of large language models and consumer AI assistant developed by ByteDance, the parent company of TikTok, reaching 159 million monthly active users and embedded across ByteDance's content, social, and device ecosystems.

4 min readUpdated June 2026
Models

ERNIE Bot

A large language model and conversational AI assistant developed by Baidu, built on the ERNIE (Enhanced Representation through Knowledge Integration) foundation model series and integrated across Baidu's search, cloud, and enterprise platforms.

5 min readUpdated June 2026
Models

Falcon LLM

A family of open-weight large language models developed by the Technology Innovation Institute (TII) in Abu Dhabi, released under permissive licenses and used widely across enterprise and research applications.

6 min readUpdated June 2026
Applications

Fine-Tuning

The process of further training a pre-trained machine learning model on a smaller, task-specific dataset to adapt its weights for a particular domain, task, or desired behaviour.

6 min readUpdated May 2026
Applications

Function Calling

Function calling is the structured mechanism by which a large language model returns a JSON-formatted invocation of a named function with typed arguments, enabling reliable integration of LLMs with external systems.

6 min readUpdated May 2026
Applications

Generative AI

Generative AI refers to artificial intelligence systems capable of producing new content — text, images, audio, video, or code — by learning the underlying distribution of training data.

4 min readUpdated May 2026
Foundations

Hallucination (AI)

A phenomenon in which an artificial intelligence system generates output that is factually incorrect, fabricated, or unsupported by its input, while presenting it with apparent confidence.

6 min readUpdated May 2026
Companies & Tools

Helicone

Helicone is an open-source LLM observability and gateway platform that enables developers to monitor, debug, and optimise large language model applications in production with minimal integration effort.

5 min readUpdated June 2026
Models

Hunyuan

A family of large language models developed by Tencent, integrated across WeChat, QQ, and Tencent Cloud, offering multimodal capabilities including text, image, video, voice, and 3D generation through a unified omni-modal architecture.

5 min readUpdated June 2026
Applications

In-Context Learning

In-context learning is the ability of large language models to perform new tasks by conditioning on examples or instructions provided within the input prompt, without updating model weights.

5 min readUpdated June 2026
Models

Kimi

A conversational AI assistant and long-context large language model developed by Moonshot AI, a Beijing startup, known for its industry-leading context window lengths and strong performance on agentic reasoning tasks.

4 min readUpdated June 2026
Infrastructure

LangChain

LangChain is an open-source framework for building applications powered by large language models, providing composable abstractions for chaining LLM calls with tools, memory, and data retrieval in Python and JavaScript.

6 min readUpdated May 2026
Infrastructure

Langfuse

Langfuse is an open-source LLM engineering platform that provides observability, tracing, prompt management, evaluation, and dataset tooling for teams building applications on top of large language models.

6 min readUpdated June 2026
Companies & Tools

LangSmith

LangSmith is an observability, tracing, and evaluation platform from LangChain for debugging, monitoring, and continuously improving large language model and AI agent applications in production.

4 min readUpdated June 2026
Foundations

Large Language Models

Large language models (LLMs) are AI systems trained on vast corpora of text to predict and generate natural language. They underpin modern chatbots, code assistants, and generative AI applications.

5 min readUpdated May 2026
Companies & Tools

LlamaIndex

LlamaIndex is an open-source Python and TypeScript framework for building retrieval-augmented and agentic AI applications over private data sources.

6 min readUpdated May 2026
Applications

LoRA (Low-Rank Adaptation)

LoRA is a parameter-efficient fine-tuning technique that adapts large pre-trained models by injecting small trainable low-rank matrices into transformer layers, drastically reducing the number of trainable parameters without sacrificing performance.

6 min readUpdated May 2026
Models

MaLLaM (Malaysia Large Language Model)

MaLLaM is a family of large language models developed by Malaysian startup Mesolitica, pretrained from scratch on Malay-language data to understand Malaysian dialects, colloquialisms, and regional languages.

5 min readUpdated June 2026
Foundations

Markov Decision Process

A Markov decision process is a mathematical framework for modelling sequential decision-making in which outcomes are partly random and partly under the control of a decision-maker.

4 min readUpdated May 2026
Companies & Tools

MiniMax

A Chinese AI company and model developer known for the MiniMax-M1 and M2 large language models featuring ultra-long context windows of up to 4 million tokens, strong agentic performance, and open MIT-licensed releases.

5 min readUpdated June 2026
Companies & Tools

Mistral AI

Mistral AI is a French artificial intelligence company founded in 2023 that develops and releases open-weight and proprietary large language models, notable for its competitive performance-to-efficiency ratio and commitment to open-source distribution.

6 min readUpdated May 2026
Infrastructure

Ollama

Ollama is an open-source runtime that enables developers and researchers to download, run, and manage large language models locally on consumer hardware without cloud API dependencies.

6 min readUpdated June 2026
Companies & Tools

Perplexity AI

Perplexity AI is an American AI company that operates an answer engine combining real-time web search with large language model synthesis, providing cited, conversational responses to user queries.

5 min readUpdated May 2026
Infrastructure

Prompt Caching

Prompt caching is an inference optimisation technique that stores precomputed key-value representations of repeated prompt prefixes, reducing latency and token processing costs for applications with stable system prompts or long shared contexts.

6 min readUpdated June 2026
Applications

Prompt Engineering

The practice of designing and optimising input instructions given to large language models to elicit accurate, relevant, and well-structured outputs for a given task or application.

7 min readUpdated May 2026
Infrastructure

Prompt Injection

Prompt injection is a security vulnerability affecting large language model applications in which an attacker embeds adversarial instructions in model inputs to override the system's intended behaviour, bypass safety controls, or exfiltrate sensitive information.

7 min readUpdated June 2026
Applications

Retrieval-Augmented Generation

A technique that enhances large language model outputs by retrieving relevant documents from an external knowledge base at inference time, grounding responses in up-to-date and domain-specific information.

6 min readUpdated May 2026
Models

Spark

A large language model developed by iFlyTek, a Chinese AI company specialising in speech recognition and natural language processing, notable for its multilingual capabilities covering over 130 languages including Malay and other ASEAN languages.

5 min readUpdated June 2026
Foundations

Sparse Autoencoder

A sparse autoencoder is a type of autoencoder trained with a sparsity constraint that forces most neurons in the hidden layer to be inactive for any given input, producing a disentangled, interpretable feature decomposition.

7 min readUpdated June 2026
Infrastructure

Speculative Decoding

Speculative decoding is an inference acceleration technique that uses a small draft model to propose multiple candidate tokens that a larger target model then verifies in parallel, achieving 2-4x throughput gains without changing output quality.

5 min readUpdated June 2026
Applications

Tool Use

Tool use in AI refers to the capability of language models to invoke external functions, APIs, or services to retrieve information, perform actions, or extend their abilities beyond text generation.

6 min readUpdated May 2026
Applications

Vibe Coding

Vibe coding is an AI-assisted software development practice in which a developer describes intent in natural language and a large language model generates the code, with the human guiding and testing rather than writing it directly.

5 min readUpdated June 2026
Infrastructure

vLLM

vLLM is an open-source library for fast and memory-efficient large language model inference and serving, built around the PagedAttention algorithm for optimised GPU memory management.

6 min readUpdated June 2026
Models

Yi

A family of open-source bilingual large language models developed by 01.AI, the Beijing-based AI startup founded by Kai-Fu Lee, achieving competitive performance against Llama 2 and Falcon with strong Chinese and English bilingual capability.

4 min readUpdated June 2026