AIWiki
Malaysia

Search Results

61 results for language model

Infrastructure

AI Guardrails

AI guardrails are runtime safety mechanisms that validate, filter, and enforce policies on large language model inputs and outputs in production systems, preventing harmful content, data leakage, prompt injection, and off-topic behaviour.

6 min readUpdated June 2026
Companies & Tools

AI21 Labs

An Israeli artificial intelligence company founded in 2017 that develops large language models, including the Jurassic and Jamba families, and enterprise agentic platforms such as Maestro.

4 min readUpdated May 2026
Companies & Tools

Anthropic

Anthropic is an American AI safety company and large language model developer founded in 2021 by former OpenAI researchers, best known for developing the Claude family of AI assistants and the Constitutional AI alignment technique.

7 min readUpdated May 2026
Applications

Autonomous Agents

Autonomous AI agents are software systems that use large language models as a reasoning core, enabling them to plan multi-step tasks, use external tools, maintain memory, and take actions to achieve goals with minimal human intervention.

6 min readUpdated May 2026
Companies & Tools

Azure AI

Azure AI is Microsoft's integrated portfolio of artificial intelligence services hosted on the Azure cloud platform, encompassing pre-built cognitive APIs, a managed machine learning platform, large language model access, and enterprise AI development tools.

6 min readUpdated May 2026
Applications

Chain-of-Thought Prompting

A prompt engineering technique that improves large language model reasoning on complex tasks by instructing the model to generate explicit intermediate reasoning steps before arriving at a final answer.

5 min readUpdated May 2026
Applications

Chatbot

A chatbot is a software application designed to simulate human conversation through text or voice, ranging from simple rule-based systems to sophisticated AI assistants powered by large language models.

3 min readUpdated May 2026
Models

ChatGLM

A family of open-source bilingual (Chinese-English) large language models developed by Zhipu AI and Tsinghua University, known for strong reasoning capabilities, large context windows, and enterprise-grade open-weight releases under MIT licensing.

5 min readUpdated June 2026
Models

Claude (Language Model)

A family of large language models developed by Anthropic, designed with a focus on safety, helpfulness, and Constitutional AI training methods for enterprise and consumer use.

5 min readUpdated May 2026
Applications

Code Generation

AI code generation is the use of large language models to automatically produce, complete, or transform source code from natural language descriptions, enabling assisted and autonomous software development.

6 min readUpdated May 2026
Companies & Tools

Cohere

Cohere is a Canadian AI company specialising in enterprise large language models, offering Command, Embed, and Rerank model families alongside secure deployment infrastructure designed for regulated industries.

6 min readUpdated May 2026
Companies & Tools

Comet ML

Comet ML is a cloud-based MLOps platform for tracking machine learning experiments, managing model versions, monitoring production models, and evaluating large language model applications.

5 min readUpdated June 2026
Foundations

Constitutional AI

Constitutional AI is an alignment method developed by Anthropic that trains language models to follow a set of written ethical principles by using the model itself to critique and revise its own outputs, reducing dependence on human feedback for harmlessness.

6 min readUpdated May 2026
Foundations

Context Window

The maximum number of tokens — including the prompt, prior conversation, retrieved documents, and the model's own output — that a large language model can process in a single forward pass.

5 min readUpdated May 2026
Companies & Tools

DeepSeek

A Chinese artificial intelligence company founded in 2023, known for developing open-source large language models including DeepSeek-R1 and DeepSeek-V3 that achieved performance competitive with leading Western AI systems.

5 min readUpdated May 2026
Foundations

Direct Preference Optimization

Direct Preference Optimization (DPO) is a stable, computationally efficient algorithm for aligning large language models with human preferences by directly optimising a policy from comparison data, without training a separate reward model or using reinforcement learning.

6 min readUpdated June 2026
Models

Doubao

A suite of large language models and consumer AI assistant developed by ByteDance, the parent company of TikTok, reaching 159 million monthly active users and embedded across ByteDance's content, social, and device ecosystems.

4 min readUpdated June 2026
Models

ERNIE Bot

A large language model and conversational AI assistant developed by Baidu, built on the ERNIE (Enhanced Representation through Knowledge Integration) foundation model series and integrated across Baidu's search, cloud, and enterprise platforms.

5 min readUpdated June 2026
Models

Falcon LLM

A family of open-weight large language models developed by the Technology Innovation Institute (TII) in Abu Dhabi, released under permissive licenses and used widely across enterprise and research applications.

6 min readUpdated June 2026
Models

Foundation Model

A large-scale AI model pretrained on broad, diverse datasets and designed to be adapted to a wide range of downstream tasks through fine-tuning, prompting, or retrieval augmentation.

6 min readUpdated June 2026
Applications

Function Calling

Function calling is the structured mechanism by which a large language model returns a JSON-formatted invocation of a named function with typed arguments, enabling reliable integration of LLMs with external systems.

6 min readUpdated May 2026
Models

Gemini

Gemini is a family of multimodal large language models developed by Google DeepMind, designed to natively process and generate text, code, images, audio, and video across a range of model sizes.

6 min readUpdated May 2026
Models

Gemma

Gemma is a family of open-weight large language models developed by Google DeepMind, built on similar technology to the Gemini series and available for deployment on hardware ranging from laptops to cloud infrastructure.

5 min readUpdated June 2026
Models

GPT-4

GPT-4 is a large multimodal language model developed by OpenAI, released in March 2023, that accepts both image and text inputs and demonstrates human-level performance on numerous professional and academic benchmarks.

6 min readUpdated May 2026
Models

Grok

Grok is a series of large language models developed by xAI, Elon Musk's AI company, featuring real-time web integration, advanced reasoning modes, and deep tool-use capabilities.

6 min readUpdated May 2026
Companies & Tools

Groq

Groq is an American AI inference company that developed the Language Processing Unit (LPU), a custom silicon architecture optimised for high-throughput, low-latency inference of large language models using on-chip SRAM rather than external DRAM.

5 min readUpdated June 2026
Companies & Tools

Helicone

Helicone is an open-source LLM observability and gateway platform that enables developers to monitor, debug, and optimise large language model applications in production with minimal integration effort.

5 min readUpdated June 2026
Models

Hunyuan

A family of large language models developed by Tencent, integrated across WeChat, QQ, and Tencent Cloud, offering multimodal capabilities including text, image, video, voice, and 3D generation through a unified omni-modal architecture.

5 min readUpdated June 2026
Applications

In-Context Learning

In-context learning is the ability of large language models to perform new tasks by conditioning on examples or instructions provided within the input prompt, without updating model weights.

5 min readUpdated June 2026
Foundations

Instruction Tuning

Instruction tuning is a supervised fine-tuning technique that trains large language models on datasets of instruction-response pairs, enabling models to follow natural language directions and generalise to unseen tasks in a zero-shot or few-shot setting.

7 min readUpdated June 2026
Models

Kimi

A conversational AI assistant and long-context large language model developed by Moonshot AI, a Beijing startup, known for its industry-leading context window lengths and strong performance on agentic reasoning tasks.

4 min readUpdated June 2026
Infrastructure

LangChain

LangChain is an open-source framework for building applications powered by large language models, providing composable abstractions for chaining LLM calls with tools, memory, and data retrieval in Python and JavaScript.

6 min readUpdated May 2026
Infrastructure

Langfuse

Langfuse is an open-source LLM engineering platform that provides observability, tracing, prompt management, evaluation, and dataset tooling for teams building applications on top of large language models.

6 min readUpdated June 2026
Companies & Tools

LangSmith

LangSmith is an observability, tracing, and evaluation platform from LangChain for debugging, monitoring, and continuously improving large language model and AI agent applications in production.

4 min readUpdated June 2026
Foundations

Large Language Models

Large language models (LLMs) are AI systems trained on vast corpora of text to predict and generate natural language. They underpin modern chatbots, code assistants, and generative AI applications.

5 min readUpdated May 2026
Models

Llama

Llama is a family of open-weight large language models developed by Meta AI, released under a permissive licence that allows researchers and developers to freely download, fine-tune, and deploy the models for both research and commercial use.

6 min readUpdated May 2026
Companies & Tools

Meta AI

Meta AI is the artificial intelligence research division and product brand of Meta Platforms, responsible for the Llama family of open-weight language models and integrated AI assistants across Facebook, Instagram, WhatsApp, and Messenger.

6 min readUpdated May 2026
Companies & Tools

Microsoft Copilot

Microsoft Copilot is an AI-powered assistant integrated across Microsoft's product ecosystem — including Windows, Microsoft 365, Edge, and Azure — using large language models to assist with writing, coding, data analysis, and task automation.

5 min readUpdated June 2026
Companies & Tools

MiniMax

A Chinese AI company and model developer known for the MiniMax-M1 and M2 large language models featuring ultra-long context windows of up to 4 million tokens, strong agentic performance, and open MIT-licensed releases.

5 min readUpdated June 2026
Companies & Tools

Mistral AI

Mistral AI is a French artificial intelligence company founded in 2023 that develops and releases open-weight and proprietary large language models, notable for its competitive performance-to-efficiency ratio and commitment to open-source distribution.

6 min readUpdated May 2026
Models

Mixtral

Mixtral is a family of open-weight sparse mixture-of-experts large language models developed by Mistral AI, comprising Mixtral 8x7B and Mixtral 8x22B, released under the Apache 2.0 licence.

5 min readUpdated May 2026
Infrastructure

Model Context Protocol

The Model Context Protocol (MCP) is an open standard introduced by Anthropic in 2024 that defines a universal interface for connecting large language models to external tools, data sources, and services.

6 min readUpdated June 2026
Companies & Tools

OpenAI

An American artificial intelligence research organisation and technology company, founded in 2015, known for developing the GPT series of language models and the ChatGPT conversational AI platform.

5 min readUpdated May 2026
Companies & Tools

Perplexity AI

Perplexity AI is an American AI company that operates an answer engine combining real-time web search with large language model synthesis, providing cited, conversational responses to user queries.

5 min readUpdated May 2026
Models

Phi (Language Model)

A family of small language models developed by Microsoft Research that demonstrate strong reasoning and instruction-following at parameter counts an order of magnitude smaller than typical frontier models.

4 min readUpdated May 2026
Applications

Prompt Engineering

The practice of designing and optimising input instructions given to large language models to elicit accurate, relevant, and well-structured outputs for a given task or application.

7 min readUpdated May 2026
Infrastructure

Prompt Injection

Prompt injection is a security vulnerability affecting large language model applications in which an attacker embeds adversarial instructions in model inputs to override the system's intended behaviour, bypass safety controls, or exfiltrate sensitive information.

7 min readUpdated June 2026
Applications

Question Answering

Question answering is the natural language processing task of producing accurate answers to questions posed in natural language, often using information retrieval, reading comprehension, or large language models.

5 min readUpdated May 2026
Models

Qwen

Qwen is a family of large language models developed by Alibaba Cloud, ranging from small open-weight dense models to trillion-parameter mixture-of-experts systems, with strong multilingual and reasoning capabilities.

4 min readUpdated May 2026
Models

Reasoning Models

Reasoning models are large language models trained to generate extended internal deliberation before producing a final answer, using test-time compute to improve accuracy on complex tasks such as mathematics, coding, and multi-step logic.

6 min readUpdated June 2026
Foundations

Reinforcement Learning from Human Feedback

A machine learning technique that trains a reward model from human preference data and uses it to align large language models with human values, safety requirements, and intended behaviour through reinforcement learning.

7 min readUpdated May 2026
Applications

Retrieval-Augmented Generation

A technique that enhances large language model outputs by retrieving relevant documents from an external knowledge base at inference time, grounding responses in up-to-date and domain-specific information.

6 min readUpdated May 2026
Models

Spark

A large language model developed by iFlyTek, a Chinese AI company specialising in speech recognition and natural language processing, notable for its multilingual capabilities covering over 130 languages including Malay and other ASEAN languages.

5 min readUpdated June 2026
Applications

Speech Recognition

Speech recognition, or automatic speech recognition (ASR), is the technology that enables computers to identify and transcribe spoken language into text using acoustic models, language models, and deep learning architectures.

6 min readUpdated May 2026
Applications

Text Summarisation

Text summarisation is the natural language processing task of producing a shorter version of a document that preserves its key information, using extractive or abstractive techniques.

4 min readUpdated May 2026
Foundations

Token

A token is the smallest unit of text processed by a large language model, typically representing a word, subword, or character used as the fundamental input and output element during inference.

6 min readUpdated June 2026
Foundations

Tokenisation

Tokenisation is the process of breaking text into discrete units called tokens — which may represent words, subwords, characters, or symbols — that serve as the fundamental input units for language models and other natural language processing systems.

6 min readUpdated May 2026
Applications

Tool Use

Tool use in AI refers to the capability of language models to invoke external functions, APIs, or services to retrieve information, perform actions, or extend their abilities beyond text generation.

6 min readUpdated May 2026
Foundations

Transformer Architecture

A neural network architecture introduced in 2017 that uses self-attention mechanisms to process sequential data in parallel, forming the foundation of modern large language models and multimodal AI systems.

7 min readUpdated May 2026
Models

Vision-Language Model

A multimodal AI system that jointly processes and generates information from both images and text, extending large language models with visual perception capabilities through cross-modal alignment.

5 min readUpdated June 2026
Models

Yi

A family of open-source bilingual large language models developed by 01.AI, the Beijing-based AI startup founded by Kai-Fu Lee, achieving competitive performance against Llama 2 and Falcon with strong Chinese and English bilingual capability.

4 min readUpdated June 2026