Search Results
51 results for “LLM”
AI Agents
Autonomous AI systems that perceive their environment, reason over goals, select and execute actions using external tools, and operate across multi-step tasks with minimal human intervention.
AI Benchmarking
The systematic evaluation of AI systems using standardised datasets, tasks, and metrics to measure capability, compare models, and track progress across research and deployment contexts.
AI Guardrails
AI guardrails are runtime safety mechanisms that validate, filter, and enforce policies on large language model inputs and outputs in production systems, preventing harmful content, data leakage, prompt injection, and off-topic behaviour.
AI Memory
AI memory refers to the mechanisms that allow artificial intelligence agents to retain, retrieve, and use information across interactions, extending capability beyond a single context window.
AI Planning
AI planning is the discipline of automatically generating a sequence of actions that an intelligent agent can execute to move from an initial state to a goal, increasingly used inside LLM-based agents to decompose and reason about complex tasks.
Arize AI
Arize AI is an American ML observability and LLM evaluation platform that helps teams monitor, debug, and improve artificial intelligence models in production, offering both open-source and enterprise-grade tooling.
AutoGen
AutoGen is an open-source multi-agent conversation framework developed by Microsoft Research that enables developers to build LLM applications where multiple AI agents communicate with each other to accomplish complex tasks collaboratively.
Autonomous Agents
Autonomous AI agents are software systems that use large language models as a reasoning core, enabling them to plan multi-step tasks, use external tools, maintain memory, and take actions to achieve goals with minimal human intervention.
Chain-of-Thought Prompting
A prompt engineering technique that improves large language model reasoning on complex tasks by instructing the model to generate explicit intermediate reasoning steps before arriving at a final answer.
Chatbot
A chatbot is a software application designed to simulate human conversation through text or voice, ranging from simple rule-based systems to sophisticated AI assistants powered by large language models.
ChatGLM
A family of open-source bilingual (Chinese-English) large language models developed by Zhipu AI and Tsinghua University, known for strong reasoning capabilities, large context windows, and enterprise-grade open-weight releases under MIT licensing.
Claude (Language Model)
A family of large language models developed by Anthropic, designed with a focus on safety, helpfulness, and Constitutional AI training methods for enterprise and consumer use.
Code Generation
AI code generation is the use of large language models to automatically produce, complete, or transform source code from natural language descriptions, enabling assisted and autonomous software development.
Cohere
Cohere is a Canadian AI company specialising in enterprise large language models, offering Command, Embed, and Rerank model families alongside secure deployment infrastructure designed for regulated industries.
Comet ML
Comet ML is a cloud-based MLOps platform for tracking machine learning experiments, managing model versions, monitoring production models, and evaluating large language model applications.
Context Window
The maximum number of tokens — including the prompt, prior conversation, retrieved documents, and the model's own output — that a large language model can process in a single forward pass.
DeepSeek
A Chinese artificial intelligence company founded in 2023, known for developing open-source large language models including DeepSeek-R1 and DeepSeek-V3 that achieved performance competitive with leading Western AI systems.
Doubao
A suite of large language models and consumer AI assistant developed by ByteDance, the parent company of TikTok, reaching 159 million monthly active users and embedded across ByteDance's content, social, and device ecosystems.
ERNIE Bot
A large language model and conversational AI assistant developed by Baidu, built on the ERNIE (Enhanced Representation through Knowledge Integration) foundation model series and integrated across Baidu's search, cloud, and enterprise platforms.
Falcon LLM
A family of open-weight large language models developed by the Technology Innovation Institute (TII) in Abu Dhabi, released under permissive licenses and used widely across enterprise and research applications.
Fine-Tuning
The process of further training a pre-trained machine learning model on a smaller, task-specific dataset to adapt its weights for a particular domain, task, or desired behaviour.
Function Calling
Function calling is the structured mechanism by which a large language model returns a JSON-formatted invocation of a named function with typed arguments, enabling reliable integration of LLMs with external systems.
Generative AI
Generative AI refers to artificial intelligence systems capable of producing new content — text, images, audio, video, or code — by learning the underlying distribution of training data.
Hallucination (AI)
A phenomenon in which an artificial intelligence system generates output that is factually incorrect, fabricated, or unsupported by its input, while presenting it with apparent confidence.
Helicone
Helicone is an open-source LLM observability and gateway platform that enables developers to monitor, debug, and optimise large language model applications in production with minimal integration effort.
Hunyuan
A family of large language models developed by Tencent, integrated across WeChat, QQ, and Tencent Cloud, offering multimodal capabilities including text, image, video, voice, and 3D generation through a unified omni-modal architecture.
In-Context Learning
In-context learning is the ability of large language models to perform new tasks by conditioning on examples or instructions provided within the input prompt, without updating model weights.
Kimi
A conversational AI assistant and long-context large language model developed by Moonshot AI, a Beijing startup, known for its industry-leading context window lengths and strong performance on agentic reasoning tasks.
LangChain
LangChain is an open-source framework for building applications powered by large language models, providing composable abstractions for chaining LLM calls with tools, memory, and data retrieval in Python and JavaScript.
Langfuse
Langfuse is an open-source LLM engineering platform that provides observability, tracing, prompt management, evaluation, and dataset tooling for teams building applications on top of large language models.
LangSmith
LangSmith is an observability, tracing, and evaluation platform from LangChain for debugging, monitoring, and continuously improving large language model and AI agent applications in production.
Large Language Models
Large language models (LLMs) are AI systems trained on vast corpora of text to predict and generate natural language. They underpin modern chatbots, code assistants, and generative AI applications.
LlamaIndex
LlamaIndex is an open-source Python and TypeScript framework for building retrieval-augmented and agentic AI applications over private data sources.
LoRA (Low-Rank Adaptation)
LoRA is a parameter-efficient fine-tuning technique that adapts large pre-trained models by injecting small trainable low-rank matrices into transformer layers, drastically reducing the number of trainable parameters without sacrificing performance.
MaLLaM (Malaysia Large Language Model)
MaLLaM is a family of large language models developed by Malaysian startup Mesolitica, pretrained from scratch on Malay-language data to understand Malaysian dialects, colloquialisms, and regional languages.
Markov Decision Process
A Markov decision process is a mathematical framework for modelling sequential decision-making in which outcomes are partly random and partly under the control of a decision-maker.
MiniMax
A Chinese AI company and model developer known for the MiniMax-M1 and M2 large language models featuring ultra-long context windows of up to 4 million tokens, strong agentic performance, and open MIT-licensed releases.
Mistral AI
Mistral AI is a French artificial intelligence company founded in 2023 that develops and releases open-weight and proprietary large language models, notable for its competitive performance-to-efficiency ratio and commitment to open-source distribution.
Ollama
Ollama is an open-source runtime that enables developers and researchers to download, run, and manage large language models locally on consumer hardware without cloud API dependencies.
Perplexity AI
Perplexity AI is an American AI company that operates an answer engine combining real-time web search with large language model synthesis, providing cited, conversational responses to user queries.
Prompt Caching
Prompt caching is an inference optimisation technique that stores precomputed key-value representations of repeated prompt prefixes, reducing latency and token processing costs for applications with stable system prompts or long shared contexts.
Prompt Engineering
The practice of designing and optimising input instructions given to large language models to elicit accurate, relevant, and well-structured outputs for a given task or application.
Prompt Injection
Prompt injection is a security vulnerability affecting large language model applications in which an attacker embeds adversarial instructions in model inputs to override the system's intended behaviour, bypass safety controls, or exfiltrate sensitive information.
Retrieval-Augmented Generation
A technique that enhances large language model outputs by retrieving relevant documents from an external knowledge base at inference time, grounding responses in up-to-date and domain-specific information.
Spark
A large language model developed by iFlyTek, a Chinese AI company specialising in speech recognition and natural language processing, notable for its multilingual capabilities covering over 130 languages including Malay and other ASEAN languages.
Sparse Autoencoder
A sparse autoencoder is a type of autoencoder trained with a sparsity constraint that forces most neurons in the hidden layer to be inactive for any given input, producing a disentangled, interpretable feature decomposition.
Speculative Decoding
Speculative decoding is an inference acceleration technique that uses a small draft model to propose multiple candidate tokens that a larger target model then verifies in parallel, achieving 2-4x throughput gains without changing output quality.
Tool Use
Tool use in AI refers to the capability of language models to invoke external functions, APIs, or services to retrieve information, perform actions, or extend their abilities beyond text generation.
Vibe Coding
Vibe coding is an AI-assisted software development practice in which a developer describes intent in natural language and a large language model generates the code, with the human guiding and testing rather than writing it directly.
vLLM
vLLM is an open-source library for fast and memory-efficient large language model inference and serving, built around the PagedAttention algorithm for optimised GPU memory management.
Yi
A family of open-source bilingual large language models developed by 01.AI, the Beijing-based AI startup founded by Kai-Fu Lee, achieving competitive performance against Llama 2 and Falcon with strong Chinese and English bilingual capability.