Search Results
61 results for “language model”
AI Guardrails
AI guardrails are runtime safety mechanisms that validate, filter, and enforce policies on large language model inputs and outputs in production systems, preventing harmful content, data leakage, prompt injection, and off-topic behaviour.
AI21 Labs
An Israeli artificial intelligence company founded in 2017 that develops large language models, including the Jurassic and Jamba families, and enterprise agentic platforms such as Maestro.
Anthropic
Anthropic is an American AI safety company and large language model developer founded in 2021 by former OpenAI researchers, best known for developing the Claude family of AI assistants and the Constitutional AI alignment technique.
Autonomous Agents
Autonomous AI agents are software systems that use large language models as a reasoning core, enabling them to plan multi-step tasks, use external tools, maintain memory, and take actions to achieve goals with minimal human intervention.
Azure AI
Azure AI is Microsoft's integrated portfolio of artificial intelligence services hosted on the Azure cloud platform, encompassing pre-built cognitive APIs, a managed machine learning platform, large language model access, and enterprise AI development tools.
Chain-of-Thought Prompting
A prompt engineering technique that improves large language model reasoning on complex tasks by instructing the model to generate explicit intermediate reasoning steps before arriving at a final answer.
Chatbot
A chatbot is a software application designed to simulate human conversation through text or voice, ranging from simple rule-based systems to sophisticated AI assistants powered by large language models.
ChatGLM
A family of open-source bilingual (Chinese-English) large language models developed by Zhipu AI and Tsinghua University, known for strong reasoning capabilities, large context windows, and enterprise-grade open-weight releases under MIT licensing.
Claude (Language Model)
A family of large language models developed by Anthropic, designed with a focus on safety, helpfulness, and Constitutional AI training methods for enterprise and consumer use.
Code Generation
AI code generation is the use of large language models to automatically produce, complete, or transform source code from natural language descriptions, enabling assisted and autonomous software development.
Cohere
Cohere is a Canadian AI company specialising in enterprise large language models, offering Command, Embed, and Rerank model families alongside secure deployment infrastructure designed for regulated industries.
Comet ML
Comet ML is a cloud-based MLOps platform for tracking machine learning experiments, managing model versions, monitoring production models, and evaluating large language model applications.
Constitutional AI
Constitutional AI is an alignment method developed by Anthropic that trains language models to follow a set of written ethical principles by using the model itself to critique and revise its own outputs, reducing dependence on human feedback for harmlessness.
Context Window
The maximum number of tokens — including the prompt, prior conversation, retrieved documents, and the model's own output — that a large language model can process in a single forward pass.
DeepSeek
A Chinese artificial intelligence company founded in 2023, known for developing open-source large language models including DeepSeek-R1 and DeepSeek-V3 that achieved performance competitive with leading Western AI systems.
Direct Preference Optimization
Direct Preference Optimization (DPO) is a stable, computationally efficient algorithm for aligning large language models with human preferences by directly optimising a policy from comparison data, without training a separate reward model or using reinforcement learning.
Doubao
A suite of large language models and consumer AI assistant developed by ByteDance, the parent company of TikTok, reaching 159 million monthly active users and embedded across ByteDance's content, social, and device ecosystems.
ERNIE Bot
A large language model and conversational AI assistant developed by Baidu, built on the ERNIE (Enhanced Representation through Knowledge Integration) foundation model series and integrated across Baidu's search, cloud, and enterprise platforms.
Falcon LLM
A family of open-weight large language models developed by the Technology Innovation Institute (TII) in Abu Dhabi, released under permissive licenses and used widely across enterprise and research applications.
Foundation Model
A large-scale AI model pretrained on broad, diverse datasets and designed to be adapted to a wide range of downstream tasks through fine-tuning, prompting, or retrieval augmentation.
Function Calling
Function calling is the structured mechanism by which a large language model returns a JSON-formatted invocation of a named function with typed arguments, enabling reliable integration of LLMs with external systems.
Gemini
Gemini is a family of multimodal large language models developed by Google DeepMind, designed to natively process and generate text, code, images, audio, and video across a range of model sizes.
Gemma
Gemma is a family of open-weight large language models developed by Google DeepMind, built on similar technology to the Gemini series and available for deployment on hardware ranging from laptops to cloud infrastructure.
GPT-4
GPT-4 is a large multimodal language model developed by OpenAI, released in March 2023, that accepts both image and text inputs and demonstrates human-level performance on numerous professional and academic benchmarks.
Grok
Grok is a series of large language models developed by xAI, Elon Musk's AI company, featuring real-time web integration, advanced reasoning modes, and deep tool-use capabilities.
Groq
Groq is an American AI inference company that developed the Language Processing Unit (LPU), a custom silicon architecture optimised for high-throughput, low-latency inference of large language models using on-chip SRAM rather than external DRAM.
Helicone
Helicone is an open-source LLM observability and gateway platform that enables developers to monitor, debug, and optimise large language model applications in production with minimal integration effort.
Hunyuan
A family of large language models developed by Tencent, integrated across WeChat, QQ, and Tencent Cloud, offering multimodal capabilities including text, image, video, voice, and 3D generation through a unified omni-modal architecture.
In-Context Learning
In-context learning is the ability of large language models to perform new tasks by conditioning on examples or instructions provided within the input prompt, without updating model weights.
Instruction Tuning
Instruction tuning is a supervised fine-tuning technique that trains large language models on datasets of instruction-response pairs, enabling models to follow natural language directions and generalise to unseen tasks in a zero-shot or few-shot setting.
Kimi
A conversational AI assistant and long-context large language model developed by Moonshot AI, a Beijing startup, known for its industry-leading context window lengths and strong performance on agentic reasoning tasks.
LangChain
LangChain is an open-source framework for building applications powered by large language models, providing composable abstractions for chaining LLM calls with tools, memory, and data retrieval in Python and JavaScript.
Langfuse
Langfuse is an open-source LLM engineering platform that provides observability, tracing, prompt management, evaluation, and dataset tooling for teams building applications on top of large language models.
LangSmith
LangSmith is an observability, tracing, and evaluation platform from LangChain for debugging, monitoring, and continuously improving large language model and AI agent applications in production.
Large Language Models
Large language models (LLMs) are AI systems trained on vast corpora of text to predict and generate natural language. They underpin modern chatbots, code assistants, and generative AI applications.
Llama
Llama is a family of open-weight large language models developed by Meta AI, released under a permissive licence that allows researchers and developers to freely download, fine-tune, and deploy the models for both research and commercial use.
Meta AI
Meta AI is the artificial intelligence research division and product brand of Meta Platforms, responsible for the Llama family of open-weight language models and integrated AI assistants across Facebook, Instagram, WhatsApp, and Messenger.
Microsoft Copilot
Microsoft Copilot is an AI-powered assistant integrated across Microsoft's product ecosystem — including Windows, Microsoft 365, Edge, and Azure — using large language models to assist with writing, coding, data analysis, and task automation.
MiniMax
A Chinese AI company and model developer known for the MiniMax-M1 and M2 large language models featuring ultra-long context windows of up to 4 million tokens, strong agentic performance, and open MIT-licensed releases.
Mistral AI
Mistral AI is a French artificial intelligence company founded in 2023 that develops and releases open-weight and proprietary large language models, notable for its competitive performance-to-efficiency ratio and commitment to open-source distribution.
Mixtral
Mixtral is a family of open-weight sparse mixture-of-experts large language models developed by Mistral AI, comprising Mixtral 8x7B and Mixtral 8x22B, released under the Apache 2.0 licence.
Model Context Protocol
The Model Context Protocol (MCP) is an open standard introduced by Anthropic in 2024 that defines a universal interface for connecting large language models to external tools, data sources, and services.
OpenAI
An American artificial intelligence research organisation and technology company, founded in 2015, known for developing the GPT series of language models and the ChatGPT conversational AI platform.
Perplexity AI
Perplexity AI is an American AI company that operates an answer engine combining real-time web search with large language model synthesis, providing cited, conversational responses to user queries.
Phi (Language Model)
A family of small language models developed by Microsoft Research that demonstrate strong reasoning and instruction-following at parameter counts an order of magnitude smaller than typical frontier models.
Prompt Engineering
The practice of designing and optimising input instructions given to large language models to elicit accurate, relevant, and well-structured outputs for a given task or application.
Prompt Injection
Prompt injection is a security vulnerability affecting large language model applications in which an attacker embeds adversarial instructions in model inputs to override the system's intended behaviour, bypass safety controls, or exfiltrate sensitive information.
Question Answering
Question answering is the natural language processing task of producing accurate answers to questions posed in natural language, often using information retrieval, reading comprehension, or large language models.
Qwen
Qwen is a family of large language models developed by Alibaba Cloud, ranging from small open-weight dense models to trillion-parameter mixture-of-experts systems, with strong multilingual and reasoning capabilities.
Reasoning Models
Reasoning models are large language models trained to generate extended internal deliberation before producing a final answer, using test-time compute to improve accuracy on complex tasks such as mathematics, coding, and multi-step logic.
Reinforcement Learning from Human Feedback
A machine learning technique that trains a reward model from human preference data and uses it to align large language models with human values, safety requirements, and intended behaviour through reinforcement learning.
Retrieval-Augmented Generation
A technique that enhances large language model outputs by retrieving relevant documents from an external knowledge base at inference time, grounding responses in up-to-date and domain-specific information.
Spark
A large language model developed by iFlyTek, a Chinese AI company specialising in speech recognition and natural language processing, notable for its multilingual capabilities covering over 130 languages including Malay and other ASEAN languages.
Speech Recognition
Speech recognition, or automatic speech recognition (ASR), is the technology that enables computers to identify and transcribe spoken language into text using acoustic models, language models, and deep learning architectures.
Text Summarisation
Text summarisation is the natural language processing task of producing a shorter version of a document that preserves its key information, using extractive or abstractive techniques.
Token
A token is the smallest unit of text processed by a large language model, typically representing a word, subword, or character used as the fundamental input and output element during inference.
Tokenisation
Tokenisation is the process of breaking text into discrete units called tokens — which may represent words, subwords, characters, or symbols — that serve as the fundamental input units for language models and other natural language processing systems.
Tool Use
Tool use in AI refers to the capability of language models to invoke external functions, APIs, or services to retrieve information, perform actions, or extend their abilities beyond text generation.
Transformer Architecture
A neural network architecture introduced in 2017 that uses self-attention mechanisms to process sequential data in parallel, forming the foundation of modern large language models and multimodal AI systems.
Vision-Language Model
A multimodal AI system that jointly processes and generates information from both images and text, extending large language models with visual perception capabilities through cross-modal alignment.
Yi
A family of open-source bilingual large language models developed by 01.AI, the Beijing-based AI startup founded by Kai-Fu Lee, achieving competitive performance against Llama 2 and Falcon with strong Chinese and English bilingual capability.