Search Results
30 results for “language models”
AI21 Labs
An Israeli artificial intelligence company founded in 2017 that develops large language models, including the Jurassic and Jamba families, and enterprise agentic platforms such as Maestro.
Anthropic
Anthropic is an American AI safety company and large language model developer founded in 2021 by former OpenAI researchers, best known for developing the Claude family of AI assistants and the Constitutional AI alignment technique.
Autonomous Agents
Autonomous AI agents are software systems that use large language models as a reasoning core, enabling them to plan multi-step tasks, use external tools, maintain memory, and take actions to achieve goals with minimal human intervention.
Chatbot
A chatbot is a software application designed to simulate human conversation through text or voice, ranging from simple rule-based systems to sophisticated AI assistants powered by large language models.
Claude (Language Model)
A family of large language models developed by Anthropic, designed with a focus on safety, helpfulness, and Constitutional AI training methods for enterprise and consumer use.
Code Generation
AI code generation is the use of large language models to automatically produce, complete, or transform source code from natural language descriptions, enabling assisted and autonomous software development.
Cohere
Cohere is a Canadian AI company specialising in enterprise large language models, offering Command, Embed, and Rerank model families alongside secure deployment infrastructure designed for regulated industries.
Constitutional AI
Constitutional AI is an alignment method developed by Anthropic that trains language models to follow a set of written ethical principles by using the model itself to critique and revise its own outputs, reducing dependence on human feedback for harmlessness.
DeepSeek
A Chinese artificial intelligence company founded in 2023, known for developing open-source large language models including DeepSeek-R1 and DeepSeek-V3 that achieved performance competitive with leading Western AI systems.
Falcon LLM
A family of open-weight large language models developed by the Technology Innovation Institute (TII) in Abu Dhabi, released under permissive licenses and used widely across enterprise and research applications.
Gemini
Gemini is a family of multimodal large language models developed by Google DeepMind, designed to natively process and generate text, code, images, audio, and video across a range of model sizes.
Grok
Grok is a series of large language models developed by xAI, Elon Musk's AI company, featuring real-time web integration, advanced reasoning modes, and deep tool-use capabilities.
LangChain
LangChain is an open-source framework for building applications powered by large language models, providing composable abstractions for chaining LLM calls with tools, memory, and data retrieval in Python and JavaScript.
Langfuse
Langfuse is an open-source LLM engineering platform that provides observability, tracing, prompt management, evaluation, and dataset tooling for teams building applications on top of large language models.
Large Language Models
Large language models (LLMs) are AI systems trained on vast corpora of text to predict and generate natural language. They underpin modern chatbots, code assistants, and generative AI applications.
Llama
Llama is a family of open-weight large language models developed by Meta AI, released under a permissive licence that allows researchers and developers to freely download, fine-tune, and deploy the models for both research and commercial use.
Meta AI
Meta AI is the artificial intelligence research division and product brand of Meta Platforms, responsible for the Llama family of open-weight language models and integrated AI assistants across Facebook, Instagram, WhatsApp, and Messenger.
Mistral AI
Mistral AI is a French artificial intelligence company founded in 2023 that develops and releases open-weight and proprietary large language models, notable for its competitive performance-to-efficiency ratio and commitment to open-source distribution.
Mixtral
Mixtral is a family of open-weight sparse mixture-of-experts large language models developed by Mistral AI, comprising Mixtral 8x7B and Mixtral 8x22B, released under the Apache 2.0 licence.
OpenAI
An American artificial intelligence research organisation and technology company, founded in 2015, known for developing the GPT series of language models and the ChatGPT conversational AI platform.
Phi (Language Model)
A family of small language models developed by Microsoft Research that demonstrate strong reasoning and instruction-following at parameter counts an order of magnitude smaller than typical frontier models.
Prompt Engineering
The practice of designing and optimising input instructions given to large language models to elicit accurate, relevant, and well-structured outputs for a given task or application.
Question Answering
Question answering is the natural language processing task of producing accurate answers to questions posed in natural language, often using information retrieval, reading comprehension, or large language models.
Qwen
Qwen is a family of large language models developed by Alibaba Cloud, ranging from small open-weight dense models to trillion-parameter mixture-of-experts systems, with strong multilingual and reasoning capabilities.
Reinforcement Learning from Human Feedback
A machine learning technique that trains a reward model from human preference data and uses it to align large language models with human values, safety requirements, and intended behaviour through reinforcement learning.
Speech Recognition
Speech recognition, or automatic speech recognition (ASR), is the technology that enables computers to identify and transcribe spoken language into text using acoustic models, language models, and deep learning architectures.
Text Summarisation
Text summarisation is the natural language processing task of producing a shorter version of a document that preserves its key information, using extractive or abstractive techniques.
Tokenisation
Tokenisation is the process of breaking text into discrete units called tokens — which may represent words, subwords, characters, or symbols — that serve as the fundamental input units for language models and other natural language processing systems.
Tool Use
Tool use in AI refers to the capability of language models to invoke external functions, APIs, or services to retrieve information, perform actions, or extend their abilities beyond text generation.
Transformer Architecture
A neural network architecture introduced in 2017 that uses self-attention mechanisms to process sequential data in parallel, forming the foundation of modern large language models and multimodal AI systems.