AIWiki
Malaysia

Search Results

30 results for language models

Companies & Tools

AI21 Labs

An Israeli artificial intelligence company founded in 2017 that develops large language models, including the Jurassic and Jamba families, and enterprise agentic platforms such as Maestro.

4 min readUpdated May 2026
Companies & Tools

Anthropic

Anthropic is an American AI safety company and large language model developer founded in 2021 by former OpenAI researchers, best known for developing the Claude family of AI assistants and the Constitutional AI alignment technique.

7 min readUpdated May 2026
Applications

Autonomous Agents

Autonomous AI agents are software systems that use large language models as a reasoning core, enabling them to plan multi-step tasks, use external tools, maintain memory, and take actions to achieve goals with minimal human intervention.

6 min readUpdated May 2026
Applications

Chatbot

A chatbot is a software application designed to simulate human conversation through text or voice, ranging from simple rule-based systems to sophisticated AI assistants powered by large language models.

3 min readUpdated May 2026
Models

Claude (Language Model)

A family of large language models developed by Anthropic, designed with a focus on safety, helpfulness, and Constitutional AI training methods for enterprise and consumer use.

5 min readUpdated May 2026
Applications

Code Generation

AI code generation is the use of large language models to automatically produce, complete, or transform source code from natural language descriptions, enabling assisted and autonomous software development.

6 min readUpdated May 2026
Companies & Tools

Cohere

Cohere is a Canadian AI company specialising in enterprise large language models, offering Command, Embed, and Rerank model families alongside secure deployment infrastructure designed for regulated industries.

6 min readUpdated May 2026
Foundations

Constitutional AI

Constitutional AI is an alignment method developed by Anthropic that trains language models to follow a set of written ethical principles by using the model itself to critique and revise its own outputs, reducing dependence on human feedback for harmlessness.

6 min readUpdated May 2026
Companies & Tools

DeepSeek

A Chinese artificial intelligence company founded in 2023, known for developing open-source large language models including DeepSeek-R1 and DeepSeek-V3 that achieved performance competitive with leading Western AI systems.

5 min readUpdated May 2026
Models

Falcon LLM

A family of open-weight large language models developed by the Technology Innovation Institute (TII) in Abu Dhabi, released under permissive licenses and used widely across enterprise and research applications.

6 min readUpdated June 2026
Models

Gemini

Gemini is a family of multimodal large language models developed by Google DeepMind, designed to natively process and generate text, code, images, audio, and video across a range of model sizes.

6 min readUpdated May 2026
Models

Grok

Grok is a series of large language models developed by xAI, Elon Musk's AI company, featuring real-time web integration, advanced reasoning modes, and deep tool-use capabilities.

6 min readUpdated May 2026
Infrastructure

LangChain

LangChain is an open-source framework for building applications powered by large language models, providing composable abstractions for chaining LLM calls with tools, memory, and data retrieval in Python and JavaScript.

6 min readUpdated May 2026
Infrastructure

Langfuse

Langfuse is an open-source LLM engineering platform that provides observability, tracing, prompt management, evaluation, and dataset tooling for teams building applications on top of large language models.

6 min readUpdated June 2026
Foundations

Large Language Models

Large language models (LLMs) are AI systems trained on vast corpora of text to predict and generate natural language. They underpin modern chatbots, code assistants, and generative AI applications.

5 min readUpdated May 2026
Models

Llama

Llama is a family of open-weight large language models developed by Meta AI, released under a permissive licence that allows researchers and developers to freely download, fine-tune, and deploy the models for both research and commercial use.

6 min readUpdated May 2026
Companies & Tools

Meta AI

Meta AI is the artificial intelligence research division and product brand of Meta Platforms, responsible for the Llama family of open-weight language models and integrated AI assistants across Facebook, Instagram, WhatsApp, and Messenger.

6 min readUpdated May 2026
Companies & Tools

Mistral AI

Mistral AI is a French artificial intelligence company founded in 2023 that develops and releases open-weight and proprietary large language models, notable for its competitive performance-to-efficiency ratio and commitment to open-source distribution.

6 min readUpdated May 2026
Models

Mixtral

Mixtral is a family of open-weight sparse mixture-of-experts large language models developed by Mistral AI, comprising Mixtral 8x7B and Mixtral 8x22B, released under the Apache 2.0 licence.

5 min readUpdated May 2026
Companies & Tools

OpenAI

An American artificial intelligence research organisation and technology company, founded in 2015, known for developing the GPT series of language models and the ChatGPT conversational AI platform.

5 min readUpdated May 2026
Models

Phi (Language Model)

A family of small language models developed by Microsoft Research that demonstrate strong reasoning and instruction-following at parameter counts an order of magnitude smaller than typical frontier models.

4 min readUpdated May 2026
Applications

Prompt Engineering

The practice of designing and optimising input instructions given to large language models to elicit accurate, relevant, and well-structured outputs for a given task or application.

7 min readUpdated May 2026
Applications

Question Answering

Question answering is the natural language processing task of producing accurate answers to questions posed in natural language, often using information retrieval, reading comprehension, or large language models.

5 min readUpdated May 2026
Models

Qwen

Qwen is a family of large language models developed by Alibaba Cloud, ranging from small open-weight dense models to trillion-parameter mixture-of-experts systems, with strong multilingual and reasoning capabilities.

4 min readUpdated May 2026
Foundations

Reinforcement Learning from Human Feedback

A machine learning technique that trains a reward model from human preference data and uses it to align large language models with human values, safety requirements, and intended behaviour through reinforcement learning.

7 min readUpdated May 2026
Applications

Speech Recognition

Speech recognition, or automatic speech recognition (ASR), is the technology that enables computers to identify and transcribe spoken language into text using acoustic models, language models, and deep learning architectures.

6 min readUpdated May 2026
Applications

Text Summarisation

Text summarisation is the natural language processing task of producing a shorter version of a document that preserves its key information, using extractive or abstractive techniques.

4 min readUpdated May 2026
Foundations

Tokenisation

Tokenisation is the process of breaking text into discrete units called tokens — which may represent words, subwords, characters, or symbols — that serve as the fundamental input units for language models and other natural language processing systems.

6 min readUpdated May 2026
Applications

Tool Use

Tool use in AI refers to the capability of language models to invoke external functions, APIs, or services to retrieve information, perform actions, or extend their abilities beyond text generation.

6 min readUpdated May 2026
Foundations

Transformer Architecture

A neural network architecture introduced in 2017 that uses self-attention mechanisms to process sequential data in parallel, forming the foundation of modern large language models and multimodal AI systems.

7 min readUpdated May 2026