Search Results
9 results for “language-model”
BERT
BERT (Bidirectional Encoder Representations from Transformers) is a pre-trained transformer-based language model developed by Google that reads text bidirectionally to understand word context in natural language tasks.
Claude (Language Model)
A family of large language models developed by Anthropic, designed with a focus on safety, helpfulness, and Constitutional AI training methods for enterprise and consumer use.
Falcon LLM
A family of open-weight large language models developed by the Technology Innovation Institute (TII) in Abu Dhabi, released under permissive licenses and used widely across enterprise and research applications.
GPT-4
GPT-4 is a large multimodal language model developed by OpenAI, released in March 2023, that accepts both image and text inputs and demonstrates human-level performance on numerous professional and academic benchmarks.
Grok
Grok is a series of large language models developed by xAI, Elon Musk's AI company, featuring real-time web integration, advanced reasoning modes, and deep tool-use capabilities.
Llama
Llama is a family of open-weight large language models developed by Meta AI, released under a permissive licence that allows researchers and developers to freely download, fine-tune, and deploy the models for both research and commercial use.
Mixtral
Mixtral is a family of open-weight sparse mixture-of-experts large language models developed by Mistral AI, comprising Mixtral 8x7B and Mixtral 8x22B, released under the Apache 2.0 licence.
Qwen
Qwen is a family of large language models developed by Alibaba Cloud, ranging from small open-weight dense models to trillion-parameter mixture-of-experts systems, with strong multilingual and reasoning capabilities.
Tokenisation
Tokenisation is the process of breaking text into discrete units called tokens — which may represent words, subwords, characters, or symbols — that serve as the fundamental input units for language models and other natural language processing systems.