Search Results
15 results for “performance”
A/B Testing (ML)
A/B testing in machine learning is a controlled experiment method that compares two or more model variants in production to determine which delivers superior performance on real-world business metrics.
Canary Deployment
Canary deployment is a progressive model release strategy in which a new version is exposed to a small subset of production traffic, allowing teams to validate performance and catch failures before a full rollout.
Deep Learning
Deep learning is a subfield of machine learning that uses multi-layered artificial neural networks to learn hierarchical representations from data, enabling state-of-the-art performance across vision, language, and speech tasks.
DeepSeek
A Chinese artificial intelligence company founded in 2023, known for developing open-source large language models including DeepSeek-R1 and DeepSeek-V3 that achieved performance competitive with leading Western AI systems.
GPT-4
GPT-4 is a large multimodal language model developed by OpenAI, released in March 2023, that accepts both image and text inputs and demonstrates human-level performance on numerous professional and academic benchmarks.
Kimi
A conversational AI assistant and long-context large language model developed by Moonshot AI, a Beijing startup, known for its industry-leading context window lengths and strong performance on agentic reasoning tasks.
Knowledge Distillation
Knowledge distillation is a model compression technique in which a smaller student neural network is trained to replicate the behaviour of a larger, more capable teacher model, enabling deployment of efficient models that approximate teacher-level performance.
LoRA (Low-Rank Adaptation)
LoRA is a parameter-efficient fine-tuning technique that adapts large pre-trained models by injecting small trainable low-rank matrices into transformer layers, drastically reducing the number of trainable parameters without sacrificing performance.
Machine Learning
Machine learning is a subfield of artificial intelligence in which systems improve their performance on tasks through experience — by automatically learning patterns from data rather than following explicitly programmed rules.
MiniMax
A Chinese AI company and model developer known for the MiniMax-M1 and M2 large language models featuring ultra-long context windows of up to 4 million tokens, strong agentic performance, and open MIT-licensed releases.
Mistral AI
Mistral AI is a French artificial intelligence company founded in 2023 that develops and releases open-weight and proprietary large language models, notable for its competitive performance-to-efficiency ratio and commitment to open-source distribution.
Qdrant
An open-source, Rust-based vector database and similarity search engine designed for high-performance storage and retrieval of high-dimensional embeddings, with support for hybrid search and multitenant deployments.
Speculative Decoding
Speculative decoding is an inference acceleration technique that uses a small draft model to propose multiple candidate tokens that a larger target model then verifies in parallel, achieving 2-4x throughput gains without changing output quality.
Transfer Learning
Transfer learning is a machine learning technique in which a model pre-trained on one task or dataset is adapted for a different but related task, enabling high performance with significantly less data and compute than training from scratch.
Yi
A family of open-source bilingual large language models developed by 01.AI, the Beijing-based AI startup founded by Kai-Fu Lee, achieving competitive performance against Llama 2 and Falcon with strong Chinese and English bilingual capability.