AIWiki
Malaysia

Search Results

15 results for performance

Infrastructure

A/B Testing (ML)

A/B testing in machine learning is a controlled experiment method that compares two or more model variants in production to determine which delivers superior performance on real-world business metrics.

6 min readUpdated June 2026
Infrastructure

Canary Deployment

Canary deployment is a progressive model release strategy in which a new version is exposed to a small subset of production traffic, allowing teams to validate performance and catch failures before a full rollout.

6 min readUpdated June 2026
Foundations

Deep Learning

Deep learning is a subfield of machine learning that uses multi-layered artificial neural networks to learn hierarchical representations from data, enabling state-of-the-art performance across vision, language, and speech tasks.

7 min readUpdated May 2026
Companies & Tools

DeepSeek

A Chinese artificial intelligence company founded in 2023, known for developing open-source large language models including DeepSeek-R1 and DeepSeek-V3 that achieved performance competitive with leading Western AI systems.

5 min readUpdated May 2026
Models

GPT-4

GPT-4 is a large multimodal language model developed by OpenAI, released in March 2023, that accepts both image and text inputs and demonstrates human-level performance on numerous professional and academic benchmarks.

6 min readUpdated May 2026
Models

Kimi

A conversational AI assistant and long-context large language model developed by Moonshot AI, a Beijing startup, known for its industry-leading context window lengths and strong performance on agentic reasoning tasks.

4 min readUpdated June 2026
Infrastructure

Knowledge Distillation

Knowledge distillation is a model compression technique in which a smaller student neural network is trained to replicate the behaviour of a larger, more capable teacher model, enabling deployment of efficient models that approximate teacher-level performance.

6 min readUpdated May 2026
Applications

LoRA (Low-Rank Adaptation)

LoRA is a parameter-efficient fine-tuning technique that adapts large pre-trained models by injecting small trainable low-rank matrices into transformer layers, drastically reducing the number of trainable parameters without sacrificing performance.

6 min readUpdated May 2026
Foundations

Machine Learning

Machine learning is a subfield of artificial intelligence in which systems improve their performance on tasks through experience — by automatically learning patterns from data rather than following explicitly programmed rules.

4 min readUpdated May 2026
Companies & Tools

MiniMax

A Chinese AI company and model developer known for the MiniMax-M1 and M2 large language models featuring ultra-long context windows of up to 4 million tokens, strong agentic performance, and open MIT-licensed releases.

5 min readUpdated June 2026
Companies & Tools

Mistral AI

Mistral AI is a French artificial intelligence company founded in 2023 that develops and releases open-weight and proprietary large language models, notable for its competitive performance-to-efficiency ratio and commitment to open-source distribution.

6 min readUpdated May 2026
Companies & Tools

Qdrant

An open-source, Rust-based vector database and similarity search engine designed for high-performance storage and retrieval of high-dimensional embeddings, with support for hybrid search and multitenant deployments.

5 min readUpdated May 2026
Infrastructure

Speculative Decoding

Speculative decoding is an inference acceleration technique that uses a small draft model to propose multiple candidate tokens that a larger target model then verifies in parallel, achieving 2-4x throughput gains without changing output quality.

5 min readUpdated June 2026
Foundations

Transfer Learning

Transfer learning is a machine learning technique in which a model pre-trained on one task or dataset is adapted for a different but related task, enabling high performance with significantly less data and compute than training from scratch.

6 min readUpdated May 2026
Models

Yi

A family of open-source bilingual large language models developed by 01.AI, the Beijing-based AI startup founded by Kai-Fu Lee, achieving competitive performance against Llama 2 and Falcon with strong Chinese and English bilingual capability.

4 min readUpdated June 2026