Search Results
13 results for “cognition”
AI PC
A personal computer equipped with a dedicated Neural Processing Unit (NPU) designed to accelerate on-device artificial intelligence workloads locally, without requiring cloud connectivity, for tasks such as image generation, speech recognition, and language model inference.
Artificial General Intelligence (AGI)
Artificial general intelligence is a hypothetical form of AI that can understand, learn, and apply knowledge across the full range of tasks a human can perform, rather than being limited to narrow domains.
Computer Vision
Computer vision is the field of artificial intelligence that enables machines to interpret and act upon visual information from the world — including images, video, and depth data.
Face Recognition
Face recognition is a biometric technology that identifies or verifies individuals by analysing facial features from images or video, widely used in security, banking, and immigration.
Hidden Markov Model
A statistical model that represents systems with unobservable (hidden) states that emit observable outputs, used widely in speech recognition, bioinformatics, and time-series analysis.
Named Entity Recognition
Named entity recognition (NER) is a natural language processing task that identifies and classifies named entities in text — such as people, organisations, locations, and dates — into predefined categories.
Neuro-symbolic AI
Neuro-symbolic AI is a hybrid artificial intelligence paradigm that combines neural network-based learning with symbolic reasoning, integrating the pattern recognition strengths of deep learning with the structured reasoning and interpretability of symbolic methods.
Object Detection
Object detection is a computer vision task that involves identifying the location and category of one or more objects within an image or video frame, producing bounding boxes and class labels for each detected instance.
Optical Character Recognition
A computer vision technology that converts images of typed, handwritten, or printed text into machine-readable digital text, increasingly powered by deep learning and transformer-based vision models.
Spark
A large language model developed by iFlyTek, a Chinese AI company specialising in speech recognition and natural language processing, notable for its multilingual capabilities covering over 130 languages including Malay and other ASEAN languages.
Speech Recognition
Speech recognition, or automatic speech recognition (ASR), is the technology that enables computers to identify and transcribe spoken language into text using acoustic models, language models, and deep learning architectures.
Vision Transformer
The Vision Transformer (ViT) is a deep learning model that applies the transformer architecture originally designed for NLP directly to sequences of image patches, achieving state-of-the-art results on visual recognition tasks.
Whisper
Whisper is an open-source automatic speech recognition system developed by OpenAI, trained on 680,000 hours of multilingual audio data and capable of transcription, translation, and language identification across nearly 100 languages.