1 result for “GPT-4V”
A multimodal AI system that jointly processes and generates information from both images and text, extending large language models with visual perception capabilities through cross-modal alignment.