Multimodal AI learns to weigh text and images more evenly

October 14, 2025 TechXplore.com Artificial Intelligence

Just as human eyes tend to focus on pictures before reading accompanying text, multimodal artificial intelligence (AI)—which processes multiple types of sensory data at once—also tends to depend more heavily on certain types of data. KAIST researchers have now developed a new multimodal AI training technology that enables models to recognize both text and images evenly, enabling far more accurate predictions.

This post was originally published on this site