MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific
Comprehension
MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific
Comprehension
The rapid advancement of Large Language Models (LLMs) and Large Multimodal Models (LMMs) has heightened the demand for AI-based scientific assistants capable of understanding scientific articles and figures. Despite progress, there remains a significant gap in evaluating models' comprehension of professional, graduate-level, and even PhD-level scientific content. Current datasets and …