Ask a Question

Prefer a chat interface with context about you and your work?

OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Image-text interleaved data, consisting of multiple images and texts arranged in a natural document format, aligns with the presentation paradigm of internet data and closely resembles human reading habits. Recent studies have shown that such data aids multimodal in-context learning and maintains the capabilities of large language models during multimodal …