Gemini 1.5: Unlocking multimodal understanding across millions of tokens
of context
Gemini 1.5: Unlocking multimodal understanding across millions of tokens
of context
In this report, we present the latest model of the Gemini family, Gemini 1.5 Pro, a highly compute-efficient multimodal mixture-of-experts model capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. Gemini 1.5 Pro achieves near-perfect …