Projects
Reading
People
Chat

SU\G(𝔸)/K·U

Projects
Reading
People
Chat

Sign Up

Ask a Question

Prefer a chat interface with context about you and your work?

Your Question

Related Paper

Scene Text Visual Question Answering

Scene Text Visual Question Answering

Current visual question answering datasets do not consider the rich semantic information conveyed by text within an image. In this work, we present a new dataset, ST-VQA, that aims to highlight the importance of exploiting high-level semantic information present in images as textual cues in the Visual Question Answering process. …

AI Backends

Gemini 2 Flash

GPT-4o

o3-mini

o1-mini

o1

Gemini 2 Pro

Sky-T1

DeepSeek R1

Claude 3 Opus

Claude 3.5 Sonnet

Claude 3.5 Haiku

Sugaku, Inc. Copyright 2024

Privacy Policy, Cookie Policy, Terms and Conditions