Ask a Question

Prefer a chat interface with context about you and your work?

E-ANT: A Large-Scale Dataset for Efficient Automatic GUI NavigaTion

E-ANT: A Large-Scale Dataset for Efficient Automatic GUI NavigaTion

Online GUI navigation on mobile devices has driven a lot of attention recent years since it contributes to many real-world applications. With the rapid development of large language models (LLM), multimodal large language models (MLLM) have tremendous potential on this task. However, existing MLLMs need high quality data to improve …