Ask a Question

Prefer a chat interface with context about you and your work?

Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation

Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation

In-image machine translation (IIMT) aims to translate an image containing texts in source language into an image containing translations in target language. In this regard, conventional cascaded methods suffer from issues such as error propagation, massive parameters, and difficulties in deployment and retaining visual characteristics of the input image. Thus, …