Ask a Question

Prefer a chat interface with context about you and your work?

LOCR: Location-Guided Transformer for Optical Character Recognition

LOCR: Location-Guided Transformer for Optical Character Recognition

Academic documents are packed with texts, equations, tables, and figures, requiring comprehensive understanding for accurate Optical Character Recognition (OCR). While end-to-end OCR methods offer improved accuracy over layout-based approaches, they often grapple with significant repetition issues, especially with complex layouts in Out-Of-Domain (OOD) documents.To tackle this issue, we propose LOCR, …