Towards Unified Scene Text Spotting Based on Sequence Generation
Towards Unified Scene Text Spotting Based on Sequence Generation
Sequence generation models have recently made significant progress in unifying various vision tasks. Although some auto-regressive models have demonstrated promising results in end-to-end text spotting, they use specific detection formats while ignoring various text shapes and are limited in the maximum number of text instances that can be detected. To …