Portmanteauing Features for Scene Text Recognition

Yew Lee Tan, Ernest Yu Kai Chew, Adams Wai‐Kin Kong, Jung‐Jae Kim, Joo‐Hwee Lim

Type: Article

Publication Date: 2022-08-21

Citations: 0

DOI: https://doi.org/10.1109/icpr56361.2022.9956468

Abstract

Scene text images have different shapes and are subjected to various distortions, e.g. perspective distortions. To handle these challenges, the state-of-the-art methods rely on a rectification network, which is connected to the text recognition network. They form a linear pipeline which uses text rectification on all input images, even for images that can be recognized without it. Undoubtedly, the rectification network improves the overall text recognition performance. However, in some cases, the rectification network generates unnecessary distortions on images, resulting in incorrect predictions in images that would have otherwise been correct without it. In order to alleviate the unnecessary distortions, the portmanteauing of features is proposed. The portmanteau feature, inspired by the portmanteau word, is a feature containing information from both the original text image and the rectified image. To generate the portmanteau feature, a non-linear input pipeline with a block matrix initialization is presented. In this work, the transformer is chosen as the recognition network due to its utilization of attention and inherent parallelism, which can effectively handle the portmanteau feature. The proposed method is examined on 6 benchmarks and compared with 13 state-of-the-art methods. The experimental results show that the proposed method outperforms the state-of-the-art methods on various of the benchmarks.

Locations

arXiv (Cornell University) - View - PDF
2022 26th International Conference on Pattern Recognition (ICPR) - View

Similar Works

Action	Title	Year	Authors
+	Portmanteauing Features for Scene Text Recognition	2022	Yew Lee Tan Ernest Yu Kai Chew Adams Wai‐Kin Kong Jung‐Jae Kim Joo‐Hwee Lim
+	Scene Text Recognition With Finer Grid Rectification	2020	Gang Wang
+	Scene Text Recognition via Transformer	2020	Xinjie Feng Hongxun Yao Yuankai Yi Jun Zhang Shengping Zhang
+	Robust Scene Text Recognition with Automatic Rectification	2016	Baoguang Shi Xinggang Wang Pengyuan Lyu Cong Yao Xiang Bai
+ PDF Chat	Robust Scene Text Recognition with Automatic Rectification	2016	Baoguang Shi Xinggang Wang Pengyuan Lyu Cong Yao Xiang Bai
+	A Multi-Object Rectified Attention Network for Scene Text Recognition	2019	Canjie Luo Lianwen Jin Zenghui Sun
+ PDF Chat	Focus-Enhanced Scene Text Recognition with Deformable Convolutions	2019	Linjie Deng Yanxiang Gong Xinchen Lu Xin Yi Zheng Ma Mei Xie
+	Focus-Enhanced Scene Text Recognition with Deformable Convolutions	2019	Linjie Deng Yanxiang Gong Xinchen Lu Xin Yi Zheng Ma Mei Xie
+	Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition	2021	Mengmeng Cui Wei Wang Jinjin Zhang Liang Wang
+	ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification	2018	Fangneng Zhan Shijian Lu
+ PDF Chat	Symmetry-Constrained Rectification Network for Scene Text Recognition	2019	Mingkun Yang Yushuo Guan Minghui Liao Xin He Kaigui Bian Song Bai Cong Yao Xiang Bai
+	Symmetry-constrained Rectification Network for Scene Text Recognition	2019	Mingkun Yang Yushuo Guan Minghui Liao Xin He Kaigui Bian Song Bai Cong Yao Xiang Bai
+ PDF Chat	SAFL: A Self-Attention Scene Text Recognizer with Focal Loss	2020	Bao Hieu Tran Thanh Le-Cong Huu Manh Nguyen Duc Anh Le Thanh Hung Nguyen Phi Le Nguyen
+ PDF Chat	ESIR: End-To-End Scene Text Recognition via Iterative Image Rectification	2019	Fangneng Zhan Shijian Lu
+ PDF Chat	Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition	2024	Zuan Gao Yuxin Wang Yadong Qu Boqiang Zhang Zixiao Wang Jianjun Xu Hongtao Xie
+	NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition	2018	Fenfen Sheng Zhineng Chen Bo Xu
+ PDF Chat	NRTR: A No-Recurrence Sequence-to-Sequence Model for Scene Text Recognition	2019	Fenfen Sheng Zhineng Chen Bo Xu
+ PDF Chat	Platypus: A Generalized Specialist Model for Reading Text in Various Forms	2024	Peng Wang Zhaohai Li Jun Tang Humen Zhong Fei Huang Zhibo Yang Cong Yao
+ PDF Chat	Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition	2024	Zuan Gao Yuxin Wang Yadong Qu Boqiang Zhang Zixiao Wang Jianjun Xu Hongtao Xie
+	Revisiting Classification Perspective on Scene Text Recognition	2021	Hongxiang Cai Jun Sun Yichao Xiong

Works That Cite This (0)

Action	Title	Year	Authors

Works Cited by This (21)

Action	Title	Year	Authors
+	Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition	2014	Max Jaderberg Karen Simonyan Andrea Vedaldi Andrew Zisserman
+ PDF Chat	An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition	2016	Baoguang Shi Xiang Bai Cong Yao
+ PDF Chat	Recursive Recurrent Nets with Attention Modeling for OCR in the Wild	2016	Chen‐Yu Lee Simon Osindero
+ PDF Chat	Synthetic Data for Text Localisation in Natural Images	2016	Ankush Gupta Andrea Vedaldi Andrew Zisserman
+ PDF Chat	Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering	2018	Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Jay Gould Lei Zhang
+ PDF Chat	ESIR: End-To-End Scene Text Recognition via Iterative Image Rectification	2019	Fangneng Zhan Shijian Lu
+ PDF Chat	Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition	2019	Hui Li Peng Wang Chunhua Shen Guyu Zhang
+ PDF Chat	MASTER: Multi-aspect non-local network for scene text recognition	2021	Ning Lü Wenwen Yu Xianbiao Qi Yihao Chen Ping Gong Rong Xiao Xiang Bai
+ PDF Chat	TextScanner: Reading Characters in Order for Robust Scene Text Recognition	2020	Zhaoyi Wan Minghang He Hao Chen Xiang Bai Cong Yao
+ PDF Chat	GTC: Guided Training of CTC towards Efficient and Accurate Scene Text Recognition	2020	Wenyang Hu Xiaocong Cai Jun Hou Shuai Yi Zhiping Lin