Text regions in natural scenes have complex and variable shape.Directly use contour coordinates to describe text regions will make the modeling inadequate and lead to low accuracy of text detection.To address the problem of irregular text regions in natural scenes,an arbitrary-shaped text detection model based on Deformable DETR is proposed.The model differs from the traditional method of directly predicting contour points by using B-Spline to make the text contour smoother and more accurate and reduces the number of predictable parameters at the same time.The proposed text detection model eliminates the need to manually design components such as anchor and region proposal.The model greatly simplifies the design and makes it more generalizable.The proposed model achieves F-measure of 85.4%and 85.0%on CTW1500 and Total-Text,which demonstrate the effectiveness of the model.
关键词
计算机视觉/自然场景文本检测/Deformable/DETR/B-样条
Key words
computer vision/natural scene text detection/Deformable DETR/B-Spline