Arbitrary-shaped Text Detection Based on Deformable DETR
Text regions in natural scenes have complex and variable shape.Directly use contour coordinates to describe text regions will make the modeling inadequate and lead to low accuracy of text detection.To address the problem of irregular text regions in natural scenes,an arbitrary-shaped text detection model based on Deformable DETR is proposed.The model differs from the traditional method of directly predicting contour points by using B-Spline to make the text contour smoother and more accurate and reduces the number of predictable parameters at the same time.The proposed text detection model eliminates the need to manually design components such as anchor and region proposal.The model greatly simplifies the design and makes it more generalizable.The proposed model achieves F-measure of 85.4%and 85.0%on CTW1500 and Total-Text,which demonstrate the effectiveness of the model.
computer visionnatural scene text detectionDeformable DETRB-Spline