Contract Review and Labeling Method Based on Deep Learning and OCR Recognition Technology
A contract review and labeling method based on deep learning and OCR recognition technology is proposed to address the problem of inaccurate recognition of contract content and long labeling time in current contract review methods.Based on OCR recognition technology,the contract text recognition model was constructed,and the OCR recognition engine was employed to convert the Chinese characters of paper documents into black and white images.Binary preprocessing of contract text images was conducted and the similarity between black and white images was calculated.Based on the similarity gradient,the stand-ard deviation of the image was compared and assigned,the foreground of the characters and the back-ground of the page were segmented,and the contract review was completed.The target labeling model based on deep learning was constructed to determine the feature vectors of each contract paragraph,and the feature vector classification of text paragraphs was transformed into a quadratic function optimization problem,and the feature classification optimization of paragraph images was carried out.The regression theory was introduced,the loss function of the labeling model was modified,the error between the output and the prediction results of the contract segment labeling model was reduced,and the contract labeling was completed.From the case analysis results,it can be seen that the proposed method can obtain a detailed list of differences by comparing the final document and the printed document,and reach faster speed in the contract labeling as well as higher correct recognition rate of the contract text.
deep learningOCR recognition technologycontract reviewcontract labeling