large multimodal modelOCRtext recognitionscene text-centric VQAdocument-oriented VQAkey information extractionhandwritten mathematical expression recognition
large multimodal model OCR text recognition scene text-centric VQA document-oriented VQA key information extraction handwritten mathematical expression recognition
2024