Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
AI & ML interests
Deep Learning Framework
Recent Activity
View all activity
Papers
GraphNet: A Large-Scale Computational Graph Dataset for Tensor Compiler Research
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese
-
PP-OCRv5 Online Demo
π76Universal-Scene Text Recognition Model with High-Accuracy
-
PaddlePaddle/PP-OCRv5_mobile_det
Image-to-Text β’ Updated β’ 40.6k β’ 18 -
PaddlePaddle/PP-OCRv5_mobile_rec
Image-to-Text β’ Updated β’ 7.83k β’ 8 -
PaddlePaddle/PP-OCRv5_server_det
Image-to-Text β’ Updated β’ 313k β’ 51
-
PaddlePaddle/arabic_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 486 β’ 1 -
PaddlePaddle/chinese_cht_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 36 -
PaddlePaddle/cyrillic_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 187 -
PaddlePaddle/devanagari_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 170
Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
PP-StructureV3 is a SOTA document parsing solution on OmniDocBench, supporting the conversion of PDFs and do cument images to Markdown and JSON.
PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese
-
PP-OCRv5 Online Demo
π76Universal-Scene Text Recognition Model with High-Accuracy
-
PaddlePaddle/PP-OCRv5_mobile_det
Image-to-Text β’ Updated β’ 40.6k β’ 18 -
PaddlePaddle/PP-OCRv5_mobile_rec
Image-to-Text β’ Updated β’ 7.83k β’ 8 -
PaddlePaddle/PP-OCRv5_server_det
Image-to-Text β’ Updated β’ 313k β’ 51
-
PaddlePaddle/arabic_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 486 β’ 1 -
PaddlePaddle/chinese_cht_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 36 -
PaddlePaddle/cyrillic_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 187 -
PaddlePaddle/devanagari_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 170