#
got-ocr20
Here are 2 public repositories matching this topic...
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
image-to-textcliptext-to-imageditmultimodalsoratext-to-videoaigcstable-diffusioncontrolnetllavasd-xlppdiffuserseva-clipstablevideodiffusionminicpm-vinternvl2qwen2-vlgot-ocr20deepseek-vl
- Updated
Mar 18, 2025 - Python
Leverage GOT-OCR2's optical character recognition capabilities using LitServe.
pythondeep-learningtransformerspytorchartificial-intelligenceoptical-character-recognitionfastapilightning-ailitservegot-ocr20
- Updated
Feb 14, 2025 - Python
Improve this page
Add a description, image, and links to thegot-ocr20 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thegot-ocr20 topic, visit your repo's landing page and select "manage topics."