Slatt
Jason-slatt07
AI & ML interests
VR
Recent Activity
reacted
to
prithivMLmods's
post
with 👍
9 days ago
Introducing the Super-OCRs Demo, a comparison of state-of-the-art multimodal OCR VLMs, including HunyuanOCR, DeepSeekOCR, Dots, and Nanonets in one space for performing OCR, rendering LaTeX and Markdown, and visual grounding (layout). Find the related Spaces and models below.🤗🔥
✨Super-OCRs[Demo]: https://huggingface.co/spaces/prithivMLmods/Super-OCRs-Demo
✨Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
✨GitHub: https://github.com/PRITHIVSAKTHIUR/Super-OCRs-Demo
⭐ Models Used:
✦ HunyuanOCR: https://huggingface.co/tencent/HunyuanOCR
✦ DeepSeek-OCR: (-) https://huggingface.co/deepseek-ai/DeepSeek-OCR (+) https://huggingface.co/prithivMLmods/DeepSeek-OCR-Latest-BF16.I64
✦ Dots.OCR: (-) https://huggingface.co/rednote-hilab/dots.ocr (+) https://huggingface.co/prithivMLmods/Dots.OCR-Latest-BF16
✦ Nanonets-OCR2-3B: https://huggingface.co/nanonets/Nanonets-OCR2-3B
⭐ Some Other Relevant Apps:
✦ Qwen3-VL-HF-Demo: https://huggingface.co/spaces/prithivMLmods/Qwen3-VL-HF-Demo
✦ Qwen3-VL-Outpost: https://huggingface.co/spaces/prithivMLmods/Qwen3-VL-Outpost
✦ Multimodal-OCR: https://huggingface.co/spaces/prithivMLmods/Multimodal-OCR
✦ Multimodal-OCR2: https://huggingface.co/spaces/prithivMLmods/Multimodal-OCR2
✦ Multimodal-OCR3: https://huggingface.co/spaces/prithivMLmods/Multimodal-OCR3
✦ DeepSeek-OCR-experimental: https://huggingface.co/spaces/prithivMLmods/DeepSeek-OCR-experimental
To know more about it, visit the app page or the respective model page!
Organizations
None yet