OCR, VQA, Thinking and Object Detection.
Ask OCR questions about uploaded images
Convert invoices to structured JSON
Extract text from images and PDFs
Ask questions about images or PDFs
Ask questions about images using Moondream2 or SmolVLM