view article Article DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models 3 days ago • 31
WildDet3D Collection This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 8 items • Updated 11 days ago • 17
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs 17 days ago • 59
Falcon Perception Collection Falcon-Perception and Falcon-OCR model: early-fusion, natively multimodal, dense Autoregressive Transformer models. • 5 items • Updated 18 days ago • 14
view article Article SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation Mar 23 • 16
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face Feb 11, 2025 • 118
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 23 days ago • 877
SigLino: Vision Foundation Models (SigLIP2 + DINOv3) Collection Vision encoders distilled from DINOv3 and SigLIP2 (MoE & Dense). CVPR 2026. • 6 items • Updated 14 days ago • 17
view article Article Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline Mar 13 • 40