JayRay5/DIVE-Doc-FRD
Text Generation
•
3B
•
Updated
•
441
Contains the 3 models presented in the paper: DIVE-Doc: Downscaling foundational Image Visual Encoder into hierarchical architecture for DocVQA