bryanzhou008/vit-mae-base-finetuned-eurosat Image Classification • 85.8M • Updated Oct 21, 2024 • 2 • 1
bryanzhou008/swin-tiny-patch4-window7-224-finetuned-eurosat Image Classification • 27.6M • Updated Oct 30, 2024 • 1 • 1
bryanzhou008/vit-base-patch16-224-in21k-finetuned-eurosat Image Classification • 85.8M • Updated Oct 30, 2024 • 2 • 1
bryanzhou008/vit-base-patch16-224-in21k-finetuned-inaturalist Image Classification • 85.8M • Updated Aug 19, 2025 • 462 • 2
ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction Paper • 2511.20937 • Published Nov 26, 2025 • 15
Running on Zero Featured 100 SAM3 Video Segmentation 🐠 100 Track and label objects in videos using text prompts or clicks
DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation Paper • 2510.14949 • Published Oct 16, 2025 • 5
DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation Paper • 2510.14949 • Published Oct 16, 2025 • 5
bryanzhou008/vit-base-patch16-224-in21k-finetuned-inaturalist Image Classification • 85.8M • Updated Aug 19, 2025 • 462 • 2