Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
segmond
's Collections
vision models
pending_space_downloads
Segmond Interests
Datasets
Papers
pending_downloads
CoolSpace
training examples
embedding models
vision models
updated
Oct 1
Upvote
-
bartowski/UI-TARS-7B-DPO-GGUF
Image-Text-to-Text
•
8B
•
Updated
Jan 23
•
1.23k
•
9
bartowski/UI-TARS-72B-SFT-GGUF
Image-Text-to-Text
•
73B
•
Updated
Jan 24
•
548
bartowski/UI-TARS-7B-SFT-GGUF
Image-Text-to-Text
•
8B
•
Updated
Jan 24
•
1.14k
•
3
bartowski/UI-TARS-72B-DPO-GGUF
Image-Text-to-Text
•
73B
•
Updated
Jan 23
•
809
•
3
bartowski/allenai_olmOCR-7B-0225-preview-GGUF
Image-Text-to-Text
•
8B
•
Updated
Feb 25
•
1.17k
•
7
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition
•
6B
•
Updated
May 1
•
397k
•
1.54k
ggml-org/ultravox-v0_5-llama-3_2-1b-GGUF
Audio-Text-to-Text
•
1B
•
Updated
May 25
•
1.2k
•
5
mradermacher/Qwen2-Audio-7B-Instruct-GGUF
Audio-Text-to-Text
•
8B
•
Updated
Jul 31
•
730
city96/FLUX.1-dev-gguf
Text-to-Image
•
12B
•
Updated
Aug 18, 2024
•
75.3k
•
1.25k
openbmb/MiniCPM-V-4_5
Image-Text-to-Text
•
9B
•
Updated
Oct 10
•
49.7k
•
1.02k
Qwen/Qwen-Image-Edit
Image-to-Image
•
Updated
Aug 25
•
93.3k
•
•
2.17k
Upvote
-
Share collection
View history
Collection guide
Browse collections