Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
openbmb
/
MiniCPM-o-2_6
like
1.27k
Follow
OpenBMB
2.28k
Any-to-Any
Transformers
Safetensors
openbmb/RLAIF-V-Dataset
multilingual
minicpmo
feature-extraction
minicpm-o
omni
vision
ocr
multi-image
video
custom_code
audio
speech
voice cloning
live Streaming
realtime speech conversation
asr
tts
arxiv:
2405.17220
arxiv:
2408.01800
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
56
Deploy
Use this model
refs/pr/37
MiniCPM-o-2_6
17.4 GB
14 contributors
History:
67 commits
heyitsys
Added in missing imports causing errors when encoding video.
845b071
verified
10 months ago
assets
Delete assets/qa.wav
10 months ago
.gitattributes
1.74 kB
add omni case for inference
11 months ago
README.md
50 kB
Added in missing imports causing errors when encoding video.
10 months ago
added_tokens.json
1.41 kB
init
11 months ago
config.json
3.44 kB
init
11 months ago
configuration_minicpm.py
7.55 kB
update
11 months ago
image_processing_minicpmv.py
16.7 kB
init
11 months ago
merges.txt
1.67 MB
init
11 months ago
model-00001-of-00004.safetensors
4.88 GB
xet
add ckpt
11 months ago
model-00002-of-00004.safetensors
4.93 GB
xet
add ckpt
11 months ago
model-00003-of-00004.safetensors
4.33 GB
xet
add ckpt
11 months ago
model-00004-of-00004.safetensors
3.21 GB
xet
add ckpt
11 months ago
model.safetensors.index.json
133 kB
add ckpt
11 months ago
modeling_minicpmo.py
141 kB
Update modeling_minicpmo.py
10 months ago
modeling_navit_siglip.py
42.1 kB
update
11 months ago
preprocessor_config.json
714 Bytes
init
11 months ago
processing_minicpmo.py
20 kB
Release `get_audio_placeholder` interface in processing (#24)
11 months ago
resampler.py
35.6 kB
init
11 months ago
special_tokens_map.json
5.35 kB
init
11 months ago
tokenization_minicpmo_fast.py
3.04 kB
init
11 months ago
tokenizer.json
7.04 MB
init
11 months ago
tokenizer_config.json
14.1 kB
init
11 months ago
utils.py
7.24 kB
update
11 months ago
vocab.json
2.78 MB
init
11 months ago