Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Hsu Shihyueh's picture
9 2 6

Hsu Shihyueh

AIR-hl
li11111's profile picture John6666's profile picture
·
  • AIR-hl

AI & ML interests

Nothing

Recent Activity

new activity 5 days ago
deepseek-ai/DeepSeek-V3.2:miss chat template file?
new activity 5 days ago
deepseek-ai/DeepSeek-V3.2:Humble request for a stable vLLM/SGLang deployment setup for DeepSeek-V3.2
new activity 4 months ago
miromind-ai/MiroVerse-v0.1:Where is the data file?
View all activity

Organizations

None yet

AIR-hl 's models 11

AIR-hl/Qwen2.5-1.5B-LD-DPO

Updated Jun 2

AIR-hl/DeepSeek-R1-Distill-Qwen-7B-AIMO

Updated Mar 9 • 3

AIR-hl/Mistral-7B-Base-WPO-bf16

Text Generation • 7B • Updated Jan 12 • 5

AIR-hl/Llama-3.2-3B-WPO

Text Generation • 4B • Updated Jan 7 • 6

AIR-hl/Llama-3.2-3B-DPO

Text Generation • 4B • Updated Jan 5 • 5 • 2

AIR-hl/Qwen2.5-1.5B-SimPO

Text Generation • 2B • Updated Jan 3 • 5

AIR-hl/Qwen2.5-1.5B-WPO

Text Generation • 2B • Updated Jan 3 • 5

AIR-hl/Qwen2.5-1.5B-DPO

Text Generation • 2B • Updated Jan 2 • 11

AIR-hl/Llama-3.2-1B-DPO

Text Generation • 1B • Updated Dec 24, 2024 • 10 •

AIR-hl/Llama-3.2-1B-ultrachat200k

Text Generation • 1B • Updated Nov 21, 2024 • 13 •

AIR-hl/Qwen2.5-1.5B-ultrachat200k

Text Generation • 2B • Updated Nov 20, 2024 • 20
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs