4 4 5

ZizhengZhan

Anditty

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models

upvoted a paper 1 day ago

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

upvoted a paper about 2 months ago

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

View all activity

Organizations

upvoted 2 papers 1 day ago

SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models

Paper • 2511.05459 • Published Nov 7, 2025 • 5

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Paper • 2606.07297 • Published 6 days ago • 105

upvoted a paper about 2 months ago

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

Paper • 2604.18224 • Published Apr 20 • 22

reacted to AdinaY's post with 👍 10 months ago

Post

2705

KAT-V1 🔥 a LLM that tackles overthinking by switching between reasoning and direct answers, by Kuaishou.

Kwaipilot/KAT-V1-40B

✨ 40B
✨ Step-SRPO: smarter reasoning control via RL
✨ MTP + Distillation: efficient training, lower cost

upvoted a paper 11 months ago

KAT-V1: Kwai-AutoThink Technical Report

Paper • 2507.08297 • Published Jul 11, 2025 • 8

liked 2 models about 1 year ago

Kwaipilot/OASIS-code-embedding-1.5B

Kwaipilot/KwaiCoder-23B-A4B-v1

Text Generation • 23B • Updated Jan 24, 2025 • 15 • 16

updated 2 models about 1 year ago

Kwaipilot/OASIS-code-embedding-1.5B

Kwaipilot/OASIS-code-1.3B

liked 2 models over 1 year ago

Kwaipilot/OASIS-code-1.3B

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 5.35M • • 13.4k

New activity in Kwaipilot/KwaiCoder-23B-A4B-v1 over 1 year ago

prompt format in code completion

#1 opened over 1 year ago by

LeiLeier

liked a model over 2 years ago

Qwen/Qwen-72B-Chat-Int4

Text Generation • 72B • Updated Jan 4, 2024 • 287 • 47

New activity in codellama/codellama-playground almost 3 years ago

Should add <bos_id> token with infilling

#11 opened almost 3 years ago by

Anditty

New activity in bigcode/starcoder almost 3 years ago

starcoder uses Megatron-LM?

#27 opened about 3 years ago by

senxiangms