NLP Group in SUSTech

university

https://ghchen.me

Activity Feed Request to join this org

AI & ML interests

Multilingual and multimodal LLM, data synthesis, complex reasoning with LLMs

Recent Activity

X1AOX1A authored a paper 6 days ago

Anchored Policy Optimization: Mitigating Exploration Collapse Via Support-Constrained Rectification

X1AOX1A submitted a paper 25 days ago

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

X1AOX1A authored a paper 27 days ago

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

View all activity

Papers

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

From Word to World: Can Large Language Models be Implicit Text-based World Models?

View all Papers

SUSTech-NLP 's datasets 4

SUSTech-NLP/UniRRM-RL

Viewer • Updated 2 days ago • 32.8k

SUSTech-NLP/UniRRM-SFT

Viewer • Updated 2 days ago • 35.7k

SUSTech-NLP/MixReward

Viewer • Updated 2 days ago • 64.5k

SUSTech-NLP/JudgeBench-Pro

Viewer • Updated Feb 18 • 1.18k • 89 • 3