Seungjuhan (Seungju Han)

upvoted a paper 3 months ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 106

upvoted a paper 6 months ago

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20, 2025 • 85

upvoted a collection 6 months ago

AI2 Safety Toolkit

Collection

Safety data, moderation tools and safe LLMs. • 6 items • Updated 18 days ago • 8

upvoted 3 papers 7 months ago

Speaking Beyond Language: A Large-Scale Multimodal Dataset for Learning Nonverbal Cues from Video-Grounded Dialogues

Paper • 2506.00958 • Published Jun 1, 2025 • 20

Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation

Paper • 2505.18842 • Published May 24, 2025 • 36

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 143

upvoted a paper 8 months ago

NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning

Paper • 2504.13941 • Published Apr 15, 2025 • 11

upvoted a collection 9 months ago

Nemotron-H

Collection

Mamba-Transformer hybrid models • 10 items • Updated 18 days ago • 31

upvoted 2 papers over 1 year ago

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

Paper • 2406.18510 • Published Jun 26, 2024 • 10

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Paper • 2406.18495 • Published Jun 26, 2024 • 13

upvoted a paper about 2 years ago

CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos

Paper • 2303.09713 • Published Mar 17, 2023 • 1

Seungju Han

AI & ML interests

Organizations