Reasoning - a jacklanda Collection

jacklanda 's Collections

Reasoning

updated 26 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 28 days ago • 75
RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

Paper • 2506.08672 • Published Jun 10, 2025 • 30
ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection

Paper • 2505.16475 • Published May 22, 2025 • 3