-
MADD: Multi-Agent Drug Discovery Orchestra
Paper • 2511.08217 • Published • 55 -
The Station: An Open-World Environment for AI-Driven Discovery
Paper • 2511.06309 • Published • 35 -
An AI system to help scientists write expert-level empirical software
Paper • 2509.06503 • Published • 6 -
The Era of Agentic Organization: Learning to Organize with Language Models
Paper • 2510.26658 • Published • 26
Collections
Discover the best community collections!
Collections including paper arxiv:2510.26658
-
BroRL: Scaling Reinforcement Learning via Broadened Exploration
Paper • 2510.01180 • Published • 18 -
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
Paper • 2510.03632 • Published • 42 -
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
Paper • 2509.25849 • Published • 47 -
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR
Paper • 2509.23808 • Published • 47
-
The Era of Agentic Organization: Learning to Organize with Language Models
Paper • 2510.26658 • Published • 26 -
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
Paper • 2510.25992 • Published • 44 -
The End of Manual Decoding: Towards Truly End-to-End Language Models
Paper • 2510.26697 • Published • 115
-
Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations
Paper • 2508.09789 • Published • 5 -
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
Paper • 2508.13186 • Published • 18 -
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents
Paper • 2508.04038 • Published • 1 -
Prompt Orchestration Markup Language
Paper • 2508.13948 • Published • 48
-
AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
Paper • 2505.16944 • Published • 8 -
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research
Paper • 2505.19253 • Published • 32 -
The Era of Agentic Organization: Learning to Organize with Language Models
Paper • 2510.26658 • Published • 26 -
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 97
-
MADD: Multi-Agent Drug Discovery Orchestra
Paper • 2511.08217 • Published • 55 -
The Station: An Open-World Environment for AI-Driven Discovery
Paper • 2511.06309 • Published • 35 -
An AI system to help scientists write expert-level empirical software
Paper • 2509.06503 • Published • 6 -
The Era of Agentic Organization: Learning to Organize with Language Models
Paper • 2510.26658 • Published • 26
-
The Era of Agentic Organization: Learning to Organize with Language Models
Paper • 2510.26658 • Published • 26 -
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
Paper • 2510.25992 • Published • 44 -
The End of Manual Decoding: Towards Truly End-to-End Language Models
Paper • 2510.26697 • Published • 115
-
BroRL: Scaling Reinforcement Learning via Broadened Exploration
Paper • 2510.01180 • Published • 18 -
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
Paper • 2510.03632 • Published • 42 -
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
Paper • 2509.25849 • Published • 47 -
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR
Paper • 2509.23808 • Published • 47
-
Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations
Paper • 2508.09789 • Published • 5 -
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
Paper • 2508.13186 • Published • 18 -
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents
Paper • 2508.04038 • Published • 1 -
Prompt Orchestration Markup Language
Paper • 2508.13948 • Published • 48
-
AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
Paper • 2505.16944 • Published • 8 -
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research
Paper • 2505.19253 • Published • 32 -
The Era of Agentic Organization: Learning to Organize with Language Models
Paper • 2510.26658 • Published • 26 -
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 97