Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2510.26658

MADD: Multi-Agent Drug Discovery Orchestra

Paper • 2511.08217 • Published 27 days ago • 55
The Station: An Open-World Environment for AI-Driven Discovery

Paper • 2511.06309 • Published 29 days ago • 35
An AI system to help scientists write expert-level empirical software

Paper • 2509.06503 • Published Sep 8 • 6
The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30 • 26

BroRL: Scaling Reinforcement Learning via Broadened Exploration

Paper • 2510.01180 • Published Oct 1 • 18
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information

Paper • 2510.03632 • Published Oct 4 • 42
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published Sep 30 • 47
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR

Paper • 2509.23808 • Published Sep 28 • 47

AGENTIC AI vs AI AGENTS

AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge

Paper • 2505.10468 • Published May 15 • 9
The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30 • 26

The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30 • 26
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29 • 44
The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30 • 115

Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations

Paper • 2508.09789 • Published Aug 13 • 5
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

Paper • 2508.13186 • Published Aug 14 • 18
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents

Paper • 2508.04038 • Published Aug 6 • 1
Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19 • 48

AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios

Paper • 2505.16944 • Published May 22 • 8
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Paper • 2505.19253 • Published May 25 • 32
The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30 • 26
Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 97

MADD: Multi-Agent Drug Discovery Orchestra

Paper • 2511.08217 • Published 27 days ago • 55
The Station: An Open-World Environment for AI-Driven Discovery

Paper • 2511.06309 • Published 29 days ago • 35
An AI system to help scientists write expert-level empirical software

Paper • 2509.06503 • Published Sep 8 • 6
The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30 • 26

The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30 • 26
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29 • 44
The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30 • 115

BroRL: Scaling Reinforcement Learning via Broadened Exploration

Paper • 2510.01180 • Published Oct 1 • 18
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information

Paper • 2510.03632 • Published Oct 4 • 42
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published Sep 30 • 47
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR

Paper • 2509.23808 • Published Sep 28 • 47

Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations

Paper • 2508.09789 • Published Aug 13 • 5
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

Paper • 2508.13186 • Published Aug 14 • 18
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents

Paper • 2508.04038 • Published Aug 6 • 1
Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19 • 48

AGENTIC AI vs AI AGENTS

AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge

Paper • 2505.10468 • Published May 15 • 9
The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30 • 26

AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios

Paper • 2505.16944 • Published May 22 • 8
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Paper • 2505.19253 • Published May 25 • 32
The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30 • 26
Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 97

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs