-
CharacterEval: A Chinese Benchmark for Role-Playing Conversational Agent Evaluation
Paper • 2401.01275 • Published • 1 -
Evaluating Very Long-Term Conversational Memory of LLM Agents
Paper • 2402.17753 • Published • 20 -
PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering
Paper • 2402.16288 • Published • 1 -
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models
Paper • 2502.14802 • Published • 15
Soulter
Soulter
·
AI & ML interests
None yet
Recent Activity
liked a dataset 1 day ago
lmsys/lmsys-chat-1m liked a dataset 1 day ago
GAIR/lima liked a dataset 2 days ago
HuggingFaceFW/fineweb-edu