arxiv:2502.09082

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

Published on Feb 13

· Submitted by

Neph0s on Feb 14

Upvote

Authors:

Siyu Yuan ,

Haoran Guo ,

Jiangjie Chen ,

Shuchang Zhou

Abstract

Role-playing language agents (RPLAs) have emerged as promising applications of large language models (LLMs). However, simulating established characters presents a challenging task for RPLAs, due to the lack of authentic character datasets and nuanced evaluation methods using such data. In this paper, we present CoSER, a collection of a high-quality dataset, open models, and an evaluation protocol towards effective RPLAs of established characters. The CoSER dataset covers 17,966 characters from 771 renowned books. It provides authentic dialogues with real-world intricacies, as well as diverse data types such as conversation setups, character experiences and internal thoughts. Drawing from acting methodology, we introduce given-circumstance acting for training and evaluating role-playing LLMs, where LLMs sequentially portray multiple characters in book scenes. Using our dataset, we develop CoSER 8B and CoSER 70B, i.e., advanced open role-playing LLMs built on LLaMA-3.1 models. Extensive experiments demonstrate the value of the CoSER dataset for RPLA training, evaluation and retrieval. Moreover, CoSER 70B exhibits state-of-the-art performance surpassing or matching GPT-4o on our evaluation and three existing benchmarks, i.e., achieving 75.80% and 93.47% accuracy on the InCharacter and LifeChoice benchmarks respectively.

View arXiv page View PDF Add to collection

Community

Neph0s

Paper submitter 2 days ago

📢 Introducing CoSER: Advancing AI Character Role-Playing with High-Quality Data from Best-Ever Books

We're excited to present CoSER (Coordinating LLM-Based Persona Simulation of Established Roles), a collection of a high-quality dataset, open models, and novel evaluation protocol for more authentic AI character role-playing!

🔑 Key Features:
📚 17,966 characters, 29,798 authentic conversations, from 771 renowned books (top-rated on the Best-Books-Ever List)
📊 Comprehensive data of other types, such as conversation setups, character experiences and internal thoughts.
🏆 State-of-the-art role-playing models: CoSER-8B and CoSER-70B (built upon LLaMA 3.1)
🎭 Novel Methodology for training and evaluating role-playing LLMs: Given-Circumstance Acting.

📄 Paper: https://arxiv.org/pdf/2502.09082
💻 Source code: https://github.com/Neph0s/COSER