Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs

Update: 2025-10-09

Description

This paper assesses how well Large Language Models (LLMs) can infer, remember, and follow user preferences in long, multi-session conversations. The evaluation of 10 different LLMs using this benchmark revealed that current state-of-the-art models exhibit significant difficulty proactively following user preferences, with accuracy dropping below 10% in zero-shot settings within a short number of turns. The researchers conclude that while fine-tuning on PrefEval can improve results, the benchmark demonstrates LLMs still face challenges in personalized conversational abilities.

Comments

In Channel

Provably Learning from Language Feedback

2025-10-2119:55

In-Context Learning for Pure Exploration

2025-10-2116:30

On the Role of Preference Variance in Preference Optimization

2025-10-2014:42

Training LLM Agents to Empower Humans

2025-10-2013:38

Richard Sutton Declares LLMs a Dead End

2025-10-2013:20

Demystifying Reinforcement Learning in Agentic Reasoning

2025-10-1915:21

Emergent coordination in multi-agent language models

2025-10-1913:57

Learning-to-measure: in-context active feature acquisition

2025-10-1916:02

Andrej Karpathy's insights: AGI, Intelligence, and Evolution

2025-10-1916:11

Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data

2025-10-1812:48

Representation-Based Exploration for Language Models: From Test-Time to Post-Training

2025-10-1817:02

The attacker moves second: stronger adaptive attacks bypass defenses against LLM jail- Breaks and prompt injections

2025-10-1816:08

When can in-context learning generalize out of task distribution?

2025-10-1619:44

The Art of Scaling Reinforcement Learning Compute for LLMs

2025-10-1613:41

A small number of samples can poison LLMs of any size

2025-10-1613:58

Dual Goal Representations

2025-10-1417:11

Welcome to the Era of Experience

2025-10-1416:42

Value Flows: Flow-Based Distributional Reinforcement Learning

2025-10-1415:42

Self-Adapting Language Models

2025-10-1216:42

The Markovian Thinker

2025-10-1214:15

00:00

Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs

#box-pro-ellipsis-176110381147798{-webkit-line-clamp:2;}Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs

Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs

Enoch H. Kang

Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs