Science Cast

Recalling Too Well: Sycophancy Evaluation and Mitigation in Memory-Augmented Models

librarianJune 10, 2026 1:37am

Views (49)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Recalling Too Well: Sycophancy Evaluation and Mitigation in Memory-Augmented Models

arXivPDFJune 9, 2026 12:00am

Authors

Shelly Bensal, Axel Magnuson, Aparna Balagopalan, Daniel M. Bikel

Abstract

Persistent memory systems promise to make LLMs more helpful by storing user beliefs over time. We show they also make models less correct by systematically amplifying sycophancy, wherein models prioritize agreement with users over accuracy. We conduct the first systematic evaluation of this effect, introducing MIST: a benchmark of synthetically generated multi-turn conversations where users express plausible misconceptions in scientific, medical, and moral reasoning domains. Testing across three state-of-the-art memory systems and five model families reveals that memory amplifies sycophantic behavior across all conditions, with up to 25x higher sycophancy rates than in-context baselines. Error analyses suggest memory extraction as the primary culprit: lossy compression into discrete snippets encodes user misconceptions while discarding corrective context. Based on these results, we propose two lightweight mitigations that substantially reduce sycophancy while matching or exceeding memory systems at factual recall.

TwitterandLinkedIn

0 comments

Add comment

Recalling Too Well: Sycophancy Evaluation and Mitigation in Memory-Augmented Models

Recalling Too Well: Sycophancy Evaluation and Mitigation in Memory-Augmented Models

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments