Daily Shaarli

All links of one day in a single page.

June 4, 2025

[2505.23836] Large Language Models Often Know When They Are Being Evaluated

If AI models can detect when they are being evaluated, the effectiveness of evaluations might be compromised. For example, models could have systematically different behavior during evaluations,...