Study Reveals LLMs 'Playing Dumb' Is Positional Collapse, Not Answer Avoidance
A new preregistered study using option-order randomization experiments found that when large language models are prompte…
2 articles about 'Sandbagging'
A new preregistered study using option-order randomization experiments found that when large language models are prompte…
A latest arXiv paper investigates the 'sandbagging effect' where large language models deliberately underperform under w…