Reasoning Models Don't Always Say What They Think

Reasoning Models Don't Always Say What They Think. Chen, Y., Benton, J., Radhakrishnan, A., Uesato, J., Denison, C., Schulman, J., Somani, A., Hase, P., Wagner, M., & Roger, F. 2025.

Paper bibtex

@misc{chenReasoningModelsDont2025,
	title = {Reasoning {Models} {Don}'t {Always} {Say} {What} {They} {Think}},
	url = {https://assets.anthropic.com/m/71876fabef0f0ed4/original/reasoning_models_paper.pdf},
	publisher = {arXiv},
	author = {Chen, Yanda and Benton, Joe and Radhakrishnan, Ansh and Uesato, Jonathan and Denison, Carson and Schulman, John and Somani, Arushi and Hase, Peter and Wagner, Misha and Roger, Fabien},
	year = {2025},
}

Downloads: 0