It’s no surprise that training an LLM on material that demonstrates sound reasoning results in better performance, but I’m fascinated by this identification of “cognitive” behaviors (verification, backtracking, subgoal setting, backward chaining). They sound obvious but I also don’t recognize them from any study of human reasoning: https://nitter.net/gandhikanishk/status/1896988028893323675
I posted this in March 2025 during week 2661.
For more, you should follow me on the fediverse: @hans@gerwitz.com