We set out to test LLM reasoning capabilities using Einstein's puzzle, a complex logic problem involving 5 houses with different characteristics and 15 clues to determine who owns a fish. Our initial ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results