We set out to test LLM reasoning capabilities using Einstein's puzzle, a complex logic problem involving 5 houses with different characteristics and 15 clues to determine who owns a fish. Our initial ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results