← back
arXivLeizhen Zhang, Shuhan Chen, Sheng ChenWed, May 27, 2026, 8:18 AM PDT
score 16.4

New test reveals how well AI models solve logic puzzles

Original: Satisfiability Solving with LLMs: A Matched-Pair Evaluation of Reasoning Capability

Source: arxiv.org

Writing ELI5 summary…