🔔 The automatic evaluation on CodaLab are under construction. The MathVista dataset is derived from three newly collected datasets: IQTest, FunctionQA, and Paper, as well as 28 other source datasets.
Math anxiety grows from stress, culture, and experience, not ability. By changing how we teach, test, and talk about math, we ...
Overview of the Thought Leap phenomenon and our bridging approach. (a) Thought Leaps in CoT; (b) Negative impact on training; (c) Bridging leaps improves reasoning performance. In this work, we ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results