gabrielchua
's Collections
math-visual-benchmarks
updated
Viewer
•
Updated
•
4.73k
•
1.84k
•
48
Viewer
•
Updated
•
3.34k
•
8.24k
•
47
Viewer
•
Updated
•
6.14k
•
8.43k
•
131
Viewer
•
Updated
•
11.3k
•
619
•
15
Viewer
•
Updated
•
1.1k
•
359
•
18
Viewer
•
Updated
•
1.74k
•
244
•
17
Viewer
•
Updated
•
11.6k
•
41.9k
•
233
Viewer
•
Updated
•
5.19k
•
5.44k
•
22
Viewer
•
Updated
•
5.01k
•
576
•
5
DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical
Reasoning Robustness of Vision Language Models
Paper
•
2411.00836
•
Published
•
15
We-Math: Does Your Large Multimodal Model Achieve Human-like
Mathematical Reasoning?
Paper
•
2407.01284
•
Published
•
78
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual
Math Problems?
Paper
•
2403.14624
•
Published
•
52