README / README.md
Ray2333's picture
Update README.md
c2eb5be verified
metadata
title: README
emoji: 🏢
colorFrom: purple
colorTo: blue
sdk: static
pinned: false

Welcome to the official repository for DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision-Language Models. This repository contains the code, resources, and documentation supporting our paper, which introduces DynaMath: a benchmark designed to rigorously evaluate mathematical reasoning across various vision-language models (VLMs).

For further details, including the benchmark leaderboard, please visit our project website and our preprint paper.

image/png