Konstantin Chernyshev
k4black
AI & ML interests
None yet
Recent Activity
authored
a paper
9 days ago
U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills
in LLMs
upvoted
a
paper
9 days ago
U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills
in LLMs
liked
a dataset
9 days ago
toloka/mu-math
Organizations
k4black's activity
Upload codebleu.py
1
#2 opened 11 months ago
by
fasterinnerlooper
Problem calling this using Huggingface Evaluate
1
#1 opened 11 months ago
by
fasterinnerlooper