Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12
17
1
Yi Cui
PRO
onekq
Follow
ermac1987's profile picture
Getcho's profile picture
Csplk's profile picture
16 followers
ยท
22 following
https://onekq.ai
onekq_ai
onekq
yicui
AI & ML interests
Benchmark, Code Generation Model
Recent Activity
updated
a Space
20 days ago
onekq-ai/WebApp1K-models-leaderboard
posted
an
update
about 1 month ago
October version of Claude 3.5 lifts SOTA (set by its June version) by 7 points. https://huggingface.co/spaces/onekq-ai/WebApp1K-models-leaderboard Closed sourced models are widening the gap again. Note: Our frontier leaderboard now uses double test scenarios because the single-scenario test suit has been saturated.
New activity
about 1 month ago
onekq-ai/WebApp1K-models-leaderboard:
All the clickable links are not accessible...
View all activity
Articles
Does Daily Software Engineering Work Need Reasoning Models?
Sep 24
โข
5
All LLMs Write Great Code, But Some Make (A Lot) Fewer Mistakes
Sep 12
โข
4
Organizations
onekq
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a Space
about 2 months ago
Running
15
๐ฆ
Quant Request