Spaces:
Sleeping
Sleeping
XufengDuan
commited on
Commit
•
842fa95
1
Parent(s):
616e12e
update scripts
Browse files- src/display/about.py +2 -2
src/display/about.py
CHANGED
@@ -36,12 +36,12 @@ class Tasks(Enum):
|
|
36 |
|
37 |
|
38 |
# Your leaderboard name
|
39 |
-
TITLE = """<h1 align="center" id="space-title">
|
40 |
|
41 |
# What does your leaderboard evaluate?
|
42 |
INTRODUCTION_TEXT = """
|
43 |
Welcome to the Humanlikeness Leaderboard, curated by [Xufeng Duan](https://xufengduan.github.io/). This platform rigorously evaluates the alignment between human and model responses across five key aspects of language: sounds, words, syntax, meaning, and discourse, to quantify a model's humanlikeness.<br><br>
|
44 |
-
|
45 |
The congruence between human and model responses provides a precise measure of the model's humanlikeness, offering critical insights into how closely these models mirror human cognitive processes.<br>
|
46 |
"""
|
47 |
|
|
|
36 |
|
37 |
|
38 |
# Your leaderboard name
|
39 |
+
TITLE = """<h1 align="center" id="space-title">Humanlikeness(HL) leaderboard</h1>"""
|
40 |
|
41 |
# What does your leaderboard evaluate?
|
42 |
INTRODUCTION_TEXT = """
|
43 |
Welcome to the Humanlikeness Leaderboard, curated by [Xufeng Duan](https://xufengduan.github.io/). This platform rigorously evaluates the alignment between human and model responses across five key aspects of language: sounds, words, syntax, meaning, and discourse, to quantify a model's humanlikeness.<br><br>
|
44 |
+
Leveraging a large-scale dataset comprising responses from 2000 human participants, we have created detailed response distributions that capture natural human behavior. These extensive datasets are then compared against response distributions generated by LLMs when presented with identical stimuli.<br><br>
|
45 |
The congruence between human and model responses provides a precise measure of the model's humanlikeness, offering critical insights into how closely these models mirror human cognitive processes.<br>
|
46 |
"""
|
47 |
|