Yury Tokpanov
yury-zyphra
AI & ML interests
None yet
Organizations
yury-zyphra's activity
[bot] Conversion to Parquet
#4 opened 4 months ago
by
parquet-converter
Dataset Viewer issue: JobManagerCrashedError
#6 opened 4 months ago
by
yury-zyphra
Seems like WARC metadata is missing from this version?
1
#4 opened about 2 months ago
by
yury-zyphra
Error when loading the dataset directly using datasets.load_dataset()
13
#8 opened 3 months ago
by
yury-zyphra
How many rows are supposed to be in the train split?
4
#13 opened 6 months ago
by
yury-zyphra
Were the documents shuffled before the dataset was split into shards?
3
#5 opened 3 months ago
by
yury-zyphra
How many rows are there in the dataset?
1
#4 opened 3 months ago
by
yury-zyphra
Missing files
3
#2 opened 3 months ago
by
pengyuan
Create Core
3
#5 opened 4 months ago
by
DrChamyoung
How many documents are actually in Dolma v1.7?
1
#42 opened 4 months ago
by
yury-zyphra
Workaround for duplicated ID's issue
#41 opened 4 months ago
by
yury-zyphra
Update dolma.py
1
#39 opened 4 months ago
by
kshinoda
Recommended wget snippet does not create folders
#40 opened 4 months ago
by
yury-zyphra
Rearrange folders
#3 opened 4 months ago
by
yury-zyphra
Rename folders
#2 opened 4 months ago
by
yury-zyphra
Update README.md
#1 opened 4 months ago
by
yury-zyphra
Add "not fine-tuned" for chat disclaimer.
#2 opened 4 months ago
by
yury-zyphra
Duplicate Key when Loading Dataset
4
#18 opened 8 months ago
by
Hhhhhao97
Error Loading The Model
1
#3 opened 5 months ago
by
jonathanjordan21
Streaming access to the dataset raises an error
3
#26 opened 6 months ago
by
LorMolf
Update README.md
#1 opened 7 months ago
by
yury-zyphra
Update README.md
#2 opened 8 months ago
by
qanthony-z
Update README.md
#1 opened 8 months ago
by
yury-zyphra
Update README.md
#1 opened 8 months ago
by
yury-zyphra