Will Held PRO
WillHeld
AI & ML interests
Machine Learning and Natural Language Processing for low-resource languages and language variants
Organizations
WillHeld's activity
What is the total # tokens after sampling proportion? 1.7T or 1.65T
3
#36 opened 4 months ago
by
ivanzhouyq
Training Code
1
#2 opened about 1 month ago
by
setianke
Llama v.s. OLMo token counts
#43 opened about 2 months ago
by
WillHeld
Common Crawl Dataset Partitioning method?
2
#37 opened 4 months ago
by
AlexFanWei
Storage Footprint of Dolma 1.7 (on disk)
2
#31 opened 5 months ago
by
skaramcheti
inference example
8
#1 opened 2 months ago
by
eschmidbauer
Inscrutable Issues w/ Torch and Zero-GPU
8
#93 opened 2 months ago
by
WillHeld
[bot] Conversion to Parquet
#1 opened 11 months ago
by
parquet-converter
Chain Of Thought Zero-Shot Prompting
#30 opened almost 2 years ago
by
WillHeld