Zheng Zian(Andy)
OrionZheng
AI & ML interests
LLM, Mixture-of-Experts, Data-Centric AI
Recent Activity
liked
a dataset
about 1 month ago
MixEval/MixEval-X
authored
a paper
about 1 month ago
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
authored
a paper
about 1 month ago
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures
Organizations
None yet
OrionZheng's activity
model_type "llama"
1
#1 opened 4 months ago
by
Phando
Update config.json
#1 opened 4 months ago
by
OrionZheng
Update ada_vocab_factory.py
#1 opened 5 months ago
by
OrionZheng
convert t5x into pytorch model
1
#1 opened 11 months ago
by
Siddharth63
Fixed some data in bad format
1
#6 opened about 1 year ago
by
OrionZheng
Is there any overlap between peS2o dataset and the arxiv subset from Redpajama?
#2 opened over 1 year ago
by
OrionZheng
Add snippet to locate errors in the data files
1
#3 opened over 1 year ago
by
OrionZheng
How to obtain the original git-commits dataset?
1
#5 opened over 1 year ago
by
OrionZheng
Confusion and Discrepancy Regarding Deduplication Versions and Dataset Sizes
1
#26 opened over 1 year ago
by
OrionZheng
Cannot run the inference on the playground
2
#1 opened over 1 year ago
by
OrionZheng
Error while loading "alt-parallel" config: TypeError: Couldn't cast array
2
#3 opened over 1 year ago
by
albertvillanova
🚩 Report
2
#2 opened over 1 year ago
by
OrionZheng