arxiv:2410.13754
Zheng Zian(Andy)
OrionZheng
AI & ML interests
LLM, Mixture-of-Experts, Data-Centric AI
Recent Activity
liked
a dataset
about 1 month ago
MixEval/MixEval-X
authored
a paper
about 1 month ago
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
authored
a paper
about 1 month ago
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures
Organizations
None yet
Papers
2
models
11
OrionZheng/openmoe-34b-200B
Text Generation
•
Updated
•
15
•
11
OrionZheng/openmoe-8b-chat
Text Generation
•
Updated
•
16
•
8
OrionZheng/openmoe-8b
Text Generation
•
Updated
•
13
•
3
OrionZheng/openmoe-8b-1T
Text Generation
•
Updated
•
90
•
2
OrionZheng/openmoe-8b-800B
Text Generation
•
Updated
•
11
•
1
OrionZheng/openmoe-8b-600B
Text Generation
•
Updated
•
7
OrionZheng/openmoe-8b-400B
Text Generation
•
Updated
•
16
OrionZheng/openmoe-8b-200B
Text Generation
•
Updated
•
13
•
2
OrionZheng/openmoe-base
Text Generation
•
Updated
•
861
•
4
OrionZheng/openmoe-8b-890B
Text Generation
•
Updated
•
6
datasets
None public yet