arxiv:2407.21018
Zhanming (Allan) Jie
allanjie
AI & ML interests
NLP, semantic parsing, named entity recognition
Organizations
models
11
allanjie/agent_reft_warmup_ep5
Text Generation
•
Updated
allanjie/agent_reft_warmup_ep4
Text Generation
•
Updated
allanjie/agent_reft_warmup_ep3
Text Generation
•
Updated
allanjie/agent_reft_warmup_ep2
Text Generation
•
Updated
allanjie/agent_reft_warmup_ep1
Text Generation
•
Updated
allanjie/chat_robot_qwen
Text Generation
•
Updated
•
7
allanjie/chat_robot
Text Generation
•
Updated
allanjie/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
1
allanjie/ppo-LunarLander-v2-test
Updated
allanjie/math23k_train_test_roberta-base
Updated
•
4
•
1