-
Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation
Paper • 2310.18628 • Published • 7 -
ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation
Paper • 2311.00272 • Published • 9 -
Magicoder: Source Code Is All You Need
Paper • 2312.02120 • Published • 79 -
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Paper • 2312.04474 • Published • 29
Abdel-Dayane Marcos
admarcosai
AI & ML interests
Natural Language Processing, Graph Neural Networks, Reinforcement Learning
Organizations
None yet
Collections
52
-
Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation
Paper • 2310.18628 • Published • 7 -
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Paper • 2310.19019 • Published • 9 -
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs
Paper • 2311.02262 • Published • 10 -
Thread of Thought Unraveling Chaotic Contexts
Paper • 2311.08734 • Published • 6
models
16
admarcosai/ppo-Pyramids1
Reinforcement Learning
•
Updated
•
54
admarcosai/ppo-SnowballTarget2
Reinforcement Learning
•
Updated
•
85
admarcosai/ppo-SnowballTarget1
Reinforcement Learning
•
Updated
•
55
admarcosai/esm2_t12_35M_UR50D-finetuned-localization
Text Classification
•
Updated
•
13
admarcosai/sd-class-butterflies-32
Unconditional Image Generation
•
Updated
•
20
admarcosai/taxi-v3-qlearning_500000
Reinforcement Learning
•
Updated
admarcosai/taxi-v3-qlearning_200
Reinforcement Learning
•
Updated
admarcosai/taxi-v3-qlearning
Reinforcement Learning
•
Updated
admarcosai/q-FrozenLake-v1-4x4-Slippery
Reinforcement Learning
•
Updated
admarcosai/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
datasets
None public yet