-
Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation
Paper • 2310.18628 • Published • 7 -
ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation
Paper • 2311.00272 • Published • 9 -
Magicoder: Source Code Is All You Need
Paper • 2312.02120 • Published • 79 -
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Paper • 2312.04474 • Published • 29
Abdel-Dayane Marcos
admarcosai
AI & ML interests
Natural Language Processing, Graph Neural Networks, Reinforcement Learning
Recent Activity
liked
a model
1 day ago
jinaai/text-seg-lm-qwen2-0.5b-cot-topic-chunking
liked
a model
13 days ago
allenai/Molmo-7B-D-0924
liked
a model
about 2 months ago
mistralai/Mistral-7B-Instruct-v0.3
Organizations
None yet
Collections
52
-
Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation
Paper • 2310.18628 • Published • 7 -
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Paper • 2310.19019 • Published • 9 -
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs
Paper • 2311.02262 • Published • 10 -
Thread of Thought Unraveling Chaotic Contexts
Paper • 2311.08734 • Published • 6
models
16
admarcosai/ppo-Pyramids1
Reinforcement Learning
•
Updated
•
30
admarcosai/ppo-SnowballTarget2
Reinforcement Learning
•
Updated
•
40
admarcosai/ppo-SnowballTarget1
Reinforcement Learning
•
Updated
•
20
admarcosai/esm2_t12_35M_UR50D-finetuned-localization
Text Classification
•
Updated
•
11
admarcosai/sd-class-butterflies-32
Unconditional Image Generation
•
Updated
•
8
admarcosai/taxi-v3-qlearning_500000
Reinforcement Learning
•
Updated
admarcosai/taxi-v3-qlearning_200
Reinforcement Learning
•
Updated
admarcosai/taxi-v3-qlearning
Reinforcement Learning
•
Updated
admarcosai/q-FrozenLake-v1-4x4-Slippery
Reinforcement Learning
•
Updated
admarcosai/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
datasets
None public yet