Abdel-Dayane Marcos's picture

Abdel-Dayane Marcos

admarcosai

·

AI & ML interests

Natural Language Processing, Graph Neural Networks, Reinforcement Learning

Recent Activity

liked a model 5 days ago

jinaai/text-seg-lm-qwen2-0.5b-cot-topic-chunking

liked a model 16 days ago

allenai/Molmo-7B-D-0924

liked a model about 2 months ago

mistralai/Mistral-7B-Instruct-v0.3

View all activity

Organizations

None yet

admarcosai's activity

commented 4 papers 9 months ago

Training Transformers with 4-bit Integers

Paper • 2306.11987 • Published Jun 21, 2023 • 22 •

Training Transformers with 4-bit Integers

Paper • 2306.11987 • Published Jun 21, 2023 • 22 •

OneBit: Towards Extremely Low-bit Large Language Models

Paper • 2402.11295 • Published Feb 17 • 23 •

3D Gaussian Splatting for Real-Time Radiance Field Rendering

Paper • 2308.04079 • Published Aug 8, 2023 • 171 •

commented 6 papers 10 months ago

A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Paper • 2402.09727 • Published Feb 15 • 36 •

Computing Power and the Governance of Artificial Intelligence

Paper • 2402.08797 • Published Feb 13 • 12 •

Efficiently Programming Large Language Models using SGLang

Paper • 2312.07104 • Published Dec 12, 2023 • 7 •

Scaling Laws for Downstream Task Performance of Large Language Models

Paper • 2402.04177 • Published Feb 6 • 17 •

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Paper • 2401.06080 • Published Jan 11 • 26 •

Transformer-Based Models Are Not Yet Perfect At Learning to Emulate Structural Recursion

Paper • 2401.12947 • Published Jan 23 • 2 •

New activity in tongyx361/MathInstruct-Core-DifficultyAware 10 months ago

Meaning of err_rate in the dataset

#2 opened 10 months ago by

commented a paper 12 months ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 138 •

New activity in deepseek-ai/deepseek-coder-6.7b-instruct 12 months ago

Trained on Code Search Net

#5 opened 12 months ago by

commented a paper 12 months ago

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Paper • 2310.08659 • Published Oct 12, 2023 • 22 •

New activity in google-research-datasets/natural_questions about 2 years ago

Natural Questions is not streamable

#1 opened about 2 years ago by

New activity in community-datasets/wiki_snippets about 2 years ago

why does loading load_dataset('wiki_snippets', name='wiki40b_en_100_0') takes 3 hours when it only generates 12GB of data

#1 opened about 2 years ago by