4 2 14

Christopher PRO

chkla

https://linktr.ee/chkla

AI & ML interests

🚀 NLP and Computational Social Science

Recent Activity

liked a Space about 1 month ago

CohereForAI/aya_expanse

liked a model about 1 month ago

CohereForAI/aya-expanse-8b

liked a model about 1 month ago

CohereForAI/aya-expanse-32b

View all activity

Organizations

chkla's activity

liked a Space about 1 month ago

Running on T4

222

🌍

CohereForAI/aya-expanse-8b

Text Generation • Updated Oct 30 • 47k • 297

CohereForAI/aya-expanse-32b

Text Generation • Updated Nov 1 • 34.2k • 179

updated a Space about 2 months ago

Sleeping

✍

chkla/quiz-politics-germany-education

Viewer • Updated Sep 1 • 981 • 55

chkla/polsci-exams-mcq

Viewer • Updated Aug 31 • 200 • 10

liked a Space 5 months ago

Running

🔥

partypress/partypress-monolingual-germany

Text Classification • Updated Nov 9, 2023 • 26 • 3

updated a Space 6 months ago

Runtime error

🏃

allenai/WildChat

Viewer • Updated Oct 17 • 529k • 1.41k • 124

upvoted an article 7 months ago

Article

Energy Scores for AI Models

•

May 9

• 30

updated a model 8 months ago

chkla/parlbert-topic-german

Text Classification • Updated Apr 8 • 253 • 11

reacted to thomwolf's post with ❤️ 8 months ago

Post

4978

A Little guide to building Large Language Models in 2024

This is a post-recording of a 75min lecture I gave two weeks ago on how to train a LLM from scratch in 2024. I tried to keep it short and comprehensive – focusing on concepts that are crucial for training good LLM but often hidden in tech reports.

In the lecture, I introduce the students to all the important concepts/tools/techniques for training good performance LLM:
* finding, preparing and evaluating web scale data
* understanding model parallelism and efficient training
* fine-tuning/aligning models
* fast inference

There is of course many things and details missing and that I should have added to it, don't hesitate to tell me you're most frustrating omission and I'll add it in a future part. In particular I think I'll add more focus on how to filter topics well and extensively and maybe more practical anecdotes and details.

Now that I recorded it I've been thinking this could be part 1 of a two-parts series with a 2nd fully hands-on video on how to run all these steps with some libraries and recipes we've released recently at HF around LLM training (and could be easily adapted to your other framework anyway):
*datatrove for all things web-scale data preparation: https://github.com/huggingface/datatrove
*nanotron for lightweight 4D parallelism LLM training: https://github.com/huggingface/nanotron
*lighteval for in-training fast parallel LLM evaluations: https://github.com/huggingface/lighteval

Here is the link to watch the lecture on Youtube: https://www.youtube.com/watch?v=2-SPH9hIKT8
And here is the link to the Google slides: https://docs.google.com/presentation/d/1IkzESdOwdmwvPxIELYJi8--K3EZ98_cL6c5ZcLKSyVg/edit#slide=id.p

Enjoy and happy to hear feedback on it and what to add, correct, extend in a second part.

2 replies

liked a model 9 months ago

CohereForAI/c4ai-command-r-v01

Text Generation • Updated Sep 27 • 7.33k • 1.07k

updated 2 Spaces 9 months ago

Sleeping

🏢

AnnotationPromptCards

Sleeping

🏃

PromptCardsPlayground

updated a model 10 months ago

chkla/parlbert-german-v1

Fill-Mask • Updated Feb 22 • 41

reacted to dvilasuero's post with 🤗 10 months ago

Post

🤗 Data is better together!

Data is essential for training good AI systems. We believe that the amazing community built around open machine learning can also work on developing amazing datasets together.

To explore how this can be done, Argilla and Hugging Face are thrilled to announce a collaborative project where we’re asking Hugging Face community members to build a dataset consisting of LLM prompts collectively.

What are we doing?
Using an instance of Argilla — a powerful open-source data collaboration tool — hosted on the Hugging Face Hub, we are collecting ratings of prompts based on their quality.

How Can You Contribute?
It’s super simple to start contributing:

1. Sign up if you don’t have a Hugging Face account

2. Go to this Argilla Space and sign in: https://huggingface.co/spaces/DIBT/prompt-collective

3. Read the guidelines and start rating prompts!

You can also join the #data-is-better-together channel in the Hugging Face Discord.

Finally, to track the community progress we'll be updating this Gradio dashboard:

https://huggingface.co/spaces/DIBT/prompt-collective-dashboard

5 replies

liked a model 10 months ago

allenai/OLMo-1B

Text Generation • Updated Jul 16 • 2.42k • 105

reacted to victor's post with 🤗 10 months ago

Post

🔥 New on HuggingChat: Assistants!

Today we are releasing Assistants on HuggingChat!
Assistants are a fun way to package your prompts and share them with the world - powered by Open source Models of course!

Learn more about Assistants here: huggingchat/chat-ui#357
Browse Assistants here: https://huggingface.co/chat/assistants

11 replies

Christopher PRO

AI & ML interests

Recent Activity

Organizations

chkla's activity

Aya Expanse

CohereForAI/aya-expanse-8b

CohereForAI/aya-expanse-32b

Narratives Annotation

chkla/quiz-politics-germany-education

chkla/polsci-exams-mcq

Mmlu Translation Progress

partypress/partypress-monolingual-germany

Parlbert Topic German Test

allenai/WildChat

Energy Scores for AI Models

chkla/parlbert-topic-german

CohereForAI/c4ai-command-r-v01

AnnotationPromptCards

PromptCardsPlayground

chkla/parlbert-german-v1

allenai/OLMo-1B