Qiying Yu

qiying

AI & ML interests

None yet

Recent Activity

Organizations

qiying's activity

New activity in CausalLM/miniG 3 months ago

About the Data Generation Method

2
#4 opened 3 months ago by qiying
upvoted an article 5 months ago
view article
Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

โ€ข 67
Reacted to merve's post with โค๏ธ 6 months ago
view post
Post
EVA-CLIP ๐Ÿฆ– is the CLIP scaled to the moon! ๐Ÿ”ฅ
The new SotA CLIP-like model ๐Ÿ†
Highlights โœจ
- Performs better in linear probing
- Outperforms in Zero-Shot Image-Text Retrieval
- Higher zero-shot accuracy in IN-1K

As usual, try it with the notebook I built for you https://colab.research.google.com/drive/1K7DdCORC3x4qyhwhuB4fT4wcfJ_BQLKw?usp=sharing#scrollTo=0ZS_lJ7SK6Ys
I also built a Space for you to compare the output probabilities to CLIP, seems that EVACLIP is more "sure" of it's results ๐Ÿ˜Š merve/EVACLIP
The authors have shared 8B checkpoints open with Apache 2.0 license ๐Ÿ’œ and it's built on top of transformers, super easy to use! BAAI/EVA-CLIP-8B
Read the paper EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters (2402.04252) ๐Ÿ“„
New activity in NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO 9 months ago

Training Hyperparameters

#6 opened 9 months ago by qiying