The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12 • 112
Qwen2-Audio Collection Audio-language model series based on Qwen2 • 4 items • Updated Aug 9 • 37
Parler-TTS: fully open-source high-quality TTS Collection If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. • 7 items • Updated Aug 8 • 40
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection Paper • 2408.04284 • Published Aug 8 • 20
Transformer Explainer: Interactive Learning of Text-Generative Models Paper • 2408.04619 • Published Aug 8 • 152
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI Paper • 2408.03361 • Published Aug 6 • 85
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models Paper • 2408.02085 • Published Aug 4 • 17
MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization Paper • 2408.02555 • Published Aug 5 • 27
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining Paper • 2408.02657 • Published Aug 5 • 32
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation Paper • 2408.02545 • Published Aug 5 • 32
💻 CodeLlama Collection Llama and CodeLlama models trained to improve the performance in terms of code generation. • 4 items • Updated 27 days ago • 3
🔮 Mixture of Experts Collection MoE done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y • 13 items • Updated 27 days ago • 22
👑 Monarch Collection Family of 7B models that combine excellent reasoning and conversational abilities. • 7 items • Updated 27 days ago • 11
CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis Paper • 2407.13301 • Published Jul 18 • 55
Scalify: scale propagation for efficient low-precision LLM training Paper • 2407.17353 • Published Jul 24 • 11
MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement Learning Paper • 2407.16312 • Published Jul 23 • 12
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper • 2407.16741 • Published Jul 23 • 67
Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data? Paper • 2407.16607 • Published Jul 23 • 21
Very Large-Scale Multi-Agent Simulation in AgentScope Paper • 2407.17789 • Published Jul 25 • 30
Llama 3.1 Collection This collection hosts the transformers and original repos of the Meta Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Aug 2 • 549
MobileCLIP Models + DataCompDR Data Collection MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities. DataCompDR: Improved datasets for training image-text SOTA models. • 22 items • Updated Jun 20 • 20
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19 • 97
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Apr 22 • 78
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 • 60
view article Article Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages May 24 • 24
view article Article CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models May 24 • 21
view article Article Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality Jun 24 • 30
Top LLM Collection Collection of TOP Open Source LLM, Sort by Best on top • 6 items • Updated Jul 26 • 9
view article Article Introducing Ghost 8B Beta: A Game-Changing Language Model By lamhieu • Jul 17 • 7
view article Article Introducing HelpingAI-15B: Emotionally Intelligent Conversational AI By Abhaykoul • Jul 12 • 3
view article Article Mixture of Agents Model (MAM): An AI-Driven Full-Stack Development Team By dnnsdunca • Jul 15 • 1
YOLOv10 Collection This collection hosts the YOLOv10 model releases • 16 items • Updated Jun 3 • 16
C4AI Aya 23 Collection Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 4 items • Updated Aug 6 • 45
Recent highlights Collection Some recent models worth checking out • 14 items • Updated 6 days ago • 22
🎠Avatars Collection The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 65 items • Updated 8 days ago • 74
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 111
FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 37 items • Updated 17 days ago • 50
🚀GGUF Collection Llama.cpp compatible models, can be used on CPUs and GPUs! • 692 items • Updated 5 days ago • 30
Korean-Adapted Model Series Collection Korean-adapted Language Model Series • 13 items • Updated May 17 • 24