Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+ Apr 26 • 10
Revolutionizing Video Transcription: Unveiling Gemma-2b-it and Langchain in the Era of Transformers Mar 12 • 3
Uniting Forces: Integrating Hugging Face with Langchain for Enhanced Natural Language Processing Dec 18, 2023 • 4
Hearing is Believing: Revolutionizing AI with Audio Classification via Computer Vision Oct 22, 2023 • 1
Ankush Collection Transformer Articles DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention Paper • 2309.14327 • Published Sep 25, 2023 • 21 MambaVision: A Hybrid Mamba-Transformer Vision Backbone Paper • 2407.08083 • Published Jul 10 • 27 Memory^3: Language Modeling with Explicit Memory Paper • 2407.01178 • Published Jul 1 • 3 Teaching Transformers Causal Reasoning through Axiomatic Training Paper • 2407.07612 • Published Jul 10 • 2
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention Paper • 2309.14327 • Published Sep 25, 2023 • 21
Teaching Transformers Causal Reasoning through Axiomatic Training Paper • 2407.07612 • Published Jul 10 • 2
RAG articles This collection is meant for RAG articles GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models Paper • 2406.14550 • Published Jun 20 • 3 Mixture-of-Agents Enhances Large Language Model Capabilities Paper • 2406.04692 • Published Jun 7 • 54 Meta Prompting for AGI Systems Paper • 2311.11482 • Published Nov 20, 2023 • 3 Symbolic Learning Enables Self-Evolving Agents Paper • 2406.18532 • Published Jun 26 • 10
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models Paper • 2406.14550 • Published Jun 20 • 3
Mixture-of-Agents Enhances Large Language Model Capabilities Paper • 2406.04692 • Published Jun 7 • 54
Andyrasika/vit-base-patch16-224-in21k-finetuned-lora-food101 Image Classification • Updated Mar 7 • 14 • 2