Interés - a JuanRafap Collection

JuanRafap 's Collections

Interés

updated 1 day ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4 • 35
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7 • 49
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5 • 60
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Paper • 2410.08815 • Published Oct 11 • 43
Game-theoretic LLM: Agent Workflow for Negotiation Games

Paper • 2411.05990 • Published Nov 8 • 7
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Paper • 2411.10640 • Published 27 days ago • 44
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Paper • 2411.19146 • Published 15 days ago • 13
Snowflake/snowflake-arctic-embed-m-v2.0

Sentence Similarity • Updated 8 days ago • 3.22k • 31
Snowflake/snowflake-arctic-embed-l-v2.0

Sentence Similarity • Updated 8 days ago • 8.22k • 54
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published 7 days ago • 42
ruliad/deepthought-8b-llama-v0.01-alpha

Text Generation • Updated 6 days ago • 34.8k • 106
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published 14 days ago • 48
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Paper • 2412.02592 • Published 10 days ago • 18
RL Zero: Zero-Shot Language to Behaviors without any Supervision

Paper • 2412.05718 • Published 6 days ago • 3