Anthony W Figueroa's picture

Anthony W Figueroa

THEFIG

·

AI & ML interests

None yet

Organizations

None yet

THEFIG's activity

upvoted an article 24 days ago

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1

• 45

upvoted 2 collections 3 months ago

Models Used in HackerNoon Publishing System

HackerNoon.com’s content management system empowers a small team to manage tens of thousands of writers, advertisers, & millions of readers 🙏 🤖 🙏🤖 • 14 items • Updated 12 days ago • 21

OpenCodeInterpreter

18 items • Updated Mar 3 • 82

upvoted an article 3 months ago

Article

Train custom AI models with the trainer API and adapt them to 🤗

By

•

Jun 29

• 33

upvoted a collection 3 months ago

Gemma 2 Release

15 items • Updated 26 days ago • 177

upvoted a paper 5 months ago

Imp: Highly Capable Large Multimodal Models for Mobile Devices

Paper • 2405.12107 • Published May 20 • 25

upvoted a collection 7 months ago

Common Corpus

The largest public domain dataset for training LLMs. • 27 items • Updated Jul 17 • 111

upvoted 3 papers 7 months ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 124

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5 • 93

Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

Paper • 2403.02677 • Published Mar 5 • 16

upvoted 3 papers 8 months ago

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Paper • 2401.10774 • Published Jan 19 • 53

BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

Paper • 2401.17053 • Published Jan 30 • 30

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25 • 46

upvoted a paper 9 months ago

MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant for Mobile Devices

Paper • 2312.16886 • Published Dec 28, 2023 • 19

upvoted a paper 10 months ago

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 36

upvoted a paper 11 months ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 120

upvoted 14 papers 12 months ago

Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models

Paper • 2310.13671 • Published Oct 20, 2023 • 18

4K4D: Real-Time 4D View Synthesis at 4K Resolution

Paper • 2310.11448 • Published Oct 17, 2023 • 36

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Paper • 2310.11441 • Published Oct 17, 2023 • 26

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 96

ObjectComposer: Consistent Generation of Multiple Objects Without Fine-tuning

Paper • 2310.06968 • Published Oct 10, 2023 • 1

OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation

Paper • 2310.07749 • Published Oct 11, 2023 • 5

WorldSmith: Iterative and Expressive Prompting for World Building with a Generative AI

Paper • 2308.13355 • Published Aug 25, 2023 • 2

Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

Paper • 2310.07653 • Published Oct 11, 2023 • 2

Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation

Paper • 2310.08541 • Published Oct 12, 2023 • 17

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 53

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback

Paper • 2309.10691 • Published Sep 19, 2023 • 4

Lemur: Harmonizing Natural Language and Code for Language Agents

Paper • 2310.06830 • Published Oct 10, 2023 • 30

SALMON: Self-Alignment with Principle-Following Reward Models

Paper • 2310.05910 • Published Oct 9, 2023 • 3

Octopus: Embodied Vision-Language Programmer from Environmental Feedback

Paper • 2310.08588 • Published Oct 12, 2023 • 34

upvoted 7 papers about 1 year ago

CodePlan: Repository-level Coding using LLMs and Planning

Paper • 2309.12499 • Published Sep 21, 2023 • 73

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities

Paper • 2308.02490 • Published Aug 4, 2023 • 16

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 170

DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations

Paper • 2307.07047 • Published Jul 13, 2023 • 15

Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts

Paper • 2307.07218 • Published Jul 14, 2023 • 26

Learning to Retrieve In-Context Examples for Large Language Models

Paper • 2307.07164 • Published Jul 14, 2023 • 21

Copy Is All You Need

Paper • 2307.06962 • Published Jul 13, 2023 • 33

upvoted a paper over 1 year ago

ChessGPT: Bridging Policy Learning and Language Modeling

Paper • 2306.09200 • Published Jun 15, 2023 • 9