ClayFace (Clayface)

upvoted a paper 7 days ago

MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling

Paper • 2409.16160 • Published 8 days ago • 28

upvoted a paper 18 days ago

StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation

Paper • 2312.12491 • Published Dec 19, 2023 • 69

upvoted a paper 19 days ago

DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors

Paper • 2409.08278 • Published 20 days ago • 10

upvoted a paper 22 days ago

GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers

Paper • 2409.04196 • Published 26 days ago • 11

upvoted an article about 2 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 206

upvoted an article 3 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24

• 168

upvoted 3 papers 4 months ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10 • 64

Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control

Paper • 2405.12970 • Published May 21 • 22

FIFO-Diffusion: Generating Infinite Videos from Text without Training

Paper • 2405.11473 • Published May 19 • 53

upvoted an article 5 months ago

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1

• 63

upvoted a paper 5 months ago

Stylus: Automatic Adapter Selection for Diffusion Models

Paper • 2404.18928 • Published Apr 29 • 14

upvoted an article 5 months ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Apr 29

• 71

upvoted a collection 5 months ago

LLaVA-Llama-3-8B

Collection

8 items • Updated Apr 28 • 18

upvoted 3 papers 5 months ago

upvoted a collection 6 months ago

WizardLM

Collection

0 items • Updated Jul 11 • 103

upvoted 4 papers 6 months ago

RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion

Paper • 2404.07199 • Published Apr 10 • 25

SpatialTracker: Tracking Any 2D Pixels in 3D Space

Paper • 2404.04319 • Published Apr 5 • 23

SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions

Paper • 2403.16627 • Published Mar 25 • 20

DepthFM: Fast Monocular Depth Estimation with Flow Matching

Paper • 2403.13788 • Published Mar 20 • 16

upvoted 9 papers 7 months ago

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding

Paper • 2403.12895 • Published Mar 19 • 29

StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control

Paper • 2403.09055 • Published Mar 14 • 24

LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation

Paper • 2403.12019 • Published Mar 18 • 8

VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

Paper • 2403.12034 • Published Mar 18 • 5

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Paper • 2403.07508 • Published Mar 12 • 75

V3D: Video Diffusion Models are Effective 3D Generators

Paper • 2403.06738 • Published Mar 11 • 28

Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11 • 53

3D Diffusion Policy

Paper • 2403.03954 • Published Mar 6 • 11

Divide-or-Conquer? Which Part Should You Distill Your LLM?

Paper • 2402.15000 • Published Feb 22 • 22

upvoted a collection 7 months ago

MiquMaid

Collection

6 items • Updated Jul 26 • 12

upvoted a paper 7 months ago

FlashTex: Fast Relightable Mesh Texturing with LightControlNet

Paper • 2402.13251 • Published Feb 20 • 13

upvoted 6 papers 8 months ago

More Agents Is All You Need

Paper • 2402.05120 • Published Feb 3 • 51

StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback

Paper • 2402.01391 • Published Feb 2 • 41

Training-Free Consistent Text-to-Image Generation

Paper • 2402.03286 • Published Feb 5 • 64

Agile But Safe: Learning Collision-Free High-Speed Legged Locomotion

Paper • 2401.17583 • Published Jan 31 • 25

Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance

Paper • 2401.15687 • Published Jan 28 • 21

Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI

Paper • 2401.14019 • Published Jan 25 • 19

upvoted 16 papers 9 months ago

TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion

Paper • 2401.09416 • Published Jan 17 • 9

En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data

Paper • 2401.01173 • Published Jan 2 • 11

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4 • 61

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Paper • 2401.01325 • Published Jan 2 • 26

DocLLM: A layout-aware generative language model for multimodal document understanding

Paper • 2401.00908 • Published Dec 31, 2023 • 178

VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM

Paper • 2401.01256 • Published Jan 2 • 19

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 79

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis

Paper • 2312.17681 • Published Dec 29, 2023 • 18

LARP: Language-Agent Role Play for Open-World Games

Paper • 2312.17653 • Published Dec 24, 2023 • 29

Compact Neural Graphics Primitives with Learned Hash Probing

Paper • 2312.17241 • Published Dec 28, 2023 • 6

DreamGaussian4D: Generative 4D Gaussian Splatting

Paper • 2312.17142 • Published Dec 28, 2023 • 18

Make-A-Character: High Quality Text-to-3D Character Generation within Minutes

Paper • 2312.15430 • Published Dec 24, 2023 • 28

Human101: Training 100+FPS Human Gaussians in 100s from 1 View

Paper • 2312.15258 • Published Dec 23, 2023 • 7

MagicScroll: Nontypical Aspect-Ratio Image Generation for Visual Storytelling via Multi-Layered Semantic-Aware Denoising

Paper • 2312.10899 • Published Dec 18, 2023 • 14

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Paper • 2312.14233 • Published Dec 21, 2023 • 15

Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation

Paper • 2312.13469 • Published Dec 20, 2023 • 10

upvoted 6 papers 10 months ago

ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors

Paper • 2312.13324 • Published Dec 20, 2023 • 9

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

Paper • 2312.13252 • Published Dec 20, 2023 • 27

UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections

Paper • 2312.13285 • Published Dec 20, 2023 • 5

Tracking Any Object Amodally

Paper • 2312.12433 • Published Dec 19, 2023 • 11

Self-Evaluation Improves Selective Generation in Large Language Models

Paper • 2312.09300 • Published Dec 14, 2023 • 14

Weight subcloning: direct initialization of transformers using larger pretrained ones

Paper • 2312.09299 • Published Dec 14, 2023 • 17

Clayface

AI & ML interests

Organizations

ClayFace's activity

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation