9 20 951

Zhipeng Yang

svjack

https://github.com/svjack

svjack

AI & ML interests

NLP,Search Engine ,Dialogue System,Question Answer System, Knowledge Base,Stable Diffusion,CV

Organizations

svjack's activity

upvoted 3 papers 11 days ago

Training-free Regional Prompting for Diffusion Transformers

Paper • 2411.02395 • Published 14 days ago • 23

SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation

Paper • 2411.04989 • Published 11 days ago • 13

TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation

Paper • 2411.04709 • Published 13 days ago • 25

upvoted a paper 22 days ago

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Paper • 2410.19355 • Published 25 days ago • 22

upvoted 3 papers 25 days ago

CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models

Paper • 2410.18505 • Published 26 days ago • 8

Framer: Interactive Frame Interpolation

Paper • 2410.18978 • Published 26 days ago • 36

Improve Vision Language Model Chain-of-thought Reasoning

Paper • 2410.16198 • Published 29 days ago • 17

upvoted 8 papers about 1 month ago

T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design

Paper • 2410.05677 • Published Oct 8 • 14

Story-Adapter: A Training-free Iterative Framework for Long Story Visualization

Paper • 2410.06244 • Published Oct 8 • 19

GLEE: A Unified Framework and Benchmark for Language-based Economic Environments

Paper • 2410.05254 • Published Oct 7 • 80

Agent S: An Open Agentic Framework that Uses Computers Like a Human

Paper • 2410.08164 • Published Oct 10 • 24

EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models

Paper • 2410.07133 • Published Oct 9 • 18

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10 • 49

Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention

Paper • 2410.10774 • Published Oct 14 • 24

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Paper • 2410.10306 • Published Oct 14 • 52

upvoted a collection about 2 months ago

IMG/VIDEO upscale

Collection

6 items • Updated Jul 10 • 3

upvoted an article 2 months ago

Article

Introduction to ggml

Aug 13

• 112

upvoted a collection 6 months ago

Hugging Face community’s Wikimedia datasets

Collection

Wikimedia datasets created by the Hugging Face community, not Wikimedia. Sorted by Wikimedia project. • 17 items • Updated Jun 7 • 10

upvoted an article 6 months ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7

• 39

upvoted a collection 7 months ago

llama3-zh

Collection

Portfolio of LLAMA3 fine-tune models • 51 items • Updated Aug 19 • 7