vtesting2 / comfyui /README.md
meepmoo's picture
Upload folder using huggingface_hub
0dcccdd verified
|
raw
history blame
5.88 kB

ComfyUI CogVideoX-Fun

Easily use CogVideoX-Fun inside ComfyUI!

1. Installation

Option 1: Install via ComfyUI Manager

TBD

Option 2: Install manually

The CogVideoX-Fun repository needs to be placed at ComfyUI/custom_nodes/CogVideoX-Fun/.

cd ComfyUI/custom_nodes/

# Git clone the cogvideox_fun itself
git clone https://github.com/aigc-apps/CogVideoX-Fun.git

# Git clone the video outout node
git clone https://github.com/Kosinkadink/ComfyUI-VideoHelperSuite.git

cd CogVideoX-Fun/
python install.py

2. Download models into ComfyUI/models/CogVideoX_Fun/

V1.1:

名称 存储空间 Hugging Face Model Scope 描述
CogVideoX-Fun-V1.1-2b-InP.tar.gz Before extraction:9.7 GB / After extraction: 13.0 GB 🤗Link 😄Link Our official graph-generated video model is capable of predicting videos at multiple resolutions (512, 768, 1024, 1280) and has been trained on 49 frames at a rate of 8 frames per second. Noise has been added to the reference image, and the amplitude of motion is greater compared to V1.0.
CogVideoX-Fun-V1.1-5b-InP.tar.gz Before extraction:16.0 GB / After extraction: 20.0 GB 🤗Link 😄Link Our official graph-generated video model is capable of predicting videos at multiple resolutions (512, 768, 1024, 1280) and has been trained on 49 frames at a rate of 8 frames per second. Noise has been added to the reference image, and the amplitude of motion is greater compared to V1.0.
CogVideoX-Fun-V1.1-2b-Pose.tar.gz Before extraction:9.7 GB / After extraction: 13.0 GB 🤗Link 😄Link Our official pose-control video model is capable of predicting videos at multiple resolutions (512, 768, 1024, 1280) and has been trained on 49 frames at a rate of 8 frames per second.
CogVideoX-Fun-V1.1-5b-Pose.tar.gz Before extraction:16.0 GB / After extraction: 20.0 GB 🤗Link 😄Link Our official pose-control video model is capable of predicting videos at multiple resolutions (512, 768, 1024, 1280) and has been trained on 49 frames at a rate of 8 frames per second.

V1.0:

Name Storage Space Hugging Face Model Scope Description
CogVideoX-Fun-2b-InP.tar.gz Before extraction:9.7 GB / After extraction: 13.0 GB 🤗Link 😄Link Our official graph-generated video model is capable of predicting videos at multiple resolutions (512, 768, 1024, 1280) and has been trained on 49 frames at a rate of 8 frames per second.
CogVideoX-Fun-5b-InP.tar.gz Before extraction:16.0 GB / After extraction: 20.0 GB 🤗Link 😄Link Our official graph-generated video model is capable of predicting videos at multiple resolutions (512, 768, 1024, 1280) and has been trained on 49 frames at a rate of 8 frames per second.

Node types

  • LoadCogVideoX_Fun_Model
    • Loads the CogVideoX-Fun model
  • CogVideoX_FUN_TextBox
    • Write the prompt for CogVideoX-Fun model
  • CogVideoX_Fun_I2VSampler
    • CogVideoX-Fun Sampler for Image to Video
  • CogVideoX_Fun_T2VSampler
    • CogVideoX-Fun Sampler for Text to Video
  • CogVideoX_Fun_V2VSampler
    • CogVideoX-Fun Sampler for Video to Video

Example workflows

Video to video generation

Our ui is shown as follow, this is the download link of the json: workflow graph

You can run the demo using following video: demo video

Control video generation

Our ui is shown as follow, this is the download link of the json: workflow graph

You can run the demo using following video: demo video

Image to video generation

Our ui is shown as follow, this is the download link of the json: workflow graph

You can run the demo using following photo: demo image

Text to video generation

Our ui is shown as follow, this is the download link of the json: workflow graph