This is a base model and new version of meta-llama: text-to-video using: openai/MMMLU & HuggingFace / finevideo datasets.

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment