COpen
CogVideoX-5B
Tsinghua / Zhipu AI
Lightweight entry point. 12GB GPU, Apache 2.0. Most accessible for experimentation.
Pick CogVideoX-5B if…
You want budget prototyping, research, or motion-heavy short clips.
Pick HunyuanVideo 1.5 if…
You want avatar gen, custom characters, or research.
Specifications
Maker
Tsinghua / Zhipu AI
Tencent
Source Type
Open Source
Open Source
License
Apache 2.0
Tencent Open (check terms)
Architecture
Expert Transformer
DiT + 3D Causal VAE
Parameters
5B
8.3B
Max Resolution
720p (1360x768)
720p
Max Duration
6s
6-10s
FPS
8
24
Native Audio
No
No
ComfyUI Support
Yes
Yes
Fine-tunable
Yes
Yes
Min VRAM
12GB
24GB (RTX 4090)
Cost / Second
Self-host
$0.06
Inputs
T2V
T2V, I2V, Avatar
On Floyo
No
Yes
Strengths & Trade-offs
CogVideoX-5B
Strengths
- +Lightweight 12GB
- +Apache 2.0
- +LoRA + DDIM Inverse
- +ModelScope integration
Trade-offs
- -Low FPS (8fps)
- -6s max
- -older architecture
Best For
- →Budget prototyping
- →research
- →motion-heavy short clips
HunyuanVideo 1.5
Strengths
- +Efficient 8.3B params
- +~75s on 4090
- +Avatar + Custom variants
Trade-offs
- -Short output
- -720p max
- -no audio
Best For
- →Avatar gen
- →custom characters
- →research
- →consumer GPU
Run these models on Floyo
Browser-based ComfyUI. No setup, no GPU required.
Audio/FoleyHunyuanVideo 1.5
HunyuanVideo Foley (Lifelike Audio)
I2VHunyuanVideo 1.5
HunyuanVideo 1.5 Image to Video