COpen

CogVideoX-5B

by Tsinghua / Zhipu AI  ·  2025

Lightweight entry point. 12GB GPU, Apache 2.0. Most accessible for experimentation.

Specifications

Max Resolution

720p (1360x768)

Max Duration

6s

FPS

8

Native Audio

No

ComfyUI Support

Yes

Fine-tunable

Yes

Min VRAM

12GB

Cost / Second

Self-host

Architecture

Expert Transformer

Parameters

5B

Inputs

T2V

License

Apache 2.0

Strengths & Trade-offs

Strengths

  • Lightweight 12GB
  • Apache 2.0
  • LoRA + DDIM Inverse
  • ModelScope integration

Weaknesses

  • Low FPS (8fps)
  • 6s max
  • older architecture

Best For

  • Budget prototyping
  • research
  • motion-heavy short clips

Scores

Quality
5.5
Motion
6
Speed
8
Control
5
Audio
-
Value
9

Workflows on Floyo

Coming soon to Floyo

We are working on bringing CogVideoX-5B workflows to the platform.

Get notified →

Compare with other models

AOpen
Wan 2.2

Alibaba (Tongyi Lab)

Quality
7
Motion
7.5
Speed
9
Compare →
SOpen
LTX-2.3

Lightricks

Quality
8.5
Motion
8
Speed
8
Compare →
BOpen
HunyuanVideo 1.5

Tencent

Quality
7
Motion
7
Speed
8
Compare →