Skip to content

[Dev] Add VAE Inference Example#38

Open
themistbeforedawn wants to merge 1 commit into
SandAI-org:mainfrom
themistbeforedawn:dev/inference-example
Open

[Dev] Add VAE Inference Example#38
themistbeforedawn wants to merge 1 commit into
SandAI-org:mainfrom
themistbeforedawn:dev/inference-example

Conversation

@themistbeforedawn

Copy link
Copy Markdown
Collaborator

🗂️ PR Category

  • ✨ New Feature
  • 🚀 Optimization (performance, memory, etc.)
  • 💥 Breaking Change
  • 🐛 Bug Fix
  • 🛠️ Development / Refactoring
  • 📚 Documentation
  • 🧹 Chore (Dependencies, CI/CD, Configuration, etc.)
  • 🧪 Testing

📝 Description

This PR only adds the WAN 2.2 VAE inference benchmark/example for measuring the performance impact of the optimization passes introduced in previous PRs.

The following numbers are CUDA HW sum averages over profiled iterations on the WAN 2.2 VAE 540p workload, measured on an NVIDIA H100 80G HBM3 GPU. Parentheses show MagiCompiler speedup over the corresponding baseline.

Decode

Shape mode MAGI_COMPILE TORCH_COMPILE EAGER
Static H/W 457.943 ms 526.973 ms (1.15x) 855.131 ms (1.87x)
Dynamic H/W 553.543 ms 768.700 ms (1.39x) 855.131 ms (1.54x)

Encode

Shape mode MAGI_COMPILE TORCH_COMPILE EAGER
Static H/W 134.444 ms 151.183 ms (1.12x) 269.702 ms (2.01x)
Dynamic H/W 179.025 ms 289.522 ms (1.62x) 269.702 ms (1.51x)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant