This commit is contained in:
JianweiYu 2025-08-26 18:07:28 -07:00
parent d38c23e1e1
commit f507425ee7

View file

@ -25,7 +25,7 @@ The model can synthesize speech up to **90 minutes** long with up to **4 distinc
### 🔥 News
- **[2025-08-26] 🎉 We Opensource the VibeVoice-7B-Preview model weights!**
- **[2025-08-26] 🎉 We Opensource the [VibeVoice-7B-Preview](https://huggingface.co/WestZhang/VibeVoice-Large-pt) model weights!**
### 📋 TODO
@ -66,7 +66,7 @@ Try your own samples at [Demo](https://aka.ms/VibeVoice-Demo).
|-------|----------------|----------|----------|
| VibeVoice-0.5B-Streaming | - | - | On the way |
| VibeVoice-1.5B | 64K | ~90 min | [HF link](https://huggingface.co/microsoft/VibeVoice-1.5B) |
| VibeVoice-7B-Preview| 32K | ~45 min | [HF link](https://huggingface.co/microsoft/VibeVoice-7B-Preview) |
| VibeVoice-7B-Preview| 32K | ~45 min | [HF link](https://huggingface.co/WestZhang/VibeVoice-Large-pt) |
## Installation
We recommend to use NVIDIA Deep Learning Container to manage the CUDA environment.