diff --git a/README.md b/README.md index 331dec3..6f762a1 100644 --- a/README.md +++ b/README.md @@ -17,6 +17,13 @@ A core innovation of VibeVoice is its use of continuous speech tokenizers (Acous The model can synthesize speech up to **90 minutes** long with up to **4 distinct speakers**, surpassing the typical 1-2 speaker limits of many prior models. + +

+ MOS Preference Results + VibeVoice Overview +

+ + ### 🎵 Demo Examples **Cross-Lingual** @@ -29,11 +36,6 @@ https://github.com/user-attachments/assets/6f27a8a5-0c60-4f57-87f3-7dea2e11c730 For more examples, try it out via [Demo](https://aka.ms/VibeVoice-Demo). -

- MOS Preference Results - VibeVoice Overview -

- ## Models | Model | Context Length | Generation Length | Weight | |-------|----------------|----------|----------|