From b711571f9f3dc734c430760d31769f3aeabbebc0 Mon Sep 17 00:00:00 2001 From: JianweiYu Date: Mon, 25 Aug 2025 09:32:14 -0700 Subject: [PATCH] update --- README.md | 16 +++++++++------- 1 file changed, 9 insertions(+), 7 deletions(-) diff --git a/README.md b/README.md index 828ace6..331dec3 100644 --- a/README.md +++ b/README.md @@ -17,15 +17,17 @@ A core innovation of VibeVoice is its use of continuous speech tokenizers (Acous The model can synthesize speech up to **90 minutes** long with up to **4 distinct speakers**, surpassing the typical 1-2 speaker limits of many prior models. -### 🎵 Demo Example -Listen to a sample of VibeVoice generating multi-speaker conversational audio: +### 🎵 Demo Examples - +**Cross-Lingual** -Try it out via [Demo](https://aka.ms/VibeVoice-Demo). +https://github.com/user-attachments/assets/838d8ad9-a201-4dde-bb45-8cd3f59ce722 + +**Spontaneous Singing** + +https://github.com/user-attachments/assets/6f27a8a5-0c60-4f57-87f3-7dea2e11c730 + +For more examples, try it out via [Demo](https://aka.ms/VibeVoice-Demo).

MOS Preference Results