Commit graph

93 commits

Author SHA1 Message Date
YaoyaoChang
6065c5224e update Colab: add download audio 2025-09-02 02:08:19 -07:00
pengzhiliang
21c35b4701 fix bug: from_pretrained of VibeVoiceProcessor 2025-09-01 11:39:59 +00:00
YaoyaoChang
5d09c31021 update readme 2025-09-01 18:02:28 +08:00
gregory-fanous
6dd49cb720
feat: add Apple MPS device support (dtype + attention handling) to demos (#67)
* Enhance model loading with device support and error handling

Updated device handling for model loading and added support for MPS. Improved error handling and fallback mechanisms for attention implementations.

* Improve device handling and model loading logic

Updated device argument handling to support MPS and added validation for MPS availability. Enhanced model loading logic based on the selected device type.

* fallback only when flash_attention_2 and add some comments back

---------

Co-authored-by: YaoyaoChang <cyy574006791@qq.com>
2025-09-01 17:16:01 +08:00
YaoyaoChang
3074f898ca support CPU 2025-09-01 16:32:50 +08:00
jonathanagustin
086cd6aa6c refactor(colab): remove cd and use absolute paths for idempotent runs
- avoid stateful working-directory changes in notebook cells
- use explicit /content/... paths for clone/install/inference
- improves readability and re-run reliability
2025-09-01 15:47:52 +08:00
jonathanagustin
12493567c4 perf(colab): simplify setup & speed up downloads (uv, hf-xet)
- use `uv pip -e` for faster, deterministic installs
- enable hf-xet (HF_XET_HIGH_PERFORMANCE=1) for faster transfers
- download model with `hf download` to /content/models/VibeVoice-1.5B (quiet, idempotent)
- simplify flow: absolute paths, minimal cells, remove redundant steps
- add binary T4 GPU check + clear runtime switch instructions
- add “Open in Colab” badge; keep Risks & Limitations visible
2025-09-01 15:47:52 +08:00
pengzhiliang
01a3005bd9 update README 2025-09-01 06:27:15 +00:00
pengzhiliang
29433e0f27 update README 2025-09-01 06:23:52 +00:00
YaoyaoChang
174e53fc04 gradio support sdpa 2025-08-30 08:22:32 -07:00
Rafael Milewski
999020e7c4 add dark mode version 2025-08-29 17:25:43 +08:00
Z
c0194a7a00 Specifically reference newer llvmlite and numba dependencies
llvmlite is trying to build an old version that creates a dependency chain issue when installing with uv:
```
  help: `llvmlite` (v0.36.0) was included because `vibevoice` (v0.0.1) depends on `librosa` (v0.11.0) which
        depends on `numba` (v0.53.1) which depends on `llvmlite`
```

pinning these dependencies on newer versions forces it to resolve and shouldn't have any downsides.
2025-08-29 16:19:01 +08:00
Sangjee Dondrub
4d5d9779c8 Remove redundant librosa package in the depdencies. 2025-08-29 15:28:31 +08:00
Before Crash
ccdb19f004
update requires-python = '>=3.9' (#48) 2025-08-29 15:25:33 +08:00
JianweiYu
f3e35d0c2c update 2025-08-28 21:58:31 -07:00
JianweiYu
6210e6ee68 update 2025-08-28 21:50:31 -07:00
JianweiYu
1a75187097 update 2025-08-28 18:40:51 -07:00
JianweiYu
cd78c5b6e6 update 2025-08-28 18:29:18 -07:00
Li Dong
b815902059
Merge pull request #28 from DevMello/patch-1
fix: remove typos in README.md
2025-08-29 00:01:37 +08:00
Li Dong
174fbabbf2
Merge pull request #45 from RichardAtCT/patch-1
docs: fix spelling errors in README.md
2025-08-29 00:01:11 +08:00
Richard A
acba35e2d0
docs: fix spelling errors in README
- Change "Opensource" to "Open Source" (News section)
- Fix "Spekaer" typo to "Speaker" (FAQ Q2)
- Correct "instable" to "unstable" (FAQ Q6)
2025-08-28 19:48:45 +04:00
YaoyaoChang
c72bc0c59b add Risks and Limitations in Gradio 2025-08-28 02:26:17 -07:00
YaoyaoChang
1f3768c974 add Risk and Limitations for Colab 2025-08-28 02:15:42 -07:00
YaoyaoChang
fa72b89dcd update risks 2025-08-28 02:12:43 -07:00
YaoyaoChang
a3131cecf9 add Q6 for FAQ 2025-08-28 02:10:31 -07:00
YaoyaoChang
4b8b6f7700 update 2025-08-28 01:28:27 -07:00
YaoyaoChang
9537bd96a6 update 2025-08-28 01:23:57 -07:00
NR
84710c4d2b Add immediate UI update for Generate and Stop buttons (#18)
### Summary
The Stop button previously appeared only after the backend generation job actually started, leaving users with no cancel affordance while a request sat in the queue. This PR makes the Stop button visible immediately after clicking **Generate Podcast**.

### Change
Adds a single non-queued `.then` step in the click event chain
that hides the Generate button and shows the Stop button instantly, before the queued job begins processing.
2025-08-28 15:24:43 +08:00
_
e611deafac FIX: adjust quote type in inference_from_file (#33)
adds
full_script = full_script.replace("’", "'")
to the data preparation
2025-08-28 15:24:42 +08:00
YaoyaoChang
f57a9bffb1 speaker index from 1 2025-08-28 15:24:42 +08:00
YaoyaoChang
54fbbce40e add colab 2025-08-27 18:57:33 -07:00
YaoyaoChang
ac0104c65e try sdpa if error in flash_attention_2 2025-08-27 18:46:15 -07:00
YaoyaoChang
64241e29ca fix typo 2025-08-27 18:25:34 -07:00
YaoyaoChang
8ed05ba7de update to sdpa 2025-08-27 18:21:30 -07:00
JianweiYu
2b75b745a4 update 2025-08-27 13:34:54 -07:00
JianweiYu
560870cbe1 update 2025-08-27 12:51:57 -07:00
Pranav Yerramaneni
437cd1f87f
Fix typo in README regarding speaker voice 2025-08-27 09:07:09 -07:00
JianweiYu
3dd8605797 Update 2025-08-27 00:32:26 -07:00
JianweiYu
06f56bff02 Update 2025-08-26 23:56:36 -07:00
JianweiYu
13b72fec79 update 2025-08-26 23:48:05 -07:00
YaoyaoChang
056bb5b0fa add args to use_eager 2025-08-26 19:44:34 -07:00
YaoyaoChang
f3df4ae1b7 update report link to arxiv 2025-08-26 19:16:14 -07:00
YaoyaoChang
f5b5a648bb update FAQ 2025-08-26 18:48:26 -07:00
JianweiYu
f507425ee7 update 2025-08-26 18:07:28 -07:00
JianweiYu
d38c23e1e1 update 2025-08-26 18:06:02 -07:00
JianweiYu
afe790e781 update 2025-08-26 16:28:32 -07:00
JianweiYu
60dd5689fe update 2025-08-26 16:27:56 -07:00
JianweiYu
d2681c19d0 tomasyu 2025-08-26 16:27:56 -07:00
pengzhiliang
5161d63e76 update FAQ 2025-08-26 16:28:29 +00:00
pengzhiliang
5e614f7190 update 2025-08-26 15:16:54 +00:00