07672aaf2b
refactor: remove unneeded torch.no_grad()
2025-09-03 17:06:11 +02:00
1cb2a50ce5
feat: load to XPU
2025-09-03 09:02:23 +02:00
aeb26374df
feat: use same config as cuda for xpu
2025-09-02 23:26:22 +02:00
90e1cd1486
feat: add ipex-llm + pytorch env vars initialization
2025-09-02 23:23:25 +02:00
66b32b2e4e
feat: use xpu as default if it exists
2025-09-02 23:13:54 +02:00
YaoyaoChang
6065c5224e
update Colab: add download audio
2025-09-02 02:08:19 -07:00
pengzhiliang
21c35b4701
fix bug: from_pretrained of VibeVoiceProcessor
2025-09-01 11:39:59 +00:00
YaoyaoChang
5d09c31021
update readme
2025-09-01 18:02:28 +08:00
gregory-fanous
6dd49cb720
feat: add Apple MPS device support (dtype + attention handling) to demos ( #67 )
...
* Enhance model loading with device support and error handling
Updated device handling for model loading and added support for MPS. Improved error handling and fallback mechanisms for attention implementations.
* Improve device handling and model loading logic
Updated device argument handling to support MPS and added validation for MPS availability. Enhanced model loading logic based on the selected device type.
* fallback only when flash_attention_2 and add some comments back
---------
Co-authored-by: YaoyaoChang <cyy574006791@qq.com>
2025-09-01 17:16:01 +08:00
YaoyaoChang
3074f898ca
support CPU
2025-09-01 16:32:50 +08:00
jonathanagustin
086cd6aa6c
refactor(colab): remove cd and use absolute paths for idempotent runs
...
- avoid stateful working-directory changes in notebook cells
- use explicit /content/... paths for clone/install/inference
- improves readability and re-run reliability
2025-09-01 15:47:52 +08:00
jonathanagustin
12493567c4
perf(colab): simplify setup & speed up downloads (uv, hf-xet)
...
- use `uv pip -e` for faster, deterministic installs
- enable hf-xet (HF_XET_HIGH_PERFORMANCE=1) for faster transfers
- download model with `hf download` to /content/models/VibeVoice-1.5B (quiet, idempotent)
- simplify flow: absolute paths, minimal cells, remove redundant steps
- add binary T4 GPU check + clear runtime switch instructions
- add “Open in Colab” badge; keep Risks & Limitations visible
2025-09-01 15:47:52 +08:00
pengzhiliang
01a3005bd9
update README
2025-09-01 06:27:15 +00:00
pengzhiliang
29433e0f27
update README
2025-09-01 06:23:52 +00:00
YaoyaoChang
174e53fc04
gradio support sdpa
2025-08-30 08:22:32 -07:00
Rafael Milewski
999020e7c4
add dark mode version
2025-08-29 17:25:43 +08:00
Z
c0194a7a00
Specifically reference newer llvmlite and numba dependencies
...
llvmlite is trying to build an old version that creates a dependency chain issue when installing with uv:
```
help: `llvmlite` (v0.36.0) was included because `vibevoice` (v0.0.1) depends on `librosa` (v0.11.0) which
depends on `numba` (v0.53.1) which depends on `llvmlite`
```
pinning these dependencies on newer versions forces it to resolve and shouldn't have any downsides.
2025-08-29 16:19:01 +08:00
Sangjee Dondrub
4d5d9779c8
Remove redundant librosa
package in the depdencies.
2025-08-29 15:28:31 +08:00
Before Crash
ccdb19f004
update requires-python = '>=3.9' ( #48 )
2025-08-29 15:25:33 +08:00
JianweiYu
f3e35d0c2c
update
2025-08-28 21:58:31 -07:00
JianweiYu
6210e6ee68
update
2025-08-28 21:50:31 -07:00
JianweiYu
1a75187097
update
2025-08-28 18:40:51 -07:00
JianweiYu
cd78c5b6e6
update
2025-08-28 18:29:18 -07:00
Li Dong
b815902059
Merge pull request #28 from DevMello/patch-1
...
fix: remove typos in README.md
2025-08-29 00:01:37 +08:00
Li Dong
174fbabbf2
Merge pull request #45 from RichardAtCT/patch-1
...
docs: fix spelling errors in README.md
2025-08-29 00:01:11 +08:00
Richard A
acba35e2d0
docs: fix spelling errors in README
...
- Change "Opensource" to "Open Source" (News section)
- Fix "Spekaer" typo to "Speaker" (FAQ Q2)
- Correct "instable" to "unstable" (FAQ Q6)
2025-08-28 19:48:45 +04:00
YaoyaoChang
c72bc0c59b
add Risks and Limitations in Gradio
2025-08-28 02:26:17 -07:00
YaoyaoChang
1f3768c974
add Risk and Limitations for Colab
2025-08-28 02:15:42 -07:00
YaoyaoChang
fa72b89dcd
update risks
2025-08-28 02:12:43 -07:00
YaoyaoChang
a3131cecf9
add Q6 for FAQ
2025-08-28 02:10:31 -07:00
YaoyaoChang
4b8b6f7700
update
2025-08-28 01:28:27 -07:00
YaoyaoChang
9537bd96a6
update
2025-08-28 01:23:57 -07:00
NR
84710c4d2b
Add immediate UI update for Generate and Stop buttons ( #18 )
...
### Summary
The Stop button previously appeared only after the backend generation job actually started, leaving users with no cancel affordance while a request sat in the queue. This PR makes the Stop button visible immediately after clicking **Generate Podcast**.
### Change
Adds a single non-queued `.then` step in the click event chain
that hides the Generate button and shows the Stop button instantly, before the queued job begins processing.
2025-08-28 15:24:43 +08:00
_
e611deafac
FIX: adjust quote type in inference_from_file ( #33 )
...
adds
full_script = full_script.replace("’", "'")
to the data preparation
2025-08-28 15:24:42 +08:00
YaoyaoChang
f57a9bffb1
speaker index from 1
2025-08-28 15:24:42 +08:00
YaoyaoChang
54fbbce40e
add colab
2025-08-27 18:57:33 -07:00
YaoyaoChang
ac0104c65e
try sdpa if error in flash_attention_2
2025-08-27 18:46:15 -07:00
YaoyaoChang
64241e29ca
fix typo
2025-08-27 18:25:34 -07:00
YaoyaoChang
8ed05ba7de
update to sdpa
2025-08-27 18:21:30 -07:00
JianweiYu
2b75b745a4
update
2025-08-27 13:34:54 -07:00
JianweiYu
560870cbe1
update
2025-08-27 12:51:57 -07:00
Pranav Yerramaneni
437cd1f87f
Fix typo in README regarding speaker voice
2025-08-27 09:07:09 -07:00
JianweiYu
3dd8605797
Update
2025-08-27 00:32:26 -07:00
JianweiYu
06f56bff02
Update
2025-08-26 23:56:36 -07:00
JianweiYu
13b72fec79
update
2025-08-26 23:48:05 -07:00
YaoyaoChang
056bb5b0fa
add args to use_eager
2025-08-26 19:44:34 -07:00
YaoyaoChang
f3df4ae1b7
update report link to arxiv
2025-08-26 19:16:14 -07:00
YaoyaoChang
f5b5a648bb
update FAQ
2025-08-26 18:48:26 -07:00
JianweiYu
f507425ee7
update
2025-08-26 18:07:28 -07:00
JianweiYu
d38c23e1e1
update
2025-08-26 18:06:02 -07:00