What's Changed
- Fix speaker embedding extraction in Qwen3-TTS model by @Blaizzy in #390
- Fix Qwen3-TTS tail artifacts by @Blaizzy in #391
- Fix Qwen3-TTS Base Voice Cloning by @Blaizzy in #394
- Add Vibevoice ASR by @Blaizzy in #389
- Qwen3 speaker embedding tests by @Blaizzy in #396
- Update TTS commands in README to include language code option by @rudolfolah in #401
- Unify Mimi implementation for Pocket TTS by @lucasnewman in #403
- Fix issue of ref_audio not loading prior to inference with server. by @BuffMcBigHuge in #406
- Enhance README with installation and usage examples by @rahimnathwani in #404
- Upgrade GitHub Actions for Node 24 compatibility by @salmanmkc in #418
- Upgrade GitHub Actions to latest versions by @salmanmkc in #419
- [VibeVoice-ASR] Fix Metal kernel crash and optimize memory for long audio by @Blaizzy in #417
- fix: Allowing quantization of Qwen3-TTS! Adding model_quant_predicate to Qwen3-TTS to exclude embedding layers by @kyr0 in #398
- Fix qwen3 tts quants (silence in VC and word precision) by @Blaizzy in #407
- Fix stt array io by @Blaizzy in #426
- Update MANIFEST.in to remove leading dot from requirements.txt path by @Blaizzy in #428
- Move audio path/format prints under verbose flag by @wladpaiva in #429
- Update pyproject.toml and GitHub Actions workflow for package publishing by @Blaizzy in #431
New Contributors
- @rudolfolah made their first contribution in #401
- @BuffMcBigHuge made their first contribution in #406
- @rahimnathwani made their first contribution in #404
- @salmanmkc made their first contribution in #418
- @kyr0 made their first contribution in #398
- @wladpaiva made their first contribution in #429
Full Changelog: v0.2.10...v0.3.0