Skip to content

v0.3.0

Latest

Choose a tag to compare

@Blaizzy Blaizzy released this 25 Jan 21:43
· 7 commits to main since this release
02ada37

What's Changed

  • Fix speaker embedding extraction in Qwen3-TTS model by @Blaizzy in #390
  • Fix Qwen3-TTS tail artifacts by @Blaizzy in #391
  • Fix Qwen3-TTS Base Voice Cloning by @Blaizzy in #394
  • Add Vibevoice ASR by @Blaizzy in #389
  • Qwen3 speaker embedding tests by @Blaizzy in #396
  • Update TTS commands in README to include language code option by @rudolfolah in #401
  • Unify Mimi implementation for Pocket TTS by @lucasnewman in #403
  • Fix issue of ref_audio not loading prior to inference with server. by @BuffMcBigHuge in #406
  • Enhance README with installation and usage examples by @rahimnathwani in #404
  • Upgrade GitHub Actions for Node 24 compatibility by @salmanmkc in #418
  • Upgrade GitHub Actions to latest versions by @salmanmkc in #419
  • [VibeVoice-ASR] Fix Metal kernel crash and optimize memory for long audio by @Blaizzy in #417
  • fix: Allowing quantization of Qwen3-TTS! Adding model_quant_predicate to Qwen3-TTS to exclude embedding layers by @kyr0 in #398
  • Fix qwen3 tts quants (silence in VC and word precision) by @Blaizzy in #407
  • Fix stt array io by @Blaizzy in #426
  • Update MANIFEST.in to remove leading dot from requirements.txt path by @Blaizzy in #428
  • Move audio path/format prints under verbose flag by @wladpaiva in #429
  • Update pyproject.toml and GitHub Actions workflow for package publishing by @Blaizzy in #431

New Contributors

Full Changelog: v0.2.10...v0.3.0