Release v0.3.0 · Blaizzy/mlx-audio

What's Changed

Fix speaker embedding extraction in Qwen3-TTS model by @Blaizzy in #390
Fix Qwen3-TTS tail artifacts by @Blaizzy in #391
Fix Qwen3-TTS Base Voice Cloning by @Blaizzy in #394
Add Vibevoice ASR by @Blaizzy in #389
Qwen3 speaker embedding tests by @Blaizzy in #396
Update TTS commands in README to include language code option by @rudolfolah in #401
Unify Mimi implementation for Pocket TTS by @lucasnewman in #403
Fix issue of ref_audio not loading prior to inference with server. by @BuffMcBigHuge in #406
Enhance README with installation and usage examples by @rahimnathwani in #404
Upgrade GitHub Actions for Node 24 compatibility by @salmanmkc in #418
Upgrade GitHub Actions to latest versions by @salmanmkc in #419
[VibeVoice-ASR] Fix Metal kernel crash and optimize memory for long audio by @Blaizzy in #417
fix: Allowing quantization of Qwen3-TTS! Adding model_quant_predicate to Qwen3-TTS to exclude embedding layers by @kyr0 in #398
Fix qwen3 tts quants (silence in VC and word precision) by @Blaizzy in #407
Fix stt array io by @Blaizzy in #426
Update MANIFEST.in to remove leading dot from requirements.txt path by @Blaizzy in #428
Move audio path/format prints under verbose flag by @wladpaiva in #429
Update pyproject.toml and GitHub Actions workflow for package publishing by @Blaizzy in #431

Full Changelog: v0.2.10...v0.3.0