ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV
-
Updated
Mar 13, 2026 - HTML
ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV
ComfyUI nodes for Qwen3-ASR (0.6B/1.7B) and ForcedAligner. Supports high-accuracy ASR and language identification for 52 languages/dialects, including 22 Chinese dialects and various English accents. Features word-level timestamps, long audio transcription, and VRAM-optimized inference.
MLX Local Serving (MLS) - Unified ASR, TTS, and Translation on Apple Silicon
On-device voice transcription, grammar correction, and text-to-speech for macOS. Runs on MLX.
Easily convert speech to timed SRT subtitles and translated captions (Colab-ready)
A free, open source, and extensible speech-to-text application that works completely offline.
🎙️ Implement fast, dependency-free C inference for Qwen3-ASR speech-to-text models with efficient streaming on modest hardware.
Add a description, image, and links to the qwen3-asr topic page so that developers can more easily learn about it.
To associate your repository with the qwen3-asr topic, visit your repo's landing page and select "manage topics."