WhisperLiveKit/whisperlivekit
Quentin Fuxa 4b2377c243 fix: correct false auto-detect claim, median bug, RTF inflation
- BENCHMARK.md: whisper also supports --language auto, voxtral is not
  the only one. Fixed mlx-whisper speed comparison (LA is actually
  faster than SS for mlx-whisper, not comparable).
- metrics.py: median calculation was wrong for even-length lists
  (took upper middle instead of averaging the two middle values).
- metrics_collector.py: RTF was inflated because log_summary() used
  wall-clock elapsed time instead of sum of actual ASR call durations.
- README.md: clarified that whisper also supports auto language
  detection, voxtral just does it better.
- Added 2 new median tests (even + odd length).
2026-02-22 23:38:04 +01:00
..
diarization add insert_audio_chunk to DiartDiarization 2026-02-11 22:10:00 +01:00
local_agreement fix --direct-english-translation not setting task=translate for localagreement backends 2026-02-11 22:10:00 +01:00
silero_vad_models fixes silence detected but never reported by silero 2025-11-23 11:20:00 +01:00
simul_whisper fix: handle numpy object_ dtype from ctranslate2 encoder (#337) 2026-02-20 20:48:28 +01:00
voxtral_mlx feat: add voxtral-mlx native backend for Apple Silicon 2026-02-22 23:28:10 +01:00
web isort 2025-11-23 11:20:00 +01:00
whisper simulstreaming mlx & torch dedup of common base 2025-02-15 23:52:00 +01:00
__init__.py isort 2025-11-23 11:20:00 +01:00
audio_processor.py fix: silence double-counting bug, add metrics module and runtime instrumentation 2026-02-22 23:27:12 +01:00
backend_support.py mixstral hf v0 2026-02-20 20:49:57 +01:00
basic_server.py simulstreaming mlx & torch dedup of common base 2025-02-15 23:52:00 +01:00
config.py simulstreaming mlx & torch dedup of common base 2025-02-15 23:52:00 +01:00
core.py docs: update README with voxtral backend, benchmarks, testing sections 2026-02-22 23:27:57 +01:00
ffmpeg_manager.py isort 2025-11-23 11:20:00 +01:00
metrics.py fix: correct false auto-detect claim, median bug, RTF inflation 2026-02-22 23:38:04 +01:00
metrics_collector.py fix: correct false auto-detect claim, median bug, RTF inflation 2026-02-22 23:38:04 +01:00
model_mapping.py simulstreaming mlx & torch dedup of common base 2025-02-15 23:52:00 +01:00
model_paths.py Fixes #294. improve model path backend detection and file extraction 2025-11-27 23:14:00 +01:00
parse_args.py docs: update README with voxtral backend, benchmarks, testing sections 2026-02-22 23:27:57 +01:00
silero_vad_iterator.py simulstreaming mlx & torch dedup of common base 2025-02-15 23:52:00 +01:00
thread_safety.py Fix critical thread safety issues 2026-01-09 11:23:19 -05:00
timed_objects.py add probability field to ASRToken 2026-02-11 22:10:00 +01:00
tokens_alignment.py fix NoneType concatenation in add_translation 2026-02-11 22:10:00 +01:00
voxtral_hf_streaming.py correct processor attributes mixtral 2026-02-22 21:13:21 +01:00
voxtral_mlx_asr.py feat: add voxtral-mlx native backend for Apple Silicon 2026-02-22 23:28:10 +01:00
warmup.py isort 2025-11-23 11:20:00 +01:00