WhisperLiveKit

History

Chingning Chen b63f54e838 fix(whisper/tokenizer): prevent IndexError from crashing multilingual streams This fix addresses a critical bug in the Whisper tokenizer that causes the transcription server to crash with an `IndexError: string index out of range` when streaming audio in languages utilizing multi-byte UTF-8 characters (e.g., Cantonese, Japanese, Mandarin). When a 3-byte character is cut off at the boundary of an audio chunk, incomplete bytes are decoded into a single Unicode replacement character (`\ufffd`), artificially shortening the string and breaking the offset mapping assumed by `split_tokens_on_unicode`. This ports the upstream fix from SYSTRAN/faster-whisper (PR #111) to add a strict bounds check before accessing the string index, allowing incomplete bytes to be safely caught and handled in the next chunk.		2026-03-02 15:31:43 +08:00
..
diarization	add insert_audio_chunk to DiartDiarization	2026-02-11 22:10:00 +01:00
local_agreement	fix --direct-english-translation not setting task=translate for localagreement backends	2026-02-11 22:10:00 +01:00
silero_vad_models	fixes silence detected but never reported by silero	2025-11-23 11:20:00 +01:00
simul_whisper	fix: handle numpy object_ dtype from ctranslate2 encoder (#337 )	2026-02-20 20:48:28 +01:00
voxtral_mlx	feat: add voxtral-mlx native backend for Apple Silicon	2026-02-22 23:28:10 +01:00
web	isort	2025-11-23 11:20:00 +01:00
whisper	fix(whisper/tokenizer): prevent IndexError from crashing multilingual streams	2026-03-02 15:31:43 +08:00
__init__.py	isort	2025-11-23 11:20:00 +01:00
audio_processor.py	fix: silence double-counting bug, add metrics module and runtime instrumentation	2026-02-22 23:27:12 +01:00
backend_support.py	mixstral hf v0	2026-02-20 20:49:57 +01:00
basic_server.py	simulstreaming mlx & torch dedup of common base	2025-02-15 23:52:00 +01:00
config.py	simulstreaming mlx & torch dedup of common base	2025-02-15 23:52:00 +01:00
core.py	docs: update README with voxtral backend, benchmarks, testing sections	2026-02-22 23:27:57 +01:00
ffmpeg_manager.py	isort	2025-11-23 11:20:00 +01:00
metrics.py	fix: correct false auto-detect claim, median bug, RTF inflation	2026-02-22 23:38:04 +01:00
metrics_collector.py	fix: correct false auto-detect claim, median bug, RTF inflation	2026-02-22 23:38:04 +01:00
model_mapping.py	simulstreaming mlx & torch dedup of common base	2025-02-15 23:52:00 +01:00
model_paths.py	Fixes #294 . improve model path backend detection and file extraction	2025-11-27 23:14:00 +01:00
parse_args.py	docs: update README with voxtral backend, benchmarks, testing sections	2026-02-22 23:27:57 +01:00
silero_vad_iterator.py	simulstreaming mlx & torch dedup of common base	2025-02-15 23:52:00 +01:00
thread_safety.py	Fix critical thread safety issues	2026-01-09 11:23:19 -05:00
timed_objects.py	add probability field to ASRToken	2026-02-11 22:10:00 +01:00
tokens_alignment.py	fix NoneType concatenation in add_translation	2026-02-11 22:10:00 +01:00
voxtral_hf_streaming.py	correct processor attributes mixtral	2026-02-22 21:13:21 +01:00
voxtral_mlx_asr.py	feat: add voxtral-mlx native backend for Apple Silicon	2026-02-22 23:28:10 +01:00
warmup.py	isort	2025-11-23 11:20:00 +01:00