WhisperLiveKit/whisperlivekit
Quentin Fuxa 8c799fa4d1 fix simulstreaming vram leak: cap cross-attn accumulation + token budget
fixes #283, fixes #275

- accumulated_cross_attns was growing unboundedly during decoding loop,
  using up to ~5GB for repetition loops. now capped to rolling window of 16
- max_tokens_per_chunk was using TOKENS_PER_SECOND (mel frame rate = 50)
  instead of actual text token rate (~15/s), allowing 10-40x too many
  decoding steps
- removed unused torch.cat on early return path
- removed dead self.committed/last_result_tokens lists (never read)
- same fixes applied to mlx variant
2026-02-11 22:10:00 +01:00
..
diarization add insert_audio_chunk to DiartDiarization 2026-02-11 22:10:00 +01:00
local_agreement fix --direct-english-translation not setting task=translate for localagreement backends 2026-02-11 22:10:00 +01:00
silero_vad_models fixes silence detected but never reported by silero 2025-11-23 11:20:00 +01:00
simul_whisper fix simulstreaming vram leak: cap cross-attn accumulation + token budget 2026-02-11 22:10:00 +01:00
web isort 2025-11-23 11:20:00 +01:00
whisper fixes #299 2025-12-05 17:54:14 +01:00
__init__.py isort 2025-11-23 11:20:00 +01:00
audio_processor.py update diarization lag after stream analysed 2025-12-18 14:13:28 +01:00
backend_support.py Refactor backend handling 2025-11-15 19:51:41 +01:00
basic_server.py isort 2025-11-23 11:20:00 +01:00
core.py Fix critical thread safety issues 2026-01-09 11:23:19 -05:00
ffmpeg_manager.py isort 2025-11-23 11:20:00 +01:00
model_paths.py Fixes #294. improve model path backend detection and file extraction 2025-11-27 23:14:00 +01:00
parse_args.py LoRa path v0 - functional 2025-11-29 17:21:10 +01:00
silero_vad_iterator.py session parameter required in OnnxWrapper 2025-12-05 15:37:18 +01:00
thread_safety.py Fix critical thread safety issues 2026-01-09 11:23:19 -05:00
timed_objects.py add probability field to ASRToken 2026-02-11 22:10:00 +01:00
tokens_alignment.py fix NoneType concatenation in add_translation 2026-02-11 22:10:00 +01:00
warmup.py isort 2025-11-23 11:20:00 +01:00