Quentin Fuxa
dd48997674
qwen3: reuse encoder kv cache
2026-03-15 22:31:39 +01:00
Quentin Fuxa
f24481dc29
update archi
2026-03-15 11:36:45 +01:00
Quentin Fuxa
ed76f40ee5
Merge branch 'main' of https://github.com/QuentinFuxa/WhisperLiveKit
2026-03-15 11:16:38 +01:00
Quentin Fuxa
5330b3fac5
update benchmark part
2026-03-15 11:16:26 +01:00
Quentin Fuxa
0c73a73aa3
update benchmark results and procedure
2026-03-15 11:16:15 +01:00
Quentin Fuxa
2d6bc4f572
Add '*.c' to .dockerignore
2026-03-14 00:18:10 +01:00
Quentin Fuxa
dfd5bf417c
voxtral mlx : improved chunking
2026-03-14 00:13:29 +01:00
Quentin Fuxa
9d8db7ab38
add qwen3 simul in tests
2026-03-14 00:13:09 +01:00
Quentin Fuxa
fa15115163
qwen3 alignment heads
2026-03-14 00:12:50 +01:00
Quentin Fuxa
8dc7b77071
Bump version to 0.2.20
2026-03-08 16:02:00 +01:00
Quentin Fuxa
10d85ff65f
Update docs, CI, and architecture diagram
2026-03-08 15:14:00 +01:00
Quentin Fuxa
e7e3441ca4
Add Qwen3 ASR backend
2026-03-07 11:48:00 +01:00
Quentin Fuxa
9abe26a996
Add CLI with serve, transcribe, listen, pull, diagnose
2026-03-01 13:37:00 +01:00
Quentin Fuxa
c8e7c216ed
Replace mock tests with real pipeline tests
2026-02-28 10:05:00 +01:00
Quentin Fuxa
586540ae36
Add test harness and test client
2026-02-22 16:19:00 +01:00
Quentin Fuxa
cd8df8e1aa
Update package setup and exports
2026-02-21 11:33:00 +01:00
Quentin Fuxa
e30f9a2573
Improve diarization backends
2026-02-15 14:55:00 +01:00
Quentin Fuxa
32de7b1276
Fix frontend buffer rendering for slow backends
2026-02-14 09:28:00 +01:00
Quentin Fuxa
9ac7c26a0b
Add OpenAI REST API and Deepgram WebSocket
2026-02-08 15:42:00 +01:00
Quentin Fuxa
c0e2600993
Add snapshot-then-diff WebSocket protocol
2026-02-07 10:17:00 +01:00
Quentin Fuxa
e0db3a98f9
Add per-session language proxy
2026-02-01 17:03:00 +01:00
Quentin Fuxa
2fe34427ef
Fix voxtral streaming drain and silence flush
2026-01-31 11:12:00 +01:00
Quentin Fuxa
d58365421f
Refactor audio processor async pipeline
2026-01-25 13:48:00 +01:00
Quentin Fuxa
a282cbe75f
Improve tokens alignment and silence handling
2026-01-24 10:55:00 +01:00
Quentin Fuxa
6e85c16614
Refactor TranscriptionEngine singleton
2026-01-18 15:27:00 +01:00
Quentin Fuxa
e1823dd99c
Improve online ASR processor
2026-01-17 09:35:00 +01:00
Quentin Fuxa
e144abbbc7
Refactor timed objects and data structures
2026-01-11 16:08:00 +01:00
Quentin Fuxa
83362c89c4
Clean up config and model paths
2026-01-10 11:42:00 +01:00
Quentin Fuxa
74c4dc791d
Lint scripts and tests
2026-01-04 14:15:00 +01:00
Quentin Fuxa
cf6c49f502
Ruff lint cleanup
2026-01-03 10:23:00 +01:00
Quentin Fuxa
451535d48f
Fix ctranslate2 encoder conversion ( #345 ) and memory leak in TokensAlignment ( #344 )
...
- Add fallback chain for StorageView to numpy conversion
- Prune old tokens/segments after 5min to bound memory
2026-03-10 22:37:00 +01:00
Quentin Fuxa
8bc0937c46
Update README section on powered research
2026-03-06 18:46:07 +01:00
Quentin Fuxa
929cf7a26b
add link to AlignAtt interactive playground
2026-03-06 18:43:25 +01:00
Quentin Fuxa
abfaf06203
Merge branch 'main' of https://github.com/QuentinFuxa/WhisperLiveKit
2026-03-04 18:17:23 +01:00
Quentin Fuxa
d1fe932241
Apply DRY method v0 - to try to catch and resolve infinite loops such as in #338
2026-03-03 22:52:00 +01:00
Quentin Fuxa
c112ceffb6
Merge pull request #342 from mnicnc404/fix/whisper-tokenizer-index-error
...
fix(whisper/tokenizer): prevent IndexError from crashing multilingual…
2026-03-02 20:36:58 +01:00
Quentin Fuxa
4917406e06
Merge pull request #341 from AymurAI/feat/uv-deps-resolution
...
deps/docker: align python support, deterministic deps resolution & docker images releases
2026-03-02 20:34:49 +01:00
Chingning Chen
b63f54e838
fix(whisper/tokenizer): prevent IndexError from crashing multilingual streams
...
This fix addresses a critical bug in the Whisper tokenizer that causes
the transcription server to crash with an `IndexError: string index out
of range` when streaming audio in languages utilizing multi-byte UTF-8
characters (e.g., Cantonese, Japanese, Mandarin).
When a 3-byte character is cut off at the boundary of an audio chunk,
incomplete bytes are decoded into a single Unicode replacement character
(`\ufffd`), artificially shortening the string and breaking the offset
mapping assumed by `split_tokens_on_unicode`.
This ports the upstream fix from SYSTRAN/faster-whisper (PR #111 ) to add
a strict bounds check before accessing the string index, allowing
incomplete bytes to be safely caught and handled in the next chunk.
2026-03-02 15:31:43 +08:00
jedzill4
c56a53fbf4
deps(mlx-groups): add optional dependencies for Apple Silicon MLX backends
2026-03-01 20:05:52 -03:00
Quentin Fuxa
66e58624b9
disable MLXAlignAtt which fails on special characters
2026-03-01 11:52:00 +01:00
jedzill4
9366e067f9
deps(pyproject): add torch and torchaudio to main dependencies
2026-02-27 19:19:18 -03:00
jedzill4
866c25670c
deps(docker): change CUDA base image to runtime version
2026-02-27 19:16:29 -03:00
jedzill4
2553ef283e
deps(docker): fix dependency group for cu129 image
...
- Changed the extras for cu129-diarization-sortformer from gpu-cu129 to cu129.
- This aligns the dependency with the correct naming convention for consistency.
2026-02-25 21:49:08 -03:00
jedzill4
73e7fafc48
feat(tests): python matrix support test
...
- Introduced a new argument for selecting the diarization backend in the engine creation.
- Enhanced the `create_engine` function to accept and utilize the specified diarization backend.
- Updated the test runner to accommodate the new backend option for improved flexibility.
2026-02-25 21:35:41 -03:00
jedzill4
bbcebcb1fe
deps(sortformer): adjust nemo-toolkit version constraints
...
- Updated the version constraint for `diarization-sortformer` to restrict it to Python 3.10 and below.
2026-02-25 21:33:00 -03:00
jedzill4
4bb58dc7aa
deps(diart): improve diart dependency tree. rename gpu-cu129 dependency group to cu129
2026-02-25 20:27:26 -03:00
jedzill4
27ca028479
ci(github): add GitHub Actions workflows for Docker image publishing and support matrix
...
- Introduced a workflow to publish Docker images on tag push and manual triggers.
- Added a support matrix workflow to test across multiple OS and Python versions.
2026-02-25 14:27:51 -03:00
jedzill4
d24805cc18
🚀 chore (docker): update docker images improving caching and using uv as python package manager
2026-02-25 14:22:43 -03:00
jedzill4
994ce21365
📌 chore(deps): pin dependences to python 3.11 to 3.13 due dependency resolution matrix
2026-02-25 14:21:19 -03:00
jedzill4
132823dc09
deps: improve deps dependency resolution (wip)
2026-02-24 20:15:53 -03:00