Commit graph

764 commits

Author SHA1 Message Date
Dominik Macháček
949304ab05 Merge branch 'opeanai-api2' into opeanai-api 2024-02-19 13:51:26 +01:00
Tijs Zwinkels
9fcd403439 Use automatic language detection by default (instead of English) 2024-02-15 22:24:43 +01:00
Tijs Zwinkels
922ad18ebc Make OpenAI backend work with language autodetect 2024-02-14 17:29:45 +01:00
Tijs Zwinkels
f0a24cd5e1 Make --vad work with --backend openai-api 2024-02-14 17:01:29 +01:00
Tijs Zwinkels
3696fef2b1 Use OpenAI api word-level timestamps 2024-02-14 17:01:29 +01:00
Tijs Zwinkels
531418ad07 Interpolate word timestamps based on word character length 2024-02-14 17:01:29 +01:00
Dominik Macháček
2270014219 fixes 2024-02-14 17:01:29 +01:00
Dominik Macháček
f8b2ae07b8 missing features in openai-api, PR #52 2024-02-14 17:01:29 +01:00
Tijs Zwinkels
6ec1f65fe2 Update documentation to include openai-api backend 2024-02-14 17:01:29 +01:00
Tijs Zwinkels
f412812082 OpenAI Whisper API backend 2024-02-14 17:01:29 +01:00
Dominik Macháček
c8123344c6 increasing timestamps fixed
but the code needs to be simplified and cleaned before merging
2024-02-06 16:38:57 +01:00
Dominik Macháček
6b968c6e29 Merge branch 'main' into vad-streaming 2024-02-06 14:34:54 +01:00
Dominik Macháček
b66c61cf7a README update auto language detection 2024-02-06 14:31:24 +01:00
Dominik Macháček
cd221a3198 auto language detection #56 2024-02-06 14:29:30 +01:00
Dominik Macháček
d65fd8a649 fixes 2024-01-25 17:53:07 +01:00
Dominik Macháček
50f1b94856 missing features in openai-api, PR #52 2024-01-25 16:50:02 +01:00
Tijs Zwinkels
ab27bfb361 Update documentation to include openai-api backend 2024-01-25 10:21:42 +01:00
Tijs Zwinkels
c30969fe27 OpenAI Whisper API backend 2024-01-25 10:21:33 +01:00
Dominik Macháček
6fa008080a VAC
- performance tests pending
- TODO: timestamps after refresh are decreasing
2024-01-03 17:55:33 +01:00
Dominik Macháček
d543411bbd VAC controller integrated
it works. Reproducing #39
2024-01-03 15:47:30 +01:00
Dominik Macháček
b2e4e9f727 Merge remote-tracking branch 'rodrigo/main' into vad-streaming 2024-01-03 13:11:04 +01:00
Dominik Macháček
1f2352fa1d README typo and one more simulation option is not shared 2024-01-03 12:52:44 +01:00
Dominik Macháček
bfbe83d792 Samples should be an integer, not seconds
- Merge pull request #49 from skripnik/patch-1
- tested performance --  ESIC dev2, 27 docs, on En, De, Cs ASR, Nvidia A40, min chunk 1s, VAD => it has lower WER and latency with "segment" buffer trimming with various thresholds
2024-01-03 10:37:32 +01:00
Aleksei Scripnic
234ac8f5e8 Samples should be an integer, not seconds
I believe it's just a typo
2024-01-02 14:40:22 +00:00
Dominik Macháček
aa51e39de4 buffer trimming option, sent. segmenter not required anymore
- both for whisper_online + server
- removed argparse code repetition
- README updated
2024-01-02 14:56:30 +01:00
Dominik Macháček
ef08538697 buffer trimming options + most recommendable default
evaluated on ESIC dev2, 27 docs
2024-01-02 12:06:29 +01:00
Dominik Macháček
99aef35958 Merge pull request #36 from luweigen/bug-chunk_completed_sentence
fix bug of completed sentence chunking. tested on faster-whisper in e…
2023-12-19 13:39:37 +01:00
Rodrigo
324dee03e7 vad 2023-12-09 17:12:43 -03:00
Rodrigo
fe4207edca Merge remote-tracking branch 'upstream/main' 2023-12-09 17:02:35 -03:00
Dominik Macháček
ff794b4d32 Merge pull request #40 from lifefeel/main
Fix: Omitting the last chunk problem in comp_unaware mode
2023-12-07 13:31:47 +01:00
J.P Lee
2b98af7b19 Fix: Omitting the last chunk problem in comp_unaware mode 2023-12-07 17:00:38 +09:00
Rodrigo
ea2a9ca2e6 use of silero model instead of silero VadIterator 2023-12-06 12:52:29 -03:00
Rodrigo
c8c786af4f use of silero model instead of silero VadIterator 2023-12-06 12:17:55 -03:00
Rodrigo
3fad8133b4 delete unused var 2023-12-01 18:08:43 -03:00
Rodrigo
9556d07484 vad 2023-12-01 17:33:46 -03:00
Dominik Macháček
64c445f073 proceedings link 2023-11-29 10:16:44 +01:00
Dominik Macháček
256ec31d21 bibtex and proceedings link 2023-11-29 10:14:30 +01:00
Wei Lu
a60c64c831 fix bug of completed sentence chunking. tested on faster-whisper in en language 2023-11-28 18:51:36 +02:00
Dominik Macháček
8f32dea5ca logfile reviewed, whisper_timestamped loading module and vad
PR #10, issues #9, #30
2023-11-28 12:16:20 +01:00
Dominik Macháček
bd0d848e7f Merge branch 'main' into TIAGo-WE-COBOT 2023-11-28 11:03:58 +01:00
Dominik Macháček
878f11cdb7 create_tokenizer in documentation
#25
2023-11-26 16:11:42 +01:00
Dominik Macháček
483badf85d Update README.md
so many "issues" with question about this :(
2023-11-23 07:41:08 +01:00
Luca
18c1434f77 backend import in child load_model method and expose logfile arg 2023-11-03 11:33:03 +01:00
Luca
f97a253273 Merge branch 'ufal:main' into main 2023-11-03 11:03:54 +01:00
Dominik Macháček
62425111e6 Update README.md
slides from oral presentation
2023-11-01 10:30:14 +08:00
Dominik Macháček
4a51e13199 segmenters for all Whisper languages 2023-09-27 23:29:50 +02:00
Luca
6e6b619257 add option to save log to file 2023-09-06 15:19:12 +02:00
Luca
c0dd2e2db9 import backend from __init__ 2023-09-06 12:39:26 +02:00
Dominik Macháček
2249846d01 Update README.md
paper link
2023-08-02 11:24:50 +02:00
Dominik Macháček
fc74626ff4 demo video on Update README.md 2023-06-28 15:20:56 +02:00