Tags · lhotse-speech/lhotse

v1.28.0

minor fix (#1418)

Nov 19, 2024
1880fc1
zip
tar.gz
Notes

v1.27.0

Fix to fixed batch size bucketing and audio loading network connectio… (

#1387)

* Fix to fixed batch size bucketing and audio loading network connection resets

* Fix tests and add more 'paranoia' tests

Aug 22, 2024
170046f
zip
tar.gz
Notes

v1.26.0

Refactor bucket selection for customization (#1377)

* Refactor bucket selection to allow customization

* Extend the API further

* Prune imports

Jul 24, 2024
21b102c
zip
tar.gz
Notes

v1.25.0

augmentation/torchaudio: add Phone effect (mulaw, lpc10 codecs) (#1348)

* augmentation/torchaudio: add Phone effect (mulaw, lpc10 codecs)

* restore_orig_sr option

---------

Co-authored-by: Piotr Żelasko <[email protected]>

Jul 18, 2024
18436e9
zip
tar.gz
Notes

v1.24.2

Releease 1.24.2

Jun 25, 2024
e76dc3c
zip
tar.gz
Notes

v1.24.1

Support for reading data from AIStore using Python SDK (#1354)

* Support for reading data from AIStore using Python SDK

* More AIStore related docs

Jun 10, 2024
866e4a8
zip
tar.gz
Notes

v1.24

Add new sampler: weighted sampler (#1344)

* add file

* add a weighted data source to enable sampling based on per-sample weight; do not allow duplicated sample within the same epoch

* add a weighted sampler; do not allow lazy mode; do not allow duplicated cut in the same batch

* modify init file accordingly

* add more documentations

* use numpy for sampling; pre-compute the indexes in __iter__ to save time

* add more documentation

* minor changes to the arguments

* remove unused file

* add test

* add more docs

* fix isort

* inherit from SimpleCutSampler; remove duplicated code

* minor fix

* Add changes requested in code review

---------

Co-authored-by: Piotr Żelasko <[email protected]>

Jun 5, 2024
4d57d53
zip
tar.gz
Notes

v1.23

In CommonVoice corpus, use .tsv headers to parse and not column index (…

…#1328)

* Fix for cv corpus

* Fix for cv corpus x2

* Debug serialization problem

* Debug serialization problem

* Undo

* Handle quote polution in CV dataset

Apr 29, 2024
b2dce78
zip
tar.gz
Notes

v1.22

Bump dev version to 1.23.0 (#1301)

Mar 7, 2024
d26d476
zip
tar.gz
Notes

v1.21

`AudioBackend` specific `save_audio` and `info`, managing missing SoX…

… in torchaudio, Python 3.12 / PyTorch 2.2 support, using `libsndfile` as preferred audio backend (#1288)

* AudioBackend supports save_audio() [code cleanup]

* Fix for Path handling; skip some tests on older torchaudio

* Move info() implementation to each AudioBackend

* Update CI configurations, fix more tests

* Conditionally build kaldifeat, fix some more tests

* Fix more tests, remove dead code, bump version

* Remaining fixes, legacy OPUS reading mode env var

* Fixes for torchaudio==2.0.0

* Fixes for save_audio/save_audios

* Skip backends for audio saving that are not applicable (e.g. torchaudio backends when it's not installed)

* Prefer LibsndfileBackend as the default lhotse backend (except for some special cases) + fix CutSet.copy_data()

Feb 13, 2024
769c273
zip
tar.gz
Notes

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.28.0

v1.27.0

v1.26.0

v1.25.0

v1.24.2

v1.24.1

v1.24

v1.23

v1.22

v1.21

Tags: lhotse-speech/lhotse