Tags: lhotse-speech/lhotse
Tags
Add new sampler: weighted sampler (#1344) * add file * add a weighted data source to enable sampling based on per-sample weight; do not allow duplicated sample within the same epoch * add a weighted sampler; do not allow lazy mode; do not allow duplicated cut in the same batch * modify init file accordingly * add more documentations * use numpy for sampling; pre-compute the indexes in __iter__ to save time * add more documentation * minor changes to the arguments * remove unused file * add test * add more docs * fix isort * inherit from SimpleCutSampler; remove duplicated code * minor fix * Add changes requested in code review --------- Co-authored-by: Piotr Żelasko <[email protected]>
`AudioBackend` specific `save_audio` and `info`, managing missing SoX… … in torchaudio, Python 3.12 / PyTorch 2.2 support, using `libsndfile` as preferred audio backend (#1288) * AudioBackend supports save_audio() [code cleanup] * Fix for Path handling; skip some tests on older torchaudio * Move info() implementation to each AudioBackend * Update CI configurations, fix more tests * Conditionally build kaldifeat, fix some more tests * Fix more tests, remove dead code, bump version * Remaining fixes, legacy OPUS reading mode env var * Fixes for torchaudio==2.0.0 * Fixes for save_audio/save_audios * Skip backends for audio saving that are not applicable (e.g. torchaudio backends when it's not installed) * Prefer LibsndfileBackend as the default lhotse backend (except for some special cases) + fix CutSet.copy_data()
PreviousNext