Tags: stevenhillis/audio
Tags
Annotate MelSpectrogram power as float only (pytorch#1572) Mel scale frequency is in general defined/used on power spectrogram and as far as we know, it is not defined for raw (complex value) spectrogram. When `power=None` (when intermediate spectrogram is complex-valued), `MelSpectrogram` fails because of extra dimension (when using pseudo complex) or type mismatch between `torch.float` and torch.cfloat` (when using native complex). This commit changes type annotation so that power is only float.
Add dtype argument for kernel caching precision (pytorch#1556) Since 0.9.0-RC1, `T.Resample` precomputes and caches resampling kernel for performance improvement. (10x improvement). The implementation from 0.8.0 computed the kernel on-the-fly on the same `device`/`dtype` as the input Tensor, but in the newer version, the kernel is precomputed at the construction time and is cached with `float32` first. This causes degradation if one wants to perform resampling on `float64`, because `sinc` values computed on `float32`s are not good enough for resampling in `float64`. The reason why we decided to use `float32` for initial caching is to keep the UX disruption minimum, and there were no way to make it work for `float64`. This PR adds `dtype` argument, that can be used for overwriting the cache precision.
Ignore return_complex when returning real-valued tensor in spectrogra… …m. (pytorch#1551)
[v0.8.1] .circleci: Bump versions for pytorch 1.8.1 (pytorch#1404) Signed-off-by: Eli Uriegas <[email protected]>
[v0.8.1] .circleci: Bump versions for pytorch 1.8.1 (pytorch#1404) Signed-off-by: Eli Uriegas <[email protected]>
PreviousNext