Fix MP3 resampling when a dataset's audio files have different sampling rates #3665

lhoestq · 2022-02-02T10:31:45Z

The resampler needs to be updated if the orig_freq doesn't match the audio file sampling rate

anton-l

LGTM, thanks for the fix!

anton-l · 2022-02-02T10:38:00Z

src/datasets/features/audio.py

@@ -174,7 +174,7 @@ def _decode_mp3(self, path_or_file):

        array, sampling_rate = torchaudio.load(path_or_file, format="mp3")
        if self.sampling_rate and self.sampling_rate != sampling_rate:
-            if not hasattr(self, "_resampler"):
+            if not hasattr(self, "_resampler") or self._resampler.orig_freq != sampling_rate:


Nice workaround, this should minimize the number of required _resampler replacements 👍

lhoestq added 3 commits February 2, 2022 11:14

fix mp3 resampling

d9e7618

add test

e8ebbaa

style

13a3cee

lhoestq requested a review from anton-l February 2, 2022 10:36

anton-l reviewed Feb 2, 2022

View reviewed changes

fix test

b357de7

lhoestq merged commit 9ee6d90 into master Feb 2, 2022

lhoestq deleted the fix-mp3-resampling branch February 2, 2022 10:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix MP3 resampling when a dataset's audio files have different sampling rates #3665

Fix MP3 resampling when a dataset's audio files have different sampling rates #3665

lhoestq commented Feb 2, 2022

anton-l left a comment

anton-l Feb 2, 2022

Fix MP3 resampling when a dataset's audio files have different sampling rates #3665

Fix MP3 resampling when a dataset's audio files have different sampling rates #3665

Conversation

lhoestq commented Feb 2, 2022

anton-l left a comment

Choose a reason for hiding this comment

anton-l Feb 2, 2022

Choose a reason for hiding this comment