Move sample rate and sample format conversion utils into `FFMPEGCommon.cpp` #629

NicolasHug · 2025-04-09T11:01:41Z

This PR moves the sample rate and sample format conversions utils from SingleStreamDecoder into FFMPEGCommon. Sample format conversion is needed for encoding too, so we need to make them common.

Specifically:

SingleStreamDecoder::createSwrContext is removed and its logic is not part of FFMPEGCommon's allocateSwrContext, which was renamed into createSwrContext
SingleStreamDecoder::convertAudioAVFrameSampleFormatAndSampleRate is moved too

Inputs are slightly modified to account for the fact that there's no streamInfo_ anymore. Other than that, this is just copy/pasting code.

…oder

scotts · 2025-04-11T14:29:57Z

src/torchcodec/_core/FFMPEGCommon.cpp

  return swrContext;
 }

+UniqueAVFrame convertAudioAVFrameSampleFormatAndSampleRate(
+    const UniqueSwrContext& swrContext,
+    const UniqueAVFrame& srcAVFrame,


Nit: we should be consistent about src and source. I have a preference for src, as it's a universal abbreviation, particularly when paired with dst. But if we say source in a lot of other places, we should stick with that.

Double nit, and I recognize this name already existed: convertAudioAVFrameSampleFormatAndSampleRate() is very long, and I feel like we're encoding parameter names that modify the operation into the name. I feel like it's clearer as just convertAudioAVFrame().

Sounds good, I'll merge as-is and follow-up with a PR to address these

NicolasHug added 11 commits April 7, 2025 16:15

Disable FFmpeg logs for encoder

7921558

Merge branch 'main' of github.com:pytorch/torchcodec into loglevelenc…

2a19014

…oder

Use c++ strings

73bdc85

Merge branch 'main' of github.com:pytorch/torchcodec into loglevelenc…

54f5543

…oder

Account for frame_size being 0

24842b6

Merge branch 'main' of github.com:pytorch/torchcodec into encoding_wav

c3ac80a

WIP

5b39c8f

Move createSwrContext in ffmpeg file

1f9f904

WIP

f525848

Move convertAudioAVFrameSampleFormatAndSampleRate in ffmpeg file

9150137

revert stuff

cf8dd11

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 9, 2025

NicolasHug mentioned this pull request Apr 10, 2025

Support encoding into a bytes tensor #635

Merged

scotts reviewed Apr 11, 2025

View reviewed changes

scotts approved these changes Apr 11, 2025

View reviewed changes

NicolasHug merged commit 12cdaa8 into pytorch:main Apr 11, 2025
46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move sample rate and sample format conversion utils into `FFMPEGCommon.cpp` #629

Move sample rate and sample format conversion utils into `FFMPEGCommon.cpp` #629

NicolasHug commented Apr 9, 2025 •

edited

Loading

scotts Apr 11, 2025

scotts Apr 11, 2025

NicolasHug Apr 11, 2025

Move sample rate and sample format conversion utils into FFMPEGCommon.cpp #629

Move sample rate and sample format conversion utils into FFMPEGCommon.cpp #629

Conversation

NicolasHug commented Apr 9, 2025 • edited Loading

scotts Apr 11, 2025

Choose a reason for hiding this comment

scotts Apr 11, 2025

Choose a reason for hiding this comment

NicolasHug Apr 11, 2025

Choose a reason for hiding this comment

Move sample rate and sample format conversion utils into `FFMPEGCommon.cpp` #629

Move sample rate and sample format conversion utils into `FFMPEGCommon.cpp` #629

NicolasHug commented Apr 9, 2025 •

edited

Loading