[Hackathon 7th No.55] Add `fft_conv1d` to `PaddleSpeech` -part #3947

DrRyanHuang · 2024-12-11T04:41:54Z

PR types

New features

PR changes

APIs

Describe

fft_conv1d is much faster than conv1d when kernel_size >= 128

Implementation:

Standard conv1D applies convolutional kernels to the input signal using a sliding window approach.
fft_conv1d leverages the Fast Fourier Transform (FFT) to accelerate the convolution operation, ideally reducing the computational complexity from $$O(N \cdot K)$$ to $$O(N \log N)$$, where $$N$$ denotes the signal length and $$K$$ represents the kernel length.

Performance:

Standard conv1D: Typically performs well with small kernels or short input signals.
Simple to implement and easy to debug.
fft_conv1d: Generally offers performance improvements when the kernel size is large (e.g., above 256) and the stride is 1. However, the introduction of FFT and IFFT incurs additional memory overhead.

Result:

paddle-bot · 2024-12-11T04:41:58Z

Thanks for your contribution!

zxcd · 2024-12-18T09:42:04Z

paddlespeech/t2s/modules/fftconv1d.py

+]
+
+
+def __unfold(_input, kernel_size: int, stride: int):


What does this api do?

paddlespeech/t2s/modules/fftconv1d.py

zxcd · 2024-12-18T09:57:47Z

paddlespeech/t2s/modules/fftconv1d.py

+
+
+# Currently, the API unfold in Paddle is extremely slow, so __unfold is implemented 
+# using the `.strides` and `.as_strided` APIs. However, these are only supported in 


paddle do not have .strides release api. Also for as_strided support >2.6, but can be combined with view and reshape .

paddlespeech/t2s/modules/fftconv1d.py

zxcd · 2024-12-18T09:59:46Z

paddlespeech/t2s/modules/fftconv1d.py

+# Paddle version 2.6 and above, so F.conv1d and Conv1D are used as replacements.
+version = paddle.__version__
+
+if version < '2.6':


v2.6 and above cannot be used?

tests/unit/tts/test_fftconv1d.py

zxcd

LGTM

add fft_conv1d

8b56087

paddle-bot bot added the contributor label Dec 11, 2024

mergify bot added T2S Test labels Dec 11, 2024

DrRyanHuang mentioned this pull request Dec 11, 2024

【Hackathon 7th No.55】在 PaddleSpeech 中实现 audiotools PaddlePaddle/community#1017

Merged

luotao1 mentioned this pull request Dec 11, 2024

【Hackathon 7th】开源贡献个人挑战赛 PaddlePaddle/Paddle#68244

Open

DrRyanHuang added 6 commits December 11, 2024 15:14

add unitest 2 shell

759b0e7

fix paddle version

b11ea3e

rename

fdd102a

add comment

2c424e3

bias -> bias_attr

bb8704a

fix unitest

136f7c5

zxcd reviewed Dec 18, 2024

View reviewed changes

tests/unit/tts/test_fftconv1d.py Outdated Show resolved Hide resolved

fix sth

f3c0063

zxcd approved these changes Dec 24, 2024

View reviewed changes

zxcd merged commit ee4f158 into PaddlePaddle:develop Dec 24, 2024
5 checks passed

DrRyanHuang deleted the fft_conv1d branch December 30, 2024 05:47

luotao1 changed the title ~~[Hackathon 7th No.55] Add fft_conv1d to PaddleSpeech~~ [Hackathon 7th No.55] Add fft_conv1d to PaddleSpeech -part Jan 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Hackathon 7th No.55] Add `fft_conv1d` to `PaddleSpeech` -part #3947

[Hackathon 7th No.55] Add `fft_conv1d` to `PaddleSpeech` -part #3947

DrRyanHuang commented Dec 11, 2024 •

edited

Loading

paddle-bot bot commented Dec 11, 2024

zxcd Dec 18, 2024

zxcd Dec 18, 2024

zxcd Dec 18, 2024

zxcd left a comment



		# Currently, the API unfold in Paddle is extremely slow, so __unfold is implemented
		# using the `.strides` and `.as_strided` APIs. However, these are only supported in

[Hackathon 7th No.55] Add fft_conv1d to PaddleSpeech -part #3947

[Hackathon 7th No.55] Add fft_conv1d to PaddleSpeech -part #3947

Conversation

DrRyanHuang commented Dec 11, 2024 • edited Loading

PR types

PR changes

Describe

Implementation:

Performance:

Result:

paddle-bot bot commented Dec 11, 2024

zxcd Dec 18, 2024

Choose a reason for hiding this comment

zxcd Dec 18, 2024

Choose a reason for hiding this comment

zxcd Dec 18, 2024

Choose a reason for hiding this comment

zxcd left a comment

Choose a reason for hiding this comment

[Hackathon 7th No.55] Add `fft_conv1d` to `PaddleSpeech` -part #3947

[Hackathon 7th No.55] Add `fft_conv1d` to `PaddleSpeech` -part #3947

DrRyanHuang commented Dec 11, 2024 •

edited

Loading