Make xtensor-fftw an actual template library #14

egpbos · 2017-10-18T15:09:29Z

Right now, due to my inexperience with templates, I have implemented the numpy-like interface functions using a number of (inline) functions that call a template instance, e.g.:

inline xt::xarray<std::complex<float> > fft (const xt::xarray<std::complex<float> > &input) {
  return _fft_<std::complex<float>, std::complex<float>, 1, true, false, false, fftwf_plan_dft_1d, fftwf_execute, fftwf_destroy_plan> (input);
}

I'm not sure, but it seems likely that the compiler will simply compile all these functions when a user includes this header. This makes the resulting binary a lot larger than it needs to be and increases compile time.

I think I now know how to rewrite the _fft_ and _ifft_ templates in such a way that I can write using template aliases for the three families (fft, rfft and hfft) that only depend on input precision. This will involve the following modifications to _fft_ and _ifft_:

Putting the fftw plan creation functions (constexpr static) in templates with parameters for precision, dimensionality, "c2c vs r2c" and direction.
Using the fftw execute and destroy_plan functions that I put in the precision dependent fftw_t template as well (these are also constexpr static).
Putting the output type in -> notation, using decltype or declexpr to determine the type based on template parameters for input precision, "c2c vs r2c" and direction.

The latter point was the crucial knowledge gap that caused me to write the functions as I did, since I didn't get auto-type-deduction working for the output type.

The text was updated successfully, but these errors were encountered:

egpbos · 2017-12-07T13:09:26Z

Suggestion by @LourensVeen: also rewrite them so that they can take in xexpressions or any other xtensor container that supports the right operators. Look at examples in the other xtensor libs.

stellarpower · 2024-03-15T15:01:43Z

Suggestion by @LourensVeen: also rewrite them so that they can take in xexpressions or any other xtensor container that supports the right operators. Look at examples in the other xtensor libs.

This would be preferable, as the library requires xarray etc. - I believe the philosophy of xtensor is the algorithms should be broadly ambivalent to the storage type used. In my case, I need to expose these to Python unfortunately, so the pytensor class is best as it avoids the need to copy.

I'm not sure, but it seems likely that the compiler will simply compile all these functions when a user includes this header.

FYI, the template itself is only compiled when instantiated. The inline functions will be parsed and checked, but should not be output except when they are actually used; the template will not be properly processed by the compiler until it's actually used with concrete arguments. If the binary is very large, it's possible the implementation is also getting inlined. I haven't looked into the details yet.

egpbos mentioned this issue Oct 30, 2017

Find a stable place to benchmark and record benchmarks #19

Open

egpbos added the code quality label Nov 30, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make xtensor-fftw an actual template library #14

Make xtensor-fftw an actual template library #14

egpbos commented Oct 18, 2017 •

edited

Loading

egpbos commented Dec 7, 2017

stellarpower commented Mar 15, 2024

Make xtensor-fftw an actual template library #14

Make xtensor-fftw an actual template library #14

Comments

egpbos commented Oct 18, 2017 • edited Loading

egpbos commented Dec 7, 2017

stellarpower commented Mar 15, 2024

egpbos commented Oct 18, 2017 •

edited

Loading