~ubuntu-branches/ubuntu/trusty/libsoxr/trusty

« back to all changes in this revision

Viewing changes to src/pffft.h

Committer: Package Import Robot
Author(s): Benjamin Drung
Date: 2013-01-19 13:59:15 UTC
Revision ID: package-import@ubuntu.com-20130119135915-ig85015j5zwtf0rp

Tags: upstream-0.1.0

Import upstream version 0.1.0

files added:

AUTHORS

CMakeLists.txt

COPYING.LGPL

INSTALL

LICENCE

NEWS

README

TODO

cmake

cmake/Modules

cmake/Modules/FindLibAVCodec.cmake

cmake/Modules/FindOpenMP.cmake

cmake/Modules/FindSIMD.cmake

cmake/Modules/TestBigEndian.cmake

deinstall.cmake.in

examples

examples/1-single-block.c

examples/1a-lsr.c

examples/2-stream.C

examples/3-options-input-fn.c

examples/4-split-channels.c

examples/5-variable-rate.c

examples/CMakeLists.txt

examples/README

examples/examples-common.h

go.bat

inst-check

inst-check-soxr

inst-check-soxr-lsr

msvc

msvc/README

msvc/libsoxr.vcproj

msvc/soxr-config.h

soxr-config.h.in

src/CMakeLists.txt

src/aliases.h

src/avfft32.c

src/avfft32s.c

src/ccrw2.h

src/data-io.c

src/data-io.h

src/dbesi0.c

src/fft4g.c

src/fft4g.h

src/fft4g32.c

src/fft4g32s.c

src/fft4g64.c

src/fft4g_cache.h

src/fifo.h

src/filter.c

src/filter.h

src/filters.h

src/half-fir.h

src/half_coefs.h

src/internal.h

src/libsoxr-dev.src.in

src/libsoxr.src.in

src/lsr.c

src/pffft.c

src/pffft.h

src/pffft32.c

src/pffft32s.c

src/poly-fir.h

src/poly-fir0.h

src/rate.h

src/rate32.c

src/rate32s.c

src/rate64.c

src/rdft.h

src/rint-clip.h

src/rint.h

src/samplerate.h

src/simd-dev.h

src/simd.c

src/simd.h

src/soxr-lsr.h

src/soxr-lsr.pc.in

src/soxr.c

src/soxr.h

src/soxr.pc.in

src/vr32.c

tests

tests/CMakeLists.txt

tests/README

tests/cmp-test.cmake

tests/eg-test

tests/io-test

tests/large-ratio

tests/vector-cmp.c

tests/vector-gen.c

Show diffs side-by-side

added added

removed removed

src/pffft.h

Based on original fortran 77 code from FFTPACKv4 from NETLIB,

authored by Dr Paul Swarztrauber of NCAR, in 1985.

As confirmed by the NCAR fftpack software curators, the following

FFTPACKv5 license applies to FFTPACKv4 sources. My changes are

released under the same terms.

FFTPACK license:

http://www.cisl.ucar.edu/css/software/fftpack5/ftpk.html

Computational and Information Systems Laboratory, UCAR,

www.cisl.ucar.edu.

Redistribution and use of the Software in source and binary forms,

with or without modification, is permitted provided that the

following conditions are met:

- Neither the names of NCAR's Computational and Information Systems

Laboratory, the University Corporation for Atmospheric Research,

nor the names of its sponsors or contributors may be used to

endorse or promote products derived from this Software without

specific prior written permission.

- Redistributions of source code must retain the above copyright

notices, this list of conditions, and the disclaimer below.

- Redistributions in binary form must reproduce the above copyright

notice, this list of conditions, and the disclaimer below in the

documentation and/or other materials provided with the

distribution.

THIS SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,

EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO THE WARRANTIES OF

MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND

NONINFRINGEMENT. IN NO EVENT SHALL THE CONTRIBUTORS OR COPYRIGHT

HOLDERS BE LIABLE FOR ANY CLAIM, INDIRECT, INCIDENTAL, SPECIAL,

EXEMPLARY, OR CONSEQUENTIAL DAMAGES OR OTHER LIABILITY, WHETHER IN AN

ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN

CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS WITH THE

SOFTWARE.

PFFFT : a Pretty Fast FFT.

This is basically an adaptation of the single precision fftpack

(v4) as found on netlib taking advantage of SIMD instruction found

on cpus such as intel x86 (SSE1), powerpc (Altivec), and arm (NEON).

For architectures where no SIMD instruction is available, the code

falls back to a scalar version.

Restrictions:

- 1D transforms only, with 32-bit single precision.

- supports only transforms for inputs of length N of the form

N=(2^a)*(3^b), a >= 5 and b >=0 (32, 48, 64, 96, 128, 144 etc

are all acceptable lengths). Performance is best for 128<=N<=8192.

- all (float*) pointers in the functions below are expected to

have an "simd-compatible" alignment, that is 16 bytes on x86 and

powerpc CPUs.

You can allocate such buffers with the functions

pffft_aligned_malloc / pffft_aligned_free (or with stuff like

posix_memalign..)

#ifndef PFFFT_H

#define PFFFT_H

#include <stddef.h>

#ifdef __cplusplus

extern "C" {

#endif

/* opaque struct holding internal stuff (precomputed twiddle factors)

this struct can be shared by many threads as it contains only

read-only data.

typedef struct PFFFT_Setup PFFFT_Setup;

/* direction of the transform */

typedef enum { PFFFT_FORWARD, PFFFT_BACKWARD } pffft_direction_t;

/* type of transform */

typedef enum { PFFFT_REAL, PFFFT_COMPLEX } pffft_transform_t;

prepare for performing transforms of size N -- the returned

PFFFT_Setup structure is read-only so it can safely be shared by

100

multiple concurrent threads.

101

102

static PFFFT_Setup *pffft_new_setup(int N, pffft_transform_t transform);

103

static void pffft_destroy_setup(PFFFT_Setup *);

104

105

Perform a Fourier transform , The z-domain data is stored in the

106

most efficient order for transforming it back, or using it for

107

convolution. If you need to have its content sorted in the

108

"usual" way, that is as an array of interleaved complex numbers,

109

either use pffft_transform_ordered , or call pffft_zreorder after

110

the forward fft, and before the backward fft.

111

112

Transforms are not scaled: PFFFT_BACKWARD(PFFFT_FORWARD(x)) = N*x.

113

Typically you will want to scale the backward transform by 1/N.

114

115

The 'work' pointer should point to an area of N (2*N for complex

116

fft) floats, properly aligned. [del]If 'work' is NULL, then stack will

117

be used instead (this is probably the beest strategy for small

118

FFTs, say for N < 16384).[/del]

119

120

input and output may alias.

121

122

static void pffft_transform(PFFFT_Setup *setup, const float *input, float *output, float *work, pffft_direction_t direction);

123

124

125

Similar to pffft_transform, but makes sure that the output is

126

ordered as expected (interleaved complex numbers). This is

127

similar to calling pffft_transform and then pffft_zreorder.

128

129

input and output may alias.

130

131

static void pffft_transform_ordered(PFFFT_Setup *setup, const float *input, float *output, float *work, pffft_direction_t direction);

132

133

134

call pffft_zreorder(.., PFFFT_FORWARD) after pffft_transform(...,

135

PFFFT_FORWARD) if you want to have the frequency components in

136

the correct "canonical" order, as interleaved complex numbers.

137

138

(for real transforms, both 0-frequency and half frequency

139

components, which are real, are assembled in the first entry as

140

F(0)+i*F(n/2+1). Note that the original fftpack did place

141

F(n/2+1) at the end of the arrays).

142

143

input and output should not alias.

144

145

static void pffft_zreorder(PFFFT_Setup *setup, const float *input, float *output, pffft_direction_t direction);

146

147

148

Perform a multiplication of the frequency components of dft_a and

149

dft_b and accumulate them into dft_ab. The arrays should have

150

been obtained with pffft_transform(.., PFFFT_FORWARD) and should

151

*not* have been reordered with pffft_zreorder (otherwise just

152

perform the operation yourself as the dft coefs are stored as

153

interleaved complex numbers).

154

155

the operation performed is: dft_ab += (dft_a * fdt_b)*scaling

156

157

The dft_a, dft_b and dft_ab pointers may alias.

158

void pffft_zconvolve_accumulate(PFFFT_Setup *setup, const float *dft_a, const float *dft_b, float *dft_ab, float scaling);

159

160

161

162

the operation performed is: dft_ab = (dft_a * fdt_b)

163

164

The dft_a, dft_b and dft_ab pointers may alias.

165

166

static void pffft_zconvolve(PFFFT_Setup *setup, const float *dft_a, const float *dft_b, float *dft_ab);

167

168

/* return 4 or 1 wether support SSE/Altivec instructions was enable when building pffft.c */

169

int pffft_simd_size(void);

170

171

static void pffft_reorder_back(int length, void * setup, float * data, float * work);

172

173

#ifdef __cplusplus

174

}

175

#endif

176

177

#endif

Older »