← Back to branch summary

~ubuntu-branches/ubuntu/raring/fftw3/raring-proposed

~ubuntu-branches/ubuntu/raring/fftw3/raring-proposed

« back to all changes in this revision

Viewing changes to doc/FAQ/fftw-faq.ascii

Committer: Bazaar Package Importer
Author(s): Paul Brossier
Date: 2006-05-31 13:44:05 UTC
mfrom: (1.1.1 upstream)
Revision ID: james.westby@ubuntu.com-20060531134405-ol9hrbg6bh81sg0c

Tags: 3.1.1-1

http://bugs.debian.org/350327

http://bugs.debian.org/338487

http://bugs.debian.org/338501

* New upstream release (closes: #350327, #338487, #338501)
* Add --enable-portable-binary to use -mtune instead of -march
* Use --with-gcc-arch=G5 / pentium4 on powerpc / i386
* Updated Standards-Version

files added:
api/malloc.c

dft/bluestein.c

dft/codelets/standard/n1_32.c

dft/codelets/standard/n1_64.c

dft/codelets/standard/q1_2.c

dft/codelets/standard/q1_3.c

dft/codelets/standard/q1_4.c

dft/codelets/standard/q1_5.c

dft/codelets/standard/q1_6.c

dft/codelets/standard/q1_8.c

dft/ctsq.c

dft/dftw-direct.c

dft/dftw-generic.c

dft/dftw-genericbuf.c

dft/indirect-transpose.c

dft/simd/codelets/n1bv_32.c

dft/simd/codelets/n1bv_64.c

dft/simd/codelets/n1fv_32.c

dft/simd/codelets/n1fv_64.c

dft/simd/codelets/n2bv_32.c

dft/simd/codelets/n2bv_64.c

dft/simd/codelets/n2fv_32.c

dft/simd/codelets/n2fv_64.c

dft/simd/codelets/n2sv_16.c

dft/simd/codelets/n2sv_32.c

dft/simd/codelets/n2sv_4.c

dft/simd/codelets/n2sv_64.c

dft/simd/codelets/n2sv_8.c

dft/simd/codelets/t1sv_16.c

dft/simd/codelets/t1sv_2.c

dft/simd/codelets/t1sv_32.c

dft/simd/codelets/t1sv_4.c

dft/simd/codelets/t1sv_8.c

dft/simd/codelets/t2bv_16.c

dft/simd/codelets/t2bv_2.c

dft/simd/codelets/t2bv_32.c

dft/simd/codelets/t2bv_4.c

dft/simd/codelets/t2bv_64.c

dft/simd/codelets/t2bv_8.c

dft/simd/codelets/t2fv_16.c

dft/simd/codelets/t2fv_2.c

dft/simd/codelets/t2fv_32.c

dft/simd/codelets/t2fv_4.c

dft/simd/codelets/t2fv_64.c

dft/simd/codelets/t2fv_8.c

dft/simd/codelets/t2sv_16.c

dft/simd/codelets/t2sv_32.c

dft/simd/codelets/t2sv_4.c

dft/simd/codelets/t2sv_8.c

dft/simd/codelets/t3bv_16.c

dft/simd/codelets/t3bv_32.c

dft/simd/codelets/t3bv_4.c

dft/simd/codelets/t3bv_8.c

dft/simd/codelets/t3fv_16.c

dft/simd/codelets/t3fv_32.c

dft/simd/codelets/t3fv_4.c

dft/simd/codelets/t3fv_8.c

dft/simd/n2s.c

dft/simd/n2s.h

dft/simd/t.c

dft/simd/t2b.h

dft/simd/t2f.h

dft/simd/t3b.h

dft/simd/t3f.h

dft/simd/ts.c

dft/simd/ts.h

doc/html/1d-Discrete-Hartley-Transforms-_0028DHTs_0029.html

doc/html/1d-Real_002deven-DFTs-_0028DCTs_0029.html

doc/html/1d-Real_002dodd-DFTs-_0028DSTs_0029.html

doc/html/Advanced-Real_002ddata-DFTs.html

doc/html/Advanced-Real_002dto_002dreal-Transforms.html

doc/html/Column_002dmajor-Format.html

doc/html/Complex-Multi_002dDimensional-DFTs.html

doc/html/Complex-One_002dDimensional-DFTs.html

doc/html/Dynamic-Arrays-in-C_002dThe-Wrong-Way.html

doc/html/Fixed_002dsize-Arrays-in-C.html

doc/html/Fortran_002dinterface-routines.html

doc/html/Guru-Real_002ddata-DFTs.html

doc/html/Guru-Real_002dto_002dreal-Transforms.html

doc/html/How-Many-Threads-to-Use_003f.html

doc/html/Installation-and-Supported-Hardware_002fSoftware.html

doc/html/Installation-on-non_002dUnix-systems.html

doc/html/Multi_002dDimensional-DFTs-of-Real-Data.html

doc/html/Multi_002ddimensional-Array-Format.html

doc/html/Multi_002ddimensional-Transforms.html

doc/html/Multi_002dthreaded-FFTW.html

doc/html/One_002dDimensional-DFTs-of-Real-Data.html

doc/html/Real-even_002fodd-DFTs-_0028cosine_002fsine-transforms_0029.html

doc/html/Real_002ddata-DFT-Array-Format.html

doc/html/Real_002ddata-DFTs.html

doc/html/Real_002dto_002dReal-Transform-Kinds.html

doc/html/Real_002dto_002dReal-Transforms.html

doc/html/Row_002dmajor-Format.html

doc/html/SIMD-alignment-and-fftw_005fmalloc.html

doc/html/The-1d-Discrete-Fourier-Transform-_0028DFT_0029.html

doc/html/The-1d-Real_002ddata-DFT.html

doc/html/The-Halfcomplex_002dformat-DFT.html

doc/html/Usage-of-Multi_002dthreaded-FFTW.html

doc/html/Wisdom-of-Fortran_003f.html

doc/html/Words-of-Wisdom_002dSaving-Plans.html

genfft/gen_mdct.ml

kernel/cpy1d.c

kernel/cpy2d-pair.c

kernel/cpy2d.c

kernel/kalloc.c

kernel/tile2d.c

libbench2/my-getopt.c

libbench2/my-getopt.h

libbench2/timer2.c

m4

m4/acx_pthread.m4

m4/amx_prog_as.m4

m4/ax_cc_maxopt.m4

m4/ax_check_compiler_flags.m4

m4/ax_compiler_vendor.m4

m4/ax_gcc_aligns_stack.m4

m4/ax_gcc_archflag.m4

m4/ax_gcc_version.m4

m4/ax_gcc_x86_cpuid.m4

m4/ax_openmp.m4

m4/ocaml.m4

rdft/codelets/hc2r/hc2rIII_64.c

rdft/codelets/hc2r/hc2r_128.c

rdft/codelets/hc2r/hc2r_64.c

rdft/codelets/r2hc/r2hcII_64.c

rdft/codelets/r2hc/r2hc_128.c

rdft/codelets/r2hc/r2hc_64.c

rdft/hc2hc-common.c

rdft/hc2hc-direct.c

rdft/hc2hc-directbuf.c

rdft/hc2hc-generic.c

rdft/khc2hc.c

rdft/vrank3-transpose.c

reodft/reodft00e-splitradix.c

simd/nonportable

simd/nonportable/Makefile.am

simd/nonportable/Makefile.in

simd/nonportable/sse.c

simd/nonportable/sse2.c

simd/x86-cpuid.h

support/twovers.sh

threads/ct.c

threads/hc2hc.c

files removed:
acinclude.m4

acx_pthread.m4

dft/codelets/inplace

dft/codelets/inplace/Makefile.am

dft/codelets/inplace/Makefile.in

dft/codelets/inplace/codlist.c

dft/codelets/inplace/q1_2.c

dft/codelets/inplace/q1_3.c

dft/codelets/inplace/q1_4.c

dft/codelets/inplace/q1_5.c

dft/codelets/inplace/q1_6.c

dft/codelets/inplace/q1_8.c

dft/codelets/standard/m1_16.c

dft/codelets/standard/m1_32.c

dft/codelets/standard/m1_64.c

dft/ct-dif.c

dft/ct-dit.c

dft/ct-ditbuf.c

dft/ct-ditf.c

dft/rader-omega.c

dft/rank0.c

dft/simd/codelets/m1bv_16.c

dft/simd/codelets/m1bv_32.c

dft/simd/codelets/m1bv_64.c

dft/simd/codelets/m1fv_16.c

dft/simd/codelets/m1fv_32.c

dft/simd/codelets/m1fv_64.c

dft/simd/codelets/m2bv_16.c

dft/simd/codelets/m2bv_32.c

dft/simd/codelets/m2bv_64.c

dft/simd/codelets/m2fv_16.c

dft/simd/codelets/m2fv_32.c

dft/simd/codelets/m2fv_64.c

dft/simd/codelets/n2bv_11.c

dft/simd/codelets/n2bv_13.c

dft/simd/codelets/n2bv_15.c

dft/simd/codelets/n2bv_3.c

dft/simd/codelets/n2bv_5.c

dft/simd/codelets/n2bv_7.c

dft/simd/codelets/n2bv_9.c

dft/simd/codelets/n2fv_11.c

dft/simd/codelets/n2fv_13.c

dft/simd/codelets/n2fv_15.c

dft/simd/codelets/n2fv_3.c

dft/simd/codelets/n2fv_5.c

dft/simd/codelets/n2fv_7.c

dft/simd/codelets/n2fv_9.c

dft/simd/t1b.c

dft/simd/t1f.c

dft/vrank2-transpose.c

dft/vrank3-transpose.c

doc/fftw3.info-1

doc/fftw3.info-2

doc/fftw3.info-3

doc/fftw3.info-4

doc/fftw3.info-5

doc/html/1d-Discrete-Hartley-Transforms--DHTs-.html

doc/html/1d-Real-even-DFTs--DCTs-.html

doc/html/1d-Real-odd-DFTs--DSTs-.html

doc/html/Advanced-Real-data-DFTs.html

doc/html/Advanced-Real-to-real-Transforms.html

doc/html/Column-major-Format.html

doc/html/Complex-Multi-Dimensional-DFTs.html

doc/html/Complex-One-Dimensional-DFTs.html

doc/html/Dynamic-Arrays-in-C-The-Wrong-Way.html

doc/html/Fortran-interface-routines.html

doc/html/Guru-Real-data-DFTs.html

doc/html/Guru-Real-to-real-Transforms.html

doc/html/How-Many-Threads-to-Use-.html

doc/html/Installation-and-Supported-Hardware-Software.html

doc/html/Installation-on-non-Unix-systems.html

doc/html/Multi-Dimensional-DFTs-of-Real-Data.html

doc/html/Multi-dimensional-Array-Format.html

doc/html/Multi-dimensional-Transforms.html

doc/html/Multi-threaded-FFTW.html

doc/html/One-Dimensional-DFTs-of-Real-Data.html

doc/html/Real-data-DFT-Array-Format.html

doc/html/Real-data-DFTs.html

doc/html/Real-even-odd-DFTs--cosine-sine-transforms-.html

doc/html/Real-to-Real-Transform-Kinds.html

doc/html/Real-to-Real-Transforms.html

doc/html/Row-major-Format.html

doc/html/SIMD-alignment-and-fftw_malloc.html

doc/html/Static-Arrays-in-C.html

doc/html/The-1d-Discrete-Fourier-Transform--DFT-.html

doc/html/The-1d-Real-data-DFT.html

doc/html/The-Halfcomplex-format-DFT.html

doc/html/Usage-of-Multi-threaded-FFTW.html

doc/html/Wisdom-of-Fortran-.html

doc/html/Words-of-Wisdom-Saving-Plans.html

genfft/gen_hc2r_noinline.ml

genfft/gen_notw_noinline.ml

genfft/gen_notw_noinline_c.ml

genfft/gen_r2hc_noinline.ml

kernel/square.c

kernel/trig1.c

libbench2/getopt-utils.c

libbench2/getopt.c

libbench2/getopt.h

libbench2/getopt1.c

mkinstalldirs

rdft/codelets/hc2r/mhc2rIII_32.c

rdft/codelets/hc2r/mhc2rIII_64.c

rdft/codelets/hc2r/mhc2r_128.c

rdft/codelets/hc2r/mhc2r_32.c

rdft/codelets/hc2r/mhc2r_64.c

rdft/codelets/r2hc/mr2hcII_32.c

rdft/codelets/r2hc/mr2hcII_64.c

rdft/codelets/r2hc/mr2hc_128.c

rdft/codelets/r2hc/mr2hc_32.c

rdft/codelets/r2hc/mr2hc_64.c

rdft/hc2hc-buf.c

rdft/hc2hc-dif.c

rdft/hc2hc-dit.c

rdft/khc2hc-dif.c

rdft/khc2hc-dit.c

rdft/rader-hc2hc.c

simd/3dnow.c

simd/simd-3dnow.h

simd/sse-aux.c

simd/sse2-aux.c

threads/ct-dit.c

threads/hc2hc-dif.c

threads/hc2hc-dit.c

files modified:
COPYING

COPYRIGHT

ChangeLog

Makefile.am

Makefile.in

NEWS

README

TODO

aclocal.m4

api/Makefile.am

api/Makefile.in

api/api.h

api/apiplan.c

api/configure.c

api/execute-dft-c2r.c

api/execute-dft-r2c.c

api/execute-dft.c

api/execute-r2r.c

api/execute-split-dft-c2r.c

api/execute-split-dft-r2c.c

api/execute-split-dft.c

api/execute.c

api/export-wisdom-to-file.c

api/export-wisdom-to-string.c

api/export-wisdom.c

api/extract-reim.c

api/f77api.c

api/f77funcs.h

api/fftw3.f

api/fftw3.h

api/flops.c

api/forget-wisdom.c

api/import-system-wisdom.c

api/import-wisdom-from-file.c

api/import-wisdom-from-string.c

api/import-wisdom.c

api/map-r2r-kind.c

api/mapflags.c

api/mkprinter-file.c

api/mktensor-iodims.c

api/mktensor-rowmajor.c

api/plan-dft-1d.c

api/plan-dft-2d.c

api/plan-dft-3d.c

api/plan-dft-c2r-1d.c

api/plan-dft-c2r-2d.c

api/plan-dft-c2r-3d.c

api/plan-dft-c2r.c

api/plan-dft-r2c-1d.c

api/plan-dft-r2c-2d.c

api/plan-dft-r2c-3d.c

api/plan-dft-r2c.c

api/plan-dft.c

api/plan-guru-dft-c2r.c

api/plan-guru-dft-r2c.c

api/plan-guru-dft.c

api/plan-guru-r2r.c

api/plan-guru-split-dft-c2r.c

api/plan-guru-split-dft-r2c.c

api/plan-guru-split-dft.c

api/plan-many-dft-c2r.c

api/plan-many-dft-r2c.c

api/plan-many-dft.c

api/plan-many-r2r.c

api/plan-r2r-1d.c

api/plan-r2r-2d.c

api/plan-r2r-3d.c

api/plan-r2r.c

api/print-plan.c

api/rdft2-pad.c

api/the-planner.c

api/version.c

api/x77.h

bootstrap.sh

config.guess

config.h.in

config.sub

configure

configure.ac

debian/changelog

debian/control

debian/fftw3-doc.doc-base.fftw3-faq

debian/fftw3-doc.doc-base.manual

debian/rules

dft/Makefile.am

dft/Makefile.in

dft/buffered.c

dft/codelet-dft.h

dft/codelets/Makefile.am

dft/codelets/Makefile.in

dft/codelets/n.c

dft/codelets/n.h

dft/codelets/standard/Makefile.am

dft/codelets/standard/Makefile.in

dft/codelets/standard/codlist.c

dft/codelets/standard/n1_10.c

dft/codelets/standard/n1_11.c

dft/codelets/standard/n1_12.c

dft/codelets/standard/n1_13.c

dft/codelets/standard/n1_14.c

dft/codelets/standard/n1_15.c

dft/codelets/standard/n1_16.c

dft/codelets/standard/n1_2.c

dft/codelets/standard/n1_3.c

dft/codelets/standard/n1_4.c

dft/codelets/standard/n1_5.c

dft/codelets/standard/n1_6.c

dft/codelets/standard/n1_7.c

dft/codelets/standard/n1_8.c

dft/codelets/standard/n1_9.c

dft/codelets/standard/t1_10.c

dft/codelets/standard/t1_12.c

dft/codelets/standard/t1_15.c

dft/codelets/standard/t1_16.c

dft/codelets/standard/t1_2.c

dft/codelets/standard/t1_3.c

dft/codelets/standard/t1_32.c

dft/codelets/standard/t1_4.c

dft/codelets/standard/t1_5.c

dft/codelets/standard/t1_6.c

dft/codelets/standard/t1_64.c

dft/codelets/standard/t1_7.c

dft/codelets/standard/t1_8.c

dft/codelets/standard/t1_9.c

dft/codelets/standard/t2_16.c

dft/codelets/standard/t2_32.c

dft/codelets/standard/t2_4.c

dft/codelets/standard/t2_64.c

dft/codelets/standard/t2_8.c

dft/codelets/t.c

dft/codelets/t.h

dft/conf.c

dft/ct.c

dft/ct.h

dft/dft.h

dft/direct.c

dft/generic.c

dft/indirect.c

dft/k7/Makefile.in

dft/k7/codelets/Makefile.in

dft/k7/codelets/codlist.c

dft/k7/codelets/f1k7_16.S

dft/k7/codelets/f1k7_2.S

dft/k7/codelets/f1k7_32.S

dft/k7/codelets/f1k7_4.S

dft/k7/codelets/f1k7_64.S

dft/k7/codelets/f1k7_8.S

dft/k7/codelets/f1k7i_16.S

dft/k7/codelets/f1k7i_2.S

dft/k7/codelets/f1k7i_32.S

dft/k7/codelets/f1k7i_4.S

dft/k7/codelets/f1k7i_64.S

dft/k7/codelets/f1k7i_8.S

dft/k7/codelets/n1k7_10.S

dft/k7/codelets/n1k7_11.S

dft/k7/codelets/n1k7_12.S

dft/k7/codelets/n1k7_128.S

dft/k7/codelets/n1k7_13.S

dft/k7/codelets/n1k7_14.S

dft/k7/codelets/n1k7_15.S

dft/k7/codelets/n1k7_16.S

dft/k7/codelets/n1k7_2.S

dft/k7/codelets/n1k7_3.S

dft/k7/codelets/n1k7_32.S

dft/k7/codelets/n1k7_4.S

dft/k7/codelets/n1k7_5.S

dft/k7/codelets/n1k7_6.S

dft/k7/codelets/n1k7_64.S

dft/k7/codelets/n1k7_7.S

dft/k7/codelets/n1k7_8.S

dft/k7/codelets/n1k7_9.S

dft/k7/codelets/n1k7i_10.S

dft/k7/codelets/n1k7i_11.S

dft/k7/codelets/n1k7i_12.S

dft/k7/codelets/n1k7i_128.S

dft/k7/codelets/n1k7i_13.S

dft/k7/codelets/n1k7i_14.S

dft/k7/codelets/n1k7i_15.S

dft/k7/codelets/n1k7i_16.S

dft/k7/codelets/n1k7i_2.S

dft/k7/codelets/n1k7i_3.S

dft/k7/codelets/n1k7i_32.S

dft/k7/codelets/n1k7i_4.S

dft/k7/codelets/n1k7i_5.S

dft/k7/codelets/n1k7i_6.S

dft/k7/codelets/n1k7i_64.S

dft/k7/codelets/n1k7i_7.S

dft/k7/codelets/n1k7i_8.S

dft/k7/codelets/n1k7i_9.S

dft/k7/codelets/t1k7_10.S

dft/k7/codelets/t1k7_12.S

dft/k7/codelets/t1k7_15.S

dft/k7/codelets/t1k7_16.S

dft/k7/codelets/t1k7_2.S

dft/k7/codelets/t1k7_3.S

dft/k7/codelets/t1k7_32.S

dft/k7/codelets/t1k7_4.S

dft/k7/codelets/t1k7_5.S

dft/k7/codelets/t1k7_6.S

dft/k7/codelets/t1k7_64.S

dft/k7/codelets/t1k7_7.S

dft/k7/codelets/t1k7_8.S

dft/k7/codelets/t1k7_9.S

dft/k7/codelets/t1k7i_10.S

dft/k7/codelets/t1k7i_12.S

dft/k7/codelets/t1k7i_15.S

dft/k7/codelets/t1k7i_16.S

dft/k7/codelets/t1k7i_2.S

dft/k7/codelets/t1k7i_3.S

dft/k7/codelets/t1k7i_32.S

dft/k7/codelets/t1k7i_4.S

dft/k7/codelets/t1k7i_5.S

dft/k7/codelets/t1k7i_6.S

dft/k7/codelets/t1k7i_64.S

dft/k7/codelets/t1k7i_7.S

dft/k7/codelets/t1k7i_8.S

dft/k7/codelets/t1k7i_9.S

dft/k7/k7.c

dft/kdft-dif.c

dft/kdft-difsq.c

dft/kdft-dit.c

dft/kdft.c

dft/nop.c

dft/plan.c

dft/problem.c

dft/rader.c

dft/rank-geq2.c

dft/simd/Makefile.am

dft/simd/Makefile.in

dft/simd/codelets/Makefile.am

dft/simd/codelets/Makefile.in

dft/simd/codelets/codlist.c

dft/simd/codelets/n1bv_10.c

dft/simd/codelets/n1bv_11.c

dft/simd/codelets/n1bv_12.c

dft/simd/codelets/n1bv_13.c

dft/simd/codelets/n1bv_14.c

dft/simd/codelets/n1bv_15.c

dft/simd/codelets/n1bv_16.c

dft/simd/codelets/n1bv_2.c

dft/simd/codelets/n1bv_3.c

dft/simd/codelets/n1bv_4.c

dft/simd/codelets/n1bv_5.c

dft/simd/codelets/n1bv_6.c

dft/simd/codelets/n1bv_7.c

dft/simd/codelets/n1bv_8.c

dft/simd/codelets/n1bv_9.c

dft/simd/codelets/n1fv_10.c

dft/simd/codelets/n1fv_11.c

dft/simd/codelets/n1fv_12.c

dft/simd/codelets/n1fv_13.c

dft/simd/codelets/n1fv_14.c

dft/simd/codelets/n1fv_15.c

dft/simd/codelets/n1fv_16.c

dft/simd/codelets/n1fv_2.c

dft/simd/codelets/n1fv_3.c

dft/simd/codelets/n1fv_4.c

dft/simd/codelets/n1fv_5.c

dft/simd/codelets/n1fv_6.c

dft/simd/codelets/n1fv_7.c

dft/simd/codelets/n1fv_8.c

dft/simd/codelets/n1fv_9.c

dft/simd/codelets/n2bv_10.c

dft/simd/codelets/n2bv_12.c

dft/simd/codelets/n2bv_14.c

dft/simd/codelets/n2bv_16.c

dft/simd/codelets/n2bv_2.c

dft/simd/codelets/n2bv_4.c

dft/simd/codelets/n2bv_6.c

dft/simd/codelets/n2bv_8.c

dft/simd/codelets/n2fv_10.c

dft/simd/codelets/n2fv_12.c

dft/simd/codelets/n2fv_14.c

dft/simd/codelets/n2fv_16.c

dft/simd/codelets/n2fv_2.c

dft/simd/codelets/n2fv_4.c

dft/simd/codelets/n2fv_6.c

dft/simd/codelets/n2fv_8.c

dft/simd/codelets/q1bv_2.c

dft/simd/codelets/q1bv_4.c

dft/simd/codelets/q1bv_8.c

dft/simd/codelets/q1fv_2.c

dft/simd/codelets/q1fv_4.c

dft/simd/codelets/q1fv_8.c

dft/simd/codelets/t1bv_10.c

dft/simd/codelets/t1bv_12.c

dft/simd/codelets/t1bv_15.c

dft/simd/codelets/t1bv_16.c

dft/simd/codelets/t1bv_2.c

dft/simd/codelets/t1bv_3.c

dft/simd/codelets/t1bv_32.c

dft/simd/codelets/t1bv_4.c

dft/simd/codelets/t1bv_5.c

dft/simd/codelets/t1bv_6.c

dft/simd/codelets/t1bv_64.c

dft/simd/codelets/t1bv_7.c

dft/simd/codelets/t1bv_8.c

dft/simd/codelets/t1bv_9.c

dft/simd/codelets/t1fv_10.c

dft/simd/codelets/t1fv_12.c

dft/simd/codelets/t1fv_15.c

dft/simd/codelets/t1fv_16.c

dft/simd/codelets/t1fv_2.c

dft/simd/codelets/t1fv_3.c

dft/simd/codelets/t1fv_32.c

dft/simd/codelets/t1fv_4.c

dft/simd/codelets/t1fv_5.c

dft/simd/codelets/t1fv_6.c

dft/simd/codelets/t1fv_64.c

dft/simd/codelets/t1fv_7.c

dft/simd/codelets/t1fv_8.c

dft/simd/codelets/t1fv_9.c

dft/simd/n1b.c

dft/simd/n1b.h

dft/simd/n1f.c

dft/simd/n1f.h

dft/simd/n2b.c

dft/simd/n2b.h

dft/simd/n2f.c

dft/simd/n2f.h

dft/simd/q1b.c

dft/simd/q1b.h

dft/simd/q1f.c

dft/simd/q1f.h

dft/simd/t1b.h

dft/simd/t1f.h

dft/solve.c

dft/vrank-geq1.c

dft/zero.c

doc/FAQ/Makefile.am

doc/FAQ/Makefile.in

doc/FAQ/fftw-faq.ascii

doc/FAQ/fftw-faq.bfnn

doc/FAQ/fftw-faq.html/index.html

doc/FAQ/fftw-faq.html/section1.html

doc/FAQ/fftw-faq.html/section2.html

doc/FAQ/fftw-faq.html/section3.html

doc/FAQ/fftw-faq.html/section4.html

doc/FAQ/fftw-faq.html/section5.html

doc/FAQ/html.refs

doc/Makefile.in

doc/f77_wisdom.f

doc/fftw3.info

doc/fftw3.pdf

doc/fftw3.texi

doc/html/Acknowledgments.html

doc/html/Advanced-Complex-DFTs.html

doc/html/Advanced-Interface.html

doc/html/Basic-Interface.html

doc/html/Calling-FFTW-from-Fortran.html

doc/html/Caveats-in-Using-Wisdom.html

doc/html/Complex-DFTs.html

doc/html/Complex-numbers.html

doc/html/Concept-Index.html

doc/html/Cycle-Counters.html

doc/html/Data-Alignment.html

doc/html/Data-Types-and-Files.html

doc/html/Dynamic-Arrays-in-C.html

doc/html/FFTW-Constants-in-Fortran.html

doc/html/FFTW-Reference.html

doc/html/Forgetting-Wisdom.html

doc/html/Fortran-Examples.html

doc/html/Generating-your-own-code.html

doc/html/Guru-Complex-DFTs.html

doc/html/Guru-Execution-of-Plans.html

doc/html/Guru-Interface.html

doc/html/Guru-vector-and-transform-sizes.html

doc/html/Installation-and-Customization.html

doc/html/Installation-on-Unix.html

doc/html/Interleaved-and-split-arrays.html

doc/html/Introduction.html

doc/html/Library-Index.html

doc/html/License-and-Copyright.html

doc/html/Memory-Allocation.html

doc/html/More-DFTs-of-Real-Data.html

doc/html/Other-Important-Topics.html

doc/html/Parallel-FFTW.html

doc/html/Planner-Flags.html

doc/html/Precision.html

doc/html/Stack-alignment-on-x86.html

doc/html/The-Discrete-Hartley-Transform.html

doc/html/Thread-safety.html

doc/html/Tutorial.html

doc/html/Upgrading-from-FFTW-version-2.html

doc/html/Using-Plans.html

doc/html/What-FFTW-Really-Computes.html

doc/html/Wisdom-Export.html

doc/html/Wisdom-Import.html

doc/html/Wisdom-Utilities.html

doc/html/Wisdom.html

doc/html/index.html

doc/html/rfftwnd.png

doc/mdate-sh

doc/rfftwnd.eps

doc/stamp-vti

doc/texinfo.tex

doc/version.texi

genfft-k7/.depend

genfft-k7/Makefile.in

genfft-k7/algsimp.ml

genfft-k7/algsimp.mli

genfft-k7/assoctable.ml

genfft-k7/assoctable.mli

genfft-k7/complex.ml

genfft-k7/complex.mli

genfft-k7/expr.ml

genfft-k7/expr.mli

genfft-k7/fft.ml

genfft-k7/gen_notw.ml

genfft-k7/gen_twiddle.ml

genfft-k7/littlesimp.ml

genfft-k7/littlesimp.mli

genfft-k7/monads.ml

genfft-k7/number.ml

genfft-k7/number.mli

genfft-k7/oracle.ml

genfft-k7/oracle.mli

genfft-k7/to_alist.ml

genfft-k7/to_alist.mli

genfft-k7/twiddle.ml

genfft-k7/twiddle.mli

genfft-k7/vScheduler.mli

genfft/.depend

genfft/Makefile.am

genfft/Makefile.in

genfft/algsimp.ml

genfft/algsimp.mli

genfft/annotate.ml

genfft/annotate.mli

genfft/assoctable.ml

genfft/assoctable.mli

genfft/c.ml

genfft/c.mli

genfft/complex.ml

genfft/complex.mli

genfft/conv.ml

genfft/conv.mli

genfft/dag.ml

genfft/dag.mli

genfft/expr.ml

genfft/expr.mli

genfft/fft.ml

genfft/fft.mli

genfft/gen_athnotw.ml

genfft/gen_athtw.ml

genfft/gen_conv.ml

genfft/gen_hc2hc.ml

genfft/gen_hc2r.ml

genfft/gen_notw.ml

genfft/gen_notw_c.ml

genfft/gen_r2hc.ml

genfft/gen_r2r.ml

genfft/gen_twiddle.ml

genfft/gen_twiddle_c.ml

genfft/gen_twidsq.ml

genfft/gen_twidsq_c.ml

genfft/genutil.ml

genfft/littlesimp.ml

genfft/littlesimp.mli

genfft/magic.ml

genfft/monads.ml

genfft/number.ml

genfft/number.mli

genfft/oracle.ml

genfft/oracle.mli

genfft/schedule.ml

genfft/schedule.mli

genfft/simd.ml

genfft/simd.mli

genfft/simdmagic.ml

genfft/to_alist.ml

genfft/to_alist.mli

genfft/trig.ml

genfft/trig.mli

genfft/twiddle.ml

genfft/twiddle.mli

genfft/unique.ml

genfft/unique.mli

genfft/util.ml

genfft/util.mli

genfft/variable.ml

genfft/variable.mli

kernel/Makefile.am

kernel/Makefile.in

kernel/align.c

kernel/alloc.c

kernel/assert.c

kernel/awake.c

kernel/buffered.c

kernel/ct.c

kernel/cycle.h

kernel/debug.c

kernel/hash.c

kernel/iabs.c

kernel/ifftw.h

kernel/md5-1.c

kernel/md5.c

kernel/minmax.c

kernel/ops.c

kernel/pickdim.c

kernel/plan.c

kernel/planner.c

kernel/primes.c

kernel/print.c

kernel/problem.c

kernel/rader.c

kernel/scan.c

kernel/solver.c

kernel/solvtab.c

kernel/stride.c

kernel/tensor.c

kernel/tensor1.c

kernel/tensor2.c

kernel/tensor4.c

kernel/tensor5.c

kernel/tensor7.c

kernel/tensor8.c

kernel/tensor9.c

kernel/timer.c

kernel/transpose.c

kernel/trig.c

kernel/twiddle.c

libbench2/Makefile.am

libbench2/Makefile.in

libbench2/aligned-main.c

libbench2/allocate.c

libbench2/bench-main.c

libbench2/bench-user.h

libbench2/bench.h

libbench2/can-do.c

libbench2/dotens2.c

libbench2/info.c

libbench2/main.c

libbench2/mflops.c

libbench2/mp.c

libbench2/ovtpvt.c

libbench2/problem.c

libbench2/report.c

libbench2/speed.c

libbench2/tensor.c

libbench2/timer.c

libbench2/useropt.c

libbench2/verify-dft.c

libbench2/verify-lib.c

libbench2/verify-r2r.c

libbench2/verify-rdft2.c

libbench2/verify.c

libbench2/verify.h

libbench2/zero.c

ltmain.sh

rdft/Makefile.am

rdft/Makefile.in

rdft/buffered.c

rdft/buffered2.c

rdft/codelet-rdft.h

rdft/codelets/Makefile.in

rdft/codelets/hb.h

rdft/codelets/hc2r.c

rdft/codelets/hc2r.h

rdft/codelets/hc2r/Makefile.am

rdft/codelets/hc2r/Makefile.in

rdft/codelets/hc2r/codlist.c

rdft/codelets/hc2r/hb_10.c

rdft/codelets/hc2r/hb_12.c

rdft/codelets/hc2r/hb_15.c

rdft/codelets/hc2r/hb_16.c

rdft/codelets/hc2r/hb_2.c

rdft/codelets/hc2r/hb_3.c

rdft/codelets/hc2r/hb_32.c

rdft/codelets/hc2r/hb_4.c

rdft/codelets/hc2r/hb_5.c

rdft/codelets/hc2r/hb_6.c

rdft/codelets/hc2r/hb_64.c

rdft/codelets/hc2r/hb_7.c

rdft/codelets/hc2r/hb_8.c

rdft/codelets/hc2r/hb_9.c

rdft/codelets/hc2r/hc2rIII_10.c

rdft/codelets/hc2r/hc2rIII_12.c

rdft/codelets/hc2r/hc2rIII_15.c

rdft/codelets/hc2r/hc2rIII_16.c

rdft/codelets/hc2r/hc2rIII_2.c

rdft/codelets/hc2r/hc2rIII_3.c

rdft/codelets/hc2r/hc2rIII_32.c

rdft/codelets/hc2r/hc2rIII_4.c

rdft/codelets/hc2r/hc2rIII_5.c

rdft/codelets/hc2r/hc2rIII_6.c

rdft/codelets/hc2r/hc2rIII_7.c

rdft/codelets/hc2r/hc2rIII_8.c

rdft/codelets/hc2r/hc2rIII_9.c

rdft/codelets/hc2r/hc2r_10.c

rdft/codelets/hc2r/hc2r_11.c

rdft/codelets/hc2r/hc2r_12.c

rdft/codelets/hc2r/hc2r_13.c

rdft/codelets/hc2r/hc2r_14.c

rdft/codelets/hc2r/hc2r_15.c

rdft/codelets/hc2r/hc2r_16.c

rdft/codelets/hc2r/hc2r_3.c

rdft/codelets/hc2r/hc2r_32.c

rdft/codelets/hc2r/hc2r_4.c

rdft/codelets/hc2r/hc2r_5.c

rdft/codelets/hc2r/hc2r_6.c

rdft/codelets/hc2r/hc2r_7.c

rdft/codelets/hc2r/hc2r_8.c

rdft/codelets/hc2r/hc2r_9.c

rdft/codelets/hc2rIII.h

rdft/codelets/hf.h

rdft/codelets/hfb.c

rdft/codelets/r2hc.c

rdft/codelets/r2hc.h

rdft/codelets/r2hc/Makefile.am

rdft/codelets/r2hc/Makefile.in

rdft/codelets/r2hc/codlist.c

rdft/codelets/r2hc/hf2_16.c

rdft/codelets/r2hc/hf2_32.c

rdft/codelets/r2hc/hf2_4.c

rdft/codelets/r2hc/hf2_64.c

rdft/codelets/r2hc/hf2_8.c

rdft/codelets/r2hc/hf_10.c

rdft/codelets/r2hc/hf_12.c

rdft/codelets/r2hc/hf_15.c

rdft/codelets/r2hc/hf_16.c

rdft/codelets/r2hc/hf_2.c

rdft/codelets/r2hc/hf_3.c

rdft/codelets/r2hc/hf_32.c

rdft/codelets/r2hc/hf_4.c

rdft/codelets/r2hc/hf_5.c

rdft/codelets/r2hc/hf_6.c

rdft/codelets/r2hc/hf_64.c

rdft/codelets/r2hc/hf_7.c

rdft/codelets/r2hc/hf_8.c

rdft/codelets/r2hc/hf_9.c

rdft/codelets/r2hc/r2hcII_10.c

rdft/codelets/r2hc/r2hcII_12.c

rdft/codelets/r2hc/r2hcII_15.c

rdft/codelets/r2hc/r2hcII_16.c

rdft/codelets/r2hc/r2hcII_2.c

rdft/codelets/r2hc/r2hcII_3.c

rdft/codelets/r2hc/r2hcII_32.c

rdft/codelets/r2hc/r2hcII_4.c

rdft/codelets/r2hc/r2hcII_5.c

rdft/codelets/r2hc/r2hcII_6.c

rdft/codelets/r2hc/r2hcII_7.c

rdft/codelets/r2hc/r2hcII_8.c

rdft/codelets/r2hc/r2hcII_9.c

rdft/codelets/r2hc/r2hc_10.c

rdft/codelets/r2hc/r2hc_11.c

rdft/codelets/r2hc/r2hc_12.c

rdft/codelets/r2hc/r2hc_13.c

rdft/codelets/r2hc/r2hc_14.c

rdft/codelets/r2hc/r2hc_15.c

rdft/codelets/r2hc/r2hc_16.c

rdft/codelets/r2hc/r2hc_2.c

rdft/codelets/r2hc/r2hc_3.c

rdft/codelets/r2hc/r2hc_32.c

rdft/codelets/r2hc/r2hc_4.c

rdft/codelets/r2hc/r2hc_5.c

rdft/codelets/r2hc/r2hc_6.c

rdft/codelets/r2hc/r2hc_7.c

rdft/codelets/r2hc/r2hc_8.c

rdft/codelets/r2hc/r2hc_9.c

rdft/codelets/r2hcII.h

rdft/codelets/r2r.c

rdft/codelets/r2r.h

rdft/codelets/r2r/Makefile.am

rdft/codelets/r2r/Makefile.in

rdft/codelets/r2r/e01_8.c

rdft/codelets/r2r/e10_8.c

rdft/conf.c

rdft/dft-r2hc.c

rdft/dht-r2hc.c

rdft/dht-rader.c

rdft/direct.c

rdft/direct2.c

rdft/generic.c

rdft/hc2hc.c

rdft/hc2hc.h

rdft/indirect.c

rdft/khc2r.c

rdft/kr2hc.c

rdft/kr2r.c

rdft/nop.c

rdft/nop2.c

rdft/plan.c

rdft/plan2.c

rdft/problem.c

rdft/problem2.c

rdft/rank-geq2-rdft2.c

rdft/rank-geq2.c

rdft/rank0-rdft2.c

rdft/rank0.c

rdft/rdft-dht.c

rdft/rdft.h

rdft/rdft2-inplace-strides.c

rdft/rdft2-radix2.c

rdft/rdft2-strides.c

rdft/rdft2-tensor-max-index.c

rdft/solve.c

rdft/solve2.c

rdft/vrank-geq1-rdft2.c

rdft/vrank-geq1.c

reodft/Makefile.am

reodft/Makefile.in

reodft/conf.c

reodft/redft00e-r2hc-pad.c

reodft/redft00e-r2hc.c

reodft/reodft.h

reodft/reodft010e-r2hc.c

reodft/reodft11e-r2hc-odd.c

reodft/reodft11e-r2hc.c

reodft/reodft11e-radix2.c

reodft/rodft00e-r2hc-pad.c

reodft/rodft00e-r2hc.c

simd/Makefile.am

simd/Makefile.in

simd/altivec.c

simd/simd-altivec.h

simd/simd-sse.h

simd/simd-sse2.h

simd/simd.h

simd/sse.c

simd/sse2.c

simd/taint.c

support/Makefile.am

support/Makefile.codelets

support/Makefile.in

tests/Makefile.am

tests/Makefile.in

tests/bench.c

tests/check.pl

tests/hook.c

threads/Makefile.am

threads/Makefile.in

threads/api.c

threads/conf.c

threads/dft-vrank-geq1.c

threads/f77api.c

threads/f77funcs.h

threads/rdft-vrank-geq1.c

threads/threads.c

threads/threads.h

threads/vrank-geq1-rdft2.c

tools/Makefile.am

tools/Makefile.in

tools/fftw-wisdom-to-conf.1

tools/fftw-wisdom-to-conf.in

tools/fftw-wisdom.c

tools/fftw_wisdom.1.in

tools/fftwf-wisdom.1

Show diffs side-by-side

added added

removed removed

doc/FAQ/fftw-faq.ascii

1

1

FFTW FREQUENTLY ASKED QUESTIONS WITH ANSWERS

2

05 Jul 2003

2

07 Mar 2006

3

3

Matteo Frigo

4

4

Steven G. Johnson

5

5

<fftw@fftw.org>

12

12

13

13

Index

14

14

15

Section 1. Introduction and General Information

16

Q1.1 What is FFTW?

17

Q1.2 How do I obtain FFTW?

18

Q1.3 Is FFTW free software?

19

Q1.4 What is this about non-free licenses?

20

Q1.5 In the West? I thought MIT was in the East?

21

22

Section 2. Installing FFTW

23

Q2.1 Which systems does FFTW run on?

24

Q2.2 Does FFTW run on Windows?

25

Q2.3 My compiler has trouble with FFTW.

26

Q2.4 FFTW does not compile on Solaris, complaining about const.

27

Q2.5 What's the difference between --enable-3dnow and --enable-k7?

28

Q2.6 What's the difference between the fma and the non-fma versions?

29

Q2.7 Which language is FFTW written in?

30

Q2.8 Can I call FFTW from Fortran?

31

Q2.9 Can I call FFTW from C++?

32

Q2.10 Why isn't FFTW written in Fortran/C++?

33

Q2.11 How do I compile FFTW to run in single precision?

34

35

Section 3. Using FFTW

36

Q3.1 Why not support the FFTW 2 interface in FFTW 3?

37

Q3.2 Why do FFTW 3 plans encapsulate the input/output arrays and not ju

38

Q3.3 FFTW seems really slow.

39

Q3.4 FFTW slows down after repeated calls.

40

Q3.5 An FFTW routine is crashing when I call it.

41

Q3.6 My Fortran program crashes when calling FFTW.

42

Q3.7 FFTW gives results different from my old FFT.

43

Q3.8 Can I save FFTW's plans?

44

Q3.9 Why does your inverse transform return a scaled result?

45

Q3.10 How can I make FFTW put the origin (zero frequency) at the center

46

Q3.11 How do I FFT an image/audio file in *foobar* format?

47

Q3.12 My program does not link (on Unix).

48

Q3.13 I included your header, but linking still fails.

49

Q3.14 My program crashes, complaining about stack space.

50

Q3.15 FFTW seems to have a memory leak.

51

Q3.16 The output of FFTW's transform is all zeros.

52

Q3.17 How do I call FFTW from the Microsoft language du jour?

53

Q3.18 Can I compute only a subset of the DFT outputs?

54

55

Section 4. Internals of FFTW

56

Q4.1 How does FFTW work?

57

Q4.2 Why is FFTW so fast?

58

59

Section 5. Known bugs

60

Q5.1 FFTW 1.1 crashes in rfftwnd on Linux.

61

Q5.2 The MPI transforms in FFTW 1.2 give incorrect results/leak memory.

62

Q5.3 The test programs in FFTW 1.2.1 fail when I change FFTW to use sin

63

Q5.4 The test program in FFTW 1.2.1 fails for n > 46340.

64

Q5.5 The threaded code fails on Linux Redhat 5.0

65

Q5.6 FFTW 2.0's rfftwnd fails for rank > 1 transforms with a final dime

66

Q5.7 FFTW 2.0's complex transforms give the wrong results with prime fa

67

Q5.8 FFTW 2.1.1's MPI test programs crash with MPICH.

68

Q5.9 FFTW 2.1.2's multi-threaded transforms don't work on AIX.

69

Q5.10 FFTW 2.1.2's complex transforms give incorrect results for large p

70

Q5.11 FFTW 2.1.3's multi-threaded transforms don't give any speedup on S

71

Q5.12 FFTW 2.1.3 crashes on AIX.

15

72

16

73

===============================================================================

17

74

18

75

Section 1. Introduction and General Information

19

76

77

Q1.1 What is FFTW?

78

Q1.2 How do I obtain FFTW?

79

Q1.3 Is FFTW free software?

80

Q1.4 What is this about non-free licenses?

81

Q1.5 In the West? I thought MIT was in the East?

20

82

21

83

-------------------------------------------------------------------------------

22

84

67

129

would neither affect their licensing revenue nor irritate existing

68

130

licensees.

69

131

132

-------------------------------------------------------------------------------

133

134

Question 1.5. In the West? I thought MIT was in the East?

135

136

Not to an Italian. You could say that we're a Spaghetti Western (with

137

apologies to Sergio Leone).

138

70

139

===============================================================================

71

140

72

141

Section 2. Installing FFTW

73

142

143

Q2.1 Which systems does FFTW run on?

144

Q2.2 Does FFTW run on Windows?

145

Q2.3 My compiler has trouble with FFTW.

146

Q2.4 FFTW does not compile on Solaris, complaining about const.

147

Q2.5 What's the difference between --enable-3dnow and --enable-k7?

148

Q2.6 What's the difference between the fma and the non-fma versions?

149

Q2.7 Which language is FFTW written in?

150

Q2.8 Can I call FFTW from Fortran?

151

Q2.9 Can I call FFTW from C++?

152

Q2.10 Why isn't FFTW written in Fortran/C++?

153

Q2.11 How do I compile FFTW to run in single precision?

74

154

75

155

-------------------------------------------------------------------------------

76

156

77

157

Question 2.1. Which systems does FFTW run on?

78

158

79

159

FFTW is written in ANSI C, and should work on any system with a decent C

80

compiler. (See also pageref:runOnWindows::' and

81

pageref:compilerCrashes::'.) FFTW can also take advantage of certain

160

compiler. (See also Q2.2 `Does FFTW run on Windows?', Q2.3 `My compiler

161

has trouble with FFTW.'.) FFTW can also take advantage of certain

82

162

hardware-specific features, such as cycle counters and SIMD instructions,

83

163

but this is optional.

84

164

86

166

87

167

Question 2.2. Does FFTW run on Windows?

88

168

89

It should. FFTW was not developed on Windows, but the source code is

90

essentially straight ANSI C. Many users have reported using FFTW 2 in the

91

past on Windows with various compilers; we are currently awaiting reports

92

for FFTW 3. See also the FFTW Windows installation notes and

93

pageref:compilerCrashes::'

169

Yes, many people have reported successfully using FFTW on Windows with

170

various compilers. FFTW was not developed on Windows, but the source code

171

is essentially straight ANSI C. See also the FFTW Windows installation

172

notes, Q2.3 `My compiler has trouble with FFTW.', and Q3.17 `How do I call

173

FFTW from the Microsoft language du jour?'.

94

174

95

175

-------------------------------------------------------------------------------

96

176

98

178

99

179

Complain fiercely to the vendor of the compiler.

100

180

101

FFTW is likely to push compilers to their limits. We have successfully

102

used gcc 3.2.x on x86 and PPC, a recent Compaq C compiler for Alpha,

103

version 6 of IBM's xlc compiler for AIX, Intel's icc versions 5-7, and Sun

104

WorkShop cc version 6. Several compiler bugs have been exposed by FFTW,

105

however. A partial list follows.

181

We have successfully used gcc 3.2.x on x86 and PPC, a recent Compaq C

182

compiler for Alpha, version 6 of IBM's xlc compiler for AIX, Intel's icc

183

versions 5-7, and Sun WorkShop cc version 6.

184

185

FFTW is likely to push compilers to their limits, however, and several

186

compiler bugs have been exposed by FFTW. A partial list follows.

106

187

107

188

gcc 2.95.x for Solaris/SPARC produces incorrect code for the test program

108

189

(workaround: recompile the libbench2 directory with -O2).

110

191

NetBSD/macppc 1.6 comes with a gcc version that also miscompiles the test

111

192

program. (Please report a workaround if you know one.)

112

193

113

gcc 3.2.3 for ARM reportedly crashes during compilation. (Please report a

114

workaround if you know one.)

194

gcc 3.2.3 for ARM reportedly crashes during compilation. This bug is

195

reportedly fixed in later versions of gcc.

115

196

116

Intel's icc-7.1 compiler build 20030402Z appears to produce incorrect

197

Versions 8.0 and 8.1 of Intel's icc falsely claim to be gcc, so you should

198

specify CC="icc -no-gcc"; this is automatic in FFTW 3.1. icc-8.0.066

199

reportely produces incorrect code for FFTW 2.1.5, but is fixed in version

200

8.1. icc-7.1 compiler build 20030402Z appears to produce incorrect

117

201

dependencies, causing the compilation to fail. icc-7.1 build 20030307Z

118

202

appears to work fine. (Use icc -V to check which build you have.) As of

119

203

2003/04/18, build 20030402Z appears not to be available any longer on

127

211

If support for SIMD instructions is enabled in FFTW, further compiler

128

212

problems may appear:

129

213

214

gcc 3.4.[0123] for x86 produces incorrect SSE2 code for FFTW when -O2 (the

215

best choice for FFTW) is used, causing FFTW to crash (make check crashes).

216

This bug is fixed in gcc 3.4.4.

217

130

218

gcc-3.2 for x86 produces incorrect SIMD code if -O3 is used. The same

131

219

compiler produces incorrect SIMD code if no optimization is used, too.

132

220

When using gcc-3.2, it is a good idea not to change the default CFLAGS

134

222

135

223

Some 3.0.x and 3.1.x versions of gcc on x86 may crash. gcc so-called 2.96

136

224

shipping with RedHat 7.3 crashes when compiling SIMD code. In both cases,

137

please upgrade to gcc-3.2.

138

139

Intel's icc 6.0 misaligns SSE constants, but FFTW has a workaround.

225

please upgrade to gcc-3.2 or later.

226

227

Intel's icc 6.0 misaligns SSE constants, but FFTW has a workaround. icc

228

8.x fails to compile FFTW 3.0.x because it falsely claims to be gcc; we

229

believe this to be a bug in icc, but FFTW 3.1 has a workaround.

230

231

Visual C++ 2003 reportedly produces incorrect code for SSE/SSE2 when

232

compiling FFTW. This bug was reportedly fixed in VC++ 2005;

233

alternatively, you could switch to the Intel compiler. VC++ 6.0 also

234

reportedly produces incorrect code for the file reodft11e-r2hc-odd.c

235

unless optimizations are disabled for that file.

140

236

141

237

gcc 2.95 on MacOS X miscompiles AltiVec code (fixed in later versions).

142

238

gcc 3.2.x miscompiles AltiVec permutations, but FFTW has a workaround.

180

276

181

277

--enable-3dnow enables generic 3DNow! support using gcc builtin functions.

182

278

This works on earlier AMD processors, but it is not as fast as our special

183

assembly routines.

279

assembly routines. As of fftw-3.1, --enable-3dnow is no longer supported.

184

280

185

281

-------------------------------------------------------------------------------

186

282

188

284

189

285

The fma version tries to exploit the fused multiply-add instructions

190

286

implemented in many processors such as PowerPC, ia-64, and MIPS. The two

191

FFTW packages are otherwise identical.

192

193

Definitely use the fma version if you have a PowerPC-based system with

194

gcc. This includes all GNU/Linux systems for PowerPC and all MacOS X

195

systems.

287

FFTW packages are otherwise identical. In FFTW 3.1, the fma and non-fma

288

versions were merged together into a single package, and the configure

289

script attempts to automatically guess which version to use.

290

291

The FFTW 3.1 configure script enables fma by default on PowerPC, Itanium,

292

and PA-RISC, and disables it otherwise. You can force one or the other by

293

using the --enable-fma or --disable-fma flag for configure.

294

295

Definitely use fma if you have a PowerPC-based system with gcc (or IBM

296

xlc). This includes all GNU/Linux systems for PowerPC and all MacOS X

297

systems. Also use it on PA-RISC and Itanium with the HP/UX compiler.

196

298

197

299

Definitely do not use the fma version if you have an ia-32 processor

198

300

(Intel, AMD, etcetera).

199

301

200

On other architectures, the situation is not so clear. For example, ia-64

201

has the fma instruction, but gcc-3.2 appears not to exploit it correctly.

202

Other compilers may do the right thing, but we have not tried them.

203

Please send us your feedback so that we can update this FAQ entry.

302

For other architectures/compilers, the situation is not so clear. For

303

example, ia-64 has the fma instruction, but gcc-3.2 appears not to exploit

304

it correctly. Other compilers may do the right thing, but we have not

305

tried them. Please send us your feedback so that we can update this FAQ

306

entry.

204

307

205

308

-------------------------------------------------------------------------------

206

309

260

363

261

364

Section 3. Using FFTW

262

365

366

Q3.1 Why not support the FFTW 2 interface in FFTW 3?

367

Q3.2 Why do FFTW 3 plans encapsulate the input/output arrays and not ju

368

Q3.3 FFTW seems really slow.

369

Q3.4 FFTW slows down after repeated calls.

370

Q3.5 An FFTW routine is crashing when I call it.

371

Q3.6 My Fortran program crashes when calling FFTW.

372

Q3.7 FFTW gives results different from my old FFT.

373

Q3.8 Can I save FFTW's plans?

374

Q3.9 Why does your inverse transform return a scaled result?

375

Q3.10 How can I make FFTW put the origin (zero frequency) at the center

376

Q3.11 How do I FFT an image/audio file in *foobar* format?

377

Q3.12 My program does not link (on Unix).

378

Q3.13 I included your header, but linking still fails.

379

Q3.14 My program crashes, complaining about stack space.

380

Q3.15 FFTW seems to have a memory leak.

381

Q3.16 The output of FFTW's transform is all zeros.

382

Q3.17 How do I call FFTW from the Microsoft language du jour?

383

Q3.18 Can I compute only a subset of the DFT outputs?

263

384

264

385

-------------------------------------------------------------------------------

265

386

308

429

is significant, you have two options. First, you can use the

309

430

FFTW_ESTIMATE option in the planner, which uses heuristics instead of

310

431

runtime measurements and produces a good plan in a short time. Second,

311

you can use the wisdom feature to precompute the plan; see

312

pageref:savePlans::'

432

you can use the wisdom feature to precompute the plan; see Q3.8 `Can I

433

save FFTW's plans?'

313

434

314

435

-------------------------------------------------------------------------------

315

436

329

450

own code. For example, you could be passing invalid arguments (such as

330

451

wrongly-sized arrays) to FFTW, or you could simply have memory corruption

331

452

elsewhere in your program that causes random crashes later on. Please

332

don't complain to us unless you can come up with a minimal program

333

(preferably under 30 lines) that illustrates the problem.

453

don't complain to us unless you can come up with a minimal self-contained

454

program (preferably under 30 lines) that illustrates the problem.

334

455

335

456

-------------------------------------------------------------------------------

336

457

352

473

353

474

You should also know that we compute an unnormalized transform. In

354

475

contrast, Matlab is an example of program that computes a normalized

355

transform. See pageref:whyscaled::'.

476

transform. See Q3.9 `Why does your inverse transform return a scaled

477

result?'.

356

478

357

479

Finally, note that floating-point arithmetic is not exact, so different

358

480

FFT algorithms will give slightly different results (on the order of the

430

552

431

553

-------------------------------------------------------------------------------

432

554

433

Question 3.15. FFTW seems to have a memory leak

555

Question 3.15. FFTW seems to have a memory leak.

434

556

435

557

After you create a plan, FFTW caches the information required to quickly

436

recreate the plan. (See pageref:savePlans::') It also maintains a small

437

amount of other persistent memory. You can deallocate all of FFTW's

438

internally allocated memory, if you wish, by calling fftw_cleanup(), as

439

documented in the manual.

558

recreate the plan. (See Q3.8 `Can I save FFTW's plans?') It also

559

maintains a small amount of other persistent memory. You can deallocate

560

all of FFTW's internally allocated memory, if you wish, by calling

561

fftw_cleanup(), as documented in the manual.

562

563

-------------------------------------------------------------------------------

564

565

Question 3.16. The output of FFTW's transform is all zeros.

566

567

You should initialize your input array *after* creating the plan, unless

568

you use FFTW_ESTIMATE: planning with FFTW_MEASURE or FFTW_PATIENT

569

overwrites the input/output arrays, as described in the manual.

570

571

-------------------------------------------------------------------------------

572

573

Question 3.17. How do I call FFTW from the Microsoft language du jour?

574

575

Please *do not* ask us Windows-specific questions. We do not use Windows.

576

We know nothing about Visual Basic, Visual C++, or .NET. Please find the

577

appropriate Usenet discussion group and ask your question there. See also

578

Q2.2 `Does FFTW run on Windows?'.

579

580

-------------------------------------------------------------------------------

581

582

Question 3.18. Can I compute only a subset of the DFT outputs?

583

584

In general, no, an FFT intrinsically computes all outputs from all inputs.

585

In principle, there is something called a *pruned FFT* that can do what

586

you want, but to compute K outputs out of N the complexity is in general

587

O(N log K) instead of O(N log N), thus saving only a small additive factor

588

in the log. (The same argument holds if you instead have only K nonzero

589

inputs.)

590

591

There are some specific cases in which you can get the O(N log K)

592

performance benefits easily, however, by combining a few ordinary FFTs.

593

In particular, the case where you want the first K outputs, where K

594

divides N, can be handled by performing N/K transforms of size K and then

595

summing the outputs multiplied by appropriate phase factors. For more

596

details, see pruned FFTs with FFTW.

597

598

There are also some algorithms that compute pruned transforms

599

*approximately*, but they are beyond the scope of this FAQ.

440

600

441

601

===============================================================================

442

602

443

603

Section 4. Internals of FFTW

444

604

605

Q4.1 How does FFTW work?

606

Q4.2 Why is FFTW so fast?

445

607

446

608

-------------------------------------------------------------------------------

447

609

469

631

FFTW's speed.

470

632

471

633

* FFTW uses a variety of FFT algorithms and implementation styles that

472

can be arbitrarily composed to adapt itself to a machine. See

473

pageref:howworks::'.

634

can be arbitrarily composed to adapt itself to a machine. See Q4.1 `How

635

does FFTW work?'.

474

636

* FFTW uses a code generator to produce highly-optimized routines for

475

637

computing small transforms.

476

638

* FFTW uses explicit divide-and-conquer to take advantage of the memory

485

647

486

648

Section 5. Known bugs

487

649

488

489

-------------------------------------------------------------------------------

490

491

Question 5.1. FFTW 1.1 crashes in rfftwnd on Linux.

492

493

This bug was fixed in FFTW 1.2. There was a bug in rfftwnd causing an

494

incorrect amount of memory to be allocated. The bug showed up in Linux

495

with libc-5.3.12 (and nowhere else that we know of).

496

497

-------------------------------------------------------------------------------

498

499

Question 5.2. The MPI transforms in FFTW 1.2 give incorrect results/leak memory.

500

501

These bugs were corrected in FFTW 1.2.1. The MPI transforms (really, just

502

the transpose routines) in FFTW 1.2 had bugs that could cause errors in

503

some situations.

504

505

-------------------------------------------------------------------------------

506

507

Question 5.3. The test programs in FFTW 1.2.1 fail when I change FFTW to use single precision.

508

509

This bug was fixed in FFTW 1.3. (Older versions of FFTW did work in

510

single precision, but the test programs didn't--the error tolerances in

511

the tests were set for double precision.)

512

513

-------------------------------------------------------------------------------

514

515

Question 5.4. The test program in FFTW 1.2.1 fails for n > 46340.

516

517

This bug was fixed in FFTW 1.3. FFTW 1.2.1 produced the right answer, but

518

the test program was wrong. For large n, n*n in the naive transform that

519

we used for comparison overflows 32 bit integer precision, breaking the

520

test.

521

522

-------------------------------------------------------------------------------

523

524

Question 5.5. The threaded code fails on Linux Redhat 5.0

525

526

We had problems with glibc-2.0.5. The code should work with glibc-2.0.7.

527

528

-------------------------------------------------------------------------------

529

530

Question 5.6. FFTW 2.0's rfftwnd fails for rank > 1 transforms with a final dimension >= 65536.

531

532

This bug was fixed in FFTW 2.0.1. (There was a 32-bit integer overflow

533

due to a poorly-parenthesized expression.)

534

535

-------------------------------------------------------------------------------

536

537

Question 5.7. FFTW 2.0's complex transforms give the wrong results with prime factors 17 to 97.

538

539

There was a bug in the complex transforms that could cause incorrect

540

results under (hopefully rare) circumstances for lengths with

541

intermediate-size prime factors (17-97). This bug was fixed in FFTW

542

2.1.1.

543

544

-------------------------------------------------------------------------------

545

546

Question 5.8. FFTW 2.1.1's MPI test programs crash with MPICH.

547

548

This bug was fixed in FFTW 2.1.2. The 2.1/2.1.1 MPI test programs crashed

549

when using the MPICH implementation of MPI with the ch_p4 device (TCP/IP);

550

the transforms themselves worked fine.

551

552

-------------------------------------------------------------------------------

553

554

Question 5.9. FFTW 2.1.2's multi-threaded transforms don't work on AIX.

555

556

This bug was fixed in FFTW 2.1.3. The multi-threaded transforms in

557

previous versions didn't work with AIX's pthreads implementation, which

558

idiosyncratically creates threads in detached (non-joinable) mode by

559

default.

560

561

-------------------------------------------------------------------------------

562

563

Question 5.10. FFTW 2.1.2's complex transforms give incorrect results for large prime sizes.

564

565

This bug was fixed in FFTW 2.1.3. FFTW's complex-transform algorithm for

566

prime sizes (in versions 2.0 to 2.1.2) had an integer overflow problem

567

that caused incorrect results for many primes greater than 32768 (on

568

32-bit machines). (Sizes without large prime factors are not affected.)

569

570

-------------------------------------------------------------------------------

571

572

Question 5.11. FFTW 2.1.3's multi-threaded transforms don't give any speedup on Solaris.

573

574

This bug was fixed in FFTW 2.1.4. (By default, Solaris creates threads

575

that do not parallelize over multiple processors, so one has to request

576

the proper behavior specifically.)

577

578

-------------------------------------------------------------------------------

579

580

Question 5.12. FFTW 2.1.3 crashes on AIX.

581

582

The FFTW 2.1.3 configure script picked incorrect compiler flags for the

583

xlc compiler on newer IBM processors. This is fixed in FFTW 2.1.4.

584

585

-conquer to take advantage of the memory

586

hierarchy.

587

588

For more details (albeit somewhat outdated), see the paper "FFTW: An

589

Adaptive Software Architecture for the FFT", by M. Frigo and S. G.

590

Johnson, *Proc. ICASSP* 3, 1381 (1998), available along with other

591

references at the FFTW web page.

592

593

===============================================================================

594

595

Section 5. Known bugs

596

597

650

Q5.1 FFTW 1.1 crashes in rfftwnd on Linux.

598

651

Q5.2 The MPI transforms in FFTW 1.2 give incorrect results/leak memory.

599

652

Q5.3 The test programs in FFTW 1.2.1 fail when I change FFTW to use sin

Older »