CS 662 Theory of Parallel Algorithms
Numerical Method part 4 FFT
[To Lecture Notes Index]
San Diego State University -- This page last updated May 7, 1996
data:image/s3,"s3://crabby-images/c25ae/c25ae2836336a4139f89a57f1bca8103b581e7e5" alt="----------"
Who is Fourier?How did he get so Fast?Who cares?
Bases for 3-dimension points: (x, y, z)
Bases for functions:
- Polynomials
-
-
-
- Fourier bases
-
k = 0, 1, ...
-
-
k = 0, 1, ...
-
- Then the Fourier transform of f(x) is:
-
Example
- f(x) = | x | for
-
- The Fourier transform of f(x) is:
-
-
Complex numbers, X-Y plane and all that
Let
represent the unit vector in the y-axis
Represent points in the plane as x +
y
So (1/2, 1) is 1/2 +
data:image/s3,"s3://crabby-images/d0fff/d0fff5ada10360de616a43c68d29aacd5ac8b71e" alt=""
Thus (a +
b)(c
+
d)
= ac+
bd
+
(b+d)
but
,
so
(a +
b)(c
+
d)
= ac-bd +
(bc+ad)
Roots of Unity
Define
as:
-
data:image/s3,"s3://crabby-images/2e7c2/2e7c28d679be6fc21437fb32064573d1affcc63e" alt=""
where
data:image/s3,"s3://crabby-images/23fcb/23fcb9102f36cb3751a930542400c975679805ec" alt=""
then
-
The n'th roots of unity are:
-
data:image/s3,"s3://crabby-images/e3e26/e3e2670ee8b11dd8016915464bd8d347f4474966" alt=""
Example
Let n = 4 then
data:image/s3,"s3://crabby-images/57280/57280b248762988e8667b79b3a7821c1e5066e66" alt=""
Example
Let n = 8 then
data:image/s3,"s3://crabby-images/81fbd/81fbd6fa79bfefbfc3cd2b6a90f87bd8a151c1e8" alt=""
Discrete vs. Continuous (DFT vs FT)
Let
data:image/s3,"s3://crabby-images/0875d/0875dec87b82ce99a16314dde54c2dd69f739cce" alt=""
Define
for j,k = 0, 1, ..., n-1
If n = 4 then
data:image/s3,"s3://crabby-images/711ad/711ad779f2fedc226360850e4106cc02ef8ab35f" alt=""
Let x be an n-dimensional vector, the discrete Fourier transform (DFT) of x is
the vector:
-
data:image/s3,"s3://crabby-images/be865/be86551937d08dd7d55c6733fac409e8f48780fe" alt=""
Then
-
data:image/s3,"s3://crabby-images/1bb06/1bb06df668057db902199307d02512a7e2a780aa" alt=""
Note
defines
We have
Now the Fast Part
Assume n is a power of 2 and compute
data:image/s3,"s3://crabby-images/e8135/e8135b5353c8fd8b36deffa7dac8079e4bff206b" alt=""
Since
we have
-
data:image/s3,"s3://crabby-images/ea50a/ea50acd009473d6a94204a33602191f6a2c6ab9c" alt=""
But this is the 2p'th component of the DFT of the vector:
-
data:image/s3,"s3://crabby-images/43411/434118c2a3f5d52e4dee582d77f11676e9570db0" alt=""
Similarly we get:
-
data:image/s3,"s3://crabby-images/2629c/2629cec388a6fa6ba8025ba7f97f58f1316fc910" alt=""
But this is the 2p+1'th component of the DFT of the vector:
-
data:image/s3,"s3://crabby-images/806f1/806f1d21224358bdea7548241b82cb4bbe334f48" alt=""
This means to compute the DFT of an n vector we can compute the DFT of two n/2
vectors.
Computing DFT
Theorem
- The above processes will compute the DFT of an n-dimensional vector in
O(log(n)) time using total of O(nlog(n)) operations
- The inverse DFT of an n-dimensional vector can be computed in O(log(n))
time using total of O(nlog(n)) operations
Example Use of FFT
Polynomial Multiplication
Let
and
data:image/s3,"s3://crabby-images/7a671/7a6718ebf97bdc0e74331ae2c02ff44e8d9435dd" alt=""
Let
where
data:image/s3,"s3://crabby-images/ef1c6/ef1c66a6fb71ed525ccefff3262f2f8cb47e5d70" alt=""
So can compute
in parallel using log(n+m) time and performing O(mn) operations using scan and
n*n processors
Using FFT
Let
and
data:image/s3,"s3://crabby-images/7c25b/7c25b3b88cd4dabdc176ddcf728e4b837dd62515" alt=""
Compute
and
where
-
-
-
-
data:image/s3,"s3://crabby-images/1b365/1b3658edea1c7426ca000427000e86202acffe95" alt=""
Compute
data:image/s3,"s3://crabby-images/c902b/c902bff87f53010153cb15f92d82aff0ee00227a" alt=""
Compute
where
-
-
data:image/s3,"s3://crabby-images/0d275/0d2757a555f665e5e7ecb227aa8fd3c28b83a07f" alt=""
Then
for k = 0, 1, ..., n+m-1
So
can be computed in O( log(n+m) ) time and O( (n+m)log(n+m) ) operations using
O(n) processors