CS 662 Theory of Parallel Algorithms
Numerical Methods 1
[To Lecture Notes Index]
San Diego State University -- This page last updated April 25, 1996
data:image/s3,"s3://crabby-images/c25ae/c25ae2836336a4139f89a57f1bca8103b581e7e5" alt="----------"
Contents of Numerical Methods 1 Lecture
- Numerical Algorithms
- Matrix Algorithms
- Transposition
- Matrix Multiplication
Basic Issue - Digital computers don't do math (like math books)
-
may not the same as
-
-
-
may not be the same as
-
-
data:image/s3,"s3://crabby-images/4b88e/4b88e0224ab0e6e9efb6e58561b8d9ad50fbf2e3" alt=""
Problem 1. Errors
Let c = a =
and let b=
data:image/s3,"s3://crabby-images/85111/85111dfc15550f8876182cbf32c9619d4c9feef9" alt=""
What is
?
What is
?
Problem 2. Computational Complexity
Computing
requires n multiplications
So computing
requires
- 2n multiplications and n additions
But
requires
- n multiplications and n additions
Compute
Method 1
data:image/s3,"s3://crabby-images/409b3/409b3442228173d4a036cf50c5444f42f6b3b4d1" alt=""
Requires n multiplications
Method 2
Let
data:image/s3,"s3://crabby-images/25d82/25d8253ab6191e2cf0d1ad56d0c109b4e1a4f92b" alt=""
Algorithm
- Result = x
- for k = 1 to r do
- Result = Result * Result
Requires log(n) multiplications
Let n be any value
Let
be the binary representation of n
Then
data:image/s3,"s3://crabby-images/caa83/caa836d8a9e7fbc51a305f699cebe0878614c416" alt=""
So
-
data:image/s3,"s3://crabby-images/a64ee/a64ee6287e50a9ec76d4dbb021984e04f5fd1cf7" alt=""
but in log(n) multiplications we can compute all the terms
-
data:image/s3,"s3://crabby-images/877ec/877ec229e6c52b818f4a749a2e569f6824e3bc0a" alt=""
It takes another log(n) multiplications to multiply them together
Problem 3
It can take more human effort and some deep mathematics to perform numerical
analysis
Let A be an n*n matrix, for example
-
data:image/s3,"s3://crabby-images/a29b0/a29b020eb6e38f91f8dcd099ef2b20e69ed89dc1" alt=""
The transpose,
,
of A is obtained by mapping
for all j and k
-
data:image/s3,"s3://crabby-images/f92a4/f92a44dfa02c268ad7af909850697e0aba5b403d" alt=""
Computing
from A sequentially by:
for j = 2 to n do
for k = 1 to j - 1 do
temp = A[ j, k ]
A[ j, k ] = A[ k, j ]
A[ k, j ] = temp
end for
end for
Time Complexity = O(n*n)
EREW Transpose
Use (n*n - n)/2 processors
for j = 2 to n do in parallel
for k = 1 to j - 1 do in parallel
Processor P(j, k)
temp = A[ j, k ]
A[ j, k ] = A[ k, j ]
A[ k, j ] = temp
end for
end for
Time Complexity = O( 1 )
Cost = n*n
Big Deal!What about Networks?
Transpose on Shuffle-Exchange
Let n =
and use shuffle-exchange network with
processors
Map elements of A to the processors by:
-
where
for j, k = 1 to n
Note:
- (j-1) and (k-1) range from 0 to n-1
-
- (j-1) and (k-1) take r bits to store
So
can be obtained by writing the bits of ( j - 1) then following them with the
bits of (k-1).
Algorithm
Shuffle r times
Example 1
Let r = 1 so n = 2
We have
and
data:image/s3,"s3://crabby-images/9c418/9c4188b554e53c0f2550fbda06d0424523b7f0fe" alt=""
Map to the 4 node shuffle-exchange network
Now perform 1 shuffle on all nodes
Example 2
Let r = 2 so n = 4
We have
,
data:image/s3,"s3://crabby-images/bd296/bd296fe39819a2af9da20743975fa7fca7905050" alt=""
map to the 16 node shuffle exchange network
now perform 2 shuffles
Let A be an n*n matrix, and x be a vector with n elements
Compute y = Ax
-
We have
data:image/s3,"s3://crabby-images/fb1df/fb1df887b8ed28824546518dd9baaeb100e2bbe1" alt=""
Matrix-Vector Multiplication on Linear Array
Matrix Matrix Multiplication
Let A and B be a n*n matrices
Compute C = A*B
-
data:image/s3,"s3://crabby-images/2e222/2e2222c81136d1541afecb18f8454d4f26e82809" alt=""
We have
-
data:image/s3,"s3://crabby-images/ebca6/ebca691f5a5ad5a7f96cd5cf6ca08883cca76dc4" alt=""
This requires n multiplications and n-1 additions per element of C
So it takes
multiplications and
additions to compute C
Matrix-Matrix Multiplication on Linear Array