CS660 Combinatorial Algorithms
Fall Semester, 1996
Splay Trees
[To Lecture Notes Index]
San Diego State University -- This page last updated Nov 7, 1996
data:image/s3,"s3://crabby-images/c25ae/c25ae2836336a4139f89a57f1bca8103b581e7e5" alt="----------"
Contents of Splay Trees
- Self-Organizing BST
- Splay Trees
- Performance of Splay Tree
- Splay Operations
- Top-Down Splay
Basic Rotation
Simple Exchange (Attempt 1)
When we access a node, apply a rotation to move the node one level closer to
root
If each node is accessed with probability of 1/n the average search time is:
data:image/s3,"s3://crabby-images/551e7/551e74689de6206377f42f809b9f3720a686a342" alt=""
Move-to-root (Attempt 2)
When we access a node, apply series of rotations to make the node the root
We have a list of n items: a1, a2, ..., an
Probability of accessing item ak is P(ak)
The average search cost is[1]
data:image/s3,"s3://crabby-images/b4d72/b4d7246ef6389381855019159f0d8b89177c0b4b" alt=""
Where H(P(a1), ....P(an)) is the entropy of the
distribution
If P(ak) = 1/n then
data:image/s3,"s3://crabby-images/19adc/19adcdb1a7f0f7ef5b18632abbece4b94f00f9b6" alt=""
Move-to-root example
data:image/s3,"s3://crabby-images/befff/befffd5c24e67b3034a2a9d58eed2669aa3f4ad6" alt=""
Splaying
Splay step at x
let p(x) = parent of node x
case 1a (zig) p(x) = root of the tree
case 2a (zig-zig) p(x) is not the root and x and p(x) are both left
children
data:image/s3,"s3://crabby-images/37c15/37c15d4d2a6453bbbe1a357d4ce690eb059bfad9" alt=""
case 3a (zig-zag) p(x) is not the root and x is a left child and p(x)
is a right child
To Splay a node X, repeat the splay step on X until it is the root
Splay B
data:image/s3,"s3://crabby-images/92bc6/92bc62bf8f0f731f6f78fe5b1445df9b23cc8c47" alt=""
Splay vs. Move-to-root
Case 1
Case 2
data:image/s3,"s3://crabby-images/6068f/6068fef3936cbb6245a281be1d33d7e0af8e5807" alt=""
Splay vs. Move-to-root
Case 3
data:image/s3,"s3://crabby-images/34173/34173839fc4298dcb48b249259bb86ae1783156e" alt=""
Move-to-root A
data:image/s3,"s3://crabby-images/2e5de/2e5ded9d9915ca7002800fdee5a9a77610534c59" alt=""
Splay A
data:image/s3,"s3://crabby-images/d4299/d42999399dc9dc8f5ea0b24f6c7cf5d00dfb5432" alt=""
Splaying at a node of depth d takes [[Theta]](d) time
ck= actual cost of operation k
=
amortized cost of operation k
Dk = the state of the data structure after applying k'th operation
to Dk
= potential associated with Dk
data:image/s3,"s3://crabby-images/19a1f/19a1f48235e35d73aba4783399191eb4246e14f8" alt=""
so we get:
-
-
-
The actual amount of work required is given by:
-
data:image/s3,"s3://crabby-images/2d7de/2d7de3c0202ae7251d66135cac549e513f66ebfe" alt=""
So need the total amortized work and difference in potential
Potential for Splay Trees
Let:
- w(x) = weight of node x, a fixed but arbitrary value
-
denotes the tree rooted at x
-
means y is a node in the tree
data:image/s3,"s3://crabby-images/93807/938079952e3b9eaae40210579479af0d6c6c77db" alt=""
- size(x) =
data:image/s3,"s3://crabby-images/8dc46/8dc46708c72b97d7b2c36a802daa397c621f137c" alt=""
- rank(x) = lg(size(x))
-
Example
Let w(x) = 1/n where n is the number of nodes in the tree
data:image/s3,"s3://crabby-images/532e6/532e63108e29660d5d0875ccff97f256c3e45b88" alt=""
Lemma The amortized time to splay node x in a tree with root at t is
at most 3(r(t) - r(x)) + 1 = O(lg(s(t)/s(x)))
- Let s, r denote the size, rank functions before a splay
-
- Let s', r' denote the size, rank functions after a splay
-
- Count rotations
Case 1 (zig) One rotation
Amortized time of this step is:
data:image/s3,"s3://crabby-images/5cea2/5cea26cf75c40f1fb3b4c40dc0b9dc54319b1efa" alt=""
- = 1 + [r'(x) + r'(y)] - r(x) - r(y)
- only x and y change rank
- <= 1 + r'(x) - r(x)
- r(y) >= r'(y)
- <= 1 + 3(r'(x) - r(x))
- r'(x) >= r(x)
Case 2 (zig-zig) Two rotations
Amortized time of this step is:
- = 2 + r'(x) + r'(y) + r'(z) - r(x) - r(y) - r(z)
-
- only x, y ,z change rank
-
- = 2 + r'(y) + r'(z) - r(x) - r(y)
- r'(x) = r(z)
-
- <= 2 + r'(x) + r'(z) - 2r(x)
- r'(x) >= r'(y) and
-
- r(y) >= r(x)
Assume that 2 <= 2r'(x) - r(x) - r'(z), then
2 + r'(x) + r'(z) - 2r(x) <= 2r'(x) - r(x) - r'(z) + r'(x) + r'(z) -
2r(x)
-
- = 3r'(x) - 3r(x)
-
-
Need to show 2r'(x) - r(x) - r'(z) >= 2
Claim 1
data:image/s3,"s3://crabby-images/061b2/061b2b8bf9b784409ba7c8c9d6f09356a69ea4f0" alt=""
- Set b = 1-a
-
- We have
-
- Setting this to 0 to find extreme value we get
-
-
-
- so
-
-
-
- that is a = 1/2 and b = 1/2
-
- but lg(1/2)+lg(1/2)= -2
End claim 1
Claim 2 2r'(x) - r(x) - r'(z) >= 2
- Recall that:
-
-
- We have:
- r(x) + r'(z) - 2r'(x) = lg(s(x)) + lg(s'(z)) - 2lg(s'(x))
-
-
- = lg(s(x)/s'(x)) + lg(s'(z)/s'(x))
-
- Now s(x) + s'(z) <= s'(x)
- (Why?)
-
- so
- 0<= s(x)/s'(x) + s'(z)/s'(x) <= 1
-
- Set s(x)/s'(x) = a and s'(z)/s'(x) =b in claim 1 to get
-
-
- lg(s(x)/s'(x)) + lg(s'(z)/s'(x)) <= -2
-
Thus r(x) + r'(z) - 2r'(x) <= -2 or 2r'(x) - r(x) - r'(z) >= 2
Case 3 (zig-zag)
Amortized time of this step is:
data:image/s3,"s3://crabby-images/bb4d1/bb4d1a3a41c9797818d64cceb68e1a1d866e1f70" alt=""
- = 2 + r'(x) + r'(w) + r'(z) - r(x) - r(w) - r(z)
- <= 2 + r'(w) + r'(z) - 2r(x)
- r'(x) = r(z) and
-
- r(w) >= r(x)
Assume that 2 <= 2r'(x) - r'(w) - r'(z)
2 + r'(w) + r'(z) - 2r(x) <= [2r'(x) - r'(w) - r'(z)] + r'(w) + r'(z) -
2r(x)
= 2r'(x) - 2r(x) <= 3 * ( r'(x) - r(x) )
Claim 3: 2r'(x) - r'(w) - r'(z) >= 2
- Proof: see claims 1 & 2
Putting it All together
Lemma The amortized time to splay node x in a tree with root at t is at
most 3(r(t) - r(x)) + 1 = O(lg(s(t)/s(x)))
Splay at B
Cost of Step 1 <= 3* ( r'(B) - r(B) ) case 3
Cost of Step 2 <= 3 * ( r''(B) - r'(B) ) case 2
Total cost = 3* ( r'(B) - r(B) ) + 3 * ( r''(B) - r'(B) )
- = 3 * ( r''(B) - r(B) )
-
- = 3 * ( r(E) - r(B) )
Case 1 only happens when splaying a child of the root
This happens at most once per splay
Amortized Cost of M Splay Operations
on Tree with N nodes
Let node i be accessed q(i) times.
Then
data:image/s3,"s3://crabby-images/a6143/a61437142c600379ec942c34814354f2346fe840" alt=""
Theorem (Balance Theorem) The total access time is
- O( (m + n) * lg (n + m) )
Theorem (Static Optimality) If every item is accessed at least once,
then the total access time is:
-
Example
Let q( i ) = 1 then we have
data:image/s3,"s3://crabby-images/43f2a/43f2a479e799d39651411e126e46ac6b030285cc" alt=""
proof of Static Optimality:
Recall that:
- size(x) =
-
- rank(x) = lg(size(x))
-
-
data:image/s3,"s3://crabby-images/8ab1a/8ab1ab579a51ee8fb859d8102590188beb2cf154" alt=""
let w(i) = q(i)/m then
-
data:image/s3,"s3://crabby-images/bc840/bc8401b5c541ffb0a0ea7cd2574f5e417c2828f5" alt=""
We wish to compute the actual cost of m operations.
Recall:
-
-
So we need to compute the change in potential and the amortized cost over m
operations.
First the change in potential.
The biggest change in potential comes when a node moves from root to a leaf.
Assume that all nodes start at the root and end up as a leaf.
This will give us an upper bound on the change in potential.
We have:
- rank of the root = lg(W)
-
- rank of node i as a leaf = lg( w( i ) )
So change in rank of node i is at most
- lg(W) - lg( w( i ) ) = lg( W/w(i) )
so the upper bound on the net decrease in potential over m operations is:
-
The amortized access time of item i is:
-
Amortized access of all times is
-
data:image/s3,"s3://crabby-images/9c422/9c42261997fe8a44ce47b90ec2b1e6c3308413fe" alt=""
So the total cost is bounded by:
-
data:image/s3,"s3://crabby-images/0cc3e/0cc3e45edef6385be5c66426ef4fd0bcfd2464b9" alt=""
access(i, t): if i is in tree t return pointer to i, otherwise return null
pointer
- Find i, then splay tree t at i.
- If i is not in tree t, then splay last node accessed looking for i
join (a, b): Return tree formed by combining tree "a", and tree "b". Assumes
that every item in "a" has key less then every item in "b"
- Splay largest item in "a", then add "b" as a right child of root of "a"
split (i, t): Split tree t, containing item i, into two trees: "a",
containing all items with key less or equal to "i"; and "b", containing all
items with key greater than "i"
- Perform access(i, t) then split tree at root
insert(i, t): insert i in tree t
- Perform split (i, t) then make i the root of the two trees returned by
split
delete(i, t): delete i from tree t
- Perform access(i, t) then perform join on t's subtrees
Case 1 X is the node we are splaying
data:image/s3,"s3://crabby-images/4d213/4d2135510ae614e91d756b149638a7556da2cef7" alt=""
Case 2 (zig-zig) The node we are splaying is in the subtree rooted at
X
data:image/s3,"s3://crabby-images/fbed6/fbed68c9d14bc78734e41d5d471ea7d1737e9df6" alt=""
Case 3 (zig-zag) The node we are splaying is in the subtree rooted at
X
Case 4 - The last step: X is the node we wish to splay
-
Example Splay at B
data:image/s3,"s3://crabby-images/66783/667834feb3a909b60974850ac95776fea48a2503" alt=""
Example Splay at B
data:image/s3,"s3://crabby-images/97870/97870438bb245ffbf63c84ab9891d658af3385ae" alt=""