In computer science, an order statistic tree is a variant of the binary search tree (or more generally, a B-tree) that supports two additional operations beyond insertion, lookup and deletion:
Contents
Both operations can be performed in O(log n) worst case time when a self-balancing tree is used as the base data structure.
Augmented search tree implementation
To turn a regular search tree into an order statistic tree, the nodes of the tree need to store one additional value, which is the size of the subtree rooted at that node (i.e., the number of nodes below it). All operations that modify the tree must adjust this information to preserve the invariant that
size[x] = size[left[x]] + size[right[x]] + 1where size[nil] = 0
by definition. Select can then be implemented as
Rank can be implemented as
function Rank(T, x) // Returns the position of x (one-indexed) in the linear sorted list of elements of the tree T r ← size[left[x]] + 1 y ← x while y ≠ T.root if y = right[y.p] r ← r + size[left[y.p]] + 1 y ← y.p return rOrder-statistic trees can be further amended with bookkeeping information to maintain balance (e.g., tree height can be added to get an order statistic AVL tree, or a color bit to get a red-black order statistic tree). Alternatively, the size field can be used in conjunction with a weight-balancing scheme at no additional storage cost.
Other implementations
Another way to implement an order statistic tree is an implicit data structure derived from the min-max heap.