Movatterモバイル変換

[0]ホーム

Jump to content

Self-balancing binary search tree

Edit links

From Wikipedia, the free encyclopedia

Any node-based binary search tree that automatically keeps its height the same

This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Self-balancing binary search tree" – news ·newspapers ·books ·scholar ·JSTOR(November 2010) (Learn how and when to remove this message)

An example of anunbalanced tree; following the path from the root to a node takes an average of 3.27 node accesses

The same tree after being height-balanced; the average path effort decreased to 3.00 node accesses

Incomputer science, aself-balancing binary search tree (BST) is anynode-basedbinary search tree that automatically keeps its height (maximal number of levels below the root) small in the face of arbitrary item insertions and deletions.^[1]These operations when designed for a self-balancing binary search tree, contain precautionary measures against boundlessly increasing tree height, so that theseabstract data structures receive the attribute "self-balancing".

Forheight-balanced binary trees, the height is defined to be logarithmic $O(\log n)$ in the number $n {\displaystyle n}$ of items. This is the case for many binary search trees, such asAVL trees andred–black trees.Splay trees andtreaps are self-balancing but not height-balanced, as their height is not guaranteed to be logarithmic in the number of items.

Self-balancing binary search trees provide efficient implementations for mutable orderedlists, and can be used for other abstract data structures such asassociative arrays,priority queues andsets.

Overview

[edit]

Tree rotations are very common internal operations on self-balancing binary trees to keep perfect or near-to-perfect balance.

Most operations on a binary search tree (BST) take time directly proportional to the height of the tree, so it is desirable to keep the height small. A binary tree with heighth can contain at most2⁰+2¹+···+2^h = 2^h+1−1 nodes. It follows that for any tree withn nodes and heighth:

n\leq 2^{h+1}-1

And that implies:

h\geq \lceil \log _{2}(n+1)-1\rceil \geq \lfloor \log _{2}n\rfloor

In other words, the minimum height of a binary tree withn nodes islog₂(n),rounded down; that is, $\lfloor \log _{2}n\rfloor$ .^[1]

However, the simplest algorithms for BST item insertion may yield a tree with heightn in rather common situations. For example, when the items are inserted in sortedkey order, the tree degenerates into alinked list withn nodes. The difference in performance between the two situations may be enormous: for example, whenn = 1,000,000, the minimum height is $\lfloor \log _{2}(1,000,000)\rfloor =19$ .

If the data items are known ahead of time, the height can be kept small, in the average sense, by adding values in a random order, resulting in arandom binary search tree. However, there are many situations (such asonline algorithms) where thisrandomization is not viable.

Self-balancing binary trees solve this problem by performing transformations on the tree (such astree rotations) at key insertion times, in order to keep the height proportional tolog₂(n). Although a certainoverhead is involved, it is not bigger than the always necessary lookup cost and may be justified by ensuring fast execution of all operations.

While it is possible to maintain a BST with minimum height with expected $O(\log n)$ time operations (lookup/insertion/removal), the additional space requirements required to maintain such a structure tend to outweigh the decrease in search time. For comparison, anAVL tree is guaranteed to be within a factor of 1.44 of the optimal height while requiring only two additional bits of storage in a naive implementation.^[1] Therefore, most self-balancing BST algorithms keep the height within a constant factor of this lower bound.

In theasymptotic ("Big-O") sense, a self-balancing BST structure containingn items allows the lookup, insertion, and removal of an item in $O(\log n)$ worst-case time, andordered enumeration of all items in $O(n)$ time. For some implementations these are per-operation time bounds, while for others they areamortized bounds over a sequence of operations. These times are asymptotically optimal among all data structures that manipulate the key only through comparisons.

Implementations

[edit]

Data structures implementing this type of tree include:

Applications

[edit]

Self-balancing binary search trees can be used in a natural way to construct and maintain ordered lists, such aspriority queues. They can also be used forassociative arrays; key-value pairs are simply inserted with an ordering based on the key alone. In this capacity, self-balancing BSTs havea number of advantages and disadvantages over their main competitor,hash tables. One advantage of self-balancing BSTs is that they allow fast (indeed, asymptotically optimal) enumeration of the itemsin key order, which hash tables do not provide. One disadvantage is that their lookup algorithms get more complicated when there may be multiple items with the same key. Self-balancing BSTs have better worst-case lookup performance than most^[2] hash tables ( $O(\log n)$ compared to $O(n)$ ), but have worse average-case performance ( $O(\log n)$ compared to $O(1)$ ).

Self-balancing BSTs can be used to implement any algorithm that requires mutable ordered lists, to achieve optimal worst-case asymptotic performance. For example, ifbinary tree sort is implemented with a self-balancing BST, we have a very simple-to-describe yetasymptotically optimal $O(n\log n)$ sorting algorithm. Similarly, many algorithms incomputational geometry exploit variations on self-balancing BSTs to solve problems such as theline segment intersection problem and thepoint location problem efficiently. (For average-case performance, however, self-balancing BSTs may be less efficient than other solutions. Binary tree sort, in particular, is likely to be slower thanmerge sort,quicksort, orheapsort, because of the tree-balancing overhead as well ascache access patterns.)

Self-balancing BSTs are flexible data structures, in that it's easy to extend them to efficiently record additional information or perform new operations. For example, one can record the number of nodes in each subtree having a certain property, allowing one to count the number of nodes in a certain key range with that property in $O(\log n)$ time. These extensions can be used, for example, to optimize database queries or other list-processing algorithms.

References

[edit]

^^a ^b ^cDonald Knuth.The Art of Computer Programming, Volume 3:Sorting and Searching, Second Edition. Addison-Wesley, 1998.ISBN 0-201-89685-0. Section 6.2.3: Balanced Trees, pp.458–481.
^Cuckoo hashing provides worst-case lookup performance of $O(1)$ .

External links

[edit]

Dictionary of Algorithms and Data Structures: Height-balanced binary search tree
GNU libavl, a LGPL-licensed library of binary tree implementations in C, with documentation

v t e Tree data structures
Search trees (dynamic sets, associative arrays)	2–3 2–3–4 AA (a,b) AVL B K-Dimensional B+ B* B^x Binary search Optimal Self-balancing Dancing HTree Interval Order statistic Palindrome (Left-leaning) Red–black Scapegoat Splay T Treap UB Weight-balanced
Heaps	Binary Binomial Brodal d-ary Fibonacci Leftist Pairing Skew binomial Skew van Emde Boas Weak
Tries	Ctrie C-trie (compressed ADT) Hash Radix Suffix Ternary search X-fast Y-fast
Spatial data partitioning trees	Ball BK BSP Cartesian Hilbert R k-d (implicitk-d) M Metric MVP Octree PH Priority R Quad R R+ R* Segment VP X
Other trees	Cover Exponential Fenwick Finger Fractal index Fusion Hash calendar iDistance K-ary Left-child right-sibling Link/cut Log-structured merge Merkle PQ Range SPQR Top

v t e Data structures
Types	Collection Container
Abstract	Associative array Multimap Retrieval Data Structure List Stack Queue Double-ended queue Priority queue Double-ended priority queue Set Multiset Disjoint-set
Arrays	Bit array Circular buffer Dynamic array Hash table Hashed array tree Sparse matrix
Linked	Association list Linked list Skip list Unrolled linked list XOR linked list
Trees	B-tree Binary search tree AA tree AVL tree Red–black tree Self-balancing tree Splay tree Heap Binary heap Binomial heap Fibonacci heap R-tree R* tree R+ tree Hilbert R-tree Rope Trie Hash tree
Graphs	Binary decision diagram Directed acyclic graph Directed acyclic word graph
List of data structures

Retrieved from "https://en.wikipedia.org/w/index.php?title=Self-balancing_binary_search_tree&oldid=1321531660"

Categories:

Hidden categories:

[8]ページ先頭

Movatterモバイル変換

Overview

Implementations

Applications

See also

References

External links