Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Heap (data structure)

From Wikipedia, the free encyclopedia
Computer science data structure
For the memory heap in low-level computer programming, seeC dynamic memory allocation.
Example of abinary max-heap with node keys being integers between 1 and 100

Incomputer science, aheap is atree-baseddata structure that satisfies theheap property: In amax heap, for any givennode C, if P is the parent node of C, then thekey (thevalue) of P is greater than or equal to the key of C. In amin heap, the key of P is less than or equal to the key of C.[1] The node at the "top" of the heap (with no parents) is called theroot node.

The heap is one maximally efficient implementation of anabstract data type called apriority queue, and in fact, priority queues are often referred to as "heaps", regardless of how they may be implemented. In a heap, the highest (or lowest) priority element is always stored at the root. However, a heap is not a sorted structure; it can be regarded as being partially ordered. A heap is a useful data structure when it is necessary to repeatedly remove the object with the highest (or lowest) priority, or when insertions need to be interspersed with removals of the root node.

A common implementation of a heap is thebinary heap, in which the tree is acomplete[2] binary tree (see figure). The heap data structure, specifically the binary heap, was introduced byJ. W. J. Williams in 1964, as a data structure for theheapsort sorting algorithm.[3] Heaps are also crucial in several efficientgraph algorithms such asDijkstra's algorithm. When a heap is a complete binary tree, it has the smallest possible height—a heap withN nodes anda branches for each node always has logaN height.

Note that, as shown in the graphic, there is no implied ordering between siblings or cousins and no implied sequence for anin-order traversal (as there would be in, e.g., abinary search tree). The heap relation mentioned above applies only between nodes and their parents, grandparents. The maximum number of children each node can have depends on the type of heap.

Heaps are typically constructed in-place in the same array where the elements are stored, with their structure being implicit in the access pattern of the operations. Heaps differ in this way from other data structures with similar or in some cases better theoretic bounds such asradix trees in that they require no additional memory beyond that used for storing the keys.

Operations

[edit]

The common operations involving heaps are:

Basic
  • find-max (orfind-min): find a maximum item of a max-heap, or a minimum item of a min-heap, respectively (a.k.a.peek)
  • insert: adding a new key to the heap (a.k.a.,push[4])
  • extract-max (orextract-min): returns the node of maximum value from a max heap [or minimum value from a min heap] after removing it from the heap (a.k.a.,pop[5])
  • delete-max (ordelete-min): removing the root node of a max heap (or min heap), respectively
  • replace: pop root and push a new key. This is more efficient than a pop followed by a push, since it only needs to balance once, not twice, and is appropriate for fixed-size heaps.[6]
Creation
  • create-heap: create an empty heap
  • heapify: create a heap out of given array of elements
  • merge (union): joining two heaps to form a valid new heap containing all the elements of both, preserving the original heaps.
  • meld: joining two heaps to form a valid new heap containing all the elements of both, destroying the original heaps.
Inspection
  • size: return the number of items in the heap.
  • is-empty: return true if the heap is empty, false otherwise.
Internal
  • increase-key ordecrease-key: updating a key within a max- or min-heap, respectively
  • delete: delete an arbitrary node (followed by moving last node and sifting to maintain heap)
  • sift-up: move a node up in the tree, as long as needed; used to restore heap condition after insertion. Called "sift" because node moves up the tree until it reaches the correct level, as in asieve.
  • sift-down: move a node down in the tree, similar to sift-up; used to restore heap condition after deletion or replacement.

Implementation using arrays

[edit]

Heaps are usually implemented with anarray, as follows:

  • Each element in the array represents a node of the heap, and
  • The parent / child relationship isdefined implicitly by the elements' indices in the array.
Example of a complete binary max-heap with node keys being integers from 1 to 100 and how it would be stored in an array.

For abinary heap, in the array, the first index contains the root element. The next two indices of the array contain the root's children. The next four indices contain the four children of the root's two child nodes, and so on. Therefore, given a node at indexi, its children are at indices2i+1{\displaystyle 2i+1} and2i+2{\displaystyle 2i+2}, and its parent is at index⌊(i−1)/2⌋. This simple indexing scheme makes it efficient to move "up" or "down" the tree.

Balancing a heap is done by sift-up or sift-down operations (swapping elements which are out of order). As we can build a heap from an array without requiring extra memory (for the nodes, for example),heapsort can be used to sort an array in-place.

After an element is inserted into or deleted from a heap, the heap property may be violated, and the heap must be re-balanced by swapping elements within the array.

Although different types of heaps implement the operations differently, the most common way is as follows:

  • Insertion: Add the new element at the end of the heap, in the first available free space. If this will violate the heap property, sift up the new element (swim operation) until the heap property has been reestablished.
  • Extraction: Remove the root and insert the last element of the heap in the root. If this will violate the heap property, sift down the new root (sink operation) to reestablish the heap property.
  • Replacement: Remove the root and put thenew element in the root and sift down. When compared to extraction followed by insertion, this avoids a sift up step.

Construction of a binary (ord-ary) heap out of a given array of elements may be performed in linear time using the classicFloyd algorithm, with the worst-case number of comparisons equal to 2N − 2s2(N) −e2(N) (for a binary heap), wheres2(N) is the sum of all digits of the binary representation ofN ande2(N) is the exponent of 2 in the prime factorization ofN.[7] This is faster than a sequence of consecutive insertions into an originally empty heap, which is log-linear.[a]

Variants

[edit]

Comparison of theoretic bounds for variants

[edit]

Here aretime complexities[8] of various heap data structures. The abbreviationam. indicates that the given complexity is amortized, otherwise it is a worst-case complexity. For the meaning of "O(f)" and "Θ(f)" seeBig O notation. Names of operations assume a max-heap.

Operationfind-maxdelete-maxincrease-keyinsertmeldmake-heap[b]
Binary[8]Θ(1)Θ(log n)Θ(log n)Θ(log n)Θ(n)Θ(n)
Skew[9]Θ(1)O(log n)am.O(log n)am.O(log n)am.O(log n)am.Θ(n)am.
Leftist[10]Θ(1)Θ(log n)Θ(log n)Θ(log n)Θ(log n)Θ(n)
Binomial[8][12]Θ(1)Θ(log n)Θ(log n)Θ(1)am.Θ(log n)[c]Θ(n)
Skew binomial[13]Θ(1)Θ(log n)Θ(log n)Θ(1)Θ(log n)[c]Θ(n)
2–3 heap[15]Θ(1)O(log n)am.Θ(1)Θ(1)am.O(log n)[c]Θ(n)
Bottom-up skew[9]Θ(1)O(log n)am.O(log n)am.Θ(1)am.Θ(1)am.Θ(n)am.
Pairing[16]Θ(1)O(log n)am.o(log n)am.[d]Θ(1)Θ(1)Θ(n)
Rank-pairing[19]Θ(1)O(log n)am.Θ(1)am.Θ(1)Θ(1)Θ(n)
Fibonacci[8][20]Θ(1)O(log n)am.Θ(1)am.Θ(1)Θ(1)Θ(n)
Strict Fibonacci[21][e]Θ(1)Θ(log n)Θ(1)Θ(1)Θ(1)Θ(n)
Brodal[22][e]Θ(1)Θ(log n)Θ(1)Θ(1)Θ(1)Θ(n)[23]
  1. ^Each insertion takes O(log(k)) in the existing size of the heap, thusk=1nO(logk){\displaystyle \sum _{k=1}^{n}O(\log k)}. Sincelogn/2=(logn)1{\displaystyle \log n/2=(\log n)-1}, a constant factor (half) of these insertions are within a constant factor of the maximum, so asymptotically we can assumek=n{\displaystyle k=n}; formally the time isnO(logn)O(n)=O(nlogn){\displaystyle nO(\log n)-O(n)=O(n\log n)}. This can also be readily seen fromStirling's approximation.
  2. ^make-heap is the operation of building a heap from a sequence ofn unsorted elements. It can be done inΘ(n) time whenevermeld runs inO(log n) time (where both complexities can be amortized).[9][10] Another algorithm achievesΘ(n) for binary heaps.[11]
  3. ^abcForpersistent heaps (not supportingincrease-key), a generic transformation reduces the cost ofmeld to that ofinsert, while the new cost ofdelete-max is the sum of the old costs ofdelete-max andmeld.[14] Here, it makesmeld run inΘ(1) time (amortized, if the cost ofinsert is) whiledelete-max still runs inO(log n). Applied to skew binomial heaps, it yields Brodal-Okasaki queues, persistent heaps with optimal worst-case complexities.[13]
  4. ^Lower bound ofΩ(loglogn),{\displaystyle \Omega (\log \log n),}[17] upper bound ofO(22loglogn).{\displaystyle O(2^{2{\sqrt {\log \log n}}}).}[18]
  5. ^abBrodal queues and strict Fibonacci heaps achieve optimal worst-case complexities for heaps. They were first described as imperative data structures. The Brodal-Okasaki queue is apersistent data structure achieving the same optimum, except thatincrease-key is not supported.

Applications

[edit]

The heap data structure has many applications.

  • Heapsort: One of the best sorting methods being in-place and with no quadratic worst-case scenarios.
  • Selection algorithms: A heap allows access to the min or max element in constant time, and other selections (such as median or kth-element) can be done in sub-linear time on data that is in a heap.[24]
  • Graph algorithms: By using heaps as internal traversal data structures, run time will be reduced by polynomial order. Examples of such problems arePrim's minimal-spanning-tree algorithm andDijkstra's shortest-path algorithm.
  • Priority queue: A priority queue is an abstract concept like "a list" or "a map"; just as a list can be implemented with a linked list or an array, a priority queue can be implemented with a heap or a variety of other methods.
  • K-way merge: A heap data structure is useful to merge many already-sorted input streams into a single sorted output stream. Examples of the need for merging include external sorting and streaming results from distributed data such as a log structured merge tree. The inner loop is obtaining the min element, replacing with the next element for the corresponding input stream, then doing a sift-down heap operation. (Alternatively the replace function.) (Using extract-max and insert functions of a priority queue are much less efficient.)

Programming language implementations

[edit]
  • TheC++ Standard Library provides themake_heap,push_heap andpop_heap algorithms for heaps (usually implemented as binary heaps), which operate on arbitrary random accessiterators. It treats the iterators as a reference to an array, and uses the array-to-heap conversion. It also provides the classstd::priority_queue, which wraps these facilities in a container-like class. However, there is no standard support for the replace, sift-up/sift-down, or decrease/increase-key operations.
  • TheBoost C++ libraries include a heaps library. Unlike the STL, it supports decrease and increase operations, and supports additional types of heap: specifically, it supportsd-ary, binomial, Fibonacci, pairing and skew heaps.
  • There is ageneric heap implementation forC andC++ withD-ary heap andB-heap support. It provides an STL-like API.
  • The standard library of theD programming language includesstd.container.BinaryHeap, which is implemented in terms of D'sranges. Instances can be constructed from anyrandom-access range.BinaryHeap exposes aninput range interface that allows iteration with D's built-inforeach statements and integration with the range-based API of thestd.algorithm package.
  • ForHaskell there is theData.Heap module.
  • TheJava platform (since version 1.5) provides a binary heap implementation with the classjava.util.PriorityQueue in theJava Collections Framework. This class implements by default a min-heap; to implement a max-heap, programmer should write a custom comparator. There is no support for the replace, sift-up/sift-down, or decrease/increase-key operations.
  • Python has aheapq module that implements a priority queue using a binary heap. The library exposes a heapreplace function to support k-way merging.
  • PHP has both max-heap (SplMaxHeap) and min-heap (SplMinHeap) as of version 5.3 in the Standard PHP Library.
  • Perl has implementations of binary, binomial, and Fibonacci heaps in theHeap distribution available onCPAN.
  • TheGo language contains aheap package with heap algorithms that operate on an arbitrary type that satisfies a given interface. That package does not support the replace, sift-up/sift-down, or decrease/increase-key operations.
  • Apple'sCore Foundation library contains aCFBinaryHeap structure.
  • Pharo has an implementation of a heap in the Collections-Sequenceable package along with a set of test cases. A heap is used in the implementation of the timer event loop.
  • TheRust programming language has a binary max-heap implementation,BinaryHeap, in thecollections module of its standard library.
  • .NET hasPriorityQueue class which uses quaternary (d-ary) min-heap implementation. It is available from .NET 6.

See also

[edit]

References

[edit]
  1. ^Black (ed.), Paul E. (2004-12-14). Entry forheap inDictionary of Algorithms and Data Structures. Online version. U.S.National Institute of Standards and Technology, 14 December 2004. Retrieved on 2017-10-08 fromhttps://xlinux.nist.gov/dads/HTML/heap.html.
  2. ^CORMEN, THOMAS H. (2009).INTRODUCTION TO ALGORITHMS. United States of America: The MIT Press Cambridge, Massachusetts London, England. pp. 151–152.ISBN 978-0-262-03384-8.
  3. ^Williams, J. W. J. (1964), "Algorithm 232 - Heapsort",Communications of the ACM,7 (6):347–348,doi:10.1145/512274.512284
  4. ^The Python Standard Library, 8.4. heapq — Heap queue algorithm,heapq.heappush
  5. ^The Python Standard Library, 8.4. heapq — Heap queue algorithm,heapq.heappop
  6. ^The Python Standard Library, 8.4. heapq — Heap queue algorithm,heapq.heapreplace
  7. ^Suchenek, Marek A. (2012), "Elementary Yet Precise Worst-Case Analysis of Floyd's Heap-Construction Program",Fundamenta Informaticae,120 (1), IOS Press:75–92,doi:10.3233/FI-2012-751.
  8. ^abcdCormen, Thomas H.;Leiserson, Charles E.;Rivest, Ronald L. (1990).Introduction to Algorithms (1st ed.). MIT Press and McGraw-Hill.ISBN 0-262-03141-8.
  9. ^abcSleator, Daniel Dominic;Tarjan, Robert Endre (February 1986)."Self-Adjusting Heaps".SIAM Journal on Computing.15 (1):52–69.CiteSeerX 10.1.1.93.6678.doi:10.1137/0215004.ISSN 0097-5397.
  10. ^abTarjan, Robert (1983). "3.3. Leftist heaps".Data Structures and Network Algorithms. pp. 38–42.doi:10.1137/1.9781611970265.ISBN 978-0-89871-187-5.
  11. ^Hayward, Ryan; McDiarmid, Colin (1991)."Average Case Analysis of Heap Building by Repeated Insertion"(PDF).J. Algorithms.12:126–153.CiteSeerX 10.1.1.353.7888.doi:10.1016/0196-6774(91)90027-v. Archived fromthe original(PDF) on 2016-02-05. Retrieved2016-01-28.
  12. ^"Binomial Heap | Brilliant Math & Science Wiki".brilliant.org. Retrieved2019-09-30.
  13. ^abBrodal, Gerth Stølting; Okasaki, Chris (November 1996), "Optimal purely functional priority queues",Journal of Functional Programming,6 (6):839–857,doi:10.1017/s095679680000201x
  14. ^Okasaki, Chris (1998). "10.2. Structural Abstraction".Purely Functional Data Structures (1st ed.). pp. 158–162.ISBN 9780521631242.
  15. ^Takaoka, Tadao (1999),Theory of 2–3 Heaps(PDF), p. 12
  16. ^Iacono, John (2000), "Improved upper bounds for pairing heaps",Proc. 7th Scandinavian Workshop on Algorithm Theory(PDF), Lecture Notes in Computer Science, vol. 1851, Springer-Verlag, pp. 63–77,arXiv:1110.4428,CiteSeerX 10.1.1.748.7812,doi:10.1007/3-540-44985-X_5,ISBN 3-540-67690-2
  17. ^Fredman, Michael Lawrence (July 1999)."On the Efficiency of Pairing Heaps and Related Data Structures"(PDF).Journal of the Association for Computing Machinery.46 (4):473–501.doi:10.1145/320211.320214.
  18. ^Pettie, Seth (2005).Towards a Final Analysis of Pairing Heaps(PDF). FOCS '05 Proceedings of the 46th Annual IEEE Symposium on Foundations of Computer Science. pp. 174–183.CiteSeerX 10.1.1.549.471.doi:10.1109/SFCS.2005.75.ISBN 0-7695-2468-0.
  19. ^Haeupler, Bernhard; Sen, Siddhartha;Tarjan, Robert E. (November 2011)."Rank-pairing heaps"(PDF).SIAM J. Computing.40 (6):1463–1485.doi:10.1137/100785351.
  20. ^Fredman, Michael Lawrence;Tarjan, Robert E. (July 1987)."Fibonacci heaps and their uses in improved network optimization algorithms"(PDF).Journal of the Association for Computing Machinery.34 (3):596–615.CiteSeerX 10.1.1.309.8927.doi:10.1145/28869.28874.
  21. ^Brodal, Gerth Stølting; Lagogiannis, George;Tarjan, Robert E. (2012).Strict Fibonacci heaps(PDF). Proceedings of the 44th symposium on Theory of Computing - STOC '12. pp. 1177–1184.CiteSeerX 10.1.1.233.1740.doi:10.1145/2213977.2214082.ISBN 978-1-4503-1245-5.
  22. ^Brodal, Gerth S. (1996),"Worst-Case Efficient Priority Queues"(PDF),Proc. 7th Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 52–58
  23. ^Goodrich, Michael T.;Tamassia, Roberto (2004). "7.3.6. Bottom-Up Heap Construction".Data Structures and Algorithms in Java (3rd ed.). pp. 338–341.ISBN 0-471-46983-1.
  24. ^Frederickson, Greg N. (1993), "An Optimal Algorithm for Selection in a Min-Heap",Information and Computation(PDF), vol. 104, Academic Press, pp. 197–214,doi:10.1006/inco.1993.1030, archived fromthe original(PDF) on 2012-12-03, retrieved2010-10-31

External links

[edit]
Wikimedia Commons has media related toHeap data structures.
The WikibookData Structures has a page on the topic of:Min and Max Heaps
  • Heap at Wolfram MathWorld
  • Explanation of how the basic heap algorithms work
  • Bentley, Jon Louis (2000).Programming Pearls (2nd ed.). Addison Wesley. pp. 147–162.ISBN 0201657880.
Search trees
(dynamic sets,
associative arrays)
Heaps
Tries
Spatial data
partitioning trees
Other trees
Types
Abstract
Arrays
Linked
Trees
Graphs
Retrieved from "https://en.wikipedia.org/w/index.php?title=Heap_(data_structure)&oldid=1323368889"
Category:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp