Movatterモバイル変換

[0]ホーム

Jump to content

Segment tree

Edit links

From Wikipedia, the free encyclopedia

Not to be confused withinterval tree.

Computer science data structure

This article has multiple issues. Please helpimprove it or discuss these issues on thetalk page.(Learn how and when to remove these messages)

This articlerelies largely or entirely on asingle source. Relevant discussion may be found on thetalk page. Please helpimprove this article byintroducing citations to additional sources.
Find sources: "Segment tree" – news ·newspapers ·books ·scholar ·JSTOR(November 2007)

This articlemay be too technical for most readers to understand. Pleasehelp improve it tomake it understandable to non-experts, without removing the technical details.(October 2020) (Learn how and when to remove this message)

(Learn how and when to remove this message)

Graphic example of the structure of the segment tree. This instance is built for the segments shown at the bottom.

Incomputer science, thesegment tree is adata structure used for storing information aboutintervals or segments. It allows querying which of the stored segments contain a given point. A similar data structure is theinterval tree.

A segment tree for a setI ofn intervals usesO(n logn) storage and can be built inO(n logn) time. Segment trees support searching for all the intervals that contain a query point in timeO(logn +k),k being the number of retrieved intervals or segments.^[1]

Applications of the segment tree are in the areas ofcomputational geometry,geographic information systems andmachine learning.

The segment tree can be generalized to higherdimension spaces.

Definition

[edit]

Description

[edit]

LetI be a set of intervals, or segments. Letp₁,p₂, ...,p_m be the list of distinct interval endpoints, sorted from left to right. Consider the partitioning of the real line induced by those points. The regions of this partitioning are calledelementary intervals. Thus, the elementary intervals are, from left to right:

(-\infty ,p_{1}),[p_{1},p_{1}],(p_{1},p_{2}),[p_{2},p_{2}],\dots ,(p_{m-1},p_{m}),[p_{m},p_{m}],(p_{m},+\infty )

That is, the list of elementary intervals consists of open intervals between two consecutive endpointsp_i andp_i+1, alternated with closed intervals consisting of a single endpoint. Single points are treated themselves as intervals because the answer to a query is not necessarily the same at the interior of an elementary interval and its endpoints.^[2]

Given a setI of intervals, or segments, a segment treeT forI is structured as follows:

T is abinary tree.
Itsleaves correspond to the elementary intervals induced by the endpoints inI, in an ordered way: the leftmost leaf corresponds to the leftmost interval, and so on. The elementary interval corresponding to a leafv is denoted Int(v).
Theinternal nodes ofT correspond to intervals that are theunion of elementary intervals: the interval Int(N) corresponding to nodeN is the union of the intervals corresponding to the leaves of the tree rooted atN. That implies that Int(N) is the union of the intervals of its two children.
Each node or leafv inT stores the interval Int(v) and a set of intervals, in some data structure. This canonical subset of nodev contains the intervals [x,x′] fromI such that [x,x′] contains Int(v) and does not contain Int(parent(v)). That is, each node inT stores the segments that span through its interval, but do not span through the interval of its parent.^[3]

Construction

[edit]

A segment tree from the set of segmentsI, can be built as follows. First, the endpoints of the intervals inI are sorted. The elementary intervals are obtained from that. Then, a balanced binary tree is built on the elementary intervals, and for each nodev it is determined the interval Int(v) it represents. It remains to compute the canonical subsets for the nodes. To achieve this, the intervals inI are inserted one by one into the segment tree. An intervalX = [x,x′] can be inserted in a subtree rooted atT, using the following procedure:^[4]

If Int(T) is contained inX then storeX atT, and finish.
Else:
- IfX intersects the interval of the left child ofT, then insertX in that child, recursively.
- IfX intersects the interval of the right child ofT, then insertX in that child, recursively.

The complete construction operation takesO(n logn) time,n being the number of segments inI.

Proof

Sorting the endpoints takesO(n logn). Building a balanced binary tree from the sorted endpoints, takes linear time onn.

The insertion of an intervalX = [x,x′] into the tree, costs O(logn).

Proof

Visiting every node takes constant time (assuming that canonical subsets are stored in a simple data structure like alinked list). When we visit nodev, we either storeX atv, or Int(v) contains an endpoint ofX. As proved above, an interval is stored at most twice at each level of the tree. There is also at most one node at every level whose corresponding interval containsx, and one node whose interval containsx′. So, at most four nodes per level are visited. Since there areO(logn) levels, the total cost of the insertion isO(logn).^[1]

Query

[edit]

A query for a segment tree receives a pointq_x(should be one of the leaves of tree), and retrieves a list of all the segments stored which contain the pointq_x.

Formally stated; given a node (subtree)v and a query pointq_x, the query can be done using the following algorithm:

Report all the intervals inI(v).
Ifv is not a leaf:
- Ifq_x is in Int(left child ofv) then
  - Perform a query in the left child ofv.
- Ifq_x is in Int(right child ofv) then
  - Perform a query in the right child ofv.

In a segment tree that containsn intervals, those containing a given query point can be reported inO(logn +k) time, wherek is the number of reported intervals.

Proof

The query algorithm visits one node per level of the tree, soO(logn) nodes in total. On the other hand, at a nodev, the segments inI are reported inO(1 +k_v) time, wherek_v is the number of intervals at nodev, reported. The sum of all thek_v for all nodesv visited, isk, the number of reported segments.^[5]

Storage requirements

[edit]

A segment treeT on a setI ofn intervals usesO(n logn) storage.

Lemma— Any interval [x,x′] ofI is stored in the canonical set for at most two nodes at the same depth.

Proof

Letv₁,v₂,v₃ be the three nodes at the same depth, numbered from left to right; and let p(v) be the parent node of any given nodev. Suppose [x,x′] is stored atv₁ andv₃. This means that [x,x′] spans the whole interval from the left endpoint of Int(v₁) to the right endpoint of Int(v₃). Note that all segments at a particular level are non-overlapping and ordered from left to right: this is true by construction for the level containing the leaves, and the property is not lost when moving from any level to the one above it by combining pairs of adjacent segments. Now either parent(v₂) = parent(v₁), or the former is to the right of the latter (edges in the tree do not cross). In the first case, Int(parent(v₂))'s leftmost point is the same as Int(v₁)'s leftmost point; in the second case, Int(parent(v₂))'s leftmost point is to the right of Int(parent(v₁))'s rightmost point, and therefore also to the right of Int(v₁)'s rightmost point. In both cases, Int(parent(v₂)) begins at or to the right of Int(v₁)'s leftmost point. Similar reasoning shows that Int(parent(v₂)) ends at or to the left of Int(v₃)'s rightmost point. Int(parent(v₂)) must therefore be contained in [x,x′]; hence, [x,x′] will not be stored atv₂.

The setI has at most 4n + 1 elementary intervals. BecauseT is a binary balanced tree with at most 4n + 1 leaves, its height is O(logn). Since any interval is stored at most twice at a given depth of the tree, that the total amount of storage isO(n logn).^[5]

Generalization for higher dimensions

[edit]

The segment tree can be generalized to higher dimension spaces, in the form of multi-level segment trees. In higher dimensional versions, the segment tree stores a collection of axis-parallel (hyper-)rectangles, and can retrieve the rectangles that contain a given query point. The structure usesO(n log^dn) storage, and answers queries inO(log^dn) time.

The use offractional cascading lowers the query time bound by a logarithmic factor. The use of theinterval tree on the deepest level of associated structures lowers the storage bound by a logarithmic factor.^[6]

Notes

[edit]

A query that asks for all the intervals containing a given point is often referred as astabbing query.^[7]

The segment tree is less efficient than theinterval tree for range queries in one dimension, due to its higher storage requirement:O(n logn) against the O(n) of the interval tree. The importance of the segment tree is that the segments within each node’s canonical subset can be stored in any arbitrary manner.^[7]

Forn intervals whose endpoints are in a small integer range (e.g., in the range [1,...,O(n)]), optimal data structures^[which?] exist with a linear preprocessing time and query timeO(1 +k) for reporting allk intervals containing a given query point.

Another advantage of the segment tree is that it can easily be adapted to counting queries; that is, to report the number of segments containing a given point, instead of reporting the segments themselves. Instead of storing the intervals in the canonical subsets, it can simply store the number of them. Such a segment tree uses linear storage, and requires anO(logn) query time, so it is optimal.^[8]

Higher dimensional versions of the interval tree and thepriority search tree do not exist; that is, there is no clear extension of these structures that solves the analogous problem in higher dimensions. But the structures can be used as associated structure of segment trees.^[6]

History

[edit]

This sectionneeds expansion. You can help byadding missing information.(November 2007)

The segment tree was invented byJon Bentley in 1977; in "Solutions to Klee’s rectangle problems".^[7]

References

[edit]

^^a ^b(de Berg et al. 2000, p. 227)
^(de Berg et al. 2000, p. 224)
^(de Berg et al. 2000, pp. 225–226)
^(de Berg et al. 2000, pp. 226–227)
^^a ^b(de Berg et al. 2000, p. 226)
^^a ^b(de Berg et al. 2000, p. 230)
^^a ^b ^c(de Berg et al. 2000, p. 229)
^(de Berg et al. 2000, pp. 229–230)

Sources cited

[edit]

de Berg, Mark; van Kreveld, Marc; Overmars, Mark; Schwarzkopf, Otfried (2000). "More Geometric Data Structures".Computational Geometry: algorithms and applications (2nd ed.). Springer-Verlag Berlin Heidelberg New York.doi:10.1007/978-3-540-77974-2.ISBN 3-540-65620-0.
http://www.cs.nthu.edu.tw/~wkhon/ds/ds10/tutorial/tutorial6.pdf

External links

[edit]

Segment Tree – CP-Algorithms

v t e Tree data structures
Search trees (dynamic sets, associative arrays)	2–3 2–3–4 AA (a,b) AVL B K-Dimensional B+ B* B^x Binary search Optimal Self-balancing Dancing HTree Interval Order statistic Palindrome (Left-leaning) Red–black Scapegoat Splay T Treap UB Weight-balanced
Heaps	Binary Binomial Brodal d-ary Fibonacci Leftist Pairing Skew binomial Skew van Emde Boas Weak
Tries	Ctrie C-trie (compressed ADT) Hash Radix Suffix Ternary search X-fast Y-fast
Spatial data partitioning trees	Ball BK BSP Cartesian Hilbert R k-d (implicitk-d) M Metric MVP Octree PH Priority R Quad R R+ R* Segment VP X
Other trees	Cover Exponential Fenwick Finger Fractal index Fusion Hash calendar iDistance K-ary Left-child right-sibling Link/cut Log-structured merge Merkle PQ Range SPQR Top

Retrieved from "https://en.wikipedia.org/w/index.php?title=Segment_tree&oldid=1228553579"

Categories:

Hidden categories:

[8]ページ先頭