Inmathematics andcomputer science,graph theory is the study ofgraphs, which aremathematical structures used to model pairwise relations between objects. A graph in this context is made up ofvertices (also callednodes orpoints) which are connected byedges (also calledarcs,links orlines). A distinction is made betweenundirected graphs, where edges link two vertices symmetrically, anddirected graphs, where edges link two vertices asymmetrically. Graphs are one of the principal objects of study indiscrete mathematics.
, aset ofedges (also calledlinks orlines), which areunordered pairs of vertices (that is, an edge is associated with two distinct vertices).
To avoid ambiguity, this type of object may be called anundirected simple graph.
In the edge, the vertices and are called theendpoints of the edge. The edge is said tojoin and and to beincident on and on. A vertex may exist in a graph and not belong to an edge. Under this definition,multiple edges, in which two or more edges connect the same vertices, are not allowed.
Example of an undirected multigraph with 3 vertices, 3 edges and 4 loops.
For vertices A,B,C and D, the degrees are respectively 4,4,5,1
For vertices U,V,W and X, the degrees are 2,2,3 and 1 respectively.
Examples of finding the degree of vertices.
In one more general sense of the term allowing multiple edges,[3][4] agraph is an ordered triple comprising:
, anincidence function mapping every edge to anunordered pair of vertices (that is, an edge is associated with two distinct vertices).
To avoid ambiguity, this type of object may be called anundirectedmultigraph.
Aloop is an edge that joins a vertex to itself. Graphs as defined in the two definitions above cannot have loops, because a loop joining a vertex to itself is the edge (for an undirected simple graph) or is incident on (for an undirected multigraph) which is not in. To allow loops, the definitions must be expanded. For undirected simple graphs, the definition of should be modified to. For undirected multigraphs, the definition of should be modified to. To avoid ambiguity, these types of objects may be calledundirected simple graph permitting loops andundirected multigraph permitting loops (sometimes alsoundirectedpseudograph), respectively.
and are usually taken to be finite, and many of the well-known results are not true (or are rather different) for infinite graphs because many of the arguments fail in theinfinite case. Moreover, is often assumed to be non-empty, but is allowed to be the empty set. Theorder of a graph is, its number of vertices. Thesize of a graph is, its number of edges. Thedegree orvalency of a vertex is the number of edges that are incident to it, where a loop is counted twice. Thedegree of a graph is the maximum of the degrees of its vertices.
In an undirected simple graph of ordern, the maximum degree of each vertex isn − 1 and the maximum size of the graph isn(n − 1)/2.
The edges of an undirected simple graph permitting loops induce a symmetrichomogeneous relation on the vertices of that is called theadjacency relation of. Specifically, for each edge, its endpoints and are said to beadjacent to one another, which is denoted.
, aset ofedges (also calleddirected edges,directed links,directed lines,arrows orarcs) which areordered pairs of vertices (that is, an edge is associated with two distinct vertices).
To avoid ambiguity, this type of object may be called adirected simple graph. In set theory and graph theory, denotes the set ofn-tuples of elements of that is, ordered sequences of elements that are not necessarily distinct.
In the edge directed from to, the vertices and are called theendpoints of the edge, thetail of the edge and thehead of the edge. The edge is said tojoin and and to beincident on and on. A vertex may exist in a graph and not belong to an edge. The edge is called theinverted edge of.Multiple edges, not allowed under the definition above, are two or more edges with both the same tail and the same head.
In one more general sense of the term allowing multiple edges,[5] adirected graph is an ordered triple comprising:
, aset ofedges (also calleddirected edges,directed links,directed lines,arrows orarcs);
, anincidence function mapping every edge to anordered pair of vertices (that is, an edge is associated with two distinct vertices).
To avoid ambiguity, this type of object may be called adirected multigraph.
Aloop is an edge that joins a vertex to itself. Directed graphs as defined in the two definitions above cannot have loops, because a loop joining a vertex to itself is the edge (for a directed simple graph) or is incident on (for a directed multigraph) which is not in. So to allow loops the definitions must be expanded. For directed simple graphs, the definition of should be modified to. For directed multigraphs, the definition of should be modified to. To avoid ambiguity, these types of objects may be called precisely adirected simple graph permitting loops and adirected multigraph permitting loops (or aquiver) respectively.
The edges of a directed simple graph permitting loops is ahomogeneous relation ~ on the vertices of that is called theadjacency relation of. Specifically, for each edge, its endpoints and are said to beadjacent to one another, which is denoted ~.
The network graph formed by Wikipedia editors (edges) contributing to different Wikipedia language versions (vertices) during one month in summer 2013.[6]
Graphs can be used to model many types of relations and processes in physical, biological,[7][8] social and information systems.[9] Many practical problems can be represented by graphs. Emphasizing their application to real-world systems, the termnetwork is sometimes defined to mean a graph in which attributes (e.g. names) are associated with the vertices and edges, and the subject that expresses and understands real-world systems as a network is callednetwork science.
Withincomputer science, 'causal' and 'non-causal' linked structures are graphs that are used to represent networks of communication, data organization, computational devices, the flow of computation, etc. For instance, the link structure of awebsite can be represented by a directed graph, in which the vertices(nodes) represent web pages and directed edges representlinks from one page to another. A similar approach can be taken to problems in social media,[10] travel, biology, computer chip design, mapping the progression of neuro-degenerative diseases,[11][12] and many other fields. The development ofalgorithms tohandle graphs is therefore of major interest in computer science. Thetransformation of graphs is often formalized and represented bygraph rewrite systems. Complementary tograph transformation systems focusing on rule-based in-memory manipulation of graphs aregraph databases geared towardstransaction-safe,persistent storing and querying ofgraph-structured data.
Graph-theoretic methods, in various forms, have proven particularly useful inlinguistics, since natural language often lends itself well to discrete structure. Traditionally,syntax and compositional semantics follow tree-based structures, whose expressive power lies in theprinciple of compositionality, modeled in a hierarchical graph. More contemporary approaches such ashead-driven phrase structure grammar model the syntax of natural language usingtyped feature structures, which aredirected acyclic graphs. Withinlexical semantics, especially as applied to computers, modeling word meaning is easier when a given word is understood in terms of related words;semantic networks are therefore important incomputational linguistics. Still, other methods in phonology (e.g.optimality theory, which useslattice graphs) and morphology (e.g. finite-state morphology, usingfinite-state transducers) are common in the analysis of language as a graph. Indeed, the usefulness of this area of mathematics to linguistics has borne organizations such asTextGraphs, as well as various 'Net' projects, such asWordNet,VerbNet, and others.
Graph theory is also used to study molecules inchemistry andphysics. Incondensed matter physics, the three-dimensional structure of complicated simulated atomic structures can be studied quantitatively by gathering statistics on graph-theoretic properties related to the topology of the atoms. Also, "theFeynman graphs and rules of calculation summarizequantum field theory in a form in close contact with the experimental numbers one wants to understand."[13] In chemistry a graph makes a natural model for a molecule, where vertices representatoms and edgesbonds. This approach is especially used in computer processing of molecular structures, ranging fromchemical editors to database searching. Instatistical physics, graphs can represent local connections between interacting parts of a system, as well as the dynamics of a physical process on suchsystems. Similarly, incomputational neuroscience graphs can be used to represent functional connections between brain areas that interact to give rise to various cognitive processes, where the vertices represent different areas of the brain and the edges represent the connections between those areas. Graph theory plays an important role in electrical modeling of electrical networks, here, weights are associated with resistance of the wire segments to obtain electrical properties of network structures.[14] Graphs are also used to represent the micro-scale channels ofporous media, in which the vertices represent the pores and the edges represent the smaller channels connecting the pores.Chemical graph theory uses themolecular graph as a means to model molecules.Graphs and networks are excellent models to study and understand phase transitions and critical phenomena.Removal of nodes or edges leads to a critical transition where the network breaks into small clusters which is studied as a phase transition. This breakdown is studied viapercolation theory.[15]
Graph theory is also widely used insociology as a way, for example, tomeasure actors' prestige or to explorerumor spreading, notably through the use ofsocial network analysis software. Under the umbrella of social networks are many different types of graphs.[17] Acquaintanceship and friendship graphs describe whether people know each other. Influence graphs model whether certain people can influence the behavior of others. Finally, collaboration graphs model whether two people work together in a particular way, such as acting in a movie together.
Likewise, graph theory is useful inbiology and conservation efforts where a vertex can represent regions where certain species exist (or inhabit) and the edges represent migration paths or movement between the regions. This information is important when looking at breeding patterns or tracking the spread of disease, parasites or how changes to the movement can affect other species.
Graphs are also commonly used inmolecular biology andgenomics to model and analyse datasets with complex relationships. For example, graph-based methods are often used to 'cluster' cells together into cell-types insingle-cell transcriptome analysis. Another use is to model genes or proteins in apathway and study the relationships between them, such as metabolic pathways and gene regulatory networks.[18] Evolutionary trees, ecological networks, and hierarchical clustering of gene expression patterns are also represented as graph structures.
Graph theory is also used inconnectomics;[19] nervous systems can be seen as a graph, where the nodes are neurons and the edges are the connections between them.
In mathematics, graphs are useful in geometry and certain parts oftopology such asknot theory.Algebraic graph theory has close links withgroup theory. Algebraic graph theory has been applied to many areas including dynamic systems and complexity.
A graph structure can be extended by assigning a weight to each edge of the graph. Graphs with weights, orweighted graphs, are used to represent structures in which pairwise connections have some numerical values. For example, if a graph represents a road network, the weights could represent the length of each road. There may be several weights associated with each edge, including distance (as in the previous example), travel time, or monetary cost. Such weighted graphs are commonly used to program GPS's, and travel-planning search engines that compare flight times and costs.
The paper written byLeonhard Euler on theSeven Bridges of Königsberg and published in 1736 is regarded as the first paper in the history of graph theory.[20] This paper, as well as the one written byVandermonde on theknight problem, carried on with theanalysis situs initiated byLeibniz. Euler's formula relating the number of edges, vertices, and faces of a convex polyhedron was studied and generalized byCauchy[21] andL'Huilier,[22] and represents the beginning of the branch of mathematics known astopology.
More than one century after Euler's paper on the bridges ofKönigsberg and whileListing was introducing the concept of topology,Cayley was led by an interest in particular analytical forms arising fromdifferential calculus to study a particular class of graphs, thetrees.[23] This study had many implications for theoreticalchemistry. The techniques he used mainly concern theenumeration of graphs with particular properties. Enumerative graph theory then arose from the results of Cayley and the fundamental results published byPólya between 1935 and 1937. These were generalized byDe Bruijn in 1959. Cayley linked his results on trees with contemporary studies of chemical composition.[24] The fusion of ideas from mathematics with those from chemistry began what has become part of the standard terminology of graph theory.
In particular, the term "graph" was introduced bySylvester in a paper published in 1878 inNature, where he draws an analogy between "quantic invariants" and "co-variants" of algebra and molecular diagrams:[25]
"[…] Every invariant and co-variant thus becomes expressible by agraph precisely identical with aKekuléan diagram or chemicograph. […] I give a rule for the geometrical multiplication of graphs,i.e. for constructing agraph to the product of in- or co-variants whose separate graphs are given. […]" (italics as in the original).
The first textbook on graph theory was written byDénes Kőnig, and published in 1936.[26] Another book byFrank Harary, published in 1969, was "considered the world over to be the definitive textbook on the subject",[27] and enabled mathematicians, chemists, electrical engineers and social scientists to talk to each other. Harary donated all of the royalties to fund thePólya Prize.[28]
One of the most famous and stimulating problems in graph theory is thefour color problem: "Is it true that any map drawn in the plane may have its regions colored with four colors, in such a way that any two regions having a common border have different colors?" This problem was first posed byFrancis Guthrie in 1852 and its first written record is in a letter ofDe Morgan addressed toHamilton the same year. Many incorrect proofs have been proposed, including those by Cayley,Kempe, and others. The study and the generalization of this problem byTait,Heawood,Ramsey andHadwiger led to the study of the colorings of the graphs embedded on surfaces with arbitrarygenus. Tait's reformulation generated a new class of problems, thefactorization problems, particularly studied byPetersen andKőnig. The works of Ramsey on colorations and more specially the results obtained byTurán in 1941 was at the origin of another branch of graph theory,extremal graph theory.
The four color problem remained unsolved for more than a century. In 1969Heinrich Heesch published a method for solving the problem using computers.[29] A computer-aided proof produced in 1976 byKenneth Appel andWolfgang Haken makes fundamental use of the notion of "discharging" developed by Heesch.[30][31] The proof involved checking the properties of 1,936 configurations by computer, and was not fully accepted at the time due to its complexity. A simpler proof considering only 633 configurations was given twenty years later byRobertson,Seymour,Sanders andThomas.[32]
The autonomous development of topology from 1860 and 1930 fertilized graph theory back through the works ofJordan,Kuratowski andWhitney. Another important factor of common development of graph theory andtopology came from the use of the techniques of modern algebra. The first example of such a use comes from the work of the physicistGustav Kirchhoff, who published in 1845 hisKirchhoff's circuit laws for calculating thevoltage andcurrent inelectric circuits.
The introduction of probabilistic methods in graph theory, especially in the study ofErdős andRényi of the asymptotic probability of graph connectivity, gave rise to yet another branch, known asrandom graph theory, which has been a fruitful source of graph-theoretic results.
A graph is an abstraction of relationships that emerge in nature; hence, it cannot be coupled to a certain representation. The way it is represented depends on the degree of convenience such representation provides for a certain application. The most common representations are the visual, in which, usually, vertices are drawn and connected by edges, and the tabular, in which rows of a table provide information about the relationships between the vertices within the graph.
Graphs are usually represented visually by drawing a point or circle for every vertex, and drawing a line between two vertices if they are connected by an edge. If the graph is directed, the direction is indicated by drawing an arrow. If the graph is weighted, the weight is added on the arrow.
A graph drawing should not be confused with the graph itself (the abstract, non-visual structure) as there are several ways to structure the graph drawing. All that matters is which vertices are connected to which others by how many edges and not the exact layout. In practice, it is often difficult to decide if two drawings represent the same graph. Depending on the problem domain some layouts may be better suited and easier to understand than others.
The pioneering work ofW. T. Tutte was very influential on the subject of graph drawing. Among other achievements, he introduced the use of linear algebraic methods to obtain graph drawings.
Graph drawing also can be said to encompass problems that deal with thecrossing number and its various generalizations. The crossing number of a graph is the minimum number of intersections between edges that a drawing of the graph in the plane must contain. For aplanar graph, the crossing number is zero by definition. Drawings on surfaces other than the plane are also studied.
The tabular representation lends itself well to computational applications. There are different ways to store graphs in a computer system. Thedata structure used depends on both the graph structure and thealgorithm used for manipulating the graph. Theoretically one can distinguish between list and matrix structures but in concrete applications the best structure is often a combination of both. List structures are often preferred forsparse graphs as they have smaller memory requirements.Matrix structures on the other hand provide faster access for some applications but can consume huge amounts of memory. Implementations of sparse matrix structures that are efficient on modern parallel computer architectures are an object of current investigation.[33]
List structures include theedge list, an array of pairs of vertices, and theadjacency list, which separately lists the neighbors of each vertex: Much like the edge list, each vertex has a list of which vertices it is adjacent to.
Matrix structures include theincidence matrix, a matrix of 0's and 1's whose rows represent vertices and whose columns represent edges, and theadjacency matrix, in which both the rows and columns are indexed by vertices. In both cases a 1 indicates two adjacent objects and a 0 indicates two non-adjacent objects. Thedegree matrix indicates the degree of vertices. TheLaplacian matrix is a modified form of the adjacency matrix that incorporates information about thedegrees of the vertices, and is useful in some calculations such asKirchhoff's theorem on the number ofspanning trees of a graph.Thedistance matrix, like the adjacency matrix, has both its rows and columns indexed by vertices, but rather than containing a 0 or a 1 in each cell it contains the length of ashortest path between two vertices.
There is a large literature ongraphical enumeration: the problem of counting graphs meeting specified conditions. Some of this work is found in Harary and Palmer (1973).
A common problem, called thesubgraph isomorphism problem, is finding a fixed graph as asubgraph in a given graph. One reason to be interested in such a question is that manygraph properties arehereditary for subgraphs, which means that a graph has the property if and only if all subgraphs have it too.Finding maximal subgraphs of a certain kind is often anNP-complete problem. For example:
Finding the largest complete subgraph is called theclique problem (NP-complete).
One special case of subgraph isomorphism is thegraph isomorphism problem. It asks whether two graphs are isomorphic. It is not known whether this problem is NP-complete, nor whether it can be solved in polynomial time.
A similar problem is findinginduced subgraphs in a given graph. Again, some important graph properties are hereditary with respect to induced subgraphs, which means that a graph has a property if and only if all induced subgraphs also have it. Finding maximal induced subgraphs of a certain kind is also often NP-complete. For example:
Still another such problem, the minor containment problem, is to find a fixed graph as a minor of a given graph. Aminor or subcontraction of a graph is any graph obtained by taking a subgraph and contracting some (or no) edges. Many graph properties are hereditary for minors, which means that a graph has a property if and only if all minors have it too. For example,Wagner's Theorem states:
A similar problem, the subdivision containment problem, is to find a fixed graph as asubdivision of a given graph. Asubdivision orhomeomorphism of a graph is any graph obtained by subdividing some (or no) edges. Subdivision containment is related to graph properties such asplanarity. For example,Kuratowski's Theorem states:
Another class of problems has to do with the extent to which various species and generalizations of graphs are determined by theirpoint-deleted subgraphs. For example:
Many problems and theorems in graph theory have to do with various ways of coloring graphs. Typically, one is interested in coloring a graph so that no two adjacent vertices have the same color, or with other similar restrictions. One may also consider coloring edges (possibly so that no two coincident edges are the same color), or other variations. Among the famous results and conjectures concerning graph coloring are the following:
Constraint modeling theories concern families of directed graphs related by apartial order. In these applications, graphs are ordered by specificity, meaning that more constrained graphs—which are more specific and thus contain a greater amount of information—are subsumed by those that are more general. Operations between graphs include evaluating the direction of a subsumption relationship between two graphs, if any, and computing graph unification. The unification of two argument graphs is defined as the most general graph (or the computation thereof) that is consistent with (i.e. contains all of the information in) the inputs, if such a graph exists; efficient unification algorithms are known.
Decomposition, defined as partitioning the edge set of a graph (with as many vertices as necessary accompanying the edges of each part of the partition), has a wide variety of questions. Often, the problem is to decompose a graph into subgraphs isomorphic to a fixed graph; for instance, decomposing a complete graph into Hamiltonian cycles. Other problems specify a family of graphs into which a given graph should be decomposed, for instance, a family of cycles, or decomposing a complete graphKn inton − 1 specified trees having, respectively, 1, 2, 3, ...,n − 1 edges.
Some specific decomposition problems and similar problems that have been studied include:
Arboricity, a decomposition into as few forests as possible
Cycle double cover, a collection of cycles covering each edge exactly twice
^Adali, Tulay; Ortega, Antonio (May 2018). "Applications of Graph Theory [Scanning the Issue]".Proceedings of the IEEE.106 (5):784–786.doi:10.1109/JPROC.2018.2820300.ISSN0018-9219.
^Vecchio, F (2017). ""Small World" architecture in brain connectivity and hippocampal volume in Alzheimer's disease: a study via graph theory from EEG data".Brain Imaging and Behavior.11 (2):473–485.doi:10.1007/s11682-016-9528-3.PMID26960946.S2CID3987492.
^Kumar, Ankush; Kulkarni, G. U. (2016-01-04). "Evaluating conducting network based transparent electrodes from geometrical considerations".Journal of Applied Physics.119 (1): 015102.Bibcode:2016JAP...119a5102K.doi:10.1063/1.4939280.ISSN0021-8979.
^Appel, K.; Haken, W. (1977), "Every planar map is four colorable. Part II. Reducibility",Illinois J. Math.,21 (3):491–567,doi:10.1215/ijm/1256049012.
^Robertson, N.; Sanders, D.; Seymour, P.; Thomas, R. (1997), "The four color theorem",Journal of Combinatorial Theory, Series B,70:2–44,doi:10.1006/jctb.1997.1750.
Lowell W. Beineke; Bjarne Toft; and Robin J. Wilson:Milestones in Graph Theory: A Century of Progress, AMS/MAA, (SPECTRUM, v.108), ISBN 978-1-4704-6431-8 (2025).
Berge, Claude (1958).Théorie des graphes et ses applications. Paris: Dunod. English edition, Wiley 1961; Methuen & Co, New York 1962; Russian, Moscow 1961; Spanish, Mexico 1962; Roumanian, Bucharest 1969; Chinese, Shanghai 1963; Second printing of the 1962 first English edition, Dover, New York 2001.
Biggs, N.; Lloyd, E.; Wilson, R. (1986).Graph Theory, 1736–1936. Oxford University Press.
Bollobás, Béla; Riordan, O. M. (2003).Mathematical results on scale-free random graphs in "Handbook of Graphs and Networks" (S. Bornholdt and H.G. Schuster (eds)) (1st ed.). Weinheim: Wiley VCH.