Movatterモバイル変換

[0]ホーム

Jump to content

P versus NP problem

Edit links

From Wikipedia, the free encyclopedia

Unsolved problem in computer science

If the solution to a problem can be checked in polynomial time, must the problem be solvable in polynomial time?

More unsolved problems in computer science

Millennium Prize Problems
Birch and Swinnerton-Dyer conjecture Hodge conjecture Navier–Stokes existence and smoothness P versus NP problem Poincaré conjecture (solved) Riemann hypothesis Yang–Mills existence and mass gap
v t e

TheP versus NP problem is a majorunsolved problem intheoretical computer science. Informally, it asks whether every problem whose solution can be quickly verified can also be quickly solved.

Here, "quickly" means an algorithm exists that solves the task and runs inpolynomial time (as opposed to, say,exponential time), meaning the task completion time isbounded above by apolynomial function on the size of the input to the algorithm. The general class of questions that somealgorithm can answer in polynomial time is "P" or "class P". For some questions, there is no known way to find an answer quickly, but if provided with an answer, it can be verified quickly. The class of questions where an answer can beverified in polynomial time is "NP", standing for "nondeterministic polynomial time".^{[Note 1]}

An answer to the P versus NP question would determine whether problems that can be verified in polynomial time can also be solved in polynomial time. If P ≠ NP, which is widely believed, it would mean that there are problems in NP that are harder to compute than to verify: they could not be solved in polynomial time, but the answer could be verified in polynomial time.

The problem has been called the most important open problem incomputer science.^[1] Aside from being an important problem incomputational theory, a proof either way would have profound implications for mathematics,cryptography, algorithm research,artificial intelligence,game theory, multimedia processing,philosophy,economics and many other fields.^[2]

It is one of the sevenMillennium Prize Problems selected by theClay Mathematics Institute, each of which carries a US$1,000,000 prize for the first correct solution.

Example

Consider the following yes/no problem: given an incompleteSudoku grid of size $n^{2}\times n^{2}$ , is there at least one legal solution where every row, column, and $n\times n$ square contains the integers 1 through $n^{2}$ ? It is straightforward to verify "yes" instances of this generalized Sudoku problem given a candidate solution. However, it is not known whether there is a polynomial-time algorithm that can correctly answer "yes" or "no" to all instances of this problem. Therefore, generalized Sudoku is in NP (quickly verifiable), but may or may not be in P (quickly solvable). (It is necessary to consider a generalized version of Sudoku, as any fixed size Sudoku has only a finite number of possible grids. In this case the problem is in P, as the answer can be found by table lookup.)

History

The precise statement of the P versus NP problem was introduced in 1971 byStephen Cook in his seminal paper "The complexity of theorem proving procedures",^[3] and independently byLeonid Levin in 1973.^[4]

Although the P versus NP problem was formally defined in 1971, there were previous inklings of the underlying problems involved. In 1955, mathematicianJohn Nash wrote a letter to theNational Security Agency, speculating that the time required to crack a sufficiently complex code would increase exponentially with the length of the key.^[5] If proved (and Nash was skeptical), this would imply what is now called P ≠ NP, since a proposed key can be verified in polynomial time. In a 1956 letter written byKurt Gödel toJohn von Neumann, Gödel asked whether theorem-proving (now known to beco-NP-complete) could be solved inquadratic orlinear time,^[6] and posited that if so, then the discovery of mathematical proofs could be automated.

Context

The relation between thecomplexity classes P and NP is studied incomputational complexity theory, the part of thetheory of computation dealing with the resources required during computation to solve a given problem. The most common resources are time (how many steps it takes to solve a problem) and space (how much memory it takes to solve a problem).

In such analysis, a model of the computer for which time must be analyzed is required. Typically such models assume that the computer isdeterministic (given the computer's present state and any inputs, there is only one possible action that the computer might take) andsequential (it performs actions one after the other).

In this theory, the class P consists of alldecision problems (definedbelow) solvable on a deterministic sequential machine in a durationpolynomial in the size of the input; the classNP consists of all decision problems whose positive solutions are verifiable inpolynomial time given the right information, or equivalently, whose solution can be found in polynomial time on anon-deterministic machine.^[7] Clearly, P ⊆ NP. Arguably, the biggest open question intheoretical computer science concerns the relationship between those two classes:

Is P equal to NP?

Since 2002,William Gasarch has conducted three polls of researchers concerning this and related questions.^[8]^[9]^[10] Confidence that P ≠ NP has been increasing – in 2002, 61% believed P ≠ NP, as opposed to 83% in 2012 and 88% in 2019. When restricted to experts, the 2019 answers became 99% believed P ≠ NP.^[10] These polls do not imply whether P = NP, Gasarch himself stated: "This does not bring us any closer to solving P=?NP or to knowing when it will be solved, but it attempts to be an objective report on the subjective opinion of this era."

NP-completeness

Euler diagram forP,NP, NP-complete, and NP-hard set of problems (excluding the empty language and its complement, which belong to P but are not NP-complete)

Main article:NP-completeness

To attack the P = NP question, the concept of NP-completeness is very useful. NP-complete problems are problems that any other NP problem is reducible to in polynomial time and whose solution is still verifiable in polynomial time. That is, any NP problem can be transformed into any NP-complete problem. Informally, an NP-complete problem is an NP problem that is at least as "tough" as any other problem in NP.

NP-hard problems are those at least as hard as NP problems; i.e., all NP problems can be reduced (in polynomial time) to them. NP-hard problems need not be in NP; i.e., they need not have solutions verifiable in polynomial time.

For instance, theBoolean satisfiability problem is NP-complete by theCook–Levin theorem, soany instance ofany problem in NP can be transformed mechanically into a Boolean satisfiability problem in polynomial time. The Boolean satisfiability problem is one of many NP-complete problems. If any NP-complete problem is in P, then it would follow that P = NP. However, many important problems are NP-complete, and no fast algorithm for any of them is known.

From the definition alone it is unintuitive that NP-complete problems exist; however, a trivial NP-complete problem can be formulated as follows: given aTuring machineM guaranteed to halt in polynomial time, does a polynomial-size input thatM will accept exist?^[11] It is in NP because (given an input) it is simple to check whetherM accepts the input by simulatingM; it is NP-complete because the verifier for any particular instance of a problem in NP can be encoded as a polynomial-time machineM that takes the solution to be verified as input. Then the question of whether the instance is a yes or no instance is determined by whether a valid input exists.

The first natural problem proven to be NP-complete was the Boolean satisfiability problem, also known as SAT. As noted above, this is the Cook–Levin theorem; its proof that satisfiability is NP-complete contains technical details about Turing machines as they relate to the definition of NP. However, after this problem was proved to be NP-complete,proof by reduction provided a simpler way to show that many other problems are also NP-complete, including the game Sudoku discussed earlier. In this case, the proof shows that a solution of Sudoku in polynomial time could also be used to completeLatin squares in polynomial time.^[12] This in turn gives a solution to the problem of partitioningtri-partite graphs into triangles,^[13] which could then be used to find solutions for the special case of SAT known as 3-SAT,^[14] which then provides a solution for general Boolean satisfiability. So a polynomial-time solution to Sudoku leads, by a series of mechanical transformations, to a polynomial time solution of satisfiability, which in turn can be used to solve any other NP-problem in polynomial time. Using transformations like this, a vast class of seemingly unrelated problems are all reducible to one another, and are in a sense "the same problem".

Harder problems

Problems in NP not known to be in P or NP-complete

Main article:NP-intermediate

In 1975,Richard E. Ladner showed that if P ≠ NP, then there exist problems in NP that are neither in P nor NP-complete.^[19] Such problems are called NP-intermediate problems. Thegraph isomorphism problem, thediscrete logarithm problem, and theinteger factorization problem are examples of problems believed to be NP-intermediate. They are some of the very few NP problems not known to be in P or to be NP-complete.

The graph isomorphism problem is the computational problem of determining whether two finitegraphs areisomorphic. An important unsolved problem in complexity theory is whether the graph isomorphism problem is in P, NP-complete, or NP-intermediate. The answer is not known, but it is believed that the problem is at least not NP-complete.^[20] If graph isomorphism is NP-complete, thepolynomial time hierarchy collapses to its second level.^[21] Since it is widely believed that the polynomial hierarchy does not collapse to any finite level, it is believed that graph isomorphism is not NP-complete. The best algorithm for this problem, due toLászló Babai, runs inquasi-polynomial time.^[22]

The integer factorization problem is the computational problem of determining theprime factorization of a given integer. Phrased as a decision problem, it is the problem of deciding whether the input has a factor less thank. No efficient integer factorization algorithm is known, and this fact forms the basis of several modern cryptographic systems, such as theRSA algorithm. The integer factorization problem is in NP and inco-NP (and even inUP and co-UP^[23]). If the problem is NP-complete, the polynomial time hierarchy will collapse to its first level (i.e., NP = co-NP). The mostefficient known algorithm for integer factorization is thegeneral number field sieve, which takes expected time

O\left(\exp \left(\left({\tfrac {64n}{9}}\log(2)\right)^{\frac {1}{3}}\left(\log(n\log(2))\right)^{\frac {2}{3}}\right)\right)

to factor ann-bit integer. The best knownquantum algorithm for this problem,Shor's algorithm, runs in polynomial time, although this does not indicate where the problem lies with respect to non-quantum complexity classes.

Comparison of P with "easy" problems

The graph shows the running time vs. problem size for aknapsack problem of a state-of-the-art, specialized algorithm. Thequadratic fit suggests that the algorithmic complexity of the problem is O((log(n))²).^[24]

All of the above discussion has assumed that P means "easy" and "not in P" means "difficult", an assumption known asCobham's thesis. It is a common assumption in complexity theory; but there are caveats.

First, it can be false in practice. A theoretical polynomial algorithm may have extremely large constant factors or exponents, rendering it impractical. For example, the problem ofdeciding whether a graphG containsH as aminor, whereH is fixed, can be solved in a running time ofO(n²),^[25] wheren is the number of vertices inG. However, thebig O notation hides a constant that depends superexponentially onH. The constant is greater than $2\uparrow \uparrow (2\uparrow \uparrow (2\uparrow \uparrow (h/2)))$ (usingKnuth's up-arrow notation), and whereh is the number of vertices inH.^[26]

On the other hand, even if a problem is shown to be NP-complete, and even if P ≠ NP, there may still be effective approaches to the problem in practice. There are algorithms for many NP-complete problems, such as theknapsack problem, thetraveling salesman problem, and theBoolean satisfiability problem, that can solve to optimality many real-world instances in reasonable time. The empiricalaverage-case complexity (time vs. problem size) of such algorithms can be surprisingly low. An example is thesimplex algorithm inlinear programming, which works surprisingly well in practice; despite having exponential worst-casetime complexity, it runs on par with the best known polynomial-time algorithms.^[27]

Finally, there are types of computations which do not conform to the Turing machine model on which P and NP are defined, such asquantum computation andrandomized algorithms.

Reasons to believe P ≠ NP or P = NP

Cook provides a restatement of the problem inThe P Versus NP Problem as "Does P = NP?"^[28] According to polls,^[8]^[29] most computer scientists believe that P ≠ NP. A key reason for this belief is that after decades of studying these problems no one has been able to find a polynomial-time algorithm for any of more than 3,000 important known NP-complete problems (seeList of NP-complete problems). These algorithms were sought long before the concept of NP-completeness was even defined (Karp's 21 NP-complete problems, among the first found, were all well-known existing problems at the time they were shown to be NP-complete). Furthermore, the result P = NP would imply many other startling results that are currently believed to be false, such as NP = co-NP and P = PH.

It is also intuitively argued that the existence of problems that are hard to solve but whose solutions are easy to verify matches real-world experience.^[30]

If P = NP, then the world would be a profoundly different place than we usually assume it to be. There would be no special value in "creative leaps", no fundamental gap between solving a problem and recognizing the solution once it's found.

— Scott Aaronson,UT Austin

On the other hand, some researchers believe that it is overconfident to believe P ≠ NP and that researchers should also explore proofs of P = NP. For example, in 2002 these statements were made:^[8]

The main argument in favor of P ≠ NP is the total lack of fundamental progress in the area of exhaustive search. This is, in my opinion, a very weak argument. The space of algorithms is very large and we are only at the beginning of its exploration. [...] The resolution ofFermat's Last Theorem also shows that very simple questions may be settled only by very deep theories.

— Moshe Y. Vardi,Rice University

Being attached to a speculation is not a good guide to research planning. One should always try both directions of every problem. Prejudice has caused famous mathematicians to fail to solve famous problems whose solution was opposite to their expectations, even though they had developed all the methods required.

— Anil Nerode,Cornell University

DLIN vs NLIN

When one substitutes "linear time on a multitape Turing machine" for "polynomial time" in the definitions of P and NP, one obtains the classesDLIN andNLIN.It is known^[31] that DLIN ≠ NLIN.

Consequences of solution

One of the reasons the problem attracts so much attention is the consequences of the possible answers. Either direction of resolution would advance theory enormously, and perhaps have huge practical consequences as well.

P = NP

A proof that P = NP could have stunning practical consequences if the proof leads to efficient methods for solving some of the important problems in NP. The potential consequences, both positive and negative, arise since various NP-complete problems are fundamental in many fields.

It is also very possible that a proof wouldnot lead to practical algorithms for NP-complete problems. The formulation of the problem does not require that the bounding polynomial be small or even specifically known. Anon-constructive proof might show a solution exists without specifying either an algorithm to obtain it or a specific bound. Even if the proof is constructive, showing an explicit bounding polynomial and algorithmic details, if the polynomial is not very low-order the algorithm might not be sufficiently efficient in practice. In this case the initial proof would be mainly of interest to theoreticians, but the knowledge that polynomial time solutions are possible would surely spur research into better (and possibly practical) methods to achieve them.

A solution showing P = NP could upend the field ofcryptography, which relies on certain problems being difficult. A constructive and efficient solution^{[Note 2]} to an NP-complete problem such as3-SAT would break most existing cryptosystems including:

Existing implementations ofpublic-key cryptography,^[32] a foundation for many modern security applications such as secure financial transactions over the Internet.
Symmetric ciphers such asAES or3DES,^[33] used for the encryption of communications data.
Cryptographic hashing, which underliesblockchain cryptocurrencies such asBitcoin, and is used to authenticate software updates. For these applications, finding a pre-image that hashes to a given value must be difficult, ideally taking exponential time. If P = NP, then this can take polynomial time, through reduction to SAT.^[34]

These would need modification or replacement withinformation-theoretically secure solutions that do not assume P ≠ NP.

There are also enormous benefits that would follow from rendering tractable many currently mathematically intractable problems. For instance, many problems inoperations research are NP-complete, such as types ofinteger programming and thetravelling salesman problem. Efficient solutions to these problems would have enormous implications for logistics. Many other important problems, such as some problems inprotein structure prediction, are also NP-complete;^[35] making these problems efficiently solvable could considerably advance life sciences and biotechnology.

These changes could be insignificant compared to the revolution that efficiently solving NP-complete problems would cause in mathematics itself. Gödel, in his early thoughts on computational complexity, noted that a mechanical method that could solve any problem would revolutionize mathematics:^[36]^[37]

If there really were a machine with φ(n) ∼k⋅n (or even ∼k⋅n²), this would have consequences of the greatest importance. Namely, it would obviously mean that in spite of the undecidability of theEntscheidungsproblem, the mental work of a mathematician concerning Yes-or-No questions could be completely replaced by a machine. After all, one would simply have to choose the natural numbern so large that when the machine does not deliver a result, it makes no sense to think more about the problem.

Similarly,Stephen Cook (assuming not only a proof, but a practically efficient algorithm) says:^[28]

... it would transform mathematics by allowing a computer to find a formal proof of any theorem which has a proof of a reasonable length, since formal proofs can easily be recognized in polynomial time. Example problems may well include all of theCMI prize problems.

Research mathematicians spend their careers trying to prove theorems, and some proofs have taken decades or even centuries to find after problems have been stated—for instance,Fermat's Last Theorem took over three centuries to prove. A method guaranteed to find a proof if a "reasonable" size proof exists, would essentially end this struggle.

Donald Knuth has stated that he has come to believe that P = NP, but is reserved about the impact of a possible proof:^[38]

[...] if you imagine a numberM that's finite but incredibly large—like say the number 10↑↑↑↑3 discussed in my paper on "coping with finiteness"—then there's a humongous number of possible algorithms that don^M bitwise or addition or shift operations onn given bits, and it's really hard to believe that all of those algorithms fail.My main point, however, is that I don't believe that the equality P = NP will turn out to be helpful even if it is proved, because such a proof will almost surely be nonconstructive.

Diagram of complexity classes provided that P ≠ NP. The existence of problems within NP but outside both P and NP-complete, under that assumption, was established byLadner's theorem.^[19]

P ≠ NP

A proof of P ≠ NP would lack the practical computational benefits of a proof that P = NP, but would represent a great advance in computational complexity theory and guide future research. It would demonstrate that many common problems cannot be solved efficiently, so that the attention of researchers can be focused on partial solutions or solutions to other problems. Due to widespread belief in P ≠ NP, much of this focusing of research has already taken place.^[39]

P ≠ NP still leaves open theaverage-case complexity of hard problems in NP. For example, it is possible that SAT requires exponential time in the worst case, but that almost all randomly selected instances of it are efficiently solvable.Russell Impagliazzo has described five hypothetical "worlds" that could result from different possible resolutions to the average-case complexity question.^[40] These range from "Algorithmica", where P = NP and problems like SAT can be solved efficiently in all instances, to "Cryptomania", where P ≠ NP and generating hard instances of problems outside P is easy, with three intermediate possibilities reflecting different possible distributions of difficulty over instances of NP-hard problems. The "world" where P ≠ NP but all problems in NP are tractable in the average case is called "Heuristica" in the paper. APrinceton University workshop in 2009 studied the status of the five worlds.^[41]

Results about difficulty of proof

Although the P = NP problem itself remains open despite a million-dollar prize and a huge amount of dedicated research, efforts to solve the problem have led to several new techniques. In particular, some of the most fruitful research related to the P = NP problem has been in showing that existing proof techniques are insufficient for answering the question, suggesting novel technical approaches are required.

As additional evidence for the difficulty of the problem, essentially all known proof techniques incomputational complexity theory fall into one of the following classifications, all insufficient to prove P ≠ NP:

Classification	Definition
Relativizing proofs	Imagine a world where every algorithm is allowed to make queries to some fixed subroutine called anoracle (which can answer a fixed set of questions in constant time, such as an oracle that solves any traveling salesman problem in 1 step), and the running time of the oracle is not counted against the running time of the algorithm. Most proofs (especially classical ones) apply uniformly in a world with oracles regardless of what the oracle does. These proofs are calledrelativizing. In 1975, Baker, Gill, andSolovay showed that P = NP with respect to some oracles, while P ≠ NP for other oracles.^[42] As relativizing proofs can only prove statements that are true for all possible oracles, these techniques cannot resolve P = NP.
Natural proofs	In 1993,Alexander Razborov andSteven Rudich defined a general class of proof techniques for circuit complexity lower bounds, callednatural proofs.^[43] At the time, all previously known circuit lower bounds were natural, and circuit complexity was considered a very promising approach for resolving P = NP. However, Razborov and Rudich showed that ifone-way functions exist, P and NP are indistinguishable to natural proof methods. Although the existence of one-way functions is unproven, most mathematicians believe that they exist, and a proof of their existence would be a much stronger statement than P ≠ NP. Thus, it is unlikely that natural proofs alone can resolve P = NP.
Algebrizing proofs	After the Baker–Gill–Solovay result, new non-relativizing proof techniques were successfully used to prove thatIP = PSPACE. However, in 2008,Scott Aaronson andAvi Wigderson showed that the main technical tool used in the IP = PSPACE proof, known asarithmetization, was also insufficient to resolve P = NP.^[44] Arithmetization converts the operations of an algorithm to algebraic and basicarithmetic symbols and then uses those to analyze the workings. In theIP = PSPACE proof, they convert theblack box and the Boolean circuits to an algebraic problem.^[44] As mentioned previously, it has been proven that this method is not viable to solve P = NP and othertime complexity problems.

These barriers are another reason why NP-complete problems are useful: if a polynomial-time algorithm can be demonstrated for an NP-complete problem, this would solve the P = NP problem in a way not excluded by the above results.

These barriers lead some computer scientists to suggest the P versus NP problem may beindependent of standard axiom systems likeZFC (cannot be proved or disproved within them). An independence result could imply that either P ≠ NP and this is unprovable in (e.g.) ZFC, or that P = NP but it is unprovable in ZFC that any polynomial-time algorithms are correct.^[45] However, if the problem is undecidable even with much weaker assumptions extending thePeano axioms for integer arithmetic, then nearly polynomial-time algorithms exist for all NP problems.^[46] Therefore, assuming (as most complexity theorists do) some NP problems don't have efficient algorithms, proofs of independence with those techniques are impossible. This also implies proving independence from PA or ZFC with current techniques is no easier than proving all NP problems have efficient algorithms.

Logical characterizations

The P = NP problem can be restated as certain classes of logical statements, as a result of work indescriptive complexity.

Consider all languages of finite structures with a fixedsignature including alinear order relation. Then, all such languages in P are expressible infirst-order logic with the addition of a suitable leastfixed-point combinator. Recursive functions can be defined with this and the order relation. As long as the signature contains at least one predicate or function in addition to the distinguished order relation, so that the amount of space taken to store such finite structures is actually polynomial in the number of elements in the structure, this precisely characterizes P.

Similarly, NP is the set of languages expressible in existentialsecond-order logic—that is, second-order logic restricted to excludeuniversal quantification over relations, functions, and subsets. The languages in thepolynomial hierarchy,PH, correspond to all of second-order logic. Thus, the question "is P a proper subset of NP" can be reformulated as "is existential second-order logic able to describe languages (of finite linearly ordered structures with nontrivial signature) that first-order logic with least fixed point cannot?".^[47] The word "existential" can even be dropped from the previous characterization, since P = NP if and only if P = PH (as the former would establish that NP = co-NP, which in turn implies that NP = PH).

Polynomial-time algorithms

No known algorithm for a NP-complete problem runs in polynomial time. However, there are algorithms known for NP-complete problems that if P = NP, the algorithm runs in polynomial time on accepting instances (although with enormous constants, making the algorithm impractical). However, these algorithms do not qualify as polynomial time because their running time on rejecting instances are not polynomial. The following algorithm, due toLevin (without any citation), is such an example below. It correctly accepts the NP-complete languageSUBSET-SUM. It runs in polynomial time on inputs that are in SUBSET-SUM if and only if P = NP:

// Algorithm that accepts the NP-complete language SUBSET-SUM.//// this is a polynomial-time algorithm if and only if P = NP.//// "Polynomial-time" means it returns "yes" in polynomial time when// the answer should be "yes", and runs forever when it is "no".//// Input: S = a finite set of integers// Output: "yes" if any subset of S adds up to 0.// Runs forever with no output otherwise.// Note: "Program number M" is the program obtained by// writing the integer M in binary, then// considering that string of bits to be a// program. Every possible program can be// generated this way, though most do nothing// because of syntax errors.FOR K = 1...∞  FOR M = 1...K    Run program number M for K steps with input S    IF the program outputs a list of distinct integers      AND the integers are all in S      AND the integers sum to 0    THEN      OUTPUT "yes" and HALT

This is a polynomial-time algorithm accepting an NP-complete language only if P = NP. "Accepting" means it gives "yes" answers in polynomial time, but is allowed to run forever when the answer is "no" (also known as asemi-algorithm).

This algorithm is enormously impractical, even if P = NP. If the shortest program that can solve SUBSET-SUM in polynomial time isb bits long, the above algorithm will try at least2^b − 1 other programs first.

Formal definitions

P and NP

Adecision problem is a problem that takes as input somestringw over an alphabet Σ, and outputs "yes" or "no". If there is analgorithm (say aTuring machine, or acomputer program with unbounded memory) that produces the correct answer for any input string of lengthn in at mostcn^k steps, wherek andc are constants independent of the input string, then we say that the problem can be solved inpolynomial time and we place it in the class P. Formally, P is the set of languages that can be decided by a deterministic polynomial-time Turing machine. Meaning,

{\mathsf {P}}=\{L:L=L(M){\text{ for some deterministic polynomial-time Turing machine }}M\}

where

L(M)=\{w\in \Sigma ^{*}:M{\text{ accepts }}w\}

and a deterministic polynomial-time Turing machine is a deterministic Turing machineM that satisfies two conditions:

M halts on all inputsw and
there exists $k\in N$ such that $T_{M}(n)\in O(n^{k})$ , whereO refers to thebig O notation and

T_{M}(n)=\max\{t_{M}(w):w\in \Sigma ^{*},|w|=n\}

t_{M}(w)={\text{ number of steps }}M{\text{ takes to halt on input }}w.

NP can be defined similarly using nondeterministic Turing machines (the traditional way). However, a modern approach uses the concept ofcertificate andverifier. Formally, NP is the set of languages with a finite alphabet and verifier that runs in polynomial time. The following defines a "verifier":

LetL be a language over a finite alphabet, Σ.

L ∈ NP if, and only if, there exists a binary relation $R\subset \Sigma ^{*}\times \Sigma ^{*}$ and a positive integerk such that the following two conditions are satisfied:

For all $x\in \Sigma ^{*}$ , $x\in L\Leftrightarrow \exists y\in \Sigma ^{*}$ such that (x,y) ∈R and $|y|\in O(|x|^{k})$ ; and
the language $L_{R}=\{x\#y:(x,y)\in R\}$ over $\Sigma \cup \{\#\}$ is decidable by a deterministic Turing machine in polynomial time.

A Turing machine that decidesL_R is called averifier forL and ay such that (x,y) ∈R is called acertificate of membership ofx inL.

Not all verifiers must be polynomial-time. However, forL to be in NP, there must be a verifier that runs in polynomial time.

Example

Let

\mathrm {COMPOSITE} =\left\{x\in \mathbb {N} \mid x=pq{\text{ for integers }}p,q>1\right\}

R=\left\{(x,y)\in \mathbb {N} \times \mathbb {N} \mid 1<y\leq {\sqrt {x}}{\text{ and }}y{\text{ divides }}x\right\}.

Whether a value ofx iscomposite is equivalent to of whetherx is a member of COMPOSITE. It can be shown that COMPOSITE ∈ NP by verifying that it satisfies the above definition (if we identify natural numbers with their binary representations).

COMPOSITE also happens to be in P, a fact demonstrated by the invention of theAKS primality test.^[48]

NP-completeness

Main article:NP-completeness

There are many equivalent ways of describing NP-completeness.

LetL be a language over a finite alphabet Σ.

L is NP-complete if, and only if, the following two conditions are satisfied:

L ∈ NP; and
anyL′ in NP is polynomial-time-reducible toL (written as $L'\leq _{p}L$ ), where $L'\leq _{p}L$ if, and only if, the following two conditions are satisfied:
1. There existsf : Σ* → Σ* such that for allw in Σ* we have: $(w\in L'\Leftrightarrow f(w)\in L)$ ; and
2. there exists a polynomial-time Turing machine that halts withf(w) on its tape on any inputw.

Alternatively, ifL ∈ NP, and there is another NP-complete problem that can be polynomial-time reduced toL, thenL is NP-complete. This is a common way of proving some new problem is NP-complete.

Claimed solutions

While the P versus NP problem is generally considered unsolved,^[49] many amateur and some professional researchers have claimed solutions.Gerhard J. Woeginger compiled a list of 116 purported proofs from 1986 to 2016, of which 61 were proofs of P = NP, 49 were proofs of P ≠ NP, and 6 proved other results, e.g. that the problem is undecidable.^[50] Some attempts at resolving P versus NP have received brief media attention,^[51] though these attempts have been refuted.

Popular culture

The filmTravelling Salesman, by director Timothy Lanzone, is the story of four mathematicians hired by the US government to solve the P versus NP problem.^[52]

In the sixth episode ofThe Simpsons' seventh season "Treehouse of Horror VI", the equation P = NP is seen shortly after Homer accidentally stumbles into the "third dimension".^[53]^[54]

In the second episode of season 2 ofElementary,"Solve for X" Holmes and Watson investigate the murders of mathematicians who were attempting to solve P versus NP.^[55]^[56]

Notes

^Anondeterministic Turing machine can move to a state that is not determined by the previous state. Such a machine could solve an NP problem in polynomial time by falling into the correct answer state (by luck), then conventionally verifying it. Such machines are not practical for solving realistic problems but can be used as theoretical models.
^Exactly how efficient a solution must be to pose a threat to cryptography depends on the details. A solution of $O(N^{2})$ with a reasonable constant term would be disastrous. On the other hand, a solution that is $\Omega (N^{4})$ in almost all cases would not pose an immediate practical danger.

References

^Fortnow, Lance (2009)."The status of the P versus NP problem"(PDF).Communications of the ACM.52 (9):78–86.CiteSeerX 10.1.1.156.767.doi:10.1145/1562164.1562186.S2CID 5969255. Archived fromthe original(PDF) on 24 February 2011. Retrieved26 January 2010.
^Fortnow, Lance (2013).The Golden Ticket: P, NP, and the Search for the Impossible. Princeton, NJ: Princeton University Press.ISBN 9780691156491.
^Cook, Stephen (1971)."The complexity of theorem proving procedures".Proceedings of the Third Annual ACM Symposium on Theory of Computing. pp. 151–158.doi:10.1145/800157.805047.ISBN 9781450374644.S2CID 7573663.
^Levin, L. A. (1973).Универсальные задачи перебора [Problems of Information Transmission].Пробл. Передачи Информ (in Russian).9 (3):115–116.
^NSA (2012)."Letters from John Nash"(PDF).Archived(PDF) from the original on 9 November 2018.
^Hartmanis, Juris."Gödel, von Neumann, and the P = NP problem"(PDF).Bulletin of the European Association for Theoretical Computer Science.38:101–107.
^Sipser, Michael:Introduction to the Theory of Computation, Second Edition, International Edition, page 270. Thomson Course Technology, 2006. Definition 7.19 and Theorem 7.20.
^^a ^b ^cGasarch, William I. (June 2002)."The P=?NP poll"(PDF).SIGACT News.33 (2):34–47.CiteSeerX 10.1.1.172.1005.doi:10.1145/564585.564599.S2CID 36828694.Archived(PDF) from the original on 15 June 2007.
^Gasarch, William I."The Second P=?NP poll"(PDF).SIGACT News.74.Archived(PDF) from the original on 24 January 2014.
^^a ^b"Guest Column: The Third P =? NP Poll1"(PDF).Archived(PDF) from the original on 31 March 2019. Retrieved25 May 2020.
^Aaronson, Scott."PHYS771 Lecture 6: P, NP, and Friends". Retrieved27 August 2007.
^"MSc course: Foundations of Computer Science".www.cs.ox.ac.uk. Retrieved25 May 2020.
^Colbourn, Charles J. (1984)."The complexity of completing partial Latin squares".Discrete Applied Mathematics.8 (1):25–30.doi:10.1016/0166-218X(84)90075-1.
^Holyer, I. (1981). "The NP-completeness of some edge-partition problems".SIAM J. Comput.10 (4):713–717.doi:10.1137/0210054.
^Fraenkel, Aviezri; Lichtenstein, D. (1981). "Computing a perfect strategy forn ×n chess requires time exponential inn".Journal of Combinatorial Theory. Series A.31 (2):199–214.doi:10.1016/0097-3165(81)90016-9.
^Eppstein, David."Computational Complexity of Games and Puzzles".
^Fischer, Michael J.;Rabin, Michael O. (1974)."Super-Exponential Complexity of Presburger Arithmetic".Proceedings of the SIAM-AMS Symposium in Applied Mathematics.7:27–41. Archived fromthe original on 15 September 2006. Retrieved15 October 2017.
^Valiant, Leslie G. (1979). "The complexity of enumeration and reliability problems".SIAM Journal on Computing.8 (3):410–421.doi:10.1137/0208032.
^^a ^bLadner, R.E. (1975)."On the structure of polynomial time reducibility".Journal of the ACM.22: 151–171 See Corollary 1.1.doi:10.1145/321864.321877.S2CID 14352974.
^Arvind, Vikraman; Kurur, Piyush P. (2006). "Graph isomorphism is in SPP".Information and Computation.204 (5):835–852.doi:10.1016/j.ic.2006.02.002.
^Schöning, Uwe (1988). "Graph isomorphism is in the low hierarchy".Journal of Computer and System Sciences.37 (3):312–323.doi:10.1016/0022-0000(88)90010-4.
^Babai, László (2018). "Group, graphs, algorithms: the graph isomorphism problem".Proceedings of the International Congress of Mathematicians—Rio de Janeiro 2018. Vol. IV. Invited lectures. World Sci. Publ., Hackensack, NJ. pp. 3319–3336.MR 3966534.
^Lance Fortnow. Computational Complexity Blog:Complexity Class of the Week: Factoring. 13 September 2002.
^Pisinger, D. 2003. "Where are the hard knapsack problems?" Technical Report 2003/08, Department of Computer Science, University of Copenhagen, Copenhagen, Denmark
^Kawarabayashi, Ken-ichi; Kobayashi, Yusuke;Reed, Bruce (2012)."The disjoint paths problem in quadratic time".Journal of Combinatorial Theory. Series B.102 (2):424–435.doi:10.1016/j.jctb.2011.07.004.
^Johnson, David S. (1987). "The NP-completeness column: An ongoing guide (edition 19)".Journal of Algorithms.8 (2):285–303.CiteSeerX 10.1.1.114.3864.doi:10.1016/0196-6774(87)90043-5.
^Gondzio, Jacek; Terlaky, Tamás (1996)."3 A computational view of interior point methods". In J. E. Beasley (ed.).Advances in linear and integer programming. Oxford Lecture Series in Mathematics and its Applications. Vol. 4. New York: Oxford University Press. pp. 103–144.MR 1438311.Postscript file at website of Gondzio andat McMaster University website of Terlaky.
^^a ^bCook, Stephen (April 2000)."The P versus NP Problem"(PDF).Clay Mathematics Institute.Archived(PDF) from the original on 16 December 2013. Retrieved18 October 2006.
^Rosenberger, Jack (May 2012)."P vs. NP poll results".Communications of the ACM.55 (5): 10.
^Aaronson, Scott (4 September 2006)."Reasons to believe"., point 9.
^Balcazar, Jose Luis; Diaz, Josep; Gabarro, Joaquim (1990).Structural Complexity II. Springer Verlag.ISBN 3-540-52079-1., Theorem 3.9
^SeeHorie, S.; Watanabe, O. (1997). "Hard instance generation for SAT".Algorithms and Computation. Lecture Notes in Computer Science. Vol. 1350. Springer. pp. 22–31.arXiv:cs/9809117.Bibcode:1998cs........9117H.doi:10.1007/3-540-63890-3_4.ISBN 978-3-540-63890-2. for a reduction of factoring to SAT. A 512-bit factoring problem (8400 MIPS-years when factored) translates to a SAT problem of 63,652 variables and 406,860 clauses.
^See, for example,Massacci, F.; Marraro, L. (2000). "Logical cryptanalysis as a SAT problem".Journal of Automated Reasoning.24 (1):165–203.CiteSeerX 10.1.1.104.962.doi:10.1023/A:1006326723002.S2CID 3114247. in which an instance of DES is encoded as a SAT problem with 10336 variables and 61935 clauses. A 3DES problem instance would be about 3 times this size.
^De, Debapratim; Kumarasubramanian, Abishek; Venkatesan, Ramarathnam (2007). "Inversion attacks on secure hash functions using SAT solvers".Theory and Applications of Satisfiability Testing – SAT 2007. International Conference on Theory and Applications of Satisfiability Testing. Springer. pp. 377–382.doi:10.1007/978-3-540-72788-0_36.
^Berger, B.;Leighton, T. (1998). "Protein folding in the hydrophobic-hydrophilic (HP) model is NP-complete".Journal of Computational Biology.5 (1):27–40.CiteSeerX 10.1.1.139.5547.doi:10.1089/cmb.1998.5.27.PMID 9541869.
^History of this letter and its translation fromSipser, Michael."The History and Status of the P versus NP question"(PDF).Archived(PDF) from the original on 2 February 2014.
^Johnson, David S. (August 2012). "A Brief History of NP-Completeness, 1954–2012". InGrötschel, M. (ed.).Optimization Stories(PDF). Documenta Mathematica. pp. 359–376.ISBN 978-3-936609-58-5.ISSN 1431-0643.
^Knuth, Donald E. (20 May 2014).Twenty Questions for Donald Knuth.InformIT. Retrieved20 July 2014.
^Foulds, L. R. (October 1983). "The Heuristic Problem-Solving Approach".Journal of the Operational Research Society.34 (10):927–934.doi:10.2307/2580891.JSTOR 2580891.
^R. Impagliazzo,"A personal view of average-case complexity", p. 134, 10th Annual Structure in Complexity Theory Conference (SCT'95), 1995.
^"Tentative program for the workshop on "Complexity and Cryptography: Status of Impagliazzo's Worlds"". Archived fromthe original on 15 November 2013.
^Baker, T. P.; Gill, J.; Solovay, R. (1975). "Relativizations of the P =? NP Question".SIAM Journal on Computing.4 (4):431–442.doi:10.1137/0204037.
^Razborov, Alexander A.; Steven Rudich (1997)."Natural proofs".Journal of Computer and System Sciences.55 (1):24–35.doi:10.1006/jcss.1997.1494.
^^a ^bAaronson, S.; Wigderson, A. (2008).Algebrization: A New Barrier in Complexity Theory(PDF). Proceedings of ACM STOC'2008. pp. 731–740.doi:10.1145/1374376.1374481.Archived(PDF) from the original on 21 February 2008.
^Aaronson, Scott."Is P Versus NP Formally Independent?"(PDF).Archived(PDF) from the original on 16 January 2017..
^Ben-David, Shai; Halevi, Shai (1992).On the independence of P versus NP.Technion (Technical report). Vol. 714. Archived fromthe original(GZIP) on 2 March 2012..
^Elvira Mayordomo."P versus NP"Archived 16 February 2012 at theWayback MachineMonografías de la Real Academia de Ciencias de Zaragoza 26: 57–68 (2004).
^Agrawal, Manindra; Kayal, Neeraj; Saxena, Nitin (2004)."PRIMES is in P"(PDF).Annals of Mathematics.160 (2):781–793.doi:10.4007/annals.2004.160.781.JSTOR 3597229.Archived(PDF) from the original on 26 September 2006.
^Markoff, John (8 October 2009)."Prizes Aside, the P-NP Puzzler Has Consequences".The New York Times.
^Gerhard J. Woeginger."The P-versus-NP page". Retrieved24 June 2018.
^Markoff, John (16 August 2010)."Step 1: Post Elusive Proof. Step 2: Watch Fireworks".The New York Times. Retrieved20 September 2010.
^Geere, Duncan (26 April 2012)."'Travelling Salesman' movie considers the repercussions if P equals NP".Wired UK. Retrieved26 April 2012.
^Hardesty, Larry (29 October 2009)."Explained: P vs. NP".
^Shadia, Ajam (13 September 2013)."What is the P vs. NP problem? Why is it important?".
^Gasarch, William (7 October 2013)."P vs NP is Elementary? No— P vs NP is ON Elementary".blog.computationalcomplexity.org. Retrieved6 July 2018.
^Kirkpatrick, Noel (4 October 2013)."Elementary Solve for X Review: Sines of Murder".TV.com. Retrieved6 July 2018.
^^a ^bWigderson, Avi (2019).Mathematics and Computation: A Theory Revolutionizing Technology and Science. Princeton University Press.ISBN 978-0-691-18913-0.
^L. G. Valiant.Completeness classes in algebra. In Proceedings of 11th ACM STOC, pp. 249–261, 1979.

Sources

Rachel Crowell (28 May 2021)."The Top Unsolved Questions in Mathematics Remain Mostly Mysterious Just one of the seven Millennium Prize Problems named 21 years ago has been solved".www.scientificamerican.com. Retrieved21 June 2021.This problem concerns the issue of whether questions that are easy to verify (a class of queries called NP) also have solutions that are easy to find (a class called P).
Hosch, William L (11 August 2009)."P versus NP problem mathematics".Encyclopædia Britannica. Retrieved20 June 2021.
"P vs NP Problem".www.claymath.org (Cook, Levin). Archived fromthe original on 18 June 2021. Retrieved20 June 2021.Suppose that you are organizing housing accommodations for a group of four hundred university students. Space is limited and only one hundred of the students will receive places in the dormitory. To complicate matters, the Dean has provided you with a list of pairs of incompatible students, and requested that no pair from this list appear in your final choice. This is an example of what computer scientists call an NP-problem...

External links

Fortnow, L.; Gasarch, W."Computational complexity".
Aviad Rubinstein'sHardness of Approximation Between P and NP, winner of theACM's2017 Doctoral Dissertation Award.
"P vs. NP and the Computational Complexity Zoo". 26 August 2014.Archived from the original on 24 November 2021 – viaYouTube.

v t e Complexity classes
Considered feasible	DLOGTIME AC⁰ ACC⁰ TC TC⁰ L SL RL FL NL NL-complete NC SC CC P P-complete ZPP RP BPP BQP APX FP
Suspected infeasible	UP NP NP-complete NP-hard co-NP co-NP-complete TFNP FNP AM QMA PH ⊕P PP #P #P-complete IP PSPACE PSPACE-complete
Considered infeasible	EXPTIME NEXPTIME EXPSPACE 2-EXPTIME ELEMENTARY NONELEMENTARY PR R RE ALL
Other complexity classes	polyL QP
Class hierarchies	Polynomial hierarchy Exponential hierarchy Grzegorczyk hierarchy Arithmetical hierarchy Boolean hierarchy
Families of classes	DTIME NTIME DSPACE NSPACE Probabilistically checkable proof Interactive proof system
List of complexity classes