While logical inference and mathematical proof had existed previously, in 1931Kurt Gödel proved with hisincompleteness theorem that there are fundamental limitations on what statements could be proved or disproved.
Modern theoretical computer science research is based on these basic developments, but includes many other mathematical and interdisciplinary problems that have been posed, as shown below:
An algorithm is aneffective method expressed as afinite list[3] of well-defined instructions[4] for calculating afunction.[5] Starting from an initial state and initial input (perhapsempty),[6] the instructions describe acomputation that, whenexecuted, proceeds through a finite[7] number of well-defined successive states, eventually producing "output"[8] and terminating at a final ending state. The transition from one state to the next is not necessarilydeterministic; some algorithms, known asrandomized algorithms, incorporate random input.[9]
Automata Theory is the study of self-operating virtual machines to help in the logical understanding of input and output process, without or with intermediate stage(s) ofcomputation (or anyfunction/process).
Computational complexity theory is a branch of thetheory of computation that focuses on classifyingcomputational problems according to their inherent difficulty, and relating thoseclasses to each other. A computational problem is understood to be a task that is in principle amenable to being solved by a computer, which is equivalent to stating that the problem may be solved by mechanical application of mathematical steps, such as analgorithm.
A problem is regarded as inherently difficult if its solution requires significant resources, whatever thealgorithm used. The theory formalizes this intuition, by introducing mathematicalmodels of computation to study these problems and quantifying the amount of resources needed to solve them, such as time and storage. Othercomplexity measures are also used, such as the amount of communication (used incommunication complexity), the number ofgates in a circuit (used incircuit complexity) and the number of processors (used inparallel computing). One of the roles of computational complexity theory is to determine the practical limits on whatcomputers can and cannot do.
Computational geometry is a branch of computer science devoted to the study of algorithms that can be stated in terms ofgeometry. Some purely geometrical problems arise out of the study of computational geometric algorithms, and such problems are also considered to be part of computational geometry.
The main impetus for the development of computational geometry as a discipline was progress incomputer graphics and computer-aided design and manufacturing (CAD/CAM), but many problems in computational geometry are classical in nature, and may come frommathematical visualization.
Theoretical results in machine learning mainly deal with a type of inductive learning called supervised learning. In supervised learning, an algorithm is given samples that are labeled in some useful way. For example, the samples might be descriptions of mushrooms, and the labels could be whether or not the mushrooms are edible. The algorithm takes these previously labeled samples and uses them to induce a classifier. This classifier is a function that assigns labels to samples including the samples that have never been previously seen by the algorithm. The goal of the supervised learning algorithm is to optimize some measure of performance such as minimizing the number of mistakes made on new samples.
Modern cryptography is heavily based on mathematical theory and computer science practice; cryptographic algorithms are designed aroundcomputational hardness assumptions, making such algorithms hard to break in practice by any adversary. It is theoretically possible to break such a system, but it is infeasible to do so by any known practical means. These schemes are therefore termed computationally secure; theoretical advances, e.g., improvements ininteger factorization algorithms, and faster computing technology require these solutions to be continually adapted. There existinformation-theoretically secure schemes that provably cannot be broken even with unlimited computing power—an example is theone-time pad—but these schemes are more difficult to implement than the best theoretically breakable but computationally secure mechanisms.
Different kinds of data structures are suited to different kinds of applications, and some are highly specialized to specific tasks. For example, databases useB-tree indexes for small percentages of data retrieval andcompilers and databases use dynamichash tables as look up tables.
Data structures provide a means to manage large amounts of data efficiently for uses such as largedatabases andinternet indexing services. Usually, efficient data structures are key to designing efficientalgorithms. Some formal design methods andprogramming languages emphasize data structures, rather than algorithms, as the key organizing factor in software design. Storing and retrieving can be carried out on data stored in bothmain memory and insecondary memory.
Distributed computing studies distributed systems. A distributed system is a software system in which components located onnetworked computers communicate and coordinate their actions bypassing messages.[15] The components interact with each other in order to achieve a common goal. Three significant characteristics of distributed systems are: concurrency of components, lack of a global clock, and independent failure of components.[15] Examples of distributed systems vary fromSOA-based systems tomassively multiplayer online games to peer-to-peer applications, and blockchain networks likeBitcoin.
Acomputer program that runs in a distributed system is called adistributed program, and distributed programming is the process of writing such programs.[16] There are many alternatives for the message passing mechanism, includingRPC-like connectors andmessage queues. An important goal and challenge of distributed systems islocation transparency.
Information-based complexity (IBC) studies optimal algorithms and computational complexity for continuous problems. IBC has studied continuous problems as path integration, partial differential equations, systems of ordinary differential equations, nonlinear equations, integral equations, fixed points, and very-high-dimensional integration.
Formal methods are a particular kind ofmathematics based techniques for thespecification, development andverification ofsoftware andhardware systems.[17] The use of formal methods for software and hardware design is motivated by the expectation that, as in other engineering disciplines, performing appropriate mathematical analysis can contribute to the reliability and robustness of a design.[18]
Machine learning is ascientific discipline that deals with the construction and study ofalgorithms that canlearn from data.[27] Such algorithms operate by building amodel based on inputs[28]: 2 and using that to make predictions or decisions, rather than following only explicitly programmed instructions.
Dually, one can view processes occurring in nature as information processing. Such processes includeself-assembly,developmental processes,gene regulation networks,protein–protein interaction networks, biological transport (active transport,passive transport) networks, andgene assembly inunicellular organisms. Efforts tounderstand biological systems also include engineering of semi-synthetic organisms, and understanding the universe itself from the point of view of information processing. Indeed, the idea was even advanced that information is more fundamental than matter or energy. The Zuse-Fredkin thesis, dating back to the 1960s, states that the entire universe is a hugecellular automaton which continuously updates its rules.[34][35]Recently it has been suggested that the whole universe is aquantum computer that computes its own behaviour.[36]The universe/nature as computational mechanism is addressed by,[37] exploring nature with help the ideas of computability, and[38] studying natural processes as computations (information processing).
Parallel computing is a form ofcomputation in which many calculations are carried out simultaneously,[40] operating on the principle that large problems can often be divided into smaller ones, which are then solved"in parallel". There are several different forms of parallel computing:bit-level,instruction level,data, andtask parallelism. Parallelism has been employed for many years, mainly inhigh-performance computing, but interest in it has grown lately due to the physical constraints preventingfrequency scaling.[41] As power consumption (and consequently heat generation) by computers has become a concern in recent years,[42] parallel computing has become the dominant paradigm incomputer architecture, mainly in the form ofmulti-core processors.[43]
Parallel computer programs are more difficult to write than sequential ones,[44] because concurrency introduces several new classes of potentialsoftware bugs, of whichrace conditions are the most common.Communication andsynchronization between the different subtasks are typically some of the greatest obstacles to getting good parallel program performance.
The maximum possiblespeed-up of a single program as a result of parallelization is known asAmdahl's law.
Programming language theory is a branch of computer science that deals with the design, implementation, analysis, characterization, and classification ofprogramming languages and their individualfeatures. It falls within the discipline of theoretical computer science, both depending on and affectingmathematics, software engineering, andlinguistics. It is an active research area, with numerous dedicated academic journals.
Inprogramming language theory,semantics is the field concerned with the rigorous mathematical study of the meaning ofprogramming languages. It does so by evaluating the meaning ofsyntactically legalstrings defined by a specific programming language, showing the computation involved. In such a case that the evaluation would be of syntactically illegal strings, the result would be non-computation. Semantics describes the processes a computer follows when executing a program in that specific language. This can be shown by describing the relationship between the input and output of a program, or an explanation of how the program will execute on a certainplatform, hence creating amodel of computation.
Experiments have been carried out in which quantum computational operations were executed on a very small number of qubits.[50] Both practical and theoretical research continues, and many national governments and military funding agencies support quantum computing research to develop quantumcomputers for both civilian and national security purposes, such ascryptanalysis.[51]
Computer algebra, also called symbolic computation or algebraic computation is a scientific area that refers to the study and development ofalgorithms andsoftware for manipulatingmathematical expressions and othermathematical objects. Although, properly speaking, computer algebra should be a subfield ofscientific computing, they are generally considered as distinct fields because scientific computing is usually based onnumerical computation with approximatefloating point numbers, while symbolic computation emphasizesexact computation with expressions containingvariables that have not any given value and are thus manipulated as symbols (therefore the name ofsymbolic computation).
Software applications that perform symbolic calculations are calledcomputer algebra systems, with the termsystem alluding to the complexity of the main applications that include, at least, a method to represent mathematical data in a computer, a user programming language (usually different from the language used for the implementation), a dedicated memory manager, auser interface for the input/output of mathematical expressions, a large set ofroutines to perform usual operations, like simplification of expressions,differentiation usingchain rule,polynomial factorization,indefinite integration, etc.
^Cook, Stephen A. (1971). "The complexity of theorem-proving procedures".Proceedings of the third annual ACM symposium on Theory of computing - STOC '71. pp. 151–158.doi:10.1145/800157.805047.ISBN978-1-4503-7464-4.
^"Any classical mathematical algorithm, for example, can be described in a finite number of English words".Rogers, Hartley Jr. (1967).Theory of Recursive Functions and Effective Computability. McGraw-Hill. Page 2.
^Well defined with respect to the agent that executes the algorithm: "There is a computing agent, usually human, which can react to the instructions and carry out the computations" (Rogers 1967, p. 2).
^"an algorithm is a procedure for computing afunction (with respect to some chosen notation for integers) ... this limitation (to numerical functions) results in no loss of generality", (Rogers 1967, p. 1).
^"An algorithm haszero or more inputs, i.e.,quantities which are given to it initially before the algorithm begins" (Knuth 1973:5).
^"A procedure which has all the characteristics of an algorithm except that it possibly lacks finiteness may be called a 'computational method'" (Knuth 1973:5).
^"An algorithm has one or more outputs, i.e. quantities which have a specified relation to the inputs" (Knuth 1973:5).
^Whether or not a process with random interior processes (not including the input) is an algorithm is debatable. Rogers opines that: "a computation is carried out in a discrete stepwise fashion, without the use of continuous methods or analog devices . . . carried forward deterministically, without resort to random methods or devices, e.g., dice" (Rogers 1967, p. 2).
^Rivest, Ronald L. (1990). "Cryptology". In J. Van Leeuwen (ed.).Handbook of Theoretical Computer Science. Vol. 1. Elsevier.
^Bellare, Mihir; Rogaway, Phillip (21 September 2005). "Introduction".Introduction to Modern Cryptography. p. 10.
^abCoulouris, George; Jean Dollimore; Tim Kindberg; Gordon Blair (2011).Distributed Systems: Concepts and Design (5th ed.). Boston: Addison-Wesley.ISBN978-0-132-14301-1.
^Ghosh, Sukumar (2007).Distributed Systems – An Algorithmic Approach. Chapman & Hall/CRC. p. 10.ISBN978-1-58488-564-1.
^Rando Allikmets, Wyeth W. Wasserman, Amy Hutchinson, Philip Smallwood, Jeremy Nathans, Peter K. Rogan,Thomas D. Schneider, Michael Dean (1998) Organization of the ABCR gene: analysis of promoter and splice junction sequences,Gene215:1, 111–122
^Burnham, K. P. and Anderson D. R. (2002)Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach, Second Edition (Springer Science, New York)ISBN978-0-387-95364-9.
^Wernick, Yang, Brankov, Yourganov and Strother, Machine Learning in Medical Imaging,IEEE Signal Processing Magazine, vol. 27, no. 4, July 2010, pp. 25–38
^Mannila, Heikki (1996).Data mining: machine learning, statistics, and databases. Int'l Conf. Scientific and Statistical Database Management. IEEE Computer Society.
^Friedman, Jerome H. (1998). "Data Mining and Statistics: What's the connection?".Computing Science and Statistics.29 (1):3–9.
^S.V. Adve et al. (November 2008)."Parallel Computing Research at Illinois: The UPCRC Agenda"Archived 2008-12-09 at theWayback Machine (PDF). Parallel@Illinois, University of Illinois at Urbana-Champaign. "The main techniques for these performance benefits – increased clock frequency and smarter but increasingly complex architectures – are now hitting the so-called power wall. The computer industry has accepted that future performance increases must largely come from increasing the number of processors (or cores) on a die, rather than making a single core go faster."
^Asanovic et al. Old [conventional wisdom]: Power is free, but transistors are expensive. New [conventional wisdom] is [that] power is expensive, but transistors are "free".
^Asanovic, Krste et al. (December 18, 2006)."The Landscape of Parallel Computing Research: A View from Berkeley" (PDF). University of California, Berkeley. Technical Report No. UCB/EECS-2006-183. "Old [conventional wisdom]: Increasing clock frequency is the primary method of improving processor performance. New [conventional wisdom]: Increasing parallelism is the primary method of improving processor performance ... Even representatives from Intel, a company generally associated with the 'higher clock-speed is better' position, warned that traditional approaches to maximizing performance through maximizing clock speed have been pushed to their limit."
^Manin, Yu. I. (1980).Vychislimoe i nevychislimoe [Computable and Noncomputable] (in Russian). Sov.Radio. pp. 13–15. Archived fromthe original on 10 May 2013. Retrieved4 March 2013.
^Finkelstein, David (1968). "Space-Time Structure in High Energy Interactions". In Gudehus, T.; Kaiser, G. (eds.).Fundamental Interactions at High Energy. New York: Gordon & Breach.