Data processing is thecollection and manipulation of digital data to produce meaningful information.[1] Data processing is a form ofinformation processing, which is the modification (processing) of information in any manner detectable by an observer.[note 1]
Data processing may involve various processes, including:
TheUnited States Census Bureau history illustrates the evolution of data processing from manual through electronic procedures.
Although widespread use of the termdata processing dates only from the 1950s,[2] data processing functions have been performed manually for millennia. For example,bookkeeping involves functions such as posting transactions and producing reports like thebalance sheet and thecash flow statement. Completely manual methods were augmented by the application ofmechanical or electroniccalculators. A person whose job was to perform calculations manually or using a calculator was called a "computer."
The1890 United States census schedule was the first to gather data by individual rather thanhousehold. A number of questions could be answered by making a check in the appropriate box on the form. From 1850 to 1880 the Census Bureau employed "a system of tallying, which, by reason of the increasing number of combinations of classifications required, became increasingly complex. Only a limited number of combinations could be recorded in one tally, so it was necessary to handle the schedules 5 or 6 times, for as many independent tallies."[3] "It took over 7 years to publish the results of the 1880 census"[4] using manual processing methods.
The termautomatic data processing was applied to operations performed by means ofunit record equipment, such asHerman Hollerith's application ofpunched card equipment for the1890 United States census. "Using Hollerith's punchcard equipment, the Census Office was able to complete tabulating most of the 1890 census data in 2 to 3 years, compared with 7 to 8 years for the 1880 census. It is estimated that using Hollerith's system saved some $5 million in processing costs"[4] in 1890 dollars even though there were twice as many questions as in 1880.
Computerized data processing, orelectronic data processing represents a later development, with a computer used instead of several independent pieces of equipment. The Census Bureau first made limited use ofelectronic computers for the1950 United States census, using aUNIVAC I system,[3] delivered in 1952.
The termdata processing has mostly been subsumed by the more general terminformation technology (IT).[5] The older term "data processing" is suggestive of older technologies. For example, in 1996 theData Processing Management Association (DPMA) changed its name to theAssociation of Information Technology Professionals. Nevertheless, the terms are approximately synonymous.
Commercial data processing involves a large volume of input data, relatively few computational operations, and a large volume of output. For example, an insurance company needs to keep records on tens or hundreds of thousands of policies, print and mail bills, and receive and post payments.
In science and engineering, the termsdata processing andinformation systems are considered too broad, and the termdata processing is typically used for the initial stage followed by adata analysis in the second stage of the overall data handling.
Data analysis uses specializedalgorithms andstatistical calculations that are less often observed in a typical general business environment. For data analysis, software suites likeSPSS orSAS, or their free counterparts such asDAP,gretl, orPSPP are often used. These tools are usually helpful for processing various huge data sets, as they are able to handle enormous amount of statistical analysis.[6]
Adata processing system is a combination ofmachines, people, and processes that for a set ofinputs produces a defined set ofoutputs. The inputs and outputs are interpreted asdata,facts,information etc. depending on the interpreter's relation to the system.
A term commonly used synonymously withdata or storage (codes) processing system isinformation system.[7] With regard particularly toelectronic data processing, the corresponding concept is referred to aselectronic data processing system.
A very simple example of a data processing system is the process of maintaining a check register. Transactions— checks and deposits— are recorded as they occur and the transactions are summarized to determine a current balance. Monthly the data recorded in the register is reconciled with a hopefully identical list of transactions processed by the bank.
A more sophisticated record keeping system might further identify the transactions— for example deposits by source or checks by type, such as charitable contributions. This information might be used to obtain information like the total of all contributions for the year.
The important thing about this example is that it is asystem, in which, all transactions are recorded consistently, and the same method of bank reconciliation is used each time.
This is aflowchart of a data processing system combining manual and computerized processing to handleaccounts receivable, billing, andgeneral ledger