Michael Ralph Stonebraker (born October 11, 1943[6]) is an Americancomputer scientist specializing indatabase systems. Through a series of academic prototypes and commercial startups, Stonebraker's research and products are central to manyrelational databases. He is also the founder of many database companies, includingIngres Corporation,Illustra, Paradigm4,StreamBase Systems,Tamr,Vertica andVoltDB, and served aschief technical officer ofInformix. For his contributions to database research, Stonebraker received the 2014Turing Award, often described as "the Nobel Prize for computing."[7]
Michael Stonebraker | |
---|---|
![]() Michael Stonebraker giving the 2015 Turing lecture | |
Born | (1943-10-11)October 11, 1943 (age 81) |
Alma mater | Princeton University (BSE) University of Michigan (MS,PhD) |
Known for | Ingres,Postgres,Vertica,Streambase,Illustra,VoltDB,SciDB |
Spouse | Beth |
Awards | IEEE John von Neumann Medal(2005) ACM Turing Award(2014) |
Scientific career | |
Fields | Computer science |
Institutions | University of California, Berkeley University of Michigan Massachusetts Institute of Technology |
Thesis | The Reduction of Large Scale Markov Models for Random Chains |
Doctoral advisor | Arch Waugh Naylor |
Notable students | Joseph M. Hellerstein Clifford A. Lynch[1] Margo Seltzer[1] Dale Skeen[2] Marti Hearst[3] Leilani Battle[4] |
Website | csail |
Stonebraker's career can be broadly divided into two phases: his time atUniversity of California, Berkeley when he focused onrelational database management systems such asIngres andPostgres, and, starting in 2001, atMassachusetts Institute of Technology (MIT) where he developed more novel data management techniques such asC-Store,H-Store,SciDB andDBOS.[8] Stonebraker is currently a professor emeritus at UC Berkeley and an adjunct professor at MIT's Computer Science and Artificial Intelligence Laboratory.[9][10] He is also known as an editor for the bookReadings in Database Systems.
Life
editStonebraker grew up inMilton, New Hampshire.[11] He earned his B.S.E. inelectrical engineering fromPrinceton University in 1965, and hisM.S. andPh.D. from theUniversity of Michigan in 1967 and 1971[12] respectively. His awards include theIEEE John von Neumann Medal and the firstSIGMODEdgar F. Codd Innovations Award. In 1994 he was inducted as aFellow of theAssociation for Computing Machinery.[13] In 1997, he was elected a member of theNational Academy of Engineering for the development and commercialization of relational and object-relational database systems. In March 2015 it was announced he won the 2014ACM Turing Award.[7] In September 2015, he won the 2015 Commonwealth Award, chosen by council members of MassTLC.[14]
The Berkeley years (1971–2000)
editStonebraker joinedUniversity of California, Berkeley, as an assistant professor in 1971, and taught in the computer science department for twenty-nine years. It was there that he did his early pioneering work on relational databases.
Ingres
editIn 1973, Stonebraker and his colleagueEugene Wong started researching relational database systems after reading a series of seminal papers published byEdgar F. Codd on therelational data model.[15]
Their project, known asIngres (Interactive Graphics and Retrieval System),[16] was one of the first systems (along withSystem R fromIBM) todemonstrate that it was possible to build a practical and efficient implementation of the relational model. A number of key ideas from INGRES are still widely used in relational systems, including the use ofB-trees, primary-copy replication, the query rewrite approach to views andintegrity constraints, and the idea of rules/triggers for integrity checking in an RDBMS. Additionally, much experimental work was done that provided insights into how to build a locking system that could provide satisfactory transaction performance.[17]
By the mid-1970s, Stonebraker's team produced, using a rotating team of student programmers, a usable relational database system. At the time Ingres was considered "low end" compared to IBM's System R, as it ran onUnix-basedDigital Equipment Corporation machines as opposed to the "big iron"IBM mainframes.[citation needed]
By the early 1980s, however, the performance and capabilities of these low-end machines were seriously threatening IBM's mainframe market, and with the threat came the ability of Ingres to become a viable, "real" product for a large number of applications. Ingres used a variation of theBSD license for a nominal fee, and soon a number of companies took advantage of this to create commercial versions of Ingres.[citation needed]
These included Stonebraker, who with fellow Berkeley professors Larry Rowe and Eugene Wong helped foundRelational Technology, Inc., later called Ingres Corporation. Subsequently, sold toComputer Associates, Ingres was re-established as an independent company in 2005, and later renamedActian. Other startups based on Ingres includeSybase, founded by Robert Epstein, a student on the project, andBritton Lee, Inc. Sybase's code was later used as a basis forMicrosoft SQL Server.[18]
Postgres
editAfter founding Relational Technology, Stonebraker and Rowe began a "post-Ingres" effort, to address the limitations of the relational model. The new project was namedPOSTGRES (POST inGRES),[19] and was designed to add support for complex data types to database systems and improve end-to-end performance of data-intensive applications. Postgres provided anobject relational programming model in which fields could be complex datatypes, and where users could register new types as well as scalar and aggregate functions over those types. Postgres was extensible in a number of other ways, making it easy for programmers to modify or add to the optimizer, query language, runtime, and indexing frameworks. These features improved both database programmability and performance, and made it possible to push large portions of a number of applications inside the database, includinggeographic information systems andtime series processing. This had the effect of substantially broadening the commercial database market.[citation needed]
Postgres was also offered using a BSD-like license, and the code forms the basis of thefree software,PostgreSQL. Stonebraker also led an effort to commercialize the code, creatingIllustra which was purchased byInformix. PostgreSQL has been used as the basis for a number of other startup companies, includingAster Data Systems,EnterpriseDB, andGreenplum.[citation needed]
Informix acquired Illustra in 1996, and Stonebraker became Informix's CTO, a position he held until September 2000. Informix integrated Illustra's O–R mapping andDataBlades into the 7.x OnLine product, resulting in Informix Universal Server (IUS), or more generally, Version 9.[citation needed]
Mariposa and Cohera
editAfter the Postgres project, Stonebraker initiated the Mariposa[20] project which became the basis of Cohera Corporation. Mariposa built afederated database over an economic model of resource trading, in which data distributed across multiple organizations could be integrated and queried from a single relational interface, governed by site-specific policies that would charge for data processing and storage. These economic policies allowed traditional ideas inquery optimization to be carried out over competing sites, and also served as the basis for data storage, replication and movement within a federation.
Cohera's initial mission was to commercialize Mariposa, but eventually focused on a business-to-business catalog management application on the core federated data integration engine. Cohera's intellectual property was purchased byPeopleSoft in 2001,and used as the basis of PeopleSoft's Enterprise Catalog Management. PeopleSoft was in turn purchased byOracle Corporation in 2004.[citation needed]
The MIT years (2001–present)
editThis section of abiography of a living persondoes notinclude anyreferences or sources. Please help by addingreliable sources. Contentious material about living people that is unsourced or poorly sourcedmust be removed immediately. Find sources: "Michael Stonebraker" – news ·newspapers ·books ·scholar ·JSTOR(December 2023) (Learn how and when to remove this message) |
Stonebraker became an adjunct professor atMIT in 2001, where he began another series of research projects and founded a number of companies.
Aurora and StreamBase
editIn the Aurora Project, Stonebraker, along with colleagues fromBrandeis University,Brown University, and MIT, focused on data management for streaming data, using a new data model and query language. Unlike relational systems, which "pull" data and process it a record at a time, in Aurora, data is "pushed", arriving asynchronously from external data sources (such as stock ticks, news feeds, or sensors.) The output is itself a stream of results (such as windowed averages) that are sent to users.[21]
Stonebraker co-foundedStreamBase Systems in 2003 to commercialize the technology behind Aurora.
C-Store and Vertica
editIn theC-Store project, started in 2005, Stonebraker, along with colleagues from Brandeis, Brown, MIT, andUniversity of Massachusetts Boston, developed a parallel,shared-nothingcolumn-oriented DBMS for data warehousing. By dividing and storing data in columns, C-Store is able to perform less I/O and get better compression ratios than conventional database systems that store data in rows.[22]
Stonebraker explained that it's because similar data items are side-by-side: Name,Name,Name,Name vs. Name,Address,Zip,Phone#. In 2005, Stonebraker co-foundedVertica to commercialize the technology behind C-Store.[23]
Morpheus and Goby
editIn 2006, Stonebraker started the Morpheus project, along with researchers from the University of Florida. Morpheus is adata integration system which relies on a collection of "transforms" tomediate between data sources. Each transform provides a queryable interface to particular web site or service, and Morpheus makes it possible to search for and compose multiple transforms to provide a new service or a unified view of several services.
In 2009, Stonebraker co-founded Goby,[24] alocal search company based on ideas from Morpheus, for people to explore new things to do in free time.
H-Store and VoltDB
editThis section of abiography of a living persondoes notinclude anyreferences or sources. Please help by addingreliable sources. Contentious material about living people that is unsourced or poorly sourcedmust be removed immediately. Find sources: "Michael Stonebraker" – news ·newspapers ·books ·scholar ·JSTOR(December 2023) (Learn how and when to remove this message) |
In 2007, with researchers fromBrown University,MIT, andYale University, Stonebraker started theH-Store project. H-Store is a distributed main-memoryonline transaction processing (OLTP) system designed to provide very high throughput on transaction processing workloads.
In 2009, Stonebraker co-founded, and then served as an adviser to,VoltDB a commercial startup based on ideas from the H-Store project.
SciDB
editIn 2008, along withDavid DeWitt and researchers from Brown, MIT,Portland State University,SLAC, theUniversity of Washington, and theUniversity of Wisconsin–Madison, Stonebraker startedSciDB[25][26] an open-source DBMS specially designed for scientific research applications.[27]
He founded Paradigm4 with Marilyn Matz, who became CEO. Paradigm4 developed SciDB, used mostly by life sciences and financial markets.Novartis,Foundation Medicine, and theNational Institutes of Health are some of the company's clients.[14][28]
NoSQL
editIn 2010 and 2011, Stonebraker criticized theNoSQL movement.[29][30][31]
Notable students
editThis section of abiography of a living persondoes notinclude anyreferences or sources. Please help by addingreliable sources. Contentious material about living people that is unsourced or poorly sourcedmust be removed immediately. Find sources: "Michael Stonebraker" – news ·newspapers ·books ·scholar ·JSTOR(December 2023) (Learn how and when to remove this message) |
Stonebraker trained more than 30 students,[2] including:
- Daniel Abadi, co-founder of Hadapt (acquired byTeradata)
- Michael J. Carey, professor at UC Irvine
- Paula Hawthorn, co-founder of Britton Lee
- Marti Hearst, professor at UC Berkeley
- Joseph M. Hellerstein, professor at UC Berkeley
- Clifford A. Lynch, executive director of theCoalition for Networked Information
- Margo Seltzer, professor at theUniversity of British Columbia, founder and former CTO ofSleepycat Software
- Dale Skeen, founder and CEO of Vitria
- Sunita Sarawagi, professor at IIT Bombay
Selected works
edit- Joseph M. Hellerstein; Michael Stonebraker (2015).Readings in Database Systems (5th ed.). MIT Press.
- Michael Stonebraker;Randy Katz,David Patterson,John Ousterhout (1988)."THE DESIGN OF XPRS"(PDF).VLDB:318–330. Retrieved25 March 2015.
{{cite journal}}
: CS1 maint: multiple names: authors list (link)
References
edit- ^ab"Ph.D. Dissertations | EECS at UC Berkeley".www2.eecs.berkeley.edu.
- ^abMichael Stonebraker at theMathematics Genealogy Project
- ^"Nice: or what it was like to be Mike's student"(PDF).
- ^Battle, Leilani Marie (2017).Behavior-driven optimization techniques for scalable data exploration (Thesis). Massachusetts Institute of Technology.hdl:1721.1/111853. Retrieved2023-12-27.
- ^"Michael Stonebraker - A.M. Turing Award Winner". Retrieved2018-02-06.
- ^"Contributors".IEEE Transactions on Systems, Man, and Cybernetics (4):562–564. Sep 1972.doi:10.1109/TSMC.1972.4309174.
- ^abConner-Simons, Adam (March 25, 2015)."Michael Stonebraker wins $1 million Turing Award".MIT News. Massachusetts Institute of Technology. RetrievedMarch 25, 2015.
- ^"Postgres pioneer Michael Stonebraker promises to upend the database once more".www.theregister.com. Retrieved2023-12-27.
- ^"Michael Stonebraker".www2.eecs.berkeley.edu. Retrieved2018-03-16.
- ^"Michael Stonebraker | MIT CSAIL".www.csail.mit.edu. Retrieved2018-03-16.
- ^Oral History of Michael Stonebraker; 2012-08-23 Retrieved 2018-08-26.
- ^Stonebraker, Michael Ralph (1971).The Reduction of Large Scale Markov Models for Random Chains (PhD thesis).University of Michigan.OCLC 634008426.ProQuest 302585708.
- ^"Michael Ralph Stonebraker - ACM author profile page". Retrieved2011-07-27.
- ^abGeller, Jessica."PTC Chief Heppelman named CEO of the year by Mass. tech council." betaBoston. The Boston Globe. Sept. 16, 2015Archived 2016-01-07 at theWayback Machine
- ^Codd, E. F. (1970)."A relational model of data for large shared data banks"(PDF).Communications of the ACM.13 (6):377–387.doi:10.1145/362384.362685.S2CID 207549016.
- ^Stonebraker, M.; Held, G.; Wong, E.; Kreps, P. (1976). "The design and implementation of INGRES".ACM Transactions on Database Systems.1 (3): 189.CiteSeerX 10.1.1.109.957.doi:10.1145/320473.320476.S2CID 1514658.
- ^"Relational Roots". Joseph Hellerstein. 1998. Retrieved2009-11-24.
- ^"Motivation & DBMS Architecture Overview". Joseph Hellerstein. 1998. Retrieved2009-11-24.
- ^Stonebraker, M.; Rowe, L. A. (1986)."The design of POSTGRES".ACM SIGMOD Record.15 (2): 340.doi:10.1145/16856.16888.
- ^Stonebraker, M.; Aoki, P. M.; Litwin, W.; Pfeffer, A.; Sah, A.; Sidell, J.; Staelin, C.; Yu, A. (1996). "Mariposa: A wide-area distributed database system".The VLDB Journal the International Journal on Very Large Data Bases.5:48–63.CiteSeerX 10.1.1.68.5480.doi:10.1007/s007780050015.S2CID 5062284.
- ^Abadi, D. J.; Carney, D.; Etintemel, U.; Cherniack, M.; Convey, C.; Lee, S.;Stonebraker, M.; Tatbul, N.; Zdonik, S. (2003). "Aurora: A new model and architecture for data stream management".The VLDB Journal the International Journal on Very Large Data Bases.12 (2): 120.CiteSeerX 10.1.1.6.1187.doi:10.1007/s00778-003-0095-z.S2CID 8101432.
- ^(Print edition title: Database Pioneer Rethinks How Data is Organized.Charles Babcock (February 21, 2008)."Database Pioneer Rethinks The Best Way To Organize Data".InformationWeek.
- ^"The Vertica Analytic Database: C-Store 7 Years Later" (PDF)"(PDF).VLDB.org. August 28, 2012.
- ^Goby.
- ^Brown, P. G. (2010). "Overview of sciDB".Proceedings of the 2010 international conference on Management of data - SIGMOD '10. pp. 963–968.doi:10.1145/1807167.1807271.ISBN 9781450300322.S2CID 14544985.
- ^Stonebraker, M.; Brown, P.; Poliakov, A.; Raman, S. (2011). "The Architecture of SciDB".Scientific and Statistical Database Management. Lecture Notes in Computer Science. Vol. 6809. pp. 1–16.doi:10.1007/978-3-642-22351-8_1.ISBN 978-3-642-22350-1.
- ^"SciDB: Relational daddy answers Google, Hadoop, NoSQL".The Register. 2010-09-13. Retrieved2012-01-11.
- ^Alspach, Kyle."New Money: MassChallenge Alum Gets Dorm Room Fund Investment; Drone Co. Raises Seed Round." BostInno. Nov. 30, 2015Archived 2016-02-07 at theWayback Machine
- ^Stonebraker, M. (2010). "SQL databases v. NoSQL databases".Communications of the ACM.53 (4):10–11.doi:10.1145/1721654.1721659.S2CID 13959501.
- ^Stonebraker, M. (2011). "Stonebraker on NoSQL and enterprises".Communications of the ACM.54 (8):10–11.doi:10.1145/1978542.1978546.S2CID 36572502.
- ^Stonebraker, M.; Abadi, D.; Dewitt, D. J.; Madden, S.; Paulson, E.; Pavlo, A.; Rasin, A. (2010). "MapReduce and parallel DBMSs".Communications of the ACM.53:64–71.doi:10.1145/1629175.1629197.S2CID 61484899.
External links
edit- "Michael Stonebraker". Berkeley, CA, USA: University of California. 1995-12-23. Retrieved2009-07-14.
- "Michael Stonebraker".User Profiles. MA, USA: MIT. 2009-07-09. Retrieved2009-07-14.
- Monash, Curt,"Michael Stonebraker",DBMS2, a series of recent interviews and comments about and by Stonebraker.
- "Morpheus: A Data Integration Toolkit".CSAIL Research Abstracts. MA, USA: MIT. Archived fromthe original on 2010-06-16. Retrieved2009-11-22.
- Goby, a new search engine to find fun things to do in one's free time (co-founded by Stonebraker)
- Database pioneer Stonebraker rocks $1M "Nobel Prize in Computing", 2015-03-25