American computer scientist (born 1966)
Sanjay Ghemawat (born 1966 inWest Lafayette, Indiana )[ 1] is anIndian American [ 2] computer scientist andsoftware engineer . He is currently a SeniorFellow atGoogle in the Systems Infrastructure Group.[ 3] [ 4] Ghemawat's work at Google, much of it in close collaboration withJeff Dean ,[ 5] has includedbig data processing modelMapReduce , theGoogle File System , and databasesBigtable andSpanner .Wired has described him as one of the "most important software engineers of the internet age".[ 5]
Ghemawat was elected as a member into theNational Academy of Engineering in 2009 for contributions to the science and engineering of large-scale distributed computer systems.
Education and early career [ edit ] Ghemawat was born inWest Lafayette, Indiana and grew up inKota, Rajasthan .[ 6] He studied atCornell University and theMassachusetts Institute of Technology (MIT).[ 3] He obtained a PhD from MIT in 1995, with a dissertation titled,The Modified Object Buffer: A Storage Management Technique for Object-Oriented Databases. Hisadvisors wereBarbara Liskov andFrans Kaashoek .[ 7]
Before joining Google, Ghemawat worked at theDEC Systems Research Center . There he began his long-time collaboration withJeff Dean , who worked at anotherDEC research lab nearby. Their work at DEC included aJava compiler and asystem profiling tool .[ 5]
After DEC was acquired byCompaq , many of its researchers left the company. Dean took a position at the newly founded search-engine companyGoogle , and was joined by Ghemawat in 1999. The two began working on Google's core infrastructure, making improvements to cope with the search engine's rapid growth in users in the early 2000s.[ 5]
Ghemawat's work at Google includes:
Original design ofProtocol Buffers , an open-source data interchange format. MapReduce , a system for large-scale data processing applications.Google File System , is a proprietary distributed file system developed to provide efficient, reliable access to data using large clusters of commodity hardware.Spanner , a scalable, multi-version, globally distributed, and synchronously replicated database.Bigtable , a large-scale semi-structured storage system.LevelDB , an open-source on-disk key-value store.TensorFlow , an open-source machine-learning software library.Service Weaver, an open-source framework for writing distributed applications. Ghemawat was elected to theNational Academy of Engineering in 2009,[ 2] and to theAmerican Academy of Arts and Sciences in 2016.[ 8] In 2012, he and Dean received theACM Prize in Computing for their work on internet infrastructure,[ 2] [ 3] [ 4] and the ACMSIGOPS Mark Weiser Award .[ 9]
Selected publications [ edit ] Ghemawat, Sanjay; Gobioff, Howard; Leung, Shun-Tak (2003). "The Google file system".Proceedings of the nineteenth ACM symposium on Operating systems principles . SOSP '03. New York, NY, USA: ACM. pp. 29– 43.doi :10.1145/945445.945450 .ISBN 1581137575 .S2CID 221261373 . Dean, Jeffrey; Ghemawat, Sanjay (January 2008)."MapReduce: Simplified Data Processing on Large Clusters" .Commun. ACM .51 (1):107– 113.doi :10.1145/1327452.1327492 .ISSN 0001-0782 .S2CID 221021701 . Chang, Fay; Dean, Jeffrey; Ghemawat, Sanjay; Hsieh, Wilson C.; Wallach, Deborah A.; Burrows, Mike; Chandra, Tushar; Fikes, Andrew; Gruber, Robert E. (June 2008). "Bigtable: A Distributed Storage System for Structured Data".ACM Trans. Comput. Syst .26 (2): 4:1–4:26.doi :10.1145/1365815.1365816 .ISSN 0734-2071 .S2CID 214769387 . Dean, Jeffrey; Ghemawat, Sanjay (January 2010)."MapReduce: A Flexible Data Processing Tool" .Commun. ACM .53 (1):72– 77.doi :10.1145/1629175.1629198 .ISSN 0001-0782 . Corbett, James C.; Dean, Jeffrey; Epstein, Michael; Fikes, Andrew; Frost, Christopher; Furman, J. J.; Ghemawat, Sanjay; Gubarev, Andrey; Heiser, Christopher (August 2013)."Spanner: Google's Globally Distributed Database" .ACM Trans. Comput. Syst .31 (3): 8:1–8:22.doi :10.1145/2518037.2491245 .ISSN 0734-2071 . Abadi, Martín; Agarwal, Ashish; Barham, Paul; Brevdo, Eugene; Chen, Zhifeng; Citro, Craig; Corrado, Greg S.; Davis, Andy; Dean, Jeffrey (March 14, 2016). "TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems".arXiv :1603.04467 [cs.DC ]. ^ "The Friendship That Made Google Huge" .New Yorker . December 10, 2018. RetrievedDecember 10, 2018 .^a b c "Google's Sanjay Ghemawat Co-Winner of Computer Award" .India West . April 9, 2013. RetrievedDecember 16, 2017 .^a b c "Sanjay Ghemawat – ACM Prize in Computing" .Award Winners .Association for Computing Machinery . RetrievedDecember 16, 2017 .^a b "ACM And Infosys Foundation Honor Google Developers For Innovations That Transformed Internet-Scale Computing" .Infosys . RetrievedDecember 16, 2017 .^a b c d Metz, Cade (August 8, 2012)."If Xerox PARC Invented the PC, Google Invented the Internet" .WIRED .Archived from the original on December 7, 2017. RetrievedDecember 16, 2017 . ^ Somers, James (December 3, 2018)."The Friendship That Made Google Huge" .The New Yorker .ISSN 0028-792X . RetrievedAugust 21, 2024 . ^ "Sanjay Ghemawat" .The Mathematics Genealogy Project . Department of Mathematics, North Dakota State University. RetrievedDecember 16, 2017 .^ "Membership" .American Academy of Arts and Sciences . RetrievedDecember 16, 2017 .^ "The Mark Weiser Award" . ACM SIGOPS. RetrievedJuly 5, 2019 .
International National Academics