| Apache Sqoop | |
|---|---|
| Developer | Apache Software Foundation |
| Initial release | 1 June 2009; 16 years ago (2009-06-01) |
| Final release | 1.4.7 / December 6, 2017; 8 years ago (2017-12-06) |
| Written in | Java |
| Operating system | Cross-platform |
| Type | Data management |
| License | Apache License 2.0 |
| Website | sqoop |
| Repository | Sqoop Repository |
Sqoop is acommand-line interface application for transferring data betweenrelational databases andHadoop.[1]
The Apache Sqoop project was retired in June 2021 and moved to the Apache Attic.[2]
Sqoop supports incremental loads of a single table or a free formSQL query as well as saved jobs which can be run multiple times to import updates made to a database since the last import. Imports can also be used to populate tables inHive orHBase.[3] Exports can be used to put data from Hadoop into a relational database. Sqoop got the name from "SQL-to-Hadoop".[4]Sqoop became a top-levelApache project in March 2012.[5]
Informatica provides a Sqoop-basedconnector from version 10.1.Pentaho providesopen-source Sqoop based connector steps,Sqoop Import[6] andSqoop Export,[7] in theirETL suitePentaho Data Integration since version 4.5 of the software.[8]Microsoft uses a Sqoop-based connector to help transfer data fromMicrosoft SQL Server databases to Hadoop.[9]Couchbase, Inc. also provides aCouchbase Server-Hadoop connector by means of Sqoop.[10]
The Sqoop Import job allows you to import data from a relational database into the Hadoop Distributed File System (HDFS) using Apache Sqoop.
The Sqoop Export job allows you to export data from Hadoop into an RDBMS using Apache Sqoop.
Pentaho's Business Analytics 4.5 is now certified on Cloudera's latest releases, Cloudera Enterprise 4.0 and CDH4. Pentaho also announced that its visual design studio capabilities have been extended to the Sqoop and Oozie components of Hadoop.