sqoop-import
Here are 19 public repositories matching this topic...
Sort:Most stars
A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype
- Updated
Aug 12, 2020 - Java
Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.
- Updated
Apr 21, 2020
This is the first project where we worked on apache spark, In this project what we have done is that we downloaded the datasets from KAGGLE where everyone is aware of, we have downloaded loan, customers credit card and transactions datasets . After downloading the datsaets we have cleaned the data . Then after by using new tools and technologies…
- Updated
Oct 14, 2021 - Jupyter Notebook
BigData Engineering Capstone Project with Tech-stack : Linux, MySQL, sqoop, HDFS, Hive, Impala, SparkSQL, SparkML, git
- Updated
Aug 25, 2022 - Python
Apache Sqoop tutorial
- Updated
Jun 7, 2019
MapReduce Job Development, RDDs Programming, Medical Data Management, Sales Analysis, And Efficient Data Integration For Big Data Analysis. Spark: Big Data Processing, SQOOP Integration, And Spark Structured Streaming For Real-Time Data.
- Updated
Jun 7, 2023 - Java
- Updated
Sep 5, 2019 - Shell
Built a data pipeline by creating tables in MySQL DB, ingested tables to Hadoop for data warehousing and built HiveQL views. Hive views in Linux VM were connected to Power BI application in Windows to create visualizations.
- Updated
Dec 5, 2023
Created a utility to import data from traditional databases to hdfs using sqoop and implemented using bash
- Updated
Jun 11, 2019 - Shell
- Updated
Mar 13, 2020
Real-Time & Batch Data Processing Pipeline
- Updated
Jan 12, 2020 - Python
Import data into the Hive using Sqoop.
- Updated
Oct 26, 2021
[Innopolis University] Big Data Course 2023. Final Project
- Updated
May 11, 2023 - HiveQL
This repository consists of the source code and the screenshots of the output. This project uses Hive, SQL, and Sqoop to perform analysis.
- Updated
Dec 5, 2021
heart data analysis
- Updated
Feb 20, 2024 - Python
ETL Pipeline for Spar Nord Bank for the analysis of refilling frequency of the ATM's all over the europe
- Updated
May 7, 2024 - Jupyter Notebook
A python package that lets you sqoop into HDFS/Hive/HBase data from RDBMS using sqoop
- Updated
Apr 18, 2020 - Python
Build a data pipeline (using hadoop-hdfs, sqoop, hiveql) for data analysis out of an ambiguous and incomplete instruction.
- Updated
Mar 16, 2023
A query system for a hypothetical bank scenario
- Updated
Feb 9, 2022 - HiveQL
Improve this page
Add a description, image, and links to thesqoop-import topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thesqoop-import topic, visit your repo's landing page and select "manage topics."