- Notifications
You must be signed in to change notification settings - Fork0
yuhexiong/oracle-data-pipeline-spark-python
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
(also provided Traditional Chinese version documentREADME-CH.md.)
Data pipeline written by Spark to transfer Oracle to Doris.
- Language: Python
- Data Processing Framework: Spark v3.5.1
edit filename in docker-compose.yaml
docker compose up -d
Replace newline characters with an empty string.
FillRATING
column NULL values with 0.
code refer to
(1)oracle_to_doris.py
(2) define schema in yamloracle_to_doris_yaml.py andoracle_to_doris.yaml
- Oracle Table
| BOOKID | TITLE | AUTHOR | PUBLICATIONYEAR | GENRE | RATING | STATUS ||---------|---------------------|---------------------|------------------|-----------------|--------|-------------|| 1 | Dune | Frank Herbert | 1965 | SCIENCE FICTION | 4.5 | AVAILABLE || 2 | 1984 | George Orwell | 1949 | DYSTOPIAN | 4.7 | CHECKED OUT || 3 | Pride and Prejudice | Jane Austen | 1813 | ROMANCE | 4.6 | AVAILABLE || 4 | The Great Gatsby | F. Scott Fitzgerald | 1925 | CLASSIC | 4.4 | AVAILABLE || 5 | The Hobbit | J.R.R. Tolkien | 1937 | FANTASY | 4.8 | CHECKED OUT |
- Doris Table
| book_id | title | author | publication_year | genre | rating | status ||---------|---------------------|---------------------|------------------|-----------------|--------|-------------|| 1 | Dune | Frank Herbert | 1965 | SCIENCE FICTION | 4.5 | AVAILABLE || 2 | 1984 | George Orwell | 1949 | DYSTOPIAN | 4.7 | CHECKED OUT || 3 | Pride and Prejudice | Jane Austen | 1813 | ROMANCE | 4.6 | AVAILABLE || 4 | The Great Gatsby | F. Scott Fitzgerald | 1925 | CLASSIC | 4.4 | AVAILABLE || 5 | The Hobbit | J.R.R. Tolkien | 1937 | FANTASY | 4.8 | CHECKED OUT |
About
No description or website provided.
Topics
Resources
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Releases
No releases published
Packages0
No packages published
Uh oh!
There was an error while loading.Please reload this page.