Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Scala to Python - sparkSql folder#2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Merged
jleetutorial merged 6 commits intomasterfrompedromb-scala_to_python
Sep 30, 2017

Conversation

pedromb
Copy link
Collaborator

  • Small change to regex on commons/Utils.py

  • Converted the scala files on the sparkSql folder to python

  • Ignored sparkSql/TypedDataset.scala

  • Changed sparkSql/RddDatasetConversion to sparkSql/RddDataframeConversion

@@ -1,5 +1,5 @@
import re

class Utils():

COMMA_DELIMITER = re.compile(''',(?=(?:[^'"]|'[^']*'|"[^"]*")*$)''')
Copy link
Owner

@jleetutorialjleetutorialSep 30, 2017
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

why do we need to make this small change?

Copy link
CollaboratorAuthor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I made some changes to the original one in scala to run in python (because the languages parse regex in a different way), and among those changes I, mistakenly, made the new one not match commas within single quotations too. But this raised some problems when processing the stackoverflow survey. I just removed that.

@jleetutorialjleetutorial merged commit9fe453e intomasterSep 30, 2017
evross pushed a commit to evross/python-spark-tutorial that referenced this pull requestDec 3, 2018
* Add env/ and .vscode/ to .gitignore* Add IAM requirements.txt* Python Service account keys (jleetutorial#5)Add IAM service account keys snippets* Python Quickstart (jleetutorial#3)Add IAM quickstart* Python Service accounts (jleetutorial#4)Add IAM service account snippets* Add query grantable roles and tests (jleetutorial#2)* Add whitespace and correct string formatting* Add extra CR* Lint
Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment
Reviewers

@jleetutorialjleetutorialjleetutorial approved these changes

Assignees
No one assigned
Labels
None yet
Projects
None yet
Milestone
No milestone
Development

Successfully merging this pull request may close these issues.

2 participants
@pedromb@jleetutorial

[8]ページ先頭

©2009-2025 Movatter.jp