Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

This repository contains my solutions to various SQL problems from LeetCode, implemented using PySpark DataFrame API and Spark SQL. The goal is to provide alternative solutions and insights for SQL enthusiasts who want to explore the power of PySpark and Spark SQL.

NotificationsYou must be signed in to change notification settings

sangwanamit621/sql-solutions-in-pyspark-dataframe-api-and-spark-sql

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Welcome to my repository featuring PySpark DataFrame API and Spark SQL solutions for LeetCode SQL questions! If you're passionate about solving SQL problems and working with PySpark, you're in the right place.

Introduction

This repository contains my solutions to various SQL problems from LeetCode, implemented using PySpark DataFrame API and Spark SQL. The goal is to provide alternative solutions and insights for SQL enthusiasts who want to explore the power of PySpark and Spark SQL.

Why PySpark and Spark SQL?

  • Scalability: Leverage the distributed computing capabilities of Apache Spark to handle large datasets.
  • Flexibility: PySpark allows you to seamlessly integrate SQL operations with Python, providing a powerful combination for data manipulation.
  • Performance: Spark SQL optimization and caching mechanisms contribute to improved query performance.

Databricks Community Edition

All solutions in this repository were developed and tested using Databricks Community Edition. If you're not already using Databricks, you cansign up for a free account to practice and run these PySpark and Spark SQL solutions in a collaborative environment.

How to Use

To run locally:

  1. Clone the repository:
    git clone https://github.com/your-username/sql-leetcode-pyspark.git```
  2. Install dependencies:
    pip3 install pyspark```
  3. Open the Jupyter notebooks and run the codes to explore the solutions.

Feel free to explore more in the solution-notebooks directory!

Contributing

Found a bug? Have a suggestion? Contributions are welcome! Fork the repository, make your changes, and submit a pull request.

About

This repository contains my solutions to various SQL problems from LeetCode, implemented using PySpark DataFrame API and Spark SQL. The goal is to provide alternative solutions and insights for SQL enthusiasts who want to explore the power of PySpark and Spark SQL.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

[8]ページ先頭

©2009-2025 Movatter.jp