milaan9/10_Python_Pandas_ModulePublic

NotificationsYou must be signed in to change notification settings
Fork227
Star245

Pandas is a high-level data manipulation tool developed by Wes McKinney. It is built on the Numpy package and its key data structure is called the DataFrame. DataFrames allow you to store and manipulate tabular data in rows of observations and columns of variables.

License

MIT license

245 stars 227 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
001_Python_Pandas_Methods		001_Python_Pandas_Methods
img		img
001_Python_Pandas_DataFrame.ipynb		001_Python_Pandas_DataFrame.ipynb
002_Python_Pandas_Exercise_1.ipynb		002_Python_Pandas_Exercise_1.ipynb
003_Python_Pandas_Exercise_2.ipynb		003_Python_Pandas_Exercise_2.ipynb
LICENSE		LICENSE
Pandas Cheat Sheet Data Wrangling in Python.pdf		Pandas Cheat Sheet Data Wrangling in Python.pdf
Pandas Cheat Sheet for Data Science in Python.pdf		Pandas Cheat Sheet for Data Science in Python.pdf
README.md		README.md
automobile_data.csv		automobile_data.csv
pokemon_data.csv		pokemon_data.csv
stockprice_data.csv		stockprice_data.csv

Repository files navigation

10_Python_Pandas_Module

Introduction 👋

What is Pandas in Python?

is the most famous python library providing fast, flexible, and expressive data structures designed to make working with "relational" or "labeled" data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical,real world data analysis in Python. Additionally, it has the broader goal of becomingthe most powerful and flexible open source data analysis / manipulation tool available in any language. It is already well on its way towards this goal.

In Pandas, the data is usually utilized to support the statistical analysis inSciPy, plotting functions fromMatplotlib, and machine learning algorithms inScikit-learn.

Main Features

Here are just a few of the things that pandas does well:

Easy handling ofmissing data (represented asNaN) in floating point as well as non-floating point data
Size mutability: columns can beinserted and deleted from DataFrame and higher dimensional objects
Automatic and explicitdata alignment: objects can be explicitly aligned to a set of labels, or the user can simplyignore the labels and letSeries,DataFrame, etc. automatically align the data for you in computations
Powerful, flexiblegroup by functionality to perform split-apply-combine operations on data sets, for both aggregatingand transforming data
Make iteasy to convert ragged, differently-indexed data in other Python and NumPy data structuresinto DataFrame objects
Intelligent label-basedslicing,fancy indexing, andsubsetting oflarge data sets
Intuitivemerging andjoining datasets
Flexiblereshaping andpivoting of datasets
Hierarchical labeling of axes (possible to have multiple labels per tick)
Robust IO tools for loading data fromflat files (CSV and delimited),Excel files,databases,and saving/loading data from the ultrafastHDF5 format
Time series-specific functionality: date range generation and frequency conversion, moving window statistics,moving window linear regressions, date shifting and lagging, etc.

Core Components of Pandas Data Structure

Pandas have two core data structure components, and all operations are based on those two objects. Organizing data in a particular way is known as a data structure. Here are the two pandas data structures:

Series
DataFrame

Table of contents 📋

No.	Name
01	Python_Pandas_DataFrame
	1.1001_Python_Pandas_DataFrame_from_Dictionary
	1.2Python_Pandas_DataFrame_from_List
	1.3Python_Pandas_DataFrame_head()_and_tail()
	1.4004_Python_Pandas_DataFrame_drop_columns
	1.5Python_Pandas_DataFrame_drop_duplicates
	1.6Python_Pandas_DataFrame_drop_columns_with_NA
	1.7Python_Pandas_DataFrame_rename_columns
	1.8Python_Pandas_DataFrame_to_Python_dictionary
	1.9Python_Pandas_DataFrame_set_index
	1.10Python_Pandas_DataFrame_reset_index
02	Python_Pandas_Exercise_1
03	Python_Pandas_Exercise_2
	automobile_data.csv
	pokemon_data.csv
04	Pandas Cheat Sheet Data Wrangling in Python.pdf
05	Pandas Cheat Sheet for Data Science in Python.pdf

These are onlineread-only versions. However you canRun ▶ all the codesonline by clicking here ➞

Install Pandas Module:

Open your Prompt and type and run the following command (individually):

```
  pip install pandas
```

Once Installed now we can import it inside our python code.

Frequently asked questions ❔

How can I thank you for writing and sharing this tutorial? 🌷

You can and Starring and Forking is free for you, but it tells me and other people that it was helpful and you like this tutorial.

Gohere if you aren't here already and click ➞✰ Star andⵖ Fork button in the top right corner. You'll be asked to create a GitHub account if you don't already have one.

How can I read this tutorial without an Internet connection?

Gohere and click the big green ➞Code button in the top right of the page, then click ➞Download ZIP.
Extract the ZIP and open it. Unfortunately I don't have any more specific instructions because how exactly this is done depends on which operating system you run.
Launch ipython notebook from the folder which contains the notebooks. Open each one of them
Kernel > Restart & Clear Output

This will clear all the outputs and now you can understand each statement and learn interactively.

If you have git and you know how to use it, you can also clone the repository instead of downloading a zip and extracting it. An advantage with doing it this way is that you don't need to download the whole tutorial again to get the latest version of it, all you need to do is to pull with git and run ipython notebook again.

Authors ✍️

I'm Dr. Milaan Parmar and I have written this tutorial. If you think you can add/correct/edit and enhance this tutorial you are most welcome🙏

Seegithub's contributors page for details.

If you have trouble with this tutorial please tell me about it byCreate an issue on GitHub. and I'll make this tutorial better. This is probably the best choice if you had trouble following the tutorial, and something in it should be explained better. You will be asked to create a GitHub account if you don't already have one.

If you like this tutorial, pleasegive it a ⭐ star.

Licence 📜

You may use this tutorial freely at your own risk. SeeLICENSE.

About

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

10_Python_Pandas_Module

Introduction 👋

What is Pandas in Python?

Main Features

Core Components of Pandas Data Structure

Table of contents 📋

Install Pandas Module:

Frequently asked questions ❔

How can I thank you for writing and sharing this tutorial? 🌷

How can I read this tutorial without an Internet connection?

Authors ✍️

Licence 📜

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

License

milaan9/10_Python_Pandas_Module

Folders and files

Latest commit

History

Repository files navigation

10_Python_Pandas_Module

Introduction 👋

What is Pandas in Python?

Main Features

Core Components of Pandas Data Structure

Table of contents 📋

Install Pandas Module:

Frequently asked questions ❔

How can I thank you for writing and sharing this tutorial? 🌷

How can I read this tutorial without an Internet connection?

Authors ✍️

Licence 📜

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages