Shop top categories that ship internationally
Buy new:
$79.00
DeliveryFriday, April 11
Ships from: Amazon
Sold by: Wholesale Books USA
$79.00
FREE International Returns
No Import Charges & $10.65 Shipping to Japan Details

Shipping & Fee Details

Price $79.00
AmazonGlobal Shipping $10.65
Estimated Import Charges $0.00
Total $89.65

DeliveryFriday, April 11. Order within18 hrs 22 mins
Only 1 left in stock - order soon.
$$79.00 () Includes selected options. Includes initial monthly payment and selected options. Details
Price
Subtotal
$$79.00
Subtotal
Initial payment breakdown
Shipping cost, delivery date, and order total (including tax) shown at checkout.
Ships from
Amazon
Amazon
Ships from
Amazon
Returns
30-day refund/replacement
30-day refund/replacement
This item can be returned in its original condition for a full refund or replacement within 30 days of receipt.
Payment
Secure transaction
Your transaction is secure
We work hard to protect your security and privacy. Our payment security system encrypts your information during transmission. We don’t share your credit card details with third-party sellers, and we don’t sell your information to others.Learn more
Support
Product support included
What's Product Support?
In the event your product doesn't work as expected or you need help using it, Amazon offers free product support options such as live phone/chat with an Amazon associate, manufacturer contact information, step-by-step troubleshooting guides, and help videos. By solving product issues, we help the planet by extending the life of products. Availability of support options differ by product and country.Learn more
$24.99
FREE International Returns
Used book in good and clean conditions. Pages and cover are intact. Limited notes marks and highlighting may be present. May show signs of normal shelf wear and bends on edges. Item may be missing CDs or access codes. May include library marks. Ships directly from Amazon. Used book in good and clean conditions. Pages and cover are intact. Limited notes marks and highlighting may be present. May show signs of normal shelf wear and bends on edges. Item may be missing CDs or access codes. May include library marks. Ships directly from Amazon. See less
DeliveryApril 17 - 30
Or fastest deliveryApril 15 - 25
$$79.00 () Includes selected options. Includes initial monthly payment and selected options. Details
Price
Subtotal
$$79.00
Subtotal
Initial payment breakdown
Shipping cost, delivery date, and order total (including tax) shown at checkout.
Access codes and supplements are not guaranteed with used items.
Enhancements you chose aren't available for this seller. Details
To add the following enhancements to your purchase, choose a different seller.
%cardName%
${cardName} not available for the seller you chose
${cardName} unavailable for quantities greater than ${maxQuantity}.
Added to
Unable to add item to List. Please try again.

Sorry, there was a problem.

There was an error retrieving your Wish Lists. Please try again.

Sorry, there was a problem.

List unavailable.
Other sellers on Amazon
Kindle app logo image

Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer -no Kindle device required.

Read instantly on your browser with Kindle for Web.

Using your mobile phone camera - scan the code below and download the Kindle app.

QR code to download the Kindle App

Follow the author

Something went wrong. Please try your request again later.

Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython 2nd Edition

byWes McKinney(Author)
Sorry, there was a problem loading this page. Try again.

{"desktop_buybox_group_1":[{"displayPrice":"$79.00","priceAmount":79.00,"currencySymbol":"$","integerValue":"79","decimalSeparator":".","fractionalValue":"00","symbolPosition":"left","hasSpace":false,"showFractionalPartIfEmpty":true,"offerListingId":"otJ5qPdRIrpkIjczSv2UyIXAUABXSr%2BPdU87gL5QhUHstTHcivTSdWkwgQ2a7q7P0Gk2DCK%2BCIiL5GQTg65oCh0SV2IuIXdRv3dY2WgJuxuK5uThI847K0dCljcL7SqOVp1g1zIwoBkHta72Nc%2BdTyN6OIqyi%2FFD5E519JDQkppaT%2BVlvGJFeyE%2BcsE4peMq","locale":"en-US","buyingOptionType":"NEW","aapiBuyingOptionIndex":0}, {"displayPrice":"$24.99","priceAmount":24.99,"currencySymbol":"$","integerValue":"24","decimalSeparator":".","fractionalValue":"99","symbolPosition":"left","hasSpace":false,"showFractionalPartIfEmpty":true,"offerListingId":"otJ5qPdRIrpkIjczSv2UyIXAUABXSr%2BPFbEhB507cfdiD19Ui0GuuozuWVe4nEjYSagFn71Mqhn15EVA1N%2Bz65ZWOxe%2BjHxNH7WDcY18%2F1kq53yHW68EBZKiGIrB1G1X6zV6vt%2FRxCBktWBNQ0y4Qp3JXI3Y0SYhCn%2B1VCtY2i7GMhS5g%2B8SOKG1MkTY1YjK","locale":"en-US","buyingOptionType":"USED","aapiBuyingOptionIndex":1}]}

Purchase options and add-ons

Get complete instructions for manipulating, processing, cleaning, and crunching datasets in Python. Updated for Python 3.6, the second edition of this hands-on guide is packed with practical case studies that show you how to solve a broad set of data analysis problems effectively. You’ll learn the latest versions of pandas, NumPy, IPython, and Jupyter in the process.

Written by Wes McKinney, the creator of the Python pandas project, this book is a practical, modern introduction to data science tools in Python. It’s ideal for analysts new to Python and for Python programmers new to data science and scientific computing. Data files and related material are available on GitHub.

  • Use the IPython shell and Jupyter notebook for exploratory computing
  • Learn basic and advanced features in NumPy (Numerical Python)
  • Get started with data analysis tools in the pandas library
  • Use flexible tools to load, clean, transform, merge, and reshape data
  • Create informative visualizations with matplotlib
  • Apply the pandas groupby facility to slice, dice, and summarize datasets
  • Analyze and manipulate regular and irregular time series data
  • Learn how to solve real-world data analysis problems with thorough, detailed examples
  1. ISBN-10
    1491957662
  2. ISBN-13
    978-1491957660
  3. Edition
    2nd
  4. Publisher
    O'Reilly Media
  5. Publication date
    October 24, 2017
  6. Language
    English
  7. Dimensions
    7 x 1.11 x 9.19 inches
  8. Print length
    550 pages

There is a newer edition of this item:

Frequently bought together

Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython
This item: Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython
$79.00
Get it as soon asFriday, Apr 11
Only 1 left in stock - order soon.
Sold by Wholesale Books USA and ships from Amazon Fulfillment.
+
Data Science from Scratch: First Principles with Python
$36.64
In Stock
Ships from and sold by Amazon.com.
+
Python for Data Analysis: Data Wrangling with pandas, NumPy, and Jupyter
$43.99
In Stock
Ships from and sold by Amazon.com.
Total price: $00
To see our price, add these items to your cart.
Try again!
Details
Added to Cart
spCSRF_Treatment
Some of these items ship sooner than the others.
Choose items to buy together.

From the brand

oreilly
  1. Data Science from Scratch: First Principles with Python
    Python Data Science Handbook: Essential Tools for Working with Data
    Essential Math for Data Science: Take Control of Your Data with Fundamental Linear Algebra, Proba...
    Data Science: The Hard Parts: Techniques for Excelling at Data Science

    Explore more Data Science

    Visit the Store

  2. Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python
    R for Data Science: Import, Tidy, Transform, Visualize, and Model Data
    Software Engineering for Data Scientists: From Notebooks to Scalable Systems
    Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking

    Start learning with O'Reilly

    Visit the Store

  3. Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniqu...
    Fundamentals of Software Architecture: A Modern Engineering Approach
    Fundamentals of Data Engineering: Plan and Build Robust Data Systems
    Prompt Engineering for Generative AI: Future-Proof Inputs for Reliable AI Outputs

    More From O'Reilly

    Visit the Store

  4. Sharing the knowledge of experts

    O'Reilly's mission is to change the world by sharing the knowledge of innovators. For over 40 years, we've inspired companies and individuals to do new things (and do them better) by providing the skills and understanding that are necessary for success.

    Our customers are hungry to build the innovations that propel the world forward. And we help them do just that.


From the Publisher

What Is This Book About?

This book is concerned with the nuts and bolts of manipulating, processing, cleaning, and crunching data in Python. My goal is to offer a guide to the parts of the Python programming language and its data-oriented library ecosystem and tools that will equip you to become an effective data analyst. While 'data analysis' is in the title of the book, the focus is specifically on Python programming, libraries, and tools as opposed to data analysis methodology. This is the Python programming you need for data analysis.

New for the Second Edition

The first edition of this book was published in 2012, during a time when open source data analysis libraries for Python (such as pandas) were very new and developing rapidly. In this updated and expanded second edition, I have overhauled the chapters to account both for incompatible changes and deprecations as well as new features that have occurred in the last five years.

I’ve also added fresh content to introduce tools that either did not exist in 2012 or had not matured enough to make the first cut. Finally, I have tried to avoid writing about new or cutting-edge open source projects that may not have had a chance to mature. I would like readers of this edition to find that the content is still almost as relevant in 2020 or 2021 as it is in 2017.

The major updates in this second edition include:

  • All code, including the Python tutorial, updated for Python 3.6 (the first edition used Python 2.7)
  • Updated Python install instructions for the Anaconda Python Distribution & other Python packages
  • Updates for the latest versions of the pandas library in 2017
  • A new chapter on some more advanced pandas tools, and some other usage tips
  • A brief introduction to using statsmodels and scikit-learn
  • Reorganized since from the first edition to make the book more accessible to newcomers.

Editorial Reviews

About the Author

Wes McKinney is a New York?based software developer and entrepreneur. After finishing his undergraduate degree in mathematics at MIT in 2007, he went on to do quantitative finance work at AQR Capital Management in Greenwich, CT. Frustrated by cumbersome data analysis tools, he learned Python and started building what would later become the pandas project. He's now an active member of the Python data community and is an advocate for the use of Python in data analysis, finance, and statistical computing applications.



Wes was later the co-founder and CEO of DataPad, whose technology assets and team were acquired by Cloudera in 2014. He has since become involved in big data technology, joining the Project Management Committees for the Apache Arrow and Apache Parquet projects in the Apache Software Foundation. In 2016, he joined Two Sigma Investments in New York City, where he continues working to make data analysis faster and easier through open source software.

Product details

  • Publisher ‏ : ‎O'Reilly Media; 2nd edition (October 24, 2017)
  • Language ‏ : ‎English
  • Paperback ‏ : ‎550 pages
  • ISBN-10 ‏ : ‎1491957662
  • ISBN-13 ‏ : ‎978-1491957660
  • Item Weight ‏ : ‎1.91 pounds
  • Dimensions ‏ : ‎7 x 1.11 x 9.19 inches
Brief content visible, double tap to read full content.
Full content visible, double tap to read brief content.

Videos

Help others learn more about this product by uploading a video!
Upload your video

About the author

Follow authors to get new release updates, plus improved recommendations.
Wes McKinney
Brief content visible, double tap to read full content.
Full content visible, double tap to read brief content.

Since 2007, I have been creating fast, easy-to-use data wrangling and statistical computing tools, mostly in the Python programming language. I am best known for creating the pandas project and writing the book Python for Data Analysis. I am also a contributor to the Apache Arrow, Kudu, and Parquet projects within the Apache Software Foundation. I am currently the CTO and Co-founder of Voltron Data, which builds accelerated computing technologies powered by Apache Arrow. I previously worked for Ursa Labs (within RStudio / Posit), Two Sigma, Cloudera, DataPad, and AQR Capital Management.

Customer reviews

4.6 out of 5 stars
1,807 global ratings

Customer Reviews, including Product Star Ratings help customers to learn more about the product and decide whether it is the right product for them.

To calculate the overall star rating and percentage breakdown by star, we don’t use a simple average. Instead, our system considers things like how recent a review is and if the reviewer bought the item on Amazon. It also analyzed reviews to verify trustworthiness.

Learn more how customers reviews work on Amazon

Review this product

Share your thoughts with other customers

Customers say

Customers find this book to be an excellent introduction to Python for data analysis, with comprehensive content and practical examples that work as advertised. They appreciate its solid material coverage and value for money. The writing style receives mixed feedback, with some customers praising it while others find it difficult to follow.

AI-generated from the text of customer reviews

72 customers mention "Introduction"56 positive16 negative

Customers find the book provides a great introduction to Python for data analysis, with one customer noting it serves as excellent preparation for Machine Learning.

"...and because while McKinney is not fun to read, he doespack the book with useful information and it is (mostly) well organized...."Read more

"...This book primarilyfocuses on the pandas Python library, which is awesome at processing and organizing data..."Read more

"This bookcovers all of the basics that you would want to know to get started in programming in Python for data analysis, as the title implies, but..."Read more

"This book gave me my first job. And I am still learning it.It is simple, talks some general idea why functions design like this, and introduces..."Read more

29 customers mention "Value for money"24 positive5 negative

Customers find the book excellent and worth buying, with one mentioning it's worth 60 box.

"...This book has beenwell worth the hours spent in it. For context, I previously relied on Excel, SQL, and some AutoHotKey...."Read more

"Great book, though a bit dry and slow...."Read more

"...There are plenty of code examples. Soworth the purchase. Only negative I wish there were mini projects to learn from."Read more

"Overall,I liked the book. Between the first and second edition, however, the author reorganized the book that made it harder for me to absorb...."Read more

18 customers mention "Use"18 positive0 negative

Customers find the book very useful, with examples that work as advertised and advanced functionality. One customer specifically mentions how it helps apply Python's tools to data science, while another notes its value for engine performance engineers.

"...This book has significantlyimproved how I work. Thanks, Wes and team."Read more

"...some general idea why functions design like this, andintroduces some practical functions...."Read more

"...of this book, and I think it accomplished it's goal of being agood general resource for people beginning their career/learning with Python and data..."Read more

"...Its agreat book to have as a reference and learning data analysis techniques. There are plenty of code examples. So worth the purchase...."Read more

10 customers mention "Content"7 positive3 negative

Customers find the content of the book great, with one customer particularly noting the good chapters on handling time series.

"So far, this book has been aninspiring reading. It contains a huge number of data cleansing, transformation, analysis & etc. code snippets...."Read more

"Great content. Five star content. But, pages started coming off the binding one day after I got this in the mail...."Read more

"Thecontent is too generic, hope it can be more technological."Read more

"Excellent step-by-step instructions.Interesting examples."Read more

7 customers mention "Material quality"7 positive0 negative

Customers find the book solid, with good material coverage, and one customer specifically mentions its high-quality paper.

"...copious use of code snippets to illustrate his points makes thematerial very usable...."Read more

"This product arrived fast. Thebook was in great shape. Couldn't have asked for a better buying experience"Read more

"I think thisbook is solid but it was a bit beyond my level...."Read more

"...details about the Pandas library for Python, the author alsoincludes solid sections about the python language and NumPy...."Read more

16 customers mention "Writing style"11 positive5 negative

Customers have mixed opinions about the writing style of the book, with some finding it great while others note that it reads like a dictionary with artificial datasets.

"Well written by the creator of Pandas. The author's copious use of code snippets to illustrate his points makes the material very usable...."Read more

"...and rewarding in its use of example datasets, itsmore personable writing style, and its outlining of good practices for data science."Read more

"...andwrites like an impatient person who would rather be doing something else...."Read more

"...bit as detailed as I hoped it would be with a great introduction,great examples, and great coverage of fundamental, basic, and advanced..."Read more

Customer image
5.0 out of 5 stars

Images in this review
Poor quality binding but great content.
3 out of 5 stars
Poor quality binding but great content.
Great content. Five star content. But, pages started coming off the binding one day after I got this in the mail. Loved the first edition and bought this just for the updates, so I can definitely recommend the content. However I can’t recommend the book itself.
Thank you for your feedback
Sorry, there was an error
Sorry we couldn't load the review

Top reviews from the United States

There was a problem filtering reviews. Please reload the page.

  • Reviewed in the United States on November 24, 2021
    I got this book when I was transitioning to doing data science with Python and was struggling to become familiar with standard tools. It's written by the creator of Pandas, and follows the style of the Pandas documentation: dense, telegraphic, peppered with examples.

    It's hard work because Wes McKinney often does not articulate why you would need to do something (assuming you are already knowledgeable on the underlying process), and writes like an impatient person who would rather be doing something else. Additionally examples often suffer from being both too long and too short - too long in that almost every example is on a toy dataset created from scratch, too short in that most of those datasets have only 5 or 10 elements and do not always showcase complex operations. Other examples (particularly involving time series) have an overabundance of data that make the critical results hard to spot. Frankly, my first month with Pandas was a miserable one.

    But I give the book 5 stars both because I came to love Pandas as I got more familiar with it, and because while McKinney is not fun to read, he does pack the book with useful information and it is (mostly) well organized. If anything it would benefit from being longer and with a more patient treatment of larger and more concrete datasets (eg the Titanic passenger dataset used in the Pandas documentation). The initial chapter on the basics of using Python could go - if you need this book, then you don't want to be trying to learn the rudiments of Python from it. If you can accept that you'll need a lot of bookmarks or margin notes to get through a rather steep learning curve, it will reward your persistence.
    11 people found this helpful
    Report
  • Reviewed in the United States on April 5, 2019
    This book has been my foundation of using python as a data analyst.

    This book primarily focuses on the pandas Python library, which is awesome at processing and organizing data (Python pandas is like MS Excel times 100. This is not an exaggeration). It also introduces the reader into numpy (lower level number crunching and arrays), matplotlib (data visualizations), scikitlearn (machine learning), and other useful data science libraries. The book contains other book recommendations for continuing education.

    Although this would be a challenging book for a brand new Python user, I would still recommend it, especially if you are currently doing a lot of work in MS Excel and/ or exporting data from databases. I had a few false starts learning Python, and my biggest stumbling block was lack of application in what I was learning. This book puts practical tools in the reader's hands very quickly. I personally don't have time to make goofy games etc. that other books have used as practice examples. Despite other reviews criticizing the use of random data throughout the book, I found the examples easy to follow and useful. I would also argue that learning how to generate random data is useful in itself (thus the purpose of the numpy random library), and that there are practical examples throughout the book. Chapter 14 devoted to real-world data analysis examples.

    I am almost finished with my second time through the book, this time working through every example. This book has been well worth the hours spent in it. For context, I previously relied on Excel, SQL, and some AutoHotKey. This book has significantly improved how I work.

    Thanks, Wes and team.
    36 people found this helpful
    Report
  • Reviewed in the United States on January 26, 2019
    This book covers all of the basics that you would want to know to get started in programming in Python for data analysis, as the title implies, but it doesn't really offer compelling real-world examples. The data seem to be made up and the analyses don't go into enough detail to help you really learn how pandas and numpy work. Overall this is a decent starter book but you will have to bookmark the python and pandas documentation online if you want to have a reference to all of the functionality those tools have, and there are many places online where you can get better examples to learn from. If you haven't made your mind up about which tool to use for data analysis, I highly recommend checking out dplyr in R, which has an excellent free book online (R for data science, hadley wickham). I find it very easy to learn and it is much easier to set up R and RStudio than it is to set up Python, even though I love Python and Pandas.
    13 people found this helpful
    Report
  • Reviewed in the United States on December 6, 2017
    This book gave me my first job. And I am still learning it. It is simple, talks some general idea why functions design like this, and introduces some practical functions. Because in real life real job you always need to look up documentation or to google certain functions, I think the idea why Wes makes functions/variables like this, and what he wants to develop in the future is very important. anyway, I think this book is for data analysis beginner and some intermediate users. I learned Python first so I recommend beginners who want to use Python for Data Analyst/Scientist to learn Python Programming first/simultaneously. At least understand lambda and python expressions, otherwise, you can't feel the full magic.
    30 people found this helpful
    Report
  • Reviewed in the United States on February 14, 2022
    Well written by the creator of Pandas. The author's copious use of code snippets to illustrate his points makes the material very usable. The snippets are short enough to type by hand so you get the frequent opportunity to play with the code and really understand the tools being presented. And Pandas is awesome!
    One person found this helpful
    Report
  • Reviewed in the United States on July 21, 2019
    So far, this book has been an inspiring reading. It contains a huge number of data cleansing, transformation, analysis & etc. code snippets. The code is very clean and - for the most part - self-explaining (at least, for a seasoned software developer). The book step by step displays the motivations behind the design and functionality of center-piece Python modules - and you would not expect anything less from the original designer of Pandas. I feel this wonderful book being a natural extension of ageless Practical CS classics by Niklaus Wirth, Kernighan-Ritchie, and B. Stroustrup for Data Science Age.
  • Reviewed in the United States on August 16, 2024
    This product arrived fast. The book was in great shape. Couldn't have asked for a better buying experience

Top reviews from other countries

  • abhishek patil
    5.0 out of 5 starsGood packing and perfect book for fresher data analytics profesional
    Reviewed in India on April 8, 2021
    Good packing from the party .I just loved the overall structure of the book and its content.I will say must for all fresher wishing to get into data analytics and data science!!
  • Jovial GBA-GOMBO
    5.0 out of 5 starsRapide et sûr
    Reviewed in France on May 20, 2022
    Acquisition pour un perfectionnement en tant que Data Analyst
  • Tamer
    5.0 out of 5 starsBest pandas reference book
    Reviewed in the United Arab Emirates on December 18, 2019
    This is the best reference I use for dealing with python, numpy and mainly pandas. Must have for anyone learning or using pandas. The author (who actually wrote pandas)style is into the point, clear and with simple examples that demonstrate the usage in real world.
    Also this book has all the info to help you prepare data for sci-kit learn and tf .
  • Conrad T.
    4.0 out of 5 starsOh lovely dictionaries and tuple
    Reviewed in Australia on December 27, 2018
    Great book for anyone needing common tools used in Python and indeed in data science. My favourite chapters were 2, 3 and 4. Basically, if you are a noob like me, you should start from the chapter one and read through till chapter 4. List, dictionary, tuple comprehensions are quiet powerful as well as array with panadas and scikitlearn.

    If you are looking for more in depth graphical representation of plots using pandas and skitlearn then maybe look at another book as this one is more of a back ground tools kind-a-thing.
    Customer image
    Conrad T.
    4.0 out of 5 stars
    Oh lovely dictionaries and tuple

    Reviewed in Australia on December 27, 2018
    Great book for anyone needing common tools used in Python and indeed in data science. My favourite chapters were 2, 3 and 4. Basically, if you are a noob like me, you should start from the chapter one and read through till chapter 4. List, dictionary, tuple comprehensions are quiet powerful as well as array with panadas and scikitlearn.

    If you are looking for more in depth graphical representation of plots using pandas and skitlearn then maybe look at another book as this one is more of a back ground tools kind-a-thing.
    Images in this review
    Customer image
  • Pedro Dias
    5.0 out of 5 starsMust have
    Reviewed in Spain on January 8, 2021
    You must have this book if you want to learn Pandas and Data Science.