a powerful python library for data analysis by badri prudhvi badri prudhvi

9
A Powerful Python Library for Data Analysis BY BADRI PRUDHVI.

Upload: wilfrid-armstrong

Post on 04-Jan-2016

227 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: A Powerful Python Library for Data Analysis BY BADRI PRUDHVI BADRI PRUDHVI

A Powerful Python Library for Data Analysis

BYBADRI PRUDHVI.

Page 2: A Powerful Python Library for Data Analysis BY BADRI PRUDHVI BADRI PRUDHVI

Contents: Introduction Key Features Core Operations Sample Dataset Analysis in

Pandas

Page 3: A Powerful Python Library for Data Analysis BY BADRI PRUDHVI BADRI PRUDHVI

Introduction

Why Learn Pandas? If you like Python…It’s a better Python. It’s a smoother path than raw numpy Very easy to do Data Analysis

Why do you need pandas? When working with tabular or structured data (like R data frame, SQL

table, Excel spreadsheet, ...): Import data Clean up messy data Explore data, gain insight into data Process and prepare your data for analysis Analyze your data (together with statsmodels, ...)

Page 4: A Powerful Python Library for Data Analysis BY BADRI PRUDHVI BADRI PRUDHVI

Introduction

http://pandas.pydata.org Software library written for the Python programming language. Mainly used for data manipulation and analysis. Core Data Structures:

Series Data Frames.

Offers data structures and operations for manipulating : Numerical tables Time series.

Page 5: A Powerful Python Library for Data Analysis BY BADRI PRUDHVI BADRI PRUDHVI

Key features

Data Frame object for data manipulation with integrated indexing Fast, easy and flexible input/output for a lot of different data formats Merging and joining (concat, join) Powerful time series manipulation (resampling, timezones, ..) Easy plotting Data alignment integrated handling of missing data Reshaping and pivoting of data sets Label-based slicing, fancy indexing, and sub setting of large data sets

Page 6: A Powerful Python Library for Data Analysis BY BADRI PRUDHVI BADRI PRUDHVI

Key features (Contd..) Data structure column insertion and deletion Grouping : groupby functionality Hierarchical axis indexing to work with high-dimensional data

in a lower-dimensional data structure Time series-functionality :

Date range generation, Frequency conversion, Moving window statistics, Moving window linear regressions, Date shifting and lagging

Page 7: A Powerful Python Library for Data Analysis BY BADRI PRUDHVI BADRI PRUDHVI

Core Operations

Create

Select

Insert

Map

Join

Sort

Clean

Bin

View

Update

Filter

Append

Group

Summarize

Conform

Rotate

Page 8: A Powerful Python Library for Data Analysis BY BADRI PRUDHVI BADRI PRUDHVI

Let’s see some Cool stuff !!!

Analyzing Sample Data using Pandas in Python

Page 9: A Powerful Python Library for Data Analysis BY BADRI PRUDHVI BADRI PRUDHVI

Thanks for listening….

Any Questions?