data manipulation - eth z · data manipulation the process of data transformation, formatting &...

7
Data Manipulation Evangelos Pournaras, Izabela Moise Evangelos Pournaras, Izabela Moise 1

Upload: others

Post on 18-Mar-2020

10 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Data Manipulation - ETH Z · Data Manipulation The process of data transformation, formatting & structuring. Examples updating, adding/removing, sorting, selection, merging, shifting,

Data ManipulationEvangelos Pournaras, Izabela Moise

Evangelos Pournaras, Izabela Moise 1

Page 2: Data Manipulation - ETH Z · Data Manipulation The process of data transformation, formatting & structuring. Examples updating, adding/removing, sorting, selection, merging, shifting,

Data Manipulation

The process of data transformation, formatting & structuring.

Examplesupdating, adding/removing, sorting, selection, merging, shifting,aggregation, etc.

TipIn Data Science, data come with collection & science starts withmanipulation!

Evangelos Pournaras, Izabela Moise 2

Page 3: Data Manipulation - ETH Z · Data Manipulation The process of data transformation, formatting & structuring. Examples updating, adding/removing, sorting, selection, merging, shifting,

A "Dirty Job"

Evangelos Pournaras, Izabela Moise 3

Page 4: Data Manipulation - ETH Z · Data Manipulation The process of data transformation, formatting & structuring. Examples updating, adding/removing, sorting, selection, merging, shifting,

Do you really need it?

• Big Data and Internet of Things result in large amount ofunstructured data.

• New data collection opportunities require advanced datamanipulation techniques.

• Involvement with data manipulation becomes more likely &required nowadays.

Evangelos Pournaras, Izabela Moise 4

Page 5: Data Manipulation - ETH Z · Data Manipulation The process of data transformation, formatting & structuring. Examples updating, adding/removing, sorting, selection, merging, shifting,

Data Format & Manipulation

Select a data manipulation approach based on how data are storedand managed:

1. Text files

2. Databases

3. Big Data

Evangelos Pournaras, Izabela Moise 5

Page 6: Data Manipulation - ETH Z · Data Manipulation The process of data transformation, formatting & structuring. Examples updating, adding/removing, sorting, selection, merging, shifting,

How to manipulate data

Most programming languages and several software tools canmanipulate data:

• Java, Python, C, C++, etc.

• Matlab, R, Excel, etc.

Criteria for selection:

• Ease of use

• Library support

• Portability

• Performance

• Data format

Evangelos Pournaras, Izabela Moise 6

Page 7: Data Manipulation - ETH Z · Data Manipulation The process of data transformation, formatting & structuring. Examples updating, adding/removing, sorting, selection, merging, shifting,

What is next?

• Data manipulation with AWK

Evangelos Pournaras, Izabela Moise 7