data manipulation - eth z · data manipulation the process of data transformation, formatting &...

Post on 18-Mar-2020

10 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Data ManipulationEvangelos Pournaras, Izabela Moise

Evangelos Pournaras, Izabela Moise 1

Data Manipulation

The process of data transformation, formatting & structuring.

Examplesupdating, adding/removing, sorting, selection, merging, shifting,aggregation, etc.

TipIn Data Science, data come with collection & science starts withmanipulation!

Evangelos Pournaras, Izabela Moise 2

A "Dirty Job"

Evangelos Pournaras, Izabela Moise 3

Do you really need it?

• Big Data and Internet of Things result in large amount ofunstructured data.

• New data collection opportunities require advanced datamanipulation techniques.

• Involvement with data manipulation becomes more likely &required nowadays.

Evangelos Pournaras, Izabela Moise 4

Data Format & Manipulation

Select a data manipulation approach based on how data are storedand managed:

1. Text files

2. Databases

3. Big Data

Evangelos Pournaras, Izabela Moise 5

How to manipulate data

Most programming languages and several software tools canmanipulate data:

• Java, Python, C, C++, etc.

• Matlab, R, Excel, etc.

Criteria for selection:

• Ease of use

• Library support

• Portability

• Performance

• Data format

Evangelos Pournaras, Izabela Moise 6

What is next?

• Data manipulation with AWK

Evangelos Pournaras, Izabela Moise 7

top related