data science and visualization - ihub

Post on 11-Feb-2022

4 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Data Science and Visualization

@iHubResearch

Data in Africa

Common challenges – Accessibility/ Availability and Quality

How can we discover data and surface

information?

– How does Facebook and Amazon suggest friends you may know and recommended items to purchase?

– How can we solve the never ending traffic problems in the city?

– How does NYC determine the frequency of

subway trains all day; all weekend?

Photo courtesy of #kenya365

What is Data Science?

The process of using data to surface information and tell stories Data Science includes collecting data, cleaning and managing the data, making it tell its story, and presenting that story to others

An ideal Data Scientist is: 1/3 part Mathematician 1/3 part Computer Scientist 1/3 part Artist

How can we innovate in each of these processes?

Data Collection

•  Survey management processes: – Mobile and web data

collections tools

– Open Data Portals – Crowdsourcing tools

KODI

Data Storage

•  Data Storage/Warehousing:

–  MySQL; NoSQL; SparQL; Linked Data; Azure; Amazon Cloud Services; Dropbox

–  What formats is the data stored in?

–  Data Cleaning processes – Need tools such as Google

Refine?

Data Analysis

Analytical Tools: Excel; SPSS; S; R; Python; Stata; Pivot Tables

Data Mining Processes:

Hadoop; Weka

Data Visualization

•  Data Visualization: Charts; graphs; pictures; maps

•  Infographic tools:

Illustrator; Infogr.am; ManyEyes; GIS mapping; Tableau

•  HOW NOT TO VISUALIZE YOUR DATA!!!

*iHub Research_ Data Science and Visualization Lab

Data Visualization

We are surfacing new data - Latest stats in African tech. sector.

Data Visualization

We are providing a melting pot for different industries/sectors to utilize technology to discover and use data for decision making.

Data Visualization

We are setting local industry standards on the use of data and influencing critical data focusing policies:

Data Protection laws; FOIs; Privacy and Security.

Data Visualization

We are innovating on new ways to effectively discover and use data in our local settings

Gsma

On-going Projects •  Umati II – automation of hate-speech monitoring process

•  3Vs Crowdsourcing Framework– viability, validation and verification

•  Investment Research– Mapping the Tech. Investment

Landscape in Kenya •  Infographics– Africa infographics; Tech statistics and data

•  Data Warehousing solutions - Tools to archive and discover our own research data and information

How can you be part of this?

Training Business Support Consultancy Use Cases for iHub Cluster Data Challenges 

Example of sites developed by Code4Kenya

top related