machine learning for wireless networks @bestcom2016

37
Machine learning for improving wireless network performance Merima Kulin, Eli De Poorter, Dirk Deschrijver, Tom Dhaene and Ingrid Moerman [email protected] Internet Based Communication Networks and Services research group (IBCN)- IDLab Department of Information Technology (INTEC) Ghent University - imec BESTCOM, 21.10.2016., Louv

Upload: merima-kulin

Post on 11-Jan-2017

55 views

Category:

Technology


1 download

TRANSCRIPT

Page 1: Machine learning for wireless networks @Bestcom2016

Machine learning for improving wireless network performance

Merima Kulin, Eli De Poorter, Dirk Deschrijver, Tom Dhaene and Ingrid [email protected] Internet Based Communication Networks and Services research group (IBCN)- IDLabDepartment of Information Technology (INTEC)Ghent University - imec BESTCOM, 21.10.2016., Louvain

Page 2: Machine learning for wireless networks @Bestcom2016

Machine learning for wireless networks• Introduction• Data-driven design: examples• Data science in wireless networks: a

tutorial• Conclusion

2

Page 3: Machine learning for wireless networks @Bestcom2016

Why machine learning?• Gartner's Hype Cycle for Emerging Technologies

2015.

2016.

Page 4: Machine learning for wireless networks @Bestcom2016

Computationalpower

Massive amounts of data

Unprecedented advances

in ML

Why machine learning?

Data is the new oil!

Page 5: Machine learning for wireless networks @Bestcom2016

What kind of data are generating wireless networks?

IoTNetwork

monitoring

Cognitive radio

Wireless networksas data sources

Page 6: Machine learning for wireless networks @Bestcom2016

Data Science

What is data science?

Machine learning

Data mining Data analysis

ML algorithm selection

Model Evaluation

Pre-processing

Kulin, Merima, Carolina Fortuna, Eli De Poorter, Dirk Deschrijver, and Ingrid Moerman. "Data-Driven Design of Intelligent Wireless Networks: An Overview and Tutorial." Sensors 16, no. 6 (2016): 790.

• Machine learning vs.

• Data mining vs.

• Data science

“Data science is the study of generalizable extraction of knowledge from data”.

Page 7: Machine learning for wireless networks @Bestcom2016

Machine learning in wireless networks• Introduction• Data driven design: examples• Data science in wireless networks: a

tutorial• Conclusion

7

Page 8: Machine learning for wireless networks @Bestcom2016

Data mining/Machine learning approaches8

Regression Classification

Clustering Anomaly detection

Page 9: Machine learning for wireless networks @Bestcom2016

Regression9

Regression

RegressionX Y

Page 10: Machine learning for wireless networks @Bestcom2016

Regression - example10

Application area: Localization

Vanheel, F.; Verhaevert, J.; Laermans, E.; Moerman, I.; Demeester, P. Automated linear regression tools improve rssi wsn localization in multipath indoor environment. EURASIP J. Wirel. Commun. Netw. 2011, 2011, 1–27.

RegressionRSSI distance

Page 11: Machine learning for wireless networks @Bestcom2016

Classification11

ClassifierX Y

Classification

C1C2

C3

C1

C2

C3

Page 12: Machine learning for wireless networks @Bestcom2016

Classification: example12

ClassifierRSSIZigbee

WiFi

Bluetooth

Application area: System recognition

Zheng, Xiaolong, et al. "ZiSense: towards interference resilient duty cycling in wireless sensor networks." Proceedings of the 12th ACM Conference on Embedded Network Sensor Systems. ACM, 2014.

Microwave

-

-

--

Page 13: Machine learning for wireless networks @Bestcom2016

Clustering13

ClusteringX ?C1

C2

C3

Clustering

Page 14: Machine learning for wireless networks @Bestcom2016

Clustering: example14

Application area: System identification

Shetty, N.; Pollin, S.; Pawełczak, P. Identifying spectrum usage by unknown systems using experiments in machine learning. In Proceedings of the 2009 IEEE Wireless Communications and Networking Conference, Budapest, Hungary, 5–8 April 2009

ClusteringX ?Zigbee

WiFi

Noise

Page 15: Machine learning for wireless networks @Bestcom2016

Anomaly detection15

AnomalyX Y/?C1

C2

C3

Anomaly detection

Page 16: Machine learning for wireless networks @Bestcom2016

Which DM/ML method can you use?16

X Y

Uluagac, A. Selcuk, et al. "A passive technique for fingerprinting wireless deviceswith wired-side observations." Communications and Network Security (CNS), 2013IEEE Conference on. IEEE, 2013

Application area: ?

Measurements from a device Device type

?Classification

Page 17: Machine learning for wireless networks @Bestcom2016

Machine learning in wireless networks• Introduction• Data driven design: examples• Data science in wireless networks: a

tutorial• Conclusion

17

Page 18: Machine learning for wireless networks @Bestcom2016

The knowledge discovery process

Page 19: Machine learning for wireless networks @Bestcom2016

The knowledge discovery processStep 1: Understanding the problem domain

Problem formulation Fingerprinting wireless devices Identify devices and device types classification problem

Goal A new solution for

Network Access Control to enhance network security

Assumptions Packet generation is

influenced by hardware architecture (CPU, DMA, L1/L2 cache, ..)

Hypothesis Identify devices and/or

device types based on statistical properties of their traffic flows

Data collection Analyze inter-arrival times

(IATs) from several devices

Page 20: Machine learning for wireless networks @Bestcom2016

The knowledge discovery process

Collected data• IAT traces

Validate the data Is the selected data a

representative sample for solving the formulated problem?

Validate the hypothesis Is the stated hypothesis true

and the selected data mining task is likely to prove it?

Step 2: Understanding the data

Page 21: Machine learning for wireless networks @Bestcom2016

Device fingerprinting• Data collection

• Data repositories (e.g. CRAWDAD)• Run experiments on testbed facilities• Collect data in situ

Overall 94 files Total ~137 mil. Mean ~1.46 mil. Std ~1.3 mil

Dell

iPad

iPhone

Nokia

Page 22: Machine learning for wireless networks @Bestcom2016

Device fingerprintingStep 2: Understanding the data

22

Visual techniques Computational techniques

Five number summary Standard deviation, variance skewness Coefficient of determination (R2) Coefficient of correlation …

boxplot

PDF and CDF

Scatter plots

Histograms

Page 23: Machine learning for wireless networks @Bestcom2016

Device fingerprintingStep 2: Understanding the data

• Visual techniques• PDF, time-series, histograms…

23

Page 24: Machine learning for wireless networks @Bestcom2016

Device fingerprintingStep 2: Understanding the data

• Computational techniques• 5-num summary

24

Page 25: Machine learning for wireless networks @Bestcom2016

Device fingerprintingStep 2: Understanding the data

• Computational• Coefficient of determination (R2)• Analysis for device identification

25

How much can data from one Dell Notebook tell about the data from other Dell Notebooks?

DN2

DN

3

Page 26: Machine learning for wireless networks @Bestcom2016

The knowledge discovery processStep 3: Data pre-processing

Raw data• Traces of IAT

data points

Training data Features extraction Feature vectors Training examples

Page 27: Machine learning for wireless networks @Bestcom2016

Device fingerprintingData pre-processing

27

Page 28: Machine learning for wireless networks @Bestcom2016

Device fingerprintingData pre-processing

28

Page 29: Machine learning for wireless networks @Bestcom2016

Device fingerprintingData pre-processing

• Features extraction

29

Page 30: Machine learning for wireless networks @Bestcom2016

The knowledge discovery processStep 4: Data mining

Training data• Feature vectors

of histogram bins

Model Neural network

HL=6, α=0.1, learned weights

k-Nearest Neighbors K=1

Decision trees Logistic regression …

ML

Page 31: Machine learning for wireless networks @Bestcom2016

31

Device fingerprintingStep 5: Performance evaluation

Test data• Test set of

feature vectorsPerformance indication RMSE, MAE, R2… Precision, Recall, Confusion matrix …

Page 32: Machine learning for wireless networks @Bestcom2016

• Algorithm selection: k-fold cross validation

32

Device fingerprintingStep 5: Performance evaluation

Page 33: Machine learning for wireless networks @Bestcom2016

• Performance evaluation• Confusion matrix• Accuracy, Precision, Recall, accuracy, F1-score

33

Device fingerprintingStep 5: Performance evaluation

Page 34: Machine learning for wireless networks @Bestcom2016

34

Device fingerprintingStep 5: Performance evaluation

• Device type classification results

Page 35: Machine learning for wireless networks @Bestcom2016

35

Device fingerprintingStep 5: Performance evaluation

• Model tuning: neural networks

Kulin, Merima, Carolina Fortuna, Eli De Poorter, Dirk Deschrijver, and Ingrid Moerman. "Data-Driven Design of Intelligent Wireless Networks: An Overview and Tutorial." Sensors 16, no. 6 (2016): 790.

More details about how to tune your algorithm can be found:

Page 36: Machine learning for wireless networks @Bestcom2016

Conclusion36

• Data-driven network design can be used for• Failure detection• Systems recognition• Performance optimization…

• Data traces are valuable• Considering releasing data traces after use

• Need for increased collaboration• Network experts, testbed experts, data mining

experts, statisticians, wireless communication, etc.

Page 37: Machine learning for wireless networks @Bestcom2016