databin data’drivensecure- businessintelligence€¦ · majorchallenges...
TRANSCRIPT
DataBINData-‐Driven Secure Business Intelligence
Devdatt DubhashiDavid Sands
Major Challenges
• How do we automatically extract meaningful info from unstructured text, images, video …
• How do we structure the information for better data analytics?
• How do we scale to very Big Data?• How do we ensure privacy when mining info?
• Graph kernels for network structured data, ICML 2014, NIPS 2015, KDD 2015, CIKM Weighted Theta Functions, NIPS 2015
• Large scale optimization: clustering, domain adaptation, ICML 2017…
• Explanatory AI/ML: Causal and Counterfactual inference, ICML 2016, ICML 2017
• Explanatory AI/ML: Disentangled representations in deep nets.
• Deep Learning for NLP: char based RNNs.
• Differential Privacy: JMLR 2017, AAAI 2017
1Disciplinary research published at top-‐tier conferences
Demonstrators implemented and integrated into the tools of our industrial partners
2
Dissemination“AI is the New Electricity”
3
Swedish Symposium Deep Learning 2018
Competence Intelligence
Innovation
Privacy in the Age of Big Data
“Two recent surveys reveal that consumers’ concerns about online privacy are at an all-‐time high.” June 2014
“Big data might be big business, but overzealous data mining can seriously destroy your brand…”
Nov 2013
Research on Privacy in Data-‐Intensive Systems Differential Privacy
Location Privacy
Social Network Privacy
A Flavour of Differential Privacy
A personal question…
13
14
Answer YES
15
Answer YES
Answer NO
16
Answer YES
Answer NO
Answer TRUTHFULLY
Differential Privacy
Emerging mathematical definition of privacy
Essence: the participation of any one individual won’t change the result of the survey in a noticeable way
Consequence: a robust definition with good properties
17
Results in the DataBIN Project
• Programming framework that achieve privacy by construction– no need to trust the programmer
• A Framework for Local Differential Privacy– no need to trust the analyst
• Machine Learning with Differential Privacy
DataBIN PhDs
Olof MogrenDeep Learning NLP
Hamid EbadiDifferential Privacy Raul Pardo (INRIA Lyon)
Privacy in Social Networks
Fredrik Johansson (MIT)Machine Learning, Causal Inference
See Posters