generating password challenge questions chuong ngo

30
Generating Password Challenge Questions Chuong Ngo

Upload: wilfred-bruce

Post on 31-Dec-2015

216 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Generating Password Challenge Questions Chuong Ngo

Generating Password Challenge Questions

Chuong Ngo

Page 2: Generating Password Challenge Questions Chuong Ngo

Online Services and the Problem of Account Security

E-commerce, banking, e-mail, etc� Average: 26 diffeerent online accounts

5 unique passwords 25 to 30: 40+ accounts

2012 online fraud cases: 3x 2010 case count 90% of accounts require user id and password

Passwords need to be strong and unique

Page 3: Generating Password Challenge Questions Chuong Ngo

Passwords: So secure you can't remember it?

Memorability vs security - negative correlation

Password recovery systems a must SMS E-mail Snail mail Challenge questions

Page 4: Generating Password Challenge Questions Chuong Ngo

Why Challenge Question?

User must answer agreed upon questions to validate identity

Most commonly used systemAdvantages Disadvantages

Resilient to opportunistic attacks Quality question pool

Can be automated/little training required Susceptible to targeted attacks

On-demand/quick turn-around time Invasion of privacy

False rejection

Page 5: Generating Password Challenge Questions Chuong Ngo

Just How Safe and Secure?

System is weak & exploitable Answers easy to obtain/public domain

Social media 12% answerable with social media info

Applicability & repeatability

Page 6: Generating Password Challenge Questions Chuong Ngo

Can It be Salvaged?

Treat challenge questions like passwords Must value memorability Avoid too many “easy” answers

Large pool of challenge questions What if the questions were targeted and

personal?

Page 7: Generating Password Challenge Questions Chuong Ngo

Targeted Challenge Questions

Applicability and repeatability negligible More personal, more secure & memorable Greater answer variety from long-form answers

Make a system the uses or generates challenge questions that target the user's strong, personal memories.

Page 8: Generating Password Challenge Questions Chuong Ngo

System Concept

Current System (Simplified) Concept System

Prompt user to select questions Prompt user with general question

Capture user responses Capture user responses (long-form)

Store user responses Run responses through NLP engine

Store responses and extracted entities

Current System (Simplified) Concept System

Query for challenge question Query for stored response or web of data

Capture user response Modify stored response/generate challenge question

Compare response against stored responses

Capture user response

Run response through NLP engine

Compare response to stored response

Data Ingest

Data Retrieval

Page 9: Generating Password Challenge Questions Chuong Ngo

The Natural Language Processing Engineat the Heart of it All

Page 10: Generating Password Challenge Questions Chuong Ngo

The NLP Engine

Uses Stanford CoreNLP Pipeline includes:

Tokenizer Sentence Splitteer PoS Tagger Morpha Annotator NER Parsing Coreferencer

Page 11: Generating Password Challenge Questions Chuong Ngo

Notable Pipeline Absences

No sentiment analyzer Requires training for individuals No real advantage

No relationship analyzer Beyond scope

Limited use of the coreferencer and dependency tree. Focused on named noun entities (NN) to simplify

implementation.

Page 12: Generating Password Challenge Questions Chuong Ngo

Fill-in-the-Blanks (FitB) ApproachA First Step

Page 13: Generating Password Challenge Questions Chuong Ngo

FitB Approach Overview

Challenge question is open-ended and general. User provides a long-form response.

Presents user with the modifieed answer to the challenge question. User must “fiell-in-the-blanks”/correct the mistakes.

Authentication done by comparing user's responses to the missing entities. Match must meet or exceed a threshold.

Page 14: Generating Password Challenge Questions Chuong Ngo

An Example

Bob is a great uncle. He loved to fiesh and would do so as ofteen as he could near his home in Minnesota. He taught me to fiesh over the summer that I stayed with him. Everyday, we would go to a nearby stream. Thee stream would later feed into the Mississippi River.

[Blank] is a great uncle. He loved to fiesh and would do so as ofteen as he could near his home in [Blank]. He taught me to fiesh over the summer that I stayed with him. Everyday, we would go to a nearby stream. Thee stream would later feed into the [Blank] River.

Page 15: Generating Password Challenge Questions Chuong Ngo

Why does it work?

It is a single story. Multiple NNs related to the same idea.

It is memorable. Prompt helps to kick start memory.

Simple and fast Does not overly burden the user. Avoids the problem of question generation. Easily extensible

No web of knowledge – preserves privacy.

Page 16: Generating Password Challenge Questions Chuong Ngo

Where does it fall short?

Potentially low entropy in question pool. Queestion is not generated.

No web of knowledge – no context. Unable to correlate multiple stored user responses.

Dependent on large number of NNs. Needs clean, non-noisy input. Token matches does not tolerate much deviation. Some private information may be leaked. Unable to be integrated into other sources of information. Signifiecant setup time.

Page 17: Generating Password Challenge Questions Chuong Ngo

Future Work

Diffeerent user interfaces Example: pictures

Incorporate additional processors Example: relationship analyzer

Increase the number of data points to match.

Page 18: Generating Password Challenge Questions Chuong Ngo

Document Retrieval ApproachA Slight Twist

Page 19: Generating Password Challenge Questions Chuong Ngo

Document Retrieval Approach

Similar to FitB approach. User is prompted to answer the same challenge

question they originally wrote an answer for. User's answers run through NLP engine, NNs

extracted. NNs used to search through all registered answer

documents, matching via bag-of-words count. Authenticated if match is above a specifieed

threshold.

Page 20: Generating Password Challenge Questions Chuong Ngo

Not Quite Right...

Cannot use regular bag-of-words approach. Source document and user-provided

answer document may diffeer too much.

Not backed by web of knowledge. Does not reveal private information.

Page 21: Generating Password Challenge Questions Chuong Ngo

Future Work

May benefiet from existing search engine technologies (ex. Lucene).

May benefiet from more data points to match.

Page 22: Generating Password Challenge Questions Chuong Ngo

Generating Questions from a Web of Knowledge (WOK) Approach

Now I Understand Why This is Still Unsolved

Page 23: Generating Password Challenge Questions Chuong Ngo

WOK Approach Overview

NLP engine extracts the NNs from the user's initial response.

User is prompted to provide more information for the NNs. Information stored in WOK.

Challenge questions generated from WOK. Answers compared to the information in

the WOK.

Page 24: Generating Password Challenge Questions Chuong Ngo

Making the WOK

Utilized Protege Popular java

library for OWL and RDF.

Information stored as OWL data models.

Page 25: Generating Password Challenge Questions Chuong Ngo

Generating the Questions

A random class is chosen from the WOK.

Queestion is generated using a property's label id and a template question.

User's response is matched against the property's value.

Page 26: Generating Password Challenge Questions Chuong Ngo

An Example

Property Value

#type Person

#fName Bob

#lName Ngo

#livesIn Minnesota

#relation Uncle

#name Bob Ngo

What is the [Blank] of your [Blank]?

What is the livesIn of your Bob Ngo?

What is the name of your Uncle?

What is the relation of your Minnesota?

Property Value

#type Location

#name Minnesota

#relation Bob's home

Page 27: Generating Password Challenge Questions Chuong Ngo

Why doesn't it work?

Queestion generation algorithm needs to be less naive. Generated questions are very impersonal.

Not really an improvement over current method. Creation of WOK is not automatic/semi-automatic. Expected answer must be an exact match. Greater invasion of privacy – has WOK. Signifiecant setup time.

Page 28: Generating Password Challenge Questions Chuong Ngo

Future Work

Queestion generation algorithm must be improved.

Incorporation of additional NLP technologies for a smarter WOK.

Ontology is the wrong technology?

Page 29: Generating Password Challenge Questions Chuong Ngo

Conclusion

FitB approach is the most ready for deployment.

Document retrieval approach evaluation incomplete.

WOK approach needs a lot more work.

Page 30: Generating Password Challenge Questions Chuong Ngo

Questions?