cmsc434 week 13 | lecture 25 | nov 26, 2013 evaluation ii · week 13 | lecture 25 | nov 26, 2013...

Human Computer Interaction Laboratory

@jonfroehlich Assistant Professor Computer Science

CMSC434 Introduction to Human-Computer Interaction

Week 13 | Lecture 25 | Nov 26, 2013

Evaluation II

WORK WITH

Professor Jordan Graber

Quiz Bowls!

Robots!

Research!

Oh My!

jbg@umiacs.umd.edu

Hall of Fame Hall of Shame

Source: http://en.flossmanuals.net/firefox/ch036_firefox-security-features/

1. Jordan

2. Schedule

3. Evaluation II

4. In-Class Activity (if time)

5. Give back quizzes

Genres of Assessment

[Nielsen, J., and Molich, R. (1990). Heuristic evaluation of user interfaces, CHI'90;

Nielsen, J. (1994). Heuristic evaluation. In Nielsen, J., and Mack, R.L. (Eds.), Usability Inspection Methods;

http://www.useit.com/papers/heuristic/]

Inspection-Based Methods

Based on the skills and

experience of evaluators

These are sometimes also

called “Expert Reviews”

Automated Methods

Usability measures computed

by software

Automated Methods

by software

Formal Methods

Models and formulas to

calculate and predict

measures semi-automatically

Automated Methods

by software

Formal Methods

Empirical Methods

Evaluation assessed by

testing with real users

experience of evaluators:

Automated Methods

by software

Formal Methods

Empirical Methods

1. Heuristic Evaluation

2. Walkthroughs

Discount Usability Techniques

Heuristic Evaluation

Nielsen, J. (1994). Heuristic evaluation. In Nielsen, J., and Mack, R.L. (Eds.), Usability Inspection Methods.]

Heuristic evaluation involves having a small set of

evaluators examine the interface and judge its

compliance with recognized usability principles

(the "heuristics").

Heuristic evaluation involves having a small set of

evaluators examine the interface and judge its

compliance with recognized usability principles

(the "heuristics").

JakobNielsen, Ph.D. "The Guru of Web Page Usability" (NYT)

Inventor of Heuristic Evaluation

Nielsen’s 10 Heuristics

1. Visibility of

System Status System should

always keep users

informed, through

appropriate

feedback at

reasonable times.

2. Match System

& Real World The system should

speak the user’s

language, with

familiar words.

Information should

appear in natural

and logical order.

3. User Control

& Freedom Users often choose

functions by

mistake and need a

clearly marked

“emergency exit.”

Support undo and

4. Consistency

& Standards. Users should not

have to wonder

whether different

words/actions mean

the same thing.

Follow platform

conventions.

5. Error

Prevention Even better than

good error

messages is a

careful design that

prevents the

problem in the 1st

place.

6. Recognition

Over Recall Minimize the user’s

memory load by

making/actions,

options visible. The

user shouldn’t have

to remember from

one dialog to next.

7. Flexibility &

Efficiency Accelerators

(unseen by novice

users) often speed

up interaction for

expert users. Allow

users to tailor

frequent actions.

8. Aesthetic &

Minimalism Interfaces shouldn’t

contain irrelevant

information. Every

unit of info comp-

etes for attention &

diminishes relative

visibility.

9. Help Users

Recognize,

Diagnose, &

Recover from

Errors. Error msgs in plain

language, precisely

indicate problem,

suggest solution.

10. Help &

Documentation Best to not need

documentation but

when necessary,

should be easy to

search, focused on

user tasks, and list

concrete steps.

Densest Slide of Year Award!

Phases of Heuristic Evaluation 1. Pre-evaluation training: Give evaluators needed domain

knowledge & information on the scenario

2. Evaluation: For ~1-2 hours, independently inspect the

product using heuristics for guidance. Each expert should

take more than one pass through the interface.

3. Severity rating: Determine how severe each problem is

4. Aggregation: Group meets & aggregates problems (with

ratings)

5. Debriefing: Discuss the outcome with design team

27 [Slide from Professor Leah Findlater]

Severity Ratings

0 – don't agree that this is a usability problem

1 - cosmetic problem

2 - minor usability problem

3 - major usability problem; important to fix

4 - usability catastrophe; imperative to fix

[H4 Consistency] [Severity 3] [Fix 0]

The interface used the string "Save" on the first screen for saving the

user's file, but used the string "Write file" on the second screen. Users

may be confused by this different terminology for the same function.

(fairly severe, but easy to fix)

[Slide from Professor Leah Findlater]

How Many Evaluators?

In principle, individual evaluators can perform a heuristic

evaluation of a user interface on their own but…

Usability Problem (ordered from easiest to

find to hardest to find)

Hard to Find

Usability Problem

Easy to Find

Usability Problem

10. Help &

documentation but

when necessary,

should be easy to

search, focused on

concrete steps.

Each row

represents a

usability problem

Hard to Find

Usability Problem

Evaluator (ordered from least successful

evaluator to most successful) Each column is

an individual

evaluator

10. Help &

documentation but

when necessary,

should be easy to

search, focused on

concrete steps.

Each row

represents a

usability problem

Easy to Find

Usability Problem

Hard to Find

Usability Problem

Evaluator (ordered from least successful

evaluator to most successful)

“Worst” evaluator only found

3 usability problems (and they

were the easiest to find)

10. Help &

documentation but

when necessary,

should be easy to

search, focused on

concrete steps.

Easy to Find

Usability Problem

Automated Methods

by software

“Best” evaluator found 10

usability problems (but not

the two “hardest”)

Empirical Methods

Well, then, how many evaluators should we use?

Nielsen recommends ~5

evaluators (at least 3), which

balances cost/benefit.

Single evaluators

found, on average,

~35% of usability

problems.

Heuristic Evaluation Critiques

Shortly after heuristic evaluation was developed, several

independent studies compared heuristic evaluation

with other methods (e.g., user testing.) They found that

different approaches identified different problems; some-

times heuristic evaluation missed severe problems.

[Rogers et al., Interaction Design, Chapter 15, 2011]

Another problem concerns experts reporting problems

that don’t exist.

Another problem concerns experts reporting problems

that don’t exist. A study by Bailey (2001) found that

33% of problems were real usability problems; 21% of

problems were missed; and 43% of problems identified

by experts were not problems at all.

[Said at UPA2009 panel as quoted by Jeff Sauro: http://www.measuringusability.com/blog/he.php]

Heuristic evaluations are 99% bad.

RolfMolich Co-Inventor of Heuristic Evaluation

[Said at UPA2009 panel as quoted by Jeff Sauro: http://www.measuringusability.com/blog/he.php]

Heuristic evaluations are 99% bad.

RolfMolich Co-Inventor of Heuristic Evaluation

Rogers et al., Interaction Design, Chapter 15, 2011]

Heuristic Evaluation asdf

o Tends to uncover many low

severity problems; severe

problems can be missed

o Can be expensive and

difficult to find 3-5 usability

professionals (sometimes

more are needed!)

o Sometimes experts are

o No special facilities needed

o No participants required;

no user testing

o Is quick and dirty (a

discount usability method)

Heuristic Evaluation Heuristic Evaluation

Automated Methods

by software

Formal Methods

Empirical Methods

2. Walkthroughs

Automated Methods

by software

Formal Methods

Empirical Methods

2. Walkthroughs

Walkthroughs

Walkthroughs are an alternative approach to heuristic

evaluation for predicting users’ problems without

doing user testing. They involve walking through a

task with an interface/product and noting problematic

usability features.

Cognitive Walkthroughs

[Rogers et al., Interaction Design, Chapter 15, 2011; http://en.wikipedia.org/wiki/Cognitive_walkthrough]

One type of walkthrough that involves simulating a

user’s problem-solving process at each step of

interaction with an interface.

Whereas heuristic evaluation takes a holistic view to

catch problems, cognitive walkthroughs are task

specific.

Cognitive Walkthroughs

The defining feature of [cognitive walkthroughs] is that

they focus on evaluating designs for ease of

learning—a focus that is motivated by observations that

users learn by exploration.

Pre-study Step: characteristics of typical users are

identified; sample tasks are created; a clear sequence of the

actions needed to accomplish task are documented

Walkthrough Step: Designer and one or more evaluators

come together to perform analysis; evaluators walk through

each step and try to answer these questions:

Performing Cognitive Walkthroughs

1. Will the user know what to do to achieve the task?

2. Will the user notice that the correct action is available?

3. Will the user interpret the response from action correctly?

Information Recording: As the walkthrough occurs, critical

information is compiled about: assumptions, problems, etc.

Design Revision: The recorded information is analyzed,

design improvement suggestions are made, and design is

iterated upon

Rogers et al., Interaction Design, Chapter 15, 2011]

Heuristic Evaluation asdf

o Time-consuming and

laborious

o Evaluators do not always

have a good understanding

of users

o Only a limited number of

tasks/scenarios can be

explored

o Strong focus on tasks

o Compared with HE, more

detail on moving through

an interaction w/system

o Perhaps most useful for

applications involving

complex operations

Walkthroughs Walkthroughs

Automated Methods

by software

Formal Methods

Empirical Methods

Automated Methods

by software

Formal Methods

Empirical Methods

1. GOMS Model

2. Keystroke Level Model (KLM)

Formal Methods

Similar to inspection methods and analytics, predictive

models (formal methods) evaluate a system without

users being present.

Formal Methods

Similar to inspection methods and analytics, predictive

models (formal methods) evaluate a system without

users being present. Rather than involving expert

evaluators or tracking usage, predictive models use

formulas to derive various measures of performance.

Automated Methods

by software

Formal Methods

Empirical Methods

1. GOMS Model

GOMS Model

A GOMS model, as proposed by Card, Moran, and

Newell (1983), is a description of the knowledge that a

user must have in order to carry out tasks on a device

or system; it is a representation of the "how to do it"

knowledge that is required by a system in order to get

the intended tasks accomplished.

[Kieras, A Guide to GOMS Analysis, 1994; Card et al., The Psychology of Human-Computer Interaction, 1983]

GOMS Model

[Rogers et al., Interaction Design, Chapter 15, 2011; Card et al., The Psychology of HCI, 1986]

An attempt to model the

knowledge and cognitive

processes involved when a

user interacts with a system

Goals refers to a particular state the

user wants to achieve

Operators refers to the cognitive

processes and physical actions that

need to be performed to achieve those

Methods are learned procedures for

accomplishing the goals

Selection rules are used to determine

which method to select when there is

more than one available.

GOMS Model Example 1

Recall that word to be deleted has to be highlighted

Recall that command is ‘cut’

Recall that command ‘cut’ is in edit menu

Accomplish goal of selecting and executing the ‘cut’ command

Return with goal accomplished

Goal: Delete a word in a sentence in Microsoft Word

Method 1: Using menus

Recall where to position cursor in relation to word to be deleted

Recall which key is backspace key

Press backspace key to delete each letter

Method 2: Using backspace key

cmsc434 week 13 | lecture 25 | nov 26, 2013 evaluation ii · week 13 | lecture 25 | nov 26, 2013...

Documents

powerpoint presentation...human computer interaction...

business week nov 2013

cmsc434 week 02 | lecture 03 | sept 09, 2014 design … ·...

ceg handbook 11 mon 28 oct – fri 1 nov 2013 week 12 mon 4...

week 2 updated on nov. 2

nov. 27 this week at fbc

xed ca newsletter week nov 01- nov 07

cmsc434 week 12 | lecture 23 | nov 19, 2013 visual design...

cmsc434 week 01 | lecture 01 | sept 02, 2014 design ... ›...

week 1 oct 7 week 2 oct 14 week 3 oct 28 week 4 nov 4 week 5...

nov. 19 this week at fbc

week 11--nov. 14

fb content images week 1 nov

cmsc434 week 07 | lecture 13 | oct 14, 2014 design...

kamloops this week nov 4, 2014

cmsc434 week 08 | lecture 16 | oct 23, 2014 · 1. android:...

nov 2nd week

cmsc434 week 03 | lecture 06 | sept 18, 2014 understanding...

cse week of nov. 15

cmsc434 week 02 | lecture 04 | sept 11, 2014 -...