interpretability, trust and moralityin...

11
INTERPRETABILITY , TRUST , AND MORALITY IN A UTONOMY D AVID D ANKS PHILOSOPHY & PSYCHOLOGY CARNEGIE MELLON

Upload: others

Post on 11-Sep-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: INTERPRETABILITY, TRUST AND MORALITYIN AUTONOMYqav.cs.ox.ac.uk/.../img/DanksMoralityTrustWorkshop17.pdf · 2017. 7. 20. · Trust & interpretability ¨Routes to interpretability &

INTERPRETABILITY, TRUST, AND

MORALITY IN AUTONOMY

DAVID DANKSPHILOSOPHY & PSYCHOLOGYCARNEGIE MELLON

Page 2: INTERPRETABILITY, TRUST AND MORALITYIN AUTONOMYqav.cs.ox.ac.uk/.../img/DanksMoralityTrustWorkshop17.pdf · 2017. 7. 20. · Trust & interpretability ¨Routes to interpretability &

¨ When can someone ethically use/deploy an autonomous system?

Page 3: INTERPRETABILITY, TRUST AND MORALITYIN AUTONOMYqav.cs.ox.ac.uk/.../img/DanksMoralityTrustWorkshop17.pdf · 2017. 7. 20. · Trust & interpretability ¨Routes to interpretability &

What do I mean by ‘autonomy’?

¨ Characterize ‘autonomy’ in terms of capabilities(= context-specific abilities)

¤ Planning (routes, action sequences, …)¤ Learning (environmental statistics, adaptation, …)¤ Deciding (action selection, classification, …)¤ …

¨ Richer & more useful than ‘levels’¤ Though obviously also more complicated…

Page 4: INTERPRETABILITY, TRUST AND MORALITYIN AUTONOMYqav.cs.ox.ac.uk/.../img/DanksMoralityTrustWorkshop17.pdf · 2017. 7. 20. · Trust & interpretability ¨Routes to interpretability &

Ethical deployment

¨ Necessary condition for ethical deployment: Reasonable belief that the “system” will behave (approximately) as the user intends¤ Agnostic about whether “system” is human or artificial¤ User’s intentions can be quite high-level (“drive safely”)

¨ Alternate formulation: User must trust the “system”¤ In relevant respects, not necessarily all of them

Page 5: INTERPRETABILITY, TRUST AND MORALITYIN AUTONOMYqav.cs.ox.ac.uk/.../img/DanksMoralityTrustWorkshop17.pdf · 2017. 7. 20. · Trust & interpretability ¨Routes to interpretability &

Varieties of (psychological) trust

1. Reliability/Predictability of Trustee¤ “Behavior”-focused¤ Knowing what the trustee will do¤ Coordination & prediction in known circumstances

2. Understanding the Trustee¤ Belief/value-focused¤ Knowing why the trustee will do¤ Coordination & prediction in novel circumstances

Page 6: INTERPRETABILITY, TRUST AND MORALITYIN AUTONOMYqav.cs.ox.ac.uk/.../img/DanksMoralityTrustWorkshop17.pdf · 2017. 7. 20. · Trust & interpretability ¨Routes to interpretability &

Trust & ethical deployment

¨ ⇒ If system will use autonomous capabilities, then deployer must have “understanding trust”

Specify desired system behavior before deployment?

“Reliability trust” is sufficient

Yes “Understanding trust” is required

No

Page 7: INTERPRETABILITY, TRUST AND MORALITYIN AUTONOMYqav.cs.ox.ac.uk/.../img/DanksMoralityTrustWorkshop17.pdf · 2017. 7. 20. · Trust & interpretability ¨Routes to interpretability &

Trust & interpretability

¨ Interpretability is key for “understanding trust”

¨ ⇒ Necessary condition for ethical deployment is: “System behavior for Goal is interpretable by User”¤ Note: Interpretability is not a property of System alone

Page 8: INTERPRETABILITY, TRUST AND MORALITYIN AUTONOMYqav.cs.ox.ac.uk/.../img/DanksMoralityTrustWorkshop17.pdf · 2017. 7. 20. · Trust & interpretability ¨Routes to interpretability &

Trust & interpretability

¨ Routes to interpretability & “understanding trust”1. Explicit requirement that System plan/learn/decide

similarly to humans2. Have deployment decisions made in collaboration

with (informed) developers3. Extended user experience in many different contexts

Page 9: INTERPRETABILITY, TRUST AND MORALITYIN AUTONOMYqav.cs.ox.ac.uk/.../img/DanksMoralityTrustWorkshop17.pdf · 2017. 7. 20. · Trust & interpretability ¨Routes to interpretability &

What about system morality?

¨ Observation: Humans typically interpret others’ unethical behavior using “internal” features¤ Conjecture: Non-developers will usually interpret

unethical system behavior as due to an unethical nature

¨ Conjecture: If the developer cares about having an ethical system, then any u-trustworthy system will (mostly) act ethically

Page 10: INTERPRETABILITY, TRUST AND MORALITYIN AUTONOMYqav.cs.ox.ac.uk/.../img/DanksMoralityTrustWorkshop17.pdf · 2017. 7. 20. · Trust & interpretability ¨Routes to interpretability &

Conclusions

¨ Ethical deployment requires trust (in system)¨ If a system employs autonomous capabilities, then

understanding-trust is required¨ Interpretability is necessary for understanding-trust

¤ Reminder: ‘interpretability’ is a three-argument relation

¨ ⇒ Interpretability is nec. for ethical deployment¤ Though many routes to this type of interpretability

¨ Conjecture: Alternate route to try to achieve ethical system behavior?

Page 11: INTERPRETABILITY, TRUST AND MORALITYIN AUTONOMYqav.cs.ox.ac.uk/.../img/DanksMoralityTrustWorkshop17.pdf · 2017. 7. 20. · Trust & interpretability ¨Routes to interpretability &

Thanks!