recognizing the impact of ai · ai and media metadata management advances in computer vision...

27
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. David Pearson, AWS AI Services May 2017 Recognizing the Impact of AI Media and Entertainment – in –

Upload: others

Post on 25-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

David Pearson, AWS AI Services

May 2017

Recognizing the Impact of AI

Media and Entertainment

– in –

Page 2: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Media Metadata Management

Audience Engagement

Lifelike Speech

Recognizing the Impact of AI…

Page 3: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

AI and Media Metadata Management

Advances in computer vision enables:

• Detection of objects, scenes, and concepts in images

• Estimation of age range, gender and emotion in faces

• Recognition of individuals in images and video

Page 4: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Using AI to Extract Metadata from Visual Content

objects, scenes, facial attributes, people

rich media

index

Page 5: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Deer 98.8%

Wildlife 95.1%

Conifer 95.1%

Spruce 95.1%

Wood 78.3%

Tree 63.5%

Forest 63.5%

Vegetation 61.9%

Pine 60.6%

Outdoors 54.0%

Flower 53.9%

Plant 52.9%

Nature 50.7%

Field 50.7%

Grass 50.7%

Page 6: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

smart cropping

& ad overlays

demographic &

sentiment analysis

face editing

& pixelation

Age Range 38-59

Beard: False 84.3%

Emotion: Happy 86.5%

Eyeglasses: False 99.6%

Eyes Open: True 99.9%

Gender: Male 99.9%

Mouth Open: False 86.2%

Mustache: False 98.4%

Smile: True 95.9%

Sunglasses: False 99.8%

Landmarks

EyeLeftEyeRightNoseMouthLeftMouthRightLeftPupilRightPupilLeftEyeBrowLeftLeftEyeBrowRightLeftEyeBrowUp

:

Page 7: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Audience Analysis

• Touchless data gathering via cameras facing the audience

• Anonymous, high volume demographic and sentiment capture

• Analysis produces usable feedback trends and patterns

AUDIENCE CAMERA

Page 8: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age
Page 9: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Facial Matching and Recognition

Page 10: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

How AI Analyzes Faces

Face Detection Landmark Feature Extraction Identification/Recognition

Attributes Verification/Comparison

Index/SearchEstimated age range,

gender, and emotion;

facial hair, smiling++

Face comparison,

match, index and

search

Page 11: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

C-SPAN’s Index of

Public Figures

Page 12: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

AI and Active Audience Engagement

Advances in chatbot technologies enable:

• Fan exchanges with character bots via social

media, mobile and web apps

• Employee conversations with internal support bots

for help desk assistance

• Spoken interactions between executives and

enterprise information

Page 13: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

I’d like to book a flight to London

Sure! Do you want to fly to Heathrow or Gatwick?

Conversational Chatbots

Page 14: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Heathrow, pleaseDestination:

LHR

Conversational Chatbots

I’d like to book a flight to London

Sure! Do you want to fly to Heathrow or Gatwick?

Page 15: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

When would you like to fly?

Next WednesdayDeparture:

5/31/2017

Conversational Chatbots

Heathrow, pleaseDestination:

LHR

I’d like to book a flight to London

Sure! Do you want to fly to Heathrow or Gatwick?

Page 16: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Origin

Destination

Departure Date

Flight Booking

“I’d like to book a flight

to London”

Automatic

Speech RecognitionNatural Language

Understanding

Book Flight

London

Utterances

Flight booking

London Heathrow

LHR

LocationLocation

LAX

Prompt

“When would you like to fly?”

“When would you

like to fly?”

Text To

Speech

Intent /Slot model

UserPreferences

Page 17: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Origin

Destination

Departure Date

Flight Booking

“Next Wednesday”Automatic

Speech Recognition

Next Wednesday

Natural Language

Understanding

Flight booking

05 / 31 / 2017

LHR

LAX

05/31/2017

Confirmation

“Your flight is booked for next Wednesday”

“Your flight is booked

for next Wednesday”

Fulfilment

Utterances

Intent /Slot model

Text To

Speech

Page 18: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age
Page 19: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

AI and Lifelike Speech

Advances in speech to text technologies enable:

• Computer-generated natural speech

• Automatic, accurate text processing

• Intelligible and easy to understand

• Semantic additions to text

• Customized pronunciation

Page 20: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Text To Speech Quality

Natural sounding speech• A subjective measure of how close is TTS output to human speech

Accurate text processing• Ability of the system to interpret common text formats such as

abbreviations, numerical sequences, homographs etc.

Today in Las Vegas, NV it's 90°F.

"We live for the music", live from the Madison Square Garden.

Highly intelligibile• A measure of how comprehensible speech is.

“Peter Piper picked a peck of pickled peppers.”

Page 21: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Improving Text to Speech with SSML

Speech Synthesis Markup Language

• W3C recommendation, an XML-based markup

language for speech synthesis applications

<speak>

My name is Kuklinski. It is spelled

<prosody rate='x-slow'>

<say-as interpret-as="characters">Kuklinski</say-as>

</prosody>

</speak>

Page 22: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Custom Pronunciation with Lexicons

Enables developers to customize the pronunciation of

words or phrases

My daughter’s name is Kaja.

<lexeme>

<grapheme>Kaja</grapheme>

<grapheme>kaja</grapheme>

<grapheme>KAJA</grapheme>

<phoneme>"kaI.@</phoneme>

</lexeme>

Page 23: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Speech Synchronization

Synchronize speech with visual content for more lifelike

speech behavior from characters & avatars

• Request an additional stream of TTS metadata

containing sentence word timings

• Use the metadata stream alongside the synthesized

speech audio stream to sync audio and visual

Page 24: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age
Page 25: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Amazon AI

Intelligent Services Powered By Deep Learning

https://aws.amazon.com/blogs/ai/

https://aws.amazon.com/amazon-ai/

Page 26: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

“The future is here,

it’s just not evenly distributed yet”

William Gibson

Page 27: Recognizing the Impact of AI · AI and Media Metadata Management Advances in computer vision enables: • Detection of objects, scenes, and concepts in images • Estimation of age

Thank You!

[email protected]