unicode emojiunicode consortium enable everybody, speaking every language on the earth, to be able...
TRANSCRIPT
Unicode EmojiM. Davis
Unicode Consortium
Enable everybody,
speaking every language on the Earth,
to be able to use their language
on computers and smartphones.
Unicode Consortium Projects
Internationalization CodeICU
Locale DataCLDR
Characters
Char Props & AlgorithmsUTC
Unicode Characters
1999: Japan
2007: Unicode expands scope
2008: Gmail / iPhone / …
2010: Unicode emoji
from “Instagram Emojineering”
2016
unicode.org/emoji/charts-beta
General Case Normalization Shaping and RenderingName Uppercase Canonical_Combining_Class Join_ControlName_Alias Lowercase Decomposition_Mapping Joining_GroupBlock Lowercase_Mapping Composition_Exclusion Joining_TypeAge Titlecase_Mapping Full_Composition_Exclusion Line_BreakGeneral_Category Uppercase_Mapping Decomposition_Type Grapheme_Cluster_BreakScript Case_Folding NFC_Quick_Check Sentence_BreakScript_Extensions Simple_Lowercase_Mapping NFKC_Quick_Check Word_BreakWhite_Space Simple_Titlecase_Mapping NFD_Quick_Check East_Asian_WidthAlphabetic Simple_Uppercase_Mapping NFKD_Quick_Check Prepended_Concatenation_MarkHangul_Syllable_Type Simple_Case_Folding NFKC_Casefold BidirectionalNoncharacter_Code_Point Soft_Dotted Changes_When_NFKC_Casefolded Bidi_ClassDefault_Ignorable_Code_Point Cased Miscellaneous Bidi_ControlDeprecated Case_Ignorable Math Bidi_MirroredLogical_Order_Exception Changes_When_Lowercased Quotation_Mark Bidi_Mirroring_GlyphVariation_Selector Changes_When_Uppercased Dash Bidi_Paired_BracketIdentifiers Changes_When_Titlecased Sentence_Terminal Bidi_Paired_Bracket_TypeID_Continue Changes_When_Casefolded Terminal_Punctuation CJKID_Start Changes_When_Casemapped Diacritic IdeographicXID_Continue Numeric Extender Unified_IdeographXID_Start Numeric_Value Grapheme_Base RadicalPattern_Syntax Numeric_Type Grapheme_Extend IDS_Binary_OperatorPattern_White_Space Hex_Digit Indic_Positional_Category IDS_Trinary_Operator
... ASCII_Hex_Digit Indic_Syllabic_Category Unicode_Radical_Stroke
Properties
Emoji Properties
Property Characters that …Emoji are emoji
Emoji_Presentation have emoji presentation by default
Emoji_Modifier are emoji modifiers (skin tones)
Emoji_Modifier_Base are bases for emoji modifiers
Keycaps
#
Flags
AC
Skin Tones
How many? (Emoji 5.0β)
Type Count SubtotalSingletons 1,126 1,126
Incomplete* Singletons -38 1,088
Emoji_Combining_Sequence: Keycaps 12 1,100
Emoji_Flag_Sequence: Valid RI sequences 258 1,358
Emoji_Modifier_Sequence: Skin tones 465 1,823
Emoji ZWJ Sequence
ZWJ
ZWJ
Gendered w/ Object
ZWJ
Gendered w/ Sign
ZWJ
UI Actions: Backspace Example
ZWJ
Segmentation
ZWJ
Don’t break!
ZWJ Sequences (Emoji 5.0β)
Type Count SubtotalFamily 31 1,854
Gendered Role, with object 192 2,046
Gendered Role, with sign 72 2,118
Gendered Activity, with sign 216 2,334
Gendered Gestures, with sign 108 2,442
Other 2 2,444
CLDR Names, Keywords
unicode.org/cldr/charts/30/annotations
nerd face nørdansigt nerderig gezicht
Nerd-Smiley
nördaandlit
Face| geek| nerd
ansigt | nørd
geek| gezicht
| nerd
Gesicht| Nerd
andlit| lúði| nörd
Birth of an emoji character
< Oct.Nov.
Mar.
June
> July
β
http://unicode.org/emoji/selection.html
benefiting Digitally Disadvantaged Languages
1. Emoji pushed products to improve their Unicode handling!
2. And led us to launching a fundraising program for DDLs:
Old slides
Future: Flags?
F g b s c t ✦
Tag Key
Region = United Kingdom(Country, … )
Tag ValueTag Base
Subdivision = Scotland(Province, State, Canton,...)
Tag Term
Gendered Activity/Gesture, with sign
ZWJ