survey on sl recognition

Upload: kashif-aziz-awan

Post on 06-Apr-2018

222 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/3/2019 Survey on SL Recognition

    1/38

    Presented By:Sumaira Kausar

  • 8/3/2019 Survey on SL Recognition

    2/38

    IntroductionIn daily life, human beings communicate with eachother and interact with computers using gestures. As a

    kind of gesture, Sign Language (SL) is the primarycommunication media for deaf people.

  • 8/3/2019 Survey on SL Recognition

    3/38

    IntroductionSign Language:

    In taxonomies of communicative hand/arm

    gestures, Sign Language (SL) is often considered asthe most structured form of gesture, while gesturesthat accompany verbal discourse are described asthe least standardized. As non-SL gestures oftenconsist of small limited vocabularies, they are not a

    useful benchmark to evaluate gesture recognitionsystems. However, SL on the other hand can offer a good benchmark to evaluate different gesturerecognition systems because it consists of large andwell-defined vocabularies.

  • 8/3/2019 Survey on SL Recognition

    4/38

    Relevant Research in Sign

    Language RecognitionResearch can be grouped in two broad groups on thebasis of signs to be recognized.

    y Static Signs

    y Dynamic/Continuous Signs

  • 8/3/2019 Survey on SL Recognition

    5/38

    Static Signs

    OmerRashid & et. al. [1]UsedSupport Vector Machine

    to recognize static one hand finger spellings ofAmerican Sign Language (ASL) and taken geometricaldescriptions of fingers as features along with Humoment andZernike moment for recognition.

    y Only static alphabets and numbers in ASL are

    recognized.y Features selected are invariant in terms of rotation,

    scaling and translation.

    y Accuracy rate is 96.2% to 98.6%.

  • 8/3/2019 Survey on SL Recognition

    6/38

    Static Signs

    Ali Karami and et. al [7] have recognized thirty two

    static alphabets of Persian Sign Language. Multilayered perceptron Neural Networkfor classification.

    y Instead of applying segmentation technique, manuallycropped image is used.

    y Recognition is performed on bare handsy Accuracy rate is 94.06%.

  • 8/3/2019 Survey on SL Recognition

    7/38

    Static Signs

    Ravikiran and et. al [8] have usedboundary tracing and

    finger tip detection to recognize one handed staticfinger spellings of American Sign Language.

    y No input gloves or colored markers are used.

    y Subset of ASL alphabets is recognized.

    y Classification is based on open fingers in a gesture.

    y Technique does not require perfect alignment of handto the camera.

  • 8/3/2019 Survey on SL Recognition

    8/38

    Static Signs

    Kelly and et. al [9] have recognized static one handed postures using Support Vector Machine using acombination ofeigen space function andHu moments

    features.

    y Technique is robust and independent of particularsigners.

    y Two different data sets are used for this purpose.

  • 8/3/2019 Survey on SL Recognition

    9/38

  • 8/3/2019 Survey on SL Recognition

    10/38

    Dynamic/ Continuous Signs

    Hong Li and et. al [2] have been able to recognize

    dynamic single hand and two arms signs using contourbased features.

    y The main focus of the research is end pointlocalization.

    y Dynamic programming is used for this purpose.y Attained recognition rate is about 88% to 96%.

  • 8/3/2019 Survey on SL Recognition

    11/38

    Dynamic/ Continuous Signs

    Dynamic programming is being used by Yang and et. al.

    [5] to recognize dynamic signs of American SignLanguage

    y Controlled environment is used.

    y Special dressing scheme and background requirements

    are used for accurate recognition of signs.y Results are compared with ConditionalRandom Field

    CRF and latent dynamic CRF and an improvement is40% in terms of frame labeling rate.

  • 8/3/2019 Survey on SL Recognition

    12/38

    Dynamic/ Continuous Signs

    Dynamic Indian Sign Language of two handed signs is

    recognized by Anup Nandy and et. al [6]. They haveused direction histograms, Euclidean distance and k-nearest neighbor metric for classification.

    y The technique is illumination and orientation

    invariant.

  • 8/3/2019 Survey on SL Recognition

    13/38

    Dynamic/ Continuous Signs

    Eight two handed gestures are detected and recognized

    from videos of continuous signs, by Kelly and et. al [10]using movement epenthesis. Hidden Markov Model isused by the researchers for recognition process.

    y The detection accuracy rate attained is 95.6 %

    y Recognition accuracy rate is 93.2%.

  • 8/3/2019 Survey on SL Recognition

    14/38

    Dynamic/ Continuous Signs

    Combination of Confidence Measure (CM) obtained

    from aligning forward sign models (the conventionalapproach) to the input data with the CM's obtained

    from aligning reversed sign models to the same input,is used by Zafrullah and et. al [11] for the verification of

    continuous phrases of American Sign Language.y Test results shown that this combination of CMs yields

    improvement in verification accuracy as compared tothe traditional approach.

  • 8/3/2019 Survey on SL Recognition

    15/38

    Dynamic/ Continuous Signs

    Quantification of facial expressions along with

    tracking of hand shapes, based on skin color, isperformed in a research conducted by Justus and et. al[12] for continuous sign language recognition.

  • 8/3/2019 Survey on SL Recognition

    16/38

    Dynamic/ Continuous Signs

    Zahoor Zafrullah and et. al [13] performed real-time

    American Sign Language (ASL) phrase verificationusing Hidden Markov Model for an educational game,CopyCat, which is designed to improve deaf children'ssigning skills.

    y

    They achieved a phrase verification accuracy of 83% .

  • 8/3/2019 Survey on SL Recognition

    17/38

    Dynamic/ Continuous Signs

    Huang and et. al [14] have usedHidden Markov Model

    for recognizing Taiwanese dynamic signs, and FiniteState Machine for verification of these signs.

  • 8/3/2019 Survey on SL Recognition

    18/38

    Dynamic/ Continuous Signs

    Kelly and et. al [16] implementedConditionalRandom

    Field (CRF) based system and compared this to a HMMbased system and got improved results with CRF, whenrecognizing motion gestures and identifying inter

    gesture transitions.

  • 8/3/2019 Survey on SL Recognition

    19/38

    Dynamic/Continuous Signs

    Nguyen and et. al [17]proposed to use a 2-layer

    Conditional Random Field model for recognizingcontinuously signed grammatical markers in ASL.

    y Grammatical markers have facial feature movementsand head movements.

    y Accuracy rate of the proposed scheme is about 93.76% .

  • 8/3/2019 Survey on SL Recognition

    20/38

    Analysis of Existing ResearchCritical analysis of the existing research have shown thatthere are many shortcomings and inherent challenges of thearea. These shortcomings are:

    y

    Segmentationy Unrestricted environmenty Size of dictionaryy Invariancey Variety of Gesturesy generalityy Start/end identification of gesture sequencesy Feature selectiony Feature extraction

  • 8/3/2019 Survey on SL Recognition

    21/38

    Segmentation Problemsy External Aid

    y Data gloves

    y

    Color gloves

    y Skin Color Based

    y Illumination

    y Background colors

    y Area of Interest segregation

    y Wide range of skin colors

  • 8/3/2019 Survey on SL Recognition

    22/38

    Unrestricted Environment

    y Signers Background

    y Signers Clothing

    y Preprocessed images

  • 8/3/2019 Survey on SL Recognition

    23/38

    Size of dictionary

    Size of dictionary used in existing researches is usually

    very small. Majority researches have used onlyalphabets of some language that are usually static onehanded signs. In case of continuous sign eight or ten

    phrases are used for this purpose.

  • 8/3/2019 Survey on SL Recognition

    24/38

    Invariance

    y Rotation

    y Translationy Scaling

    y Illumination

  • 8/3/2019 Survey on SL Recognition

    25/38

    Varietyof Gesturesy One handed

    y Two handed

    y Armsy Facial expressions

    y Eye gaze

    y Head and body postures

  • 8/3/2019 Survey on SL Recognition

    26/38

    Generality

    y Feature selection is based on particular SL and selectedsubset of the dictionary

  • 8/3/2019 Survey on SL Recognition

    27/38

    Motion Gestures

    y Start/ end identification of gesture sequences

    y Motion identification

    y Isolated sign recognition

  • 8/3/2019 Survey on SL Recognition

    28/38

    Feature Selection

    y Minimizing Feature set

    y Robust feature Selection

    y Features based on local information

  • 8/3/2019 Survey on SL Recognition

    29/38

    Feature Extraction

    y Complex background

    y Colored and data gloves

  • 8/3/2019 Survey on SL Recognition

    30/38

    Guidelines for future Researchy It is a better approach to not use any external aid for

    segmentation, rather segmentation should be vision Based.For coping up problems with skin color based

    segmentation, it is recommended to use a combination ofskin color based features along with geometrical features.This can lead to the formulation of a better segmentationtechnique.

    y Techniques for classification and segmentation should beadopted which are not putting much restrictions on the

    environment of the signer. e.g. if segmentation is usingboth geometrical and skin color based features, thenrestrictions on background colors and signer's clothes canbe minimized.

  • 8/3/2019 Survey on SL Recognition

    31/38

    Guidelines for future Researchy Dictionary used for future researches should be larger. For

    this purpose more focus is required on the feature selectionprocess. Such features should be used which are general

    and can be applicable for a broader set of signs. So a largerset of dictionary of SL should be accommodated by thefuture researches.

    y Transformation and illumination invariance is a required

    feature for the future researches in the field of signlanguage recognition systems. Incorporating classificationtechnique with some invariant methodologies such asgeometrical moments can be helpful in this respect.

  • 8/3/2019 Survey on SL Recognition

    32/38

    Guidelines for future ResearchVariety of gestures should be accommodated in SL

    recognition systems. As this variety of gestures is an

    important aspect that should be catered for in thefuture work in the field. For this purpose special focusand extensive work is required on feature selection andfeature extraction. Multilayered and multidimensionalfeature set is required to be developed. Specialdimensionality reduction techniques then required toapply an efficient classification methodology.

  • 8/3/2019 Survey on SL Recognition

    33/38

    Guidelines for future ResearchFor an efficient classification solution, it is required to

    minimize the feature set while accommodatingmaximum possible set of sign dictionary andmaximum types of gestures. It is also desirable yetchallenging to select features which are robust to covera larger set of dictionary and type of gestures. It is alsorequired for a robust set of features that it relies on

    local information of signer without relying on someexternal cues or markers. So the feature sets defined in

    future techniques should accommodate thesechallenges.

  • 8/3/2019 Survey on SL Recognition

    34/38

    Guidelines for future ResearchFeature extraction challenges should be addressed

    appropriately for the future applications. Such features

    should be selected that are not disturbed by signersbackground so these features can be correctly

    extracted.

  • 8/3/2019 Survey on SL Recognition

    35/38

  • 8/3/2019 Survey on SL Recognition

    36/38

    References1]Omer Rashid, Ayoub alHamadi,Bernd Michelis,Utilizing invariant descriptors for finger spelling American SignLanguage using SVM, ISVC 10, Proceedings of 6th international conference on advances of visual computing,Volume part 1.

    [2] Hong Li and Michael Greenspan, Model-based segmentation and recognition of dynamic gestures incontinuous video streams , Science direct pattern recognition corrected proof 2010.

    [3] Trmal Jan and Marek Hrz : Evaluation of Feature Space Transforms for Czech Sign-Language Recognition .MetaCentrumYearbook 2010, p. 145-150, 2010.

    [4]Aleem Khalid Alvi, M. Yousuf Bin Azhar, Mehmood Usman, Suleman Mumtaz, Sameer Rafiq, RaziUr Rehman,Israr Ahmed, Pakistan Sign Language Recognition Using Statistical Template Matching, World Academy ofScience, Engineering and Technology 3 2005.

    [5] Yang R., Sarkar S., Loeding B., Handling Movement Epenthesis and Hand Segmentation Ambiguities inContinuous Sign Language Recognition Using Nested Dynamic Programming ,Pattern Analysis and MachineIntelligence, IEEE Transactions on March 2010,volume 32 Issue:3 pp 462 477.

    [6]Anup Nandy, Jay Shankar Prasad, Soumik Mondal, Pavan Chakraborty and G. C. Nandi, Recognition ofIsolated Indian Sign Language Gesture in Real Time ,Information Processing and ManagementCommunications in Computer and Information Science, 2010, Volume 70, 102-107, DOI: 10.1007/978-3-642-12214-9_18

    [7]Ali Karami, ahman Zanj and Azadeh kiani, Persian sign language (PSL) recognition using wavelet transformand neural networks ,Expert Systems with Applications

    Volume 38, Issue 3, March 2011, Pages 2661-2667

  • 8/3/2019 Survey on SL Recognition

    37/38

    References[8] J. Ravikiran, Kavi Mahesh, Suhas Mahishi, R. Dheeraj, S. Sudheender and Nitin V. Pujari, AutomaticRecognition of Sign Language Images ,Intelligent Automation and Computer Engineering LectureNotes in Electrical Engineering, 2010, Volume 52, 321-332, DOI: 10.1007/978-90-481-3517-2_25Computer Science, 2011, Volume 6495/2011, 665-676,

    [9] KellyD., McDonald J., Markham C., A person independent system for recognition of hand posturesused in sign language, Pattern Recognition Letters Volume 31, Issue 11, 1 August 2010, Pages 1359-1368

    [10] KellyD., McDonald J., Markham C., Continuous recognition of motion based gestures in signlanguage , Computer Vision Workshops (ICCV Workshops), 2009 IEEE 12th InternationalConference on Sept. 27 2009-Oct. 4 2009, Kyoto,pp 1073 1080.

    [11] Zafrulla Z., Brashear H., Hamilton H., Starner T.,"A novel approach to American Sign Language(ASL) phrase verification using reversed signing", Computer Vision and Pattern RecognitionWorkshops (CVPRW), 2010 IEEE Computer Society Conference San Francisco, CA,, 13-18 June 2010 ,pp 48 55.

    [12] Justus Piater, Thomas Hoyoux, Wei Du, Video Analysis for Continuous Sign LanguageRecognition,4th Workshop on the Representation and Processing of Sign Languages: Corpora andSign Language Technologies, 2010 (Workshop at the 7th International Conference on LanguageResources and Evaluation (LREC), Malta)

  • 8/3/2019 Survey on SL Recognition

    38/38

    References[13] Zahoor Zafrulla, Helene Brashear, Pei Yin , Peter Presti, Thad Starner, HarleyHamilton, American Sign Language Phrase Verification in an EducationalGame for Deaf Children ,20th International Conference on PatternRecognition, Istanbul, Turkey ,August 23-August 26 2010.

    [14] Chung-Lin Huang; Bo-Lin Tsai; , A Vision-Based Taiwanese Sign LanguageRecognition, Pattern Recognition (ICPR), 2010 20th International

    Conference,Istanbul ,Turkey , 23-26 Aug. 2010, PP 3683 3686.[16] KellyD., McDonald J., Markham C., Evaluation of threshold model HMMS

    and Conditional Random Fields for recognition of spatiotemporal gestures insign language ,Computer Vision Workshops (ICCV Workshops), 2009 IEEE12th International Conference on Sept. 27 2009-Oct. 4 2009, Kyoto, pp 490 497.

    [17] Tan Dat Nguyen and Surendra Ranganath, Recognizing Continuous

    Grammatical Marker Facial Gestures in Sign Language Video ,ComputerVision ACCV 2010 Lecture Notes in Computer Science, 2011, Volume6495/2011, 665-676

    [18] Sumaira Kausar, M. Younus,Javed, Shaleeza,Sohail: Recognition of gesturesin Pakistani sign language using fuzzy classifier , 8th conference on Signalprocessing, computational geometry and artificial vision 2008, Rhodes, Greece,Pages 101-105