speakers identification system for core networks using hadoop

Upload: tahreem-khalid

Post on 05-Apr-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    1/31

    SPEAKERS IDENTIFICATION

    SYSTEM FOR CORE NETWORKSUSING HADOOP CLUSTERS

    PROJECT SUPERVISOR:

    DR.SHOAB A.KHAN

    GROUP MEMBERS:

    AMSAL NAEEM

    TAHREEM KHALID

    RAHEEL MUMTAZ

    DE 30 (CE)

    COLLEGE OF E&ME, NUST

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    2/31

    OUTLINEO Motivation & problem definition

    O System level design

    O AlgorithmsO Implementation

    O Results

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    3/31

    MOTIVATIONO Security

    O Terrorism

    O Keep a check on important peopleO Political issues

    O Cricket

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    4/31

    Terrorist Incidents

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    5/31

    Match Fixing in Cricket

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    6/31

    PROBLEM SOLUTIONDesign a system that can

    O Process large number of calls at a time

    O Process large data setsO Identify the person an a particular call

    O Monitor the communication without

    interruption

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    7/31

    SYSTEM LEVEL DESIGNO Components

    O GUI explanation

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    8/31

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    9/31

    Recoding FeatureExtraction

    Matching

    Database

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    10/31

    Graphical User Interface

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    11/31

    Graphical User Interface

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    12/31

    Design And ImplementationO Training

    O Testing

    O Hadoop architectureO Hbase architecture

    O Speaker identification

    O Mfcc

    O Vector quantization

    O Matching (distance measurement)

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    13/31

    IN N

    IN 1 IN 2

    OUTPUT

    HADOOP MASTER

    MASTERTaskTracker

    SLAVE 1TaskTracker

    SLAVE 2TaskTracker

    SLAVE 3TaskTracker

    SLAVE 4TaskTracker

    MAP MAP MAP MAP MAP

    SORT SORT SORT SORT SORT

    MERGE

    REDUCE

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    14/31

    TRAINING

    O The voice samples are

    recorded in Matlab

    O MFCC Features are

    extracted from the voice

    input.

    O Vector Quantization

    using K-Means is done

    to reduce feature

    vectors.

    O Insert above feature

    vectors to HBase table.

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    15/31

    TESTINGO Input recordedO MFCC Features are

    extracted

    O Features input to

    HADOOP Cluster.

    O Distributed inputs to allmachines

    O MapReduce tasks on

    each TaskTracker.

    O Euclidean distance is

    measured

    O The output consists of

    the most likely matched

    speaker.

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    16/31

    HADOOP ARCHITECTURE

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    17/31

    MAPREDUCE WORKING

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    18/31

    HBASE ARCHITECTURE

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    19/31

    Speak

    er 1

    Speak

    er 2

    Speak

    er N

    UnknownSpeaker

    FEATUREEXTRACTION

    SPEAKER

    MODELLING

    MATCHING

    FEATURE

    EXTRACTION

    DATABASESPEAKER

    IDENTIFIED

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    20/31

    Mel Frequency Cepstral

    CoefficientsSamplin

    g

    Log

    Mel-Frequenc

    yWarping

    InverseDFT

    Framing &Windowin

    g

    DiscreteFourierTransfor

    m

    Mel

    Cepstru

    m

    Voice

    Signal

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    21/31

    MFCCO

    Hamming Window

    where 0 n N-1

    N=length of frame

    O Discrete Fourier Transform

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    22/31

    MFCC

    O Mel-Frequency warping

    O Inverse Discrete Fourier Transform

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    23/31

    VECTOR QUANTIZATION

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    24/31

    NO

    K-Means Clustering

    Select K objects randomly from M data objectsto take as initial clustering centers

    Assign all data object to its nearest cluster center

    Update each center by averaging all of the points

    that have been assigned to it

    Stop

    Have centroids

    changed?

    YES

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    25/31

    MATCHINGEuclidean Distance

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    26/31

    RESULTSO Graphs

    O Comparisons

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    27/31

    COMPARISON

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    28/31

    COMPARISON

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    29/31

    COMPARISON

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    30/31

    RESULTS

  • 7/31/2019 Speakers Identification System for Core Networks Using Hadoop

    31/31

    CONCLUSIONO Summary of project

    O Summary of results

    O Future extension