low-power audio classification for hearing-aids · 2011. 8. 6. · 010010001000...

18
010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson Assistant Professor Georgia Institute of Technology [email protected] SPRP497

Upload: others

Post on 24-May-2021

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

01001000100001001000100000110000001000001100

Low-Power Audio Classification For Hearing-Aids

David V. Anderson Assistant ProfessorGeorgia Institute of [email protected]

SPRP497

Page 2: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

2

Project Objective

Automatic tuning of electro-acoustic response of hearing-aids to suit the audio environment

We perform classification of the auditory environment to enable tuning of the hearing-aid

Requirements Small SizeLow-powerHigh accuracyLow false alarms

Page 3: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

3

Desirable QualitiesInter-class variability

Features should provide good inter-class discrimination but still maintain intra-class cohesion

Features must be robust to noise

Granularity IssueTrade-off between complexity of system and granularity of classes

Real-time response Computationally efficient classification structures and feature extraction algorithms

Perfo

rman

ce Response

Complexity

Page 4: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

4

Some Current Classification Approaches

Easy to implement but accuracy in adverse conditions may not be very

good.

LowFairHeuristics

Computationally expensive. Is essentially a binary classifier.

HighVery GoodSVM

Computationally expensive. Usually use GMMs for probability estimates.

HighGoodHMM

Does not handle high dimensional data well

ModerateGoodGMM

CommentsComplexityAccuracyMethods

Page 5: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

5

Proposed Approach

Robust feature extractionBased on an advanced model of the human auditory system.

Very efficient algorithm for classification based on AdaBoost

Final classifier can be implemented using MAC and a comparator.

Page 6: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

6

Problems with Conventional Features

Work well in noise free case but performance degrades in presence of noiseAccuracy is reduced greatly when different classes are presented simultaneously

Humans do an extremely good job of classifying soundsPhysiologically inspired perceptual features are

Highly discriminative Robust to noise

Why auditory modeling?

Page 7: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

7

Auditory Modeling Based on Modulation Spectrum Theory

Input h(t;s) ∫T∂t ∂sg( ) w(t) HWRv(s)

Cochlea Hair cell stage Cochlear nucleus

Page 8: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

8

Response Fields of Neurons in the Cortex*

*Shamma et al.

Page 9: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

9

Hardware Implementation

Spatial-Temporal Filtering

BPF BPFBPFBPF ...

∑ ∑ ∑ ∑

HWR HWRHWRHWR ...

Input

LPF LPFLPFLPF ...

- - -

4-D Feature Space

Page 10: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

10

AdaBoost Classifier

Given examples (x1, y1),….,(x2, y2) where yi = 0,1 for negative and positive examples respectively.Initialize weights w1,i = 1/(2m), 1/(2l) for yi = 0,1 respectively, where m and l are the number of negatives and positives respectively.For t = 1 to T

1. Normalize weights,

2. Train hj ; error, 3. Choose classifier , with the least 4. Update weights:

ei = 0 if xi if classified correctly, 1 otherwise

)/( ,, jtjit www Σ=|)(| iijit yxh −∑=ε

th tε)1(

,,1 )( ietitit ww −

+ = β

)1/( ttt εεβ −=

Page 11: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

11

where, αt = log(1/βt)

1)( =xh ∑∑==

≥T

tttt

T

txh

11)2/1()( αα

The final strong classifier is:The final strong classifier is:

if

= 0 else

The AdaBoost classifiertries to find the decision boundaryfor the classification task by combining the multiple hypothesisbased on single features.

Page 12: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

12

AuditoryModel

ConventionalFeatures

FeaturePool

LearningAlgorithm

ClassifiedCategory

ClassifierInput

OfflineTraining

Overall Structure of Proposed System

During offline training, the weights (alpha's) needed to combine the features to form the decision function are learned. The multi-class problem is structured as a combination of binary classification problems and the results are combined by majority voting.

Page 13: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

13

Results - Matlab

95.8 %87.7 %78.85 %Overall

Version 2(30 sec

data)

Version 1(1 sec

data)

Phonak(30 sec data)

95.5 %92.7 % Overall

AdaBoostGMM

Phonak Database (Music, Speech, Noise, Speech in Noise)

Tel-03 Database (Animal Vocalizations, Speech, Music, Noise)

Phonak

AdaB

oost

AdaB

oost

30 se

cs

AdaB

oost

1 sec

30 se

csAdaB

oost

GM

M

Page 14: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

14

Hardware Implementation

In order to reduce the complexity of the feature extraction and to enable ease of implementation, some modifications were incorporated

The frontend bandpass filters were replaced by an FFT and a mel-cepstra like processing was implemented to extract the auditory spectrum.

Page 15: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

15

DSP Implementation

FFT

. . . .

Spatio-Temporal Filtering

Amplitude Compression

Features

Page 16: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

16

Simulation results with the new feature set showed no overall degradation in classification accuracy compared to the original feature set.

78.02 %88.51 %Noisy Speech

86.82 %85.96 %Overall

90.52 %79.74 %Speech

79.02 %77.03 %Noise

99.71 %97.84 %Music

New Features(after modification for

hardware implementation)

Original featuresCategory

Note: Drop in noisy speech performance is due to the use of FFT and mel-scale grouping.

Page 17: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

17

C5510 Specifications

Sampling rate: 8 kHzFor feature extraction (for 1 second segment):

Size of data: 13 k wordsSize of our code: 4 k wordsSize of entire code: 16 k wordsMIPS ?

For Classification85 coefficients85 MACs, 85 additions and 1 compare

Page 18: Low-Power Audio Classification For Hearing-Aids · 2011. 8. 6. · 010010001000 01001000100000110000001000001100 Low-Power Audio Classification For Hearing-Aids David V. Anderson

18

Low-Power Audio Classification For Hearing-Aids

David V. Anderson

Assistant ProfessorGeorgia [email protected]