binaural approach in acoustic scene simulations in audio forensics

Upload: francesca-ortolani

Post on 03-Jun-2018

226 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/12/2019 Binaural Approach in Acoustic Scene Simulations in Audio Forensics

    1/14

    Francesca Ortolani - Binaural approach in acoustic scene simulations in audio forensics

    Radio Shack School of Audio Technologies 2014

    1

    Binaural approach in acoustic scene simulations in audio forensics

    Francesca OrtolaniIronbridge Electronics

    [email protected]

    Concept

    Gunshots, steps, door noises, vehicles, barking dogs each sound could be a critical proof on thecrime scene. Along with acoustic measurements (such as reverberation time, wall isolation,

    outdoor noise), binaural miking in acoustic simulations is indeed a powerful instrument fortesting a witness capabilities in hearing sound evidences and check for their direction of arrival onthe crime scene. This article is a short introduction to binaural audio and reverberation issues.

    1. Introduction and background

    !inaural recording is a "# audio techni$ue ai ed at the %la&back through head%hones or a %air ofs%eakers. 't should not be istaken for stereo%honic sound( )hich rather refers to a

    bidi ensional soundsca%e. !inaural iking ai s at rendering the e*act %osition of the sourcesrelati+e to the listener in a "# en+iron ent( that is( sound is all around the listener )earinghead%hones or being in front of the s%eakers. !inaural audio through s%eakers is uch oredifficult to i %le ent due to cross-talk bet)een the channels.

    There e*ist " %rinci%al techni$ues in order to get a binaural rendering, The use of a dumm head, an artificial hu an-like head e$ui%%ed )ith icro%hones in the

    ears( %laced at eardru s the du & head substitutes for the hu an head and itscharacteristic shado)( de%ending on the direction of arri+al of the i %inging front.

    The si ulation of a hu an head and the co %utation of a RT/ !ead "elated Transfer#unction .

    *%eri ents )ith real hu an heads and real %eo%le )earing icro%hone ca%sules in theears.

    The first tests in binaural audio go back to 1331. A %air of coal icro%hones )ere %laced in front ofthe stage of the %era in 5aris and s%aced like hu an ears. The acoustic signals )ere transducedand sent to destination b& tele%hone.Se+eral &ears later( in 16"1-"2( the researchers at !ell Tele%hone 7aboratories in 8e) 9erse& :SA

    founded so e of the ost i %ortant bases of electroacoustics. ar+e& /letcher( ;/ather ofstereo%honic sound

  • 8/12/2019 Binaural Approach in Acoustic Scene Simulations in Audio Forensics

    2/14

    Francesca Ortolani - Binaural approach in acoustic scene simulations in audio forensics

    Radio Shack School of Audio Technologies 2014

    2

    cur+es started in+estigating about the nature of sound( s%eech and listening and %atented thefirst edical acoustic hearing aid de+ice.

    The accurate descri%tion of each %art of an acoustic s&ste ( in a %articular )a& the s%eakingsource( the icro%hone( the electric trans ission line( the recei+er head%hones( louds%eaker orlistener soon beca e essential.

    The first binaural du & head )as called ; scar

  • 8/12/2019 Binaural Approach in Acoustic Scene Simulations in Audio Forensics

    3/14

    Francesca Ortolani - Binaural approach in acoustic scene simulations in audio forensics

    Radio Shack School of Audio Technologies 2014

    "

    The '7# e*hibits instead greater +ariabilit& changing the listener and it de%ends $uite criticall& onfre$uenc&. ThatHs )h& the '7# turns out to be ore useful to source localisation.

    Referring to /igure 1( the for ula for calculating the 'T# can be Iood)orth for ula e*tended forfre$uencies belo) 1.= k ? E"F,

    ( ) ( )sin

    , cosa

    ITD c

    += 1.1

    )here a is the radius of the head su%%osing it is s%herical ( c is the s%eed of sound about "44 Jsin air ( K is the a?i uth and * is ele+ation.

    Iood)orthHs original for ulation did not consider that the 'T# di inishes outdistancing thesource on the listenerHs hori?ontal %lane. The factor cos reckons )ith that.

    At fre$uencies abo+e 1.= k ? Iood)orthHs for ula gets less accurate.

    onsider another sensor& cue, the '5# ( nteraural +hase ifference) . Ihen the '5# is greater ore$ual to 130L( the source %osition is a biguous. This a biguit& is induced b& the %eriodic natureof %hase.

    Since the )a+elength for higher fre$uencies is shorter( it a& ha%%en that the distance bet)eenthe ears is greater than a )a+elength for ade$uatel& high fre$uencies M1.=k ? .This gi+es rise to spatial aliasing ( as entioned abo+e( and the %ercei+ed 'T# %ro+es to bea biguous( as the ear is res%onsi+e onl& to the %hase and not to ti e differences.

    Figure 1 Duple binaural model

  • 8/12/2019 Binaural Approach in Acoustic Scene Simulations in Audio Forensics

    4/14

    Francesca Ortolani - Binaural approach in acoustic scene simulations in audio forensics

    Radio Shack School of Audio Technologies 2014

    4

    So( for such fre$uencies the '7# su%%orting infor ation is necessar&. 'n other )ords( a highfre$uenc& signal M1.= k ? %roduces an 'T# greater than the signal %eriod( )hile a lo) fre$uenc&

    signal is such that( because of the 'T# ranging in a %eriod of the signal itself( the %hase difference%ercei+ed at the ear allo)s the listener to e+aluate the 'T# una biguousl&.

    The hu an listener takes ad+antage of the 'T# and '7# cues in order to decode the direction ofarri+al of the sound. 'n addition( the %resence of the %inna in the ear %roduces further ore acoloration of the sound( de%ending on the direction of arri+al of the sound )a+e./igure 2 sho)s the hu an ear anato &.'n general( localisation %erfor ance is e*cellent in the front stage of the hori?ontal %lane( good onthe rear and uch )orse on the +ertical %lane.

    ". #$%F ead-$elated %ransfer Function'

    'n linear s&ste s anal&sis the transfer function is defined as the co %le* ratio bet)een the out%utsignal s%ectru and the in%ut signal s%ectru . Then( the transfer function !( f ) of a linear ti e-in+ariant s&ste is defined as,

    ( ) ( )

    ( )Y f

    H f X f

    = 1.2

    )here ( f ) is the out%ut signal fre$uenc& s%ectru and -( f ) is the in%ut signal s%ectru .

    A transfer function or( e$ui+alentl& in ti e do ain( the impulse response characteri?es anddescribes the channel bet)een the source and the recei+er.The RT/ is the head related transfer function( a function of 4 +ariables (f, r, K( N ( )here f is thefre$uenc&( r is the distance of the source fro the listener( K and N are res%ecti+el& the a?i uthand the ele+ation.So( )e ha+e a left RT/ and a right RT/ and these describe the channel bet)een the ear and thesource relati+el& to the left and right ear.

    A RT/ atches onl& one configuration of source and recei+er and the left and right RT/s doe bed the 'T# and '7# infor ation.

    'n order to e*tract the 'T# and '7# fro the RT/s E4F )e co %ute the odulus of the interauraltransfer function ( )hich is defined as,

    ( ) ( )

    ( )

    = , ,

    , ,

    R

    r

    INT L

    r

    H H

    H 1."

    )here / 01f is the angular fre$uenc& and the su%erscri%ts " (& denote res%ecti+el& the right andleft channel. The odulus e*%ressed in d! of ( ) INT H is,

  • 8/12/2019 Binaural Approach in Acoustic Scene Simulations in Audio Forensics

    5/14

    Francesca Ortolani - Binaural approach in acoustic scene simulations in audio forensics

    Radio Shack School of Audio Technologies 2014

    =

    ( ) ( ) ( ) ( ) = = 10 , , , ,20log R L

    INT INT r r dB dB dB H H H H 1.4

    and it is an e*%ression of the '7# )ith +ar&ing fre$uenc&.The grou% dela& is defined as,

    ( ) ( ) ( )( )

    =

    R LINT d d

    d d 1.=

    )here ( ) ( ) ( ) ( ) ( ) ( ) = = = , , , ,, ,R L

    INT R r L r H H H re%resents the 'T# )ith

    +ar&ing fre$uenc&.

    The RT/ ethod does not re%lace the #u%le* odel( but su%%orts the latter in order to obtain abetter esti ation of the source %osition in the listening en+iron ent. Research brought to oreco %licated hu an odels and the !RT/ (2od $"elated Transfer #unction) )ere in+ented.These ne) transfer functions describe hu an bod& +ibrations and their trans ission to theauditor& s&ste thus contributing to the o+erall sound %erce%tion.

    (. $e)erberation effects on source localisation

    As a first effect re+erberation reduces the efficac& of the infor ation used b& the listener. 'n%resence of reflections( in fact( the auditor& s&ste a& decode the source %osition erroneousl&due to high intensit& echoes interfering the direct sound and su%eri %osing the sel+es to it.

    Tests in anechoic cha ber and re+erberant cha ber brought so e significant results through the&ears.So e of the are listed belo),

    a )ide band stationar& noise is localised ore scarcel& in a re+erberant field E=F a )ide band noise is localised easier in a re+erberant field than single s%arse tones( that is(

    localisation i %ro+es )ith the s%ectral densit& of the source high attack transient sounds are better localised than others inde%endentl& on

    re+erberation deca& ti e .

    Thanks to the precedence effect the listener is able to )eight )a+efronts in different )a&s.The first front on arri+al direct sound is gi+en ore )eight.Stee% attack transients trigger the %recedence effect and fa+our localisation( )hile stationar&

    sources are decoded )ith a greater difficult&.

  • 8/12/2019 Binaural Approach in Acoustic Scene Simulations in Audio Forensics

    6/14

    Francesca Ortolani - Binaural approach in acoustic scene simulations in audio forensics

    Radio Shack School of Audio Technologies 2014

    >

    Studies about se%aration of concurrent sources E>F re+ealed that the ca%abilit& of grou%ing soundsaccording to their har onicit& de%ends on the e*tent of their funda ental fre$uenc& f 0 range.

    'n free s%ace the listener identifies )ith ease those sounds ha+ing a %ro%er %itch difference.The situation changes in a re+erberant en+iron ent, direct sound and echoes )ith the sa efunda ental fre$uencies relate to each other on the basis of har onic ratios. A flutter in f 0 causesinhar onicit& bet)een direct sound and reflections. /or this reason( in a re+erberant field( the

    echanis of grou%ing sounds according to har onicit& loses its efficac&.

    'n a confined s%ace distance %erce%tion de%ends on 2 factors EOF, the ratio bet)een direct soundenerg& and reflected energ& and the ti e dela& bet)een direct sound and reflected sound.Roo reflections also contribute to the general s%atial i %ression charactering the listeningen+iron ent and the sound source.

    So e %ara eters are introduced here, ASI (Apparent 3ource 4idth) ( )hich is the %ercei+edlength of a source( and 7 P (&istener 5nvelopment) ( the %erce%tion of a surrounding sound.ASI is deter ined %rinci%all& b& the intensit& le+el of the earl& lateral reflections )ithin the first30 s fro the arri+al of the direct sound at the listenerHs %osition.7 P is the subDecti+e i ersi+e sound sensation %roduced b& re+erb and de%ends on the nature oflate reflections( in %articular on their distribution o+er ti e( their le+el and direction of arri+al.

    'n E3F !radle& e*%lains ho) ASI and 7 P are caused b& the %recedence effect( during )hich thedirect sound and the earl& reflections are te %oraril& erged together.

    ASI finds Dustification in the circu stance that the identification of the direction of arri+al isdistorted b& the enhance ent of the erged e+ent %roduced b& earl& lateral reflections.

    The re+erberation tail instead is not erged )ith direct sound and a%%ears as a sort of diffusedhalo around the direct sound thus gi+ing the listener a sense of i ersion $uantifiable )ith 7 P.

    *. $e)erberation effects on sound perception

    Ihen the sound is e itted in a re+erberant en+iron ent( reflections are su%eri %osed to directsound( causing the ;s%reading< of it in ti e and filling u% )ith energ& those s%aces in ti e that)ould ha+e re ained e %t& in an anechoic a bient( that is( a reflectionless cha ber.Iith i ediate effect( the listener encounters a reduction in signal intelligibilit&./igure 2 a b co %ares the s%ectrogra 1 of the +ocal recording ;Thank &ou for tra+elling on thecentral line< b& a fe ale s%eaker in the case a of a oderate re+erberant a bient and the case

    b )ith a 1-second deca&-ti e re+erb.

    /igure 2 b highlights re+erberation filling u% those darker ?ones )ith lo)er energ&( )hich areuch shar%er in /igure 2 a .

    The attack %hone es are less susce%tible to the effect of re+erberation than the final ;releasec n nn:

    d 3 a= 1.O

    )here : is a directi+it& %ara eter 1: = for an o nidirectional source ( n3 is the n-th surface

    area ha+ing the absor%tion coefficient na .

    'f the absor%tion coefficients are unkno)n( an alternati+e for ula for cd is the follo)ing,

    >0100

    c

    : ; d

    "T p= 1.3

    )here ; is the roo +olu e "m and

    >0"T is the esti ated reverberation time 00( RT"0( RT20( RT10 , t&%icall& defined as the ti e inter+al sound energ& takes todeca& >0( "0( 20( 10 d!( after the e*citation in the roo has ceased.

    Figure * %he reflection order is the number of impacts the sound carries out before reaching the listener recei)er 01"

  • 8/12/2019 Binaural Approach in Acoustic Scene Simulations in Audio Forensics

    13/14

    Francesca Ortolani - Binaural approach in acoustic scene simulations in audio forensics

    Radio Shack School of Audio Technologies 2014

    1"

    nne e B/ %he human ear

    The %inna( the out ost %art of the ear( is a cartilaginous structure )ith the ai of con+e&ing thesound( through the auditor& channel( to)ards the eardru e brane. The sound is trans ittedto the ossicles alleus( an+il and stirru% ( )hich for a s&ste of le+ers beha+ing like ana %lifier. So( the force i %inging on the o+al )indo) is ade tri%le co %ared to that i %ingingon the alleus and the acoustic %ressure is 2 orders of agnitude greater relati+e to the %ressureon the eardru E4F. The cochlea is the center of the auditor& %rocess. 't has the sha%e of a snailand itHs about "= long )ith a dia eter of 2 at its e*ternal entrance and graduall& shrinking.'nside the cochlea there is the basilar e brane co+ered in ciliated cells. Ihen the e brane+ibrates( the cilias bend and this o+e ent causes the ner+e ter inations of the relati+e cells totrans it i %ulses to the brain( )hose nu ber de%ends on the fre$uenc& of the sound )a+e. 'n

    other )ords( the cochlea beha+es like a filter bank E4F( o%erating a s%ectru anal&sis andtransduces a fre$uenc& infor ation into an electric signal to)ards the brain( according to the%osition of the in+ol+ed ;resonator

  • 8/12/2019 Binaural Approach in Acoustic Scene Simulations in Audio Forensics

    14/14

    Francesca Ortolani - Binaural approach in acoustic scene simulations in audio forensics

    Radio Shack School of Audio Technologies 2014

    14

    References:1. R. . Stern( B. 9. !ro)n e #. Iang( 2inaural 3ound &ocali>ation ( 'n 'omputational Auditor

    3cene Anal sis? +rinciples, Algorithms, and Applications ( Iile&-' 5ress( 200>

    2. /. 7. Iight an e #. 9. istler( #actors affecting the relative salience of sound locali>ationcues ( in 2inaural and 3patial !earing in "eal and ;irtual 5nvironments ( 1-2"( 7a)rence

    rlbau Associates( ah)ah( 89( 166O

    ". !rian art& e Pictor 7a??arini( 2inaural !"T# based spatialisation? new approaches and implementation ( 12th 'nt. onference on #igital Audio ffects #A/*-06 ( o o( 'tal&(Se%te ber 1-4( 2006

    4. A. :ncini( Audio igitale ( ",30-31( cBra)- ill( 200>

    =. I. . art ann( &ocali>ation of sound in rooms ( 9ournal of the Acoustical Societ& ofA erica( O4 = ,1"30-1"61(163"

    >. 9. /. ulling( . Su erfield e #. . arshall( 5ffects of simulated reverberation on the useof binaural cues and fundamental fre8uenc differences for separating concurrent vowels (S%eech o unication( 14,O1-6=( 1664

    O. B.9. !ro)n e . 5alo Uki( "everberation ( 'n 'omputational Auditor 3cene Anal sis?

    +rinciples, Algorithms, and Applications ( Iile&-' 5ress( 200>

    3. 9. S. !radle&( +redictors of speech intelligibilit in rooms ( 9ournal of the Acoustical Societ& ofA erica( 21 > ,=OO-=30( 1646

    6. R. . !olt e A. #. ac#onald( Theor of speech masking b reverberation ( 9ournal of theAcoustical Societ& of A erica( 21 > ,=OO-=30(1646

    10. !. 7ibbe& e 5. . Rogers( The effects of overlap$masking on binaural reverberant wordintelligibilit ( 9ournal of the Acoustical Societ& of A erica( 11=,"141-"1=1( 2004

    11. A erican 8ational Standards 'nstitute( American national ps choacoustical terminolog ,S".20( 16O"( A erican Standards Association

    12. 9. 5. A. 7ochner e 9. /. !urger( The influence of reflections on auditorium acoustics (9ournal of Sound and Pibration( 1 4,42>-4=4( 16>4

    1". /. A. +erest( The %aster !andbook of Acoustics ( cBra)- ill 2006