white paper phonetic search tech
Post on 04-Apr-2018
231 Views
Preview:
TRANSCRIPT
-
7/30/2019 White Paper Phonetic Search Tech
1/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding.
whIte pAper
pnic Sac tcnlg
A Whitepaper by Nexidia, Inc.
-
7/30/2019 White Paper Phonetic Search Tech
2/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 2
whItepAper
pnic Sac tcnlg
Cig Nic
Copyright 2004-2009, Nexidia Inc. All rights reserved.
This manual and any sotware described herein, in whole or in part may not
be reproduced, translated or modifed in any manner, without the prior written
approval o Nexidia Inc. Any documentation that is made available by Nexidia Inc.
is the copyrighted work o Nexidia Inc. or its licensors and is owned by Nexidia Inc.
or its licensors. This document contains inormation that may be protected by one
or more U.S. patents, oreign patents or pending applications.
trADeMArKS
Nexidia, Enterprise Speech Intelligence, Nexidia ESI, the Nexidia logo, and
combinations thereo are trademarks o Nexidia Inc. in the United States and other
countries. Other product name and brands mentioned in this manual may be the
trademarks or registered trademarks o their respective companies and are hereby
acknowledged.
-
7/30/2019 White Paper Phonetic Search Tech
3/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 3
whItepAper
pnic Sac tcnlg
Cnsqunl, muc is daa as unavailabl analsis, suc as millins us call cn calls cdd v
a a a acivd lgal asns. Using a m adiinal aac, a v small amun audi ma b lisnd ,
bu in an ad-c mann, suc as andm audis b call cn manags, lisning vaius badcass. tagd sacing,v, is dicul. I is audi daa asil sacabl, man alicains uld b ssibl, suc as: viing nl
calls a m s ciia, ming nd analsis acss usands us cusm calls, sacing an ni nscas
nd xac lcain a cain ic is discussd and man uss.
t dicul in accssing inmain in ms audi da is a unlik sm badcas mdia, clsd caining is n
availabl. Fu, man-mad anscis a xnsiv gna, and limid in i dsciin. Audi sac basd n
sc--x cnlg is n scalabl, dnds n igl aind dicinais and gnas a ibiiv al cs
nsi. wa is ndd is an alna aac.
In is a, summaiz i k in sacing audi and xamin caacisics vaius mds. w n
induc and dscib a b aac knn as nic-basd sac, dvld b sac gu a Nxidia in
cnjuncin i Ggia Insiu tcnlg. pnic-basd sac is dsignd xml as sacing ug
vas amuns mdia, alling sac ds, ass, jagn, slang and ds n adil und in a sc--x
dicina. Bl vid a dsciin Nxidias cnlg and n discuss accuac nic sac and
nall sn cun alicains cnlg isl.
Cnac cns/nis, ic mdia, lgal/audi discv and gvnmn alicains a aas Nxidia as bn
succssull alid.
pi wk in Audi Sac
rival inmain m audi and sc as bn a gal man sacs v as n as. t simls
sluin is blm uld b us Lag Vcabula Cninuus Sc rcgniin (LVCSr), m im alignmn,
and duc an indx x cnn alng i im sams. LVCSr is sucinl mau a lbxs a ublicl availabl
suc as htK (m Cambidg Univsi, england), ISIp (Mississii Sa Univsi, USA), and Sinx (Cangi Mlln
Univsi, USA) as ll as a s cmmcial ings. Muc imvd manc dmnsad in cun LVCSr
ssms cms m b linguisic mdling [Juask] limina squncs ds a a n alld iin
languag. Ununal, d as a sldm z.
Inducin
From call centers to broadcast news programs, the quantity of digital les being created is
growing quickly and shows no signs of slowing. While valuable information may exist in the
audio of these les, there has historically been no effective means to organize, search and
analyze the data in an efcient manner.
-
7/30/2019 White Paper Phonetic Search Tech
4/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 4
whItepAper
pnic Sac tcnlg
t nd b aumaic ival audi daa as md mulain daabass scicall s is caabili
[Ga]. Als, a saa ack as bn sablisd skn dcumn ival iin annual treC (tx rival
Cnnc) vn [Gal]. An xaml can b sn in [Jnsn]. In is sac, a ansciin m LVCSr as ducd
n NISt hub-4 Badcas Ns cus. wl snnc quis a sd, and ansciin is sacd using inllign
x-basd inmain xacin mds. Sm insing daa m is ss a d as ang m 64%
20%, dnding n LVCSr ssm usd, and clsd caining as a ugl 12%. wil sc cgniin as
imvd sinc s suls, imvmn as bn incmnal.
Ng and Zu [Ng] cgnizd nd nic sacing b using subd unis inmain ival. Alug
nic as ig (37%) and manc ival ask as l cmad LVCSr mds, ga mis
as aniciad b aus.
In LVCSr aac, cgniz is anscib all inu sc as a cain ds in is vcabula. Kd
sing is a din cniqu sacing audi scic ds and ass. In is aac, cgniz is nl
cncnd i ccuncs n kd as. Sinc sc singl d mus b cmud (insad
ni vcabula), muc lss cmuain is quid. tis as v iman al al-im alicains suc as
suvillanc and aumain a-assisd calls [wiln] [wld].
An advanag kd sing is nial an n vcabula a sac im, making is cniqu usul in
aciv ival. tis cniqu, v, is inadqua al-im xcuin. wn sacing ug ns undds
INDUStry
Contact Centers/Enterprise
BeNeFItS FroM phoNetIC SeArCh
> Improved customer interactions
> Deeper business intelligence
> Operational efciencies
Rich Media > Large amounts o long orm content is searchable
> Automated categorization and fltering
> Synchronize stories with videos
> Ad targeting
> Easily monetized content
Legal/Audio Discovery > Corporate compliance
> Litigation support
> Fast and accurate audio discovery
Government > Audio search
> Public saety
> Standards compliance
-
7/30/2019 White Paper Phonetic Search Tech
5/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 5
whItepAper
pnic Sac tcnlg
Figure 1
Nexidia High-Speed
Phonetic Search Architecture
usands us acivd audi daa, scanning mus b xcud man usands ims as an al-im.
t aciv is gal, a n class kd ss as bn dvld a ms saa indxing and sacing
sags. In ding s, sac sds a a sval usand ims as an al im av bn succssull acivd.
t dminan aacs av bn as-indxing mds [Saukkai] and nic laic mds [Jams] and
cmbinains [yu]. In s aac, sd is acivd b gnaing a dsciin sc signal using
a subs sub-d dscis. ts dscis a usd na sac sac a ival im. In scnd
aac, sc is indxd duc a laic likl nms a can b sacd quickl an givn nm
squnc. In all s mds, accuac as bn sacicd sd.
t Nxidia hig-Sd pnic Sac engin
w n induc an aac nic sacing, illusad in Figu 1. tis ig-sd algim [Clmns
al. 2001a; Clmns al. 2001b; Clmns al. 2007; U.S. ans 7,231,351; 7,263,484; 7,313,521; 7,324,939;
7,406,415] cmiss assindxing and sacing. t s as indxs inu sc duc a nic
sac ack and is md nl nc. t scnd as, md nv a sac is ndd a d as,
is sacing nic sac ack. onc indxing is cmld, is sac sag can b ad an numb
quis. Sinc sac is nic, sac quis d n nd b in an -dnd dicina, us alling sacs
nams, n ds, misslld ds, jagn c. N a nc indxing as bn cmld, iginal mdia a
n invlvd a all duing sacing and sac ack culd b gnad n igs-quali mdia availabl imvd
accuac ( xaml: -la audi ln), bu n audi culd b lacd b a cmssd snain
sag and subsqun laback ( xaml: GSM) aads.
-
7/30/2019 White Paper Phonetic Search Tech
6/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 6
whItepAper
pnic Sac tcnlg
Indxing and acusic mdl
t indxing as bgins i ma cnvsin inu mdia (s ma mig b Mp3, ADpCM, Quicktim, c.)
in a sandad audi snain subsqun andling (pCM). tn, using an acusic mdl, indxing ngin scans
inu sc and ducs csnding nic sac ack. An acusic mdl jinl sns caacisics
b an acusic cannl (an nvinmn in ic sc as ud and a ansduc ug ic i as cdd)
and a naual languag (in ic uman bings xssd inu sc). Audi cannl caacisics includ: qunc
sns, backgund nis and vbain. Caacisics a naual languag includ gnd, dialc and accn
sak.
Nxidia icall ducs acusic mdls ac languag:
a mdl mdia i ig samling as, gd signal--nis ais, and m mal, asd sc; and
a mdl mdia m a cmmcial ln nk, i landlin cllula ands,
imizd m snanus, cnvsainal sc ln calls.
Nxidia sus m an 30 languags including:
Duc
englis (N Amican, UK, and Ausalian)
Fnc (euan, Canadian)
hindi
Gman
Jaans
Kan
Mandain
russian
Sanis (Lain Amican, Casilian)
tai
Addiinal languags a cnsanl in dvlmn. I icall aks lss an a ks dvl a languag ack
a n languag.
-
7/30/2019 White Paper Phonetic Search Tech
7/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 7
whItepAper
pnic Sac tcnlg
pnic sac ack
t nd sul nic indxing an audi l is caing a pnic Audi tack (pAt l)a igl cmssd
snain nic cnn inu sc. Unlik LVCSr, s ssnial us is mak ivsibl
(and ssibl incc) bindings bn sc sunds and scic ds, nic indxing ml ins liklid
nial nic cnn as a ducd laic, ding dcisins abu d bindings subsqun sacing as.
pAt ls a siml ls a can b ad as madaa, assciad and disibud i iginaing mdia sgmns,
ducd in n nvinmn, sd in daa bass, ansmid via nks, and sacd in an nvinmn. t pAt
l gs in siz inal lng in im suc mdia l, a aund 3.7 MB u, quivaln a bi a
8.6 kbs, i.., 2/3 a GSM ln audi (13 kbs) 1/15 a ical Mp3 (128 kbs).
KeyworD pArSING
t sacing as bgins i asing qu sing, ic is scid as x cnaining n m:
ds ass (.g., psidn Sum Cu Jusic),
nic sings (.g., _B _Iy _t _Uw _B _Iy, six nms sning acnm B2B),
mal as (.g., bain canc &15 cll n, sning ass skn iin 15 scnds ac ).
A nic dicina is ncd ac d iin qu m accmmda unusual ds (s nunciains
mus b andld sciall givn naual languag) as ll as v cmmn ds ( ic manc imizain is
il). An d n und in dicina is n cssd b cnsuling a slling--sund cnv a gnas
likl nic snains givn ds ga.
SeArCh AND reSULtS LIStS
A ds, ass, nic sings and mal as iin qu m a asd, acual sacing cmmncs.
Mulil pAt ls can b scannd a ig sd duing a singl sac likl nic squncs (ssibl saad b
ss scid b mal as) a clsl mac csnding sings nms in qu m. rcall a
pAt ls ncd nial ss nms, n ivsibl bindings sunds. tus, macing algim is babilisic
and uns mulil suls, ac as a 4-ul:
pAt Fil ( idni mdia sgmn assciad i uaiv i)
Sa tim os (bginning qu m iin mdia sgmn, accua n undd a scnd)
end tim os (axima im s nd qu m)
Cndnc Lvl (a qu m ccus as indicad, bn 0.0 and 1.0)
-
7/30/2019 White Paper Phonetic Search Tech
8/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 8
whItepAper
pnic Sac tcnlg
evn duing sacing, ivsibl dcisins a snd. rsuls a siml numad, sd b cndnc lvl, i
ms likl candidas lisd s. pscssing suls lis can b aumad. examl sagis includ ad slds
(.g., ign suls bl 90% cndnc), ccunc cuning (.g., a mdia sgmn gs a b sc v addiinal
insanc qu m) and naual languag cssing (ans nab ds and ass dning smanics).
tical b sac ngins siv un mulil suls n s ag s a us can quickl idni n
suls as i dsid cic. Similal, an cin us inac can b dvisd squnc aidl ug a nic
sac suls lis, lisn bif ac n, dmin lvanc and nall slc n m uancs a m
scic ciia. Dnding n availabl im and imanc ival, lis can b usd as dl as ncssa.
StrUCtUreD QUerIeS
In addiin ad-c sacs, Nxidia vids a m sisicad cnlg assis i cnxual sacs: a sucud
qu. A sucud qu is simila a ni sa gamma a uld b ducd an aumaic sc cgniin
ssm. examls as a AND, or, and ANDNot. Du scial dmain sac, sval lul xnsins a
als vidd, suc as aacing im inds as. Simila Nxidias sandad ad-c nic sac, b scs
and im ss a und. B cnsucing cmlx quis, cusms a abl asil gna dcumn classis in
addiin jus dcing d as ccuncs. An xaml mig b idni man calls in a call cns aciv
discuss blms i a ba. Sucud quis a siml i and av xssiv cau cmlx
Blan and mal lainsis, as sn in lling xaml:
Cun = or( cun, cica, ba)
rsac = BeFore_3( l m, or( cck, d sm sac)))
pblm = or( Im aaid, ununal, rsac)
QUery = AND_10( Cun, pblm)
ADVANtAGeS oF phoNetIC SeArChING
t basic acicu nic sacing s sval k advanags v LVCSr and cnvninal d sing:
Sd, accuac, scalabili. t indxing as dvs is limid im allmn nl cagizing inu sc sunds
in nial ss nmsa an making ivsibl dcisins abu ds. tis aac svs ssibili
ig accuac s a sacing as can mak b dcisins n snd i scic qu ms. Als,
acicu saas indxing and sacing s a indxing nds b md nl nc (icall duing mdia
ings) and laivl as ain (sacing) can b md as n as ncssa.
on vcabula. LVCSr ssms can nl cgniz ds und in i lxicns. Man cmmn qu ms (suc as
scializd minlg and nams l, lacs and ganizains) a icall mid m s lxicns (al k
m small nug a LVCSrs can b xcud cs civl in al-im, and als bcaus s kinds qu ms a
nabl unsabl as n minlg and nams a cnsanl vlving). pnic indxing is uncncnd abu suc linguisic
issus, mainaining cmll n vcabula (, as m accual, n vcabula a all).
-
7/30/2019 White Paper Phonetic Search Tech
9/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 9
whItepAper
pnic Sac tcnlg
L nal n ds. LVCSr lxicns can b udad i n minlg, nams, and ds. hv, is
xacs a sius nal in ms cs nsibcaus ni mdia aciv mus n b cssd ug
LVCSr cgniz n ds (an ain a icall xcus nl sligl as an al im a bs). Als,
babiliis nd b assignd n ds, i b gussing i qunc cnx b aining a languag
mdl a includs n ds. t dicina iin nic sacing acicu, n and, is cnsuld
nl duing sacing as, ic is laivl as cmad indxing. Adding n ds incus nl an sac,
and i is n unncssa add ds, sinc slling--sund ngin can andl ms cass aumaicall, uss
can siml n sund-i-u vsins ds.
pnic and inxac slling. p nams a aiculal usul qu msbu als aiculal dicul LVCSr,
n nl bcaus ma n ccu in lxicn as dscibd abv, bu als bcaus n av mulil sllings
(and an vaian ma b scid a sac im). wi nic sacing, xac slling is n quid. F xaml, a
munainus gin in Nw Czcslvakia can indd b lcad b sciing Sudnland, bu Su Dan Land ill
k as ll. tis advanag bcms cla i a nam a can b slld Qadda, Kadda, Quada, Kadda,
Kadan ic culd b lcad b nic sacing.
Us-dmind d sac. I a aicula d as is n skn clal, i backgund nis ins a
a mmn, n LVCSr ill likl n cgniz sunds ccl. onc a dcisin is mad, cc inain
is lssl ls subsqun sacs. pnic sacing v uns mulil suls, sd b cndnc lvl.
t sunds a issu ma n b s (i ma n vn b in n 100), bu i is v likl in suls lis
sm, aiculal i sm in d as is laivl unimdd b cannl aiacs. I nug im
is availabl, and i ival is sucinl iman, n a mivad us (aidd b an cin uman inac) can
dill as dl as ncssa. tis caabili is siml unavailabl i LVCSr.
Amnabl aalll xcuin. t nic sacing acicu can ak ull advanag an aalll cssing
accmmdains. F xaml, a cmu i dual csss can indx ic as as. Addiinall, pAt ls can b
cssd in aalll b banks cmus sac m mdia uni im ( sac acks can b licad in
sam imlmnain andl m quis v sam mdia).
CUrreNt IMpLeMeNtAtIoN oF phoNetIC SeArChING
Nxidia vids a ang duc ings su nds a id ang nvinmns. t ms basic m,
calld Nxidia wkbnc, is a C++ lki a vids basic uncinali indxing and sacing n mdia l
daa sams. t kbnc quis uss dvl i n nd--nd alicain. An xnsiv s saml cd
is vidd assis uss in quickl adding nic-basd sac uncinali i alicains.
t Nxidia enis Sc Inllignc (eSI) sluin is a ull sv-sd alicain a can b cngud
aumaicall ings mdia ls m mulil sucs, sac an numb us-dnd m liss and quis, and
analz s suls saisical ans. Iniiall dsignd as ingain in a cmmcial call cn, Nxidia
eSI alls call cn as asil dmin sci cmlianc saisics, mni ic nds v im, and dill
dn in call acivs scic ccunc dsid vns, all using an inuiiv b inac.
-
7/30/2019 White Paper Phonetic Search Tech
10/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 10
whItepAper
pnic Sac tcnlg
Nxidia als s Nxidia eSI Dvls ediin (De), a b svics lki all a cusm-buil alicain dicl
cnl and adminis an eSI insallain. Sinc eSI De lki uss b svics, alicains ma b dvld using
viuall an dvlmn nvinmn, suc as Java, Visual Basic, c.
o Nxidia ducs includ AudiFind, a sandaln dsk sluin idal -discv in lgal mak and audi
nsics in gnal; Languag Assss, a b-basd sluin a aumaicall asssss nunciain and func call
cn agn alicans; and a duc sui dsignd ic mdia mak a includs aumaic agging vid asss.
All Nxidia ducs a dsignd vid ig manc b indxing and sac. on a ical 3.0 Ghz Dual pcss
Dual C sv, mdia ls a indxd bn 82 and 340 ims as an al-im. onc pAt ls a ladd m
disk in mm (rAM), sac sds v 1.5 millin ims as an al-im can b acivd ( quivalnl, m an
400 us audi sacd in a scnd). t ngin is dsignd ak maximum advanag a muli-css ssm,suc a a dual css bx acivs nal dubl ugu a singl css cnguain, i minimal vad
bn csss. Cmad alnaiv LVCSr aacs, Nxidia nic-basd sac ngin vids a lvl
scalabili n acivabl b ssms.
t Nxidia ngin cms i buil-in su a id vai cmmn audi mas, including pCM, -la, A-la,
ADpCM, Mp3, Quicktim, wMA, g.723.1, g.729, g.726, Dialgic VoX, GSM and man s. Nxidia als vids a
amk su cusm l-mas and dvics, suc as dic nk ds and ia cdcs, ug
a vidd lug-in acicu.
pmanc Nxidia pnic Sac
t a k manc caacisics Nxidias pnic Sac: accuac suls, indx sd and sac
sd. All a iman n valuaing an audi sac cnlg. tis scin ill dscib ac s in dail
Nxidia nic-basd ngin.
reSULt ACCUrACy
pnic-basd sac suls a und as a lis uaiv i lcains, in dscnding liklid d. As a us gsss
u dn is lis, ill nd m and m insancs i qu ccuing. hv, ill als vnuall
ncun an incasing amun als alams (suls a d n csnd dsid sac m). tis manc
caacisic is bs sn b a cuv cmmn in dcin : rciv oaing Caacisic cuv, roC cuv,
sn in Figu 2 and Figu 3.
t gna is cuv, n nds ximnal suls m sac ngin ( dd lis uaiv is) and idal
suls s s (acquid b manual vi and dcumnain s daa). F audi sac, idal s is
vbaim anscis a as skn in audi. F a singl qu, s numb acual ccuncs in idal
ansci is cund. t roC cuv bgins a 0,0 in n ga Fals Alams hu vsus pbabili Dcin.
rsuls m sac ngin a n xamind, bginning m lis. wn a uaiv i in lis macs
ansci, dcin a incass, as cnag u ccuncs dcd as jus gn u ( cuv
-
7/30/2019 White Paper Phonetic Search Tech
11/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 11
whItepAper
pnic Sac tcnlg
gs u). wn i is n a mac, als alam a n incass ( cuv n mvs ig). tis cninus
unil als alam a acs a -dnd sld. F an singl qu in gnic sc, is cuv nmall as v
ins, sinc sam as ill nl an a ims, unlss sam ic is bing discussd v and v in
daabas. t duc a maningul roC cuv, usands quis a sd i suls avagd g, gnaing
sm, and saisicall signican, roC cuvs.
t a maj caacisics a ac babili dcin an givn qu.
1 audi bing sacd; and
2 lng and nm cmsiin sac ms mslvs.
t addss s issu, Nxidia vids languag acks ac languag, n dsignd sac badcas-qualimdia and an ln-quali audi. t roC cuvs N Amican englis in badcas and ln a
sn in Figus 2 and 3 scivl.
Figure 2
ROC Curves for the North
American broadcast language pack
-
7/30/2019 White Paper Phonetic Search Tech
12/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 12
whItepAper
pnic Sac tcnlg
F xaml, using N Amican englis badcas languag ack and a qu lng 1215 nms, u
can xc, n avag, nd 85% u ccuncs, i lss an n als i 2 us mdia sacd.
t Nxidia ngin vids an alicain fxibili cs nsu a ig babili dcin, b accing
suls i a mda cndnc sc, duc als alams (uaiv suls ill av a ig babili bing
an acual dsid sul), b aising sc sld and nl accing s i ig cndnc scs.
In a d-sing ssm suc as Nxidia, m nms in qu man m disciminaiv inmain is availabl a
sac im. As sn b u cuvs in gus sning u din gus qu lngs, dinc can
b damaic. Funal, a an s, singl d quis (suc as n ), ms al-ld sacs a
nams, ass, insing sc a sn lng nm squncs. .
F badcas suls, s s is a n-u slcin ABC, CNN, and nscass, ssinall anscibdand ud b Linguisic Daa Cnsium (LDC). F ln, s s is a 10-u subs Sicbad and
Sicbad Cllula ca, als availabl m LDC. Qu ms gnad b numaing all ssibl d
and as squncs in anscis, and andml csing aund n usand m is s.
Figure 3
ROC Curves for the North
American telephony language pack
-
7/30/2019 White Paper Phonetic Search Tech
13/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 13
whItepAper
pnic Sac tcnlg
Figure 4
Search speed, in hours of media searched per
second of CPU time, for a range of query lengths
INDeXING SpeeD
An signican mic Nxidias nic sac is indxing sd (sd a ic n mdia can b mad sacabl).
tis is a cla advanag Nxidia, as ngin ingss mdia v aidl. Fm call cns i undds sas, mdia
acivs i ns usands us, andld dvics i limid CpU and sucs, is sd is a ima
cncn, as is las dicl inasucu cs.
Indxing quis a laivl cnsan amun cmuain mdia u, unlss a aicula audi sgmn is msl
silnc, in ic cas indxing as a vn ga. In s-cas scnai a call cn badcas cding
a cnains msl nn-silnc, ings sds a sv-class pC a givn bl in tabl 1.
ts sds indica a indxing im 1,000 us mdia is lss an 1 u al im. pu an a,
a singl sv a ull caaci can indx v 30,000 mdia da.
ts suls a audi sulid in lina pCM -la ma indxing ngin. I audi is sulid in an ma
suc as Mp3, wMA, GSM, ill b a small amun ma-dndn vad dcd cmssd audi.
SeArCh SpeeD
A nal manc masu is sd a ic mdia can b sacd nc i as bn indxd. t main acs infunc
sd sacing. t ms iman ac is pAt ls a in mm n disk. onc an alicain
-
7/30/2019 White Paper Phonetic Search Tech
14/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 14
whItepAper
pnic Sac tcnlg
quss a sac ack b ladd (i i xcs i b ndd sn), ls un s sac a ack, Nxidia
sac ngin ill lad i in mm. An subsqun sacing ill us is in-mm vsin, gal sding u n
sam mdia is sacd mulil ims.
A scnd ac infuncing sac sd is lng, in nms, d as in qu. S quis un
as, as a calculains mak innal sac ngin.
tabl 2 bl ss sac sds a ail avag (12 nms lng) qu v a lag s in-mm pAt ls,
xcud n a sv-class pC.
Alicains pnic Sac
t nic sac cnlg snd in is a as alad und man alicains and is alicabl man m.
A bi summa cun and nial uss is:
Call cn daa mining. Aumaicall sac cdd acivs in call-cns usul and niall abl
inmain, suc as call nd analss, nding blm aas in IVrs, aud dcin, and uss.
SeArCh SpeeD*
667,210
SerVer UtILIZAtIoN
> 12.5% (single thread, only one CPU core used)
5,068,783 > 100% (8 threads, one thread per CPU core)
Table 1 Search speed (* in times faster than real-time) for a 12-phoneme query on a 2-processor, 4-core server
(Dell PowerEdge 2950, 2 x 3.16 GHz X5460 Quad Core, 4 GB RAM, 2 x 6 MB cache, 1.33 GHz FSB)
INDeXING SpeeD*
190
SerVer UtILIZAtIoN
> 12.5% (single thread, only one CPU core used)
1,306 > 100% (8 threads, one thread per CPU core)
Table 1 Indexing Speed (*in times faster than real time) Indexing speed on a 2-processor, 4-core server
(Dell PowerEdge 2950, 2 x 3.16 GHz X5460 Quad Core, 4 GB RAM, 2 x 6 MB cache, 1.33 GHz FSB)
-
7/30/2019 White Paper Phonetic Search Tech
15/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 15
whItepAper
pnic Sac tcnlg
Call cn quali cnl. rduc xnss assciad i manual vsig CSr sci cmlianc. Fu, all
sci cmlianc analsis acss all calls, all sas a cn, a an manual samling a small cnag.
Sacabl vic mail. Man l av bgun using i mail lds as i ling ssm, kning x sac can lad
m dsid addsss, ns, discussins, and inmain. pnic sac n alls simila caabiliis vicmail.
ral-im mdia sac. Nxidias nic sac can un in mni md un suls i lss an 1 scnd
lanc n mniing u 1,000 simulanus audi sams sv.
Acivd mdia sac. wi Nxidias nic sac, i is ssibl s nd a dcas, lcu, gam ins,
and scnd, immdial jum in in cding alking abu dsid ic.
Nain, dsiin and invi su. In scnais xnsiv ansciin is n availabl, Nxidias as indxingand xml as sac alls k maks b quickl und.
Sacabl ns andld uss. Nxidias sac is ligig nug asil un n andld dvics. Quick ns
n lng av b in dninsad, n can jus sac audi isl.
wd as dcin. Ligig sac can asil nam-dialing, cmmand-and-cnl, small alica-
ins cunl andld b small sc cgniin ngins.
Cnclusins
tis a as givn an vvi nic sac cnlg dvld a Nxidia. t md baks sacing in
sags: indxing and sacing. t indx sag ans nl nc mdia l, and is xml as, a m an 1,000as an al-im n sandad pC ada. ta l can n b sacd indndnl an numb ims, a a a
m an 5,000,000 ims as an al im. Sac quis can b ds, ass, vn sucud quis a all
as suc as AND, or, and im cnsains n gus ds. Sac suls a liss im ss in ls, i an
accmaning sc giving liklid a a mac qu and a is im.
pnic sacing as sval advanags v vius mds sacing audi mdia. B n cnsaining nun-
ciain sacs, an nam, slang, vn ds a av bn inccl slld can b und, cmll aviding
u--vcabula blms sc cgniin ssms. pnic sac is als as. F dlmns suc as call
cns i ns usands us audi da, dcisin n slcing a subs analz nd n b mad, sinc
i vn mds sucs all cdings can b indxd sac. Unlik aacs, Nxidias sac cnlg
is v scalabl, alling as and cin sacing and analsis xml lag audi acivs.
-
7/30/2019 White Paper Phonetic Search Tech
16/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 16
whItepAper
pnic Sac tcnlg
rncs
[Cang] e. I. Cang and r. p. Limann, Imving wdsing pmanc i Aiciall Gnad Daa, in pcdings
Ieee Innainal Cnnc n Acusics, Sc and Signal pcssing, Alana, GA, Vl. 1, 283-286, 1996.
[Ci] J. Ci, D. hindl, J. hisbg, I. Magin-Cagnllau, C. Kakaani, F. pia, A. Singal, and S. wiak, SCAN
Sc Cnn Basd Audi Naviga: A Ssms ovvi, pcdings In. Cn. n Skn Languag pcssing, 1998.
[Clmns al. 2001a] M. Clmns, p. Cadill, M. Mill, pnic Sacing Digial Audi, NAB Badcas engining
Cnnc, Las Vgas, NV, Ail 2001.
[Clmns al. 2001b] M. Clmns, p. Cadill, M. Mill, pnic Sacing vs. LVCSr: h Find wa yu rall wan
in Audi Acivs, AVIoS, San Js, CA, Ail 2001.
[Clmns al. 2007] M. Clmns and M. Gavalda, Vic/Audi Inmain rival: Minimizing Nd human
eas, pcdings Ieee ASrU, K, Jaan. Dcmb 2007.
[Gal] J. Gal, C. Auzann, and e. Vs, t treC Skn Dcumn rival tack: A Succss S,
pcdings treC-8, 107-116, Gaisbug, MD, Nv. 1999.
[Ga] D. Ga, Z. wu, r. McIn, and M. Libman, t 1996 Badcas Ns Sc and Languag-Mdl Cus,
pcdings 1997 DArpA Sc rcgniin wks, 1997.
[IBM] ://-4.ibm.cm/sa/sc, ViaVic.
[Jams] D. A. Jams and S. J. yung, A Fas Laic-Basd Aac Vcabula Indndn wdsing, in pcdings
Ieee Innainal Cnnc n Acusics, Sc and Signal pcssing, Adlais, SA, Ausalia, Vl. 1, 377-380, 1994.
[Jnsn] S.e. Jnsn, p.C. wdland, p. Julin, and K. Sk Jns, Skn Dcumn rival treC-8 a Cambidg
Univsi, pcdings treC-8, 197-206, Gaisbug, MD, Nv. 1999.
[Juask] D. Juask and J. Main, Sc and Languag pcssing, pnic-hall, 2000.
[Mics] X. huang, A. Ac, F. Allva, M. hang, L. Jiang, and M. Maajan, Mics winds higl Inllign Sc
rcgniz: wis, pcdings ICASSp 95, vlum 1, 93-97.
[Ng] K. Ng and V. Zu, pnic rcgniin Skn Dcumn rival, pcdings ICASSp 98, Sal, wA, 1998.
[pilis] ://.sc.b.ilis.cm, Sc pal.
[Saukkai] r. r. Saukkai and D. h. Ballad, pnic S Indxing Fas Lxical Accss, Ieee tansacins n pan
Analsis and Macin Inllignc, Vl. 20, n. 1, 78-82, Janua, 1998.
[Viag] ://.viag.cm, VidLgg and AudiLgg.
-
7/30/2019 White Paper Phonetic Search Tech
17/17
2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs
and n m ll ing Unid Sas ans 7 231 351 7 263 484 7 313 521 7 324 939 7 406 415 and ans nding 17
whItepAper
pnic Sac tcnlg
[wiln] J. wiln, L. rabin, L. L, and e. Gldman, Aumaic rcgniin Kds in Uncnsaind Sc Using
hiddn Makv Mdls, Ieee tansacins n Acusics, Sc, and Signal pcssing, Vl. 38, n. 11, 1870-1878,
Nvmb, 1990.
[wld] r. wld, A. Smi, and M. Sambu, t enancmn wdsing tcniqus, in pcdings Ieee
Innainal Cnnc n Acusics, Sc and Signal pcssing, Dnv, Co, Vl. 1, 209-212, 1980.
[yu] p yu, K. Cn, C. Ma, and F. Sid, Vcabula-Indndn Indxing Snanus Sc, Ieee tansacins
n Sc and Audi pcssing, vlum 13, n. 5, Smb 2005.
Nexidia Inc.
3565 pidmn rad Ne
Building t, Sui 400
Alana, GA 30305
404.495.7220 l
404.495.7221 ax
866.355.1241 ll- nxidia.cm
top related