automatic detection of microchiroptera echolocation calls
TRANSCRIPT
![Page 1: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/1.jpg)
Automatic detection of microchiroptera echolocation calls from field recordings
using machine learning algorithms
Mark D. Skowronski and John G. Harris
Computational Neuro-Engineering Lab
Electrical and Computer Engineering
University of Florida, Gainesville, FL, USA
May 19, 2005
![Page 2: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/2.jpg)
Overview• Motivations for acoustic bat detection
• Machine learning paradigm
• Detection experiments
• Conclusions
![Page 3: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/3.jpg)
Bat detection motivations• Bats are among the most diverse yet least
studied mammals (~25% of all mammal species are bats).
• Bats affect agriculture and carry diseases (directly or through parasites).
• Acoustical domain is significant for echolocating bats and is non-invasive.
• Recorded data can be volumous automated algorithms for objective and repeatable detection & classification desired.
![Page 4: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/4.jpg)
Conventional methods• Conventional bat detection/classification parallels
acoustic-phonetic paradigm of automatic speech recognition from 1970s.
• Characteristics of acoustic phonetics:– Originally mimicked human expert methods– First, boundaries between regions determined – Second, features for each region were extracted– Third, features compared with decision trees, DFA
• Limitations:– Boundaries ill-defined, sensitive to noise– Many feature extraction algorithms with varying degrees of noise
robustness
![Page 5: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/5.jpg)
Machine learning• Acoustic phonetics gave way to machine
learning for ASR in 1980s:• Advantages:
– Decisions based on more information– Mature statistical foundation for algorithms– Frame-based features, from expert knowledge– Improved noise robustness
• For bats: increased detection range
![Page 6: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/6.jpg)
Detection experiments• Database of bat calls
– 7 different recording sites, 8 species– 1265 hand-labeled calls (from spectrogram
readings)
• Detection experiment design– Discrete events: 20-ms bins– Discrete outcomes: Yes or No: does a bin
contain any part of a bat call?
![Page 7: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/7.jpg)
Detectors• Baseline
– Threshold for frame energy
• Gaussian mixture model (GMM)– Model of probability distribution of call features– Threshold for model output probability
• Hidden Markov model (HMM)– Similar to GMM, but includes temporal constraints through piecewise-
stationary states– Threshold for model output probability along Viterbi path
![Page 8: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/8.jpg)
Feature extraction• Baseline
– Normalization: session noise floor at 0 dB– Feature: frame power
• Machine learning– Blackman window, zero-padded FFT– Normalization: log amplitude mean subtraction
• From ASR: ~cepstral mean subtraction• Removes transfer function of recording environment• Mean across time for each FFT bin
– Features:• Maximum FFT amplitude, dB• Frequency at maximum amplitude, Hz• First and second temporal derivatives (slope, concavity)
![Page 9: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/9.jpg)
Feature extraction examples
![Page 10: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/10.jpg)
Feature extraction examples
![Page 11: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/11.jpg)
Feature extraction examples
Six features: Power, Frequency, P, F P, F
![Page 12: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/12.jpg)
Detection example
![Page 13: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/13.jpg)
Experiment results
![Page 14: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/14.jpg)
Experiment results
![Page 15: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/15.jpg)
Conclusions• Machine learning algorithms improve detection
when specificity is high (>.6).• HMM slightly superior to GMM, uses more
temporal information, but slower to train/test.• Hand labels determined using spectrogram,
biased towards high-power calls.• Machine learning models applicable to other
species.
![Page 16: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/16.jpg)
Bioacoustic applications• To apply machine learning to other species:
– Determine ground truth training data through expert hand labels
– Extract relevant frame-based features, considering domain-specific noise sources (echos, propellor noise, other biological sources)
– Train models of features from hand-labeled data– Consider training “silence” models for discriminant
detection/classification
![Page 17: Automatic detection of microchiroptera echolocation calls](https://reader033.vdocument.in/reader033/viewer/2022052904/55836a98d8b42a360c8b4f5d/html5/thumbnails/17.jpg)
Further information• http://www.cnel.ufl.edu/~markskow• [email protected]
AcknowledgementsBat data kindly provided by:
Brock Fenton, U. of Western Ontario, Canada