6.003 signal processingof lec 05a sampling rate 8000 samples/sec, 9600 samples window_size=4096,...

6.003 Signal Processing

Week 9, Lecture A:Short Time Fourier Transform

6.003 Fall 2020

Time-varying SignalsReal-world signals (i.e., speech, music, . . .) often have frequency content that varies with time.

Fourier Transform: events that are local in time are global in frequency (and vice versa). Sudden changes and local variations can be difficult to detect.

Example: 2 tunes• cos0.wav• cos1.wav

How to tell them apart?

FFT of the two signals:

cos0 cos1

Short-Time Fourier Transform (STFT)The short-time Fourier Transform is a tradeoff between time and frequency-domain representations, representing the frequency content of the signal at various points in time.

Conceptually, we are taking the DFT of successive windowed regions of the original signal (and these regions may overlap).

Short-Time Fourier Transform (STFT)

Formally, we define the STFT of a signal x as:

𝑆𝑇𝐹𝑇 𝑥 [𝑚, 𝑘] =

𝑛=0

𝑁−1

𝑥 𝑛 + 𝑚 × 𝑠 ∙ 𝑤[𝑛]𝑒−𝑗2𝜋𝑘𝑁 𝑛

Where:• 𝑚 is a time index, 𝑘 is a frequency index

• 𝑁 is the length of a window

• 𝑤[𝑛] is window function

• 𝑠 is “step size”

STFT and Spectrograms

𝑛=0

𝑁−1

𝑥 𝑛 +𝑚 × 𝑠 ∙ 𝑤[𝑛]𝑒−𝑗2𝜋𝑘𝑁

The STFT is often visualized using a spectrogram, which is defined to be the magnitude squared of the STFT. Spectrogram:

Frequency content of the signal at various points in time.

Setting Suitable Parameters for Spectrograms

𝑛=0

𝑁−1

𝑥 𝑛 +𝑚 × 𝑠 ∙ 𝑤[𝑛]𝑒−𝑗2𝜋𝑘𝑁

• What window size N should be used?

• What step size s should be used?

Setting suitable window size N:

• Larger N will give rise to better frequency resolution

• Larger N will compromise time resolution

Setting suitable step size s:

• Smaller s helps with better time resolution

• Smaller s will mean more steps to perform calculation, increase computation cost

• What type of window function should be used ?

Spectrogram of signal from cos0.wav

• Window size large, we can not detect frequency change in time

• Need to optimize the windowsize to achieve reasonable resolution both in freq. and time

• Freq. conversion: see slide #13 of Lec 05A

Sampling rate 8000 samples/sec, 9600 samples

window_size=4096, step_size=256 window_size=2048, step_size=256 window_size=512, step_size=256

window_size=256, step_size=128 window_size=128, step_size=64

Spectrogram of signal from cos1.wav

• Onset of a note tend to have frequencies across all spectrum• Reducing step size, better resolution in time, longer time to compute

Sampling rate 8000 samples/sec, 9600 samples

window_size=2048, step_size=256 window_size=256, step_size=32window_size=256, step_size=256

Spectrogram of signal from mystery1.wav

• Larger window size N to resolve frequency better• If already know whole frequency range, can zoom into certain frequency range to see better • Now we can read music using spectrogram!

window_size=1024, step_size=64window_size=512, step_size=128 window_size=1024, step_size=64

sample_rate=22050, 214987 samples

• Larger window size N to resolve frequency better to see the chords more clearly

window_size=512, step_size=128 window_size=1024, step_size=64 window_size=1024, step_size=64

window_size=1024, step_size=64 window_size=2048, step_size=64

further zoom in in frequency

• Now we can use this to analyze all kinds of waveforms!

window_size=2048, step_size=32

Zoom in both in freq. and in time

Other Spectrograms

• Spectrogram gives visual representation of the spectrum of frequencies contained in signal, as a function of time. A very useful tool in many fields!

fs=8000, short speech

fs=44100

fs=44100, piano music

STFT in Processing Streaming Signals

• Breaking a signal into pieces so that it can be processed in small chunks is especially important in streaming applications where the signal may be arbitrarily long.

General procedure: • Divide the input signal into a sequence of windows, each of length N. • Process each window. • Assemble the processed pieces together to form the output.

• How to do this? How to choose N (good resolution with high speed)?

• How to assemble them smoothly?

Summary• Short-Time Fourier Transform: analyzing a sequence of finite-length

portions of an input signal

➢Spectrogram: visual representation of the spectrum of frequencies of a signal as it varies with time

➢Processing Streaming signals: (See Recitation 9B)

6.003 signal processingof lec 05a sampling rate 8000 samples/sec, 9600 samples window_size=4096,...

Documents

kirill rogozhin intel - core(s) 1 2 4 6 8 12 18 >18 threads...

synology diskstation ds212/ds212+ · 2019. 9. 18. · max....

s29gl512n, s29gl256n, s29gl128n 512, 256, 128 mbit, 3 v

s25fl256l/s25fl128l, 256-mb (32-mb)/128-mb (16-mb), 3.0 v...

proportional valve group pvg 128 and 256 technical...

128-bit and 256-bit xop and fma4 instructions

produced by digestion igg · 3 3 3 6 6 12 12 12 16 16 256...

spoc: an authenticated cipher submission to the nist lwc...

s25fl128s/s25fl256s 128 mbit (16 mbyte)/256 mbit (32...

1 kbit (128 x 8), 2 at24cs01/at24cs02 kbit (256 x 8) ·...

pvg-ex 32/128/256 proportional valve group

the zuc-256 stream cipher · snow 3g-256 aes in 128-eea2 &...

proportional valve group · 2020. 5. 17. · pvg 128/256...

s25fl128s/s25fl256s, 128 mbit (16 mbyte)/256 mbit (32...

high-speed 2.5v 256/128/64k x 36 synchronous dual-port

lpc2212/2214 single-chip 16/32-bit arm microcontrollers;...

single channel, 128-/256-position, i2c/spi, nonvolatile...

new-pixelbook-comparison-10.02.17[1]...i5 or i7 processor...

lecture 8: deep neural network...

processingof* electrocataly1cdatav1.1 · processingof*...