eusipco 2012 - eurasip.org · geometric space-time audio processing a. sarti and f. antonacci...

134
EUSIPCO 2012 20th European Signal Processing Conference August 27-31, 2012, Bucharest Geometric Space-Time Audio Processing Augusto Sarti and Fabio Antonacci Image and Sound processing Group Dipartimento di Elettronica ed Informazione Politecnico di Milano, Italy

Upload: dinhnhi

Post on 17-Feb-2019

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

EUSIPCO 2012 20th European Signal Processing Conference

August 27-31, 2012, Bucharest

Geometric Space-Time Audio Processing

Augusto Sarti and Fabio Antonacci

Image and Sound processing Group

Dipartimento di Elettronica ed Informazione

Politecnico di Milano, Italy

Page 2: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

2 Outline (1/2)

• Background and motivations – Wavefields, Rays and Beams

– Plenacoustic function

– Physical issues

– Perceptual limitations

• Building blocks – Domain: ray space

– “Sharp” primitives and descriptors

– “Graded” primitives and descriptors

• Modeling of soundfields – Ray tracing: overview

– Beam tracing: overview

– Visibility tracing

– Notes on the extension to the 3D case

– Aerial view

Page 3: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

3 Outline (2/2)

• Measurements and images

– Acoustic measurements and corresponding constraints

– Acoustic images

– Plenacoustic images

• Applications

– Analysis • Estimation from acoustic measurements

• Estimation from acoustic and plenacoustic images

– Acoustic scene rendering/synthesis

• Conclusions and perspectives

Page 4: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

EUSIPCO 2012 20th European Signal Processing Conference

August 27-31, 2012, Bucharest

Part 1:

Background and motivations

Page 5: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

5 From wave equation to rays

• Generalized D’Alembert equation

This equation can be reformulated as a differential relationship btw time (L-tf variable s) and space (L-tf variables sj), which describes propagation

– Any function that complies with this relationship is a suitable candidate!

– The dimensionality of the parameter space of all possible propagation modes is far from being manageable

– We need structuring!!!

D’Alembert equation

),(),(,

, tux

btut

aji

i

j

i

ji

kk

k

k xx

ji

i

jji

k

k

k sbsa,

,

Page 6: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

6 From wave equation to rays

• Can we operate in a “white-box” fashion? – Including sources and environment in the modeling process

• How?

– Strong assumptions on propagation, sources, etc.

– Working with well-established geometric and algebraic tools

• Why?

– Divide and conquer approach

– If we do things well, the method will be sufficiently general and we will be able to cover a wide range of problems (calibration, structure from sound, simulation, rendering, etc.)

Issues

Page 7: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

7 From wave equation to rays

• Instead of implementing and modeling the PDE, we implement and model the general solution of the PDE

• In 1D this is quite straightforward

– Example: Digital WaveGuides [Smith1985,Smith1986,Smith1998] • Accommodate discontinuities in the medium and boundary conditions

(scattering)

• Boundary conditions become lego blocks for the implementation!

• The extension to 2D or 3D is all but trivial

Roadmap

Page 8: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

8 From wave equation to rays

• Helmholtz equation:

• General solution

• Eikonal equation:

Eikonal equation

0),(),(2

22 xx

P

cP

)(),()(),( xxx TjeASP

0)(

1)]([

2

2 x

xc

THp: • High frequencies • Non dispersive medium

Eikonal equation operates only on the direction , i.e. the direction orthogonal to the wavefront . This direction is referred to as Acoustic Ray

)(xT

)(xT

Page 9: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

9 From wave equation to rays

• Point-like source:

• Acoustic Impulse Response (AIR)

Eikonal equation: examples

S The wavefield emitted by a point-like

source is described through a plurality (i.e. infinite) rays departing

from the source location

S R

t

h(t)

Rays bounce according to the Snell’s law when they meet reflectors.

In the AIR each peak corresponds to an acoustic path linking source and

receiver

Page 10: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

10 From rays to beams

Non-planar wavefronts require a plurality of rays need compact representation of a bundle of rays

acoustic beams

S

An acoustic beam is made of all rays that originate from the same pt and meet the same (planar) obstacle along their

propagation

reflectors

beams

Page 11: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

11 Beam description

• Accommodates wave propagation in enclosures

– A source with a given radial pattern can be seen as producing a number of uniform beams (piecewise constant approximation)

– Every time a beam encounters a reflector along its propagation path, beam splitting occurs

– Inherently open-loop modeling of propagation

Pros

• General enough to accommodate spherical as well as planar wavefronts

• We can separate the geometric aspects of propagation from the modifications

that the signal undergoes during the propagation (attenuation, dispersion, etc.) o We will focus on geometry of propagation and worry about signal modifications

afterwards

Page 12: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

12 Beam representation

– Only interactions with the environment that preserve the point-like nature of sources (both real and virtual) can be readily modeled

• In order to accommodate diffusive surfaces, we need to introduce simplifications

– The representation based on rays is not accurate at low frequency, as diffraction enables the wavefield to turn around obstacles Geometrical Theory of Diffraction (GTD)

Cons

s

s’

s

r

$ s’!

Specular reflections Diffusive reflections

Page 13: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

13 Additional assumptions

• 2D or 2Dx1D (separable) wavefield

– Extension to 3D is possible

• The environment is made of uniform and piecewise linear reflectors

– Generalization to curved reflectors is possible through the use of piecewise linear reflectors

Page 14: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

14 Perceptual limitations

What comes at handy:

• Phase ignored at high frequency

• Space perception drops at low frequency (wavefield is the same for both ears)

• Limited sensitivity outside vocal bandwidth

• Space perception limited to a fairly narrow bandwidth

Page 15: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

15 Plenacoustic function

• The Plenoptic function [Adelson1991] is a seven-dimensional function f(x,y,z,q,W,l,t), which describes the intensity of the light field seen at location (x,y,z) when looking in direction (q,W)

– Audio wavelengths are 106 times larger than light wavelengths difficult to construct a device that measures the sound pressure at any point along with its gradient

• The Plenacoustic function [Ajdler2003][Ajdler2006] is the four dimensional function p(x,y,z,t) which describes the sound pressure at location (x,y,z) and time t given an acoustic event in a room

– Knowing the plenacoustic function means knowing the soundfield at any possible location of the microphone

– Applications [Ajdler2006a]: • Head Related Transfer Function (HRTF) interpolation

• Interpolation of the Acoustic Impulse Response along a trajectory

• Modeling of the Room Impulse Response along an arbitrary trajectory

• …

Page 16: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

16 Plenacoustic function

• We want to work with rays, therefore we need to recuperate info on direction

• We CAN extract this information (field along a direction) using arrays of mictophones (with limits due to the aperture problem)

• If we keep track of these limits, we can go back to the equivalency btw plenoptic and plenacoustic function definition

• The 2D plenacoustic function p(x,y,q,t) is the soundfield measured at location (x,y) at time t and travelling in direction q

– This representation is inherently based on rays it is convenient to adopt a representation for which rays appear as an elementary primitive

– Extension to 3D is possible (see later on)

What geometry should we use?

Page 17: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

17 What geometry?

• Needs

– Beams (and rays) originate from points

– We need to treat planar and spherical wavefronts in a similar fashion

– We need to simplify the modeling of reflections on planar surfaces

• A natural choice: projective geometry (and beyond...)

– A “point” in P2 is a ray

– A planar wavefront originates from a source at infinity (in homogeneous coords it is a point like any other)

– A reflection on a planar surface is a homography (linear operation) Adopting projective geometry means assuming that all rays meet in some center of accumulation (projection center). Whenever this center changes (e.g. through reflection) we need to devise a projective transformation that accommodates such a change

– Extension with oriented projective geometry

Page 18: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

EUSIPCO 2012 20th European Signal Processing Conference

August 27-31, 2012, Bucharest

Part 2:

Building blocks

Page 19: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

19 Domain: ray space

• Acoustic ray

– In 2D a ray is characterized by three parameters. 2 possible representations:

• a point (that the ray passes through) and a direction

• the parameters of the line that the ray lies on

– We adopt the second parameterization:

– Note: rays defined in this way are also characterized by an orientation

– Projective geometry P2 does not account for the orientation

Ray characterization

],,[ 321 llll

Ray travelling from A to B Ray travelling from B to A

Oriented Projective Geometry (OPG) T2 [Stolfi87] is needed

xA

xB xB

xA

Page 20: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

20 Domain: ray space

• Given a line with parameters , all the points that lie on it must satisfy

• A ray is a point in the oriented projective space T2

• Its coordinates form a class of equivalence: – are all the same ray

– are all the same ray, with opposite orientation to the set

Definition

Tlll ],,[ 321

0,],,[ 321 kklklkl T

0,],,[ 321 kklklkl T

0k

l1

l2

l3(l1,l2,l3)

The Euclidean space spanned by such homogeneous coordinates of lines is called ray space

The acoustic ray is represented in the ray space by a half-line line with k>0

0,],,[ 321 kklklkl T

Tlll ],,[ 321

032211 lxlxl

Tlll ],,[ 321

Page 21: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

• Acoustic source in the Euclidean space:

• Homogeneous coordinates corresponding to :

• This point lies on the line l=[l1,l2,l3]T iff xTl=0.

• If x lies in the intersection between l1 and l2, it will also lie in the

intersection between kl1 and kl2 (k>0 or k<0)

therefore kx will be the same point as x

• This means that x is homogeneous as well

21 Geometric primitives Point-like sources and receivers

T

sss xx ],[ 21x

sx T

sss xx ]1,,[ 21x

( )( )

( )( ) 0

0

0

0

2

1

2

1

lx

lx

lx

lxT

T

T

T

k

k

k

k

Page 22: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

• A point is identified by the set of all rays that pass through it.

• In the ray space this set of lines corresponds to a plane passing through the origin, whose orthogonal is [x1,x2,1]T

• The plane [x1,x2,1]T divides the ray space in two half-spaces:

0lxT

s

0lxT

s

− lines going counterclockwise around the source

− lines going clockwise around the source

Geometric primitives Point-like sources and receivers 22

l1

l2

l3

(0,0,0)

P*

x

y

P(x,y)

Page 23: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Geometric primitives

• In the 2D Euclidian space the oriented (bounded) reflector is defined by its endpoints xA and xB

• The reflector divides the Euclidean space in two half-spaces: points lying at the left and at the right of the reflector.

• In the ray space the oriented reflector is defined by all the rays that intersect the reflector and travel towards one of the two half-spaces, as specified by the orientation of the reflector.

Bounded reflectors 23

An oriented reflector (AB) corresponds in the ray space to all the rays intermediate between the

planes A* and B* and with the same orientation of xA-xB

Page 24: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Geometric primitives

• A non-oriented reflector becomes a pair of oppositely oriented lines in T2

• An infinite non-oriented reflector is a line whose endpoints are at infinity

• All the rays in the geometric space cross an infinite reflector

Infinite Non-oriented reflector 24

an infinite non-oriented reflector is mapped onto the whole ray space

Page 25: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Geometric primitives

• For a compact visualization the “reduced” ray space is used

• The “reduced” ray space is obtained by intersecting the ray space with a plane, e.g. l3=1.

“Reduced” ray-space 25

Bounded reflector

Ray space Reduced ray space

Point

The point is represented by a line in the reduced ray space

The reflector is represented by two oppositely oriented beams in the reduced ray space

Page 26: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Geometric primitives

• An acoustic beam is the set of rays with origin xS and directions intermediate between lA and lB

• In the ray space:

– xS corresponds in the ray space to the plane with parameters xS

– lA and lB are lines both lying on the plane xS as they pass through xS .

Acoustic beams 26

xS

lA

lB

xS

lA

lB

The beam in the ray space corresponds to a portion of the plane xS limited by the directions lA and lB

l2

l1

l3

l1

l2 lA

lB

Ray space

“Reduced” ray space

Page 27: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Duality btw. Geometric and Ray spaces

Summary of representations in geometric and ray spaces

27

Geometric space Reduced ray space

Ray

Point

Oriented reflector

Beam

x1

x2

x1

x2

x1

x2

x1

x2

l1

l2

l1

l2

l1

l2

l1

l2

Page 28: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Duality btw. Geometric and Ray spaces

• A duality relationships exists between geometric and ray space:

– A point becomes a line

– A line becomes a point

– An (oriented) beam becomes an oriented segment

– A segment becomes an (oriented) beam

28

In the literature the ray space is also referred as dual space [Antonacci2008]

Page 29: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Visibility region

• Visibility region:

– Defines if (or what portion of) a reflector is visible from another reflector.

– In other words: which rays departing from reflector AC will hit reflector CD?

Non-oriented Reflector to reflector visibility 29

The visibility region V(RCD) of the reflector RCD as seen from the reflector RAB is the intersection of the duals of RAB and RCD

Geometric space Ray space

Page 30: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Visibility region

• For oriented reflectors, the visibility region is the intersection of the duals of the oriented reflectors

Oriented Reflector to Reflector visibility 30

All rays originated from a reflector RAB falling onto reflector RCD form the visibility region of RCD from RAB

Page 31: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Visibility diagram

• Visibility diagram (visibility from a reflector):

– Is a function that defines on which reflector (if any) a ray departing from reflector RAB falls

– In other words: given a ray l departing from RAB , which oriented reflector meets, if any, along its propagation?

Visibility from a reflector 31

The visibility diagram D(Ri) is the overlay of visibility regions V(R) of all the reflectors in the environment, according to front-to-back

order of reflectors

Page 32: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Visibility diagram Visibility from a reflector 32

Example 2

Example 1

Page 33: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Global visibility

• Global visibility:

– Defines the mutual visibility among reflectors

– The global visibility is used in the modeling of propagation, as it speeds up the evaluation of the visibility from a point (tracing of acoustic beams)…see next slides

Visibility of the whole environment 33

The global visibility is the collection of the visibility diagrams for all the reflectors present in the environment

Page 34: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Point to window visibility

• Given an acoustic beam that departs from a reflector Ri (i.e. it is originated from a reflection), what can we see of the real world from the position S’ behind Ri?

• How can we swiftly extract this information from the visibility diagram D(Ri) ?

34

The visibility from S’ corresponds to the lookup of the visibility diagram D(Ri) along the line S’*, representation of S’

Geometric space Reduced ray space

Note: what we see from behind the reflector is what we see from the surface of the reflector Lumigraph [Levoy1996]

Page 35: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Point to point visibility

• Once the location of a source S’ behind a reflector and the position of a receiver R are specified, which is the ray that goes from S’ to R?

• How can we swiftly extract this information from the visibility diagram?

35

The visibility from S’ to R is the intersection between S’* and R*, representations of S’ and R on the visibility diagram D(Ri).

R*

Geometric space Reduced ray space

Note: pt. to window and pt. to pt. visibility are used for modeling the sound propagation in enclosures (later on…)

Page 36: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

“Graded” primitives

• Until now:

– Point-like sources and receivers represented in the ray space as planes

– Reflectors are represented as beams in the ray space

– Beams as intersections of the duals of sources and reflectors (i.e. segments in the reduced ray space)

• But…something is missing

– Sources, receivers and reflectors are “omnidirectional”

– Next step: assigning directive patterns to the geometric primitives

• From a different viewpoint:

– The ray space is the domain of the plenacoustic function, the directivity information of the geometric primitives constitutes the co-domain.

36

Page 37: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Source radial pattern

• Acoustic source in the ray space: xs ↔l*s

• The line l*s defines the domain on which the source radial pattern is

mapped to the (reduced) ray space.

37

Given an acoustic source source at xs=[x1s,x2s]T, its beampattern f(q) is

mapped in the reduced ray space to f(l1*, l2

*), with

ss

ss

xxl

xxl

21

*

2

12

*

1

)tan(

1

)tan(

)tan(

q

q

q

)1

arctan(

)tan(

*

1

*

1

*

1

*

2

s

s

xl

yl

ll

q

q

Analogous relationships hold for microphone sensitivity radial patterns

Page 38: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Acoustic camera

• An acoustic camera is a device that captures all acoustic rays that pass through its optical center xc.

• Example: the optical center xc in an ordinary microphone array is the central microphone

• Representation in the reduced ray space: the line lc*

corresponding to the optical center of the array

• The acoustic image is mapped on lc* as for the acoustic sources

and receivers

From geometric to ray space 38

Example in 3D

Page 39: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Acoustic camera

• Limited information about the acoustic scene can be extracted from the acoustic camera.

– Reason: the acoustic environment is observed along the line lc* , which

is a 1D subspace of the two-dimensional reduced ray-space

• Idea: observe a two-dimensional portion of the reduced ray space to extract more information

– Such device should be able to read the intensity of all rays passing through it, not only those passing through the optical center

Limitations 39

Plenacoustic camera

Page 40: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Ideal plenacoustic camera

• The ideal plenacoustic camera is a device that is able to capture all the rays that pass through it all points on the device are optical centers

• Example:

Definition and representation 40

• A plenacoustic camera CD • An acoustic source in S

• Reflector AB. • Along with the real source, also rays coming from the image source S’ are acquired • The camera acquires

oAll the rays coming from S oAll the rays departing from S and reflected by AB, which can be modeled as departing from S’

• The plenacoustic image is mapped on the ray space

Geometric space

Page 41: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Ideal plenacoustic camera

• The ideal plenacoustic camera is a device that is able to capture all the rays that pass through it all points on the array are optical centers

• Example:

Definition and representation 41

Geometric space Reduced ray space

Page 42: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Ideal plenacoustic camera Definition and representation 42

Information contained in the plenacoustic image: o A segment on the line S, representation of S in the ray space

The segment is determined by the intersection btw. the region of the ray space visible from the plenacoustic camera (rays are bound to pass through the plenacoustic camera)

o A segment on the line S’, representation of S’ in the ray space

The segment is determined by the intersection btw. the region of the ray space visible from the plenacoustic camera and the visibility region of the reflector AB (rays are bound to pass through the region visible from the camera AND through the reflector AB)

Application: through the analysis of the plenacoustic imge it is possible to localize S and S’

Page 43: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Real plenacoustic camera

• A device with infinite optical centers is unfeasible.

• Real plenacoustic camera [Markovic2012]: a linear microphone array on which a finite number of optical centers are defined

• Implementation:

43

• The array is organized in sub-arrays.

•Each sub-array is composed by N>2 microphones •A sliding window selects microphones in each sub-array

• An acoustic image is acquired for each sub-array.

• The optical center of each acoustic image lies in the center of the sub-array.

Page 44: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Real plenacoustic camera

• Example:

• Each sub-array is composed by a limited number of microphones and its extension is limited acoustic images are blurred and exhibit a limited angular resolution

• The microphone array has a limited size only a portion of the whole ray space is observed.

Example and Limitations 44

Page 45: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Real plenacoustic camera

• Minimum angular distance between two acoustic sources to be observed as separate primitives in the acoustic image (and therefore in the plenacoustic image)

Finite resolution 45

))(cos(||||2

||||||min

'2'1

2

'1'2

2

'2

2

'1 q

mpmp

ppmpmp

ss

ssss

According to the specific technique adopted for the computation of the acoustic image (MVDR, Music, …) a different resolution min can be achieved. Two sources ps1’ and ps2’ must honor in order to appear as distinct in the plenacoustic image

Page 46: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

EUSIPCO 2012 20th European Signal Processing Conference

August 27-31, 2012, Bucharest

Part 3:

Modeling of soundfields

Page 47: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Modeling propagation

• Goal: given

– the geometry of the environment

– the location of the source and of the receiver

provide an estimate of the Acoustic Impulse Response btw. source and receiver

• Complex task using FEM or BEM approaches.

• Approximate solutions come from geometrical acoustics.

• Some techniques:

– Ray tracing [Krokstad1968], [Rindel1995], [Naylor1992]

– Image source [Allen1979], [Kirzenstein1984], [Huopaniemi 1997]

– Beam Tracing [Funkhouser1998], [Laine2009], [Dadoun1985]

– Visibility-based beam tracing [Antonacci2008, Markovic2010]

47

Page 48: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Ray tracing 48

Steps:

1. Cast a finite number of acoustic rays from the source 2. Propagate through the environment 3. Interact with objects in the environment

Determine the first intersection of the ray with the scene

surface (ray shooting ) Reflect rays Some sound is absorbed (absorption coefficient depends on material)

Page 49: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Ray tracing Implementation 49 49

Steps:

1. Cast a finite number of acoustic rays from the source 2. Propagate through the environment 3. Interact with objects in the environment

Delay:

Attenuation:

Room impulse response

Anechoic recording or synthetized signal

Output sound

d

c1

d

*

Page 50: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Ray tracing Pros and cons 50

Disadvantages:

Inefficient Suffers from aliasing New tracing whenever either the source or the receiver

moves

Advantages:

Simple to implement; can model specular reflections, diffusion and diffraction; models all types of surfaces (e.g. curved surfaces)

Page 51: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Image source

• The acoustic source is interactively mirrored to the other side of all reflectors that constitute the environment (virtual sources)

• All specular paths up to the desired reflection order are found

• Extremely efficient for simple, rectangular rooms

Implementation 51

Image sources

Page 52: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Image source

• Generates approximately n(n-1)r-1 virtual sources, where n is the number of reflectors and r the reflection order

• A big number of computed virtual sources are actually invisible, either from a reflected source (i.e. do not generate any acoustic field – physically invalid specular reflection) or the receiver

Cons 52

Page 53: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Image source

• All virtual sources should be tested for occlusion by all potential occludes – O(nr) visibility checks in arbitrary environments

Cons 53

Disadvantages: Only specular reflections Computationally demanding in

complex environments Dependence on source and

receiver positions

Advantages: Efficient in rectangular rooms Finds all specular reflections

Page 54: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Beam tracing

• Takes advantage of spatial coherence – groups rays in beams

• The interactive mirroring of the source is done only with respect to reflectors that beam encounters during the propagation

• Beams are organized in trees according to the splitting hierarchy

• Path tracing (finding paths between source and receiver) becomes an efficient process (lookup of the beam tree data structure)

54

Page 55: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Beam tracing

• Advantages:

– Good accuracy

– No need to recalculate the beam tree if receiver moves -> enables real-time rendering when receiver is moving (with static source)

– All paths between source and receiver are found (unlike ray tracing)

– Finds a minimum number of virtual sources (unlike image source method) – beam tracing automatically eliminates occluded paths (virtual sources that are never visible) during the construction of the beam tree

• Disadvantages:

– Moving the source requires the construction of a new beam tree which is generally a computationally demanding task

– Complex to implement

– In its initial implementation only specular reflections are modeled

Pros and cons 55

Page 56: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Visibility-based beam tracing (2D)

Goal: overcome the limitations of beam tracing, speeding up the computation of the beams

56

Iterative procedure For each beam that falls onto a reflector: 1.compute the reflected bundle of rays:

2.split the reflected bundle of rays into sub-beams (intersect its ray space representation with reflector’s visibility diagram); 3. add new beams to the beam tree data structure; 4.repeat the procedure for each beam that falls onto a reflector (the recursive procedure stops when the preassigned order of reflection is reached or when the beams die out).

3 33 1 3 1' ABR

S M S

( ) ( )1

3 1 3 3 3 3 3 1' det AB AB

T

R R

l M M l

Page 57: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

2D versus 3D

2D:

• In 2D there are two degrees of

freedom in the description of a line;

• A line in 2D is a hyper-plane;

57

3D:

• In 3D there are four degrees of freedom in the description of a line (increase is two, not one);

• A 3D line does not separate two half-spaces as in 2D;

• Visual events are complex in 3D.

Page 58: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Discontinuity surfaces 58

Discontinuity surfaces:

2D [Teller1992]

3D [Drettakis1994]

2D versus 3D

Page 59: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Extension to 3D Representation

Extension to 3D

• a line in 2D has 2 DOF; a line in 3D has 4 DOF ; • unlike a line in 2D, a line in 3D does not separate two half-spaces; • visual events are complex in 3D.

five-dimensional oriented projective space

Plucker coordinates of a oriented line l passing through points A and B:

6 1

B - Al

A×B

Not every point in corresponds to ray in 3D - only those points that lay on a four-dimensional ruled surface known as Plucker quadric:

0 T l Ql

3

3

0 IQ

I 0

5P

5P

Page 60: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Beam tracing in 3D

Visibility regions: Only events on the surface of the Plucker quadric have to be taken into account while evaluating the mutual visibility between reflectors.

Visibility evaluation is no more a linear program. A new, dedicated, algorithm has to be developed.

Beam Tracing (visibility from point evaluation): • similar procedure to 2D case; • increase in dimension is exactly one (2 dof); • no intersection with the quadric is needed.

The beam tracing procedure has a reasonable computational burden that allows efficient tracing of acoustic beams.

Page 61: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Beam tracing in 3D

DEMO

61

Page 62: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

EUSIPCO 2012 20th European Signal Processing Conference

August 27-31, 2012, Bucharest

Part 4:

Measurements and images

Page 63: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Acoustic measurements and images

• An acoustic system can extract:

– Acoustic measurements: Time Of Arrival, Time Difference Of Arrival, Direction Of Arrival,…

– Acoustic images, images of acoustic events extracted from traditional acoustic cameras

– Plenacoustic images, extracted from plenacoustic cameras

• We aim at

– finding a convenient representation in the ray space for measurements and images;

– estimate primitives in the acoustic scene (source localization, reflector localization, self-calibration of microphone arrays,…).

63

Page 64: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Acoustic measurements

• Examples of measurements that concern the propagation are – Time of Arrival (TOA)

– Time Difference of Arrival (TDOA)

depending on whether we have control over the source (sync)

• Such measurements may refer to – a direct acoustic path

– an indirect (specularly reflected) acoustic path

• A measurement carries informaton that can be turned into a constraint – This constraint will be useful for characterizing primitives only if it is expressed

in terms of the primitives themselves

Categorization 64

Page 65: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Time Of Arrival on direct path

• HP – Visibility between source and receiver

– First arrival corresponds to direct path from source to receiver

(source beam directly illuminates receiver)

• Given the TOA t and the speed of propagation c the source is bound to lie on a circle (or a sphere) of radius tc centered on the receiver

In the geometric domain 65

M

S

t c

Page 66: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Time Of Arrival on direct path

• The point xP2 corresponds to the location of the source iff

where C is the matrix representation of a circle (a=c, b=0) centered in the receiver [Contini2012]

• The (scalable) parameters (a, d, e, f) can be readily derived from the measured TOA, given the mic location

• x can be estimated from multiple measurements

Similar expressions can be derived for the location of the receiver, given the location of the source

In the ray space 66

fed

ecb

dba

x

xT

2/2/

2/2/

2/2/

1

,0 2

1

CxCxx

Page 67: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Time Of Arrival on indirect path

• Total TOA (time of flight)

• What constraints do we derive on the geometric primitives?

In the geometric domain 67

PMSP ttt

SPt PMt

M

S

P

t

Page 68: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Time Of Arrival on indirect path

• The point xP2 is a candidate reflection point for the acoustic path iff

i.e. x must lie on an ellipse whose foci are source and receiver, respectively

• The parameters can be easily obtained from the knowledge of the microphone position and of the TOA

In the ray space 68

fed

ecb

dba

x

x

xT

2/2/

2/2/

2/2/

,0

3

2

1

CxCxx

Page 69: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Time Of Arrival on indirect path

• The previous constraint not yet involves the geometric primitives of interest, as P (of hom. coordinates x) is just a reflection point of the reflector

• In order to express it directly in terms of the reflector l we need to rewrite the constraint in its adjoint form

where l are the hom. coords. of the line that the reflector lies on, and C* is the adjoint matrix of the ellipse C

In the ray space 69

0* lClT

11* )det( CCCC

MS

Pl

References: [Antonacci2010], [Antonacci2012], [Canclini2011],[Filos2010]

Page 70: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

TOA on indirect paths

• Note: the constraint derived by the TOA measured on an indirect path can be used for localizing reflectors (environment inference): we will see an example later

In the ray space 70

Page 71: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

TDOA on direct paths

• HP – Multiple receivers

– Visibility between source and receivers

– First arrivals correspond to direct path from source to receivers

(source beams directly illuminate the receivers)

• We have no information on the source time of emission, therefore all that we can measure are Time Differences Of Arrival (TDOAs)

71

21 ttt

S

1R

2R2t

1t t

Page 72: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

TDOA on direct paths

• Given t, R1 and R2, the source S is bound to lie on a hyperbola with foci in R1 and R2

72

S

1R

2R2t

1t t

Page 73: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

• The point of homogeneous coords x corresponds to the location of the source iff

where C is the matrix representation of a hyperbola det(C’)<0 with foci in R1 and R2

• The (scalable) parameters (a, b, c, d, e, f) can be readily derived from the measured TDOA, given the location of the mics

• x can be estimated from multiple measurements

TDOA on direct paths 73

fed

ecb

dba

x

x

xT

2/2/

2/2/

2/2/

,0

3

2

1

CxCxx

Page 74: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

TDOA on indirect paths

• Combination of two constraints

– Homography (linear projective mapping) that embodies the specular reflection

– Constraint obtained from TDOA on direct path from x’ to receivers

74

1

22

0

nnnIH

Hxx'

dT

'

'

'

' ,0''

3

2

1

x

x

xT xCxx

x

x’

d n

R1 R2

Page 75: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

TDOA on indirect paths

• The information on the reflector is carried by the homography H

• If we want to nail the reflector given x and the location of the receivers, we need to combine the above constraints wisely

75

Page 76: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

DOA on indirect paths

• Consider having a compact array that acquires the signal from the acoustic source xs along with the source from the image xs’

• Geometry:

• If we make an hypothesis on xs , the line l is the axis of the segment xs x’s and the triangle xp xs x’s is isosceles.

Definitions 76

xs : real source r: line on which xs lies q: DOA of xs x’s : image source r’: line on which x’s lies q’: DOA of x’s

xr:reference point of the array

Page 77: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

DOA on indirect paths

• Focal property: the tangent line at xP is the bisector of the angle formed by the line joining xS and xP and the perpendicular to r’ through xP

Constraint in the geometric space 77

xS

x

l

l is bound to be tangential to the parabola with focus in xS and directrix r’ [Canclini 2011a]

Same kind of constraint obtained for TOA on indirect paths

Page 78: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Summary of constraints 78

Direct Applications:source loc. and self-calibration

Indirect Applications: reflector localization

TOA

TDOA

DOA

R S

tc 0Cxx

TConstraint

Hp: Source-receiver synchronization

R

S

Hp: Source-receiver synchronization

0* lClT

Constraint

R1

R2

0CxxT

Constraint

Hp: Synchronization among receivers

x

x

d n R1

R2

Hp: Synchronization among receivers

M

S

0CHxHxTT

Constraint Line

Constraint

0* lClT

Constraint

Hp: Sources in the farfield

Hp: Sources in the farfield

S

r’

Page 79: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Acoustic images

• An acoustic image is an estimate of the distribution of the power incoming at a microphone array from different angles of arrival.

• Several techniques in the literature:

– Delay and sum Beamformer

– Minimum Variance Distortionless Response Beamformer

– MUltiple SIgnal Classification, etc.

• Example of acoustic image (acquired in a 3D geometry)

79

In the context of geometrical ST audio processing acoustic images are used for (see later on): -self-calibration of microphone arrays -estimation of the reflective properties of walls

Page 80: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Plenacoustic images

• Result from a combination of multiple acoustic images, which are mapped to the ray space. Each acoustic image is acquired by a subset of the microphones in the array [Markovic2012].

• In the context of geometrical acoustics plenacoustic images are used for:

– Source localization

– Reflector localization (along with its endpoints)

– …

80

Page 81: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

EUSIPCO 2012 20th European Signal Processing Conference

August 27-31, 2012, Bucharest

Part 5:

Applications

Page 82: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Analysis and synthesis

• In the context of space-time audio processing geometrical acoustics is beneficial for

– Soundfield analysis: analysis of the acoustic scene

– Soundfield synthesis: rendering through loudspeaker arrays

– Modeling of the soundfield: tracing of the visibility in the ray space

– Mixed scenarios : e.g. applications where a preliminary identification of the acoustic scene enables an improved rendering

82

Page 83: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Soundfield analysis

• Goals

– Source localization;

– Self-calibration of microphone arrays;

– Localization of reflectors in the environment;

– Estimation of the reflective properties of walls

• The estimation of acoustic and geometric properties of the environment is accomplished through

– Joint analysis of multiple measurements (TOAs, TDOAs, DOAs);

– Processing on acoustic and plenacoustic images.

Goals and contextualization 83

Page 84: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Self-calibration

• Given TOA measurements at each sensor we want to infer the position of the microphone array [Contini2012]

• In case of unknown array geometry, we need both intrinsic and extrinsic array calibration

• In case of known array geometry, we only need extrinsic calibration (estimate array position and orientation)

Using TOAs 84

• Each TOA tmn

generates a constraint on the mic position xm

• xm can be estimated from (at least 3) constraints tmn, n=1,…,N

S1

S2

Sn

SN

Rm

tmn

R1

R2

RM

0

0

0

2

1

mNmmT

mmmT

mmmT

xCx

xCx

xCx

N

n

mnm

T

mmJ1

2)()( xCxx

Cost function

Page 85: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Self-calibration

• In order to obtain a constraint that acts directly on the rotation and translation of the whole array, we need to nest the TOA-related constraint tmn with that associated to the homography H

Using TOAs 85

Constraint on mic position

Merge the homography

Solve for H to obtain rotation matrix R and translation vector t

Nn

Mmmmnm

T

,...,1

,...,1,0''

xCx

0

0

0

0

1121

1111

MMN

TM

T

mmn

Tm

T

TT

TT

HxCHx

HxCHx

HxCHx

HxCHx

Page 86: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Self-calibration

• Goal: estimate position and orientation of multiple arrays (extrinsic calibration) from individual acoustic images [Redondi2009]

Using DOAs (farfield) 86

We can use TDOA-based constraints on the acoustic images for robust peak localization

Conventional stereo-imaging constraints (epipolar geometry) can now be used for array extrinsic calibration

B

A

X X’

R, t

Page 87: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Self-calibration from TDOAs

• Multiple array geometry

– Relationship between x and x’ through the fundamental matrix F

– The acquisition of N>7 acoustic images enables estimation of R and t

(up to a scaling factor) • Robust estimation methods (e.g. RanSaC) can be adopted for more accurate results

Using DOAs (farfield) 87

0Fxx'T

Demo Calibration error as a function of the number of sources

Page 88: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Self-calibration

• From L>=5 correspondences Xl X’l

we find the homography that best fuses the coordinate system of the first acoustic camera with that of the second one

• Mutual positions estimated through Rigid Body Motion [Valente2010 ]or Maximum Likelihood analysis [Valente2010a] on the set of correspondences Xl X’l

Using TDOAs (near field) 88

Two (or more) arrays acquire volumetric acoustic maps and localize the same source in space in their local coordinates systems, Xl and X’l.

Page 89: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Self-calibration

• Comparison of the accuracy of the estimation for different source distances for Rigid Body Motion (RBM), Maximum Likelihood Estimation (MLE) and DOA-based technique

Using TDOAs (near-field) 89

Setup Results

Maximum likelihood analysis allows an accurate estimation in both the near- and in the far-field

Page 90: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Source localization

• A step backward: redefining the geometric space

Using TDOAs 90

Representation adopted: (x,y) coordinates (x,y,z) coordinates z: distance covered by the wavefront from the source position

Source localization: finding the apex of the cone, given a set of noisy measurements, possibly coming from unsynchronized arrays [Compagnoni2012].

Metrics

Cost function:

N

i

aeaeJ1

/2

/

222 )()()( ssse zzyyxx

)()()( 22

sssa zzyyxx

Cone equation

Cone aperture

Page 91: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Source localization

• Setup – 81 source positions on a square grid.

– 1000 realizations

– RMS localization error, expressed in meters.

Using TDOAs 91

Cone equation Cone aperture SRD-LS [Beck2008]

Page 92: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

The line parameters of the reflector are found as the global minimum of the cost function, i.e.

Inference of the geometry

• When multiple TOA measurements are conducted on indirect paths coming from the same reflector, we can reconstruct its location by finding the common tangent to all the ellipses, which corresponds to solve the system

Based on TOAs 92

0

...

0

0

*

2*

1*

lCl

lCl

lCl

NT

T

T

N

i

iTJ

1

2* )()( lCllCost function

)(minargˆ lll

J

Page 93: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Inference of the geometry

Using TOAs 93

• If l is a minimum of J(l) , then also kl is mimimum need of de-homogenization to obtain a single global minimum

• Without loss of generality the global minimum is found by minimizing the cost function on the reduced ray spaces l1=1 and l3=1

Page 94: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Inference of the geometry

• When only TDOAs are available (i.e. use of uncontrolled acoustic sources), the geometry of the environment is estimated through a three-steps procedure [Filos2010]:

1. TDOAs related to the direct path are used to localize the acoustic source

2. TDOAs related to indirect path are converted into TOAs once the location of the acoustic source is available.

3. Reflectors are localized by combining the resulting TOAs

Using TDOAs 94

Page 95: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Inference of the geometry

• Extension to multiple reflectors: Hough transform [Antonacci2012, Filos2011]

Hough transform 95

Accuracy on a per-wall basis as a function of the number of sources used for the estimation of the TOAs

Page 96: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Inference of the geometry

• DOAs related to reflective paths constrain the reflector to be tangent to parabolas, specified by the location of the acoustic source and by the

measured DOA [Canclini2011].

• When the source moves in space, the same reflector is observed from different viewpointsthe reflector is the common tangent to all the parabolas

Using DOAs 96

Hand-measured reflector line

Estimated reflector line

Parabolas associated to DOAs

Microphone array

Loudspeaker

Page 97: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Inference of the geometry

• The reflector line is estimated from the combination of multiple constraints, as for the TOA case [Canclini2011].

• Average reflector location error (angle and distance) as a function of the number of DOAs:

Using DOAs 97

An accurate localization of the reflector is possible even for N=4 sources

Note: extension to multiple reflectors is possible using the Hough transform

Page 98: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Estimation of the reflective properties of walls

• Goal: estimate the reflection coefficient of the walls in an enclosure

• Idea: – model the acoustic propagation and generation of the spatial pseudospectrum

– only unknowns are the reflection coefficients

– match the simulated pseudospectrum with the measured one acquired by a microphone array

98

Assumptions: • room geometry is known (or

estimated in a preliminary step) • Source and array locations are known

(or estimated) • Directive properties of the

microphone array are known

Data model

Page 99: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Estimation of the reflective properties of walls

Step 1: matching of the pseudospectra

99

Model

Measured

Page 100: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Estimation of the reflective properties of walls

Step 1: matching of the pseudospectra • q: amplitude factor of the pseudospectrum and depends on the reflection coefficient of

the walls unknown

• M: models the contributions to the pseudospetrum, up to a scale factor it is known with the a-priori information

100

Page 101: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Estimation of the reflective properties of walls

Step 2: estimation of the reflection coefficient

In each element of q the reflection coefficients are in a multiplicative relationship the logarithm of q is taken:

101

Example

Page 102: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Estimation of the reflective properties of walls

Step 2: estimation of the reflection coefficients

102

Least-squares solution:

counts the number of occurrences of variables in m for each of the components of q

= Reflection coefficients:

Page 103: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Estimation of the reflective properties of walls 103

Experimental setup Matching of the pseudospectra

wooden panels

sem

i-a

nec

ho

ic r

oo

m

Experimental results

Page 104: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Estimation of the reflective properties of walls 104

Sub-band reflection coefficient estimates

Typical reflection coefficients for wooden panels

Page 105: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Scene inference from plenacoustic images

• Source and reflector localization from plenacoustic images [Markovic2012]

105

Page 106: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Scene inference from plenacoustic images

• Source localization

106

Source localization error: below 3 cm for both real and image sources

Page 107: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Scene inference from plenacoustic images

• Room estimation

107

Page 108: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Scene inference from plenacoustic images 108

Estimated walls

Real walls

Wall with finite extension

Microphone array

Page 109: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Soundfield synthesis

• Goals:

– Render a virtual acoustic source using a loudspeaker array

– Render the acoustics of a virtual environment

– Compensate for the early reflections of the environment in which the loudspeaker array is operating (“room compensation”)

– Exploit the early reflections of the real environment, e.g. for a virtual home theater system

109

Page 110: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Beamshaping Rendering virtual acoustic sources 110

Goal:

Propagation matrix

Desired response

LS solution

• Rendering of a virtual source along with its radiation pattern in a listening region, by means of a loudspeaker array [Antonacci2009]

• We focus on directional sources, modeled as beams; we need to control:

• Source position

• Direction of emission

• Beam aperture Formulation as an inverse problem:

virtual source

loudspeakers

listening region

Page 111: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Beamshaping Rendering virtual acoustic sources 111

Why a LS solution (and not conventional beamforming)?

• no constraints on loudspeaker and control point positions

• easy for handling near-field

• better control of parameters

• efficiency: propagation matrix and its (pseudo) inverse stored in advance

Fact: conditioning of could be a problem, especially at low frequencies

• SVD regularization:

• SVD is preferable w.r.t. other methods (e.g., Tikhonov):

• More controllable

• Its solution depends only on

)( GGH

rUDVh

UDVG

H

H

1~ˆ

G

)0,0,/1,/1(diag~

1

1 K D

Page 112: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Virtual environment rendering 112

Virtual acoustics: rendering the effect of a source in a virtual environment (in free-field) [Markovic 2010] Assumption: a complex acoustic wave field can be modeled as the superposition of elementary beams (geometrical acoustics)

Example: simple environment

Idea: use loudspeakers for synthesizing elementary beams, and obtain the complex wave field through superposition

The source in the room generates a set of virtual image sources according to the geometry of the environment we want to render

Page 113: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Virtual environment rendering 113

What if the environment is more complex?

• Image sources are suitable only in simple convex environments

• Visibility tests are very time demanding

• A convenient solution is beam tracing, which compute visibility very efficiently

Page 114: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Virtual environment rendering 114

Examples

NMSE (%):

500 Hz 7.4 %

1000 Hz 7.9 %

NMSE (%):

500 Hz 5.1 %

1000 Hz 5.3 %

Page 115: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Room compensation Contextualization 115

Fact: from analysis methodologies we can extract the room geometry

Idea: we use again the beam tracing engine for predicting the room reverberation (at least early reflections)

• Using reflective TOAs

• Using reflective DOAs

Page 116: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Room compensation 116 Early reflections modeling

early reflections

room compensated beam-shaping:

We now consider the rendering system operating in an arbitrary room:

• Use beam-tracing for computing the set of image loudspeakers visible at control points

• Modify Green’s functions as

• The propagation matrix becomes

Page 117: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Room compensation 117

Desired wave field Free-field beam shaping

Non-compensated beam shaping

Room-compensated beam shaping

Normalized MSE at a reference

point •The non-compensated response exhibits relevant peaks at resonant modes

•Room compensation dampens most of that peaks

Page 118: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Environment-aware virtual environment rendering

Goal: render the acoustics of a virtual environment when the array operates in another environment

118

• Loudspeakers operate in an arbitrary reverberant room

• Image loudspeakers determined through beam-tracing

• Room compensation enables a free-field behaviour of the beam shaping

• Design of a virtual environment to be rendered…

• … and rendering through superposition of beams

Page 119: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Environment-aware virtual environment rendering

119

Desired wave field Free-field wf rendering

Non-compensated wf rendering

Room-compensated wf rendering

Page 120: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Exploiting the environment 120

So far we limited ourselves to room compensation…

… is it possible to think at the environment as an “augmented” rendering system?

image array (first order)

image array (first order)

image array (second order)

image array (second order)

IDEA: use the walls (i.e., image loudspeakers) to increase the array

Application: simulation of a 5.1 surround system [Canclini2012]

Page 121: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Exploiting the environment 121

Environment-aware virtual 5.1 surround

What about undesired reflections?

We can introduce, again, room compensation (slightly modified):

Desired response Actual response

CGG roomdes

desroomT

roomT

roomT

GGGGC1)(ˆ

ffcomp hCh ˆCompensated loudspeaker gains

listening area

C

L R RL RR

Page 122: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Exploiting the environment 122

Example: reproduction of the rear-right channel

Desired Non-compensated Room-compensated

listening area

RR

Page 123: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Exploiting the environment 123

Page 124: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

EUSIPCO 2012 20th European Signal Processing Conference

August 27-31, 2012, Bucharest

Part 6:

Conclusions and perspectives

Page 125: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Conclusions

• Projective geometry and oriented projective geometry offer powerful tools for organizing the information coming from measurements in a layered fashion

– Many nonlinear relationships become linear in projective space

– Seamless interworking with Computer Vision methodologies (e.g. camera self-calibration)

– Same approach for a variety of applications, from calibration to rendering

• Extension to 3D geometries is possible

– Examples: • inference of the geometry [Nastasia2011], [Filos2012], [Canclini2012a]

• Modeling of propagation in 3D geometries

125

Page 126: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Perspectives

• Areas of growth

– Environment geometry from plenacoustic imaging

– WF reconstruction from plenacoustic images

– Source separation/characterization from plenacoustic images

– Data-driven plenacoustic rendering

– Model-driven plenacoustic super-resolution

– Extension to algebraic geometry

– Joint calibration and synchronization of spatial distributions of unsynchronized sensors

126

Page 127: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

EUSIPCO 2012 20th European Signal Processing Conference

August 27-31, 2012, Bucharest

References

Page 128: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

References

*Adelson1991+ E. Adelson and J. Bergen, “The plenoptic function and the elements of early vision,” in Computational Models of Visual Processing. Cambridge, MA: MIT Press, 1991

*Ajdler2003+ T.Ajdler and M.Vetterli, “The Plenacoustic function and its sampling”, in proc. of Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA, 2003

[Ajdler2006] T.Ajdler, L.Sbaiz and M.Vetterli, "The Plenacoustic Function and Its Sampling," IEEE Transactions on Signal Processing, vol.54, no.10, pp.3790-3804, Oct. 2006

*Ajdler 2005+ T. Ajdler , L. Sbaiz , M. Vetterli “The plenacoustic function on the circle with application to HRTF interpolation”, In proceedings of IEEE ICASSP (2005)

*Antonacci2008+ F. Antonacci, M. Foco, A. Sarti, and S. Tubaro, “Fast tracing of acoustic beams and paths through visibility lookup,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, no. 4, pp. 812–824, May 2008.

[Antonacci2009] Soundfield rendering with loudspeaker arrays through multiple beamshaping, in proc. of Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009

*Antonacci2010+ F.Antonacci, A.Sarti, S.Tubaro, “Geometric reconstruction of the environment from its response to multiple acoustic emissions”, 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)

[Antonacci2012] F.Antonacci, J.Filos, M.R.P. Thomas, E. A. Habets, A. Sarti, P.Naylor, S. Tubaro, “Inference of Room Geometry from Acoustic Impulse Responses”, to appear in IEEE Transactions on Audio, Speech and Language Processing

128

Page 129: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

References

*Beck2008+ A. Beck, P. Stoica, and J. Li, “Exact and approximate solutions of source localization problems,” IEEE Transactions on Signal Processing, vol. 56, no. 5, pp. 1770–1778, May 2008.

*Canclini 2011+ A. Canclini, M.R.P. Thomas, J. Filos, A. Sarti, P.Naylor, S. Tubaro, “Exact localization of acoustic reflectors from quadratic constraints”, proc. of IEEE International Workshop on Applications os Signal Processing to Audio and Acoustics, WASPAA, 2011

*Canclini2011a+ A.Canclini, P.Annibale, F.Antonacci, A. Sarti, R.Rabenstein, “From direction of arrival estimates to localization of planar reflectors in a two dimensional geometry”, proc. of 2011 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)

[Canclini2012] A.Canclini, D. Markovic, F. Antonacci, A. Sarti, S. Tubaro, "A Room-compensated virtual surround systesm exploiting early reflections in a reverberant room", in proc. of European Signal Processing Conference, EUSIPCO2012

[Canclini2012a]A.Canclini, F.Antonacci, J.Filos, A.Sarti, P.A.Naylor "Exact localization of planar acoustic reflectors in three-dimensional geometries", in proc. of 2012 International Workshop on Acoustic, Echo and Noise Control (IWAENC'12)

*Compagnoni2012+ M.Compagnoni, P.Bestagini, F. Antonacci, A.Sarti, S.Tubaro, “Localization of Acoustic Sources through the Fitting of Propagation Cones using Multiple Independent Arrays”, IEEE Transactions on Audio, Speech and Language Processing, volume 20, issue 7

129

Page 130: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

References

*Contini2012+A.Contini, F. Antonacci, A.Canclini, M.Compagnoni, A.Sarti, S.Tubaro, “Self-calibration of microphone arrays from measurement of Times Of Arrival of acoustic signals”, in proc. of International Symposium on Communications, Control and Signal Processing, ISCCSP 2012

*Dadoun1985+ N.Dadoun and D.G.Kirkpatrick and J.P.Walsh, “The geometry of beam tracing”, Proceedings of the first annual symposium on Computational geometry, 1985

*Drettakis1994+ G. Drettakis, E. Fiume, “A Fast Shadow Algorithm for Area Light Sources Using Backprojection”, Proceedings of the 21st annual conference on Computer graphics and interactive techniques, 1994

*Filos2010+ J. Filos, E. Habets, P.Naylor, “A Two-Step Approach to Blindly Infer Room Geometries”, proc. of International Workshop on Acoustic Echo and Noise Cancellation, IWAENC 2010

*Filos2011+J.Filos, M.Thomas, F. Antonacci, A.Sarti, P.Naylor, “Robust inference of room geometry from acoustic measurements using the Hough transform”, in proc. of 19th European Signal Processing Conference (EUSIPCO’11), pp. 161-165, Barcelona, Aug. 29, Sept. 2, 2011

[Filos2012] J.Filos, A.Canclini, F.Antonacci, A.Sarti, P.A.Naylor, "Localization of planar acoustic reflectors from the combination of linear estimates", in proc. of 20th European Signal Processing Conference, EUSIPCO 2012

130

Page 131: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

References

[Funkhouser1998]T. Funkhouser and I.Carlbom and G.Elko and G. Pingali, G and M. Sondhi and J.West, “A beam tracing approach to acoustic modeling for interactive virtual environments”, Proceedings of the 25th annual conference on Computer graphics and interactive techniques, 1998

*Huopaniemi1997+ J.Huopaniemi and L.Savioja and M.Karjalainen, “Modeling of reflections and air absorption in acoustical spaces a digital filter design approach”, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 1997

*Kirzenstein1984+ J. Kirszenstein “An image source computer model for room acoustics analysis and electroacoustic simulation”, Applied Acoustics, 1984

*Krokstad1968+ A. Krokstad, S. Strom, S. Sørsdal, “Calculating the acoustical room response by the use of a ray tracing technique”, Journal of Sound and Vibration, pag. 118-125, 1968

*Laine2009+ S.Laine and S.Siltanen and T.Lokki and L.Savioja, “Accelerated beam tracing algorithm”, Applied Acoustics, 2009

*Lehmann2008+ E.A. Lehmann and A.M. Johansson, “Prediction of energy decay in room impulse responses simulated with an image-source model”, Journal of the Acoustical Society of America, 2008

*Levoy1996+ M.Levoy, P.Hanrahan, “Light Field Rendering”, proc. SIGGRAPH’96

131

Page 132: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

References 132

*Markovic2010+ D.Markovic, A. Canclini, F. Antonacci, A. Sarti, S. Tubaro, “Visibility-Based Beam Tracing for Soundfield Rendering”, in proc. of International Workshop on Multimedia Signal Processing, MMSP2010, Saint Malo, France, October 2010

*Markovic2012+ D.Markovic, G. Sandrini, F. Antonacci, A. Sarti, S. Tubaro, “Plenacoustic imaging in the ray space”, accepted for publication at International Workshop on Acoustic Echo Noise and Cancellation (IWAENC), 2012

*Nastasia2011+E.Nastasia, F. Antonacci, A. Sarti, S.Tubaro, “Localization of planar acoustic reflectors through emission of controlled stimuli”, in proc. of 19th European Signal Processing Conference (EUSIPCO’11), pp. 156-160, Barcelona, Aug. 29, Sept. 2, 2011

*Naylor1992+ G. Naylor, “Treatment of Early and Late Reflections in a Hybrid Computer Model for Room Acoustics”, 124th ASA Meeting, New Orleans, 1992.

*Redondi2009+ A.Redondi, M.Tagliasacchi, F.Antonacci, A.Sarti, “Geometric calibration of distributed microphone arrays”, IEEE International Workshop on Multimedia Signal Processing, 2009. MMSP'09

*Rindel1995+ J.H. Rindel, “Computer Simulation Techniques for Acoustical design of Rooms”, Acoustics Australia, 1995

Page 133: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

References

[Smith1985] J. O. Smith, ``A new approach to digital reverberation using closed waveguide networks,'' in Proceedings of the 1985 International Computer Music Conference, Vancouver, pp. 47-53, Computer Music Association, 1985

[Smith1986] J. O. Smith, ``Efficient simulation of the reed-bore and bow-string mechanisms,'' in Proceedings of the 1986 International Computer Music Conference, The Hague, pp. 275-280, Computer Music Association, 1986

[Smith1998] J. O. Smith, ``Principles of digital waveguide models of musical instruments,'' in Applications of Digital Signal Processing to Audio and Acoustics (M. Kahrs and K. Brandenburg, eds.), pp. 417-466, Boston/Dordrecht/London: Kluwer Academic Publishers, 1998

*Stolfi87+ J.Stolfi, “Oriented projective geometry,” in Proc. of Symposium on Computational Geometry, 1987, pp. 76–85

*Teller1992+ S.J. Teller, “Computing the Antipenumbra of an Area Light Source”, Proceedings of the 19th annual conference on Computer graphics and interactive techniques, 1992

*Valente2010+ S.D. Valente, F.Antonacci, M.Tagliasacchi, A.Sarti, S.Tubaro, “Self-calibration of two microphone arrays from volumetric acoustic maps in non-reverberant rooms”, in proc. of International Symposium on Communications Control and Signal Processing, Cyprus, March 2010

*Valente2010a+ S.D. Valente, M.Tagliasacchi, F.Antonacci, P.Bestagini, A.Sarti, S.Tubaro, “Geometric calibration of distributed microphone arrays from acoustic source correspondences”, in proc. of International Workshop on Multimedia Signal Processing, MMSP2010, October 2010

133

Page 134: EUSIPCO 2012 - eurasip.org · Geometric Space-Time Audio Processing A. Sarti and F. Antonacci EUSIPCO 2012 , August 27th, 2012 Outline (1/2) 2 •Background and motivations

A. Sarti and F. Antonacci Geometric Space-Time Audio Processing EUSIPCO 2012 , August 27th, 2012

Contributors

• Paolo Bestagini

• Alberto Calatroni

• Antonio Canclini

• Marco Compagnoni

• Alessio Contini

• Andrea Galbiati

• Dejan Markovic

• Erica Nastasia

• Giorgio Sandrini

• Daniele Valente