scenes from video workshop talk

What’s so good about pieces, Lego and understanding?

Anton van den Hengel

Australian Centre for Visual Technologies (ACVT)The University of AdelaideSouth Australia

People think in 3D

It has been a theme …

"the perception of solid objects is a process which can be based on the

properties of three-dimensional transformations and the laws of nature”

Larry Roberts (1965)

Geometry is not enough

Structure and semantics interact

Structure and geometry interact

WHY PLANTS ARE LIKE LEGO

Developmental changes in response to drought

Boris Parent, ACPFG

30 35 40 45 50 55 60 65

Time after sowing [d]

drought

well watered

39 d after sowing

46 d after sowing

The escape response of Clipper under drought is reflected in

an earlier time of absolute maximum growth

Morphological changes in response to drought

Boris Parent, ACPFG

30 40 50 60

Time after sowing [d]

The reduced number of tillers under drought is

reflected in the area/height ratio

Barley cv Clipperdrought

well watered

Deep reasoning

• Try to explain as much as possible

• Fine-grained and detailed

• Deep semantics

• And the implied constraints

• Shape is only an intermediate step

Deconstruction

Silhouettes

• We’re only interested in shape (at least for now)

Deconstruction

• Render all possible building blocks in every possible position, and recover its silhouette

• Then reconstruct object silhouettes from templates

• Requires enough camera information to achieve this

Template shapes

• nTemplates = nShapes x nPositions x nRotations

• So there are lots of them

• But they are sparsely used

Sparse recovery

• \alpha a vector of binary template coefficients

• \Pi a matrix with one template silhouette per column

• y the silhouette of the shape to be recovered

• NP hard and fragile

Sparse recovery – L_1 norm

• But there may still be millions of templates, and they’re enormous (|Pixels| x |Images|)

Sparse recovery – Random projections

• Random projection by DxS matrix \Phi

• D << S

• \Phi is sparsely sampled from N(0,1)

• But there are still too many templates

Sparse recovery - Cropping

• Eliminate templates with a footprint that extends significantly beyond that of the object

• Reduces the number of templates by at least an order of magnitude

• Down to tens to tens of thousands of templates

Binarising the solution

• Solutions are not binary

• Randomly generate binary hypotheses from non-binary \alpha

• Evaluate using an accurate composition model

Results

Plants

Results

200 400 600 800 1000

Number of Templates

Search

Viable

Results

0 0.01 0.02 0.03 0.04 0.05 0.060

Noise Level (Fraction of Pixels Changed)

Search

Composition problems

Not a true model of silhouette formation

So doesn’t deal well with template overlap

Working on this by subtracting overlaps, graph-based approaches

Somewhat overcome by…

Inequality

• Isn’t physically accurate for foreground pixels, so split

• Background (0) pixels

• And foreground pixels

Practicality again

• Only interested in the number of pixels outside the object silhouette, not the location

• So not

• but

Practicality again

• Want to ensure that

• Need to project to a lower dimension

• But \Phi_I must have only positive elements

A better model of composition

• Left with

Constraints - Intersection

• Form J where every row represents a constraint

• If templates i and k intersect then insert a row in J with only elements i and k set to 1

Constraints - Support

• Form K where every row represents a constraint

• If template i needs support t set K_ii = t

• If template j provides s support to j then K_ij = -s

Measurement benefit tails off

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4

Noise level (added to camera extrinsics)

racy (

Accuracy vs noise for varying numbers of measurements

Results

Limitations

• One template per value per parameter

• Fixable?

scenes from video workshop talk

Technology

book2movie: aligning video scenes with book...

temporal video segmentation to scenes using high-level...

talk fusion - create video email

mo deling 3d scenes from video - evl

video istituzionale di casanoi - behind the scenes

aws loft talk: behind the scenes with signalfx

my video talk products

variable focus video: reconstructing depth and video for...

talk fusion - video auto responder

talk fusion - grow your business with video

behind the scenes digital video: organelle campaign ms....

behind the scenes of video streaming - artskc.org...behind...

book2movie: aligning video scenes with book chapters ·...

women talk sci fi podcast 48 ~ behind the scenes - judy...

towards motion aware light field video for dynamic...

my video talk philippines business opportunity

video + blogging talk

real-time video surveillance for large scenes hanyu liu,...

behind-the-scenes photos and analysis of 11-second video

"sex at dawn" music video - behind the scenes