semantic 3d reconstruction and localization marc...

58
Semantic 3D Reconstruction and localization Marc Pollefeys joint work with Christian Haene, Nikolay Savinov, Ian Cherabier, Lubor Ladicky, Martin Oswald, Christopher Zach, Andrea Cohen, Johannes Schoenberger. Andreas Geiger &

Upload: others

Post on 09-Oct-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Semantic 3D Reconstruction and localization

Marc Pollefeys

joint work with Christian Haene, Nikolay Savinov, Ian Cherabier, LuborLadicky, Martin Oswald, Christopher Zach, Andrea Cohen, Johannes

Schoenberger. Andreas Geiger

&

Page 2: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Classical 3D reconstruction from images

• Classical 3D from image approach – Relative pose between images (structure-from-motion)– Per pixel depth estimation (multi-view stereo matching)– Surface reconstruction (TSDF, poisson, graph energy minimization)

E(x) SS xS xS

isotropic shape priorunary depth evidence term

line-of-sight model

free-space

occupied-space

S optimal solution xs (spherical)

Page 3: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian
Page 4: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Semantic 3D reconstruction

Building Ground Vegetation Clutter

• Joint 3D reconstruction and class segmentation – Obtain separate surface for each class of object– Corresponds to multi-label volumetric segmentation problem

4

Page 5: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Discrete and Continuous Formulations

Discrete Domain Continuous Domain

Smoothness:transitions along edges

Smoothness:(anisotropic) boundary length

Formulation: Linear Program[Schlesinger 1976]Arbitrary smoothness cost allowed

Formulation: Convex Program[Chambolle, Cremers, Pock 2008]Smoothness needs to form a metric

Page 6: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Convex, Continuous Multi-Label Formulation[Zach, Häne, Pollefeys, CVPR 2012, TPAMI 2014]

Metric smoothness fulfills triangle inequality Truncated quadratic smoothness non-metric

Our continuously inspired formulation Takes best from both worlds Non-metric and anisotropic boundary length cost

Page 7: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Energy Formulation

Cost for boundary:

Page 8: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Energy Formulation

Cost for boundary:

Page 9: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Energy Formulation

Cost for boundary:

Page 10: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Wulff shapes

The shape of an equilibrium crystal is obtained, according to the Gibbs thermodynamic principle, by minimizing the total surface free energy associated to the crystal-medium interface. http://www.scholarpedia.org/

Wulff's theorem: The minimum surface energy for a given volume of a polyhedron will be achieved if the distances of its faces from one given point are proportional to their surface tension

Proposed use for anisotropic regularization [Esedoglu and Oscher 2004, Zach et al. 2009, Haene et al. 2013/14/15]

Page 11: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Input Images

Depth MapsClass Likelihoods

Dense Semantic 3D Reconstruction[Häne, Zach, Cohen, Angst, Pollefeys, CVPR 2013]

Semantic ClassifierSparse ReconstructionDense Matching

Page 12: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Depth MapsClass Likelihoods

Dense Semantic 3D Model

Dense Semantic 3D Reconstruction[Häne, Zach, Cohen, Angst, Pollefeys, CVPR 2013]

Joint Fusion, Convex Optimization

Page 13: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Formulation: Labeling of a Voxel Space[Häne, Zach, Cohen, Angst, Pollefeys, CVPR 2013]

Data Term: Described as per-voxel unary potentials

Regularization Term: Class-specific, direction dependent, surface area penalization

Learned fromtraining data

Page 14: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Class specific training/selection of

building

ground

vegetation

stuff

building ground vegetation stuff air

14

Page 15: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Joint 3D reconstruction and class segmentation(Haene et al CVPR13)

15

Page 16: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Weakly Observed Structures I

Buildings standing on the ground

Page 17: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Outline

Semantic 3D reconstruction• Joint reconstruction, recognition and segmentation• High-order ray potentials• Joint classifier• Modeling objects

17

Page 18: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Weakly Observed Structures II

Building separated from vegetation

Page 19: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Unobserved Surfaces

Labels can be separated

Page 20: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Unobserved Surfaces

Labels can be separated

Page 21: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Higher-order ray potentials to model visibility

• Volumetric formulation

Ray potentials Pairwise regularizer

• Cost based on the first occupied voxel along the ray

depth label

freespace

(Savinov et al, CVPR15/CVPR16)

Page 22: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Cost based on the first occupied voxel along the ray

3

(Savinov et al, CVPR15/CVPR16)

Page 23: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Visibility Consistency Constraint

(non-convex)

(Savinov et al, CVPR16)

Page 24: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

24

Results

(Savinov et al, CVPR15/16)

inference generative

minimize Δ

Page 25: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

2-label V-Constraint: SOTA on Middlebury

V-Constraint: nicely handles thin objects

Page 26: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Can our approach also handle objects?

• Extend approach from „Stuff“ to „Things“

26

Introduce location dependent anisotropic smoothness prior

Page 27: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Learning location-dependentanisotropic smoothnes prior

• Download training data from Google 3D warehouse

• At every voxel in 3D bounding box, estimate distributionof observed shape normals

• Determine convex Wulff shape which best representsobserved statistics

27

(Haene et al CVPR 2014)

Page 28: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

28

Cars

Page 29: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Car reconstructions

29

Page 30: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Advantages of multi-class segmentation

Learn separate statistics for car-air and car-ground transition likelihoods

30

Page 31: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Semantic multi-class 3D head reconstruction(Maninchedda et al. ECCV2016)

• Align prior to data by minimizing smoothness termtowards position

parametric shape model

implicit semantic shape model

Page 32: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Segment based 3D object shape priors

Computational 3D Photography32

represent non-convex object shape priors as combination of convex part priorsKarimi, Haene, Pollefeys (CVPR15)

build prior by performing (approximate) convex decomposition of example (and observe which transitions occur)

Page 33: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Result: Tables

Page 34: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Result: Trees

Page 35: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Result Mug

Page 36: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Detect and regularize for symmetries(Speciale et al. ECCV2016)

A preference for symmetry can be introduced by adding non-local regularization terms

Page 37: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Semantic 3D Reconstruction Compared to a fixed-grid ≈ 20 x faster, 30 – 40 x less memory Allows for city scale reconstructions

Large-Scale Semantic 3D Reconstruction: an Adaptive Multi-Resolution Model for Multi- Class Volumetric Labeling,Maros Blaha, Christoph Vogel, Audrey Richard, Jan D. Wegner, Thomas Pock, Konrad Schindler, CVPR 2016

Page 38: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Overview of Regularizers

Isotropic spatially homogeneous TV

Isotropic spatially varying weighted TV

Anisotropic spatially varying

weighted TV

Anisotropic spatially varying

Wulff shape

Page 39: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Learning Regularization

(Cherabier et al ECCV2018)

Generalize gradient operator in regularizer Learn label interactions

, ,

Multi-label segmentation/3D reconstruction

Saddle-point problem

Iterate primal-dualupdate steps

(Vogel and Pock GCPR2017)

Page 40: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Neural Network via Optimization Unrolling

Iterate primal-dualupdate steps

Pre-processing` Primal Dual Post-processing

Page 41: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Iterate primal-dualupdate steps

Pre-processing Primal Dual Post-processing

Neural Network via Optimization Unrolling

Page 42: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Pre-processing Variational Optimization Post-processing

Multi-scale Architecture

Page 43: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Potential to combine with Multi-Sensor Depth Map Fusion

Multi-Sensor Aggregation Semantic 3D Reconstruction

Page 44: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

2D Experiments

Page 45: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

3D Experiments

Page 46: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

3D Experiments on ScanNet

Page 47: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Learning Regularization

Hähne et al. (50 iter.)Hähne et al. (50 iter.) Hähne et al. (2750 iter.)Hähne et al. (2750 iter.)

Ours (50 iter.)Ours (50 iter.)TV‐L1 (50 iter.)TV‐L1 (50 iter.)

Page 48: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Learning Regularization

Hähne et al. (50 iter.)Hähne et al. (50 iter.) Hähne et al. (2750 iter.)Hähne et al. (2750 iter.)

Ours (50 iter.)Ours (50 iter.)TV‐L1 (50 iter.)TV‐L1 (50 iter.)

Page 49: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Learning Regularization

Hähne et al. (50 iter.)Hähne et al. (50 iter.) Hähne et al. (2750 iter.)Hähne et al. (2750 iter.)

Ours (50 iter.)Ours (50 iter.)TV‐L1 (50 iter.)TV‐L1 (50 iter.)

Page 50: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Learning Regularization

TV‐L1 (50 iter.)TV‐L1 (50 iter.) Ground TruthGround Truth

Ours‐300 (50 iter.)Ours‐300 (50 iter.)Ours‐5 (50 iter.)Ours‐5 (50 iter.)

Page 51: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Learning Regularization

TV‐L1 (50 iter.)TV‐L1 (50 iter.)Ground TruthGround Truth Ours‐300 (50 iter.)Ours‐300 (50 iter.)Ours‐5 (50 iter.)Ours‐5 (50 iter.)

Page 52: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Visual Localization

Compute exact position and orientation of query image.

Page 53: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Visual Localization

[Bernhard Zeisl, Torsten Sattler and Marc Pollefeys.Camera Pose Voting for Large-Scale Image-Based Localization. ICCV, 2015]

Page 54: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

3D Semantic Localization

[Schönberger, Pollefeys, Geiger, Sattler, Semantic Visual Localization, CVPR 2018]

3D semantic volume 3D semantic volume

Page 55: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

3D Semantic Localization

[Schönberger, Pollefeys, Geiger, Sattler, Semantic Visual Localization, CVPR 2018]

latent space

global semantic map

local semantic map

Page 56: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

The 180º Case

[Schönberger, Pollefeys, Geiger, Sattler, Semantic Visual Localization, CVPR 2018]

Page 57: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Strong Viewpoint Change

[Schönberger, Pollefeys, Geiger, Sattler, Semantic Visual Localization, CVPR 2018]

ours image features 3D features learning

KITTI datasetdepth from stereo

Understanding the Limitations of CNN-based Absolute Camera Pose Regression, Sattler et al CVPR 2019

Page 58: Semantic 3D Reconstruction and localization Marc Pollefeyshelper.ipam.ucla.edu/publications/glws4/glws4_15588.pdf · Semantic 3D Reconstruction and localization Marc Pollefeys jointworkwithChristian

Questions?

58