Coherent Scene Understanding with 3D Geometric Reasoning

Jiyan Pan12/3/2012

TaskDetect objects

Identify surface regions

Estimate ground plane

Infer gravity direction

Geometrically coherent in the

3D world

3D geometric context

ground plane

image plane(inverse) gravity

ground plane orientation

ground plane height

object vertical orientation

real world heightobject depthcamera center

focal length

object pitch and roll angles

object landmarks

Coordinate system

Deterministic relationships

Variables of global 3D geometries:

ng, np, hp

ground plane

image plane(inverse) gravity

ground plane height

object vertical orientation

real world heightobject depthcamera center

focal length

object pitch and roll angles

object landmarks

Coordinate system

Probabilistic relationships

Derived from appearance

Prior knowledge

Can we solve them all for a coherent solution?

• Non-linear• Non-deterministic• Even invalid equations from false detections

Global 3D context

Local 3D context

“Chicken and egg” problem: Local entities could be validated by global 3D context Global 3D context is induced from local entities

Global 3D context

Local 3D context

Possible solution (All in PGM)• Put both global 3D geometries and local entities in a PGM [1]

– Precision issue: Have to quantize continuous variables– Complexity issue: Pairwise potential would contain up to ~1e6 entries

[1] D. Hoiem, A. A. Efros, and M. Hebert. Putting objects in perspective. IJCV, 2008

Ground

Gravity

100(pitch) × 100 (roll) × 100 (height)

Possible solution (Fixed global geometries as hypotheses)

• Task much easier under a fixed hypothesis of global 3D geometries

Ground

Gravity

× × × × × ×

• Task much easier under a fixed hypothesis of global 3D geometries

Possible solution (Fixed global geometries as hypotheses)

How to generate global 3D geometry hypotheses?

Possible solution(Hypotheses by exhaustive search)

• Exhaustive search over the quantized space of global 3D geometries [2]

– Computational complexity tends to limit search space

[2] S. Bao et al. Toward coherent object detection and scene layout understanding. IVC, 2011

Possible solution(Hypotheses by Hough voting)

• Each local entity casts vote to the Hough voting space of the global 3D geometries and peaks are selected[3]

– False detections could corrupt the votes– Would applying EM help? Not likely, if false detections overwhelm

[3] M. Sun et al. Object detection with geometrical context feedback loop. BMVC, 2010

L1 L2 L3L5L4 L7L6

Our solution• We take a RANSAC-like approach: Randomly mix the

contributions of local entities

L1 L2 L3L5L4 L7L6

contributions of local entities

L1 L2 L3L5L4 L7L6

contributions of local entities– Compared to averaging over all local entities: More robust against outliers– Compared to directly using estimates from each single local entity: More robust against noise

L1 L2 L3L5L4 L7L6

0 5 10 15 20 25 30 35 40 45 501.6

Number of random mixtures

Gravity Direction

IndividualMixtureAverage

0 5 10 15 20 25 30 35 40 45 501.6

Number of random mixtures

Ground Plane Orientation

IndividualMixtureAverage

Local 3D context

Global 3D context

ground plane orientation valid

valid invalid (#1)

invalid (#1)invalid

ground plane

#1: Common ground (global)

#2: Gravity direction (global)

(inverse) gravity

ground plane orientation invalid

ground plane

#3: Depth ordering (local)

(inverse) gravity

incompatible (#3)

ground plane

#4: Space occupancy (local)

(inverse) gravity

incompatible (#4)

ground plane

Global geometric compatibility for an object:

Orientation:

Given a global 3D geometry hypothesis

Global geometric compatibility for an object:

Orientation:

Height:

Global geometric compatibility for a surface:

Orientation: local estimates vs. or

Location: horizontal surface region vs. ground horizon

Local geometric compatibility for two objects:

Depth ordering:

Space occupancy:

Objective function of the CRF:

0,01,5.0

else,0

1,,min,

)()(ji

jiijooss

Local 3D context

Global 3D context

Best hypothesis

3D reasoning agrees with raw detector

3D reasoning recovers detection rejected by raw detector

3D reasoning rejects detection accepted by raw detector

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 50

False Positive per Image

eDeformable Part Model Detector

Baseline

3D geometric reasoning improves object detection performance

D. Hoiem, A. A. Efros, and M. Hebert. Putting objects in perspective. IJCV, 2008

0 0.2 0.4 0.6 0.8 1 1.20

eDalal-Triggs Detector

Baseline

D. Hoiem, A. A. Efros, and M. Hebert. Putting objects in perspective. IJCV, 2008

Improvement in AP over baseline detector

Ours 10.4%

Hoiem 4.8%

Sun 5.1%

M. Sun et al. Object detection with geometrical context feedback loop. BMVC, 2010D. Hoiem, A. A. Efros, and M. Hebert. Putting objects in perspective. IJCV, 2008

Horizon estimation median error

Ours 2.05⁰

Hoiem 3.15⁰

Sun 2.41⁰

M. Sun et al. Object detection with geometrical context feedback loop. BMVC, 2010D. Hoiem, A. A. Efros, and M. Hebert. Putting objects in perspective. IJCV, 2008

Local 3D context

Global 3D context

Best hypothesis

Contributions of different geometric context

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 50

eDetection ROC Curve

Det+IdvlGeo

Det+PairGeo

Det+FullGeo

Benefit is mutual

Error in gravity direction

Error in ground orientation

Vanishing points alone 2.62⁰ 4.85⁰

Whole system 2.05⁰ 2.21⁰

Extensions– Improved depth ordering constraint– Local geometric constraints involving vertical surfaces– Multiple supporting planes– Using more prior knowledge of objects– Utilizing semantic categories of surface regions

closer object

farther object

closer object farther object

occlusion mask of the farther object

intersection region of the two object masks

Fully cover?

Occlusion: bottleneck in our system

– Missed detection– Erroneous estimation of local properties– Less effective depth ordering constraint

Generalized Hough voting: better at handle occlusions

K. Rematas et al. CORP 2011

B. Leibe et al. IJCV 2008

Occlusion-and-geometry-aware Hough voting

Local 3D context

Global 3D context

Best hypothesis

• So far we have treated the entire region labeled as "vertical" as a whole

Decompose vertical region into surface segments Occlusion boundary recovery (Hoiem et al. IJCV’11)Vanishing line sweeping (Lee et al. CVPR’09)

ground plane

inverse gravity

vertical surface candidate 1

ground plane

inverse gravity

ground plane

vertical surface candidateinverse gravity

object candidate

ground plane

vertical surface candidateinverse gravity

Given object layout, erect surfaces one by one “Interpretation by synthesis” (Gupta et al. ECCV’10)

supporting plane 1

supporting plane 2

ground plane

tx td tX

• Spring 2013 (ICCV’13 submission)– Improved depth ordering constraint– Using more prior knowledge of objects– Multiple supporting planes

• Fall 2013 (CVPR’14 submission)– Local geometric constraints involving vertical surfaces– Utilizing semantic categories of surface regions

• During Spring Semester of 2014– Thesis writing

Expected Contributions

• Systematically model the relationships among global and local geometric variables

• Develop a RANSAC-CRF scheme to handle non-linear, non-deterministic, and possibly invalid relationships

• Occlusion-and-geometry-aware object detection for finer depth order reasoning

• Joint reasoning among global geometries, surface segments, and objects

Thank you!

Coherent Scene Understanding with 3D Geometric Reasoning

Documents

Transcript of Coherent Scene Understanding with 3D Geometric Reasoning

The cohomology of coherent sheaveschai/624_08/mumford-oda...CHAPTER VII The cohomology of coherent sheaves 1. Basic Cech cohomologyˇ We begin with the general set-up. (i) Xany topological

EE359 Wireless Communications - Arab Academy for ...webmail.aast.edu/~khedr/Courses/VT/OFDM/lecture_nine...Coherent Demodulation € P s ≈α M Q(β M γ s) For all coherent demodulation

Results and future of COHERENT

Neutrinos as generalized coherent states - arXiv.org e ... · 2 Cheng-Yang Lee: Neutrinos as generalized coherent states The paper is organised as follows. In sec. (2), we prove that

Automated Theorem Proving - carma.newcastle.edu.au · Automated reasoning Domain-general methods: Propositional theorem proving First-order theorem proving Equational reasoning Higher-order

ΕΕ EPSO Δοκιμασία Κατανόησης Κειμένου Verbal Reasoning

Coherent MIMO Radar: High Resolution Applications

QUANTUM COHERENT CONDUCTION IN CNTs An Amateur’s View

Ph h tt tiPhase coherent transport in 2D toppgological ... · PDF filePh h tt tiPhase coherent transport in 2D toppgological insulators Mini-School on Topological ... A 364 me364 meVnm,

Fock and Coherent

Passive Coherent Location (PCL) Radar Demonstrator Meeting...Passive Coherent Location (PCL) Radar Demonstrator UNCLASSIFIED/UNLIMITED The angles θT and θR in Fig. 1 are, respectively,

Nonlinear Momentum Compaction and Coherent Synchrotron Radiation at the Metrology Light

PROVING EWSAE WORK: TE T AUDECE AT TE T TE Study - Cineplex Scene... · CASE STUDY The Challenge SCENE, the Scotiabank-Cineplex loyalty program, wanted to add value to the plan for

Coherent Interference Intensity Huygens’ Principle Section 25.4.

Nominal Reasoning Techniques in Coq (Work in Progress)sweirich/papers/nominal-coq/presentation.pdfWhat is nominal reasoning (in Coq)? Using names for both bound and free variables

03 ΕΕ EPSO Δοκιμασία Κατανόησης Αφηρημένων Εννοιών Abstract Reasoning

Retro-analitical Reasoning IQ tests for the High Range.

Coherent Diffractive Imaging - ATTOFEL

Mathieu Acher Managing Feature Models. (FeAture Model scrIpt Language for manIpulation and Automatic Reasoning) φ TVL DIMACS

CS 433 Automated Reasoning 2021