• No results found

Structure from Motion using Factorization

N/A
N/A
Protected

Academic year: 2022

Share "Structure from Motion using Factorization"

Copied!
40
0
0

Loading.... (view fulltext now)

Full text

(1)

Structure from Motion using Factorization

Sharat Chandran

ViGIL

Indian Institute of Technology Bombay http://www.cse.iitb.ac.in/∼sharat

March 2014

Note: These slides are best seen with accompanying video

(2)

Problem Definition

Can we understand motion using a single camera?

Given 2D point tracks of landmark points from asingle view point, recover 3D pose and orientation

Assumptions

2D tracks of major landmark points are provided Scaled-projective/orthographic projection model.

(3)

Problem Definition

Can we understand motion using a single camera?

Given 2D point tracks of landmark points from asingle view point, recover 3D pose and orientation

Assumptions

2D tracks of major landmark points are provided Scaled-projective/orthographic projection model.

(4)

Problem Definition

Can we understand motion using a single camera?

Given 2D point tracks of landmark points from asingle view point, recover 3D pose and orientation

Assumptions

2D tracks of major landmark points are provided Scaled-projective/orthographic projection model.

(5)

Why is this a hard problem?

The mapping between 2D tracked positions and 3D body pose is many-to-many1. This confounds standard regression

algorithms.

Rear

1

P (x, y, −z)2 P (x, y, z)

Front

Reference Plane

1

SOATTO, S.,ANDBROCKETT, R.

1998.

Optimal structure from motion: Local ambiguites and global estimates.

IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

(6)

Why this “may not” be such a hard problem after all?

Human brain perform thisdisambiguationwith very little ease.

Psycho-physical and neuro-physiological imaging

experiments have confirmed the fact that we can perceive structure even when we are presented with a video sequence containing only the point tracks of the major joints in the human body2

2

JOHANSSON, G.

1976.

Spatio temporal differentiation and integration in visual motion perception.

Psychological Research.

(7)

How can we mimic this ability?

Let’s observe the trajectories of joint

(a) The top view tra- jectories of a few dofs plotted

(b) One more dof added to the plot

(c) All the dofs in- cluded

(8)

How can we mimic this ability?

Let’s observe the trajectories of joint

(a) The top view tra- jectories of a few dofs plotted

(b) One more dof added to the plot

(c) All the dofs in- cluded

(9)

How can we mimic this ability?

Let’s observe the trajectories of joint

(a) The top view tra- jectories of a few dofs plotted

(b) One more dof added to the plot

(c) All the dofs in- cluded

(10)

How can we mimic this ability?

Let’s observe the trajectories of joint

(a) The top view tra- jectories of a few dofs plotted

(b) One more dof added to the plot

(c) All the dofs in- cluded

(11)

How can we mimic this ability?

Let’s observe the trajectories of joint

(a) The top view tra- jectories of a few dofs plotted

(b) One more dof added to the plot

(c) All the dofs in- cluded

(12)

How do we capture these structures?

Matrix Factorization

W2F×P=

x11 · · · x1p

y11 · · · y1p

... ... ...

xf1 · · · xfp

yf1 · · · yfp

If the object in the scene isrigidthis matrixWhas a very small rank!!

(13)

How do we capture these structures?

Matrix Factorization

W2F×P=

x11 · · · x1p y11 · · · y1p ... ... ... xf1 · · · xfp yf1 · · · yfp

If the object in the scene isrigidthis matrixWhas a very small rank!!

(14)

How do we capture these structures?

Matrix Factorization

W2F×P=

x11 · · · x1p y11 · · · y1p ... ... ... xf1 · · · xfp yf1 · · · yfp

If the object in the scene isrigidthis matrixWhas a very small rank!!

(15)

How do we capture these structures?

Matrix Factorization

W2F×P=

x11 · · · x1p y11 · · · y1p ... ... ... xf1 · · · xfp yf1 · · · yfp

If the object in the scene isrigidthis matrixWhas a very small rank!!

(16)

How do we capture these structures?

Matrix Factorization

W2F×P=

x11 · · · x1p y11 · · · y1p ... ... ... xf1 · · · xfp yf1 · · · yfp

If the object in the scene isrigidthis matrixWhas a very small rank!!

(17)

Rigid Body Geometry and Motion

Object centroid based World Co-ordinate System (WCS)

(18)

Rank Theorem

Definex˜ij =xij−x¯i andy˜ij =yij −y¯i where the bar notation refers to the centroid of the points in theith frame. We have the measurement matrix

2F×P=

11 · · · x˜1p

y11 · · · y1p ... ... ... x˜f1 · · · x˜fp yf1 · · · yfp

The matrix has rank 3

(19)

Rank Theorem

Definex˜ij =xij−x¯i andy˜ij =yij −y¯i where the bar notation refers to the centroid of the points in theith frame. We have the measurement matrix

2F×P=

11 · · · x˜1p

y11 · · · y1p ... ... ... x˜f1 · · · x˜fp yf1 · · · yfp

The matrix has rank 3

(20)

Rank Theorem Proof

xij = iTi (PjTi), yij=jTi(PjTi), 1 n

n

X

j=1

Pj =0

ij = iTi (PjTi)− 1 n

n

X

m=1

iTi(PmTi)

ij = jTi (PjTi)− 1 n

n

X

m=1

jTi(PmTi) x˜ij = iTi Pj ij=jTiPj W¯ = RS

R =

iT1 jT1 . . .

iTN jTN

S=

P1 P2 . . . PN

(21)

Rigid Body Geometry and Motion

Without noiseWis atmost of rankthree Using SVD,W=O1ΣO2where,

O1,O2are column orthogonal matrices andΣis a diagonal matrix with singular values in non-decreasing order

O1ΣO2=O01Σ0O02+O001Σ00O002 where,

O01hasfirst threecolumns ofO1,O02hasfirst threerows of O2andΣ0 is 3×3 matrix with 3 largest non-singular values.

The second term is completely due to noise and can be eliminated

Rˆ =O01h Σ0

i1/2

andSˆ = h

Σ0 i1/2

O02

(22)

Rigid Body Geometry and Motion

Without noiseWis atmost of rankthree Using SVD,W=O1ΣO2where,

O1,O2are column orthogonal matrices andΣis a diagonal matrix with singular values in non-decreasing order

O1ΣO2=O01Σ0O02+O001Σ00O002 where,

O01hasfirst threecolumns ofO1,O02hasfirst threerows of O2andΣ0 is 3×3 matrix with 3 largest non-singular values.

The second term is completely due to noise and can be eliminated

Rˆ =O01h Σ0

i1/2

andSˆ = h

Σ0 i1/2

O02

(23)

Rigid Body Geometry and Motion

Without noiseWis atmost of rankthree Using SVD,W=O1ΣO2where,

O1,O2are column orthogonal matrices andΣis a diagonal matrix with singular values in non-decreasing order

O1ΣO2=O01Σ0O02+O001Σ00O002 where,

O01hasfirst threecolumns ofO1,O02hasfirst threerows of O2andΣ0 is 3×3 matrix with 3 largest non-singular values.

The second term is completely due to noise and can be eliminated

Rˆ =O01h Σ0

i1/2

andSˆ = h

Σ0 i1/2

O02

(24)

Rigid Body Geometry and Motion

Without noiseWis atmost of rankthree Using SVD,W=O1ΣO2where,

O1,O2are column orthogonal matrices andΣis a diagonal matrix with singular values in non-decreasing order

O1ΣO2=O01Σ0O02+O001Σ00O002 where,

O01hasfirst threecolumns ofO1,O02hasfirst threerows of O2andΣ0 is 3×3 matrix with 3 largest non-singular values.

The second term is completely due to noise and can be eliminated

Rˆ =O01h Σ0

i1/2

andSˆ = h

Σ0 i1/2

O02

(25)

Rigid Body Geometry and Motion

Without noiseWis atmost of rankthree Using SVD,W=O1ΣO2where,

O1,O2are column orthogonal matrices andΣis a diagonal matrix with singular values in non-decreasing order

O1ΣO2=O01Σ0O02+O001Σ00O002 where,

O01hasfirst threecolumns ofO1,O02hasfirst threerows of O2andΣ0 is 3×3 matrix with 3 largest non-singular values.

The second term is completely due to noise and can be eliminated

Rˆ =O01h Σ0

i1/2

andSˆ = h

Σ0 i1/2

O02

(26)

Rigid Body Geometry and Motion

Solution is not uniqueany invertible 3×3,Qmatrix can be written asR= ( ˆRQ)andS= (Q−1S)ˆ

Rˆ is a linear transformation ofR, similarlySˆ is a linear transformation ofS.

Using the following orthonormality constraints we can find RandS

ˆiTf QQTˆif =1

ˆjTf QQTˆjf =1

ˆiTf QQTˆjf =0 (1)

(27)

Rigid Body Geometry and Motion

Solution is not unique any invertible 3×3,Qmatrix can be written asR= ( ˆRQ)andS= (Q−1S)ˆ

Rˆ is a linear transformation ofR, similarlySˆ is a linear transformation ofS.

Using the following orthonormality constraints we can find RandS

ˆiTf QQTˆif =1 ˆjTf QQTˆjf =1

ˆiTf QQTˆjf =0 (1)

(28)

Tomasi Kanade Factorisation (Recap)

. . .

(29)

Tomasi Kanade Factorisation (Recap)

. . .

27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

... ... ... ...

94 ? · · · 131

109 ? · · · 135

W

(30)

Tomasi Kanade Factorisation (Recap)

. . .

27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

... ... ... ...

94 ? · · · 131

109 ? · · · 135

W

(31)

Tomasi Kanade Factorisation (Recap)

. . .

27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

... ... ... ...

94 ? · · · 131

109 ? · · · 135

W

(32)

Tomasi Kanade Factorisation (Recap)

. . .

27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

... ... ... ...

94 ? · · · 131

109 ? · · · 135

W

(33)

Tomasi Kanade Factorisation (Recap)

. . .

27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

... ... ... ...

94 ? · · · 131

109 ? · · · 135

W

(34)

Tomasi Kanade Factorisation (Recap)

. . .

27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

... ... ... ...

94 ? · · · 131

109 ? · · · 135

W

(35)

Tomasi Kanade Factorisation (Recap)

. . .

27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

... ... ... ...

94 ? · · · 131

109 ? · · · 135

Central Observation: This matrix is rank-limited.

If the object motion is rigid the observation matrix (discounting noise) will have a maximum rank of 4

W

(36)

Tomasi Kanade Factorisation (Recap)

. . .

27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

... ... ... ...

94 ? · · · 131

109 ? · · · 135

=

Shape

Central Observation: This matrix is rank-limited.

If the object motion is rigid the observation matrix (discounting noise) will have a maximum rank of 4

R

1

R

2

R

N

...

R S

W

(37)

Tomasi Kanade Factorisation (Recap)

. . .

27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

... ... ... ...

94 ? · · · 131

109 ? · · · 135

=

Shape

Central Observation: This matrix is rank-limited.

If the object motion is rigid the observation matrix (discounting noise) will have a maximum rank of 4

Orthographic Camera Model Single Object in FOV of camera Object undergoes rigid motion All the points are visible throughout the sequence

Assumptions

R

1

R

2

R

N

...

R S

W

(38)

Tomasi Kanade Factorisation (Recap)

. . .

27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

... ... ... ...

94 ? · · · 131

109 ? · · · 135

=

Shape

Central Observation: This matrix is rank-limited.

If the object motion is rigid the observation matrix (discounting noise) will have a maximum rank of 4

Orthographic Camera Model Single Object in FOV of camera Object undergoes rigid motion All the points are visible throughout the sequence

Assumptions

R

1

R

2

R

N

...

R S

W

(39)

For Further Reading I

G. Golub and A. Loan Matrix Computations

John Hopkins U. Press, 1996 C. Tomasi and T. Kanade

Shape and motion from image stream: A factorization method

Image of Science: Science of Images, 90:9795–9802,1993 J. Xiao and J. Chai and T. Kanade

A Closed-Form Solution to Non-Rigid Shape and Motion Recovery

ECCV 2004

(40)

For Further Reading II

C. Bregler and A. Hertzmann and H. Biermann

Recovering Non-Rigid 3D Shape from Image Streams CVPR, 2000

M. Brand

Morphable 3D Models from Video CVPR, 2001

Appu Shaji and Aydin Varol and Pascal Fua and Yashoteja and Ankush Jain and Sharat Chandran

Resolving Occlusion in Multiframe Reconstruction of Deformable Surfaces

NORDIA,CVPRW, 2011

M. Kilian, N. Mitra and H. Pottmann. Geometric Modeling in Shape Space. Siggraph, 2008.

References

Related documents

indicating motivation and aims To make students understand cognitive science from the point of view of Psychology,. Neuroscience, Linguistics,

Ans. The point-in-polygon overlay determines the points from one input layer lying inside a particular polygon. In this method the output layer is still a point layer but has

● Just like in 2D Euclidean space, if we know the Laplace-Beltrami operator of the surface, and the function value at a single point, we can solve for the function at all points on

16.8 and 16.9, we observe that, from the point of view of kinetics, the most general plane motion of a rigid body symmetrical with respect to a reference plane can be replaced by

– Find the intersection point of given line and edge line of the window. – Repeat until both end points

• The projection is obtained on a plane known as Picture Plane and the view is taken from a point known as Station Point....

From a technical point of view, e-waste patents can be divided into two components: (1) material recovery from sources of e-waste- materials such as plastics

We propose a novel, end-to-end trainable, deep network called ParSeNet that decomposes a 3D point cloud into parametric surface patches, including B-spline patches as well as