Structure from Motion using Factorization

(1)

Structure from Motion using Factorization

Sharat Chandran

ViGIL

Indian Institute of Technology Bombay http://www.cse.iitb.ac.in/∼sharat

March 2014

Note: These slides are best seen with accompanying video

(2)

Problem Definition

Can we understand motion using a single camera?

Given 2D point tracks of landmark points from asingle view point, recover 3D pose and orientation

Assumptions

2D tracks of major landmark points are provided Scaled-projective/orthographic projection model.

(3)

Problem Definition

Assumptions

(4)

Problem Definition

Assumptions

(5)

Why is this a hard problem?

The mapping between 2D tracked positions and 3D body pose is many-to-many¹. This confounds standard regression

algorithms.

Rear

1

P (x, y, −z)2 P (x, y, z)

Front

Reference Plane

1

SOATTO, S.,ANDBROCKETT, R.

1998.

Optimal structure from motion: Local ambiguites and global estimates.

IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

(6)

Why this “may not” be such a hard problem after all?

Human brain perform thisdisambiguationwith very little ease.

Psycho-physical and neuro-physiological imaging

experiments have confirmed the fact that we can perceive structure even when we are presented with a video sequence containing only the point tracks of the major joints in the human body²

2

JOHANSSON, G.

1976.

Spatio temporal differentiation and integration in visual motion perception.

Psychological Research.

(7)

How can we mimic this ability?

Let’s observe the trajectories of joint

(a) The top view trajectories of a few dofs plotted

(b) One more dof added to the plot

(c) All the dofs in- cluded

(8)

How can we mimic this ability?

(9)

How can we mimic this ability?

(10)

How can we mimic this ability?

(11)

How can we mimic this ability?

(12)

How do we capture these structures?

Matrix Factorization

W_2F×P=







x₁₁ · · · x_1p

y₁₁ · · · y_1p

... ... ...

x_f₁ · · · x_fp

y_f₁ · · · y_fp







If the object in the scene isrigidthis matrixWhas a very small rank!!

(13)

How do we capture these structures?

W_2F×P=







x₁₁ · · · x_1p y₁₁ · · · y_1p ... ... ... x_f₁ · · · x_fp y_f₁ · · · y_fp







(14)

How do we capture these structures?

W_2F×P=













(15)

How do we capture these structures?

W_2F×P=













(16)

How do we capture these structures?

W_2F×P=













(17)

Rigid Body Geometry and Motion

Object centroid based World Co-ordinate System (WCS)

(18)

Rank Theorem

Definex˜_ij =x_ij−x¯_i andy˜_ij =y_ij −y¯_i where the bar notation refers to the centroid of the points in theith frame. We have the measurement matrix

W¯_2F×P=







x˜11 · · · x˜1p

y₁₁ · · · y_1p ... ... ... x˜_f₁ · · · x˜_fp y_f₁ · · · y_fp







The matrixW¯ has rank 3

(19)

Rank Theorem

Definex˜_ij =x_ij−x¯_i andy˜_ij =y_ij −y¯_i where the bar notation refers to the centroid of the points in theith frame. We have the measurement matrix

W¯_2F×P=







x˜11 · · · x˜1p

y₁₁ · · · y_1p ... ... ... x˜_f₁ · · · x˜_fp y_f₁ · · · y_fp







The matrixW¯ has rank 3

(20)

Rank Theorem Proof

x_ij = i^T_i (P_j−T_i), y_ij=j^T_i(P_j−T_i), 1 n

n

X

j=1

P_j =0

x˜_ij = i^T_i (P_j−T_i)− 1 n

n

X

m=1

i^T_i(P_m−T_i)

y˜_ij = j^T_i (P_j−T_i)− 1 n

n

X

m=1

j^T_i(Pm−T_i) x˜_ij = i^T_i P_j y˜_ij=j^T_iP_j W¯ = RS

R =





 i^T₁ j^T₁ . . .

i^T_N j^T_N







S=

P₁ P₂ . . . P_N

(21)

Rigid Body Geometry and Motion

Without noiseWis atmost of rankthree Using SVD,W=O₁ΣO₂where,

O₁,O₂are column orthogonal matrices andΣis a diagonal matrix with singular values in non-decreasing order

O₁ΣO₂=O⁰₁Σ⁰O⁰₂+O⁰⁰₁Σ⁰⁰O⁰⁰₂ where,

O⁰₁hasfirst threecolumns ofO₁,O⁰₂hasfirst threerows of O₂andΣ⁰ is 3×3 matrix with 3 largest non-singular values.

The second term is completely due to noise and can be eliminated

Rˆ =O⁰₁h Σ⁰

i1/2

andSˆ = h

Σ⁰ i1/2

O⁰₂

(22)

Rigid Body Geometry and Motion

Rˆ =O⁰₁h Σ⁰

i1/2

andSˆ = h

Σ⁰ i1/2

O⁰₂

(23)

Rigid Body Geometry and Motion

Rˆ =O⁰₁h Σ⁰

i1/2

andSˆ = h

Σ⁰ i1/2

O⁰₂

(24)

Rigid Body Geometry and Motion

Rˆ =O⁰₁h Σ⁰

i1/2

andSˆ = h

Σ⁰ i1/2

O⁰₂

(25)

Rigid Body Geometry and Motion

Rˆ =O⁰₁h Σ⁰

i1/2

andSˆ = h

Σ⁰ i1/2

O⁰₂

(26)

Rigid Body Geometry and Motion

Solution is not uniqueany invertible 3×3,Qmatrix can be written asR= ( ˆRQ)andS= (Q⁻¹S)ˆ

Rˆ is a linear transformation ofR, similarlySˆ is a linear transformation ofS.

Using the following orthonormality constraints we can find RandS

ˆi^T_f QQ^Tˆi_f =1

ˆj^T_f QQ^Tˆj_f =1

ˆi^T_f QQ^Tˆj_f =0 (1)

(27)

Rigid Body Geometry and Motion

Solution is not unique any invertible 3×3,Qmatrix can be written asR= ( ˆRQ)andS= (Q⁻¹S)ˆ

Rˆ is a linear transformation ofR, similarlySˆ is a linear transformation ofS.

Using the following orthonormality constraints we can find RandS

ˆi^T_f QQ^Tˆi_f =1 ˆj^T_f QQ^Tˆj_f =1

ˆi^T_f QQ^Tˆj_f =0 (1)

(28)

Tomasi Kanade Factorisation (Recap)

. . .

(29)

Tomasi Kanade Factorisation (Recap)

. . .







27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

94 ? · · · 131

109 ? · · · 135







W

(30)

Tomasi Kanade Factorisation (Recap)

. . .







27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

94 ? · · · 131

109 ? · · · 135







W

(31)

Tomasi Kanade Factorisation (Recap)

. . .







27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

94 ? · · · 131

109 ? · · · 135







W

(32)

Tomasi Kanade Factorisation (Recap)

. . .







27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

94 ? · · · 131

109 ? · · · 135







W

(33)

Tomasi Kanade Factorisation (Recap)

. . .







27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

94 ? · · · 131

109 ? · · · 135







W

(34)

Tomasi Kanade Factorisation (Recap)

. . .







27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

94 ? · · · 131

109 ? · · · 135







W

(35)

Tomasi Kanade Factorisation (Recap)

. . .







27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

94 ? · · · 131

109 ? · · · 135







Central Observation: This matrix is rank-limited.

If the object motion is rigid the observation matrix (discounting noise) will have a maximum rank of 4

W

(36)

Tomasi Kanade Factorisation (Recap)

. . .







27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

94 ? · · · 131

109 ? · · · 135







=

^Shape

R

1

R

2

R

N

...

R S

W

(37)

Tomasi Kanade Factorisation (Recap)

. . .







27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

94 ? · · · 131

109 ? · · · 135







=

^Shape

Orthographic Camera Model Single Object in FOV of camera Object undergoes rigid motion All the points are visible throughout the sequence

Assumptions

R

1

R

2

R

N

...

R S

W

(38)

Tomasi Kanade Factorisation (Recap)

. . .







27 61 · · · 96

97 53 · · · 122

28 62 · · · 97

97 53 · · · 122

... ... ... ...

94 ? · · · 131

109 ? · · · 135







=

^Shape

Orthographic Camera Model Single Object in FOV of camera Object undergoes rigid motion All the points are visible throughout the sequence

Assumptions

R

1

R

2

R

N

...

R S

W

(39)

For Further Reading I

G. Golub and A. Loan Matrix Computations

John Hopkins U. Press, 1996 C. Tomasi and T. Kanade

Shape and motion from image stream: A factorization method

Image of Science: Science of Images, 90:9795–9802,1993 J. Xiao and J. Chai and T. Kanade

A Closed-Form Solution to Non-Rigid Shape and Motion Recovery

ECCV 2004

(40)

For Further Reading II

C. Bregler and A. Hertzmann and H. Biermann

Recovering Non-Rigid 3D Shape from Image Streams CVPR, 2000

M. Brand

Morphable 3D Models from Video CVPR, 2001

Appu Shaji and Aydin Varol and Pascal Fua and Yashoteja and Ankush Jain and Sharat Chandran

Resolving Occlusion in Multiframe Reconstruction of Deformable Surfaces

NORDIA,CVPRW, 2011

M. Kilian, N. Mitra and H. Pottmann. Geometric Modeling in Shape Space. Siggraph, 2008.