25
Evaluating Lookup-Based Monocular Human Pose Tracking on the HumanEva Test Data Nicholas R. Howe

Evaluating Lookup-Based Monocular Human Pose Tracking …

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Evaluating Lookup-Based Monocular Human Pose Tracking …

Evaluating Lookup-Based Monocular Human Pose Tracking

on theHumanEva Test Data

Nicholas R. Howe

Page 2: Evaluating Lookup-Based Monocular Human Pose Tracking …

Orientation

• Goal of work: Develop HumanEva results– Algorithms not necessarily state of the art– Useful as baseline

Single camera

3D poserecovery

Page 3: Evaluating Lookup-Based Monocular Human Pose Tracking …

Lookup-Based Motion Capture (1)

• Use silhouettes to retrieve known poses

Library of known 3D poses Library of known 3D poses with with observationsobservations

CandidateCandidateposesposes

ObservationsObservationsfrom videofrom video

Page 4: Evaluating Lookup-Based Monocular Human Pose Tracking …

Lookup-Based Motion Capture (2)

• Use second-order hidden Markov model to select pose sequence with low energy

Frame 1Frame 1 Frame 2Frame 2 Frame 3Frame 3 Optimize for:• Agreement with

frame observations• Agreement with

flow observations• Small inertial

changes between frames

Page 5: Evaluating Lookup-Based Monocular Human Pose Tracking …

Motion capturedata

Pose & flow rendering

PoseLibrary

Final Pose Trajectory

Frame i

Frame i-1

…Frame i+1

Inpu

t vid

eo Candidates i-1

Candidates i

Candidates i+1

Optical flow& foregroundsegmentation R

etrie

val

MarkovChain

Optimization

Flow Chart of Motion Capture

Page 6: Evaluating Lookup-Based Monocular Human Pose Tracking …

Some Related Work• Estimating Human Body Configuration Using Shape

Context MatchingMori & Malik, ECCV 2002

• 3D Tracking = Classification+InterpolationTomasi, Petrov, & Sastry, ICCV 2003

• Silhouette Lookup for Automatic Pose Tracking Howe, ANM 2004

• 3D Articulated Models and Multi-View Tracking with SilhouettesDelamarre & Faugeras, ICCV 1999

• Temporal Integration of Multiple Silhouette-based Body-part HypothesesKwatra, Bobick, & Johnson, CVPR 2001

Page 7: Evaluating Lookup-Based Monocular Human Pose Tracking …

Caveats

• Scalability of pose library is a concern– May limit technique to specific applications– Walking library: 805 poses– Boxing library = several thousand poses– Some work exists on sublinear retrieval

• Lookup employs background subtraction– Good segmentation result is often achievable– BS not required in principle for lookup-based methods

• Others have demonstrated edge-based techniques

Page 8: Evaluating Lookup-Based Monocular Human Pose Tracking …

Overview

• Feature from video– Background subtraction– Optical flow

• Lookup techniques• Markov chaining• Results

Page 9: Evaluating Lookup-Based Monocular Human Pose Tracking …

Background Subtraction

• Graph cut formulation uses edge data– Segmentation tends to follow edges

• HSV color space with shadow correction• Robust estimation of background• Failures mostly due to poor contrast

Typical good result Typical bad result

Page 10: Evaluating Lookup-Based Monocular Human Pose Tracking …

Optical Flow

• Optical flow from Krause method• Mask by foreground & compute moments

Page 11: Evaluating Lookup-Based Monocular Human Pose Tracking …

Pose Lookup

• Candidate pool combines multiple queries:

Half-chamferdistance (A)

Turning angledistance (B)

Flow momentdistance (C)

Most results areconstrained to lie near the previous frame’s candidates.

A few open retrievals are also included.

Combinations:A+B+CA+BC only

Page 12: Evaluating Lookup-Based Monocular Human Pose Tracking …

Frame “Stitching”

• First-order Markov chain sufficient for “smoothness”• Second-order chain is needed for conservation of

momentum• Flow match & momentum conservation intended to

prevent “shuffle-step” errors

?

Page 13: Evaluating Lookup-Based Monocular Human Pose Tracking …

Results

• Results available for:S3_Walking_1_(BW2): Mean error 11 pixels S3_Walking_1_(C2): Mean error 14 pixelsS2_Walking_1_(BW2): Mean error 13 pixels*S3_Walking_1_(C1): Mean error 18 pixels*

*Affected by error in background subtraction

• Boxing available soon (hopefully!)• Observations:

– Left-right inversion problems– Error is highest at extremities =

Page 14: Evaluating Lookup-Based Monocular Human Pose Tracking …

S3_Walking_1_(BW2)

• Mean error after swaps is 11 pixels.

* apparent error in mocap for head

*

Page 15: Evaluating Lookup-Based Monocular Human Pose Tracking …

Visualization

Page 16: Evaluating Lookup-Based Monocular Human Pose Tracking …

Optimization

• Match may be improved by optimization on pose parameters

• One frame at a time– Improve chamfer match with silhouette– Improve smoothness: use quadratic fit to

parameters over 11-frame window• Error improves to 10 pixels after one round

Page 17: Evaluating Lookup-Based Monocular Human Pose Tracking …

S2_Walking_1_(BW2) • Mean error after swaps is 13 pixels.

Page 18: Evaluating Lookup-Based Monocular Human Pose Tracking …

S3_Walking_1_(C2)

• Mean error after swaps is 14 pixels.

* apparent error in mocap for head

*

Page 19: Evaluating Lookup-Based Monocular Human Pose Tracking …

S3_Walking_1_(C1)

• Mean error after swaps is 18 pixels.

*

Unstable due to Unstable due to bad segmentationbad segmentation

Page 20: Evaluating Lookup-Based Monocular Human Pose Tracking …

Visualization

Page 21: Evaluating Lookup-Based Monocular Human Pose Tracking …

Conclusions

Few results, but some trends are clear:• Pixel accuracies in the teens• Limitations of silhouettes evident

– Left-right ambiguity is still an unsolved problem for this method

– Arm locations drift when obscured by torso• One set of results for HumanEva

Page 22: Evaluating Lookup-Based Monocular Human Pose Tracking …
Page 23: Evaluating Lookup-Based Monocular Human Pose Tracking …

Coordination Between Frames

• Need to pick from top matches at each frame.– Want good image match at all frames– Want small change between frames⇒ Markov chain minimization!

• Best local choices minimize global error

etc.

frame i-1 frame i frame i+1

Page 24: Evaluating Lookup-Based Monocular Human Pose Tracking …

2. Identify best (least expensive)

result

Markov Chain Minimization

Frame 1 Frame 2 Frame n

...

1. Compute cheapest path to each state from previous states(cost = estimate of plausibility)

State 2A

State 2C

State 2B

State 1A

State 1C

State 1B

State nA

State nC

State nB

3. Backtrack, picking out path that gave best result.

Page 25: Evaluating Lookup-Based Monocular Human Pose Tracking …

Silhouette Comparison

Turning angle(Captures morphology)

Chamfer distance(Captures overlap)

• Combine using Belkin technique(score = sum of individual ranks)