A Compositional Object-Based Approach to Learning Physical...

A Compositional Object-Based Approach to Learning Physical DynamicsMichael Chang1,2,4, Tomer Ullman1,3, Antonio Torralba1,4, Joshua B. Tenenbaum1,3

1 2 3 4

Project Webpage: http://mbchang.github.io/npe Code: http://github.com/mbchang/dynamics

2 Steps1 Vision

3 Neural Physics Engine (NPE)

Predict the velocity of each object in turn, given the pairwise interactions with its neighborhood context objects.

NPE applied on object 3

Scenario Model Architecture

Baselines

NP applied on object 3 LSTM applied on object 3

pairwise encoder

decodersum pairwise effects

Multi-layer encoder

Pairwise layer Multi-layer decoder

Multi-layer LSTM

No-Pairwise (NP) Baseline: no pairwise factorizationLSTM Baseline: no composition of independent effects

4a Generalization: different numbers of objects

4b Generalization: different scene configurations

Ground Truth NPE NP LSTM

3 Balls

4 Balls

5 Balls

6 Balls

7 Balls

8 Balls

Ground Truth NPE NP LSTM

“O” World

“L” World

“U” World

“I” World

“O”, “L: worlds withoutinternal obstacles“U”, “I:worlds with internal obstacles

3, 4, 5: worlds with fewer objects

6, 7, 8: worlds with more objects

NPE does not overlap with internal obstacles, while the NP and LSTM do. This shows the NPE is invariant to position and scene configuration, while NP and LSTM memorize the training wall configuration.

a) Prediction Task: Cosine similarity (top) and relative magnitude (middle) of prediction vs ground truth over 50 steps of simulation. Log MSE on velocity (bottom) over the course of training

b) Generalization Task: Same metrics as prediction, but train on 3, 4, 5; test on 6, 7, 8.

c) Inference Task: NPE gets about 90% accuracy in prediction and generalization setting.

d) Neighborhood: constrains the search space of context objects

5 Contributions

Object-based representations Context Selection Mechanism

Factorization Compositionality

Factorize larger objects into smaller building blocks

Compute prediction as a composition of pairwise interactions

Ingredients useful for generalizationVision: In order to rapidly learn new tasks and flexibly adapt to changes in inputs and goals, an agent needs a prior on physics that allows it to naturally generalize reasoning to novel scenes.

Beliefs Desires

Actions

planning

World state

Agentstate

perception

physics

Approach: Endow the agent with the prior as a learned physics simulator.

This work: We present the Neural Physics Engine, a framework for learning simulators of intuitive physics that naturally generalizes across variable object count and different scene configurations.

Different numbers of objects

Different scene configurations

Prior experience Novel scenes

What are the primitives, means of combination, and means of abstraction that underlie the training and testing distributions, such that generalization comes for free?

By combining the expressivenessof physics engines and the adaptability of neural networks in a compositional architecture that naturally supports generalization in fundamental aspects of physical reasoning, the Neural Physics Engine is an important step towards lifting an agent’s ability to think at a level of abstraction where the concept of physics is primitive.

Contributions• Presented a framework for learning a physics simulator• Proposed ingredients useful for generalization• Combined the strengths of symbolic and neural

models in an object-based neural network• Demonstrated an instantiation of the NPE framework

for prediction, generalization, and inference tasks in worlds of balls and obstacles

Ingredients useful for generalization• Object-based representations• Context-selection mechanism• Factorization• Compositionality

Combining the advantages of symbolic and neural models

Neural Physics Engine (NPE)

• Generic notions of objects and their interactions• Trained to model specific object properties and dynamics of different worlds • Transfers knowledge across different object counts and scene configurations

Symbolic Physics Engines

• Expressive• Knowledge encoded in structure• Difficult to adapt to scenarios

outside description language

Neural Networks

• Adaptive• Knowledge emerges through training• Requires retraining for new scenarios

How to incorporate inductive biases that are not only strong enough to generalize across scenes, but also flexible enough to learn from and adapt to different inputs?

https://goo.gl/BWYuOF

Simulation videos

A Compositional Object-Based Approach to Learning Physical...

Documents

Galileo: Perceiving Physical Object Properties by ...people.csail.mit.edu/billf/publications/5780-galileo-perceiving... · Galileo: Perceiving Physical Object Properties by ... metal

Compositional Modeling and Design of Cyber-Physical ...etd.library.vanderbilt.edu/available/etd-08042016-123306/... · Compositional Modeling and Design of Cyber-Physical Systems

Compositional Learning for HumanObject Interactionopenaccess.thecvf.com/.../Keizo_Kato_Compositional_Learning_of_ECCV... · Compositional Learning for Human Object Interaction 3 graph

Compositional Captioning - University of Chicagottic.uchicago.edu/~klivescu/MLSLP2016/hendricks_MLSLP2016.pdfCompositional Captioning: Describing Novel Object Categories without Paired

Quantitative Measurement of Virtual vs. Physical Object Embodiment through Kinesthetic ... · 2019-04-22 · Quantitative Measurement of Virtual vs. Physical Object Embodiment through

Keith Smith, The Book as Physical Object

Compositional and lexical semantics • Compositional semantics: the

Driving Object Deformations from Internal Physical Processesresearch.cs.tamu.edu/keyser/Papers/SPM07_bendcrumple.pdf · Driving Object Deformations from Internal Physical Processes

Universal Physical Camouﬂage Attacks on Object Detectorsalanlab/Pubs20/huang2020universal.pdfUniversal Physical Camouﬂage Attacks on Object Detectors Lifeng Huang1,2 Chengying

Galileo: Perceiving Physical Object Properties by ...ilkery/papers/phys_nips.pdf · Galileo: Perceiving Physical Object Properties by Integrating a Physics Engine with Deep Learning

LNCS 8691 - A Graph Theoretic Approach for Object Shape ... · A Graph Theoretic Approach for Object Shape Representation in Compositional Hierarchies Using a Hybrid Generative-Descriptive

Towards Universal Physical Attacks on Single Object Tracking

Physical Adversarial Attacks on an Aerial Imagery Object

Physical Evidence. Physical evidence is any evidence introduced in a trial in the form of a physical object. It can include part or all of any object

MHSAA Physical Form - Cloud Object Storage | Store

Remote Batch Invocation for Compositional Object Servicespeople.cs.vt.edu/~tilevich/papers/batches.pdf · Remote Batch Invocation for Compositional Object Services Ali Ibrahim2, Yang

Compositional Model~basedDesign of Physical systems

ModelicaTM - A Unified Object-Oriented Language for Physical

Compositional, physical and chemical modification of polylactide

Physical representations: object knowledge cs376 fall 2005 - sdoorley