24
Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective: Krishna Rajan Iowa State University http://cosmic.mse.iastate.edu July 13 th 2011 Acknowledgements: National Science Foundation AFOSR Dept. of Homeland Security Army Research Office Office of Naval Research Dept. of Energy DARPA

Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Material Informatics: Data, Methodologies, and Applications

Informatics for the Materials Genome: a Minimalist Perspective: Krishna Rajan Iowa State University http://cosmic.mse.iastate.edu

July 13th 2011

Acknowledgements: • National Science Foundation • AFOSR

Dept. of Homeland Security • Army Research Office • Office of Naval Research • Dept. of Energy • DARPA

Page 2: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Do we really need more data?

Data vs Knowledge

http://www.genengnews.com/

Krishna Rajan

Page 3: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Functionality = F ( x1 , x2 , x3 , x4 , x5 , x6 , x7 , x8 ……)

Issues: • how many variables? • which variables are important? • classify behavior among variables • making quantitative predictions …relate functionality to variables …

• traditionally we describe them by empirical equations: •Quantitative Structure Activity Relationships (QSARs) are derived from data mining techniques not assuming a priori which physics is the most important

Need to build database with these variables

Krishna Rajan Krishna Rajan

High Dimensional Data

Page 4: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Multidimensionality of data

Krishna Rajan

Hume-Rothery 1926, 1934

Laves 1956, 1967

Engel-Brewer 1964, 1967

Pearson 1972

Villars 1995 ……………………….

http://www.chem.ox.ac.uk/icl/heyes/structure_of_solids/Lecture3/Lec3.html#anchor5

Page 5: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Multidimensionality of data

Krishna Rajan

Page 6: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

n

i

ii ppH1

log

Classification (partitioning of feature space)

= Minimization of information entropy

= Maximization of information gain

: Probability distribution of AB2 structure types occurred in Linus Pauling File (LPF)

Information Entropy

Krishna Rajan: ICME Symposium- MS&T

Krishna Rajan

Kong & Rajan-2012

Page 7: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Ranking descriptors

Krishna Rajan

Page 8: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Information entropy based “phase diagrams”

Krishna Rajan

Page 9: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

840 compounds

(34 structure types)

140 compounds

(14 structure types)

22 compounds

(2 structure types)

Recursive partioning to track Evolution of design rules

Tracking Structural Correlations

Krishna Rajan

Page 10: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Developing a design rules for intermetallics

Krishna Rajan

Page 11: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Size factor

Electrochemical factor

Valence-electron factor

GeX2

Crystal-structure design rules

Tracking design rules

Entropy scaled Structure map

Krishna Rajan

Kong & Rajan (2009/ 2012)

Page 12: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

The possible crystal structures of a hypothetical compound AuBe2 are suggested from the classification tree constructed by using known data.

A compound the structure is unknown

Potential structure types

Data-driven crystal chemistry (if-then) rules

Guiding Structure Prediction

Krishna Rajan

Page 13: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

“Minimalism”: Linking information entropy to irreducible representations

Krishna Rajan

Page 14: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Linking Crystal Chemistry with Crystal Symmetry

Krishna Rajan

Page 15: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Limited Data Problem—no data deluge!

Page 16: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Data Diversity

Page 17: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Modeling with Data Mining

Page 18: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

“omics” materials design Non-“omics” materials design

Accelerated Design- value of “omics” design

Krishna Rajan

2010/11- Rajan

Page 19: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Data Driven Modeling

Closing the gap : developing a QSAR- a new figure of merit

Knowledge: a unified & accelerated model of materials behavior

Information: the ‘tolerance factor’ Data: developing a descriptor database

Krishna Rajan

Page 20: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Discovering Classifiers

Krishna Rajan

Page 21: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Ranking Descriptors

Krishna Rajan

Page 22: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

Structure Maps from Data Mining

Krishna Rajan

Page 23: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

The 4 V’s of Materials Data

VARIETY: data in many forms

Engineering design / materials insertion

Multiscale data

VOLUME: data at rest

Materials reference data Thermodynamic Crystallography Property

VELOCITY: data in motion Materials Characterization

in-situ materials dynamics ( x-ray, ..)

Time-of-flight data

VERACITY: data in doubt

Incomplete data, ambiguities, missing

data

Phase diagrams, Property maps

Modeling

Krishna Rajan

Page 24: Informatics for the Materials Genome: a Minimalist ...€¦ · Material Informatics: Data, Methodologies, and Applications Informatics for the Materials Genome: a Minimalist Perspective:

veracity variety velocity volume :Discovery Materials

Summary: “Closing the Gap”’

Experiments and physical models

Informatics, statistical learning

To transform the “Materials Genome” from a concept to reality we need an information system that can enable and accelerate the Data to Knowledge transformation (the new paradigm for Materials-

by-Design)

Krishna Rajan