70
The implications of Open Notebook Science and other new forms of scientific communication for Nanoinformatics Jean-Claude Bradley November 3, 2010 Nanoinformatics 2010 Associate Professor of Chemistry Drexel University

The implications of Open Notebook Science

  • Upload
    arlen

  • View
    21

  • Download
    5

Embed Size (px)

DESCRIPTION

The implications of Open Notebook Science and other new forms of scientific communication for Nanoinformatics. Nanoinformatics 2010. Jean-Claude Bradley. Associate Professor of Chemistry Drexel University. November 3, 2010. The Evolution of Automation in Scientific Research. - PowerPoint PPT Presentation

Citation preview

Page 1: The  implications  of  Open  Notebook Science

The implications of Open Notebook Science and other new forms of

scientific communication for Nanoinformatics

Jean-Claude Bradley

November 3, 2010

Nanoinformatics 2010

Associate Professor of ChemistryDrexel University

Page 2: The  implications  of  Open  Notebook Science

LIMS CENS

Single Instrument Automation

Laboratory Information Management Systems

Collaborative Electronic Notebook Systems

Human /Autonomous Agent Hybrid Systems

Human ManagedFully AutonomousScientific Research Systems

TODAY

SMIRP bridge

The Evolution of Automation in Scientific Research

Page 3: The  implications  of  Open  Notebook Science

StandardModularIntegratedResearchProtocols

Capturing semantic structure in research

at the point of data entry

Page 4: The  implications  of  Open  Notebook Science
Page 5: The  implications  of  Open  Notebook Science

HumanAgent Autonomous

Agent

SMIRP

(Bot)

Browser

Excel

The SMIRP model for a hybrid Human/Autonomous Agent System

Anthropomimetic Design

Page 6: The  implications  of  Open  Notebook Science

Approaches to Collaborative Electronic Notebooks

rigid

SMIRPcompromise:Rigid information representationFlexible linking of modules

flexible

• Structured• Generallydomainspecific

• Adaptable• Unstructured

http://smirp.drexel.edu

Page 7: The  implications  of  Open  Notebook Science

Fundamental Information Representation in SMIRP

Module 1 Module 2

Parameter 1

Parameter 2

Parameter 4

Parameter 5

instance

Record 1

instance

Record 2

(People)

(Name)

(Employee of)

(Company)

(Name)

Parameter 3(email)

(Address)

Bill Gates Microsoft

Page 8: The  implications  of  Open  Notebook Science

Two approaches to the development of databases

Communicateanticipated need

Designdatabase structure

Let database structureevolvethrough useSMIRP

Page 9: The  implications  of  Open  Notebook Science

Case-study: Evolution of SMIRP structure in a nanoscience laboratory

Location Drexel UniversityDepartment of Chemistry

Users faculty, undergraduate students, graduatestudents, librarians and other university personnel

Period Feb 1999 – April 2001, with a detailed focus onlast 7 months (Sept 2000-April 2001)

Total accounts (last 7 months) 78

Active Accounts (added records) 50

Administrators (changed database structure)

9

Page 10: The  implications  of  Open  Notebook Science

HumanResource Management 13%

Maintenance1%

Knowledge Processing 72%

Most Active Module Categories (9/00 – 4/01)

Labwork14%

118 modules 1/3 account for 98% of activity

Page 11: The  implications  of  Open  Notebook Science

Activity Analysis by Category over Time

2000

-10-

3

2000

-10-

17

2000

-10-

30

2000

-11-

12

2000

-11-

25

2000

-12-

8

2000

-12-

21

2001

-1-3

2001

-1-1

6

2001

-1-3

0

2001

-2-1

2

2001

-2-2

5

2001

-3-1

0

2001

-3-2

3

2001

-4-5

2001

-4-1

8

Maintenance

Human Resource ManagementLaboratory Work

Knowledge Processing0

1000

2000

3000

4000

5000

6000

7000

8000

Page 12: The  implications  of  Open  Notebook Science

Recruitment events 2%

ProjectManager 5%Errors

5%

Productivity Tracking 14%

People 28%

Workstudy hours reporting 46%

Most Active Human Resource Management Modules

Page 13: The  implications  of  Open  Notebook Science

Most Active Maintenance Modules

SMIRPProblems22%

Orders 19%

Invoice (TEM/SEM and other instrument charges) 19%

Laboratorymaterials16%

Vendor15%

Orderforms9%

Page 14: The  implications  of  Open  Notebook Science

Most Active Knowledge Processing Modules

Journal 9%

Knowledge Filter 3%

ReformatReference requests 20%Find

Reference 66%

PublisherDocument ProductionReference ProcessingParameter CorrelationData source filesExperimental Conclusion GenerationKnowledge consolidation

Page 15: The  implications  of  Open  Notebook Science

Seamless Integration of Human and Autonomous Agents in Workflows

Real-Time Workflow Designs

Automated

Human(default)State A State B

Page 16: The  implications  of  Open  Notebook Science

Workflow for Extraction of Article information and URL

Queries Web and extracts information

Page 17: The  implications  of  Open  Notebook Science

Most Active Laboratory Modules

Preparation of Silver rods for SCBETEM Micrographs Of Pd on CSCBE on membranesHydrogenation of Crotonaldehyde using Pd CatalystsReduction of Methylene blue by Pd Metal Particles in a Field

Electrodeposition of Pd on Graphite 29%

Protocol Prototyping25%

Pd onto Carbon Nanofibers17%

Electroless plating on Membranes9%

Synthesis of Pd catalysts by Bipolar electrochemistry5%

TEM Micrographs Of Pd on C3%

Pd particle size analysis using TEM 3%

Page 18: The  implications  of  Open  Notebook Science

Keyword Search Results: example “nanotube”

Page 19: The  implications  of  Open  Notebook Science

From Keyword to Orders

Page 20: The  implications  of  Open  Notebook Science

From Keyword to Article

Page 21: The  implications  of  Open  Notebook Science

From Keyword to Knowledge Filter

Page 22: The  implications  of  Open  Notebook Science

From Keyword to Protocol Prototyping

Page 23: The  implications  of  Open  Notebook Science

Sharing results semi-automatically: SMIRP Knowledge Product

•Single Experiment•Full Context•Supporting Data•Not suitable for traditional peer-reviewed publications

Page 24: The  implications  of  Open  Notebook Science

Non-traditional publication options in 2003

(Elsevier)

Page 25: The  implications  of  Open  Notebook Science
Page 26: The  implications  of  Open  Notebook Science
Page 27: The  implications  of  Open  Notebook Science
Page 28: The  implications  of  Open  Notebook Science
Page 29: The  implications  of  Open  Notebook Science
Page 30: The  implications  of  Open  Notebook Science
Page 31: The  implications  of  Open  Notebook Science

To Cite or Not to Cite?

Page 32: The  implications  of  Open  Notebook Science
Page 33: The  implications  of  Open  Notebook Science

“I would never consider a claim made in a patent as blocking an author's claim of novelty.” Langmuir Editor

What is a Scientific Precedent in Academia?What is a Scientific Precedent in Patent Law?

Page 34: The  implications  of  Open  Notebook Science

What is Scholarship?*also indexed in Chemical Abstracts!

Page 35: The  implications  of  Open  Notebook Science

The UsefulChem Project (2005)

What would happen if a chemistry project was completely transparent

in real time?

Page 36: The  implications  of  Open  Notebook Science

Motivation: Faster Science, Better Science

Page 37: The  implications  of  Open  Notebook Science

TRUST

PROOF

Page 38: The  implications  of  Open  Notebook Science

First record then abstract structure

In order to be discoverable use Google friendly formats (simple HTML, no

login) In order to be replicable use free hosted tools (Wikispaces, Google

Spreadsheets)

Strategy for an Open Notebook:

Page 39: The  implications  of  Open  Notebook Science

UsefulChem Project: Open Primary Research in Drug Design using Web2.0

tools

Docking

Synthesis

Testing

Rajarshi GuhaIndiana U

JC BradleyDrexel U

Phil RosenthalUCSF

(malaria)

Dan ZaharevitzNCI

(tumors)

Tsu-Soo TanNanyang Inst.

Page 40: The  implications  of  Open  Notebook Science

Malaria Target: falcipain-2 involved in hemoglobin metabolism

Dana.org

Page 41: The  implications  of  Open  Notebook Science

Outcome of Guha-Bradley-Rosenthal collaboration

Page 42: The  implications  of  Open  Notebook Science

The Ugi reaction: can we predict precipitation?

Can we predict solubility in organic solvents?

Page 43: The  implications  of  Open  Notebook Science

Crowdsourcing Solubility Data

Page 44: The  implications  of  Open  Notebook Science

ONS Challenge Judges

Page 45: The  implications  of  Open  Notebook Science

ONS Submeta Award Winners

Page 46: The  implications  of  Open  Notebook Science

Data provenance: From Wikipedia to…

Page 47: The  implications  of  Open  Notebook Science

…the lab notebook and raw data

Page 48: The  implications  of  Open  Notebook Science

• Concentration (0.4, 0.2, 0.07 M)• Solvent (methanol, ethanol, acetonitrile, THF)• Excess of some reagents (1.2 eq.)

How does Open Notebook Science fit with traditional publication?

Page 49: The  implications  of  Open  Notebook Science

Paper written on Wiki

Page 50: The  implications  of  Open  Notebook Science

References to papers, blog posts, lab notebook pages, raw

data

Page 51: The  implications  of  Open  Notebook Science

Paper on Journal of Visualized Experiments (JoVE)

Page 52: The  implications  of  Open  Notebook Science

Pre-print on Nature Precedings

Page 53: The  implications  of  Open  Notebook Science

ONSArchive: Semi-Automated Snapshot of the Entire Scientific Record

Automated Download of

Spreadsheets and Parsing of

Web Pages

Manual Backup

of Spectral

Data Files

Manual Export

of Wikispac

es

Page 54: The  implications  of  Open  Notebook Science

Lulu.com Data Disks

Page 55: The  implications  of  Open  Notebook Science

Interactive NMR spectra using JSpecView and JCAMP-DX

Page 56: The  implications  of  Open  Notebook Science

Raw Data As Images

Splatter?

Some liquid

Page 57: The  implications  of  Open  Notebook Science

YouTube for demonstrating experimental set-up

Page 58: The  implications  of  Open  Notebook Science

The importance of raw data availability

Missed in a prior publication on

solubility for this compound

Page 59: The  implications  of  Open  Notebook Science

The Intersection of Open Notebooks (Bradley/Todd) and IP implications

Open Notebook could have blocked patent

if done earlier

Page 60: The  implications  of  Open  Notebook Science

Convenient web services for solubility measurement and

prediction

(Andrew Lang)

Page 61: The  implications  of  Open  Notebook Science

Other Web Services…

(Andrew Lang)

General Transparent Solubility Prediction

Page 62: The  implications  of  Open  Notebook Science

Semi-Automated Measurement of solubility via

web service analysis of JCAMP-DX files

(Andy Lang)

Page 63: The  implications  of  Open  Notebook Science

Integration of Multiple Web Services to Recommend Solvents

for Reactions

(Andrew Lang)

Page 64: The  implications  of  Open  Notebook Science
Page 65: The  implications  of  Open  Notebook Science
Page 66: The  implications  of  Open  Notebook Science

Reaction Attempts Book

Page 67: The  implications  of  Open  Notebook Science

Reaction Attempts Book: Reactants listed Alphabetically

Page 68: The  implications  of  Open  Notebook Science

For all Formats of ONS Projects

Page 69: The  implications  of  Open  Notebook Science

Dynamic links to private tagged Mendeley collections

(Andrew Lang)

Page 70: The  implications  of  Open  Notebook Science

Conclusions• Open Notebook Science can provide an additional channel to communicate useful scientific information

• Recording first for human consumption followed by abstracting the semantics later works but the format will be field specific

• As long as proof is valued over trust there is no limit to what useful forms of scientific communication will emerge.