21
Enhancing Computational Tools for LC-IM-MS- Based Small Molecule Workflows Aivett Bilbao 1 , John Fjeldsted 2 , Samuel H. Payne 1 1. Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory 2. Agilent Technologies 65 th American Society for Mass Spectrometry Conference June 6, 2017 Indiana Convention Center, Indianapolis, IN

Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows

Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1

1. Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory

2. Agilent Technologies

65th American Society for Mass Spectrometry Conference

June 6, 2017

Indiana Convention Center, Indianapolis, IN

Page 2: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

Outline

2

• LC-IM-MS data acquisition

• Why data pre-processing algorithms?

• Multidimensional smoothing

• Saturation repair

• Application of untargeted small-molecule workflow

Presenter
Presentation Notes
In the first part I will talk about data acquisition, and why pre-processing algorithms and for what they are useful. Then I will explain the 2 algorithms and finally one application of the workflow
Page 3: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

LC-IM-MS data acquisition

3

Liquid chromatography (LC) separationand electrospray ionization (ESI)

Analysis and detection of ions

3rd 2nd 1st : ELUTION

MASS SPECTROMETER

LC-MS data

min µs

ION MOBILITY (IM) SEPARATION

ms

LC-IM-MS data3rd 2nd 1st : ARRIVAL

Inte

nsity

Total ion chromatogram

Elution time (min)Arrival time (ms)

IM cycle or frame

Presenter
Presentation Notes
To analyze complex samples, liquid chromatography is used to separate the molecules and reduce the complexity, while, typically in line, the mass spectrometer is analyzing and detecting the ions as they elute. Then in the acquired data we can see the signal for each time point, according to the elution order, and for each time point there is a mass spectrum. When the sample complexity is high, the LC is not enough and some ions co-elute, sometimes even with the same mass. The MS is in the micro seconds (us) scale, the LC is in min, then we can add an ion mobility as an extra separation between the two, and fits very well because it is in the milisecond scale. https://en.wikipedia.org/wiki/Glucose https://en.wikipedia.org/wiki/Bilirubin https://en.wikipedia.org/wiki/Creatinine
Page 4: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

Drift tube IM - time-of-flight MS

4

2D-XIC from LC-IM-MS data

IM-QTOF from Zhang et at. Clinical Mass Spectrometry, 2016

Page 5: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

Why data pre-processing algorithms?

5

Dynamic range (decades)

3 4 5 6 7

Jagged peak→ low ion statistics

Peak with “flat top”→ saturation of detection system

Low-abundance analytes

High-abundance analytes

Computationally extend the limitsMultidimensional

smoothingSaturation

repair

Page 6: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017 6

Multidimensional SmoothingSample 1

Sample 2Sample 3..

Raw Data Files

Saturation Repair

Sample 1Sample 2Sample 3..

PNNL External Programming

Data processing workflow

Raw Data Files

Pre-processing Processing

4D feature (peak) finding

Multi-sample alignment

Compound identification

Quantitative comparisons and further statistical analyses

Presenter
Presentation Notes
A first step that generates native Agilent files does not interfere with the workflow
Page 7: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

Smoothing heuristics: moving average

7

Nc+1

1) Smoothing the IM dimension, e.g. kernel size 3

X’ = ( Nd-1 + X + Nd+1 ) 3

Nd+1

X

Nd-1

m/z

Drift

tim

e

X’

Nc-1

Spectra in frame orIM cycle

2) Smoothing the chromatographic dimension, e.g. kernel size 3

X’’ = ( Nc-1 + X’ + Nc+1 )3

Page 8: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

Smoothing software

8

Implemented in C#

Presenter
Presentation Notes
A graphical user interface, high computational skills are not required to use the tool.
Page 9: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

LC-IM-MS dataset

9

4 sulfa drugs spiked at3 concentrations (1, 10 and 100ppb) x 4 replicates each (12 runs)

686 frames, 500 spectra per frame

TIC fruit tea (complex background) + sulfa drugs

Presenter
Presentation Notes
500 spectra per frame (0.12 ms per spectra, 60.23 ms per frame), 686 frames (0.032 min per frame, total 21.97 min). https://en.wikipedia.org/wiki/Sulfonamide_(medicine) https://pixabay.com/en/tee-fruit-tea-tea-bag-dried-592046/
Page 10: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

Smoothing removes artifacts and enhances real signals

10

m/z

Drif

t tim

e

m/z

Frame at 9 min. IM-MS Browser software

UNSMOOTHED SMOOTHED

Presenter
Presentation Notes
Sulfas in Fruit Tea 1-1000 r1 c4-1.d
Page 11: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

Smoothing improves peak shape and alignment

11

Sulfamethazine

Aligning low-abundancepeaks across 4 replicates

well alignedmissed

Mass Profiler software

UNSMOOTHED SMOOTHED

Page 12: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

Improved reproducibility of spiked-in compounds

12

Smoothing improved detection and alignment for 3 of the 4 spiked-in sulfa drugs, at low-concentration

Page 13: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017 13

Fruit tea features detected andwell aligned across all 12 replicates

Consistent detection of more low-to-medium-abundance features

Abundancelowmediumhigh

SMOOTHED11491

UNSMOOTHED5957

UNSMOOTHED SMOOTHED

Presenter
Presentation Notes
Abundances below 2000 are beneath usable levels of ion statistics. Abundances about 20,000 have reproducible detection performance. summary(factor(datf$Smoothing)) no yes 5957 11491 Abundance factor: [2e+03,1e+04] (1e+04,1e+07] (1e+07,1e+09]
Page 14: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Saturation repair

Peak with “flat top”→ saturation of detection system

Page 15: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

Example of saturation repair using theoretical isotopic distributions

15

1. Flag all peaks above a defined saturation threshold

2. Generate theoretical isotopic distribution

3. Correct using most intense isotopic peak below saturation threshold

Example lipid PC(16:0/16:0)

Page 16: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

Saturation repair increases dynamic range

16

Software implementation to repair LC-IM-MS peptide data: example from blood serum

Corrected

TIC

LC scan

TIC

Uncorrected

Work in progress to automate processing for small molecules

Presenter
Presentation Notes
https://commons.wikimedia.org/wiki/File:Blood_bag.jpg
Page 17: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Integrating collision cross section libraries for untargeted

molecular characterization

Page 18: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

Data set for performance evaluation

18

Urine dataset?

PNNL CCS BD

Page 19: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

CCS information enhances identification

19

Comparison:Accurate mass, Accurate mass and CCS – ID-BrowserAccurate mass and CCS – Prototype scoring example

Message: CCS libraries and improved scoring enhance identification by reducing the number of hits per feature (reduces false positives, increases accuracy).

Page 20: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Aivett Bilbao | ASMS 2017

Conclusions

20

Software is as important as hardwaregood data from innovative instruments yield even better results with appropriate software!complex data from large-scale studies requires good software and automation

Multidimensional smoothing dramatically improves detection reproducibility of low-abundance analytes

Saturation repairincreases dynamic range in high-abundance analytes

Combining both computational strategies provide consistent characterization of more analytes in an extended dynamic range

Dynamic range (decades)

3 4 5 6 7

Presenter
Presentation Notes
Smoothing is good! Just use the appropriate parameters for your data.
Page 21: Enhancing Computational Tools for LC-IM-MS- …...Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1 1

Acknowledgments

Samuel H. PayneRichard D. Smith Erin S. Baker Thomas O. MetzXueyun ZhengBryson C. Gibbons

John FjeldstedEd DarlandEmma E. RennieBruce Wang

Thank you very much for your attention…