Upload
others
View
3
Download
0
Embed Size (px)
Citation preview
Enhancing Computational Tools for LC-IM-MS-Based Small Molecule Workflows
Aivett Bilbao 1, John Fjeldsted 2, Samuel H. Payne 1
1. Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory
2. Agilent Technologies
65th American Society for Mass Spectrometry Conference
June 6, 2017
Indiana Convention Center, Indianapolis, IN
Aivett Bilbao | ASMS 2017
Outline
2
• LC-IM-MS data acquisition
• Why data pre-processing algorithms?
• Multidimensional smoothing
• Saturation repair
• Application of untargeted small-molecule workflow
Aivett Bilbao | ASMS 2017
LC-IM-MS data acquisition
3
Liquid chromatography (LC) separationand electrospray ionization (ESI)
Analysis and detection of ions
3rd 2nd 1st : ELUTION
MASS SPECTROMETER
LC-MS data
min µs
ION MOBILITY (IM) SEPARATION
ms
LC-IM-MS data3rd 2nd 1st : ARRIVAL
Inte
nsity
Total ion chromatogram
Elution time (min)Arrival time (ms)
IM cycle or frame
Aivett Bilbao | ASMS 2017
Drift tube IM - time-of-flight MS
4
2D-XIC from LC-IM-MS data
IM-QTOF from Zhang et at. Clinical Mass Spectrometry, 2016
Aivett Bilbao | ASMS 2017
Why data pre-processing algorithms?
5
Dynamic range (decades)
3 4 5 6 7
Jagged peak→ low ion statistics
Peak with “flat top”→ saturation of detection system
Low-abundance analytes
High-abundance analytes
Computationally extend the limitsMultidimensional
smoothingSaturation
repair
Aivett Bilbao | ASMS 2017 6
Multidimensional SmoothingSample 1
Sample 2Sample 3..
Raw Data Files
Saturation Repair
Sample 1Sample 2Sample 3..
PNNL External Programming
Data processing workflow
Raw Data Files
Pre-processing Processing
4D feature (peak) finding
Multi-sample alignment
Compound identification
Quantitative comparisons and further statistical analyses
Aivett Bilbao | ASMS 2017
Smoothing heuristics: moving average
7
Nc+1
1) Smoothing the IM dimension, e.g. kernel size 3
X’ = ( Nd-1 + X + Nd+1 ) 3
Nd+1
X
Nd-1
m/z
Drift
tim
e
X’
Nc-1
Spectra in frame orIM cycle
2) Smoothing the chromatographic dimension, e.g. kernel size 3
X’’ = ( Nc-1 + X’ + Nc+1 )3
Aivett Bilbao | ASMS 2017
Smoothing software
8
Implemented in C#
Aivett Bilbao | ASMS 2017
LC-IM-MS dataset
9
4 sulfa drugs spiked at3 concentrations (1, 10 and 100ppb) x 4 replicates each (12 runs)
686 frames, 500 spectra per frame
TIC fruit tea (complex background) + sulfa drugs
Aivett Bilbao | ASMS 2017
Smoothing removes artifacts and enhances real signals
10
m/z
Drif
t tim
e
m/z
Frame at 9 min. IM-MS Browser software
UNSMOOTHED SMOOTHED
Aivett Bilbao | ASMS 2017
Smoothing improves peak shape and alignment
11
Sulfamethazine
Aligning low-abundancepeaks across 4 replicates
well alignedmissed
Mass Profiler software
UNSMOOTHED SMOOTHED
Aivett Bilbao | ASMS 2017
Improved reproducibility of spiked-in compounds
12
Smoothing improved detection and alignment for 3 of the 4 spiked-in sulfa drugs, at low-concentration
Aivett Bilbao | ASMS 2017 13
Fruit tea features detected andwell aligned across all 12 replicates
Consistent detection of more low-to-medium-abundance features
Abundancelowmediumhigh
SMOOTHED11491
UNSMOOTHED5957
UNSMOOTHED SMOOTHED
Saturation repair
Peak with “flat top”→ saturation of detection system
Aivett Bilbao | ASMS 2017
Example of saturation repair using theoretical isotopic distributions
15
1. Flag all peaks above a defined saturation threshold
2. Generate theoretical isotopic distribution
3. Correct using most intense isotopic peak below saturation threshold
Example lipid PC(16:0/16:0)
Aivett Bilbao | ASMS 2017
Saturation repair increases dynamic range
16
Software implementation to repair LC-IM-MS peptide data: example from blood serum
Corrected
TIC
LC scan
TIC
Uncorrected
Work in progress to automate processing for small molecules
Integrating collision cross section libraries for untargeted
molecular characterization
Aivett Bilbao | ASMS 2017
Data set for performance evaluation
18
Urine dataset?
PNNL CCS BD
Aivett Bilbao | ASMS 2017
CCS information enhances identification
19
Comparison:Accurate mass, Accurate mass and CCS – ID-BrowserAccurate mass and CCS – Prototype scoring example
Message: CCS libraries and improved scoring enhance identification by reducing the number of hits per feature (reduces false positives, increases accuracy).
Aivett Bilbao | ASMS 2017
Conclusions
20
Software is as important as hardwaregood data from innovative instruments yield even better results with appropriate software!complex data from large-scale studies requires good software and automation
Multidimensional smoothing dramatically improves detection reproducibility of low-abundance analytes
Saturation repairincreases dynamic range in high-abundance analytes
Combining both computational strategies provide consistent characterization of more analytes in an extended dynamic range
Dynamic range (decades)
3 4 5 6 7
Acknowledgments
Samuel H. PayneRichard D. Smith Erin S. Baker Thomas O. MetzXueyun ZhengBryson C. Gibbons
John FjeldstedEd DarlandEmma E. RennieBruce Wang
Thank you very much for your attention…