- Notifications
You must be signed in to change notification settings - Fork0
Fragmentation Spectra Analysis
License
idslme/IDSL.FSA
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Fragmentation Spectra Analysis (FSA) by theIntegrated Data Science Laboratory for Metabolomics and Exposomics (IDSL.ME) is an R package designed to annotate.msp (mass spectra format) and.mgf (Mascot generic format) files using a combination of mass spectral entropy similarity, dot product (cosine) similarity, and normalized Euclidean mass error (NEME) criteria followed by intelligent pre-filtering steps for rapid searches. IDSL.FSA also provides a number of modules to convert and manipulate.msp and.mgf files.
- Parameter selection through a user-friendly and well-describedparameter spreadsheet
- Analyzing population size untargeted studies (n > 500)
- Consistency with.msp and.mgf file formats
- Measuring matching variables including mass spectral entropy similarity, dot product (cosine) similarity, and normalized Euclidean mass error (NEME)
- Spectra annotation regardless of presence of precursor values
- Spectra annotation using optional criteria including retention time and adduct type
- Spectra annotation using accurate or nominal mass values
- Generating batch spectra match figures
- Parallel processing in Windows and Linux environments
install.packages("IDSL.FSA")
Note: In some instances, thereadxl R package is also required to read theFSA parameter spreadsheet.
install.packages("readxl")
IDSL.FSA requiresFragmentation Spectra DataBase (FSDB) objects (.Rdata) to rapidly annotate mass spectrometry data (.msp and.mgf). IDSL.FSA is also able to generateFSDB objects from several reference.msp files even with inconsistent settings. When the reference.msp files orFSDB objects are available, to annotate.msp and.mgf files, download theFSA parameter spreadsheet and select the parameters accordingly and then use this spreadsheet as the input for the IDSL.FSA workflow:
library(IDSL.FSA)FSA_workflow("Address of the FSA parameter spreadsheet")
Follow these steps for a quick msp annotation of an.msp file
DownloadKynurenine_Kynurenic_acid.msp. You should also ensure this file ends up with an.msp extension after downloading. You may also need to rename this file to have an.msp extension.
Download the positive modeFSDB
IDSL.FSA requires parameters for three tabs for a full scale run. For this study, use default parameter values presented in theFSA parameter spreadsheet. Then, provide information for
3.1.FSA0002 for theAddress of the MS/MS library (FSDB) generated using
FSDB
tab3.2.FSA0003 forLocation of the INPUT sample of .msp and/or .mgf files
3.3.FSA0004 forLocation to save OUTPUT results
3.4. You may also adjust the number of processing threads usingSPEC0001 in the
SpectraSimilarity
tab according to your computational powerRun this command in the R/Rstudio console or terminal.
library(IDSL.FSA)FSA_workflow("Address of the FSA parameter spreadsheet")
You see the results in the address you provided forFSA0004 including:
5.1. individualSpectraAnnotationTable for each.msp file in theannotated_spectra_tables directory in the.Rdata and.csv formats
5.2. if you had selected an integer greater than 0 forSPEC0019, match spectra figures are also available in theplot folder for each MSP block.
- Fragmentation Spectra DataBase (FSDB)
- MSP files management
- MSP Standardizations
- Spectral Similarity Metrics
[1] Fakouri Baygi, S., Kumar, Y. Barupal, D.K.IDSL.CSA: Composite Spectra Analysis for Chemical Annotation of Untargeted Metabolomics Datasets.Analytical Chemistry,2023,95(25), 9480–9487.