-
Notifications
You must be signed in to change notification settings - Fork 0
Spectral Similarity Metrics
Li et., al. suggested to use a noise removal ratio (Default = 1%) to eliminate low abundant noisy peaks. The IDSL.FSA workflow also incorporates this data clean-up technique.
To accelerate the fragment matching workflow, IDSL.FSA attempts to match a number of characteristic fragmentation peaks from library and sample fragmentation spectra before any other peaks. These characteristic fragmentation peaks are called spectra markers in the IDSL.FSA workflow. SPEC0009 and SPEC0010 parameters are used to define spectra markers for library and experimental fragmentation spectra in the SpectraSimilarity
tab of the FSA parameter spreadsheet. SPEC0009 indicates the minimum cutoff (%) for
Li et., al. demonstrated spectral entropy can outperform dot product (also known as cosine similarity) in spectra similarity measurement. We incorporated spectral entropy as well as dot product and normalized Euclidean mass error (NEME) in the IDSL.FSA package to provide multi-dimensional comparison between two fragmentation spectra.
Spectral entropy measurement includes all matched and unmatched peaks.
where
Cosine similarity measurement can only take into account peaks from the reference spectra.
where
Normalized Euclidean mass error (NEME) is a new metric which is able to utilize resolution power of high-resolution mass spectrometry (HRMS) instruments to evaluate quality of spectra matching. NEME is able to only cover matched peaks.
where
Li et., al. presented a weight transformation formula to boost the intensity of low abundant peaks. Intensity of spectra with lower spectral entropies
Li, Y., Kind, T., Folz, J., Vaniya, A., Mehta, S.S. Fiehn, O. Spectral entropy outperforms MS/MS dot product similarity for small-molecule compound identification. Nature Methods, 2021, 18(12), 1524-1531.
Fakouri Baygi, S., Banerjee S. K., Chakraborty P., Kumar, Y. Barupal, D.K. IDSL.UFA assigns high confidence molecular formula annotations for untargeted LC/HRMS datasets in metabolomics and exposomics. Analytical Chemistry, 2022, 94(39), 13315–13322.