Data toolkits for processing NMR, MALDI MSI, MALDI single cell, Raman Spectroscopy, LC-MS and GC-MS raw data, chemoinformatics data analysis and data visualization.
Mzkit
is an open source raw data file toolkit for mass spectrometry data analysis, provides by the BioNovoGene
corporation. The features of mzkit inlcudes: raw data file content viewer(XIC/TIC/Mass spectrum plot/MS-Imaging), build molecule network, formula de-novo search, de-novo annotation of the unknown metabolite features and targeted data quantification.
Download Software: http://www.biodeep.cn/downloads?lang=en-US
- MZKit® is a registered trademark of BioNovoGene Corporation, protected by copyright law and international treaties.
- RawFileReader reading tool. Copyright © 2016 by Thermo Fisher Scientific, Inc. All rights reserved.
- MS-DIAL was launched as a universal program for untargeted metabolomics that supports multiple instruments (GC/MS, GC/MS/MS, LC/MS, and LC/MS/MS) and MS vendors (Agilent, Bruker, LECO, Sciex, Shimadzu, Thermo, and Waters). Part of the spectrum algorithm in MZKit is developed based on the MS-DIAL project.
MIT License
Copyright (c) 2018 [email protected], BioNovoGene Co., LTD.
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.
-
-
- 1.1.1. Background Task
-
1.2. Search Feature
-
1.3. XIC plot
- 1.3.1. XIC overlay
-
1.4. TIC plot
-
1.5. Raw Scatter
-
1.6. View Mass spectra
-
1.7. Save Plot and Export matrix
- 1.7.1. Export XIC
-
1.8. Ms-Imaging
-
-
-
2.1. Formula search
- 2.1.1. Export Formula Search Result
-
2.2. Molecular Networking
- 2.2.1. step1 select ions
- 2.2.2. step2 build network
- 2.2.3. step3 view network data
- 2.2.4. step4 network visualization
- 2.2.5. Export Network Data
- 2.2.6. Save Network Visual
-
-
- 3.1. Switch Between Toolkit
- 3.2. Install Mzkit
- 3.3. Uninstall Mzkit
Credits
This open source mass spectrometry data toolkit is developed at the BioDeep R&D laboratory and brought to you by
BioNovoGene
corporation.
Important Note: this application only supports the open source mzXML/mzML/imzML
raw data file formats. For view the data in the vendor format file like the Thermo *.Raw
required convert to the mzXML file format at first. It is recommended that convert the vendor format file to mzXML via ProteoWizard.
For view the file content of the mzXML or mzML datafile in mzkit, you must imports the raw data file into the mzkit at first. Here is how: select the Main
tabpage of mzkit you will see the Open
command for the raw data imports operation. Then you are going to click this Open
command button, choose the raw data file for imports and wait for it finished.
Then you should see the files that you've imports into mzkit on the File Explorer
dock panel if there is no error occurs during the raw data file imports progress. Now you can click on the raw data file tree to expend it and click one of the feature in your raw file to view the content data.
For improvements of the user experience when you are using the mzkit application, the raw data files is imports to mzkit application under a background task. You can see the background task progress throught the Task List
window:
the search bar on the top of the file tree is the m/z
search input: you can input a specific m/z
value or formula
expression in the search bar for search the matched features in your raw data file. This operation is usually apply for the XIC
data search.
When you have click on the search button, then all of the M/z
feature in your raw data file that match the ppm
condition will be listed in the Featrue List
dock panel:
An example of search m/z 834.6 with tolerance error 30ppm. the result in the
Feature List
is usually used for create a XCI plot.
Expends the file content tree in the File Explorer
, and then mouse right click on one MS2 feature in your file, select XIC
for create a XIC plot for a specific ion feature:
The XIC plot is a kind of time-signal chromatography plot of a specific m/z ion.
you can click on the checkbox besides the Ms2 feature for select different ion feature for create the XIC overlay plot:
as the same as create a XIC plot, you also can create TIC plot for a single file or multiple file by select multiple file by check on checkbox:
The TIC plot is similar to the XIC plot, data is generated from all ions.
just click on the node of the raw file, then you will open the raw scatter plot of your specific raw data file, example as:
For view the mass spectra data in your file, just click on one of the scan feature in your raw data file:
The mzkit application provides the function for save the plot image and the plot data in your raw data file. for example, select the Data Viewer
tab page in mzkit, you will see two viewer action buttons in the menu:
- [
Snapshot
] for export the XIC/TIC/MS2 data plot image to a specific file. - [
Save Matrix
] for export the Mass spectra or Chromatography data to a specific Excel table file.
Export the XIC data to a specific file is also keeps simple, just mouse right click on the file content tree and then choose Export XIC Ions
.
1.8. Ms-Imaging
mzkit software is also supports the imzML
file format for run ms-imaging rendering, here is how to view the imzML raw data file in mzkit application:
a. click of the open button on the file menu:
b. then you will see a pop out dialog for select raw data file, select the imzML
file format in the file fiter, like:
c. after the imzML raw data file has been imported into mzkit, then a raw data view for ms-imaging will be open, it looks like:
two components for view ms-imaging contains:
[MsImaging Parameters]
Panel for select ions and adjust rendering parameters.[MS-Imaging Viewer]
tab page for show the rendering image output of the selected ions.
d. select target ions in the explorer panel, and then mouse right click on the ions, select rendering
menu item for run ms-imaging rendering:
e. finally, you will see the imaging on the [MS-Imaging Viewer]
tab page after loading pixels data from the related ibd
data files:
f. you can tweaks of the image from the [MsImaging Parameters]
Panel.
change different color schema for rendering the MS-image in mzkit application.
You can search for the formula of one feature in your raw data file with mzkit for do some de-novo annotation with just simply mouse right click on one of the feature and then select Search Formula
, and then wait for a while to let mzkit for search all of the candidate formula by enumerates all of the possible element combinations:
Once the mzkit have been done of formula search, then all of the matched formula will be shown on the result page. there are some information about the each formula search result is listed in the result table, includes: formula result, its corresponding exact mass value, mass error of the searched m/z and the m/z calculated from the exact mass, ion charge value from your raw file, precursor type information, etc.
You also can search of the formula by input any m/z
value in the input box of the search result page. Click on the formula then you could submit the candidate formula into the biodeep database for search of the metabolite information.
Export the formula search result just like other data that we've introduced before, just click on the Formula Result
tab page and then click on the Export button. this command will let you save the formula search result into a specific Excel table file.
The molecular networking in mass spectrum data is a kind of spectrum similarity network. You can build a molecule network use mzkit in a very easy way:
For create a spectrum similarity network, it required some ms2 ions data provides to mzkit for run spectrum matrix alignment and evaluate the simialrity scores between these spectrum matrix data. select the ions data just like plot XIC operation it does in mzkit: check on the ms2 feature in the file explorer
dock panel.
Once we have the mass spectrum data selected, then we could run the matrix similarity between these matrix for build a network. now we mouse right click of the file tree in the file explorer
, and then choose the Molecular Networking
menu item:
once the networking progress have been done, then we could view the network result data in the Molecular Networking
tool page. there are three tables in the tool page for show your network data: Network
, Compounds
and Network Statistics
.
-
- the
Network
tab page contains the edges data in your network, which is the spectrum cluster simialrity result. all of the spectrum alignment its simialrity value is less than theSpectrum Similarity
threshold value that will be removes from the network.
- the
there are columns in the edge table: CompoundA
and CompoundB
is the spectrum reference id in this edge connected. and the simialrity
, forward
and reverse
column is the simialrity score value of the two spectrum matrix and the last View
column contains the button that let you view the spectrum matrix alignment result visual plot.
-
- the
Compounds
tab page contains the spectrum cluster (network nodes) information in your network.
- the
The mzkit program provides a small build-in network visualization engine that could let you visual the resulted molecular network with just a simple mouse click. As you can see in the previous screenshot, there is a Render Network
command button on the top menu when the Network
tab is actived. No we just click on it, and wait for the network layout calculation progress complete, and then we will see a new tag page which is named Molecular Networking Viewer
will be shown in the document area of the mzkit program.
There is not too much style tweaking in the mzkit build-in network visualizer, so that you may be want to export the network data into table and then visualize it in other network visualization software like the famous Cytoscape
.
Just click on the Export
command button beside the Render Network
button in the Network
menu tab, then a dialog of save network data will be trigger and opened. Select a location in the dialog and then you can save the network table data into a given location for visualization in other software.
For export the network image, just mouse right click on the viewer panel, then you could see a popout menu Save Image
that could use for viewer image saved.
You also can save the network image via Save
command in the Main
menu tab page when the molecular networking viewer is current active document page in mzkit program.
Now you can use the saved network image for your publications:
For evaluate the quantify content result of the target compounds in your samples. You must create the linear models for the target compounds at first. The targeted quantification tools in mzkit contains the function for create linear models of LCMSMS MRM data and the GCMS SIM data.
NOTE about sample names in linear reference:
- It is recommended name your linear reference samples in patterns like
L-<level>
, examples as:L-1
,L-2
,L-3
,...
.- and using the content value like
100ppm
,50ppb
as the sample file name is also recommended.
You can switch between toolkit pages via the start menu of mzkit:
click
[File]
->[Mzkit Data Toolkits]
, and then click on one toolkit item then you can switch to the required toolkit page.
or just select a page from the app switcher toolstrip menu:
If you have already installed the mzkit application, please uninstall the previous version before you install the new version of the mzkit application.
When you have extract the zip package of the mzkit compression package, then you could found a setup.exe
install application in the top of folder. Click on this setup application, then your will going to install mzkit program into your computer system.
When you have launch the mzkit installer, then you will see a security warning dialog from your windows system, just click on the Install
for start the install progress, and then just needs to wait for the installer finish the progress of copy the mzkit application files.
if you want to removes mzkit from your computer system, then you could follow this instruction for uninstall:
- Open the control panel of your windows system, and then click of the link
Uninstall a program
- Then you will see a list of program that installed on your windows, located the mzkit application
- Then right click on the mzkit, select
uninstall
, then you can removes mzkit from your windows system
Just click OK
on the dialog:
The development of MZKit workbench is based on these works:
- X. Shen, R. Wang, X. Xiong, Y. Yin, Y. Cai, Z. Ma, N. Liu, and Z.-J. Zhu* (Corresponding Author), Metabolic Reaction Network-based Recursive Metabolite Annotation for Untargeted Metabolomics, Nature Communications, 2019, 10: 1516.
- Li S, Park Y, Duraisingham S, Strobel FH, Khan N, et al. (2013) Predicting Network Activity from High Throughput Metabolomics. PLOS Computational Biology 9(7): e1003123. https://doi.org/10.1371/journal.pcbi.1003123
- Pang, Z., Chong, J., Zhou, G., Morais D., Chang, L., Barrette, M., Gauthier, C., Jacques, PE., Li, S., and Xia, J. (2021) MetaboAnalyst 5.0: narrowing the gap between raw spectra and functional insights Nucl. Acids Res. (doi: 10.1093/nar/gkab382)
- Ogata, H., Goto, S., Sato, K., Fujibuchi, W., Bono, H., & Kanehisa, M. (1999). KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic acids research, 27(1), 29–34. https://doi.org/10.1093/nar/27.1.29
- Tsugawa, H., Cajka, T., Kind, T., Ma, Y., Higgins, B., Ikeda, K., Kanazawa, M., VanderGheynst, J., Fiehn, O., & Arita, M. (2015). MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis. Nature methods, 12(6), 523–526. https://doi.org/10.1038/nmeth.3393
- Sud M, Fahy E, Cotter D, Brown A, Dennis EA, Glass CK, Merrill AH Jr, Murphy RC, Raetz CR, Russell DW, Subramaniam S., LMSD: LIPID MAPS® structure database Nucleic Acids Research, 35: p. D527-32., DOI: 10.1093/nar/gkl838 , PMID: 17098933
- Fahy E, Sud M, Cotter D & Subramaniam S., LIPID MAPS® online tools for lipid research Nucleic Acids Research (2007), 35: p. W606-12., DOI: 10.1093/nar/gkm324 , PMID: 17584797
- Wishart DS, Guo AC, Oler E, et al., HMDB 5.0: the Human Metabolome Database for 2022. Nucleic Acids Res. 2022. Jan 7;50(D1):D622–31. 34986597
- Mingxun Wang, Jeremy J. Carver, Vanessa V. Phelan, Laura M. Sanchez, Neha Garg, Yao Peng, Don Duy Nguyen et al. "Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking." Nature biotechnology 34, no. 8 (2016): 828. PMID: 27504778
- Li, Y., Kind, T., Folz, J. et al. Spectral entropy outperforms MS/MS dot product similarity for small-molecule compound identification. Nat Methods 18, 1524–1531 (2021). https://doi.org/10.1038/s41592-021-01331-z
- Kind, T., Fiehn, O. Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry. BMC Bioinformatics 8, 105 (2007). https://doi.org/10.1186/1471-2105-8-105
- Allen F, Greiner R, and Wishart D (2015) Competitive fragmentation modeling of ESI-MS/MS spectra for putative metabolite identification. Metabolomics. 11:98–110. Supporting Data