High Temperature Geochemistry Database
The fast development of statistical methods and machine learning algorithms applied to data from cyberinfrastructures offers genuine opportunities to reveal solid Earth's secular evolution and chemistry. However, in their present state, cyberinfrastructures are composed of raw high temperature geochemical data with missing categories, a non-negligible proportion of errors, including age information, and misplaced chemical compositions that may be inconsistent with publications. These unintended errors are mainly caused by (1) errors in manual data entry; (2) the lack of standards to publish high temperature geochemistry data, and the limitations of the optical character recognition techniques used to convert tabular data into readable documents. Furthermore, there is a lack of inherent relationships between rocks and minerals.
With joint efforts of 20 geochemists, we have constructed a benchmark dataset in high temperature geochemistry. Up to now, 200,000 rock and mineral data have been checked and corrected manually based on the FAIR principle (findable, accessible, interoperable, and reusable). Furthermore, the database provides raw data downloaded from cyberinfrastructures and their corresponding data after data cleaning, which can be used to check the effectiveness of data filtering algorithms. Therefore, the 200,000 rock and mineral data include (1) raw-clean data pairs; (2) mineral-mineral and mineral-rock relations. The database system is developed using Postgre SQL and Java, utilizing the VUE and SpringBoot frameworks for the front and back ends. The web portal offers a querying function to search for specific geochemistry data and a matching function to find rock-mineral combinations and mineral-mineral pairs generated under the same formation conditions. In the future, the database will be uploaded to the Deep-time Digital Earth program platform for data integration.
-
FAIR principle (Findable, Accessible, Interoperable, Reusable).
-
Manual cleaning ~ 200,000 rock and mineral data.
-
Raw-clean data pairs to test data filtering algorithms.
-
User-friendly Website: https://htgdb.deep-time.org
- Website Display
- Website Structure
-
Yang Lyu, J ZhangZhou, ZJU Earth Data Team. Bringing Data Clarify from Cyberinfrastructures: A Benchmark Database in High Temperature Geochemistry. Americal Geophysical Union Fall Meeting (AGU), 2023, San Francisco, United States.
-
Yang Lyu, J ZhangZhou. High temperature Geochemistry Database - high quality segmented domain database.Annual Meeting of Chinese Geoscience Union (CGU), 2023, Zhuhai, China.
-
Yang Lyu, J ZhangZhou. High temperature Geochemistry Database - high quality segmented domain database. The 7th Conference on Earth System Science (CESS), 2023, Shanghai, China.
Team Leader:
Yang Lyu
Technical Guidance:
- Shengfeng Pan
- Jinyuan Zhang
Data Arrangement:
-
ZJU Earth Data Team,
-
Siqi Huang
UI / Front-end:
-
Shuyi Li
-
Yutong Sun
Back-end:
- Junbo Wang
- Jianing Wang
- Ruitao Chang
User Docs:
- Nuoer Li
If you join the HTG team, you will get:
- Participate in development process and internal testing of the database.
- Give priority to using data and algorithms in the database.
- Have access to the latest updates to database.
Please send the email to join us: [email protected]
Fhase | Change Time | Content |
---|---|---|
V1.0 | 2023.12 | near 200,000 samples (rock, mineral, inclusion, experiment sample) |
V2.0 | wait for it | expected at 2024.03 |
Click 'Home' or the flame patten to return to the homepage of the website.
The login page.
Customize rock data filter.Users can select the criteria they want to get the relevant data. In this page, in addition to the data, users can also see the amount of related data and the historical query information.
Customize mineral data filter.Users can select the criteria they want to get the relevant data. In this page, in addition to the data, users can also see the amount of related data and the historical query information.
Customize Experiment data filter.Users can select the criteria they want to get the relevant data. In this page, in addition to the data, users can also see the amount of related data and the historical query information.
In this page, We provide three templates for rock, mineral, Inclusion and experiment sample respectively. Users can select the files they want to download.
We provides two comparative data sets before and after manual cleaning, which can be used to test the effect of data cleaning. In this page, here are the two files for Clinopyroxene mineral and Igneous rock comparison datasets respectively. Users can select the files they want to download.
The introductions of project and the team. If users want to join us, this part also mentions the related issues.
If users find that the data does not fit original literature, data missing from the sample, data template does not match and other problems during the use of this website,fill the form and feedback to us.
When using this website, users can register and log in to their account, then search for the required information on the website, and then download the required information.
-
Log in
Click 'Log in' to go to the login page.
If user has previously registered an account, he can log in directly. If user doesn't have an account, he can click 'Sign Up' to register first. After logging in, user will be redirected back to the original page.
-
Search and download data
-
Search&Match (Take 'Rock Data' as an example)
Step1: Click 'Search&Match'-'Rock Data'
Step2: Customize your rock data filter
Users can select the criteria they want to customize the filiter.
After selecting the criteria, click 'Submit Fliter'.
Step3: Check and read the data
Step4: Download
Click 'Download Data'.
-
Excel Available (Take 'Expert Data' as an example)
Step1: Click 'Excel Available'-'Expert Data'
Step2: Choose the file to download
Click 'xlsx' to download. 'xlsx' means the file format.
High T Geochemistry database mainly include natural rocks, natural minerals, natural inclusions and experimental synthetic samples which are formed in high temperature environment.Sample information includes sample age, rock property, mineral property, chemical composition, sampling location, geological environment and data source.
If you are submitting data to this database, please place the data correctly in the template provided on this page:https://htgdb.deep-time.org/dataTemplate. Here, we provide three templates for rock, mineral/Inclusion and experiment sample respectively. Your uploaded data would be published on the website as the open source.
If users find that the website does not work well with you, in the meantime, googling error messages cannot help. At that time please contact us and send the details of bug. Our experts are willing to solve the problems for you as soon as we can.
Please send the email to contact: [email protected]
If users find that the data does not fit original literature, data missing from the sample, data template does not match and other problems during the use of this website, click ‘About Us’-‘Contact Us' to fill the form and feedback to us.