A framework for identifying potential multi-gene prognostic signatures. The web application can be accessed at https://mulmarker.azurewebsites.net/.
MulMarker is a comprehensive framework for identifying potential multi-gene prognostic signatures across various diseases. MulMarker comprises three core modules: a GPT-driven chatbot for addressing user queries, a module for identifying multi-gene prognostic signatures, and a module for generating tailored reports.
MulMarker randomly divides patients into training and test groups. In the training group, it first employs univariate regression analysis to screen genes and then constructs a risk model using multivariate regression analysis. The combination of the identified genes is considered to be a potential prognostic signature. A KM survival analysis and a log-rank test are used to evaluate the performance of the prognosis signature. Next, the signature is evaluated in the test group and total dataset using KM survival analysis and log-rank test. Finally, MulMarker generates a report based on the analysis results to explain the findings. To use MulMarker, required files and parameters are listed as follows:
-
Analysis Name (string): Input the name of your analysis, such as "LungCancer".
-
Chosen Genes (.txt): A txt file with Candidate genes, one gene per line. Genes in the file must be included in Expression Data, such as "MulMarker/test/chosen_genes.txt".
-
Clinical Patients (.txt): A txt file with clinical information. There are three columns: "PATIENT_ID", "OS_STATUS" and "OS_TIME". Patients in the file should be the same as the patients in Expression Data, such as "MulMarker/test/clinical_info.txt".
-
Quantified Data (.txt): A txt file for transcriptomic and proteomic data. Each row corresponds to a gene and each column corresponds to a patient, such as "MulMarker/test/rna_info.txt".
-
Seed Number (number): Patients will be randomly divided into train group and test group when training the model. This parameter is the seed number of random grouping. We recommend you to adjust the paramneter to get a better risk model, such as "12".
-
Ratio (number): The ratio between the training group and test group, such as "0.5".