LncRNA_Sub_cellular_Localisation_Prediction_using_Multi-source_Features

Here in this project we have developed and tested several machine learning models based on multi-source features for the lncRNA sub-cellular localisation prediction.

First the dataset was collected from the extensive databases search of lab validated lncRNA sub-celullar localisation data. A total of 7 locations dataset was found.

Different set of features were extracted from the sequences such as sequences-based features, structre features and physico-chemical features.

then to reduce the dimension and redundant features several feature selection methods were employed to reduce the chances of overfitting.

9 different machine learning algorithms were used to train and test and ROC curve and other COnfusion matrix based metrices used to find the best model, in our case was Random forest.

Files Descriptions:

Dataset_of_7_classes.fasta - Contains the sequences of the lncRNAs with their locations labelled
Features_extraction_all.ipynb - Contains the code to extract all the features used in the study from the input fasta file
Feature_selection.ipynb - Contains the code for the feature selection methods used in our study
All_machine_learning_models.ipynb - Contains the code to train, test all the models

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
All_machine_learning_Models_.ipynb		All_machine_learning_Models_.ipynb
Dataset_of_7_classes.fasta		Dataset_of_7_classes.fasta
Features_extraction_all.ipynb		Features_extraction_all.ipynb
README.md		README.md
feature_selection.ipynb		feature_selection.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LncRNA_Sub_cellular_Localisation_Prediction_using_Multi-source_Features

About

Releases

Packages

Languages

baibhav-bioinfo/LncRNA_Sub-cellular_Localisation_Prediction_using_Multi-source_Features

Folders and files

Latest commit

History

Repository files navigation

LncRNA_Sub_cellular_Localisation_Prediction_using_Multi-source_Features

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages