Research:
Research and Academic Roles as Bioinformatician, Computational Biology and Genomics, NGS Specialist, Research Scientist, Bio-Software Developer, Bioinformatics CTO, and Lead researcher roles as Senior CTO, Senior Computational Lead and Bioinformatician/Associate. I am not interested in freelancing and technician jobs (so please kindly don't contact regarding the same). Genomic analysis and software development for illumina and long read sequencing approaches. I provide bio-software development, algorithms, bioinformatics, machine learning for Illumina and LongRead sequencing technologies across all species. I am specific to the analysis in the areas of RNA-seq and Single cell, Metagenomics and Pangenomics covering all species. Similarly, Biological Machine and Deep learning specific to biological questions, biological language models and bio-software, HPC management.
Academic and Research teaching: I only read and write, fullstack RUST, Python, and Golang for the scientific and academic research. As a part of the scientific and academic research, I can give the language classes to academics and research students. I am not a private teacher or YouTuber and not interested in such freelancing roles or forming collaboration on the same.
Sequencing specifics:
2010-2021: Plant, Bacterial, Fungi: RNASeq, GenomSeq, Phylogenomics, PacBio Sequencing, Single Cell Analysis, Bio-software.
2021:2023: Machine Learning, Bio-software.
2024: PanGenome, Bio-software.
2025: Human Genomics, Bio-software
Software Development: C++(2010-2021), 2024-: RUST
Web Development: RUST,GOlang and Python
Bioinformatics/Machine and Deep Learning: Python
HPC Management: Golang (2024:-).
Appplication specific: RUST 🦀 i use Axum, Rocket, Dioxus, Actix, Leptos web framework and for GOLang, I use Gin, Fiber, Echo, Beego, FastHTTP, Gorilla, Chi, Hertz. For Machine learning, Keras, XgBoost, Tensorflow and PyTorch.
Background:
I have a background in bioinformatics and plant biology and extensive experience in code development using several programming languages(prior to 2021 - C++, Bash, R - after 2021: Bash, Python,Golang, RUST) covering data analysis and data science, machine and deep learning and web and application development. Following my PhD, I developed bioinformatics methods for transcriptional and post-transcriptional genomics across nuclear and organelle genomes at Fondazione Edmund Mach (Italy). I analyzed and finished multiple RNA-seq and Organelle-Seq experiments for several plant and fungal species, including Arundo donax. Additionally i analyzed multiple metagenomics anlaysis coming from the fungal and bacterial species involving ITS metagenomics, as well as the bacterial metagenomics.Additionally, I have done a lot of work in the field of organelle genomics and have published the first Cardamine species' chloroplast genomes. I independently created an international partnership to find and create computational methods for a number of crop species. Following that, I spent two years (2014–2016) as a Research Fellow (Academic Level B) at the University of Technology, Sydney, Australia, where I developed computational methods for understanding seagrasses. Following that, I spent a short time spent in University of Connecticut, USA, where i analyzed the Douglas fir genome from the genome annotation to the phylogenomics and identifying genes and evolution of importance.
Since August 2017, I have worked as a Postdoctoral researcher at the Finnish Museum of Natural History and the University of Helsinki, conducting research on genome bioinformatics and sequencing the genomes of lower plants, including Coleochaete orbicularis, Blasia pusilla, Chaetospiridium orbicularis, Polytrichum commune, Mallomonas, and Cryptomonas species. My work has been focused on genome assembly, genome annotation, chloroplast genomics, and a variety of other topics. Additionally, I've worked for various other organisations, such as Edinburgh UK, to analyse the genomics data for PAFTOL species and the chloroplast genomes of the Ambrosia clade from Norway. Since 2019, my research has shifted to examining the genomes of fungi whose species have been sequenced using NextSeq methods. This work is currently concentrated on genome assembly, annotations, markers genes, and phylogenomics of those fungi. I have assembled, annotated, and identified ITS and other phylogenomics markers, as well as performed alignments, phylogenies, and downstream analyses on the fungal genomes of over 500 different species. The bioinformatics application of high throughput sequencing and methods to comprehend the biological and functional importance of the genes, evolution, and pathways in plants have been the main areas of my research up to this point.
From 2022-2023, i added several new skills as a carrier advancement. From 2024 onwards, I worked at Universitat Potsdam, Germany, where I self-learnt RUST and develop approaches for machine and deep learning. During the time, I bench-marked PacBioHifi genome analysis and created a complete HMTL, CSS, Javascript enabled web and also coded several approaches, packages in Python and RUST. Since 2025, I worked as Area Expert at Instytut Chemii Bioorganicznej Polskiej Akademii Nauk, Poland, and i worked on human genomics and developed computational approaches for human genomics and software development.
Python, RUST and GoLang development: Few of them and click the source for more of them. See the last commit tag as the final build release for each of the source repository.
-> eVaiUtilities: Variant analysis from the eVai.
-> panscape: Pangenome long reads
-> sequenceprofiler: Profiling sequence kmers for histograms
-> phyloevolve: Long reads and alignments from the multiple alignments.
-> hpcMapper: DevOPS system managment for the high performance computing.
-> bacdive: Bacterial genome analysis from Bacdive.
-> NLRanalyzer: Complete kit for analyzing NLR.
-> humanCAST: Complete kit for human genome analysis.
-> araseq: Complete kit for the Arabidopsis genome information.
-> minifyseq: Noise removal from the long reads inclduing the machine learning based.
-> CAGanalyzer: Analyzing the CAG repeats from the human genome.
-> doiTAG: Generating doi for the sequences for next generation sequencing.
-> varLinker: Analyzing and linking variants for annotation.
-> vcfFilter: Filtering VCF files
-> rustRet: Analyzing the massspectrometry data.
-> repgnerate: Analyzing the sequencing information post sequencing.
-> proteogenomics: implementing the proteogenomics methods.
-> geomapper: a complete kit for geospatial analysis for German geo.
-> ensemblcov: a complete kit for using the ensemblcov at your commandline.
-> bactiPAN: complete bacterial pangenome analyzer both in shell and RUST.
-> varView: Graphics enabled Variant terminal analyzer in GOlang.
-> doseqGO: A complete sequence information portal for standalone sequence information.