SkimLit: Let's make PubMed abstracts easier to read
Pubmed is a platform where we can find millions of articles in medical fields
THe purpose of this project is to build an NLP model to make reading medical abtracts easier.
Model: Replicate the PubMed 200k RCT: a Dataset for Sequential Sentence Classification in Medical Abstracts paper
(model structure avalaible here https://arxiv.org/pdf/1612.05251.pdf)
This is a multimodal model, and its implementation is carried out using TensorFlow-Keras.
Dataset: https://www.kaggle.com/datasets/matthewjansen/pubmed-200k-rtc
Input and output: