Name		Name	Last commit message	Last commit date
parent directory ..
datasets		datasets
notebooks		notebooks
README.md		README.md
classes.pkl		classes.pkl
create_model.py		create_model.py
entrypoint.py		entrypoint.py
label_encoder.pkl		label_encoder.pkl
model.pkl		model.pkl
requirements.txt		requirements.txt

README.md

Medical Transcript Classifier

This model is based on the Kaggle Medical Transcriptions dataset. It has medical transcripts along with the medical specialty they represent. We will build a classifier that will predict the medical specialty given the transcription text. While the dataset has thousands of specialties, we limit ourselves to a subset of 10.

Setup

Create a virtual environment using your favorite environment management tool and install the requirements. As an example,

python3 -m venv env
source env/bin/activate

pip3 install arthurai
pip3 install -r requirements.txt

Note that the requirements.txt file in this directory assumes python versions 3.6-3.8, as these are currently the only supported versions for the arthur SDK.

Quickstart

The notebook notebooks/Quickstart.ipynb shows an example of onboarding a model and sending data.

Other Files

While this repo contains a pre-trained model and everything else you need to get started, the code used to generate the model is included for your reference.

create_model.py is the code used to create the model
entrypoint.py is the code used to enable explainability
Pickle files are result of running create_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nlp_medical_transcript_classifier

nlp_medical_transcript_classifier

README.md

Medical Transcript Classifier

Setup

Quickstart

Other Files

Files

nlp_medical_transcript_classifier

Directory actions

More options

Directory actions

More options

Latest commit

History

nlp_medical_transcript_classifier

Folders and files

parent directory

README.md

Medical Transcript Classifier

Setup

Quickstart

Other Files