Name		Name	Last commit message	Last commit date
parent directory ..
NF1_Corrected_Second_Plate		NF1_Corrected_Second_Plate
NF1_Pilot_Images		NF1_Pilot_Images
NF1_Second_Plate		NF1_Second_Plate
example_images		example_images
0.correct_images.sh		0.correct_images.sh
0.download_NF1_data.yml		0.download_NF1_data.yml
README.md		README.md
convert_crop_NF1_images.cppipe		convert_crop_NF1_images.cppipe
convert_crop_NF1_images.cpproj		convert_crop_NF1_images.cpproj
correct_images.ipynb		correct_images.ipynb
correct_images.py		correct_images.py
correctionutils.py		correctionutils.py

README.md

Download NF1 Schwann Cell Images

In this module, we present the two datasets, one called pilot and a second dataset, that are downloaded to perform image-based cell profiling.

NF1 Project Information

Information regarding the data for the NF1 project are as follows:

Assay: Modified Cell Painting

Channels: DAPI (DNA/Nuclei), GFP (Endoplasmic Reticulum), RFP (Cytoplasm) 

Cells: Schwann cells

Samples: Two genotypes of the NF1 gene, WT +/+ and Null -/-

Microscope: GE Deltavision Elite

Magnification: 20X

Bit-size: 16-bit

Number of images: 

- 96 images (Pilot dataset)

- 384 images (New dataset)

Total storage size: 

- 212.9 MB (Pilot dataset)

- 204.2 MB (New dataset - post conversion and cropping)

Standard Metadata Name Structure

The standard metadata structure for this project is based on the pilot dataset.

Issue with new dataset

When comparing the pilot dataset and the second plate dataset (updated as of 12/09), there were three big differences between them:

Image composition
File naming (metadata)

Firstly, the images from the second plate were RGB (Figure 1).

The images from the pilot dataset are greyscale (which is the standard) and are 16-bit (Figure 2).

Secondly, since the code within this project relies on the metadata structure from the pilot data (see above), the second plate would not be able to run properly.

Solution

This led to the need to correct the second plate dataset to reflect standards from the pilot dataset. The corrections include using CellProfiler and Python.

In CellProfiler, we split the RGB images into three greyscale images (called red, green, and blue), taking the one of the three images that is connected to the channel (e.g. DAPI is the blue channel, GFP is green, RFP is red), and cropping the images to remove the scale. The cropped images from each channel are saved as .tiffiles that are greyscale and 16-bit images.

Lastly, using Python, we created a function to reorder the file names and add metadata to fit the standards for analysis downstream.

Steps to perform the conversion and correction of new dataset

Step 1: Create conda environment

# Run this command to create the conda environment 
conda env create -f 0.download_NF1_data.yml

Step 2: Activate conda environment

# Run this command to create the conda environment 
conda activate download-NF1-data

Step 3: Execute preprocessing NF1 data

# Run this script in terminal
bash 1.preprocessing_data.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0_download_data

0_download_data

README.md

Download NF1 Schwann Cell Images

NF1 Project Information

Standard Metadata Name Structure

Issue with new dataset

Solution

Steps to perform the conversion and correction of new dataset

Step 1: Create conda environment

Step 2: Activate conda environment

Step 3: Execute preprocessing NF1 data

Files

0_download_data

Directory actions

More options

Directory actions

More options

Latest commit

History

0_download_data

Folders and files

parent directory

README.md

Download NF1 Schwann Cell Images

NF1 Project Information

Standard Metadata Name Structure

Issue with new dataset

Solution

Steps to perform the conversion and correction of new dataset

Step 1: Create conda environment

Step 2: Activate conda environment

Step 3: Execute preprocessing NF1 data