The actual datasets are hosted at https://huggingface.co/datasets/medviz/bike-datasets
Due to the large size of data files, please install git-lfs
and enable it first before downloading the data files.
Install git-lfs
:
brew install git-lfs
git lfs install
Add your SSH key to HuggingFace https://huggingface.co/settings/keys.
Then you can download the dataset from HuggingFace.
git clone [email protected]:datasets/medviz/bike-datasets
After downloading this repo, check the file size in the data/medviz
folder.
The file size of those .tsv
files should be quite large. If not, usually it's caused by missing of git-lfs
![image](https://private-user-images.githubusercontent.com/140455389/369083801-350524d1-b9db-444e-9dd7-db296161b055.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkyOTY5ODEsIm5iZiI6MTczOTI5NjY4MSwicGF0aCI6Ii8xNDA0NTUzODkvMzY5MDgzODAxLTM1MDUyNGQxLWI5ZGItNDQ0ZS05ZGQ3LWRiMjk2MTYxYjA1NS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjExJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxMVQxNzU4MDFaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1lMDE1NGE0ODJkNWM4YzAzYzFmMDVkMTIzOTFjOTFiYWY2NjJkMzM1MDgxMWY1ZDZhNmExMmM1ODIwODRiZmVhJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.UtzHJu0-pzWWX_M0nVkKdL9dcNRomwPzk0RB3o_fmlI)
sudo apt update
sudo apt install git-lfs
git lfs --version
git lfs install
All datasets should follow the following naming convention.
- File name all upper case.
- File type and extension is lower case.
To ensure the loading by frontend, the .tsv
file must be in the correct format:
pid, title, abstract, journal, year, mesh_terms, color, x, y, size
The mesh_terms
column should be a list of keywords seperated by ;
, for example: Infections;Kidney;Kidney Diseases;Microscopy, Electron;Poultry Diseases;Specific Pathogen-Free Organisms