Skip to content

Commit

Permalink
update to v1.2
Browse files Browse the repository at this point in the history
  • Loading branch information
alantian committed Sep 15, 2020
1 parent 0d673ec commit 0f609ee
Show file tree
Hide file tree
Showing 7 changed files with 26,182 additions and 8 deletions.
15 changes: 10 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,10 +2,15 @@

[![License: CC BY-SA 4.0](https://img.shields.io/badge/License-CC%20BY--SA%204.0-blue.svg)](https://creativecommons.org/licenses/by-sa/4.0/)

📚 [Read the paper](https://arxiv.org/abs/2002.08595) to learn more about Kaokore dataset, our motivations in making them, as well as creative usage of it! The paper would appear in the proceedings of the Eleventh International Conference on Computational Creativity, ICCC’20.
📚 [Read the paper](https://arxiv.org/abs/2002.08595) to learn more about Kaokore dataset, our motivations in making them, as well as creative usage of it! The paper is in [the proceedings](http://computationalcreativity.net/iccc20/proceedings/) of the Eleventh International Conference on Computational Creativity, ICCC’20.


**Dataset History** We are keeping expanding the dataset. Besides adding more images, all other settings remain the same.The update history is:

* Version `1.2`: Exapnded to `8848` images. Most recent version
* Version `1.1`: Exapnded to `8573` images.
* Version `1.0`: `5552` images. The initial relase.

**Dataset Expanded!** We have exapned the initial relase (version `1.0`) of our dataset with more images!
The most recent version `1.1` now contains `8573` images. Besides adding more images, all other settings remain the same.
_Note that the classification and the genertive results here and in the paper still correspond to the version `1.0` of our dataset._

## The Dataset
Expand All @@ -15,7 +20,7 @@ _KaoKore_ is a novel dataset of face images from Japanese illustrations along wi
_KaoKore_ dataset is build based on the [_Collection of Facial Expressions_](http://codh.rois.ac.jp/face/), which results from an effort by the ROIS-DS Center for Open Data in the Humanities (CODH) that has been publicly available since 2018.
It provides a dataset of cropped face images extracted from Japanese artworks publicly available from National Institute of Japanese Literature, Kyoto University Rare Materials Digital Archive and Keio University Media Center from the Late Muromachi Period (16th century) to the Early Edo Period (17th century) to facilitate research into art history, especially the study of artistic style. It also provides corresponding metadata annotated by researchers with domain expertise.

KaoKore dataset contains image files, each being an color (RGB) image of size `256 x 256` as well as two sets of labels _gender_ and _social status_. Version `1.0` contains `5552` images, while the most recent version 1.1 contains `8573` images.
KaoKore dataset contains image files, each being an color (RGB) image of size `256 x 256` as well as two sets of labels _gender_ and _social status_. The most recent version contains `8848` images.

<p align="center">
<img src="images/fig/dataset_example.png" width='768'>
Expand All @@ -33,7 +38,7 @@ KaoKore dataset contains image files, each being an color (RGB) image of size `2

## Get the data 💾

🌟 You can run [`python3 download.py`](download.py) download KaoKore datasets. The default setting downloads version `1.0` of the dataset. To try out version `1.1`, please use `python3 download.py --dataset_version 1.1`.
🌟 You can run [`python3 download.py`](download.py) download KaoKore datasets. The default setting downloads the most recent version `1.2` of the dataset. To try out older version (e.g. `1.0`), please use `python3 download.py --dataset_version 1.0`. For version numbers plese refer to **Dataset History** above.
Also, see the output of `download.py --help` for more details.

It is known that some conda installations may have trouble looking for SSL certificates. If that is the case, you could use `download.py --ssl_unverified_context`, **at your own risk and only if you know what you are doing**, to disable the certificate verification. Also it is reported that the default downlaod concurrency `--threads 16` may be too high for some network/machines. In that case please try a lower one.
Expand Down
Loading

0 comments on commit 0f609ee

Please sign in to comment.