Skip to content

Commit

Permalink
upload
Browse files Browse the repository at this point in the history
  • Loading branch information
Finance-781 committed Apr 11, 2020
1 parent f29895e commit a39f211
Show file tree
Hide file tree
Showing 10 changed files with 280,041 additions and 0 deletions.
3 changes: 3 additions & 0 deletions data/ihdp/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
Location of IHDP dataset introduced in [1].

1. J. L. Hill. Bayesian Nonparametric Modeling for Causal Inference. Journal of Computational and Graphical Statistics, 2012.
Binary file added data/jobs/Jobs_Lalonde_Data.csv.gz
Binary file not shown.
9 changes: 9 additions & 0 deletions data/jobs/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# Jobs dataset

Data descriptions link: http://users.nber.org/~rdehejia/data/nswdata2.html

The dataset is derived from the following datasets:

- Non-randomized control data: https://users.nber.org/~rdehejia/data/psid_controls.txt
- Randomized treated data: https://users.nber.org/~rdehejia/data/nsw_treated.txt
- Randomized control data: https://users.nber.org/~rdehejia/data/nsw_control.txt
11 changes: 11 additions & 0 deletions data/kaggle_creditcardfraud/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
# license

Open Database

# source

https://www.kaggle.com/mlg-ulb/creditcardfraud

## notes

Feature "time" has been removed.
250,000 changes: 250,000 additions & 0 deletions data/kaggle_creditcardfraud/creditcard_modified.csv

Large diffs are not rendered by default.

Binary file added data/spambase.csv.gz
Binary file not shown.
3 changes: 3 additions & 0 deletions data/synthetic/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
Synthetic dataset created for Deephit.

See "DeepHit: A Deep Learning Approach to Survival Analysis with Competing Risks" for more details.
30,001 changes: 30,001 additions & 0 deletions data/synthetic/synthetic_comprisk.csv

Large diffs are not rendered by default.

14 changes: 14 additions & 0 deletions data/twins/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
# Twins dataset

Source: National Center for Health Statistics 1989-1991

Data descriptions link: https://www.nber.org/data/linked-birth-infant-death-data-vital-statistics-data.html

The dataset is derived from:
- 1989 data: https://www.nber.org/lbid/1989/linkco1989us_num.csv.zip
- 1990 data: https://www.nber.org/lbid/1990/linkco1990us_num.csv.zip
- 1991 data: https://www.nber.org/lbid/1991/linkco1991us_num.csv.zip

The NCHS responsible only for the initial data and not for analyses, interpretations, and or conclusions reached by the authors


Binary file added data/twins/Twin_Data.csv.gz
Binary file not shown.

0 comments on commit a39f211

Please sign in to comment.