What to put inside the reproducibility report ? #4

rougier · 2019-11-04T09:09:46Z

This is a thread for defining what kind of information might be useful to appear in the reproducibility report (independently of success of failure). @annakrystalli might have some more ideas since she organized reproducibility hackatons:

Location of the original source code (online, physical support, what kind, etc)
Presence of a license for the code
Presence of a README
Programming langage
Operating System (if relevant)
Specific hardware (if relevant)

Feel free to add comments/ideas/suggestions

annakrystalli · 2019-11-04T10:39:42Z

There are a couple of resources that could be useful here:

Will try and work on this soon!

khinsen · 2019-11-04T16:37:38Z

Operating system and hardware should ideally be given both for the original computation and for the reproduction.

License is nice to know but not essential for reproduction by the original author.

We should also recommend authors to make their code available online (if not done already), add a license (if missing), and usage instructions. Then submit everything to Software Heritage.

rougier · 2019-11-04T18:14:42Z

From @annakrystalli resouces, I think we could ask authors who managed to reproduce their results to make a dedicated compendium (GitHub repository) and to save it at Software Heritage.
(and we'll test in again in ten years :)) The reviewer template will also be super useful for the review.

khinsen · 2019-11-19T07:18:46Z

Copying over some items from ReScience/template#6:

@rougier writes:
Among the things that might be interesting is:

How did you conserve the sources
Did you take care of registering RNG seed (if you use it)
Did you save command line options (if you need some options)
Did you need to adapt your sources ?
Did you need to adapt your libraries ?
What guided your choice of fortran among other languages at that time
etc.

@khinsen writes:
I'd like to emphasize the utility of communicating the choices (and the motivations behind them) made at the time of publication, even if they risk being distorted by hindsight. That's something we can only get out of authors doing reproductions of their own work. For example, I realized that I never preserved or published code for reproducibility, but only to make it available for reuse by others. As a consequence, I am always missing the last small steps: command-line arguments, that five-line script that ties computations together, etc.

khinsen · 2019-12-03T13:08:08Z

One point from my own experience as a participant in the challenge: If reproduction requires any changes at all to the code or the installation instructions, discuss which knowledge or competence someone else would need to be able to do it. In my case (https://github.com/khinsen/rescience-ten-year-challenge-paper-3), I fixed a software collapse issue by changing a single line of my code, but I could do that quickly only because I had written or contributed to the entire software stack that my code depends on. For anyone else, figuring out that a dependency of a dependency of my code was broken by a change one more layer below would probably be a prohibitive effort.

khinsen · 2019-12-11T11:25:03Z

I have written a first draft of the author guidelines. Comments (and pull requests!) welcome!

rougier · 2019-12-12T09:59:43Z

Looks good to me. Maybe we need to emphasize the des ription of the language that has been used since we may want to do some (simple) statistics from all the entries.

khinsen · 2019-12-12T13:49:25Z

Good point. Anything else we might to want include in the statistics? It would be interesting in theory to include all dependencies, not just the language. It is unlikely that we will have enough submissions to do a meaningful analysis on anything but the most frequently listed dependencies, but I'd expect a few dependencies (e.g. NumPy) to be as frequent as languages.

khinsen · 2019-12-13T05:05:09Z

Could we reasonably ask authors to provide a machine-readable list of dependencies for our analysis? Take ReScience/submissions#11 (comment) as an example: I think the author provided a nice and detailed explanation for his choice of technologies for a human reader, but it would be hard to extract [Fortran, IQPACK] as a dependency list from it.

Alternatively, we could ask reviewer to compile such a list, and have authors verify. Or scan for dependencies in the end, when doing our statistics, which is doable if the number of submissions doesn't explode in the coming months.

nuest · 2019-12-13T08:00:52Z

Would you go so far as to suggest to create a Binder?

khinsen · 2019-12-13T10:49:57Z

No. We won't have any notebooks due to the ten-year rule, so moving towards Binder would require authors to rewrite their code, which is not the goal of the exercice.

What we could suggest is to package a suitable computational environment as a container (a reproducible Dockerfile, for example) or using a Nix or Guix. But I wouldn't want to make this a condition, we'd lose too many people.

pdebuyl · 2019-12-17T13:35:55Z

We can propose alternatives here:

notebook, on binder or not
Makefile
bash script

khinsen mentioned this issue Nov 19, 2019

difficulty converting metadata file yaml -> tex ReScience/template#6

Closed

nuest mentioned this issue Nov 25, 2019

CODECHECK bundles codecheckers/discussion#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What to put inside the reproducibility report ? #4

What to put inside the reproducibility report ? #4

rougier commented Nov 4, 2019

annakrystalli commented Nov 4, 2019

khinsen commented Nov 4, 2019

rougier commented Nov 4, 2019

khinsen commented Nov 19, 2019

khinsen commented Dec 3, 2019

khinsen commented Dec 11, 2019

rougier commented Dec 12, 2019

khinsen commented Dec 12, 2019

khinsen commented Dec 13, 2019

nuest commented Dec 13, 2019

khinsen commented Dec 13, 2019

pdebuyl commented Dec 17, 2019