Add tests #71

jamesmbaazam · 2023-09-13T20:53:49Z

This PR adds tests for all the functions in the package to close #45.

codecov-commenter · 2023-09-13T21:09:13Z

Codecov Report

Merging #71 (8e78d6f) into main (88df856) will increase coverage by 36.32%.
The diff coverage is n/a.

❗ Current head 8e78d6f differs from pull request most recent head 537ceab. Consider uploading reports for the commit 537ceab to get more accurate results

@@             Coverage Diff             @@
##             main      #71       +/-   ##
===========================================
+ Coverage   62.50%   98.82%   +36.32%     
===========================================
  Files           8        8               
  Lines         424      424               
===========================================
+ Hits          265      419      +154     
+ Misses        159        5      -154

see 5 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

jamesmbaazam · 2023-09-15T14:16:55Z

@pratikunterwegs @joshwlambert @Bisaloo any idea how to fix this failing test?

pratikunterwegs · 2023-09-15T15:00:45Z

@pratikunterwegs @joshwlambert @Bisaloo any idea how to fix this failing test?

Perhaps a stupid question, but have you changed the behaviour of the function in a way that would legitimately change the snapshot? In which case accepting the new snapshot would be the way out, I'd guess.

jamesmbaazam · 2023-09-15T15:24:07Z

@pratikunterwegs I haven't. Normally, that's the output you'd see in Rstudio when you first run the test and it'd go away after you've accepted it. However, I don't know how to achieve that in the CI. I'm tempted to delete these tests at the cost of some coverage.

joshwlambert · 2023-09-15T15:43:32Z

@jamesmbaazam can you reproduce this issue locally? I've run the tests and calculated code coverage on this branch locally and everything works.

Have you tried re-running the failed workflow to check if it was a one-time issue?

jamesmbaazam · 2023-09-15T19:38:26Z

@joshwlambert It passes locally for me too but fails on the CI. I get the same error when I re-run the failed jobs.

pratikunterwegs · 2023-09-18T07:44:53Z

@jamesmbaazam from what I understand the snapshot fails as its components are being wrapped in covr:::count() calls. This seems to be because {covr} identifies the outputs as R expressions whose coverage should be checked. This just appears to be how {covr} works. If I'm right, this is part of trace_calls(). From the {covr} vignette:

The core function in covr is trace_calls(). This function was adapted from ideas in Advanced R - Walking the Abstract Syntax Tree with recursive functions. This recursive function modifies each of the leaves (atomic or name objects) of an R expression by applying a given function to them. If the expression is not a leaf the walker function calls itself recursively on elements of the expression instead.

This issue on using {covr} with {drake} appears to suggest that there is not much to be done about this when there's a conflict with static code analysis tools - you would probably be better off removing this snapshot test.

I've tried replacing the expect_snapshot(body(...)) command with expect_snapshot(writeLines(deparse(body(...)))), but since the wrapping happens when the function factory output is generated, it's included in the string, and the snapshot test fails anyway.

One alternative which I haven't tried and which just might work is converting the function factory output to a string internally, and returning a string instead of a closure. I think this would also be caught by {covr} and would probably result in the same issue.

Another issue which you might find insightful regarding code coverage for function factories: r-lib/covr#363

An alternative would be to create snapshot tests for the outputs of the generated functions, rather than the function body itself, and/or testing for the statistical correctness.

jamesmbaazam · 2023-09-18T08:49:18Z

@pratikunterwegs Thanks for looking into this. Really detailed explanation. I thought of it over the weekend and decided to remove the snapshot test for now as it doesn't make much sense. There's another test to ensure that the right distribution is passed to the spec argument. The discussion in the linked issue is succinct and insightful. In essence, there are just some things you can't test exactly.

Bisaloo · 2023-09-18T09:36:42Z

For an immediate fix of the issue at hand, you can add skip_on_covr() to these tests.

More generally, and slightly beyond the scope of this specific PR, I don't see much benefit for this function in its current state. As far as I can tell, It is used just once, in simulate_tree_from_pop() but it doesn't really simplify the code there because there is still branching based on offspring_dist value. While I appreciate the value of splitting the code in conceptually distinct units in the form of functions, even when they are called just one, there is a risk of ending up with code so nested that you have to jump in a giant rabbit hole each time you want to understand anything about the codebase. I think we are dangerously tilting in the rabbit hole direction here.

I see two potential solutions:

removing get_offspring_func() and make the code part of the main simulate_tree_from_pop(). This is probably my preferred option in this specific case
making sure that get_offspring_func() allows you to simplify the code in the parent simulate_tree_from_pop() function. This could be achieved by moving the argument checking code in get_offspring_func(), which allows you to remove the branching in get_offspring_func()

jamesmbaazam · 2023-09-18T09:47:49Z

Thanks @Bisaloo. I will implement the first option. Creating an issue to fix this.

jamesmbaazam · 2023-09-18T17:10:13Z

@pratikunterwegs Would you like to review this PR? I am hoping to merge it in by the close of Wednesday.

pratikunterwegs · 2023-09-18T17:25:27Z

Sure, will have feedback by tomorrow afternoon.

pratikunterwegs · 2023-09-19T09:29:19Z

tests/testthat/test-epichains.R

I would suggest reorganising this file so that the mapping between the <epichains> and aggregated data created at the top, and the actual tests, are clearer for maintainers who are new to the package. You could create each object just before you test it, for instance.

I see that the class of the objects is tested in tests-simulate.R whereas this file tests methods, maybe a small comment saying that would be good here for future maintainers. Alternatively, you could combine the two files.

You could create each object just before you test it, for instance.

Thanks. I'll make the change. This was how I organised it originally but noticed I was doing the same thing many times, so decided to move it to the top. But it does make sense to have it within the context in which they're tested for ease of reading. it seems to be a tradeoff between readability and efficiency. I've made the suggested changes here fdf45f8 and 395641d.

Alternatively, you could combine the two files.

I think I'll keep them separate to keep the script-to-test mapping organisation.

tests/testthat/test-epichains.R

pratikunterwegs · 2023-09-19T09:46:32Z

tests/testthat/test-epichains.R

+test_that("head and tail methods work", {
+  expect_snapshot(head(epichains_tree))
+  expect_snapshot(head(epichains_tree2))
+  expect_snapshot(tail(epichains_tree))
+  expect_snapshot(tail(epichains_tree2))
+})


I would suggest adding a check for the return type, which is currently a data.frame. Might be worth adding this to the method documentation as well, as users might be expecting an <epichains> object.

I've added the tests here 01c5ef9. I'll create an issue for the documentation issue.

pratikunterwegs · 2023-09-19T09:58:30Z

tests/testthat/test-epichains.R

+  expect_s3_class(
+    aggreg_by_gen,
+    "epichains_aggregate_df"
+  )
+  expect_s3_class(
+    aggreg_by_time,
+    "epichains_aggregate_df"
+  )
+  expect_s3_class(
+    aggreg_by_both,
+    "epichains_aggregate_df"
+  )


Here it might be worth also checking that the data aggregated by "both" inherits from a list, whereas the other two inherit from data.frame. I think there should be a wider rethinking of classes in {epichains} as well to avoid this ambiguous inheritance structure

Thanks for the suggestion for rethink the class structures. I'll raise an issue for further discussion. For now, I've added the suggested test here fdf45f8.

pratikunterwegs · 2023-09-19T10:00:29Z

tests/testthat/test-helpers.R

I'm reminded by the recent {cfr} review that test descriptions that include function names can be brittle to function name changes - might be good to change that.

I agree that they are brittle but I wonder if they are really that hard to maintain in a small code base like this. Even for "larger" code bases like that of dplyr, their tests like this one for across use function names.

Generic descriptions in long test files are hard to debug in my opinion.

That was pretty much my logic in {cfr} too - but you suggested to make them more descriptive. I'm happy for the function names to stay in.

Yeah, the suggestion was to more specific with the testing contexts. Maybe there was a lapse in communication.

pratikunterwegs

Thanks @jamesmbaazam - looks alright to me overall. I haven't really looked into {epichains} before. From what I can see, I think the tests cover the package functionality, and there seem to be tests for correctness so hopefully the chain simulation functions work as expected.

This PR is mostly adding tests, but one issue that I noticed is that <epichain> and <epichains_aggregated_df> objects can inherit from different base classes (data.frame and vector, and data.frame and list, respectively). I'm not sure whether it's a good idea to get into conditional inheritance in this way. This would probably make it difficult for future developers to easily account for what sort of object a function will return. If the data in each case (e.g. epichain_tree vs epichain_summary) is sufficiently different, perhaps it would be better to have separate classes. If you would find an overarching signature more convenient for some methods, the existing classes could be defined as abstract super-classes with sub-classes instead. Happy to discuss this further.

A minor point is that looking at the functions that are related to checking the offspring distribution function, you would be restricting users to pass functions that are available in {stats} and can thus be found by exists(). Are there likely to be cases where users would want to specify a function from another package, and pass it explicitly namespaced as "pkg::function"? If so, do you intend to support that, and would the check_*() functions need to account for it?

…texts

jamesmbaazam · 2023-09-19T18:03:41Z

This PR is mostly adding tests, but one issue that I noticed is that and <epichains_aggregated_df> objects can inherit from different base classes (data.frame and vector, and data.frame and list, respectively). I'm not sure whether it's a good idea to get into conditional inheritance in this way. This would probably make it difficult for future developers to easily account for what sort of object a function will return. If the data in each case (e.g. epichain_tree vs epichain_summary) is sufficiently different, perhaps it would be better to have separate classes. If you would find an overarching signature more convenient for some methods, the existing classes could be defined as abstract super-classes with sub-classes instead. Happy to discuss this further.

I've logged this here #79.

A minor point is that looking at the functions that are related to checking the offspring distribution function, you would be restricting users to pass functions that are available in {stats} and can thus be found by exists(). Are there likely to be cases where users would want to specify a function from another package, and pass it explicitly namespaced as "pkg::function"? If so, do you intend to support that, and would the check_*() functions need to account for it?

This issue of function look-up has been discussed extensively in #25 and #33 (comment).

pratikunterwegs

Thanks for the changes @jamesmbaazam - looks alright to me.

jamesmbaazam · 2023-09-20T08:32:21Z

Thanks for your thorough review as always.

jamesmbaazam added the pkg_infrastructure label Sep 13, 2023

jamesmbaazam marked this pull request as draft September 13, 2023 20:54

jamesmbaazam marked this pull request as ready for review September 15, 2023 08:50

jamesmbaazam requested a review from sbfnk September 15, 2023 14:17

jamesmbaazam self-assigned this Sep 18, 2023

jamesmbaazam mentioned this pull request Sep 18, 2023

Remove get_offspring_func() helper #73

Closed

jamesmbaazam force-pushed the add_tests branch from f6e888b to a745ec9 Compare September 18, 2023 16:53

jamesmbaazam added 12 commits September 18, 2023 18:12

Move tests to dedicated script

6f2f94c

Add test for checks

a463f19

Add test for simulation functions

39ccbb2

Clean up the tests

267b995

Generate likelihood doc file

042388c

Use expect_identical instead of expect_equal

98dd76a

Use expect_null instead of expect_equal

bbab69d

Fix expected data types

fbc65ac

Add tests for utils.R

e40fd81

Add tests for epichains classes and methods

c9ed2ec

Add tests for the helper functions

3584b3b

Linting

c394326

jamesmbaazam and others added 5 commits September 18, 2023 18:12

Add fix=TRUE to fix the pattern to be matched

aa4579d

Fix comment tags

f9a32a0

Lint

5e60ed9

Remove snapshot tests

d6f4323

Remove tests for the get_offspring_func() helper

45b22e8

jamesmbaazam force-pushed the add_tests branch from 39f6ec5 to 45b22e8 Compare September 18, 2023 17:12

pratikunterwegs self-requested a review September 18, 2023 17:27

pratikunterwegs reviewed Sep 19, 2023

View reviewed changes

tests/testthat/test-epichains.R Outdated Show resolved Hide resolved

pratikunterwegs reviewed Sep 19, 2023

View reviewed changes

jamesmbaazam added 5 commits September 19, 2023 17:46

Restructure tests-epichains by moving simulations into individual con…

fdf45f8

…texts

Restructure test-simulate by moving simulations into individual contexts

395641d

Rename file

e90b972

Linting

d6aaa79

Linting

537ceab

jamesmbaazam force-pushed the add_tests branch from ab4dfd0 to 537ceab Compare September 19, 2023 16:46

jamesmbaazam added 2 commits September 19, 2023 18:15

Add tests for the class of the head and tail methods

01c5ef9

Re-generate snaps

356a44c

This was referenced Sep 19, 2023

Document return type of head and tail methods #78

Closed

Restructure <epichains> and <epichains_aggregate_df> classes #79

Closed

pratikunterwegs approved these changes Sep 20, 2023

View reviewed changes

jamesmbaazam added the enhancement New feature or request label Sep 20, 2023

jamesmbaazam merged commit 021ed86 into main Sep 20, 2023

jamesmbaazam deleted the add_tests branch September 20, 2023 09:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tests #71

Add tests #71

jamesmbaazam commented Sep 13, 2023

codecov-commenter commented Sep 13, 2023 •

edited

Loading

jamesmbaazam commented Sep 15, 2023

pratikunterwegs commented Sep 15, 2023

jamesmbaazam commented Sep 15, 2023

joshwlambert commented Sep 15, 2023

jamesmbaazam commented Sep 15, 2023 •

edited

Loading

pratikunterwegs commented Sep 18, 2023 •

edited

Loading

jamesmbaazam commented Sep 18, 2023

Bisaloo commented Sep 18, 2023

jamesmbaazam commented Sep 18, 2023

jamesmbaazam commented Sep 18, 2023

pratikunterwegs commented Sep 18, 2023

pratikunterwegs Sep 19, 2023

jamesmbaazam Sep 19, 2023

pratikunterwegs Sep 19, 2023

jamesmbaazam Sep 19, 2023

pratikunterwegs Sep 19, 2023

jamesmbaazam Sep 19, 2023

pratikunterwegs Sep 19, 2023

jamesmbaazam Sep 19, 2023

pratikunterwegs Sep 19, 2023

jamesmbaazam Sep 19, 2023

pratikunterwegs left a comment

jamesmbaazam commented Sep 19, 2023

pratikunterwegs left a comment

jamesmbaazam commented Sep 20, 2023

Add tests #71

Add tests #71

Conversation

jamesmbaazam commented Sep 13, 2023

codecov-commenter commented Sep 13, 2023 • edited Loading

Codecov Report

jamesmbaazam commented Sep 15, 2023

pratikunterwegs commented Sep 15, 2023

jamesmbaazam commented Sep 15, 2023

joshwlambert commented Sep 15, 2023

jamesmbaazam commented Sep 15, 2023 • edited Loading

pratikunterwegs commented Sep 18, 2023 • edited Loading

jamesmbaazam commented Sep 18, 2023

Bisaloo commented Sep 18, 2023

jamesmbaazam commented Sep 18, 2023

jamesmbaazam commented Sep 18, 2023

pratikunterwegs commented Sep 18, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pratikunterwegs left a comment

Choose a reason for hiding this comment

jamesmbaazam commented Sep 19, 2023

pratikunterwegs left a comment

Choose a reason for hiding this comment

jamesmbaazam commented Sep 20, 2023

codecov-commenter commented Sep 13, 2023 •

edited

Loading

jamesmbaazam commented Sep 15, 2023 •

edited

Loading

pratikunterwegs commented Sep 18, 2023 •

edited

Loading