From 4d7be916c31036bf859d2704e806f2eced083c64 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sbfnk@users.noreply.github.com>
Date: Wed, 9 Jan 2019 06:42:06 +0000
Subject: [PATCH 001/828] initial commit

---
 .gitignore |  36 +++
 LICENSE    | 674 +++++++++++++++++++++++++++++++++++++++++++++++++++++
 README.md  |   2 +
 3 files changed, 712 insertions(+)
 create mode 100644 .gitignore
 create mode 100644 LICENSE
 create mode 100644 README.md

diff --git a/.gitignore b/.gitignore
new file mode 100644
index 00000000..26fad6fa
--- /dev/null
+++ b/.gitignore
@@ -0,0 +1,36 @@
+# History files
+.Rhistory
+.Rapp.history
+
+# Session Data files
+.RData
+
+# Example code in package build process
+*-Ex.R
+
+# Output files from R CMD build
+/*.tar.gz
+
+# Output files from R CMD check
+/*.Rcheck/
+
+# RStudio files
+.Rproj.user/
+
+# produced vignettes
+vignettes/*.html
+vignettes/*.pdf
+
+# OAuth2 token, see https://github.com/hadley/httr/releases/tag/v0.3
+.httr-oauth
+
+# knitr and R markdown default cache directories
+/*_cache/
+/cache/
+
+# Temporary files created by R markdown
+*.utf8.md
+*.knit.md
+
+# Shiny token, see https://shiny.rstudio.com/articles/shinyapps.html
+rsconnect/
diff --git a/LICENSE b/LICENSE
new file mode 100644
index 00000000..f288702d
--- /dev/null
+++ b/LICENSE
@@ -0,0 +1,674 @@
+                    GNU GENERAL PUBLIC LICENSE
+                       Version 3, 29 June 2007
+
+ Copyright (C) 2007 Free Software Foundation, Inc. <https://fsf.org/>
+ Everyone is permitted to copy and distribute verbatim copies
+ of this license document, but changing it is not allowed.
+
+                            Preamble
+
+  The GNU General Public License is a free, copyleft license for
+software and other kinds of works.
+
+  The licenses for most software and other practical works are designed
+to take away your freedom to share and change the works.  By contrast,
+the GNU General Public License is intended to guarantee your freedom to
+share and change all versions of a program--to make sure it remains free
+software for all its users.  We, the Free Software Foundation, use the
+GNU General Public License for most of our software; it applies also to
+any other work released this way by its authors.  You can apply it to
+your programs, too.
+
+  When we speak of free software, we are referring to freedom, not
+price.  Our General Public Licenses are designed to make sure that you
+have the freedom to distribute copies of free software (and charge for
+them if you wish), that you receive source code or can get it if you
+want it, that you can change the software or use pieces of it in new
+free programs, and that you know you can do these things.
+
+  To protect your rights, we need to prevent others from denying you
+these rights or asking you to surrender the rights.  Therefore, you have
+certain responsibilities if you distribute copies of the software, or if
+you modify it: responsibilities to respect the freedom of others.
+
+  For example, if you distribute copies of such a program, whether
+gratis or for a fee, you must pass on to the recipients the same
+freedoms that you received.  You must make sure that they, too, receive
+or can get the source code.  And you must show them these terms so they
+know their rights.
+
+  Developers that use the GNU GPL protect your rights with two steps:
+(1) assert copyright on the software, and (2) offer you this License
+giving you legal permission to copy, distribute and/or modify it.
+
+  For the developers' and authors' protection, the GPL clearly explains
+that there is no warranty for this free software.  For both users' and
+authors' sake, the GPL requires that modified versions be marked as
+changed, so that their problems will not be attributed erroneously to
+authors of previous versions.
+
+  Some devices are designed to deny users access to install or run
+modified versions of the software inside them, although the manufacturer
+can do so.  This is fundamentally incompatible with the aim of
+protecting users' freedom to change the software.  The systematic
+pattern of such abuse occurs in the area of products for individuals to
+use, which is precisely where it is most unacceptable.  Therefore, we
+have designed this version of the GPL to prohibit the practice for those
+products.  If such problems arise substantially in other domains, we
+stand ready to extend this provision to those domains in future versions
+of the GPL, as needed to protect the freedom of users.
+
+  Finally, every program is threatened constantly by software patents.
+States should not allow patents to restrict development and use of
+software on general-purpose computers, but in those that do, we wish to
+avoid the special danger that patents applied to a free program could
+make it effectively proprietary.  To prevent this, the GPL assures that
+patents cannot be used to render the program non-free.
+
+  The precise terms and conditions for copying, distribution and
+modification follow.
+
+                       TERMS AND CONDITIONS
+
+  0. Definitions.
+
+  "This License" refers to version 3 of the GNU General Public License.
+
+  "Copyright" also means copyright-like laws that apply to other kinds of
+works, such as semiconductor masks.
+
+  "The Program" refers to any copyrightable work licensed under this
+License.  Each licensee is addressed as "you".  "Licensees" and
+"recipients" may be individuals or organizations.
+
+  To "modify" a work means to copy from or adapt all or part of the work
+in a fashion requiring copyright permission, other than the making of an
+exact copy.  The resulting work is called a "modified version" of the
+earlier work or a work "based on" the earlier work.
+
+  A "covered work" means either the unmodified Program or a work based
+on the Program.
+
+  To "propagate" a work means to do anything with it that, without
+permission, would make you directly or secondarily liable for
+infringement under applicable copyright law, except executing it on a
+computer or modifying a private copy.  Propagation includes copying,
+distribution (with or without modification), making available to the
+public, and in some countries other activities as well.
+
+  To "convey" a work means any kind of propagation that enables other
+parties to make or receive copies.  Mere interaction with a user through
+a computer network, with no transfer of a copy, is not conveying.
+
+  An interactive user interface displays "Appropriate Legal Notices"
+to the extent that it includes a convenient and prominently visible
+feature that (1) displays an appropriate copyright notice, and (2)
+tells the user that there is no warranty for the work (except to the
+extent that warranties are provided), that licensees may convey the
+work under this License, and how to view a copy of this License.  If
+the interface presents a list of user commands or options, such as a
+menu, a prominent item in the list meets this criterion.
+
+  1. Source Code.
+
+  The "source code" for a work means the preferred form of the work
+for making modifications to it.  "Object code" means any non-source
+form of a work.
+
+  A "Standard Interface" means an interface that either is an official
+standard defined by a recognized standards body, or, in the case of
+interfaces specified for a particular programming language, one that
+is widely used among developers working in that language.
+
+  The "System Libraries" of an executable work include anything, other
+than the work as a whole, that (a) is included in the normal form of
+packaging a Major Component, but which is not part of that Major
+Component, and (b) serves only to enable use of the work with that
+Major Component, or to implement a Standard Interface for which an
+implementation is available to the public in source code form.  A
+"Major Component", in this context, means a major essential component
+(kernel, window system, and so on) of the specific operating system
+(if any) on which the executable work runs, or a compiler used to
+produce the work, or an object code interpreter used to run it.
+
+  The "Corresponding Source" for a work in object code form means all
+the source code needed to generate, install, and (for an executable
+work) run the object code and to modify the work, including scripts to
+control those activities.  However, it does not include the work's
+System Libraries, or general-purpose tools or generally available free
+programs which are used unmodified in performing those activities but
+which are not part of the work.  For example, Corresponding Source
+includes interface definition files associated with source files for
+the work, and the source code for shared libraries and dynamically
+linked subprograms that the work is specifically designed to require,
+such as by intimate data communication or control flow between those
+subprograms and other parts of the work.
+
+  The Corresponding Source need not include anything that users
+can regenerate automatically from other parts of the Corresponding
+Source.
+
+  The Corresponding Source for a work in source code form is that
+same work.
+
+  2. Basic Permissions.
+
+  All rights granted under this License are granted for the term of
+copyright on the Program, and are irrevocable provided the stated
+conditions are met.  This License explicitly affirms your unlimited
+permission to run the unmodified Program.  The output from running a
+covered work is covered by this License only if the output, given its
+content, constitutes a covered work.  This License acknowledges your
+rights of fair use or other equivalent, as provided by copyright law.
+
+  You may make, run and propagate covered works that you do not
+convey, without conditions so long as your license otherwise remains
+in force.  You may convey covered works to others for the sole purpose
+of having them make modifications exclusively for you, or provide you
+with facilities for running those works, provided that you comply with
+the terms of this License in conveying all material for which you do
+not control copyright.  Those thus making or running the covered works
+for you must do so exclusively on your behalf, under your direction
+and control, on terms that prohibit them from making any copies of
+your copyrighted material outside their relationship with you.
+
+  Conveying under any other circumstances is permitted solely under
+the conditions stated below.  Sublicensing is not allowed; section 10
+makes it unnecessary.
+
+  3. Protecting Users' Legal Rights From Anti-Circumvention Law.
+
+  No covered work shall be deemed part of an effective technological
+measure under any applicable law fulfilling obligations under article
+11 of the WIPO copyright treaty adopted on 20 December 1996, or
+similar laws prohibiting or restricting circumvention of such
+measures.
+
+  When you convey a covered work, you waive any legal power to forbid
+circumvention of technological measures to the extent such circumvention
+is effected by exercising rights under this License with respect to
+the covered work, and you disclaim any intention to limit operation or
+modification of the work as a means of enforcing, against the work's
+users, your or third parties' legal rights to forbid circumvention of
+technological measures.
+
+  4. Conveying Verbatim Copies.
+
+  You may convey verbatim copies of the Program's source code as you
+receive it, in any medium, provided that you conspicuously and
+appropriately publish on each copy an appropriate copyright notice;
+keep intact all notices stating that this License and any
+non-permissive terms added in accord with section 7 apply to the code;
+keep intact all notices of the absence of any warranty; and give all
+recipients a copy of this License along with the Program.
+
+  You may charge any price or no price for each copy that you convey,
+and you may offer support or warranty protection for a fee.
+
+  5. Conveying Modified Source Versions.
+
+  You may convey a work based on the Program, or the modifications to
+produce it from the Program, in the form of source code under the
+terms of section 4, provided that you also meet all of these conditions:
+
+    a) The work must carry prominent notices stating that you modified
+    it, and giving a relevant date.
+
+    b) The work must carry prominent notices stating that it is
+    released under this License and any conditions added under section
+    7.  This requirement modifies the requirement in section 4 to
+    "keep intact all notices".
+
+    c) You must license the entire work, as a whole, under this
+    License to anyone who comes into possession of a copy.  This
+    License will therefore apply, along with any applicable section 7
+    additional terms, to the whole of the work, and all its parts,
+    regardless of how they are packaged.  This License gives no
+    permission to license the work in any other way, but it does not
+    invalidate such permission if you have separately received it.
+
+    d) If the work has interactive user interfaces, each must display
+    Appropriate Legal Notices; however, if the Program has interactive
+    interfaces that do not display Appropriate Legal Notices, your
+    work need not make them do so.
+
+  A compilation of a covered work with other separate and independent
+works, which are not by their nature extensions of the covered work,
+and which are not combined with it such as to form a larger program,
+in or on a volume of a storage or distribution medium, is called an
+"aggregate" if the compilation and its resulting copyright are not
+used to limit the access or legal rights of the compilation's users
+beyond what the individual works permit.  Inclusion of a covered work
+in an aggregate does not cause this License to apply to the other
+parts of the aggregate.
+
+  6. Conveying Non-Source Forms.
+
+  You may convey a covered work in object code form under the terms
+of sections 4 and 5, provided that you also convey the
+machine-readable Corresponding Source under the terms of this License,
+in one of these ways:
+
+    a) Convey the object code in, or embodied in, a physical product
+    (including a physical distribution medium), accompanied by the
+    Corresponding Source fixed on a durable physical medium
+    customarily used for software interchange.
+
+    b) Convey the object code in, or embodied in, a physical product
+    (including a physical distribution medium), accompanied by a
+    written offer, valid for at least three years and valid for as
+    long as you offer spare parts or customer support for that product
+    model, to give anyone who possesses the object code either (1) a
+    copy of the Corresponding Source for all the software in the
+    product that is covered by this License, on a durable physical
+    medium customarily used for software interchange, for a price no
+    more than your reasonable cost of physically performing this
+    conveying of source, or (2) access to copy the
+    Corresponding Source from a network server at no charge.
+
+    c) Convey individual copies of the object code with a copy of the
+    written offer to provide the Corresponding Source.  This
+    alternative is allowed only occasionally and noncommercially, and
+    only if you received the object code with such an offer, in accord
+    with subsection 6b.
+
+    d) Convey the object code by offering access from a designated
+    place (gratis or for a charge), and offer equivalent access to the
+    Corresponding Source in the same way through the same place at no
+    further charge.  You need not require recipients to copy the
+    Corresponding Source along with the object code.  If the place to
+    copy the object code is a network server, the Corresponding Source
+    may be on a different server (operated by you or a third party)
+    that supports equivalent copying facilities, provided you maintain
+    clear directions next to the object code saying where to find the
+    Corresponding Source.  Regardless of what server hosts the
+    Corresponding Source, you remain obligated to ensure that it is
+    available for as long as needed to satisfy these requirements.
+
+    e) Convey the object code using peer-to-peer transmission, provided
+    you inform other peers where the object code and Corresponding
+    Source of the work are being offered to the general public at no
+    charge under subsection 6d.
+
+  A separable portion of the object code, whose source code is excluded
+from the Corresponding Source as a System Library, need not be
+included in conveying the object code work.
+
+  A "User Product" is either (1) a "consumer product", which means any
+tangible personal property which is normally used for personal, family,
+or household purposes, or (2) anything designed or sold for incorporation
+into a dwelling.  In determining whether a product is a consumer product,
+doubtful cases shall be resolved in favor of coverage.  For a particular
+product received by a particular user, "normally used" refers to a
+typical or common use of that class of product, regardless of the status
+of the particular user or of the way in which the particular user
+actually uses, or expects or is expected to use, the product.  A product
+is a consumer product regardless of whether the product has substantial
+commercial, industrial or non-consumer uses, unless such uses represent
+the only significant mode of use of the product.
+
+  "Installation Information" for a User Product means any methods,
+procedures, authorization keys, or other information required to install
+and execute modified versions of a covered work in that User Product from
+a modified version of its Corresponding Source.  The information must
+suffice to ensure that the continued functioning of the modified object
+code is in no case prevented or interfered with solely because
+modification has been made.
+
+  If you convey an object code work under this section in, or with, or
+specifically for use in, a User Product, and the conveying occurs as
+part of a transaction in which the right of possession and use of the
+User Product is transferred to the recipient in perpetuity or for a
+fixed term (regardless of how the transaction is characterized), the
+Corresponding Source conveyed under this section must be accompanied
+by the Installation Information.  But this requirement does not apply
+if neither you nor any third party retains the ability to install
+modified object code on the User Product (for example, the work has
+been installed in ROM).
+
+  The requirement to provide Installation Information does not include a
+requirement to continue to provide support service, warranty, or updates
+for a work that has been modified or installed by the recipient, or for
+the User Product in which it has been modified or installed.  Access to a
+network may be denied when the modification itself materially and
+adversely affects the operation of the network or violates the rules and
+protocols for communication across the network.
+
+  Corresponding Source conveyed, and Installation Information provided,
+in accord with this section must be in a format that is publicly
+documented (and with an implementation available to the public in
+source code form), and must require no special password or key for
+unpacking, reading or copying.
+
+  7. Additional Terms.
+
+  "Additional permissions" are terms that supplement the terms of this
+License by making exceptions from one or more of its conditions.
+Additional permissions that are applicable to the entire Program shall
+be treated as though they were included in this License, to the extent
+that they are valid under applicable law.  If additional permissions
+apply only to part of the Program, that part may be used separately
+under those permissions, but the entire Program remains governed by
+this License without regard to the additional permissions.
+
+  When you convey a copy of a covered work, you may at your option
+remove any additional permissions from that copy, or from any part of
+it.  (Additional permissions may be written to require their own
+removal in certain cases when you modify the work.)  You may place
+additional permissions on material, added by you to a covered work,
+for which you have or can give appropriate copyright permission.
+
+  Notwithstanding any other provision of this License, for material you
+add to a covered work, you may (if authorized by the copyright holders of
+that material) supplement the terms of this License with terms:
+
+    a) Disclaiming warranty or limiting liability differently from the
+    terms of sections 15 and 16 of this License; or
+
+    b) Requiring preservation of specified reasonable legal notices or
+    author attributions in that material or in the Appropriate Legal
+    Notices displayed by works containing it; or
+
+    c) Prohibiting misrepresentation of the origin of that material, or
+    requiring that modified versions of such material be marked in
+    reasonable ways as different from the original version; or
+
+    d) Limiting the use for publicity purposes of names of licensors or
+    authors of the material; or
+
+    e) Declining to grant rights under trademark law for use of some
+    trade names, trademarks, or service marks; or
+
+    f) Requiring indemnification of licensors and authors of that
+    material by anyone who conveys the material (or modified versions of
+    it) with contractual assumptions of liability to the recipient, for
+    any liability that these contractual assumptions directly impose on
+    those licensors and authors.
+
+  All other non-permissive additional terms are considered "further
+restrictions" within the meaning of section 10.  If the Program as you
+received it, or any part of it, contains a notice stating that it is
+governed by this License along with a term that is a further
+restriction, you may remove that term.  If a license document contains
+a further restriction but permits relicensing or conveying under this
+License, you may add to a covered work material governed by the terms
+of that license document, provided that the further restriction does
+not survive such relicensing or conveying.
+
+  If you add terms to a covered work in accord with this section, you
+must place, in the relevant source files, a statement of the
+additional terms that apply to those files, or a notice indicating
+where to find the applicable terms.
+
+  Additional terms, permissive or non-permissive, may be stated in the
+form of a separately written license, or stated as exceptions;
+the above requirements apply either way.
+
+  8. Termination.
+
+  You may not propagate or modify a covered work except as expressly
+provided under this License.  Any attempt otherwise to propagate or
+modify it is void, and will automatically terminate your rights under
+this License (including any patent licenses granted under the third
+paragraph of section 11).
+
+  However, if you cease all violation of this License, then your
+license from a particular copyright holder is reinstated (a)
+provisionally, unless and until the copyright holder explicitly and
+finally terminates your license, and (b) permanently, if the copyright
+holder fails to notify you of the violation by some reasonable means
+prior to 60 days after the cessation.
+
+  Moreover, your license from a particular copyright holder is
+reinstated permanently if the copyright holder notifies you of the
+violation by some reasonable means, this is the first time you have
+received notice of violation of this License (for any work) from that
+copyright holder, and you cure the violation prior to 30 days after
+your receipt of the notice.
+
+  Termination of your rights under this section does not terminate the
+licenses of parties who have received copies or rights from you under
+this License.  If your rights have been terminated and not permanently
+reinstated, you do not qualify to receive new licenses for the same
+material under section 10.
+
+  9. Acceptance Not Required for Having Copies.
+
+  You are not required to accept this License in order to receive or
+run a copy of the Program.  Ancillary propagation of a covered work
+occurring solely as a consequence of using peer-to-peer transmission
+to receive a copy likewise does not require acceptance.  However,
+nothing other than this License grants you permission to propagate or
+modify any covered work.  These actions infringe copyright if you do
+not accept this License.  Therefore, by modifying or propagating a
+covered work, you indicate your acceptance of this License to do so.
+
+  10. Automatic Licensing of Downstream Recipients.
+
+  Each time you convey a covered work, the recipient automatically
+receives a license from the original licensors, to run, modify and
+propagate that work, subject to this License.  You are not responsible
+for enforcing compliance by third parties with this License.
+
+  An "entity transaction" is a transaction transferring control of an
+organization, or substantially all assets of one, or subdividing an
+organization, or merging organizations.  If propagation of a covered
+work results from an entity transaction, each party to that
+transaction who receives a copy of the work also receives whatever
+licenses to the work the party's predecessor in interest had or could
+give under the previous paragraph, plus a right to possession of the
+Corresponding Source of the work from the predecessor in interest, if
+the predecessor has it or can get it with reasonable efforts.
+
+  You may not impose any further restrictions on the exercise of the
+rights granted or affirmed under this License.  For example, you may
+not impose a license fee, royalty, or other charge for exercise of
+rights granted under this License, and you may not initiate litigation
+(including a cross-claim or counterclaim in a lawsuit) alleging that
+any patent claim is infringed by making, using, selling, offering for
+sale, or importing the Program or any portion of it.
+
+  11. Patents.
+
+  A "contributor" is a copyright holder who authorizes use under this
+License of the Program or a work on which the Program is based.  The
+work thus licensed is called the contributor's "contributor version".
+
+  A contributor's "essential patent claims" are all patent claims
+owned or controlled by the contributor, whether already acquired or
+hereafter acquired, that would be infringed by some manner, permitted
+by this License, of making, using, or selling its contributor version,
+but do not include claims that would be infringed only as a
+consequence of further modification of the contributor version.  For
+purposes of this definition, "control" includes the right to grant
+patent sublicenses in a manner consistent with the requirements of
+this License.
+
+  Each contributor grants you a non-exclusive, worldwide, royalty-free
+patent license under the contributor's essential patent claims, to
+make, use, sell, offer for sale, import and otherwise run, modify and
+propagate the contents of its contributor version.
+
+  In the following three paragraphs, a "patent license" is any express
+agreement or commitment, however denominated, not to enforce a patent
+(such as an express permission to practice a patent or covenant not to
+sue for patent infringement).  To "grant" such a patent license to a
+party means to make such an agreement or commitment not to enforce a
+patent against the party.
+
+  If you convey a covered work, knowingly relying on a patent license,
+and the Corresponding Source of the work is not available for anyone
+to copy, free of charge and under the terms of this License, through a
+publicly available network server or other readily accessible means,
+then you must either (1) cause the Corresponding Source to be so
+available, or (2) arrange to deprive yourself of the benefit of the
+patent license for this particular work, or (3) arrange, in a manner
+consistent with the requirements of this License, to extend the patent
+license to downstream recipients.  "Knowingly relying" means you have
+actual knowledge that, but for the patent license, your conveying the
+covered work in a country, or your recipient's use of the covered work
+in a country, would infringe one or more identifiable patents in that
+country that you have reason to believe are valid.
+
+  If, pursuant to or in connection with a single transaction or
+arrangement, you convey, or propagate by procuring conveyance of, a
+covered work, and grant a patent license to some of the parties
+receiving the covered work authorizing them to use, propagate, modify
+or convey a specific copy of the covered work, then the patent license
+you grant is automatically extended to all recipients of the covered
+work and works based on it.
+
+  A patent license is "discriminatory" if it does not include within
+the scope of its coverage, prohibits the exercise of, or is
+conditioned on the non-exercise of one or more of the rights that are
+specifically granted under this License.  You may not convey a covered
+work if you are a party to an arrangement with a third party that is
+in the business of distributing software, under which you make payment
+to the third party based on the extent of your activity of conveying
+the work, and under which the third party grants, to any of the
+parties who would receive the covered work from you, a discriminatory
+patent license (a) in connection with copies of the covered work
+conveyed by you (or copies made from those copies), or (b) primarily
+for and in connection with specific products or compilations that
+contain the covered work, unless you entered into that arrangement,
+or that patent license was granted, prior to 28 March 2007.
+
+  Nothing in this License shall be construed as excluding or limiting
+any implied license or other defenses to infringement that may
+otherwise be available to you under applicable patent law.
+
+  12. No Surrender of Others' Freedom.
+
+  If conditions are imposed on you (whether by court order, agreement or
+otherwise) that contradict the conditions of this License, they do not
+excuse you from the conditions of this License.  If you cannot convey a
+covered work so as to satisfy simultaneously your obligations under this
+License and any other pertinent obligations, then as a consequence you may
+not convey it at all.  For example, if you agree to terms that obligate you
+to collect a royalty for further conveying from those to whom you convey
+the Program, the only way you could satisfy both those terms and this
+License would be to refrain entirely from conveying the Program.
+
+  13. Use with the GNU Affero General Public License.
+
+  Notwithstanding any other provision of this License, you have
+permission to link or combine any covered work with a work licensed
+under version 3 of the GNU Affero General Public License into a single
+combined work, and to convey the resulting work.  The terms of this
+License will continue to apply to the part which is the covered work,
+but the special requirements of the GNU Affero General Public License,
+section 13, concerning interaction through a network will apply to the
+combination as such.
+
+  14. Revised Versions of this License.
+
+  The Free Software Foundation may publish revised and/or new versions of
+the GNU General Public License from time to time.  Such new versions will
+be similar in spirit to the present version, but may differ in detail to
+address new problems or concerns.
+
+  Each version is given a distinguishing version number.  If the
+Program specifies that a certain numbered version of the GNU General
+Public License "or any later version" applies to it, you have the
+option of following the terms and conditions either of that numbered
+version or of any later version published by the Free Software
+Foundation.  If the Program does not specify a version number of the
+GNU General Public License, you may choose any version ever published
+by the Free Software Foundation.
+
+  If the Program specifies that a proxy can decide which future
+versions of the GNU General Public License can be used, that proxy's
+public statement of acceptance of a version permanently authorizes you
+to choose that version for the Program.
+
+  Later license versions may give you additional or different
+permissions.  However, no additional obligations are imposed on any
+author or copyright holder as a result of your choosing to follow a
+later version.
+
+  15. Disclaimer of Warranty.
+
+  THERE IS NO WARRANTY FOR THE PROGRAM, TO THE EXTENT PERMITTED BY
+APPLICABLE LAW.  EXCEPT WHEN OTHERWISE STATED IN WRITING THE COPYRIGHT
+HOLDERS AND/OR OTHER PARTIES PROVIDE THE PROGRAM "AS IS" WITHOUT WARRANTY
+OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO,
+THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
+PURPOSE.  THE ENTIRE RISK AS TO THE QUALITY AND PERFORMANCE OF THE PROGRAM
+IS WITH YOU.  SHOULD THE PROGRAM PROVE DEFECTIVE, YOU ASSUME THE COST OF
+ALL NECESSARY SERVICING, REPAIR OR CORRECTION.
+
+  16. Limitation of Liability.
+
+  IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING
+WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MODIFIES AND/OR CONVEYS
+THE PROGRAM AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES, INCLUDING ANY
+GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING OUT OF THE
+USE OR INABILITY TO USE THE PROGRAM (INCLUDING BUT NOT LIMITED TO LOSS OF
+DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY YOU OR THIRD
+PARTIES OR A FAILURE OF THE PROGRAM TO OPERATE WITH ANY OTHER PROGRAMS),
+EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE POSSIBILITY OF
+SUCH DAMAGES.
+
+  17. Interpretation of Sections 15 and 16.
+
+  If the disclaimer of warranty and limitation of liability provided
+above cannot be given local legal effect according to their terms,
+reviewing courts shall apply local law that most closely approximates
+an absolute waiver of all civil liability in connection with the
+Program, unless a warranty or assumption of liability accompanies a
+copy of the Program in return for a fee.
+
+                     END OF TERMS AND CONDITIONS
+
+            How to Apply These Terms to Your New Programs
+
+  If you develop a new program, and you want it to be of the greatest
+possible use to the public, the best way to achieve this is to make it
+free software which everyone can redistribute and change under these terms.
+
+  To do so, attach the following notices to the program.  It is safest
+to attach them to the start of each source file to most effectively
+state the exclusion of warranty; and each file should have at least
+the "copyright" line and a pointer to where the full notice is found.
+
+    <one line to give the program's name and a brief idea of what it does.>
+    Copyright (C) <year>  <name of author>
+
+    This program is free software: you can redistribute it and/or modify
+    it under the terms of the GNU General Public License as published by
+    the Free Software Foundation, either version 3 of the License, or
+    (at your option) any later version.
+
+    This program is distributed in the hope that it will be useful,
+    but WITHOUT ANY WARRANTY; without even the implied warranty of
+    MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+    GNU General Public License for more details.
+
+    You should have received a copy of the GNU General Public License
+    along with this program.  If not, see <https://www.gnu.org/licenses/>.
+
+Also add information on how to contact you by electronic and paper mail.
+
+  If the program does terminal interaction, make it output a short
+notice like this when it starts in an interactive mode:
+
+    <program>  Copyright (C) <year>  <name of author>
+    This program comes with ABSOLUTELY NO WARRANTY; for details type `show w'.
+    This is free software, and you are welcome to redistribute it
+    under certain conditions; type `show c' for details.
+
+The hypothetical commands `show w' and `show c' should show the appropriate
+parts of the General Public License.  Of course, your program's commands
+might be different; for a GUI interface, you would use an "about box".
+
+  You should also get your employer (if you work as a programmer) or school,
+if any, to sign a "copyright disclaimer" for the program, if necessary.
+For more information on this, and how to apply and follow the GNU GPL, see
+<https://www.gnu.org/licenses/>.
+
+  The GNU General Public License does not permit incorporating your program
+into proprietary programs.  If your program is a subroutine library, you
+may consider it more useful to permit linking proprietary applications with
+the library.  If this is what you want to do, use the GNU Lesser General
+Public License instead of this License.  But first, please read
+<https://www.gnu.org/licenses/why-not-lgpl.html>.
diff --git a/README.md b/README.md
new file mode 100644
index 00000000..9b3e57ea
--- /dev/null
+++ b/README.md
@@ -0,0 +1,2 @@
+# epichains
+Methods for analysing the distribution of epidemiological chain sizes and lengths

From 172fef08b94d1443390d6c28ea42e94f06c11c5f Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 9 Jan 2019 06:45:32 +0000
Subject: [PATCH 002/828] function code

---
 R/borel.r       |  25 ++++++++
 R/likelihoods.R | 153 ++++++++++++++++++++++++++++++++++++++++++++++++
 R/simulate.r    |  43 ++++++++++++++
 3 files changed, 221 insertions(+)
 create mode 100644 R/borel.r
 create mode 100644 R/likelihoods.R
 create mode 100644 R/simulate.r

diff --git a/R/borel.r b/R/borel.r
new file mode 100644
index 00000000..a03e34f3
--- /dev/null
+++ b/R/borel.r
@@ -0,0 +1,25 @@
+##' Density of the Borel distribution
+##'
+##' @param x vector of integers.
+##' @param mu mu parameter.
+##' @param log logical; if TRUE, probabilities p are given as log(p).
+##' @return probability mass.
+##' @author Sebastian Funk
+dborel <- function(x, mu, log=FALSE) {
+    if (x < 1) stop("'x' must be greater than 0")
+    ld <- -mu * x + (x - 1) * log(mu * x) - lgamma(x + 1)
+    if (!log) ld <- exp(ld)
+    return(ld)
+}
+
+##' Generate random numbers from the Borel distribution
+##'
+##' Random numbers are generated by simulating from a Poisson branching process
+##' @param n number of random variates to generate.
+##' @param mu mu parameter.
+##' @param infinite any number to treat as infinite; simulations will be stopped if this number is reached
+##' @return vector of random numbers
+##' @author Sebastian Funk
+rborel <- function(n, mu, infinite=Inf) {
+    chain_sim(n, "pois", "size", infinite=infinite, lambda=mu)
+}
diff --git a/R/likelihoods.R b/R/likelihoods.R
new file mode 100644
index 00000000..364ca519
--- /dev/null
+++ b/R/likelihoods.R
@@ -0,0 +1,153 @@
+##' Likelihood of the size of chains with Poisson offspring distribution
+##'
+##' @param x vector of sizes
+##' @param lambda rate of the Poisson distributino
+##' @return log-likelihood values
+##' @author Sebastian Funk
+pois_size_ll <- function(x, lambda)
+{
+  (x - 1) * log(lambda) - lambda * x + (x - 2) * log(x) - lgamma(x)
+}
+
+##' Likelihood of the size of chains with Negative-Binomial offspring distribution
+##'
+##' @param x vector of sizes
+##' @param size the dispersion parameter (often called \code{k} in ecological applications)
+##' @param prob probability of success (in the parameterisation with \code{prob}, see also \code{\link[stats]{NegBinomial}})
+##' @param mu mean parameter
+##' @return log-likelihood values
+##' @author Sebastian Funk
+nbinom_size_ll <- function(x, size, prob, mu)
+{
+  if (!missing(prob)) {
+    if (!missing(mu)) stop("'prob' and 'mu' both specified")
+    mu <- size * (1 - prob) / prob
+  }
+  lgamma(size * x + (x - 1)) - (lgamma(size * x) + lgamma(x + 1)) +
+    (x - 1) * log (mu / size) -
+    (size * x + (x - 1)) * log(1 + mu / size)
+}
+
+##' Likelihood of the size of chains with gamma-Borel offspring distribution
+##'
+##' @param x vector of sizes
+##' @param size the dispersion parameter (often called \code{k} in ecological applications)
+##' @param prob probability of success (in the parameterisation with \code{prob}, see also \code{\link[stats]{NegBinomial}})
+##' @param mu mean parameter
+##' @return log-likelihood values
+##' @author Sebastian Funk
+gborel_size_ll <- function(x, size, prob, mu) {
+  if (!missing(prob)) {
+    if (!missing(mu)) stop("'prob' and 'mu' both specified")
+    mu <- size * (1 - prob) / prob
+  }
+  lgamma(size + x - 1) - (lgamma(x + 1) + lgamma(size)) - size * log(mu / size) +
+    (x - 1) * log(x) - (size + x - 1) * log(x + size / mu)
+}
+
+##' Likelihood of the length of chains with Poisson offspring distribution
+##'
+##' @param x vector of sizes
+##' @param lambda rate of the Poisson distributino
+##' @return log-likelihood values
+##' @author Sebastian Funk
+pois_length_ll <- function(x, lambda) {
+
+  ## iterated exponential function
+  arg <- exp(lambda * exp(-lambda))
+  itex <- 1
+  for (i in seq_len(max(x))) itex <- c(itex, arg ^ itex[i])
+
+  Gk <- c(0, exp(-lambda) * itex) ## set G_{0}=1
+
+  log(Gk[x + 1] - Gk[x])
+}
+
+##' Likelihood of the length of chains with geometric offspring distribution
+##'
+##' @param x vector of sizes
+##' @param prob probability of the geometric distribution with mean \code{1/prob}
+##' @return log-likelihood values
+##' @author Sebastian Funkgeom_length_ll <- function(x, prob) {
+geom_length_ll <- function(x, prob) {
+
+  lambda <- 1 / prob
+  ## G(k) - G(k - 1)
+  GkmGkm1 <- (1 - lambda ^ (x)) / (1 - lambda ^ (x + 1)) -
+    (1 - lambda ^ (x - 1)) / (1 - lambda ^ (x))
+
+  log(GkmGkm1)
+}
+
+##' Likelihood of the length of chains with generic offspring distribution
+##'
+##' The likelihoods are calculated with a crude approximation using simulated
+##'   chains by linearly approximating any missing values in the empirical
+##'   cumulative distribution function (ecdf).
+##' @param x vector of sizes
+##' @param ... any paramaters to pass to \code{\link{chain_sim}}
+##' @return log-likelihood values
+##' @author Sebastian Funkgeom_length_ll <- function(x, prob) {
+##' @inheritParams chain_ll chain_sim
+offspring_ll <- function(x, offspring, stat, n=100, ...) {
+
+  dist <- chain_sim(n, offspring, stat, ...)
+
+  ## linear approximation
+  f <- ecdf(dist)
+  acdf <- diff(c(0, approx(unique(dist), f(unique(dist)), seq_len(max(dist[is.finite(dist)])))$y))
+  lik <- acdf[x]
+  lik[is.na(lik)] <- 0
+  log(lik)
+}
+
+##' Likelihood for the outcome of a branching process
+##'
+##' @param x vector of sizes or lengths of transmission chains
+##' @param stat statistic given as \code{x} ("size" or "length" of chains)
+##' @param infinite any chains of this size/length will be treated as infinite
+##' @param exclude any sizes/lengths to exclude from the likelihood calculation
+##' @param ... parameters for the offspring distribution
+##' @return likelihood
+##' @inheritParams chain_sim
+##' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll geom_length_ll offspring_ll
+##' @author Sebastian Funk
+chain_ll <- function(x, offspring, stat=c("size", "length"), infinite = Inf, exclude, ...)
+{
+  stat <- match.arg(stat)
+
+  if (any(x >= infinite)) {
+    calc_sizes <- seq_len(infinite)
+  } else {
+    calc_sizes <- unique(c(1, x))
+  }
+
+  ## first, get likelihood function as given by `offspring` and `stat``
+  likelihoods <- c()
+  ll_func <- paste(offspring, stat, "ll", sep="_")
+  if (exists(ll_func)) {
+    func <- get(ll_func)
+    if (!is.function(func)) stop("'", ll_func, "' is not a function.")
+    likelihoods[calc_sizes] <- func(calc_sizes, ...)
+  } else {
+    likelihoods[calc_sizes] <- offspring_ll(calc_sizes, offspring, stat, ...)
+  }
+
+  if (!missing(exclude)) {
+    likelihoods <- likelihoods - log(-expm1(sum(likelihoods[exclude])))
+    likelihoods[exclude] <- -Inf
+  }
+
+  sexpl <- sum(exp(likelihoods), na.rm = TRUE)
+  if (sexpl < 1) {
+    maxl <- log(1 - sum(exp(likelihoods), na.rm = TRUE))
+  } else {
+    maxl <- -Inf
+  }
+  likelihoods <- c(likelihoods, maxl)
+
+  x[x > infinite] <- infinite + 1
+  chain_likelihoods <- likelihoods[x]
+
+  return(sum(chain_likelihoods))
+}
diff --git a/R/simulate.r b/R/simulate.r
new file mode 100644
index 00000000..f43e02e8
--- /dev/null
+++ b/R/simulate.r
@@ -0,0 +1,43 @@
+##' Simulate chains using a branching process
+##'
+##' @param n number of simulations to run.
+##' @param offspring offspring distribution as character string, e.g. "pois" for
+##'     the Poisson offspring distribution. 
+##' @param stat statistic to calculate ("size" or "length" of chains)
+##' @param infinite a size or length from which the size/length is to be considered infinite
+##' @param ... parameters of the offspring distribution
+##' @return a vector of sizes/lengths
+##' @author Sebastian Funk
+chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf, ...) {
+
+    stat <- match.arg(stat)
+
+    ## first, get random function as given by `offspring`
+    random_func <- paste0("r", offspring)
+    if (!exists(random_func)) stop("Random sampling function '", random_func, "' does not exist.")
+    func <- get(random_func)
+    if (!is.function(func)) stop("'", random_func, "' is not a function.")
+
+    ## next, simulate n chains
+    dist <- c()
+    for (i in seq_len(n)) {
+        stat_track <- 1 ## variable to track length or size (depending on `stat`)
+        state <- 1
+        while (state > 0 && state < infinite) {
+            offspring <- sum(func(n=state, ...))
+            if (stat=="size") {
+                stat_track <- stat_track + offspring
+            } else if (stat=="length"){
+                if (offspring > 0) stat_track <- stat_track + 1
+            } else {
+                stop("Unknown statistic: '", stat, "'.")
+            }
+            state <- offspring
+        }
+        if (state >= infinite) stat_track <- Inf
+        dist[i] <- stat_track
+    }
+
+    return(dist)
+}
+

From 9e0d38d5ad6bbf2a5cd51a2c156390e28442a33a Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 10 Jan 2019 23:30:34 +0000
Subject: [PATCH 003/828] use more robust log1p for maximal likelihood

---
 R/likelihoods.R | 12 ++++--------
 1 file changed, 4 insertions(+), 8 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 364ca519..24828c29 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -138,15 +138,11 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), infinite = Inf, exc
     likelihoods[exclude] <- -Inf
   }
 
-  sexpl <- sum(exp(likelihoods), na.rm = TRUE)
-  if (sexpl < 1) {
-    maxl <- log(1 - sum(exp(likelihoods), na.rm = TRUE))
-  } else {
-    maxl <- -Inf
+  if (any(x >= infinite)) {
+    maxl <- log1p(-sum(exp(likelihoods), na.rm = TRUE))
+    likelihoods <- c(likelihoods, maxl)
+    x[x > infinite] <- infinite + 1
   }
-  likelihoods <- c(likelihoods, maxl)
-
-  x[x > infinite] <- infinite + 1
   chain_likelihoods <- likelihoods[x]
 
   return(sum(chain_likelihoods))

From 2ec446e259aba5f6e456b176557dd7475d074b11 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 10 Jan 2019 23:30:56 +0000
Subject: [PATCH 004/828] convert vectors of parameters to lists in chain_ll

---
 R/likelihoods.R | 7 +++++--
 1 file changed, 5 insertions(+), 2 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 24828c29..0db365f0 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -125,12 +125,15 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), infinite = Inf, exc
   ## first, get likelihood function as given by `offspring` and `stat``
   likelihoods <- c()
   ll_func <- paste(offspring, stat, "ll", sep="_")
+  pars <- as.list(unlist(list(...))) ## converts vectors to lists
   if (exists(ll_func)) {
     func <- get(ll_func)
     if (!is.function(func)) stop("'", ll_func, "' is not a function.")
-    likelihoods[calc_sizes] <- func(calc_sizes, ...)
+    likelihoods[calc_sizes] <- do.call(func, c(list(x=calc_sizes), pars))
   } else {
-    likelihoods[calc_sizes] <- offspring_ll(calc_sizes, offspring, stat, ...)
+    likelihoods[calc_sizes] <-
+      do.call(offspring_ll,
+              c(list(x=calc_sizes, offspring=offspring, stat=stat), pars))
   }
 
   if (!missing(exclude)) {

From b0931e6d9ad602692e60dc2a2af48a2db5d41a1d Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 10 Jan 2019 23:31:25 +0000
Subject: [PATCH 005/828] add documentation

---
 DESCRIPTION           | 10 ++++++++++
 NAMESPACE             |  2 ++
 R/likelihoods.R       | 11 ++++++-----
 man/chain_ll.Rd       | 35 +++++++++++++++++++++++++++++++++++
 man/chain_sim.Rd      | 30 ++++++++++++++++++++++++++++++
 man/dborel.Rd         | 24 ++++++++++++++++++++++++
 man/gborel_size_ll.Rd | 26 ++++++++++++++++++++++++++
 man/geom_length_ll.Rd | 22 ++++++++++++++++++++++
 man/nbinom_size_ll.Rd | 26 ++++++++++++++++++++++++++
 man/offspring_ll.Rd   | 31 +++++++++++++++++++++++++++++++
 man/pois_length_ll.Rd | 22 ++++++++++++++++++++++
 man/pois_size_ll.Rd   | 22 ++++++++++++++++++++++
 man/rborel.Rd         | 24 ++++++++++++++++++++++++
 13 files changed, 280 insertions(+), 5 deletions(-)
 create mode 100644 DESCRIPTION
 create mode 100644 NAMESPACE
 create mode 100644 man/chain_ll.Rd
 create mode 100644 man/chain_sim.Rd
 create mode 100644 man/dborel.Rd
 create mode 100644 man/gborel_size_ll.Rd
 create mode 100644 man/geom_length_ll.Rd
 create mode 100644 man/nbinom_size_ll.Rd
 create mode 100644 man/offspring_ll.Rd
 create mode 100644 man/pois_length_ll.Rd
 create mode 100644 man/pois_size_ll.Rd
 create mode 100644 man/rborel.Rd

diff --git a/DESCRIPTION b/DESCRIPTION
new file mode 100644
index 00000000..28fca7e4
--- /dev/null
+++ b/DESCRIPTION
@@ -0,0 +1,10 @@
+Package: epichains
+Version: 0.1
+Title: Analysis of transmission chains
+Authors@R: c(person("Sebastian", "Funk", email = "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")))
+Description: Performs analysis of chain sizes
+License: GPL-3
+URL: https://github.com/sbfnk/epichains
+BugReports: https://github.com/sbfnk/epichains/issues
+NeedsCompilation: no
+RoxygenNote: 6.1.1
diff --git a/NAMESPACE b/NAMESPACE
new file mode 100644
index 00000000..6ae92683
--- /dev/null
+++ b/NAMESPACE
@@ -0,0 +1,2 @@
+# Generated by roxygen2: do not edit by hand
+
diff --git a/R/likelihoods.R b/R/likelihoods.R
index 0db365f0..d47da195 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -68,7 +68,7 @@ pois_length_ll <- function(x, lambda) {
 ##' @param x vector of sizes
 ##' @param prob probability of the geometric distribution with mean \code{1/prob}
 ##' @return log-likelihood values
-##' @author Sebastian Funkgeom_length_ll <- function(x, prob) {
+##' @author Sebastian Funk
 geom_length_ll <- function(x, prob) {
 
   lambda <- 1 / prob
@@ -87,8 +87,9 @@ geom_length_ll <- function(x, prob) {
 ##' @param x vector of sizes
 ##' @param ... any paramaters to pass to \code{\link{chain_sim}}
 ##' @return log-likelihood values
-##' @author Sebastian Funkgeom_length_ll <- function(x, prob) {
-##' @inheritParams chain_ll chain_sim
+##' @author Sebastian Funk
+##' @inheritParams chain_ll
+##' @inheritParams chain_sim
 offspring_ll <- function(x, offspring, stat, n=100, ...) {
 
   dist <- chain_sim(n, offspring, stat, ...)
@@ -104,15 +105,15 @@ offspring_ll <- function(x, offspring, stat, n=100, ...) {
 ##' Likelihood for the outcome of a branching process
 ##'
 ##' @param x vector of sizes or lengths of transmission chains
+##' @param ... parameters for the offspring distribution
 ##' @param stat statistic given as \code{x} ("size" or "length" of chains)
 ##' @param infinite any chains of this size/length will be treated as infinite
 ##' @param exclude any sizes/lengths to exclude from the likelihood calculation
-##' @param ... parameters for the offspring distribution
 ##' @return likelihood
 ##' @inheritParams chain_sim
 ##' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll geom_length_ll offspring_ll
 ##' @author Sebastian Funk
-chain_ll <- function(x, offspring, stat=c("size", "length"), infinite = Inf, exclude, ...)
+chain_ll <- function(x, offspring, ..., stat=c("size", "length"), infinite = Inf, exclude)
 {
   stat <- match.arg(stat)
 
diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
new file mode 100644
index 00000000..450be841
--- /dev/null
+++ b/man/chain_ll.Rd
@@ -0,0 +1,35 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/likelihoods.R
+\name{chain_ll}
+\alias{chain_ll}
+\title{Likelihood for the outcome of a branching process}
+\usage{
+chain_ll(x, offspring, stat = c("size", "length"), infinite = Inf,
+  exclude, ...)
+}
+\arguments{
+\item{x}{vector of sizes or lengths of transmission chains}
+
+\item{offspring}{offspring distribution as character string, e.g. "pois" for
+the Poisson offspring distribution.}
+
+\item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
+
+\item{infinite}{any chains of this size/length will be treated as infinite}
+
+\item{exclude}{any sizes/lengths to exclude from the likelihood calculation}
+
+\item{...}{parameters for the offspring distribution}
+}
+\value{
+likelihood
+}
+\description{
+Likelihood for the outcome of a branching process
+}
+\seealso{
+pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll geom_length_ll offspring_ll
+}
+\author{
+Sebastian Funk
+}
diff --git a/man/chain_sim.Rd b/man/chain_sim.Rd
new file mode 100644
index 00000000..e3d09a24
--- /dev/null
+++ b/man/chain_sim.Rd
@@ -0,0 +1,30 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/simulate.r
+\name{chain_sim}
+\alias{chain_sim}
+\title{Simulate chains using a branching process}
+\usage{
+chain_sim(n, offspring, stat = c("size", "length"), infinite = Inf,
+  ...)
+}
+\arguments{
+\item{n}{number of simulations to run.}
+
+\item{offspring}{offspring distribution as character string, e.g. "pois" for
+the Poisson offspring distribution.}
+
+\item{stat}{statistic to calculate ("size" or "length" of chains)}
+
+\item{infinite}{a size or length from which the size/length is to be considered infinite}
+
+\item{...}{parameters of the offspring distribution}
+}
+\value{
+a vector of sizes/lengths
+}
+\description{
+Simulate chains using a branching process
+}
+\author{
+Sebastian Funk
+}
diff --git a/man/dborel.Rd b/man/dborel.Rd
new file mode 100644
index 00000000..14d269d0
--- /dev/null
+++ b/man/dborel.Rd
@@ -0,0 +1,24 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/borel.r
+\name{dborel}
+\alias{dborel}
+\title{Density of the Borel distribution}
+\usage{
+dborel(x, mu, log = FALSE)
+}
+\arguments{
+\item{x}{vector of integers.}
+
+\item{mu}{mu parameter.}
+
+\item{log}{logical; if TRUE, probabilities p are given as log(p).}
+}
+\value{
+probability mass.
+}
+\description{
+Density of the Borel distribution
+}
+\author{
+Sebastian Funk
+}
diff --git a/man/gborel_size_ll.Rd b/man/gborel_size_ll.Rd
new file mode 100644
index 00000000..1e6c2fc4
--- /dev/null
+++ b/man/gborel_size_ll.Rd
@@ -0,0 +1,26 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/likelihoods.R
+\name{gborel_size_ll}
+\alias{gborel_size_ll}
+\title{Likelihood of the size of chains with gamma-Borel offspring distribution}
+\usage{
+gborel_size_ll(x, size, prob, mu)
+}
+\arguments{
+\item{x}{vector of sizes}
+
+\item{size}{the dispersion parameter (often called \code{k} in ecological applications)}
+
+\item{prob}{probability of success (in the parameterisation with \code{prob}, see also \code{\link[stats]{NegBinomial}})}
+
+\item{mu}{mean parameter}
+}
+\value{
+log-likelihood values
+}
+\description{
+Likelihood of the size of chains with gamma-Borel offspring distribution
+}
+\author{
+Sebastian Funk
+}
diff --git a/man/geom_length_ll.Rd b/man/geom_length_ll.Rd
new file mode 100644
index 00000000..428bd355
--- /dev/null
+++ b/man/geom_length_ll.Rd
@@ -0,0 +1,22 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/likelihoods.R
+\name{geom_length_ll}
+\alias{geom_length_ll}
+\title{Likelihood of the length of chains with geometric offspring distribution}
+\usage{
+geom_length_ll(x, prob)
+}
+\arguments{
+\item{x}{vector of sizes}
+
+\item{prob}{probability of the geometric distribution with mean \code{1/prob}}
+}
+\value{
+log-likelihood values
+}
+\description{
+Likelihood of the length of chains with geometric offspring distribution
+}
+\author{
+Sebastian Funk
+}
diff --git a/man/nbinom_size_ll.Rd b/man/nbinom_size_ll.Rd
new file mode 100644
index 00000000..4d58ee7f
--- /dev/null
+++ b/man/nbinom_size_ll.Rd
@@ -0,0 +1,26 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/likelihoods.R
+\name{nbinom_size_ll}
+\alias{nbinom_size_ll}
+\title{Likelihood of the size of chains with Negative-Binomial offspring distribution}
+\usage{
+nbinom_size_ll(x, size, prob, mu)
+}
+\arguments{
+\item{x}{vector of sizes}
+
+\item{size}{the dispersion parameter (often called \code{k} in ecological applications)}
+
+\item{prob}{probability of success (in the parameterisation with \code{prob}, see also \code{\link[stats]{NegBinomial}})}
+
+\item{mu}{mean parameter}
+}
+\value{
+log-likelihood values
+}
+\description{
+Likelihood of the size of chains with Negative-Binomial offspring distribution
+}
+\author{
+Sebastian Funk
+}
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
new file mode 100644
index 00000000..aab921f4
--- /dev/null
+++ b/man/offspring_ll.Rd
@@ -0,0 +1,31 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/likelihoods.R
+\name{offspring_ll}
+\alias{offspring_ll}
+\title{Likelihood of the length of chains with generic offspring distribution}
+\usage{
+offspring_ll(x, offspring, stat, n = 100, ...)
+}
+\arguments{
+\item{x}{vector of sizes}
+
+\item{offspring}{offspring distribution as character string, e.g. "pois" for
+the Poisson offspring distribution.}
+
+\item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
+
+\item{n}{number of simulations to run.}
+
+\item{...}{any paramaters to pass to \code{\link{chain_sim}}}
+}
+\value{
+log-likelihood values
+}
+\description{
+The likelihoods are calculated with a crude approximation using simulated
+  chains by linearly approximating any missing values in the empirical
+  cumulative distribution function (ecdf).
+}
+\author{
+Sebastian Funk
+}
diff --git a/man/pois_length_ll.Rd b/man/pois_length_ll.Rd
new file mode 100644
index 00000000..4d80bda1
--- /dev/null
+++ b/man/pois_length_ll.Rd
@@ -0,0 +1,22 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/likelihoods.R
+\name{pois_length_ll}
+\alias{pois_length_ll}
+\title{Likelihood of the length of chains with Poisson offspring distribution}
+\usage{
+pois_length_ll(x, lambda)
+}
+\arguments{
+\item{x}{vector of sizes}
+
+\item{lambda}{rate of the Poisson distributino}
+}
+\value{
+log-likelihood values
+}
+\description{
+Likelihood of the length of chains with Poisson offspring distribution
+}
+\author{
+Sebastian Funk
+}
diff --git a/man/pois_size_ll.Rd b/man/pois_size_ll.Rd
new file mode 100644
index 00000000..c5c0bd28
--- /dev/null
+++ b/man/pois_size_ll.Rd
@@ -0,0 +1,22 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/likelihoods.R
+\name{pois_size_ll}
+\alias{pois_size_ll}
+\title{Likelihood of the size of chains with Poisson offspring distribution}
+\usage{
+pois_size_ll(x, lambda)
+}
+\arguments{
+\item{x}{vector of sizes}
+
+\item{lambda}{rate of the Poisson distributino}
+}
+\value{
+log-likelihood values
+}
+\description{
+Likelihood of the size of chains with Poisson offspring distribution
+}
+\author{
+Sebastian Funk
+}
diff --git a/man/rborel.Rd b/man/rborel.Rd
new file mode 100644
index 00000000..8923dc65
--- /dev/null
+++ b/man/rborel.Rd
@@ -0,0 +1,24 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/borel.r
+\name{rborel}
+\alias{rborel}
+\title{Generate random numbers from the Borel distribution}
+\usage{
+rborel(n, mu, infinite = Inf)
+}
+\arguments{
+\item{n}{number of random variates to generate.}
+
+\item{mu}{mu parameter.}
+
+\item{infinite}{any number to treat as infinite; simulations will be stopped if this number is reached}
+}
+\value{
+vector of random numbers
+}
+\description{
+Random numbers are generated by simulating from a Poisson branching process
+}
+\author{
+Sebastian Funk
+}

From f14c22aca963e040996fe022fe6b732441a95ecc Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Tue, 15 Jan 2019 17:09:39 +0000
Subject: [PATCH 006/828] set infinite as lower limit for infinite outbreak
 sizes

---
 R/likelihoods.R | 8 +++++---
 1 file changed, 5 insertions(+), 3 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index d47da195..c3e44381 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -118,15 +118,18 @@ chain_ll <- function(x, offspring, ..., stat=c("size", "length"), infinite = Inf
   stat <- match.arg(stat)
 
   if (any(x >= infinite)) {
-    calc_sizes <- seq_len(infinite)
+    calc_sizes <- seq_len(infinite-1)
+    x[x >= infinite] <- infinite
   } else {
-    calc_sizes <- unique(c(1, x))
+    calc_sizes <- unique(x)
   }
 
   ## first, get likelihood function as given by `offspring` and `stat``
   likelihoods <- c()
   ll_func <- paste(offspring, stat, "ll", sep="_")
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
+
+  ## calculate likelihoods
   if (exists(ll_func)) {
     func <- get(ll_func)
     if (!is.function(func)) stop("'", ll_func, "' is not a function.")
@@ -145,7 +148,6 @@ chain_ll <- function(x, offspring, ..., stat=c("size", "length"), infinite = Inf
   if (any(x >= infinite)) {
     maxl <- log1p(-sum(exp(likelihoods), na.rm = TRUE))
     likelihoods <- c(likelihoods, maxl)
-    x[x > infinite] <- infinite + 1
   }
   chain_likelihoods <- likelihoods[x]
 

From a7d49c619ebcb256c6184aa1cbf32a63b4c3bdb4 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Tue, 15 Jan 2019 21:11:31 +0000
Subject: [PATCH 007/828] catch machine precision errors

---
 R/likelihoods.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index c3e44381..09fc9e31 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -146,7 +146,8 @@ chain_ll <- function(x, offspring, ..., stat=c("size", "length"), infinite = Inf
   }
 
   if (any(x >= infinite)) {
-    maxl <- log1p(-sum(exp(likelihoods), na.rm = TRUE))
+    maxl <-
+      tryCatch(log1p(-sum(exp(likelihoods), na.rm = TRUE)), error=function(e) -Inf)
     likelihoods <- c(likelihoods, maxl)
   }
   chain_likelihoods <- likelihoods[x]

From 31e87ca60b8d6b1a4276c0c896643933a019b9c0 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Tue, 15 Jan 2019 21:11:52 +0000
Subject: [PATCH 008/828] export functions

---
 NAMESPACE    | 2 ++
 R/simulate.r | 1 +
 2 files changed, 3 insertions(+)

diff --git a/NAMESPACE b/NAMESPACE
index 6ae92683..ddaddf44 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -1,2 +1,4 @@
 # Generated by roxygen2: do not edit by hand
 
+export(chain_ll)
+export(chain_sim)
diff --git a/R/simulate.r b/R/simulate.r
index f43e02e8..70d09b39 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -8,6 +8,7 @@
 ##' @param ... parameters of the offspring distribution
 ##' @return a vector of sizes/lengths
 ##' @author Sebastian Funk
+##' @export
 chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf, ...) {
 
     stat <- match.arg(stat)

From 218f7a9165783cae4e074763654715ad00a0c81a Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Tue, 15 Jan 2019 23:38:35 +0000
Subject: [PATCH 009/828] doc update

---
 R/likelihoods.R | 1 +
 man/chain_ll.Rd | 8 ++++----
 2 files changed, 5 insertions(+), 4 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 09fc9e31..3af7537c 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -113,6 +113,7 @@ offspring_ll <- function(x, offspring, stat, n=100, ...) {
 ##' @inheritParams chain_sim
 ##' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll geom_length_ll offspring_ll
 ##' @author Sebastian Funk
+##' @export
 chain_ll <- function(x, offspring, ..., stat=c("size", "length"), infinite = Inf, exclude)
 {
   stat <- match.arg(stat)
diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
index 450be841..b91ebdbb 100644
--- a/man/chain_ll.Rd
+++ b/man/chain_ll.Rd
@@ -4,8 +4,8 @@
 \alias{chain_ll}
 \title{Likelihood for the outcome of a branching process}
 \usage{
-chain_ll(x, offspring, stat = c("size", "length"), infinite = Inf,
-  exclude, ...)
+chain_ll(x, offspring, ..., stat = c("size", "length"), infinite = Inf,
+  exclude)
 }
 \arguments{
 \item{x}{vector of sizes or lengths of transmission chains}
@@ -13,13 +13,13 @@ chain_ll(x, offspring, stat = c("size", "length"), infinite = Inf,
 \item{offspring}{offspring distribution as character string, e.g. "pois" for
 the Poisson offspring distribution.}
 
+\item{...}{parameters for the offspring distribution}
+
 \item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
 
 \item{infinite}{any chains of this size/length will be treated as infinite}
 
 \item{exclude}{any sizes/lengths to exclude from the likelihood calculation}
-
-\item{...}{parameters for the offspring distribution}
 }
 \value{
 likelihood

From 199372a881db3e6575af54aefa0b5024d63b0194 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Tue, 15 Jan 2019 17:07:10 +0000
Subject: [PATCH 010/828] observation probabilities <1

---
 R/likelihoods.R              | 42 +++++++++++++++++++++++++++---------
 R/utils.r                    | 24 +++++++++++++++++++++
 man/chain_ll.Rd              |  6 ++++--
 man/complementary_logprob.Rd | 21 ++++++++++++++++++
 man/rbinom_size.Rd           | 26 ++++++++++++++++++++++
 5 files changed, 107 insertions(+), 12 deletions(-)
 create mode 100644 R/utils.r
 create mode 100644 man/complementary_logprob.Rd
 create mode 100644 man/rbinom_size.Rd

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 3af7537c..4f872d16 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -106,6 +106,8 @@ offspring_ll <- function(x, offspring, stat, n=100, ...) {
 ##'
 ##' @param x vector of sizes or lengths of transmission chains
 ##' @param ... parameters for the offspring distribution
+##' @param obs_prob observation probability (assumed constant)
+##' @param n number of samples for estimating the likelihood if obs_prob < 1
 ##' @param stat statistic given as \code{x} ("size" or "length" of chains)
 ##' @param infinite any chains of this size/length will be treated as infinite
 ##' @param exclude any sizes/lengths to exclude from the likelihood calculation
@@ -114,15 +116,27 @@ offspring_ll <- function(x, offspring, stat, n=100, ...) {
 ##' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll geom_length_ll offspring_ll
 ##' @author Sebastian Funk
 ##' @export
-chain_ll <- function(x, offspring, ..., stat=c("size", "length"), infinite = Inf, exclude)
+chain_ll <- function(x, offspring, ..., obs_prob=1, n, stat=c("size", "length"), infinite = Inf, exclude)
 {
   stat <- match.arg(stat)
 
-  if (any(x >= infinite)) {
-    calc_sizes <- seq_len(infinite-1)
+  ## checks
+  if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
+  if (obs_prob < 1) {
+    if (missing(n)) stop("'n' must be specified if 'obs_prob' is <1")
+    sampled_x <- replicate(n, pmin(rbinom_size(length(x), x, obs_prob), infinite))
+    size_x <- unlist(sampled_x)
+    if (!is.finite(infinite)) infinite <- max(size_x) + 1
+  } else {
     x[x >= infinite] <- infinite
+    size_x <- x
+  }
+
+  ## determine for which sizes to calculate the likelihood (for true chain size)
+  if (any(size_x == infinite)) {
+    calc_sizes <- seq_len(infinite-1)
   } else {
-    calc_sizes <- unique(x)
+    calc_sizes <- unique(size_x)
   }
 
   ## first, get likelihood function as given by `offspring` and `stat``
@@ -141,17 +155,25 @@ chain_ll <- function(x, offspring, ..., stat=c("size", "length"), infinite = Inf
               c(list(x=calc_sizes, offspring=offspring, stat=stat), pars))
   }
 
+  ## assign probabilities to infinite outbreak sizes
+  if (any(size_x == infinite)) {
+    likelihoods[infinite] <- complementary_logprob(likelihoods)
+  }
+
   if (!missing(exclude)) {
     likelihoods <- likelihoods - log(-expm1(sum(likelihoods[exclude])))
     likelihoods[exclude] <- -Inf
   }
 
-  if (any(x >= infinite)) {
-    maxl <-
-      tryCatch(log1p(-sum(exp(likelihoods), na.rm = TRUE)), error=function(e) -Inf)
-    likelihoods <- c(likelihoods, maxl)
+  ## adjust for binomial observation probabilities
+  if (obs_prob < 1) {
+    chains_likelihood <- mean(apply(sampled_x, 2, function(sx) {
+      sum(likelihoods[sx])
+    }))
+  } else {
+    chains_likelihood <- sum(likelihoods[x])
   }
-  chain_likelihoods <- likelihoods[x]
 
-  return(sum(chain_likelihoods))
+  return(chains_likelihood)
 }
+
diff --git a/R/utils.r b/R/utils.r
new file mode 100644
index 00000000..6876b962
--- /dev/null
+++ b/R/utils.r
@@ -0,0 +1,24 @@
+##' Calculates the complementary log-probability
+##'
+##' Given x and norm, this calculates log(1-sum(exp(x)))
+##' @param x log-probabilities
+##' @return value
+##' @author Sebastian Funk
+##' @keywords internal
+complementary_logprob <- function(x) {
+    tryCatch(log1p(-sum(exp(x))), error=function(e) -Inf)
+}
+
+##' Samples size (the number of trials) of a binomial distribution
+##'
+##' Samples the size parameter from the binomial distribution with fixed x
+##' (number of sucesses) and p (sucess probability)
+##' @param n number of samples to generate
+##' @param x number of successes
+##' @param prob probability of success
+##' @return a sampled size
+##' @author Sebastian Funk
+##' @keywords internal
+rbinom_size <- function(n, x, prob) {
+    x + rnbinom(n, x, prob) + rnbinom(n, 1, prob)
+}
diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
index b91ebdbb..0621031d 100644
--- a/man/chain_ll.Rd
+++ b/man/chain_ll.Rd
@@ -4,8 +4,8 @@
 \alias{chain_ll}
 \title{Likelihood for the outcome of a branching process}
 \usage{
-chain_ll(x, offspring, ..., stat = c("size", "length"), infinite = Inf,
-  exclude)
+chain_ll(x, offspring, ..., obs_prob = 1, stat = c("size", "length"),
+  infinite = Inf, exclude)
 }
 \arguments{
 \item{x}{vector of sizes or lengths of transmission chains}
@@ -15,6 +15,8 @@ the Poisson offspring distribution.}
 
 \item{...}{parameters for the offspring distribution}
 
+\item{obs_prob}{observation probability (assumed constant)}
+
 \item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
 
 \item{infinite}{any chains of this size/length will be treated as infinite}
diff --git a/man/complementary_logprob.Rd b/man/complementary_logprob.Rd
new file mode 100644
index 00000000..221bccb0
--- /dev/null
+++ b/man/complementary_logprob.Rd
@@ -0,0 +1,21 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/utils.r
+\name{complementary_logprob}
+\alias{complementary_logprob}
+\title{Calculates the complementary log-probability}
+\usage{
+complementary_logprob(x)
+}
+\arguments{
+\item{x}{log-probabilities}
+}
+\value{
+value
+}
+\description{
+Given x and norm, this calculates log(1-sum(exp(x)))
+}
+\author{
+Sebastian Funk
+}
+\keyword{internal}
diff --git a/man/rbinom_size.Rd b/man/rbinom_size.Rd
new file mode 100644
index 00000000..c50027b4
--- /dev/null
+++ b/man/rbinom_size.Rd
@@ -0,0 +1,26 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/utils.r
+\name{rbinom_size}
+\alias{rbinom_size}
+\title{Samples size (the number of trials) of a binomial distribution}
+\usage{
+rbinom_size(n, x, prob)
+}
+\arguments{
+\item{n}{number of samples to generate}
+
+\item{x}{number of successes}
+
+\item{prob}{probability of success}
+}
+\value{
+a sampled size
+}
+\description{
+Samples the size parameter from the binomial distribution with fixed x
+(number of sucesses) and p (sucess probability)
+}
+\author{
+Sebastian Funk
+}
+\keyword{internal}

From a99bb79f3ededd5b5d20d833a3bf8ca884cd4771 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 08:22:59 +0000
Subject: [PATCH 011/828] update DESCRIPTION

---
 DESCRIPTION | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 28fca7e4..21988041 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,10 +1,10 @@
-Package: epichains
+Package: bpmodels
 Version: 0.1
-Title: Analysis of transmission chains
+Title: Analysing chain statistics using branching process models
 Authors@R: c(person("Sebastian", "Funk", email = "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")))
-Description: Performs analysis of chain sizes
+Description: Provides methods to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks, as discussed in Farrington et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
+Imports: matrixStats
 License: GPL-3
-URL: https://github.com/sbfnk/epichains
-BugReports: https://github.com/sbfnk/epichains/issues
-NeedsCompilation: no
+URL: https://github.com/sbfnk/bpmodels
+BugReports: https://github.com/sbfnk/bpmodels
 RoxygenNote: 6.1.1

From 91da15a555354b285b341bae032dd56cb7a01e79 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 08:23:45 +0000
Subject: [PATCH 012/828] update order of chain_ll parameters

---
 R/likelihoods.R | 7 +++----
 1 file changed, 3 insertions(+), 4 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 4f872d16..69d69d6a 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -105,18 +105,17 @@ offspring_ll <- function(x, offspring, stat, n=100, ...) {
 ##' Likelihood for the outcome of a branching process
 ##'
 ##' @param x vector of sizes or lengths of transmission chains
-##' @param ... parameters for the offspring distribution
-##' @param obs_prob observation probability (assumed constant)
-##' @param n number of samples for estimating the likelihood if obs_prob < 1
 ##' @param stat statistic given as \code{x} ("size" or "length" of chains)
+##' @param obs_prob observation probability (assumed constant)
 ##' @param infinite any chains of this size/length will be treated as infinite
 ##' @param exclude any sizes/lengths to exclude from the likelihood calculation
+##' @param ... parameters for the offspring distribution
 ##' @return likelihood
 ##' @inheritParams chain_sim
 ##' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll geom_length_ll offspring_ll
 ##' @author Sebastian Funk
 ##' @export
-chain_ll <- function(x, offspring, ..., obs_prob=1, n, stat=c("size", "length"), infinite = Inf, exclude)
+chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1, infinite = Inf, exclude, ...)
 {
   stat <- match.arg(stat)
 

From 293e2b111496935727f5f1e5d19586d8d90bf7f9 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 08:23:59 +0000
Subject: [PATCH 013/828] update documentation (internals and examples)

---
 DESCRIPTION     | 2 +-
 R/likelihoods.R | 9 +++++++++
 2 files changed, 10 insertions(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 21988041..164e4615 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,5 +1,5 @@
 Package: bpmodels
-Version: 0.1
+Version: 0.1.0
 Title: Analysing chain statistics using branching process models
 Authors@R: c(person("Sebastian", "Funk", email = "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")))
 Description: Provides methods to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks, as discussed in Farrington et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
diff --git a/R/likelihoods.R b/R/likelihoods.R
index 69d69d6a..ee5b8489 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -4,6 +4,7 @@
 ##' @param lambda rate of the Poisson distributino
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
+##' @keywords internal
 pois_size_ll <- function(x, lambda)
 {
   (x - 1) * log(lambda) - lambda * x + (x - 2) * log(x) - lgamma(x)
@@ -17,6 +18,7 @@ pois_size_ll <- function(x, lambda)
 ##' @param mu mean parameter
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
+##' @keywords internal
 nbinom_size_ll <- function(x, size, prob, mu)
 {
   if (!missing(prob)) {
@@ -36,6 +38,7 @@ nbinom_size_ll <- function(x, size, prob, mu)
 ##' @param mu mean parameter
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
+##' @keywords internal
 gborel_size_ll <- function(x, size, prob, mu) {
   if (!missing(prob)) {
     if (!missing(mu)) stop("'prob' and 'mu' both specified")
@@ -51,6 +54,7 @@ gborel_size_ll <- function(x, size, prob, mu) {
 ##' @param lambda rate of the Poisson distributino
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
+##' @keywords internal
 pois_length_ll <- function(x, lambda) {
 
   ## iterated exponential function
@@ -69,6 +73,7 @@ pois_length_ll <- function(x, lambda) {
 ##' @param prob probability of the geometric distribution with mean \code{1/prob}
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
+##' @keywords internal
 geom_length_ll <- function(x, prob) {
 
   lambda <- 1 / prob
@@ -90,6 +95,7 @@ geom_length_ll <- function(x, prob) {
 ##' @author Sebastian Funk
 ##' @inheritParams chain_ll
 ##' @inheritParams chain_sim
+##' @keywords internal
 offspring_ll <- function(x, offspring, stat, n=100, ...) {
 
   dist <- chain_sim(n, offspring, stat, ...)
@@ -115,6 +121,9 @@ offspring_ll <- function(x, offspring, stat, n=100, ...) {
 ##' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll geom_length_ll offspring_ll
 ##' @author Sebastian Funk
 ##' @export
+##' @examples
+##' chain_sizes <- c(1,1,4,7) # example of observed chain sizes
+##' chain_ll(chain_sizes, "pois", "size", lambda=0.5)
 chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1, infinite = Inf, exclude, ...)
 {
   stat <- match.arg(stat)

From dc7d0a1e3732a03a332d23e8fa566f81dba77787 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 09:21:33 +0000
Subject: [PATCH 014/828] chain_ll: update parameters

---
 R/likelihoods.R | 19 +++++++++++--------
 1 file changed, 11 insertions(+), 8 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index ee5b8489..12dca021 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -90,15 +90,16 @@ geom_length_ll <- function(x, prob) {
 ##'   chains by linearly approximating any missing values in the empirical
 ##'   cumulative distribution function (ecdf).
 ##' @param x vector of sizes
+##' @param nsim_offspring number of simulations of the offspring distribution for approximation the size/length distribution
 ##' @param ... any paramaters to pass to \code{\link{chain_sim}}
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
 ##' @inheritParams chain_ll
 ##' @inheritParams chain_sim
 ##' @keywords internal
-offspring_ll <- function(x, offspring, stat, n=100, ...) {
+offspring_ll <- function(x, offspring, stat, nsim_offspring=100, ...) {
 
-  dist <- chain_sim(n, offspring, stat, ...)
+  dist <- chain_sim(nsim_offspring, offspring, stat, ...)
 
   ## linear approximation
   f <- ecdf(dist)
@@ -115,6 +116,7 @@ offspring_ll <- function(x, offspring, stat, n=100, ...) {
 ##' @param obs_prob observation probability (assumed constant)
 ##' @param infinite any chains of this size/length will be treated as infinite
 ##' @param exclude any sizes/lengths to exclude from the likelihood calculation
+##' @param nsim_obs number of simulations if the likelihood is to be approximated for imperfect observations
 ##' @param ... parameters for the offspring distribution
 ##' @return likelihood
 ##' @inheritParams chain_sim
@@ -124,15 +126,15 @@ offspring_ll <- function(x, offspring, stat, n=100, ...) {
 ##' @examples
 ##' chain_sizes <- c(1,1,4,7) # example of observed chain sizes
 ##' chain_ll(chain_sizes, "pois", "size", lambda=0.5)
-chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1, infinite = Inf, exclude, ...)
+chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1, infinite = Inf, exclude, nsim_obs, ...)
 {
   stat <- match.arg(stat)
 
   ## checks
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
-    if (missing(n)) stop("'n' must be specified if 'obs_prob' is <1")
-    sampled_x <- replicate(n, pmin(rbinom_size(length(x), x, obs_prob), infinite))
+    if (missing(nsim_obs)) stop("'nsim_obs' must be specified if 'obs_prob' is <1")
+    sampled_x <- replicate(nsim_obs, pmin(rbinom_size(length(x), x, obs_prob), infinite))
     size_x <- unlist(sampled_x)
     if (!is.finite(infinite)) infinite <- max(size_x) + 1
   } else {
@@ -160,7 +162,8 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1, infinit
   } else {
     likelihoods[calc_sizes] <-
       do.call(offspring_ll,
-              c(list(x=calc_sizes, offspring=offspring, stat=stat), pars))
+              c(list(x=calc_sizes, offspring=offspring,
+                     stat=stat, infinite=infinite), pars))
   }
 
   ## assign probabilities to infinite outbreak sizes
@@ -175,9 +178,9 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1, infinit
 
   ## adjust for binomial observation probabilities
   if (obs_prob < 1) {
-    chains_likelihood <- mean(apply(sampled_x, 2, function(sx) {
+    chains_likelihood <- apply(sampled_x, 2, function(sx) {
       sum(likelihoods[sx])
-    }))
+    })
   } else {
     chains_likelihood <- sum(likelihoods[x])
   }

From 44efdd630dcf48af4517fa650089c35ba6849759 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 09:22:10 +0000
Subject: [PATCH 015/828] Roxygen documentation update

---
 man/chain_ll.Rd       | 16 +++++++++++-----
 man/gborel_size_ll.Rd |  1 +
 man/geom_length_ll.Rd |  1 +
 man/nbinom_size_ll.Rd |  1 +
 man/offspring_ll.Rd   |  5 +++--
 man/pois_length_ll.Rd |  1 +
 man/pois_size_ll.Rd   |  1 +
 7 files changed, 19 insertions(+), 7 deletions(-)

diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
index 0621031d..71f39d7e 100644
--- a/man/chain_ll.Rd
+++ b/man/chain_ll.Rd
@@ -4,8 +4,8 @@
 \alias{chain_ll}
 \title{Likelihood for the outcome of a branching process}
 \usage{
-chain_ll(x, offspring, ..., obs_prob = 1, stat = c("size", "length"),
-  infinite = Inf, exclude)
+chain_ll(x, offspring, stat = c("size", "length"), obs_prob = 1,
+  infinite = Inf, exclude, nsim_obs, ...)
 }
 \arguments{
 \item{x}{vector of sizes or lengths of transmission chains}
@@ -13,15 +13,17 @@ chain_ll(x, offspring, ..., obs_prob = 1, stat = c("size", "length"),
 \item{offspring}{offspring distribution as character string, e.g. "pois" for
 the Poisson offspring distribution.}
 
-\item{...}{parameters for the offspring distribution}
+\item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
 
 \item{obs_prob}{observation probability (assumed constant)}
 
-\item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
-
 \item{infinite}{any chains of this size/length will be treated as infinite}
 
 \item{exclude}{any sizes/lengths to exclude from the likelihood calculation}
+
+\item{nsim_obs}{number of simulations if the likelihood is to be approximated for imperfect observations}
+
+\item{...}{parameters for the offspring distribution}
 }
 \value{
 likelihood
@@ -29,6 +31,10 @@ likelihood
 \description{
 Likelihood for the outcome of a branching process
 }
+\examples{
+chain_sizes <- c(1,1,4,7) # example of observed chain sizes
+chain_ll(chain_sizes, "pois", "size", lambda=0.5)
+}
 \seealso{
 pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll geom_length_ll offspring_ll
 }
diff --git a/man/gborel_size_ll.Rd b/man/gborel_size_ll.Rd
index 1e6c2fc4..13ee9646 100644
--- a/man/gborel_size_ll.Rd
+++ b/man/gborel_size_ll.Rd
@@ -24,3 +24,4 @@ Likelihood of the size of chains with gamma-Borel offspring distribution
 \author{
 Sebastian Funk
 }
+\keyword{internal}
diff --git a/man/geom_length_ll.Rd b/man/geom_length_ll.Rd
index 428bd355..98015fe7 100644
--- a/man/geom_length_ll.Rd
+++ b/man/geom_length_ll.Rd
@@ -20,3 +20,4 @@ Likelihood of the length of chains with geometric offspring distribution
 \author{
 Sebastian Funk
 }
+\keyword{internal}
diff --git a/man/nbinom_size_ll.Rd b/man/nbinom_size_ll.Rd
index 4d58ee7f..974b5916 100644
--- a/man/nbinom_size_ll.Rd
+++ b/man/nbinom_size_ll.Rd
@@ -24,3 +24,4 @@ Likelihood of the size of chains with Negative-Binomial offspring distribution
 \author{
 Sebastian Funk
 }
+\keyword{internal}
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index aab921f4..d2cd9b8f 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -4,7 +4,7 @@
 \alias{offspring_ll}
 \title{Likelihood of the length of chains with generic offspring distribution}
 \usage{
-offspring_ll(x, offspring, stat, n = 100, ...)
+offspring_ll(x, offspring, stat, nsim_offspring = 100, ...)
 }
 \arguments{
 \item{x}{vector of sizes}
@@ -14,7 +14,7 @@ the Poisson offspring distribution.}
 
 \item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
 
-\item{n}{number of simulations to run.}
+\item{nsim_offspring}{number of simulations of the offspring distribution for approximation the size/length distribution}
 
 \item{...}{any paramaters to pass to \code{\link{chain_sim}}}
 }
@@ -29,3 +29,4 @@ The likelihoods are calculated with a crude approximation using simulated
 \author{
 Sebastian Funk
 }
+\keyword{internal}
diff --git a/man/pois_length_ll.Rd b/man/pois_length_ll.Rd
index 4d80bda1..8bcf37d4 100644
--- a/man/pois_length_ll.Rd
+++ b/man/pois_length_ll.Rd
@@ -20,3 +20,4 @@ Likelihood of the length of chains with Poisson offspring distribution
 \author{
 Sebastian Funk
 }
+\keyword{internal}
diff --git a/man/pois_size_ll.Rd b/man/pois_size_ll.Rd
index c5c0bd28..19163265 100644
--- a/man/pois_size_ll.Rd
+++ b/man/pois_size_ll.Rd
@@ -20,3 +20,4 @@ Likelihood of the size of chains with Poisson offspring distribution
 \author{
 Sebastian Funk
 }
+\keyword{internal}

From 54f60997c644cb0a0e030152803965d410f36cca Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 09:22:21 +0000
Subject: [PATCH 016/828] tests

---
 tests/testthat.R             |  4 ++++
 tests/testthat/tests-borel.r | 15 +++++++++++++++
 tests/testthat/tests-ll.r    | 33 +++++++++++++++++++++++++++++++++
 tests/testthat/tests-sim.r   | 15 +++++++++++++++
 4 files changed, 67 insertions(+)
 create mode 100644 tests/testthat.R
 create mode 100644 tests/testthat/tests-borel.r
 create mode 100644 tests/testthat/tests-ll.r
 create mode 100644 tests/testthat/tests-sim.r

diff --git a/tests/testthat.R b/tests/testthat.R
new file mode 100644
index 00000000..b9a1b439
--- /dev/null
+++ b/tests/testthat.R
@@ -0,0 +1,4 @@
+library(testthat)
+library(bpmodels)
+
+test_check("bpmodels")
diff --git a/tests/testthat/tests-borel.r b/tests/testthat/tests-borel.r
new file mode 100644
index 00000000..266997e3
--- /dev/null
+++ b/tests/testthat/tests-borel.r
@@ -0,0 +1,15 @@
+context("The Borel distribution is implemented")
+
+test_that("We can calculate probabilities and sample",
+{
+    expect_gt(dborel(1, 0.5), 0)
+    expect_equal(dborel(1, 0.5, log=TRUE), -0.5)
+    expect_length(rborel(2, 0.9), 2)
+})
+
+test_that("Errors are thrown",
+{
+    expect_error(dborel(0, 0.5), "greater than 0")
+})
+
+
diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
new file mode 100644
index 00000000..f1dfdfea
--- /dev/null
+++ b/tests/testthat/tests-ll.r
@@ -0,0 +1,33 @@
+context("Calculating the likelihood from a branching process model")
+
+chains <- c(1,1,4,7)
+
+test_that("Likelihoods can be calculated",
+{
+    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5), 0)
+    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, exclude=1), 0)
+    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, infinite = 5), 0)
+    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, obs_prob = 0.5, nsim_obs=1), 0)
+    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, infinite = 5, obs_prob = 0.5, nsim_obs=1), 0)
+    expect_lt(chain_ll(chains, "binom", "size", size=1, prob=0.5), 0)
+})
+
+test_that("Analytical size/length distributions are implemented",
+{
+    expect_true(all(pois_size_ll(chains, lambda=0.5) < 0))
+    expect_true(all(nbinom_size_ll(chains, mu=0.5, size=0.2) < 0))
+    expect_true(all(nbinom_size_ll(chains, prob=0.5, size=0.2) < 0))
+    expect_true(all(gborel_size_ll(chains, prob=0.5, size=0.2) < 0))
+    expect_true(all(gborel_size_ll(chains, prob=0.5, size=0.2) < 0))
+    expect_true(all(pois_length_ll(chains, lambda=0.5) < 0))
+    expect_true(all(geom_length_ll(chains, prob=0.5) < 0))
+})
+
+test_that("Errors are thrown",
+{
+    expect_error(chain_ll(chain_sizes, "pois", "size", lambda=0.5, obs_prob = 3), "must be within")
+    expect_error(chain_ll(chain_sizes, "pois", "size", lambda=0.5, obs_prob = 0.5), "must be specified")
+    expect_error(nbinom_size_ll(chains, mu=0.5, size=0.2, prob=0.1), "both specified")
+    expect_error(gborel_size_ll(chains, mu=0.5, size=0.2, prob=0.1), "both specified")
+    expect_error(chain_sim(n=2, "test"), "is not a function")
+})
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
new file mode 100644
index 00000000..8cb7795a
--- /dev/null
+++ b/tests/testthat/tests-sim.r
@@ -0,0 +1,15 @@
+context("Simulating from a branching process model")
+
+test_that("Chains can be simulated",
+{
+    expect_length(chain_sim(n=2, "pois", lambda=0.5), 2)
+    expect_length(chain_sim(n=2, "pois", "length", lambda=0.5), 2)
+    expect_false(any(is.finite(chain_sim(n=2, "pois", "length", lambda=0.5, infinite=1))))
+})
+
+test_that("Errors are thrown",
+{
+    rtest <- 0
+    expect_error(chain_sim(n=2, "dummy"), "does not exist")
+    expect_error(chain_sim(n=2, "test"), "is not a function")
+})

From 7ab18338ac78c8b192d114f834b0c5db2c494a57 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 09:22:27 +0000
Subject: [PATCH 017/828] vignette

---
 vignettes/introduction.Rmd | 73 ++++++++++++++++++++++++++++++++++++++
 1 file changed, 73 insertions(+)
 create mode 100644 vignettes/introduction.Rmd

diff --git a/vignettes/introduction.Rmd b/vignettes/introduction.Rmd
new file mode 100644
index 00000000..24abe368
--- /dev/null
+++ b/vignettes/introduction.Rmd
@@ -0,0 +1,73 @@
+---
+title: "Analysing chain statistics using branching process models"
+author: "Sebastian Funk"
+date: "`r Sys.Date()`"
+output: rmarkdown::html_vignette
+vignette: >
+  %\VignetteIndexEntry{Analysing chain statistics using branching process models}
+  %\VignetteEngine{knitr::rmarkdown}
+  %\VignetteEncoding{UTF-8}
+---
+
+```{r setup, include = FALSE}
+library('knitr')
+knitr::opts_chunk$set(
+  collapse = TRUE,
+  comment = "#>"
+)
+```
+
+[bpmodels](https://github.com/sbfnk/bpmodels) is an `R` package to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks.
+
+# Usage
+
+To load the package, use
+```{r eval=FALSE}
+library('bpmodels')
+```
+```{r echo=FALSE}
+suppressWarnings(library('bpmodels'))
+```
+
+At the heart of the `bpmodels` package are the `chains_ll` and `chains_sim` functions. The `chains_ll` function calculates the log-likelihood of a distribution of chain sizes or lengths given an offspring distribution and associated parameters. For example, to get the log-likelihood for a given observed distribution of chain sizes assuming a mean number of 0.5 Poisson-distributed offspring per generation, use
+
+```{r}
+chain_sizes <- c(1,1,4,7) # example of observed chain sizes
+chain_ll(chain_sizes, "pois", "size", lambda=0.5)
+```
+
+The first argument of `chain_ll` is the size (or length) distribution to analyse. The second argument (called `offspring`) specifies the offspring distribution. This is given as a character string that refers to the function used to generate random offspring. It can be any probability distribution implemented in R, that is, one that has a corresponding function for generating random numbers beginning with the letter `r`. In the case of the example above, since random Poisson numbers are generated in R using a function called `rpois`, "pois" is the corresponding string to pass to the `offspring` argument.
+
+The third argument (called `stat`) determines whether to analyse chain sizes ("size", the default if this argument is not specified) or lengths ("length"). Lastly, any named arguments not recognised by `chain_ll` are interpreted as parameters of the corresponding probability distribution, here `lambda=0.5` as the mean of the Poisson distribution (see the R help page for the Poisson distribution for more information).
+
+You can use the `R` help to find out about usage of the `chains_ll` function,
+
+```{r eval=FALSE}
+?chains_ll
+```
+
+To simulate from a branching process, use the `chain_sim` function, which follows the same syntax as the `chain_ll` function:
+
+```{r}
+chain_sim(n=5, "pois", "size", lambda=0.5)
+```
+
+# Methodology
+
+If the probability distribution of chain sizes or lengths has an analytical solution, this will be used (size distribution: Poisson and negative binomial; length distribution: Poisson and geometric). If not, simulations are used to approximate this probability distributions (using a linear approximation to the cumulative distribution for unobserved sizes/lengths), requiring an additional parameter `nsim_offspring` for the number of simulations to be used for this approximation.
+
+# Imperfect observations
+
+The `chain_ll` function has an `obs_prob` parameter that can be used to determine the likelihood if observations are imperfect. This only works when analysing chain sizes (`stat="size"`). In that case, true chain sizes are simulated repeatedly (the number of times given by the `nsim_obs` argument) and the likelihood calculated for each of these simulations. For example, if the probability of observing each case is 30%, use
+
+```{r}
+ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda=0.5, nsim_obs=10)
+summary(ll)
+```
+
+This returns `nsim_obs=10` likelihood values which can be averaged to come up with an overall likelihood estimate.
+
+# References
+
+* Farrington, C.P., Kanaan, M.N. and Gay, N.J. (2003). [Branching process models for surveillance of infectious diseases controlled by mass vaccination](https://doi.org/10.1093/biostatistics/4.2.279).
+* Blumberg, S. and Lloyd-Smith, J.O. (2013). [Comparing methods for estimating R0 from the size distribution of subcritical transmission chains](https://doi.org/10.1016/j.epidem.2013.05.002).

From 99f1adad871ee24832676fb79a23d4c06bbb777e Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 09:22:36 +0000
Subject: [PATCH 018/828] update package structure

---
 .Rbuildignore       |   4 +
 .gitignore          |  10 +-
 .travis.yml         |   5 +
 CODE_OF_CONDUCT.md  |  25 ++
 DESCRIPTION         |   5 +
 LICENSE             | 674 --------------------------------------------
 NEWS.md             |   3 +
 README.md           |  11 +-
 appveyor.yml        |  41 +++
 man/offspring_ll.Rd |   3 +-
 10 files changed, 96 insertions(+), 685 deletions(-)
 create mode 100644 .Rbuildignore
 create mode 100644 .travis.yml
 create mode 100644 CODE_OF_CONDUCT.md
 delete mode 100644 LICENSE
 create mode 100644 NEWS.md
 create mode 100644 appveyor.yml

diff --git a/.Rbuildignore b/.Rbuildignore
new file mode 100644
index 00000000..f0fea78d
--- /dev/null
+++ b/.Rbuildignore
@@ -0,0 +1,4 @@
+^CODE_OF_CONDUCT\.md$
+^appveyor\.yml$
+^\.travis\.yml$
+cran-comments.md
diff --git a/.gitignore b/.gitignore
index 26fad6fa..57133000 100644
--- a/.gitignore
+++ b/.gitignore
@@ -1,36 +1,28 @@
+inst/doc
 # History files
 .Rhistory
 .Rapp.history
 
 # Session Data files
 .RData
-
 # Example code in package build process
 *-Ex.R
-
 # Output files from R CMD build
 /*.tar.gz
-
 # Output files from R CMD check
 /*.Rcheck/
-
 # RStudio files
 .Rproj.user/
-
 # produced vignettes
 vignettes/*.html
 vignettes/*.pdf
-
 # OAuth2 token, see https://github.com/hadley/httr/releases/tag/v0.3
 .httr-oauth
-
 # knitr and R markdown default cache directories
 /*_cache/
 /cache/
-
 # Temporary files created by R markdown
 *.utf8.md
 *.knit.md
-
 # Shiny token, see https://shiny.rstudio.com/articles/shinyapps.html
 rsconnect/
diff --git a/.travis.yml b/.travis.yml
new file mode 100644
index 00000000..8d139ac6
--- /dev/null
+++ b/.travis.yml
@@ -0,0 +1,5 @@
+# R for travis: see documentation at https://docs.travis-ci.com/user/languages/r
+
+language: R
+sudo: false
+cache: packages
diff --git a/CODE_OF_CONDUCT.md b/CODE_OF_CONDUCT.md
new file mode 100644
index 00000000..24aa0a3c
--- /dev/null
+++ b/CODE_OF_CONDUCT.md
@@ -0,0 +1,25 @@
+# Contributor Code of Conduct
+
+As contributors and maintainers of this project, we pledge to respect all people who 
+contribute through reporting issues, posting feature requests, updating documentation,
+submitting pull requests or patches, and other activities.
+
+We are committed to making participation in this project a harassment-free experience for
+everyone, regardless of level of experience, gender, gender identity and expression,
+sexual orientation, disability, personal appearance, body size, race, ethnicity, age, or religion.
+
+Examples of unacceptable behavior by participants include the use of sexual language or
+imagery, derogatory comments or personal attacks, trolling, public or private harassment,
+insults, or other unprofessional conduct.
+
+Project maintainers have the right and responsibility to remove, edit, or reject comments,
+commits, code, wiki edits, issues, and other contributions that are not aligned to this 
+Code of Conduct. Project maintainers who do not follow the Code of Conduct may be removed 
+from the project team.
+
+Instances of abusive, harassing, or otherwise unacceptable behavior may be reported by 
+opening an issue or contacting one or more of the project maintainers.
+
+This Code of Conduct is adapted from the Contributor Covenant 
+(http://contributor-covenant.org), version 1.0.0, available at 
+http://contributor-covenant.org/version/1/0/0/
diff --git a/DESCRIPTION b/DESCRIPTION
index 164e4615..16c0fb0a 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -4,7 +4,12 @@ Title: Analysing chain statistics using branching process models
 Authors@R: c(person("Sebastian", "Funk", email = "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")))
 Description: Provides methods to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks, as discussed in Farrington et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
 Imports: matrixStats
+Suggests: 
+    testthat,
+    knitr,
+    rmarkdown
 License: GPL-3
 URL: https://github.com/sbfnk/bpmodels
 BugReports: https://github.com/sbfnk/bpmodels
 RoxygenNote: 6.1.1
+VignetteBuilder: knitr
diff --git a/LICENSE b/LICENSE
deleted file mode 100644
index f288702d..00000000
--- a/LICENSE
+++ /dev/null
@@ -1,674 +0,0 @@
-                    GNU GENERAL PUBLIC LICENSE
-                       Version 3, 29 June 2007
-
- Copyright (C) 2007 Free Software Foundation, Inc. <https://fsf.org/>
- Everyone is permitted to copy and distribute verbatim copies
- of this license document, but changing it is not allowed.
-
-                            Preamble
-
-  The GNU General Public License is a free, copyleft license for
-software and other kinds of works.
-
-  The licenses for most software and other practical works are designed
-to take away your freedom to share and change the works.  By contrast,
-the GNU General Public License is intended to guarantee your freedom to
-share and change all versions of a program--to make sure it remains free
-software for all its users.  We, the Free Software Foundation, use the
-GNU General Public License for most of our software; it applies also to
-any other work released this way by its authors.  You can apply it to
-your programs, too.
-
-  When we speak of free software, we are referring to freedom, not
-price.  Our General Public Licenses are designed to make sure that you
-have the freedom to distribute copies of free software (and charge for
-them if you wish), that you receive source code or can get it if you
-want it, that you can change the software or use pieces of it in new
-free programs, and that you know you can do these things.
-
-  To protect your rights, we need to prevent others from denying you
-these rights or asking you to surrender the rights.  Therefore, you have
-certain responsibilities if you distribute copies of the software, or if
-you modify it: responsibilities to respect the freedom of others.
-
-  For example, if you distribute copies of such a program, whether
-gratis or for a fee, you must pass on to the recipients the same
-freedoms that you received.  You must make sure that they, too, receive
-or can get the source code.  And you must show them these terms so they
-know their rights.
-
-  Developers that use the GNU GPL protect your rights with two steps:
-(1) assert copyright on the software, and (2) offer you this License
-giving you legal permission to copy, distribute and/or modify it.
-
-  For the developers' and authors' protection, the GPL clearly explains
-that there is no warranty for this free software.  For both users' and
-authors' sake, the GPL requires that modified versions be marked as
-changed, so that their problems will not be attributed erroneously to
-authors of previous versions.
-
-  Some devices are designed to deny users access to install or run
-modified versions of the software inside them, although the manufacturer
-can do so.  This is fundamentally incompatible with the aim of
-protecting users' freedom to change the software.  The systematic
-pattern of such abuse occurs in the area of products for individuals to
-use, which is precisely where it is most unacceptable.  Therefore, we
-have designed this version of the GPL to prohibit the practice for those
-products.  If such problems arise substantially in other domains, we
-stand ready to extend this provision to those domains in future versions
-of the GPL, as needed to protect the freedom of users.
-
-  Finally, every program is threatened constantly by software patents.
-States should not allow patents to restrict development and use of
-software on general-purpose computers, but in those that do, we wish to
-avoid the special danger that patents applied to a free program could
-make it effectively proprietary.  To prevent this, the GPL assures that
-patents cannot be used to render the program non-free.
-
-  The precise terms and conditions for copying, distribution and
-modification follow.
-
-                       TERMS AND CONDITIONS
-
-  0. Definitions.
-
-  "This License" refers to version 3 of the GNU General Public License.
-
-  "Copyright" also means copyright-like laws that apply to other kinds of
-works, such as semiconductor masks.
-
-  "The Program" refers to any copyrightable work licensed under this
-License.  Each licensee is addressed as "you".  "Licensees" and
-"recipients" may be individuals or organizations.
-
-  To "modify" a work means to copy from or adapt all or part of the work
-in a fashion requiring copyright permission, other than the making of an
-exact copy.  The resulting work is called a "modified version" of the
-earlier work or a work "based on" the earlier work.
-
-  A "covered work" means either the unmodified Program or a work based
-on the Program.
-
-  To "propagate" a work means to do anything with it that, without
-permission, would make you directly or secondarily liable for
-infringement under applicable copyright law, except executing it on a
-computer or modifying a private copy.  Propagation includes copying,
-distribution (with or without modification), making available to the
-public, and in some countries other activities as well.
-
-  To "convey" a work means any kind of propagation that enables other
-parties to make or receive copies.  Mere interaction with a user through
-a computer network, with no transfer of a copy, is not conveying.
-
-  An interactive user interface displays "Appropriate Legal Notices"
-to the extent that it includes a convenient and prominently visible
-feature that (1) displays an appropriate copyright notice, and (2)
-tells the user that there is no warranty for the work (except to the
-extent that warranties are provided), that licensees may convey the
-work under this License, and how to view a copy of this License.  If
-the interface presents a list of user commands or options, such as a
-menu, a prominent item in the list meets this criterion.
-
-  1. Source Code.
-
-  The "source code" for a work means the preferred form of the work
-for making modifications to it.  "Object code" means any non-source
-form of a work.
-
-  A "Standard Interface" means an interface that either is an official
-standard defined by a recognized standards body, or, in the case of
-interfaces specified for a particular programming language, one that
-is widely used among developers working in that language.
-
-  The "System Libraries" of an executable work include anything, other
-than the work as a whole, that (a) is included in the normal form of
-packaging a Major Component, but which is not part of that Major
-Component, and (b) serves only to enable use of the work with that
-Major Component, or to implement a Standard Interface for which an
-implementation is available to the public in source code form.  A
-"Major Component", in this context, means a major essential component
-(kernel, window system, and so on) of the specific operating system
-(if any) on which the executable work runs, or a compiler used to
-produce the work, or an object code interpreter used to run it.
-
-  The "Corresponding Source" for a work in object code form means all
-the source code needed to generate, install, and (for an executable
-work) run the object code and to modify the work, including scripts to
-control those activities.  However, it does not include the work's
-System Libraries, or general-purpose tools or generally available free
-programs which are used unmodified in performing those activities but
-which are not part of the work.  For example, Corresponding Source
-includes interface definition files associated with source files for
-the work, and the source code for shared libraries and dynamically
-linked subprograms that the work is specifically designed to require,
-such as by intimate data communication or control flow between those
-subprograms and other parts of the work.
-
-  The Corresponding Source need not include anything that users
-can regenerate automatically from other parts of the Corresponding
-Source.
-
-  The Corresponding Source for a work in source code form is that
-same work.
-
-  2. Basic Permissions.
-
-  All rights granted under this License are granted for the term of
-copyright on the Program, and are irrevocable provided the stated
-conditions are met.  This License explicitly affirms your unlimited
-permission to run the unmodified Program.  The output from running a
-covered work is covered by this License only if the output, given its
-content, constitutes a covered work.  This License acknowledges your
-rights of fair use or other equivalent, as provided by copyright law.
-
-  You may make, run and propagate covered works that you do not
-convey, without conditions so long as your license otherwise remains
-in force.  You may convey covered works to others for the sole purpose
-of having them make modifications exclusively for you, or provide you
-with facilities for running those works, provided that you comply with
-the terms of this License in conveying all material for which you do
-not control copyright.  Those thus making or running the covered works
-for you must do so exclusively on your behalf, under your direction
-and control, on terms that prohibit them from making any copies of
-your copyrighted material outside their relationship with you.
-
-  Conveying under any other circumstances is permitted solely under
-the conditions stated below.  Sublicensing is not allowed; section 10
-makes it unnecessary.
-
-  3. Protecting Users' Legal Rights From Anti-Circumvention Law.
-
-  No covered work shall be deemed part of an effective technological
-measure under any applicable law fulfilling obligations under article
-11 of the WIPO copyright treaty adopted on 20 December 1996, or
-similar laws prohibiting or restricting circumvention of such
-measures.
-
-  When you convey a covered work, you waive any legal power to forbid
-circumvention of technological measures to the extent such circumvention
-is effected by exercising rights under this License with respect to
-the covered work, and you disclaim any intention to limit operation or
-modification of the work as a means of enforcing, against the work's
-users, your or third parties' legal rights to forbid circumvention of
-technological measures.
-
-  4. Conveying Verbatim Copies.
-
-  You may convey verbatim copies of the Program's source code as you
-receive it, in any medium, provided that you conspicuously and
-appropriately publish on each copy an appropriate copyright notice;
-keep intact all notices stating that this License and any
-non-permissive terms added in accord with section 7 apply to the code;
-keep intact all notices of the absence of any warranty; and give all
-recipients a copy of this License along with the Program.
-
-  You may charge any price or no price for each copy that you convey,
-and you may offer support or warranty protection for a fee.
-
-  5. Conveying Modified Source Versions.
-
-  You may convey a work based on the Program, or the modifications to
-produce it from the Program, in the form of source code under the
-terms of section 4, provided that you also meet all of these conditions:
-
-    a) The work must carry prominent notices stating that you modified
-    it, and giving a relevant date.
-
-    b) The work must carry prominent notices stating that it is
-    released under this License and any conditions added under section
-    7.  This requirement modifies the requirement in section 4 to
-    "keep intact all notices".
-
-    c) You must license the entire work, as a whole, under this
-    License to anyone who comes into possession of a copy.  This
-    License will therefore apply, along with any applicable section 7
-    additional terms, to the whole of the work, and all its parts,
-    regardless of how they are packaged.  This License gives no
-    permission to license the work in any other way, but it does not
-    invalidate such permission if you have separately received it.
-
-    d) If the work has interactive user interfaces, each must display
-    Appropriate Legal Notices; however, if the Program has interactive
-    interfaces that do not display Appropriate Legal Notices, your
-    work need not make them do so.
-
-  A compilation of a covered work with other separate and independent
-works, which are not by their nature extensions of the covered work,
-and which are not combined with it such as to form a larger program,
-in or on a volume of a storage or distribution medium, is called an
-"aggregate" if the compilation and its resulting copyright are not
-used to limit the access or legal rights of the compilation's users
-beyond what the individual works permit.  Inclusion of a covered work
-in an aggregate does not cause this License to apply to the other
-parts of the aggregate.
-
-  6. Conveying Non-Source Forms.
-
-  You may convey a covered work in object code form under the terms
-of sections 4 and 5, provided that you also convey the
-machine-readable Corresponding Source under the terms of this License,
-in one of these ways:
-
-    a) Convey the object code in, or embodied in, a physical product
-    (including a physical distribution medium), accompanied by the
-    Corresponding Source fixed on a durable physical medium
-    customarily used for software interchange.
-
-    b) Convey the object code in, or embodied in, a physical product
-    (including a physical distribution medium), accompanied by a
-    written offer, valid for at least three years and valid for as
-    long as you offer spare parts or customer support for that product
-    model, to give anyone who possesses the object code either (1) a
-    copy of the Corresponding Source for all the software in the
-    product that is covered by this License, on a durable physical
-    medium customarily used for software interchange, for a price no
-    more than your reasonable cost of physically performing this
-    conveying of source, or (2) access to copy the
-    Corresponding Source from a network server at no charge.
-
-    c) Convey individual copies of the object code with a copy of the
-    written offer to provide the Corresponding Source.  This
-    alternative is allowed only occasionally and noncommercially, and
-    only if you received the object code with such an offer, in accord
-    with subsection 6b.
-
-    d) Convey the object code by offering access from a designated
-    place (gratis or for a charge), and offer equivalent access to the
-    Corresponding Source in the same way through the same place at no
-    further charge.  You need not require recipients to copy the
-    Corresponding Source along with the object code.  If the place to
-    copy the object code is a network server, the Corresponding Source
-    may be on a different server (operated by you or a third party)
-    that supports equivalent copying facilities, provided you maintain
-    clear directions next to the object code saying where to find the
-    Corresponding Source.  Regardless of what server hosts the
-    Corresponding Source, you remain obligated to ensure that it is
-    available for as long as needed to satisfy these requirements.
-
-    e) Convey the object code using peer-to-peer transmission, provided
-    you inform other peers where the object code and Corresponding
-    Source of the work are being offered to the general public at no
-    charge under subsection 6d.
-
-  A separable portion of the object code, whose source code is excluded
-from the Corresponding Source as a System Library, need not be
-included in conveying the object code work.
-
-  A "User Product" is either (1) a "consumer product", which means any
-tangible personal property which is normally used for personal, family,
-or household purposes, or (2) anything designed or sold for incorporation
-into a dwelling.  In determining whether a product is a consumer product,
-doubtful cases shall be resolved in favor of coverage.  For a particular
-product received by a particular user, "normally used" refers to a
-typical or common use of that class of product, regardless of the status
-of the particular user or of the way in which the particular user
-actually uses, or expects or is expected to use, the product.  A product
-is a consumer product regardless of whether the product has substantial
-commercial, industrial or non-consumer uses, unless such uses represent
-the only significant mode of use of the product.
-
-  "Installation Information" for a User Product means any methods,
-procedures, authorization keys, or other information required to install
-and execute modified versions of a covered work in that User Product from
-a modified version of its Corresponding Source.  The information must
-suffice to ensure that the continued functioning of the modified object
-code is in no case prevented or interfered with solely because
-modification has been made.
-
-  If you convey an object code work under this section in, or with, or
-specifically for use in, a User Product, and the conveying occurs as
-part of a transaction in which the right of possession and use of the
-User Product is transferred to the recipient in perpetuity or for a
-fixed term (regardless of how the transaction is characterized), the
-Corresponding Source conveyed under this section must be accompanied
-by the Installation Information.  But this requirement does not apply
-if neither you nor any third party retains the ability to install
-modified object code on the User Product (for example, the work has
-been installed in ROM).
-
-  The requirement to provide Installation Information does not include a
-requirement to continue to provide support service, warranty, or updates
-for a work that has been modified or installed by the recipient, or for
-the User Product in which it has been modified or installed.  Access to a
-network may be denied when the modification itself materially and
-adversely affects the operation of the network or violates the rules and
-protocols for communication across the network.
-
-  Corresponding Source conveyed, and Installation Information provided,
-in accord with this section must be in a format that is publicly
-documented (and with an implementation available to the public in
-source code form), and must require no special password or key for
-unpacking, reading or copying.
-
-  7. Additional Terms.
-
-  "Additional permissions" are terms that supplement the terms of this
-License by making exceptions from one or more of its conditions.
-Additional permissions that are applicable to the entire Program shall
-be treated as though they were included in this License, to the extent
-that they are valid under applicable law.  If additional permissions
-apply only to part of the Program, that part may be used separately
-under those permissions, but the entire Program remains governed by
-this License without regard to the additional permissions.
-
-  When you convey a copy of a covered work, you may at your option
-remove any additional permissions from that copy, or from any part of
-it.  (Additional permissions may be written to require their own
-removal in certain cases when you modify the work.)  You may place
-additional permissions on material, added by you to a covered work,
-for which you have or can give appropriate copyright permission.
-
-  Notwithstanding any other provision of this License, for material you
-add to a covered work, you may (if authorized by the copyright holders of
-that material) supplement the terms of this License with terms:
-
-    a) Disclaiming warranty or limiting liability differently from the
-    terms of sections 15 and 16 of this License; or
-
-    b) Requiring preservation of specified reasonable legal notices or
-    author attributions in that material or in the Appropriate Legal
-    Notices displayed by works containing it; or
-
-    c) Prohibiting misrepresentation of the origin of that material, or
-    requiring that modified versions of such material be marked in
-    reasonable ways as different from the original version; or
-
-    d) Limiting the use for publicity purposes of names of licensors or
-    authors of the material; or
-
-    e) Declining to grant rights under trademark law for use of some
-    trade names, trademarks, or service marks; or
-
-    f) Requiring indemnification of licensors and authors of that
-    material by anyone who conveys the material (or modified versions of
-    it) with contractual assumptions of liability to the recipient, for
-    any liability that these contractual assumptions directly impose on
-    those licensors and authors.
-
-  All other non-permissive additional terms are considered "further
-restrictions" within the meaning of section 10.  If the Program as you
-received it, or any part of it, contains a notice stating that it is
-governed by this License along with a term that is a further
-restriction, you may remove that term.  If a license document contains
-a further restriction but permits relicensing or conveying under this
-License, you may add to a covered work material governed by the terms
-of that license document, provided that the further restriction does
-not survive such relicensing or conveying.
-
-  If you add terms to a covered work in accord with this section, you
-must place, in the relevant source files, a statement of the
-additional terms that apply to those files, or a notice indicating
-where to find the applicable terms.
-
-  Additional terms, permissive or non-permissive, may be stated in the
-form of a separately written license, or stated as exceptions;
-the above requirements apply either way.
-
-  8. Termination.
-
-  You may not propagate or modify a covered work except as expressly
-provided under this License.  Any attempt otherwise to propagate or
-modify it is void, and will automatically terminate your rights under
-this License (including any patent licenses granted under the third
-paragraph of section 11).
-
-  However, if you cease all violation of this License, then your
-license from a particular copyright holder is reinstated (a)
-provisionally, unless and until the copyright holder explicitly and
-finally terminates your license, and (b) permanently, if the copyright
-holder fails to notify you of the violation by some reasonable means
-prior to 60 days after the cessation.
-
-  Moreover, your license from a particular copyright holder is
-reinstated permanently if the copyright holder notifies you of the
-violation by some reasonable means, this is the first time you have
-received notice of violation of this License (for any work) from that
-copyright holder, and you cure the violation prior to 30 days after
-your receipt of the notice.
-
-  Termination of your rights under this section does not terminate the
-licenses of parties who have received copies or rights from you under
-this License.  If your rights have been terminated and not permanently
-reinstated, you do not qualify to receive new licenses for the same
-material under section 10.
-
-  9. Acceptance Not Required for Having Copies.
-
-  You are not required to accept this License in order to receive or
-run a copy of the Program.  Ancillary propagation of a covered work
-occurring solely as a consequence of using peer-to-peer transmission
-to receive a copy likewise does not require acceptance.  However,
-nothing other than this License grants you permission to propagate or
-modify any covered work.  These actions infringe copyright if you do
-not accept this License.  Therefore, by modifying or propagating a
-covered work, you indicate your acceptance of this License to do so.
-
-  10. Automatic Licensing of Downstream Recipients.
-
-  Each time you convey a covered work, the recipient automatically
-receives a license from the original licensors, to run, modify and
-propagate that work, subject to this License.  You are not responsible
-for enforcing compliance by third parties with this License.
-
-  An "entity transaction" is a transaction transferring control of an
-organization, or substantially all assets of one, or subdividing an
-organization, or merging organizations.  If propagation of a covered
-work results from an entity transaction, each party to that
-transaction who receives a copy of the work also receives whatever
-licenses to the work the party's predecessor in interest had or could
-give under the previous paragraph, plus a right to possession of the
-Corresponding Source of the work from the predecessor in interest, if
-the predecessor has it or can get it with reasonable efforts.
-
-  You may not impose any further restrictions on the exercise of the
-rights granted or affirmed under this License.  For example, you may
-not impose a license fee, royalty, or other charge for exercise of
-rights granted under this License, and you may not initiate litigation
-(including a cross-claim or counterclaim in a lawsuit) alleging that
-any patent claim is infringed by making, using, selling, offering for
-sale, or importing the Program or any portion of it.
-
-  11. Patents.
-
-  A "contributor" is a copyright holder who authorizes use under this
-License of the Program or a work on which the Program is based.  The
-work thus licensed is called the contributor's "contributor version".
-
-  A contributor's "essential patent claims" are all patent claims
-owned or controlled by the contributor, whether already acquired or
-hereafter acquired, that would be infringed by some manner, permitted
-by this License, of making, using, or selling its contributor version,
-but do not include claims that would be infringed only as a
-consequence of further modification of the contributor version.  For
-purposes of this definition, "control" includes the right to grant
-patent sublicenses in a manner consistent with the requirements of
-this License.
-
-  Each contributor grants you a non-exclusive, worldwide, royalty-free
-patent license under the contributor's essential patent claims, to
-make, use, sell, offer for sale, import and otherwise run, modify and
-propagate the contents of its contributor version.
-
-  In the following three paragraphs, a "patent license" is any express
-agreement or commitment, however denominated, not to enforce a patent
-(such as an express permission to practice a patent or covenant not to
-sue for patent infringement).  To "grant" such a patent license to a
-party means to make such an agreement or commitment not to enforce a
-patent against the party.
-
-  If you convey a covered work, knowingly relying on a patent license,
-and the Corresponding Source of the work is not available for anyone
-to copy, free of charge and under the terms of this License, through a
-publicly available network server or other readily accessible means,
-then you must either (1) cause the Corresponding Source to be so
-available, or (2) arrange to deprive yourself of the benefit of the
-patent license for this particular work, or (3) arrange, in a manner
-consistent with the requirements of this License, to extend the patent
-license to downstream recipients.  "Knowingly relying" means you have
-actual knowledge that, but for the patent license, your conveying the
-covered work in a country, or your recipient's use of the covered work
-in a country, would infringe one or more identifiable patents in that
-country that you have reason to believe are valid.
-
-  If, pursuant to or in connection with a single transaction or
-arrangement, you convey, or propagate by procuring conveyance of, a
-covered work, and grant a patent license to some of the parties
-receiving the covered work authorizing them to use, propagate, modify
-or convey a specific copy of the covered work, then the patent license
-you grant is automatically extended to all recipients of the covered
-work and works based on it.
-
-  A patent license is "discriminatory" if it does not include within
-the scope of its coverage, prohibits the exercise of, or is
-conditioned on the non-exercise of one or more of the rights that are
-specifically granted under this License.  You may not convey a covered
-work if you are a party to an arrangement with a third party that is
-in the business of distributing software, under which you make payment
-to the third party based on the extent of your activity of conveying
-the work, and under which the third party grants, to any of the
-parties who would receive the covered work from you, a discriminatory
-patent license (a) in connection with copies of the covered work
-conveyed by you (or copies made from those copies), or (b) primarily
-for and in connection with specific products or compilations that
-contain the covered work, unless you entered into that arrangement,
-or that patent license was granted, prior to 28 March 2007.
-
-  Nothing in this License shall be construed as excluding or limiting
-any implied license or other defenses to infringement that may
-otherwise be available to you under applicable patent law.
-
-  12. No Surrender of Others' Freedom.
-
-  If conditions are imposed on you (whether by court order, agreement or
-otherwise) that contradict the conditions of this License, they do not
-excuse you from the conditions of this License.  If you cannot convey a
-covered work so as to satisfy simultaneously your obligations under this
-License and any other pertinent obligations, then as a consequence you may
-not convey it at all.  For example, if you agree to terms that obligate you
-to collect a royalty for further conveying from those to whom you convey
-the Program, the only way you could satisfy both those terms and this
-License would be to refrain entirely from conveying the Program.
-
-  13. Use with the GNU Affero General Public License.
-
-  Notwithstanding any other provision of this License, you have
-permission to link or combine any covered work with a work licensed
-under version 3 of the GNU Affero General Public License into a single
-combined work, and to convey the resulting work.  The terms of this
-License will continue to apply to the part which is the covered work,
-but the special requirements of the GNU Affero General Public License,
-section 13, concerning interaction through a network will apply to the
-combination as such.
-
-  14. Revised Versions of this License.
-
-  The Free Software Foundation may publish revised and/or new versions of
-the GNU General Public License from time to time.  Such new versions will
-be similar in spirit to the present version, but may differ in detail to
-address new problems or concerns.
-
-  Each version is given a distinguishing version number.  If the
-Program specifies that a certain numbered version of the GNU General
-Public License "or any later version" applies to it, you have the
-option of following the terms and conditions either of that numbered
-version or of any later version published by the Free Software
-Foundation.  If the Program does not specify a version number of the
-GNU General Public License, you may choose any version ever published
-by the Free Software Foundation.
-
-  If the Program specifies that a proxy can decide which future
-versions of the GNU General Public License can be used, that proxy's
-public statement of acceptance of a version permanently authorizes you
-to choose that version for the Program.
-
-  Later license versions may give you additional or different
-permissions.  However, no additional obligations are imposed on any
-author or copyright holder as a result of your choosing to follow a
-later version.
-
-  15. Disclaimer of Warranty.
-
-  THERE IS NO WARRANTY FOR THE PROGRAM, TO THE EXTENT PERMITTED BY
-APPLICABLE LAW.  EXCEPT WHEN OTHERWISE STATED IN WRITING THE COPYRIGHT
-HOLDERS AND/OR OTHER PARTIES PROVIDE THE PROGRAM "AS IS" WITHOUT WARRANTY
-OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO,
-THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
-PURPOSE.  THE ENTIRE RISK AS TO THE QUALITY AND PERFORMANCE OF THE PROGRAM
-IS WITH YOU.  SHOULD THE PROGRAM PROVE DEFECTIVE, YOU ASSUME THE COST OF
-ALL NECESSARY SERVICING, REPAIR OR CORRECTION.
-
-  16. Limitation of Liability.
-
-  IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING
-WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MODIFIES AND/OR CONVEYS
-THE PROGRAM AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES, INCLUDING ANY
-GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING OUT OF THE
-USE OR INABILITY TO USE THE PROGRAM (INCLUDING BUT NOT LIMITED TO LOSS OF
-DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY YOU OR THIRD
-PARTIES OR A FAILURE OF THE PROGRAM TO OPERATE WITH ANY OTHER PROGRAMS),
-EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE POSSIBILITY OF
-SUCH DAMAGES.
-
-  17. Interpretation of Sections 15 and 16.
-
-  If the disclaimer of warranty and limitation of liability provided
-above cannot be given local legal effect according to their terms,
-reviewing courts shall apply local law that most closely approximates
-an absolute waiver of all civil liability in connection with the
-Program, unless a warranty or assumption of liability accompanies a
-copy of the Program in return for a fee.
-
-                     END OF TERMS AND CONDITIONS
-
-            How to Apply These Terms to Your New Programs
-
-  If you develop a new program, and you want it to be of the greatest
-possible use to the public, the best way to achieve this is to make it
-free software which everyone can redistribute and change under these terms.
-
-  To do so, attach the following notices to the program.  It is safest
-to attach them to the start of each source file to most effectively
-state the exclusion of warranty; and each file should have at least
-the "copyright" line and a pointer to where the full notice is found.
-
-    <one line to give the program's name and a brief idea of what it does.>
-    Copyright (C) <year>  <name of author>
-
-    This program is free software: you can redistribute it and/or modify
-    it under the terms of the GNU General Public License as published by
-    the Free Software Foundation, either version 3 of the License, or
-    (at your option) any later version.
-
-    This program is distributed in the hope that it will be useful,
-    but WITHOUT ANY WARRANTY; without even the implied warranty of
-    MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
-    GNU General Public License for more details.
-
-    You should have received a copy of the GNU General Public License
-    along with this program.  If not, see <https://www.gnu.org/licenses/>.
-
-Also add information on how to contact you by electronic and paper mail.
-
-  If the program does terminal interaction, make it output a short
-notice like this when it starts in an interactive mode:
-
-    <program>  Copyright (C) <year>  <name of author>
-    This program comes with ABSOLUTELY NO WARRANTY; for details type `show w'.
-    This is free software, and you are welcome to redistribute it
-    under certain conditions; type `show c' for details.
-
-The hypothetical commands `show w' and `show c' should show the appropriate
-parts of the General Public License.  Of course, your program's commands
-might be different; for a GUI interface, you would use an "about box".
-
-  You should also get your employer (if you work as a programmer) or school,
-if any, to sign a "copyright disclaimer" for the program, if necessary.
-For more information on this, and how to apply and follow the GNU GPL, see
-<https://www.gnu.org/licenses/>.
-
-  The GNU General Public License does not permit incorporating your program
-into proprietary programs.  If your program is a subroutine library, you
-may consider it more useful to permit linking proprietary applications with
-the library.  If this is what you want to do, use the GNU Lesser General
-Public License instead of this License.  But first, please read
-<https://www.gnu.org/licenses/why-not-lgpl.html>.
diff --git a/NEWS.md b/NEWS.md
new file mode 100644
index 00000000..1d3b8bbd
--- /dev/null
+++ b/NEWS.md
@@ -0,0 +1,3 @@
+# bpmodels 0.1.0
+
+* initial release
diff --git a/README.md b/README.md
index 9b3e57ea..fede0da4 100644
--- a/README.md
+++ b/README.md
@@ -1,2 +1,11 @@
-# epichains
+# bpmodels
+
 Methods for analysing the distribution of epidemiological chain sizes and lengths
+
+The latest development version of the `bpmodels` package can be installed via
+
+```{r eval=FALSE}
+devtools::install_github('sbfnk/bpmodels')
+```
+
+Please note that the 'bpmodels' project is released with a [Contributor Code of Conduct](CODE_OF_CONDUCT.md). By contributing to this project, you agree to abide by its terms.
diff --git a/appveyor.yml b/appveyor.yml
new file mode 100644
index 00000000..057d78b3
--- /dev/null
+++ b/appveyor.yml
@@ -0,0 +1,41 @@
+# DO NOT CHANGE the "init" and "install" sections below
+
+# Download script file from GitHub
+init:
+  ps: |
+        $ErrorActionPreference = "Stop"
+        Invoke-WebRequest http://raw.github.com/krlmlr/r-appveyor/master/scripts/appveyor-tool.ps1 -OutFile "..\appveyor-tool.ps1"
+        Import-Module '..\appveyor-tool.ps1'
+install:
+  ps: Bootstrap
+
+# Adapt as necessary starting from here
+
+build_script:
+  - travis-tool.sh install_deps
+
+test_script:
+  - travis-tool.sh run_tests
+
+on_failure:
+  - 7z a failure.zip *.Rcheck\*
+  - appveyor PushArtifact failure.zip
+
+artifacts:
+  - path: '*.Rcheck\**\*.log'
+    name: Logs
+
+  - path: '*.Rcheck\**\*.out'
+    name: Logs
+
+  - path: '*.Rcheck\**\*.fail'
+    name: Logs
+
+  - path: '*.Rcheck\**\*.Rout'
+    name: Logs
+
+  - path: '\*_*.tar.gz'
+    name: Bits
+
+  - path: '\*_*.zip'
+name: Bits
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index d2cd9b8f..d9827f27 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -14,7 +14,8 @@ the Poisson offspring distribution.}
 
 \item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
 
-\item{nsim_offspring}{number of simulations of the offspring distribution for approximation the size/length distribution}
+\item{nsim_offspring}{number of simulations of the offspring distribution
+for approximation the size/length distribution}
 
 \item{...}{any paramaters to pass to \code{\link{chain_sim}}}
 }

From 97679dc2936881031457cac0010da3fc21e904ae Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 09:51:26 +0000
Subject: [PATCH 019/828] remove spurious tests

---
 tests/testthat/tests-ll.r  | 1 -
 tests/testthat/tests-sim.r | 2 --
 2 files changed, 3 deletions(-)

diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
index f1dfdfea..e12b72ce 100644
--- a/tests/testthat/tests-ll.r
+++ b/tests/testthat/tests-ll.r
@@ -29,5 +29,4 @@ test_that("Errors are thrown",
     expect_error(chain_ll(chain_sizes, "pois", "size", lambda=0.5, obs_prob = 0.5), "must be specified")
     expect_error(nbinom_size_ll(chains, mu=0.5, size=0.2, prob=0.1), "both specified")
     expect_error(gborel_size_ll(chains, mu=0.5, size=0.2, prob=0.1), "both specified")
-    expect_error(chain_sim(n=2, "test"), "is not a function")
 })
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 8cb7795a..697886ab 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -9,7 +9,5 @@ test_that("Chains can be simulated",
 
 test_that("Errors are thrown",
 {
-    rtest <- 0
     expect_error(chain_sim(n=2, "dummy"), "does not exist")
-    expect_error(chain_sim(n=2, "test"), "is not a function")
 })

From fe3718868677481622fd110f7cb534c899c99d71 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 09:52:22 +0000
Subject: [PATCH 020/828] qualify stats functions

---
 R/likelihoods.R | 9 ++++++---
 R/utils.r       | 2 +-
 2 files changed, 7 insertions(+), 4 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 12dca021..34911a31 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -90,7 +90,8 @@ geom_length_ll <- function(x, prob) {
 ##'   chains by linearly approximating any missing values in the empirical
 ##'   cumulative distribution function (ecdf).
 ##' @param x vector of sizes
-##' @param nsim_offspring number of simulations of the offspring distribution for approximation the size/length distribution
+##' @param nsim_offspring number of simulations of the offspring distribution
+##'   for approximation the size/length distribution 
 ##' @param ... any paramaters to pass to \code{\link{chain_sim}}
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
@@ -102,8 +103,10 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring=100, ...) {
   dist <- chain_sim(nsim_offspring, offspring, stat, ...)
 
   ## linear approximation
-  f <- ecdf(dist)
-  acdf <- diff(c(0, approx(unique(dist), f(unique(dist)), seq_len(max(dist[is.finite(dist)])))$y))
+  f <- stats::ecdf(dist)
+  acdf <-
+    diff(c(0, stats::approx(unique(dist), f(unique(dist)),
+                            seq_len(max(dist[is.finite(dist)])))$y))
   lik <- acdf[x]
   lik[is.na(lik)] <- 0
   log(lik)
diff --git a/R/utils.r b/R/utils.r
index 6876b962..86fe3f7d 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -20,5 +20,5 @@ complementary_logprob <- function(x) {
 ##' @author Sebastian Funk
 ##' @keywords internal
 rbinom_size <- function(n, x, prob) {
-    x + rnbinom(n, x, prob) + rnbinom(n, 1, prob)
+    x + stats::rnbinom(n, x, prob) + stats::rnbinom(n, 1, prob)
 }

From c6d27e42895d10a4fa2b389d400d3264bf08e0a6 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 09:56:30 +0000
Subject: [PATCH 021/828] update travis.yml for osx and codecov

---
 .travis.yml | 27 ++++++++++++++++++++++++---
 1 file changed, 24 insertions(+), 3 deletions(-)

diff --git a/.travis.yml b/.travis.yml
index 8d139ac6..a7279d6d 100644
--- a/.travis.yml
+++ b/.travis.yml
@@ -1,5 +1,26 @@
 # R for travis: see documentation at https://docs.travis-ci.com/user/languages/r
-
-language: R
-sudo: false
+language: r
 cache: packages
+
+matrix:
+  include:
+    - os: linux
+      r: release
+      env:
+        - R_CODECOV=true
+    - os: linux
+      r: devel
+    - os: linux
+      r: oldrel
+    - os: osx
+      osx_image: xcode8.3
+
+warnings_are_errors: true
+
+notifications:
+  email:
+    on_success: change
+    on_failure: change
+
+after_success:
+- if [[ "${R_CODECOV}" ]]; then Rscript -e 'covr::codecov()'; fi

From 4925cccf9bb0aaf5b1b9222556c22cba982eb33d Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 09:58:18 +0000
Subject: [PATCH 022/828] fix typo in appveyor.yml

---
 appveyor.yml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/appveyor.yml b/appveyor.yml
index 057d78b3..bc46a87c 100644
--- a/appveyor.yml
+++ b/appveyor.yml
@@ -38,4 +38,4 @@ artifacts:
     name: Bits
 
   - path: '\*_*.zip'
-name: Bits
+    name: Bits

From b7346c161f35ae970018a7a5b55a52579ad5ad57 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 10:15:22 +0000
Subject: [PATCH 023/828] README badges

---
 DESCRIPTION | 3 ++-
 README.md   | 4 ++++
 2 files changed, 6 insertions(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 16c0fb0a..5020a6e7 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -7,7 +7,8 @@ Imports: matrixStats
 Suggests: 
     testthat,
     knitr,
-    rmarkdown
+    rmarkdown,
+    covr
 License: GPL-3
 URL: https://github.com/sbfnk/bpmodels
 BugReports: https://github.com/sbfnk/bpmodels
diff --git a/README.md b/README.md
index fede0da4..028c7a9e 100644
--- a/README.md
+++ b/README.md
@@ -1,5 +1,9 @@
 # bpmodels
 
+[![Travis-CI Build Status](https://travis-ci.org/sbfnk/bpmodels.svg?branch=master)](https://travis-ci.org/sbfnk/bpmodels)
+[![Appveyor Build Status](https://ci.appveyor.com/api/projects/status/github/sbfnk)](https://ci.appveyor.com/project/sbfnk/bpmodels)
+[![codecov](https://codecov.io/github/sbfnk/bpmodels/branch/master/graphs/badge.svg)](https://codecov.io/github/sbfnk/bpmodels) 
+
 Methods for analysing the distribution of epidemiological chain sizes and lengths
 
 The latest development version of the `bpmodels` package can be installed via

From d8f3e186b003bebd28ad9bae9b449af9c6f946e0 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 10:23:14 +0000
Subject: [PATCH 024/828] remove obsolete test

---
 R/simulate.r | 4 +---
 1 file changed, 1 insertion(+), 3 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 70d09b39..8e13b68a 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -28,10 +28,8 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
             offspring <- sum(func(n=state, ...))
             if (stat=="size") {
                 stat_track <- stat_track + offspring
-            } else if (stat=="length"){
+            } else if (stat=="length") {
                 if (offspring > 0) stat_track <- stat_track + 1
-            } else {
-                stop("Unknown statistic: '", stat, "'.")
             }
             state <- offspring
         }

From cb5258b65688ea4e0e6318a77aa30d25053d0378 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 10:28:45 +0000
Subject: [PATCH 025/828] give chain length test higher chance of finding
 length > 1

---
 tests/testthat/tests-sim.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 697886ab..88116f63 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -3,7 +3,7 @@ context("Simulating from a branching process model")
 test_that("Chains can be simulated",
 {
     expect_length(chain_sim(n=2, "pois", lambda=0.5), 2)
-    expect_length(chain_sim(n=2, "pois", "length", lambda=0.5), 2)
+    expect_length(chain_sim(n=10, "pois", "length", lambda=0.9), 2)
     expect_false(any(is.finite(chain_sim(n=2, "pois", "length", lambda=0.5, infinite=1))))
 })
 

From 2308ff3e57081f74075e0d1cdfb7937f825ded80 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 10:42:52 +0000
Subject: [PATCH 026/828] fix simulation test

---
 tests/testthat/tests-sim.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 88116f63..093e5a6b 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -3,7 +3,7 @@ context("Simulating from a branching process model")
 test_that("Chains can be simulated",
 {
     expect_length(chain_sim(n=2, "pois", lambda=0.5), 2)
-    expect_length(chain_sim(n=10, "pois", "length", lambda=0.9), 2)
+    expect_length(chain_sim(n=10, "pois", "length", lambda=0.9), 10)
     expect_false(any(is.finite(chain_sim(n=2, "pois", "length", lambda=0.5, infinite=1))))
 })
 

From 5555ffbc750cd4ef7e9e12bd9f8f57506d6e9310 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 11:16:01 +0000
Subject: [PATCH 027/828] use actual function to generate offspring instead of
 name

---
 DESCRIPTION                |  1 -
 R/borel.r                  |  5 ++--
 R/likelihoods.R            | 57 +++++++++++++++++++++++++-------------
 R/simulate.r               | 30 +++++++++++---------
 man/chain_ll.Rd            | 13 +++++----
 man/chain_sim.Rd           | 11 ++++++--
 man/gborel_size_ll.Rd      |  6 ++--
 man/geom_length_ll.Rd      |  3 +-
 man/nbinom_size_ll.Rd      | 12 +++++---
 man/offspring_ll.Rd        |  5 ++--
 man/rborel.Rd              |  3 +-
 tests/testthat/tests-ll.r  | 28 ++++++++++++-------
 tests/testthat/tests-sim.r |  9 +++---
 vignettes/introduction.Rmd | 14 ++++++----
 14 files changed, 125 insertions(+), 72 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 5020a6e7..bf73caf5 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -3,7 +3,6 @@ Version: 0.1.0
 Title: Analysing chain statistics using branching process models
 Authors@R: c(person("Sebastian", "Funk", email = "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")))
 Description: Provides methods to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks, as discussed in Farrington et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
-Imports: matrixStats
 Suggests: 
     testthat,
     knitr,
diff --git a/R/borel.r b/R/borel.r
index a03e34f3..9e97df5d 100644
--- a/R/borel.r
+++ b/R/borel.r
@@ -17,9 +17,10 @@ dborel <- function(x, mu, log=FALSE) {
 ##' Random numbers are generated by simulating from a Poisson branching process
 ##' @param n number of random variates to generate.
 ##' @param mu mu parameter.
-##' @param infinite any number to treat as infinite; simulations will be stopped if this number is reached
+##' @param infinite any number to treat as infinite; simulations will be stopped
+##'     if this number is reached
 ##' @return vector of random numbers
 ##' @author Sebastian Funk
 rborel <- function(n, mu, infinite=Inf) {
-    chain_sim(n, "pois", "size", infinite=infinite, lambda=mu)
+    chain_sim(n, stats::rpois, "size", infinite=infinite, lambda=mu)
 }
diff --git a/R/likelihoods.R b/R/likelihoods.R
index 34911a31..f84b383f 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -10,11 +10,14 @@ pois_size_ll <- function(x, lambda)
   (x - 1) * log(lambda) - lambda * x + (x - 2) * log(x) - lgamma(x)
 }
 
-##' Likelihood of the size of chains with Negative-Binomial offspring distribution
+##' Likelihood of the size of chains with Negative-Binomial offspring
+##' distribution
 ##'
 ##' @param x vector of sizes
-##' @param size the dispersion parameter (often called \code{k} in ecological applications)
-##' @param prob probability of success (in the parameterisation with \code{prob}, see also \code{\link[stats]{NegBinomial}})
+##' @param size the dispersion parameter (often called \code{k} in ecological
+##'   applications)
+##' @param prob probability of success (in the parameterisation with
+##'   \code{prob}, see also \code{\link[stats]{NegBinomial}})
 ##' @param mu mean parameter
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
@@ -33,8 +36,10 @@ nbinom_size_ll <- function(x, size, prob, mu)
 ##' Likelihood of the size of chains with gamma-Borel offspring distribution
 ##'
 ##' @param x vector of sizes
-##' @param size the dispersion parameter (often called \code{k} in ecological applications)
-##' @param prob probability of success (in the parameterisation with \code{prob}, see also \code{\link[stats]{NegBinomial}})
+##' @param size the dispersion parameter (often called \code{k} in ecological
+##'   applications)
+##' @param prob probability of success (in the parameterisation with
+##'   \code{prob}, see also \code{\link[stats]{NegBinomial}})
 ##' @param mu mean parameter
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
@@ -44,7 +49,8 @@ gborel_size_ll <- function(x, size, prob, mu) {
     if (!missing(mu)) stop("'prob' and 'mu' both specified")
     mu <- size * (1 - prob) / prob
   }
-  lgamma(size + x - 1) - (lgamma(x + 1) + lgamma(size)) - size * log(mu / size) +
+  lgamma(size + x - 1) -
+    (lgamma(x + 1) + lgamma(size)) - size * log(mu / size) +
     (x - 1) * log(x) - (size + x - 1) * log(x + size / mu)
 }
 
@@ -70,7 +76,8 @@ pois_length_ll <- function(x, lambda) {
 ##' Likelihood of the length of chains with geometric offspring distribution
 ##'
 ##' @param x vector of sizes
-##' @param prob probability of the geometric distribution with mean \code{1/prob}
+##' @param prob probability of the geometric distribution with mean
+##' \code{1/prob}
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
 ##' @keywords internal
@@ -91,7 +98,7 @@ geom_length_ll <- function(x, prob) {
 ##'   cumulative distribution function (ecdf).
 ##' @param x vector of sizes
 ##' @param nsim_offspring number of simulations of the offspring distribution
-##'   for approximation the size/length distribution 
+##'   for approximation the size/length distribution
 ##' @param ... any paramaters to pass to \code{\link{chain_sim}}
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
@@ -119,25 +126,30 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring=100, ...) {
 ##' @param obs_prob observation probability (assumed constant)
 ##' @param infinite any chains of this size/length will be treated as infinite
 ##' @param exclude any sizes/lengths to exclude from the likelihood calculation
-##' @param nsim_obs number of simulations if the likelihood is to be approximated for imperfect observations
+##' @param nsim_obs number of simulations if the likelihood is to be
+##'   approximated for imperfect observations
 ##' @param ... parameters for the offspring distribution
 ##' @return likelihood
 ##' @inheritParams chain_sim
-##' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll geom_length_ll offspring_ll
+##' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll
+##'   geom_length_ll offspring_ll
 ##' @author Sebastian Funk
 ##' @export
 ##' @examples
 ##' chain_sizes <- c(1,1,4,7) # example of observed chain sizes
-##' chain_ll(chain_sizes, "pois", "size", lambda=0.5)
-chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1, infinite = Inf, exclude, nsim_obs, ...)
-{
+##' chain_ll(chain_sizes, rpois, "size", lambda=0.5)
+chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
+                     infinite = Inf, exclude, nsim_obs, ...) {
   stat <- match.arg(stat)
 
   ## checks
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
-    if (missing(nsim_obs)) stop("'nsim_obs' must be specified if 'obs_prob' is <1")
-    sampled_x <- replicate(nsim_obs, pmin(rbinom_size(length(x), x, obs_prob), infinite))
+    if (missing(nsim_obs)) {
+      stop("'nsim_obs' must be specified if 'obs_prob' is <1")
+    }
+    sampled_x <-
+      replicate(nsim_obs, pmin(rbinom_size(length(x), x, obs_prob), infinite))
     size_x <- unlist(sampled_x)
     if (!is.finite(infinite)) infinite <- max(size_x) + 1
   } else {
@@ -152,15 +164,22 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1, infinit
     calc_sizes <- unique(size_x)
   }
 
-  ## first, get likelihood function as given by `offspring` and `stat``
+  ## get random function as given by `offspring`
+  if (!is.function(offspring)) {
+    stop("object passed as 'offspring' is not a function.")
+  }
+
+  ## get likelihood function as given by `offspring` and `stat``
   likelihoods <- c()
-  ll_func <- paste(offspring, stat, "ll", sep="_")
+  ## get offspring distribution by stripping first letter from offspring
+  ## function 
+  offspring_dist <- sub("^.", "", deparse(substitute(offspring)))
+  ll_func <- paste(offspring_dist, stat, "ll", sep="_")
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
   ## calculate likelihoods
-  if (exists(ll_func)) {
+  if (exists(ll_func, where=asNamespace('bpmodels'), mode='function')) {
     func <- get(ll_func)
-    if (!is.function(func)) stop("'", ll_func, "' is not a function.")
     likelihoods[calc_sizes] <- do.call(func, c(list(x=calc_sizes), pars))
   } else {
     likelihoods[calc_sizes] <-
diff --git a/R/simulate.r b/R/simulate.r
index 8e13b68a..6ce66d1f 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,37 +1,41 @@
 ##' Simulate chains using a branching process
 ##'
 ##' @param n number of simulations to run.
-##' @param offspring offspring distribution as character string, e.g. "pois" for
-##'     the Poisson offspring distribution. 
+##' @param offspring offspring distribution, given as the function used to
+##'     generate the number of offspring in each generation, e.g. `rpois` for
+##'     Poisson distributed offspring
 ##' @param stat statistic to calculate ("size" or "length" of chains)
-##' @param infinite a size or length from which the size/length is to be considered infinite
+##' @param infinite a size or length from which the size/length is to be
+##'     considered infinite
 ##' @param ... parameters of the offspring distribution
 ##' @return a vector of sizes/lengths
 ##' @author Sebastian Funk
 ##' @export
-chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf, ...) {
+##' @examples
+##' chain_sim(n=5, rpois, "size", lambda=0.5)
+chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
+                      ...) {
 
     stat <- match.arg(stat)
 
     ## first, get random function as given by `offspring`
-    random_func <- paste0("r", offspring)
-    if (!exists(random_func)) stop("Random sampling function '", random_func, "' does not exist.")
-    func <- get(random_func)
-    if (!is.function(func)) stop("'", random_func, "' is not a function.")
+    if (!is.function(offspring)) {
+        stop("object passed as 'offspring' is not a function.")
+    }
 
     ## next, simulate n chains
     dist <- c()
     for (i in seq_len(n)) {
-        stat_track <- 1 ## variable to track length or size (depending on `stat`)
+        stat_track <- 1 ## track length or size (depending on `stat`)
         state <- 1
         while (state > 0 && state < infinite) {
-            offspring <- sum(func(n=state, ...))
+            n_offspring <- sum(offspring(n=state, ...))
             if (stat=="size") {
-                stat_track <- stat_track + offspring
+                stat_track <- stat_track + n_offspring
             } else if (stat=="length") {
-                if (offspring > 0) stat_track <- stat_track + 1
+                if (n_offspring > 0) stat_track <- stat_track + 1
             }
-            state <- offspring
+            state <- n_offspring
         }
         if (state >= infinite) stat_track <- Inf
         dist[i] <- stat_track
diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
index 71f39d7e..0ed5fa56 100644
--- a/man/chain_ll.Rd
+++ b/man/chain_ll.Rd
@@ -10,8 +10,9 @@ chain_ll(x, offspring, stat = c("size", "length"), obs_prob = 1,
 \arguments{
 \item{x}{vector of sizes or lengths of transmission chains}
 
-\item{offspring}{offspring distribution as character string, e.g. "pois" for
-the Poisson offspring distribution.}
+\item{offspring}{offspring distribution, given as the function used to
+generate the number of offspring in each generation, e.g. `rpois` for
+Poisson distributed offspring}
 
 \item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
 
@@ -21,7 +22,8 @@ the Poisson offspring distribution.}
 
 \item{exclude}{any sizes/lengths to exclude from the likelihood calculation}
 
-\item{nsim_obs}{number of simulations if the likelihood is to be approximated for imperfect observations}
+\item{nsim_obs}{number of simulations if the likelihood is to be
+approximated for imperfect observations}
 
 \item{...}{parameters for the offspring distribution}
 }
@@ -33,10 +35,11 @@ Likelihood for the outcome of a branching process
 }
 \examples{
 chain_sizes <- c(1,1,4,7) # example of observed chain sizes
-chain_ll(chain_sizes, "pois", "size", lambda=0.5)
+chain_ll(chain_sizes, rpois, "size", lambda=0.5)
 }
 \seealso{
-pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll geom_length_ll offspring_ll
+pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll
+  geom_length_ll offspring_ll
 }
 \author{
 Sebastian Funk
diff --git a/man/chain_sim.Rd b/man/chain_sim.Rd
index e3d09a24..9030121b 100644
--- a/man/chain_sim.Rd
+++ b/man/chain_sim.Rd
@@ -10,12 +10,14 @@ chain_sim(n, offspring, stat = c("size", "length"), infinite = Inf,
 \arguments{
 \item{n}{number of simulations to run.}
 
-\item{offspring}{offspring distribution as character string, e.g. "pois" for
-the Poisson offspring distribution.}
+\item{offspring}{offspring distribution, given as the function used to
+generate the number of offspring in each generation, e.g. `rpois` for
+Poisson distributed offspring}
 
 \item{stat}{statistic to calculate ("size" or "length" of chains)}
 
-\item{infinite}{a size or length from which the size/length is to be considered infinite}
+\item{infinite}{a size or length from which the size/length is to be
+considered infinite}
 
 \item{...}{parameters of the offspring distribution}
 }
@@ -25,6 +27,9 @@ a vector of sizes/lengths
 \description{
 Simulate chains using a branching process
 }
+\examples{
+chain_sim(n=5, rpois, "size", lambda=0.5)
+}
 \author{
 Sebastian Funk
 }
diff --git a/man/gborel_size_ll.Rd b/man/gborel_size_ll.Rd
index 13ee9646..221bf270 100644
--- a/man/gborel_size_ll.Rd
+++ b/man/gborel_size_ll.Rd
@@ -9,9 +9,11 @@ gborel_size_ll(x, size, prob, mu)
 \arguments{
 \item{x}{vector of sizes}
 
-\item{size}{the dispersion parameter (often called \code{k} in ecological applications)}
+\item{size}{the dispersion parameter (often called \code{k} in ecological
+applications)}
 
-\item{prob}{probability of success (in the parameterisation with \code{prob}, see also \code{\link[stats]{NegBinomial}})}
+\item{prob}{probability of success (in the parameterisation with
+\code{prob}, see also \code{\link[stats]{NegBinomial}})}
 
 \item{mu}{mean parameter}
 }
diff --git a/man/geom_length_ll.Rd b/man/geom_length_ll.Rd
index 98015fe7..bdc6082d 100644
--- a/man/geom_length_ll.Rd
+++ b/man/geom_length_ll.Rd
@@ -9,7 +9,8 @@ geom_length_ll(x, prob)
 \arguments{
 \item{x}{vector of sizes}
 
-\item{prob}{probability of the geometric distribution with mean \code{1/prob}}
+\item{prob}{probability of the geometric distribution with mean
+\code{1/prob}}
 }
 \value{
 log-likelihood values
diff --git a/man/nbinom_size_ll.Rd b/man/nbinom_size_ll.Rd
index 974b5916..363ecd30 100644
--- a/man/nbinom_size_ll.Rd
+++ b/man/nbinom_size_ll.Rd
@@ -2,16 +2,19 @@
 % Please edit documentation in R/likelihoods.R
 \name{nbinom_size_ll}
 \alias{nbinom_size_ll}
-\title{Likelihood of the size of chains with Negative-Binomial offspring distribution}
+\title{Likelihood of the size of chains with Negative-Binomial offspring
+distribution}
 \usage{
 nbinom_size_ll(x, size, prob, mu)
 }
 \arguments{
 \item{x}{vector of sizes}
 
-\item{size}{the dispersion parameter (often called \code{k} in ecological applications)}
+\item{size}{the dispersion parameter (often called \code{k} in ecological
+applications)}
 
-\item{prob}{probability of success (in the parameterisation with \code{prob}, see also \code{\link[stats]{NegBinomial}})}
+\item{prob}{probability of success (in the parameterisation with
+\code{prob}, see also \code{\link[stats]{NegBinomial}})}
 
 \item{mu}{mean parameter}
 }
@@ -19,7 +22,8 @@ nbinom_size_ll(x, size, prob, mu)
 log-likelihood values
 }
 \description{
-Likelihood of the size of chains with Negative-Binomial offspring distribution
+Likelihood of the size of chains with Negative-Binomial offspring
+distribution
 }
 \author{
 Sebastian Funk
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index d9827f27..19d8fee4 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -9,8 +9,9 @@ offspring_ll(x, offspring, stat, nsim_offspring = 100, ...)
 \arguments{
 \item{x}{vector of sizes}
 
-\item{offspring}{offspring distribution as character string, e.g. "pois" for
-the Poisson offspring distribution.}
+\item{offspring}{offspring distribution, given as the function used to
+generate the number of offspring in each generation, e.g. `rpois` for
+Poisson distributed offspring}
 
 \item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
 
diff --git a/man/rborel.Rd b/man/rborel.Rd
index 8923dc65..e32484ed 100644
--- a/man/rborel.Rd
+++ b/man/rborel.Rd
@@ -11,7 +11,8 @@ rborel(n, mu, infinite = Inf)
 
 \item{mu}{mu parameter.}
 
-\item{infinite}{any number to treat as infinite; simulations will be stopped if this number is reached}
+\item{infinite}{any number to treat as infinite; simulations will be stopped
+if this number is reached}
 }
 \value{
 vector of random numbers
diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
index e12b72ce..51c67a89 100644
--- a/tests/testthat/tests-ll.r
+++ b/tests/testthat/tests-ll.r
@@ -4,12 +4,14 @@ chains <- c(1,1,4,7)
 
 test_that("Likelihoods can be calculated",
 {
-    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5), 0)
-    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, exclude=1), 0)
-    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, infinite = 5), 0)
-    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, obs_prob = 0.5, nsim_obs=1), 0)
-    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, infinite = 5, obs_prob = 0.5, nsim_obs=1), 0)
-    expect_lt(chain_ll(chains, "binom", "size", size=1, prob=0.5), 0)
+    expect_lt(chain_ll(chains, rpois, "size", lambda=0.5), 0)
+    expect_lt(chain_ll(chains, rpois, "size", lambda=0.5, exclude=1), 0)
+    expect_lt(chain_ll(chains, rpois, "size", lambda=0.5, infinite = 5), 0)
+    expect_lt(chain_ll(chains, rpois, "size", lambda=0.5, obs_prob = 0.5,
+                       nsim_obs=1), 0)
+    expect_lt(chain_ll(chains, rpois, "size", lambda=0.5, infinite = 5,
+                       obs_prob = 0.5, nsim_obs=1), 0)
+    expect_lt(chain_ll(chains, rbinom, "size", size=1, prob=0.5), 0)
 })
 
 test_that("Analytical size/length distributions are implemented",
@@ -25,8 +27,14 @@ test_that("Analytical size/length distributions are implemented",
 
 test_that("Errors are thrown",
 {
-    expect_error(chain_ll(chain_sizes, "pois", "size", lambda=0.5, obs_prob = 3), "must be within")
-    expect_error(chain_ll(chain_sizes, "pois", "size", lambda=0.5, obs_prob = 0.5), "must be specified")
-    expect_error(nbinom_size_ll(chains, mu=0.5, size=0.2, prob=0.1), "both specified")
-    expect_error(gborel_size_ll(chains, mu=0.5, size=0.2, prob=0.1), "both specified")
+    expect_error(chain_ll(chains, "dummy", "size", lambda=0.5),
+                 "not a function")
+    expect_error(chain_ll(chains, rpois, "size", lambda=0.5, obs_prob = 3),
+                 "must be within")
+    expect_error(chain_ll(chains, rpois, "size", lambda=0.5, obs_prob = 0.5),
+                 "must be specified")
+    expect_error(nbinom_size_ll(chains, mu=0.5, size=0.2, prob=0.1),
+                 "both specified")
+    expect_error(gborel_size_ll(chains, mu=0.5, size=0.2, prob=0.1),
+                 "both specified")
 })
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 093e5a6b..c543e497 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -2,12 +2,13 @@ context("Simulating from a branching process model")
 
 test_that("Chains can be simulated",
 {
-    expect_length(chain_sim(n=2, "pois", lambda=0.5), 2)
-    expect_length(chain_sim(n=10, "pois", "length", lambda=0.9), 10)
-    expect_false(any(is.finite(chain_sim(n=2, "pois", "length", lambda=0.5, infinite=1))))
+    expect_length(chain_sim(n=2, rpois, lambda=0.5), 2)
+    expect_length(chain_sim(n=10, rpois, "length", lambda=0.9), 10)
+    expect_false(any(is.finite(chain_sim(n=2, rpois, "length", lambda=0.5,
+                                         infinite=1))))
 })
 
 test_that("Errors are thrown",
 {
-    expect_error(chain_sim(n=2, "dummy"), "does not exist")
+    expect_error(chain_sim(n=2, "dummy"), "is not a function")
 })
diff --git a/vignettes/introduction.Rmd b/vignettes/introduction.Rmd
index 24abe368..07e9cada 100644
--- a/vignettes/introduction.Rmd
+++ b/vignettes/introduction.Rmd
@@ -33,10 +33,10 @@ At the heart of the `bpmodels` package are the `chains_ll` and `chains_sim` func
 
 ```{r}
 chain_sizes <- c(1,1,4,7) # example of observed chain sizes
-chain_ll(chain_sizes, "pois", "size", lambda=0.5)
+chain_ll(chain_sizes, rpois, "size", lambda=0.5)
 ```
 
-The first argument of `chain_ll` is the size (or length) distribution to analyse. The second argument (called `offspring`) specifies the offspring distribution. This is given as a character string that refers to the function used to generate random offspring. It can be any probability distribution implemented in R, that is, one that has a corresponding function for generating random numbers beginning with the letter `r`. In the case of the example above, since random Poisson numbers are generated in R using a function called `rpois`, "pois" is the corresponding string to pass to the `offspring` argument.
+The first argument of `chain_ll` is the size (or length) distribution to analyse. The second argument (called `offspring`) specifies the offspring distribution. This is given as a the function used to generate random offspring. It can be any probability distribution implemented in R, that is, one that has a corresponding function for generating random numbers beginning with the letter `r`. In the case of the example above, since random Poisson numbers are generated in R using a function called `rpois`, this is the function to pass as the `offspring` argument.
 
 The third argument (called `stat`) determines whether to analyse chain sizes ("size", the default if this argument is not specified) or lengths ("length"). Lastly, any named arguments not recognised by `chain_ll` are interpreted as parameters of the corresponding probability distribution, here `lambda=0.5` as the mean of the Poisson distribution (see the R help page for the Poisson distribution for more information).
 
@@ -49,19 +49,23 @@ You can use the `R` help to find out about usage of the `chains_ll` function,
 To simulate from a branching process, use the `chain_sim` function, which follows the same syntax as the `chain_ll` function:
 
 ```{r}
-chain_sim(n=5, "pois", "size", lambda=0.5)
+chain_sim(n=5, rpois, "size", lambda=0.5)
 ```
 
 # Methodology
 
-If the probability distribution of chain sizes or lengths has an analytical solution, this will be used (size distribution: Poisson and negative binomial; length distribution: Poisson and geometric). If not, simulations are used to approximate this probability distributions (using a linear approximation to the cumulative distribution for unobserved sizes/lengths), requiring an additional parameter `nsim_offspring` for the number of simulations to be used for this approximation.
+If the probability distribution of chain sizes or lengths has an analytical solution, this will be used (size distribution: Poisson and negative binomial; length distribution: Poisson and geometric). If not, simulations are used to approximate this probability distributions (using a linear approximation to the cumulative distribution for unobserved sizes/lengths), requiring an additional parameter `nsim_offspring` for the number of simulations to be used for this approximation. For example, to get offspring drawn from a binomial distribution with probability `p=0.5`.
+
+```{r}
+chain_ll(chain_sizes, rbinom, "size", size=1, prob=0.5, nsim_offspring=100)
+```
 
 # Imperfect observations
 
 The `chain_ll` function has an `obs_prob` parameter that can be used to determine the likelihood if observations are imperfect. This only works when analysing chain sizes (`stat="size"`). In that case, true chain sizes are simulated repeatedly (the number of times given by the `nsim_obs` argument) and the likelihood calculated for each of these simulations. For example, if the probability of observing each case is 30%, use
 
 ```{r}
-ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda=0.5, nsim_obs=10)
+ll <- chain_ll(chain_sizes, rpois, "size", obs_prob = 0.3, lambda=0.5, nsim_obs=10)
 summary(ll)
 ```
 

From 48f83ce782d90019da048b88af46c93ccac99316 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 11:32:26 +0000
Subject: [PATCH 028/828] update link for Appveyor badge

---
 README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.md b/README.md
index 028c7a9e..2654ac3e 100644
--- a/README.md
+++ b/README.md
@@ -1,7 +1,7 @@
 # bpmodels
 
 [![Travis-CI Build Status](https://travis-ci.org/sbfnk/bpmodels.svg?branch=master)](https://travis-ci.org/sbfnk/bpmodels)
-[![Appveyor Build Status](https://ci.appveyor.com/api/projects/status/github/sbfnk)](https://ci.appveyor.com/project/sbfnk/bpmodels)
+[![Appveyor Build Status](https://ci.appveyor.com/api/projects/status/y37i8x0wo9o8s2wf?svg=true)](https://ci.appveyor.com/project/sbfnk/bpmodels)
 [![codecov](https://codecov.io/github/sbfnk/bpmodels/branch/master/graphs/badge.svg)](https://codecov.io/github/sbfnk/bpmodels) 
 
 Methods for analysing the distribution of epidemiological chain sizes and lengths

From 0ae93936d50883c4f4cb150dac0716869a6874a9 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 11:36:20 +0000
Subject: [PATCH 029/828] fix typos

---
 R/likelihoods.R            | 6 +++---
 R/utils.r                  | 2 +-
 man/offspring_ll.Rd        | 2 +-
 man/pois_length_ll.Rd      | 2 +-
 man/pois_size_ll.Rd        | 2 +-
 man/rbinom_size.Rd         | 2 +-
 vignettes/introduction.Rmd | 8 ++++----
 7 files changed, 12 insertions(+), 12 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index f84b383f..cbcc04ec 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -1,7 +1,7 @@
 ##' Likelihood of the size of chains with Poisson offspring distribution
 ##'
 ##' @param x vector of sizes
-##' @param lambda rate of the Poisson distributino
+##' @param lambda rate of the Poisson distribution
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
 ##' @keywords internal
@@ -57,7 +57,7 @@ gborel_size_ll <- function(x, size, prob, mu) {
 ##' Likelihood of the length of chains with Poisson offspring distribution
 ##'
 ##' @param x vector of sizes
-##' @param lambda rate of the Poisson distributino
+##' @param lambda rate of the Poisson distribution
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
 ##' @keywords internal
@@ -99,7 +99,7 @@ geom_length_ll <- function(x, prob) {
 ##' @param x vector of sizes
 ##' @param nsim_offspring number of simulations of the offspring distribution
 ##'   for approximation the size/length distribution
-##' @param ... any paramaters to pass to \code{\link{chain_sim}}
+##' @param ... any parameters to pass to \code{\link{chain_sim}}
 ##' @return log-likelihood values
 ##' @author Sebastian Funk
 ##' @inheritParams chain_ll
diff --git a/R/utils.r b/R/utils.r
index 86fe3f7d..a1b74e9f 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -12,7 +12,7 @@ complementary_logprob <- function(x) {
 ##' Samples size (the number of trials) of a binomial distribution
 ##'
 ##' Samples the size parameter from the binomial distribution with fixed x
-##' (number of sucesses) and p (sucess probability)
+##' (number of successes) and p (success probability)
 ##' @param n number of samples to generate
 ##' @param x number of successes
 ##' @param prob probability of success
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 19d8fee4..7bfe36c6 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -18,7 +18,7 @@ Poisson distributed offspring}
 \item{nsim_offspring}{number of simulations of the offspring distribution
 for approximation the size/length distribution}
 
-\item{...}{any paramaters to pass to \code{\link{chain_sim}}}
+\item{...}{any parameters to pass to \code{\link{chain_sim}}}
 }
 \value{
 log-likelihood values
diff --git a/man/pois_length_ll.Rd b/man/pois_length_ll.Rd
index 8bcf37d4..4a767a99 100644
--- a/man/pois_length_ll.Rd
+++ b/man/pois_length_ll.Rd
@@ -9,7 +9,7 @@ pois_length_ll(x, lambda)
 \arguments{
 \item{x}{vector of sizes}
 
-\item{lambda}{rate of the Poisson distributino}
+\item{lambda}{rate of the Poisson distribution}
 }
 \value{
 log-likelihood values
diff --git a/man/pois_size_ll.Rd b/man/pois_size_ll.Rd
index 19163265..931b1430 100644
--- a/man/pois_size_ll.Rd
+++ b/man/pois_size_ll.Rd
@@ -9,7 +9,7 @@ pois_size_ll(x, lambda)
 \arguments{
 \item{x}{vector of sizes}
 
-\item{lambda}{rate of the Poisson distributino}
+\item{lambda}{rate of the Poisson distribution}
 }
 \value{
 log-likelihood values
diff --git a/man/rbinom_size.Rd b/man/rbinom_size.Rd
index c50027b4..89b2e539 100644
--- a/man/rbinom_size.Rd
+++ b/man/rbinom_size.Rd
@@ -18,7 +18,7 @@ a sampled size
 }
 \description{
 Samples the size parameter from the binomial distribution with fixed x
-(number of sucesses) and p (sucess probability)
+(number of successes) and p (success probability)
 }
 \author{
 Sebastian Funk
diff --git a/vignettes/introduction.Rmd b/vignettes/introduction.Rmd
index 07e9cada..8a456e68 100644
--- a/vignettes/introduction.Rmd
+++ b/vignettes/introduction.Rmd
@@ -17,7 +17,7 @@ knitr::opts_chunk$set(
 )
 ```
 
-[bpmodels](https://github.com/sbfnk/bpmodels) is an `R` package to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks.
+[bpmodels](https://github.com/sbfnk/bpmodels) is an `R` package to analyse and simulate the size and length of branching processes with a given offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks.
 
 # Usage
 
@@ -29,7 +29,7 @@ library('bpmodels')
 suppressWarnings(library('bpmodels'))
 ```
 
-At the heart of the `bpmodels` package are the `chains_ll` and `chains_sim` functions. The `chains_ll` function calculates the log-likelihood of a distribution of chain sizes or lengths given an offspring distribution and associated parameters. For example, to get the log-likelihood for a given observed distribution of chain sizes assuming a mean number of 0.5 Poisson-distributed offspring per generation, use
+At the heart of the package are the `chains_ll` and `chains_sim` functions. The `chains_ll` function calculates the log-likelihood of a distribution of chain sizes or lengths given an offspring distribution and associated parameters. For example, to get the log-likelihood for a given observed distribution of chain sizes assuming a mean number of 0.5 Poisson-distributed offspring per generation, use
 
 ```{r}
 chain_sizes <- c(1,1,4,7) # example of observed chain sizes
@@ -38,7 +38,7 @@ chain_ll(chain_sizes, rpois, "size", lambda=0.5)
 
 The first argument of `chain_ll` is the size (or length) distribution to analyse. The second argument (called `offspring`) specifies the offspring distribution. This is given as a the function used to generate random offspring. It can be any probability distribution implemented in R, that is, one that has a corresponding function for generating random numbers beginning with the letter `r`. In the case of the example above, since random Poisson numbers are generated in R using a function called `rpois`, this is the function to pass as the `offspring` argument.
 
-The third argument (called `stat`) determines whether to analyse chain sizes ("size", the default if this argument is not specified) or lengths ("length"). Lastly, any named arguments not recognised by `chain_ll` are interpreted as parameters of the corresponding probability distribution, here `lambda=0.5` as the mean of the Poisson distribution (see the R help page for the Poisson distribution for more information).
+The third argument (called `stat`) determines whether to analyse chain sizes (`"size"`, the default if this argument is not specified) or lengths (`"length"`). Lastly, any named arguments not recognised by `chain_ll` are interpreted as parameters of the corresponding probability distribution, here `lambda=0.5` as the mean of the Poisson distribution (see the R help page for the Poisson distribution for more information).
 
 You can use the `R` help to find out about usage of the `chains_ll` function,
 
@@ -54,7 +54,7 @@ chain_sim(n=5, rpois, "size", lambda=0.5)
 
 # Methodology
 
-If the probability distribution of chain sizes or lengths has an analytical solution, this will be used (size distribution: Poisson and negative binomial; length distribution: Poisson and geometric). If not, simulations are used to approximate this probability distributions (using a linear approximation to the cumulative distribution for unobserved sizes/lengths), requiring an additional parameter `nsim_offspring` for the number of simulations to be used for this approximation. For example, to get offspring drawn from a binomial distribution with probability `p=0.5`.
+If the probability distribution of chain sizes or lengths has an analytical solution, this will be used (size distribution: Poisson and negative binomial; length distribution: Poisson and geometric). If not, simulations are used to approximate this probability distributions (using a linear approximation to the cumulative distribution for unobserved sizes/lengths), requiring an additional parameter `nsim_offspring` for the number of simulations to be used for this approximation. For example, to get offspring drawn from a binomial distribution with probability `prob=0.5`.
 
 ```{r}
 chain_ll(chain_sizes, rbinom, "size", size=1, prob=0.5, nsim_offspring=100)

From addfa091979f6195685b5932215a0c2c0c7319f6 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 16 Jan 2019 11:37:57 +0000
Subject: [PATCH 030/828] set seed in vignette

---
 vignettes/introduction.Rmd | 1 +
 1 file changed, 1 insertion(+)

diff --git a/vignettes/introduction.Rmd b/vignettes/introduction.Rmd
index 8a456e68..0a237859 100644
--- a/vignettes/introduction.Rmd
+++ b/vignettes/introduction.Rmd
@@ -27,6 +27,7 @@ library('bpmodels')
 ```
 ```{r echo=FALSE}
 suppressWarnings(library('bpmodels'))
+set.seed(13)
 ```
 
 At the heart of the package are the `chains_ll` and `chains_sim` functions. The `chains_ll` function calculates the log-likelihood of a distribution of chain sizes or lengths given an offspring distribution and associated parameters. For example, to get the log-likelihood for a given observed distribution of chain sizes assuming a mean number of 0.5 Poisson-distributed offspring per generation, use

From dd31949ae1af69d03cc4ef169fc1542ec5bb6a1a Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 17 Jan 2019 12:28:50 +0000
Subject: [PATCH 031/828] fix for obs_size with only one chain

---
 R/likelihoods.R | 1 +
 1 file changed, 1 insertion(+)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index cbcc04ec..9e60a688 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -150,6 +150,7 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
     }
     sampled_x <-
       replicate(nsim_obs, pmin(rbinom_size(length(x), x, obs_prob), infinite))
+    if (length(x) == 1) sampled_x <- matrix(sampled_x, nrow=1)
     size_x <- unlist(sampled_x)
     if (!is.finite(infinite)) infinite <- max(size_x) + 1
   } else {

From 67bc19d7158f06701e96ee55dd477927eea79a5b Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 17 Jan 2019 12:29:21 +0000
Subject: [PATCH 032/828] fix typo

---
 R/utils.r          | 2 +-
 man/rbinom_size.Rd | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/utils.r b/R/utils.r
index a1b74e9f..a03ffdd5 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -16,7 +16,7 @@ complementary_logprob <- function(x) {
 ##' @param n number of samples to generate
 ##' @param x number of successes
 ##' @param prob probability of success
-##' @return a sampled size
+##' @return sampled sizes
 ##' @author Sebastian Funk
 ##' @keywords internal
 rbinom_size <- function(n, x, prob) {
diff --git a/man/rbinom_size.Rd b/man/rbinom_size.Rd
index 89b2e539..5e19360d 100644
--- a/man/rbinom_size.Rd
+++ b/man/rbinom_size.Rd
@@ -14,7 +14,7 @@ rbinom_size(n, x, prob)
 \item{prob}{probability of success}
 }
 \value{
-a sampled size
+sampled sizes
 }
 \description{
 Samples the size parameter from the binomial distribution with fixed x

From 43d5c883666a35b51330381c006f524b235a0778 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 17 Jan 2019 12:29:34 +0000
Subject: [PATCH 033/828] imperfect observation for chain lengths

---
 R/likelihoods.R            |  9 +++++++--
 R/utils.r                  | 15 +++++++++++++++
 man/rgen_length.Rd         | 27 +++++++++++++++++++++++++++
 tests/testthat/tests-ll.r  |  2 ++
 vignettes/introduction.Rmd |  2 +-
 5 files changed, 52 insertions(+), 3 deletions(-)
 create mode 100644 man/rgen_length.Rd

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 9e60a688..5da0966c 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -148,8 +148,13 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
     if (missing(nsim_obs)) {
       stop("'nsim_obs' must be specified if 'obs_prob' is <1")
     }
+    if (stat=="size") {
+      sample_func <- rbinom_size
+    } else if (stat=="length"){
+      sample_func <- rgen_length
+    }
     sampled_x <-
-      replicate(nsim_obs, pmin(rbinom_size(length(x), x, obs_prob), infinite))
+      replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob), infinite))
     if (length(x) == 1) sampled_x <- matrix(sampled_x, nrow=1)
     size_x <- unlist(sampled_x)
     if (!is.finite(infinite)) infinite <- max(size_x) + 1
@@ -173,7 +178,7 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
   ## get likelihood function as given by `offspring` and `stat``
   likelihoods <- c()
   ## get offspring distribution by stripping first letter from offspring
-  ## function 
+  ## function
   offspring_dist <- sub("^.", "", deparse(substitute(offspring)))
   ll_func <- paste(offspring_dist, stat, "ll", sep="_")
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
diff --git a/R/utils.r b/R/utils.r
index a03ffdd5..233e1b9d 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -22,3 +22,18 @@ complementary_logprob <- function(x) {
 rbinom_size <- function(n, x, prob) {
     x + stats::rnbinom(n, x, prob) + stats::rnbinom(n, 1, prob)
 }
+
+##' Samples chain lengths with given observation probabilities
+##'
+##' Samples the length of a transmission chain where each individual element is
+##' observed with binomial probability
+##' (number of successes) and p (success probability)
+##' @param n number of samples to generate
+##' @param x observed chain lengths
+##' @param prob probability of observation
+##' @return sampled lengths
+##' @author Sebastian Funk
+##' @keywords internal
+rgen_length <- function(n, x, prob) {
+    x + ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1)
+}
diff --git a/man/rgen_length.Rd b/man/rgen_length.Rd
new file mode 100644
index 00000000..14ebbb17
--- /dev/null
+++ b/man/rgen_length.Rd
@@ -0,0 +1,27 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/utils.r
+\name{rgen_length}
+\alias{rgen_length}
+\title{Samples chain lengths with given observation probabilities}
+\usage{
+rgen_length(n, x, prob)
+}
+\arguments{
+\item{n}{number of samples to generate}
+
+\item{x}{observed chain lengths}
+
+\item{prob}{probability of observation}
+}
+\value{
+sampled lengths
+}
+\description{
+Samples the length of a transmission chain where each individual element is
+observed with binomial probability
+(number of successes) and p (success probability)
+}
+\author{
+Sebastian Funk
+}
+\keyword{internal}
diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
index 51c67a89..65d10719 100644
--- a/tests/testthat/tests-ll.r
+++ b/tests/testthat/tests-ll.r
@@ -9,6 +9,8 @@ test_that("Likelihoods can be calculated",
     expect_lt(chain_ll(chains, rpois, "size", lambda=0.5, infinite = 5), 0)
     expect_lt(chain_ll(chains, rpois, "size", lambda=0.5, obs_prob = 0.5,
                        nsim_obs=1), 0)
+    expect_lt(chain_ll(chains, rpois, "length", lambda=0.5, obs_prob = 0.5,
+                       nsim_obs=1), 0)
     expect_lt(chain_ll(chains, rpois, "size", lambda=0.5, infinite = 5,
                        obs_prob = 0.5, nsim_obs=1), 0)
     expect_lt(chain_ll(chains, rbinom, "size", size=1, prob=0.5), 0)
diff --git a/vignettes/introduction.Rmd b/vignettes/introduction.Rmd
index 0a237859..a56b810d 100644
--- a/vignettes/introduction.Rmd
+++ b/vignettes/introduction.Rmd
@@ -63,7 +63,7 @@ chain_ll(chain_sizes, rbinom, "size", size=1, prob=0.5, nsim_offspring=100)
 
 # Imperfect observations
 
-The `chain_ll` function has an `obs_prob` parameter that can be used to determine the likelihood if observations are imperfect. This only works when analysing chain sizes (`stat="size"`). In that case, true chain sizes are simulated repeatedly (the number of times given by the `nsim_obs` argument) and the likelihood calculated for each of these simulations. For example, if the probability of observing each case is 30%, use
+The `chain_ll` function has an `obs_prob` parameter that can be used to determine the likelihood if observations are imperfect. In that case, true chain sizes or lengths are simulated repeatedly (the number of times given by the `nsim_obs` argument) and the likelihood calculated for each of these simulations. For example, if the probability of observing each case is 30%, use
 
 ```{r}
 ll <- chain_ll(chain_sizes, rpois, "size", obs_prob = 0.3, lambda=0.5, nsim_obs=10)

From 42d742a691e9e3fdf49e8931f8863e0f0b337fa7 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Fri, 18 Jan 2019 07:53:01 +0000
Subject: [PATCH 034/828] simpler negative binomial sampling

---
 R/utils.r | 6 ++++--
 1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/R/utils.r b/R/utils.r
index 233e1b9d..3dd3cc87 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -20,7 +20,7 @@ complementary_logprob <- function(x) {
 ##' @author Sebastian Funk
 ##' @keywords internal
 rbinom_size <- function(n, x, prob) {
-    x + stats::rnbinom(n, x, prob) + stats::rnbinom(n, 1, prob)
+    x + stats::rnbinom(n, x + 1, prob) - 1
 }
 
 ##' Samples chain lengths with given observation probabilities
@@ -35,5 +35,7 @@ rbinom_size <- function(n, x, prob) {
 ##' @author Sebastian Funk
 ##' @keywords internal
 rgen_length <- function(n, x, prob) {
-    x + ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1)
+    x +
+      ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1) +
+      ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1)
 }

From 444b4dc7659a8d57dccb342cabb01fa05dca94fa Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Fri, 18 Jan 2019 07:54:16 +0000
Subject: [PATCH 035/828] sample missed generations at the beginning and end of
 chains

---
 R/utils.r | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/utils.r b/R/utils.r
index 3dd3cc87..b5efaebc 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -26,8 +26,8 @@ rbinom_size <- function(n, x, prob) {
 ##' Samples chain lengths with given observation probabilities
 ##'
 ##' Samples the length of a transmission chain where each individual element is
-##' observed with binomial probability
-##' (number of successes) and p (success probability)
+##' observed with binomial probability (number of successes) and p (success
+##' probability)
 ##' @param n number of samples to generate
 ##' @param x observed chain lengths
 ##' @param prob probability of observation

From b8d4145b8709b37a12a9a24e94a2475ed9ffa89a Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Fri, 18 Jan 2019 08:09:47 +0000
Subject: [PATCH 036/828] rbinom_size fix (no need to subtract 1)

---
 R/utils.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/utils.r b/R/utils.r
index b5efaebc..015b091b 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -20,7 +20,7 @@ complementary_logprob <- function(x) {
 ##' @author Sebastian Funk
 ##' @keywords internal
 rbinom_size <- function(n, x, prob) {
-    x + stats::rnbinom(n, x + 1, prob) - 1
+    x + stats::rnbinom(n, x + 1, prob)
 }
 
 ##' Samples chain lengths with given observation probabilities

From 773048931d8d67f69eeb0ccf5c6b5db20db41bf5 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Tue, 22 Jan 2019 11:40:59 +0000
Subject: [PATCH 037/828] throw error for non-integer offspring distributions

---
 R/simulate.r               | 3 +++
 tests/testthat/tests-sim.r | 1 +
 2 files changed, 4 insertions(+)

diff --git a/R/simulate.r b/R/simulate.r
index 6ce66d1f..f5ede015 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -30,6 +30,9 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
         state <- 1
         while (state > 0 && state < infinite) {
             n_offspring <- sum(offspring(n=state, ...))
+            if (n_offspring %% 1 > 0) {
+                stop("Offspring distribution must return integers")
+            }
             if (stat=="size") {
                 stat_track <- stat_track + n_offspring
             } else if (stat=="length") {
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index c543e497..98eac794 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -11,4 +11,5 @@ test_that("Chains can be simulated",
 test_that("Errors are thrown",
 {
     expect_error(chain_sim(n=2, "dummy"), "is not a function")
+    expect_error(chain_sim(n=2, rlnorm, meanlog=log(1.6)), "integer")
 })

From bf2d11825b1f20b51244f2a264f209f5a0b6d26d Mon Sep 17 00:00:00 2001
From: "Zhian N. Kamvar" <zkamvar@gmail.com>
Date: Thu, 7 Feb 2019 12:40:46 +0900
Subject: [PATCH 038/828] pre-allocate dist

Growing a vector in R tends to be slow. This process will run faster due to the pre-allocation:

```r
Unit: relative
                                                                  expr      min       lq     mean   median       uq      max neval cld
            {     slow <- c()     for (i in seq(1e+06)) slow[i] <- 1 } 5.156572 5.071423 5.084367 5.023986 5.271332 3.261024   100   b
 {     fast <- integer(1e+06)     for (i in seq(1e+06)) fast[i] <- 1 } 1.000000 1.000000 1.000000 1.000000 1.000000 1.000000   100  a
```
---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index f5ede015..5237efc9 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -24,7 +24,7 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
     }
 
     ## next, simulate n chains
-    dist <- c()
+    dist <- integer(n)
     for (i in seq_len(n)) {
         stat_track <- 1 ## track length or size (depending on `stat`)
         state <- 1

From f65e40f11d739480818898029baaee186b107bfc Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 7 Feb 2019 08:14:22 +0000
Subject: [PATCH 039/828] vectorise chain simulations

---
 R/simulate.r | 46 ++++++++++++++++++++++++++++------------------
 1 file changed, 28 insertions(+), 18 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 5237efc9..4be6f8d6 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -23,27 +23,37 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
         stop("object passed as 'offspring' is not a function.")
     }
 
+    stat_track <- rep(1, n) ## track length or size (depending on `stat`)
+    n_offspring <- rep(1, n) ## current number of offspring
+    sim <- seq_len(n) ## track chains that are still being simulated
+
     ## next, simulate n chains
-    dist <- integer(n)
-    for (i in seq_len(n)) {
-        stat_track <- 1 ## track length or size (depending on `stat`)
-        state <- 1
-        while (state > 0 && state < infinite) {
-            n_offspring <- sum(offspring(n=state, ...))
-            if (n_offspring %% 1 > 0) {
-                stop("Offspring distribution must return integers")
-            }
-            if (stat=="size") {
-                stat_track <- stat_track + n_offspring
-            } else if (stat=="length") {
-                if (n_offspring > 0) stat_track <- stat_track + 1
-            }
-            state <- n_offspring
+    while (length(sim) > 0) {
+        ## simulate next generation
+        next_gen <- offspring(n=sum(n_offspring[sim]), ...)
+        if (any(next_gen %% 1 > 0)) {
+            stop("Offspring distribution must return integers")
+        }
+        ## record indices corresponding the number of offspring of last
+        ## iteration, for the tapply call below
+        indices <- rep(sim, n_offspring[sim])
+        ## initialise number of offspring
+        n_offspring <- rep(0, n)
+        ## assign offspring sum to indices still being simulated
+        n_offspring[sim] <- tapply(next_gen, indices, sum)
+        ## track size/length
+        if (stat=="size") {
+            stat_track <- stat_track + n_offspring
+        } else if (stat=="length") {
+            stat_track <- stat_track + pmin(1, n_offspring)
         }
-        if (state >= infinite) stat_track <- Inf
-        dist[i] <- stat_track
+        ## only continue to simulate chains that offspring and aren't of
+        ## infinite size/length
+        sim <- which(n_offspring > 0 & stat_track < infinite)
     }
 
-    return(dist)
+    stat_track[stat_track >= infinite] <- Inf
+
+    return(stat_track)
 }
 

From e22374d44a0ac007e76e1d4169118a0cc32d92e4 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 7 Feb 2019 08:18:58 +0000
Subject: [PATCH 040/828] documentation update

---
 R/utils.r          | 4 ++--
 man/rgen_length.Rd | 4 ++--
 2 files changed, 4 insertions(+), 4 deletions(-)

diff --git a/R/utils.r b/R/utils.r
index 015b091b..f5d85fce 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -26,8 +26,8 @@ rbinom_size <- function(n, x, prob) {
 ##' Samples chain lengths with given observation probabilities
 ##'
 ##' Samples the length of a transmission chain where each individual element is
-##' observed with binomial probability (number of successes) and p (success
-##' probability)
+##' observed with binomial probability with parameters n (number of successes)
+##' and p (success probability)
 ##' @param n number of samples to generate
 ##' @param x observed chain lengths
 ##' @param prob probability of observation
diff --git a/man/rgen_length.Rd b/man/rgen_length.Rd
index 14ebbb17..21a6359e 100644
--- a/man/rgen_length.Rd
+++ b/man/rgen_length.Rd
@@ -18,8 +18,8 @@ sampled lengths
 }
 \description{
 Samples the length of a transmission chain where each individual element is
-observed with binomial probability
-(number of successes) and p (success probability)
+observed with binomial probability with parameters n (number of successes)
+and p (success probability)
 }
 \author{
 Sebastian Funk

From f0480f0285650b8d3035ca9c5a3776926c87b2a6 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 7 Feb 2019 08:25:13 +0000
Subject: [PATCH 041/828] update DESCRIPTION and NEWS

---
 DESCRIPTION | 2 +-
 NEWS.md     | 4 ++++
 2 files changed, 5 insertions(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index bf73caf5..c8f4b3c1 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,7 +1,7 @@
 Package: bpmodels
 Version: 0.1.0
 Title: Analysing chain statistics using branching process models
-Authors@R: c(person("Sebastian", "Funk", email = "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")))
+Authors@R: c(person("Sebastian", "Funk", email = "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")), person("Zhian N.", "Kamvar", email = "zkamvar@gmail.com", role = c("ctb")))
 Description: Provides methods to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks, as discussed in Farrington et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
 Suggests: 
     testthat,
diff --git a/NEWS.md b/NEWS.md
index 1d3b8bbd..0e74623f 100644
--- a/NEWS.md
+++ b/NEWS.md
@@ -1,3 +1,7 @@
+# bpmodels 0.1.9999
+
+* faster, vectorised chain simulations
+
 # bpmodels 0.1.0
 
 * initial release

From 466bcbd48f0d3641e74e1fab78b79e4722a6ec82 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 7 Feb 2019 10:36:49 +0000
Subject: [PATCH 042/828] simulate trees

---
 R/simulate.r               | 50 ++++++++++++++++++++++++++++++++------
 tests/testthat/tests-sim.r |  2 ++
 2 files changed, 45 insertions(+), 7 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 4be6f8d6..5709ac26 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -7,14 +7,18 @@
 ##' @param stat statistic to calculate ("size" or "length" of chains)
 ##' @param infinite a size or length from which the size/length is to be
 ##'     considered infinite
+##' @param tree return the tree of infectors
 ##' @param ... parameters of the offspring distribution
-##' @return a vector of sizes/lengths
+##' @return a vector of sizes/lengths (if \code{tree==FALSE}), or a data frame
+##'     with columns `n` (simulation ID), `id` (a unique ID within each
+##'     simulation for each individual element of the chain), `ancestor` (the ID
+##'     of the ancestor of each element) and `generation`.
 ##' @author Sebastian Funk
 ##' @export
 ##' @examples
 ##' chain_sim(n=5, rpois, "size", lambda=0.5)
 chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
-                      ...) {
+                      tree=FALSE, ...) {
 
     stat <- match.arg(stat)
 
@@ -27,6 +31,17 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
     n_offspring <- rep(1, n) ## current number of offspring
     sim <- seq_len(n) ## track chains that are still being simulated
 
+    ## initialise data frame to hold the trees
+    if (tree) {
+        generation <- 1L
+        tdf <-
+            data.frame(n=seq_len(n),
+                       id=1L,
+                       ancestor=NA_integer_,
+                       generation=generation)
+        ancestor_ids <- rep(1, n)
+    }
+
     ## next, simulate n chains
     while (length(sim) > 0) {
         ## simulate next generation
@@ -34,26 +49,47 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
         if (any(next_gen %% 1 > 0)) {
             stop("Offspring distribution must return integers")
         }
-        ## record indices corresponding the number of offspring of last
-        ## iteration, for the tapply call below
+
+        ## record indices corresponding the number of offspring
         indices <- rep(sim, n_offspring[sim])
+
         ## initialise number of offspring
         n_offspring <- rep(0, n)
         ## assign offspring sum to indices still being simulated
         n_offspring[sim] <- tapply(next_gen, indices, sum)
+
         ## track size/length
         if (stat=="size") {
             stat_track <- stat_track + n_offspring
         } else if (stat=="length") {
             stat_track <- stat_track + pmin(1, n_offspring)
         }
+
+        ## record ancestors (if tree==TRUE)
+        if (tree && sum(n_offspring[sim]) > 0) {
+            ancestors <- rep(ancestor_ids, next_gen)
+            ids <- ancestors + unlist(lapply(n_offspring[sim], seq_len))
+            generation <- generation + 1L
+            ## record indices corresponding the number of offspring
+            new_df <-
+                data.frame(n=rep(sim, n_offspring[sim]),
+                           id=ids,
+                           ancestor=ancestors,
+                           generation=generation)
+            tdf <- rbind(tdf, new_df)
+        }
+
         ## only continue to simulate chains that offspring and aren't of
         ## infinite size/length
         sim <- which(n_offspring > 0 & stat_track < infinite)
+        if (tree) ancestor_ids <- unlist(lapply(n_offspring[sim], seq_len))
     }
 
-    stat_track[stat_track >= infinite] <- Inf
-
-    return(stat_track)
+    if (tree) {
+        return(tdf)
+    } else {
+        stat_track[stat_track >= infinite] <- Inf
+        return(stat_track)
+    }
 }
 
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 98eac794..240b743d 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -4,6 +4,8 @@ test_that("Chains can be simulated",
 {
     expect_length(chain_sim(n=2, rpois, lambda=0.5), 2)
     expect_length(chain_sim(n=10, rpois, "length", lambda=0.9), 10)
+    expect_true(is.data.frame(chain_sim(n=10, rpois, lambda=2, tree=TRUE,
+                                        infinite=10)))
     expect_false(any(is.finite(chain_sim(n=2, rpois, "length", lambda=0.5,
                                          infinite=1))))
 })

From b0f022515271fd38cfd90cbf47314700619a8123 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 7 Feb 2019 12:43:43 +0000
Subject: [PATCH 043/828] update `chain_sim` documentation

---
 man/chain_sim.Rd | 9 +++++++--
 1 file changed, 7 insertions(+), 2 deletions(-)

diff --git a/man/chain_sim.Rd b/man/chain_sim.Rd
index 9030121b..c7c2de75 100644
--- a/man/chain_sim.Rd
+++ b/man/chain_sim.Rd
@@ -5,7 +5,7 @@
 \title{Simulate chains using a branching process}
 \usage{
 chain_sim(n, offspring, stat = c("size", "length"), infinite = Inf,
-  ...)
+  tree = FALSE, ...)
 }
 \arguments{
 \item{n}{number of simulations to run.}
@@ -19,10 +19,15 @@ Poisson distributed offspring}
 \item{infinite}{a size or length from which the size/length is to be
 considered infinite}
 
+\item{tree}{return the tree of infectors}
+
 \item{...}{parameters of the offspring distribution}
 }
 \value{
-a vector of sizes/lengths
+a vector of sizes/lengths (if \code{tree==FALSE}), or a data frame
+    with columns `n` (simulation ID), `id` (a unique ID within each
+    simulation for each individual element of the chain), `ancestor` (the ID
+    of the ancestor of each element) and `generation`.
 }
 \description{
 Simulate chains using a branching process

From bd05e51aef62ae4a7ade3f58b27050dd555dea0f Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 7 Feb 2019 13:23:22 +0000
Subject: [PATCH 044/828] tree simulations: fix IDs

---
 R/simulate.r | 10 +++++++---
 1 file changed, 7 insertions(+), 3 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 5709ac26..fdffe01a 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -40,6 +40,7 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
                        ancestor=NA_integer_,
                        generation=generation)
         ancestor_ids <- rep(1, n)
+        current_max_id <- rep(1, n)
     }
 
     ## next, simulate n chains
@@ -68,11 +69,14 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
         ## record ancestors (if tree==TRUE)
         if (tree && sum(n_offspring[sim]) > 0) {
             ancestors <- rep(ancestor_ids, next_gen)
-            ids <- ancestors + unlist(lapply(n_offspring[sim], seq_len))
+            current_max_id <- unname(tapply(ancestor_ids, indices, max))
+            indices <- rep(sim, n_offspring[sim])
+            ids <- rep(current_max_id, n_offspring[sim]) +
+                unlist(lapply(n_offspring[sim], seq_len))
             generation <- generation + 1L
             ## record indices corresponding the number of offspring
             new_df <-
-                data.frame(n=rep(sim, n_offspring[sim]),
+                data.frame(n=indices,
                            id=ids,
                            ancestor=ancestors,
                            generation=generation)
@@ -82,7 +86,7 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
         ## only continue to simulate chains that offspring and aren't of
         ## infinite size/length
         sim <- which(n_offspring > 0 & stat_track < infinite)
-        if (tree) ancestor_ids <- unlist(lapply(n_offspring[sim], seq_len))
+        if (tree) ancestor_ids <- ids[indices %in% sim]
     }
 
     if (tree) {

From 16dfadf691b171a090f84980d4139b8caa1ceec2 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 7 Mar 2019 13:31:11 +0000
Subject: [PATCH 045/828] improved handling of excluded sizes

---
 R/likelihoods.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 5da0966c..bd908666 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -167,7 +167,7 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
   if (any(size_x == infinite)) {
     calc_sizes <- seq_len(infinite-1)
   } else {
-    calc_sizes <- unique(size_x)
+    calc_sizes <- unique(c(size_x, exclude))
   }
 
   ## get random function as given by `offspring`

From 09ef8cbec7aeeb81287a91c9a9e6efb9d2c82eb9 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 7 Mar 2019 13:31:31 +0000
Subject: [PATCH 046/828] get the correct function name, even in optim etc.

---
 R/likelihoods.R |  4 ++--
 R/utils.r       | 18 ++++++++++++++++++
 2 files changed, 20 insertions(+), 2 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index bd908666..758b7626 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -139,7 +139,7 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring=100, ...) {
 ##' chain_sizes <- c(1,1,4,7) # example of observed chain sizes
 ##' chain_ll(chain_sizes, rpois, "size", lambda=0.5)
 chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
-                     infinite = Inf, exclude, nsim_obs, ...) {
+                     infinite = Inf, exclude=c(), nsim_obs, ...) {
   stat <- match.arg(stat)
 
   ## checks
@@ -179,7 +179,7 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
   likelihoods <- c()
   ## get offspring distribution by stripping first letter from offspring
   ## function
-  offspring_dist <- sub("^.", "", deparse(substitute(offspring)))
+  offspring_dist <- sub("^.", "", find_function_name(offspring))
   ll_func <- paste(offspring_dist, stat, "ll", sep="_")
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
diff --git a/R/utils.r b/R/utils.r
index f5d85fce..9f912db2 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -39,3 +39,21 @@ rgen_length <- function(n, x, prob) {
       ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1) +
       ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1)
 }
+
+##' Finds the name of a function passed as an argument
+##'
+##' This works even when a function is passed multiple times (e.g., when used
+##' inside an \code{\link{optim}} call).
+##' See https://stackoverflow.com/a/46740314/10886760
+##' @param fun function of which the name is to be determined
+##' @return function name
+##' @author Sebastian Funk
+##' @keywords internal
+find_function_name <- function(fun) {
+  objects <- ls(envir = environment(fun))
+  for (i in objects) {
+    if (identical(fun, get(i, envir = environment(fun)))) {
+      return(i)
+    }
+  }
+}

From d512afbe8555eb44cc52fd6125ce2c91bcccc96b Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Sat, 17 Aug 2019 17:46:20 +0100
Subject: [PATCH 047/828] exclude sizes in likelihood as desired

---
 R/likelihoods.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 758b7626..88064f11 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -207,10 +207,10 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
   ## adjust for binomial observation probabilities
   if (obs_prob < 1) {
     chains_likelihood <- apply(sampled_x, 2, function(sx) {
-      sum(likelihoods[sx])
+      sum(likelihoods[sx[!(sx %in% exclude)]])
     })
   } else {
-    chains_likelihood <- sum(likelihoods[x])
+    chains_likelihood <- sum(likelihoods[x[!(x %in% exclude)]])
   }
 
   return(chains_likelihood)

From 3617f47fe57b9aa551af8fe1314ca0ae7027237a Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Mon, 19 Aug 2019 14:31:33 +0100
Subject: [PATCH 048/828] Likelihood: allow passing string for offspring

This is useful if the random-number-generating function does not exist, but a
closed form does
---
 R/likelihoods.R           | 24 +++++++++++++++++++-----
 man/chain_ll.Rd           |  6 ++----
 man/find_function_name.Rd | 23 +++++++++++++++++++++++
 man/offspring_ll.Rd       |  4 +---
 tests/testthat/tests-ll.r |  5 +++++
 5 files changed, 50 insertions(+), 12 deletions(-)
 create mode 100644 man/find_function_name.Rd

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 88064f11..f61c5e97 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -122,6 +122,7 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring=100, ...) {
 ##' Likelihood for the outcome of a branching process
 ##'
 ##' @param x vector of sizes or lengths of transmission chains
+##' @param offspring offspring distribution: either a function (e.g., \code{rpois} for Poisson) or a character string (e.g., "pois" for Poisson)
 ##' @param stat statistic given as \code{x} ("size" or "length" of chains)
 ##' @param obs_prob observation probability (assumed constant)
 ##' @param infinite any chains of this size/length will be treated as infinite
@@ -143,6 +144,9 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
   stat <- match.arg(stat)
 
   ## checks
+  if (!is.function(offspring) && !is.character(offspring)) {
+    stop("object passed as 'offspring' is not a function or character string.")
+  }
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
     if (missing(nsim_obs)) {
@@ -171,15 +175,16 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
   }
 
   ## get random function as given by `offspring`
-  if (!is.function(offspring)) {
-    stop("object passed as 'offspring' is not a function.")
+  if (is.character(offspring)) {
+    offspring_dist <- offspring
+  } else {
+    ## get offspring distribution by stripping first letter from offspring
+    ## function
+    offspring_dist <- sub("^.", "", find_function_name(offspring))
   }
 
   ## get likelihood function as given by `offspring` and `stat``
   likelihoods <- c()
-  ## get offspring distribution by stripping first letter from offspring
-  ## function
-  offspring_dist <- sub("^.", "", find_function_name(offspring))
   ll_func <- paste(offspring_dist, stat, "ll", sep="_")
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
@@ -188,6 +193,15 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
     func <- get(ll_func)
     likelihoods[calc_sizes] <- do.call(func, c(list(x=calc_sizes), pars))
   } else {
+    if (is.character(offspring)) {
+      roffspring_name <- paste0("r", offspring)
+      if (exists(roffspring_name)) {
+        offspring <- get(roffspring_name)
+        if (!is.function(offspring)) stop(roffspring_name, " is not a function.")
+      } else {
+        stop("Function ", roffspring_name, " does not exist.")
+      }
+    }
     likelihoods[calc_sizes] <-
       do.call(offspring_ll,
               c(list(x=calc_sizes, offspring=offspring,
diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
index 0ed5fa56..a1619dc8 100644
--- a/man/chain_ll.Rd
+++ b/man/chain_ll.Rd
@@ -5,14 +5,12 @@
 \title{Likelihood for the outcome of a branching process}
 \usage{
 chain_ll(x, offspring, stat = c("size", "length"), obs_prob = 1,
-  infinite = Inf, exclude, nsim_obs, ...)
+  infinite = Inf, exclude = c(), nsim_obs, ...)
 }
 \arguments{
 \item{x}{vector of sizes or lengths of transmission chains}
 
-\item{offspring}{offspring distribution, given as the function used to
-generate the number of offspring in each generation, e.g. `rpois` for
-Poisson distributed offspring}
+\item{offspring}{offspring distribution: either a function (e.g., \code{rpois} for Poisson) or a character string (e.g., "pois" for Poisson)}
 
 \item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
 
diff --git a/man/find_function_name.Rd b/man/find_function_name.Rd
new file mode 100644
index 00000000..d330baed
--- /dev/null
+++ b/man/find_function_name.Rd
@@ -0,0 +1,23 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/utils.r
+\name{find_function_name}
+\alias{find_function_name}
+\title{Finds the name of a function passed as an argument}
+\usage{
+find_function_name(fun)
+}
+\arguments{
+\item{fun}{function of which the name is to be determined}
+}
+\value{
+function name
+}
+\description{
+This works even when a function is passed multiple times (e.g., when used
+inside an \code{\link{optim}} call).
+See https://stackoverflow.com/a/46740314/10886760
+}
+\author{
+Sebastian Funk
+}
+\keyword{internal}
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 7bfe36c6..cc55a913 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -9,9 +9,7 @@ offspring_ll(x, offspring, stat, nsim_offspring = 100, ...)
 \arguments{
 \item{x}{vector of sizes}
 
-\item{offspring}{offspring distribution, given as the function used to
-generate the number of offspring in each generation, e.g. `rpois` for
-Poisson distributed offspring}
+\item{offspring}{offspring distribution: either a function (e.g., \code{rpois} for Poisson) or a character string (e.g., "pois" for Poisson)}
 
 \item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
 
diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
index 65d10719..cff7b997 100644
--- a/tests/testthat/tests-ll.r
+++ b/tests/testthat/tests-ll.r
@@ -1,6 +1,7 @@
 context("Calculating the likelihood from a branching process model")
 
 chains <- c(1,1,4,7)
+rtest <- "test"
 
 test_that("Likelihoods can be calculated",
 {
@@ -30,6 +31,10 @@ test_that("Analytical size/length distributions are implemented",
 test_that("Errors are thrown",
 {
     expect_error(chain_ll(chains, "dummy", "size", lambda=0.5),
+                 "does not exist")
+    expect_error(chain_ll(chains, list(), "size", lambda=0.5),
+                 "not a function or")
+    expect_error(chain_ll(chains, "test", "size", lambda=0.5),
                  "not a function")
     expect_error(chain_ll(chains, rpois, "size", lambda=0.5, obs_prob = 3),
                  "must be within")

From 4311dd222df3498b1de9f3fa10ba4938485271e6 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Mon, 19 Aug 2019 17:39:19 +0100
Subject: [PATCH 049/828] likelihoods: don't allow passing offspring functions

This is really, really slow
---
 R/borel.r                  |  2 +-
 R/likelihoods.R            | 52 +++++++++++++-------------------------
 R/simulate.r               | 19 +++++++++-----
 man/chain_ll.Rd            | 12 ++++++---
 man/chain_sim.Rd           |  8 +++---
 man/offspring_ll.Rd        |  4 ++-
 tests/testthat/tests-ll.r  | 25 ++++++++----------
 tests/testthat/tests-sim.r | 12 ++++-----
 vignettes/introduction.Rmd | 10 ++++----
 9 files changed, 67 insertions(+), 77 deletions(-)

diff --git a/R/borel.r b/R/borel.r
index 9e97df5d..56dc4331 100644
--- a/R/borel.r
+++ b/R/borel.r
@@ -22,5 +22,5 @@ dborel <- function(x, mu, log=FALSE) {
 ##' @return vector of random numbers
 ##' @author Sebastian Funk
 rborel <- function(n, mu, infinite=Inf) {
-    chain_sim(n, stats::rpois, "size", infinite=infinite, lambda=mu)
+    chain_sim(n, "pois", "size", infinite=infinite, lambda=mu)
 }
diff --git a/R/likelihoods.R b/R/likelihoods.R
index f61c5e97..bdebd49a 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -122,15 +122,15 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring=100, ...) {
 ##' Likelihood for the outcome of a branching process
 ##'
 ##' @param x vector of sizes or lengths of transmission chains
-##' @param offspring offspring distribution: either a function (e.g., \code{rpois} for Poisson) or a character string (e.g., "pois" for Poisson)
 ##' @param stat statistic given as \code{x} ("size" or "length" of chains)
 ##' @param obs_prob observation probability (assumed constant)
 ##' @param infinite any chains of this size/length will be treated as infinite
 ##' @param exclude any sizes/lengths to exclude from the likelihood calculation
+##' @param individual if TRUE, a vector of individual log-likelihood contributions will be returned rather than the sum
 ##' @param nsim_obs number of simulations if the likelihood is to be
 ##'   approximated for imperfect observations
 ##' @param ... parameters for the offspring distribution
-##' @return likelihood
+##' @return likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or a list of individual likelihood contributions (if \code{individual=TRUE})
 ##' @inheritParams chain_sim
 ##' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll
 ##'   geom_length_ll offspring_ll
@@ -138,14 +138,14 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring=100, ...) {
 ##' @export
 ##' @examples
 ##' chain_sizes <- c(1,1,4,7) # example of observed chain sizes
-##' chain_ll(chain_sizes, rpois, "size", lambda=0.5)
+##' chain_ll(chain_sizes, "pois", "size", lambda=0.5)
 chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
-                     infinite = Inf, exclude=c(), nsim_obs, ...) {
+                     infinite = Inf, exclude=c(), individual=FALSE, nsim_obs, ...) {
   stat <- match.arg(stat)
 
   ## checks
-  if (!is.function(offspring) && !is.character(offspring)) {
-    stop("object passed as 'offspring' is not a function or character string.")
+  if (!is.character(offspring)) {
+    stop("object passed as 'offspring' is not a character string.")
   }
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
@@ -158,13 +158,13 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
       sample_func <- rgen_length
     }
     sampled_x <-
-      replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob), infinite))
-    if (length(x) == 1) sampled_x <- matrix(sampled_x, nrow=1)
+      replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob), infinite), simplify = FALSE)
     size_x <- unlist(sampled_x)
     if (!is.finite(infinite)) infinite <- max(size_x) + 1
   } else {
     x[x >= infinite] <- infinite
     size_x <- x
+    sampled_x <- list(x)
   }
 
   ## determine for which sizes to calculate the likelihood (for true chain size)
@@ -174,18 +174,9 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
     calc_sizes <- unique(c(size_x, exclude))
   }
 
-  ## get random function as given by `offspring`
-  if (is.character(offspring)) {
-    offspring_dist <- offspring
-  } else {
-    ## get offspring distribution by stripping first letter from offspring
-    ## function
-    offspring_dist <- sub("^.", "", find_function_name(offspring))
-  }
-
   ## get likelihood function as given by `offspring` and `stat``
   likelihoods <- c()
-  ll_func <- paste(offspring_dist, stat, "ll", sep="_")
+  ll_func <- paste(offspring, stat, "ll", sep="_")
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
   ## calculate likelihoods
@@ -193,15 +184,6 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
     func <- get(ll_func)
     likelihoods[calc_sizes] <- do.call(func, c(list(x=calc_sizes), pars))
   } else {
-    if (is.character(offspring)) {
-      roffspring_name <- paste0("r", offspring)
-      if (exists(roffspring_name)) {
-        offspring <- get(roffspring_name)
-        if (!is.function(offspring)) stop(roffspring_name, " is not a function.")
-      } else {
-        stop("Function ", roffspring_name, " does not exist.")
-      }
-    }
     likelihoods[calc_sizes] <-
       do.call(offspring_ll,
               c(list(x=calc_sizes, offspring=offspring,
@@ -216,17 +198,19 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
   if (!missing(exclude)) {
     likelihoods <- likelihoods - log(-expm1(sum(likelihoods[exclude])))
     likelihoods[exclude] <- -Inf
-  }
 
-  ## adjust for binomial observation probabilities
-  if (obs_prob < 1) {
-    chains_likelihood <- apply(sampled_x, 2, function(sx) {
-      sum(likelihoods[sx[!(sx %in% exclude)]])
+    sampled_x <- lapply(sampled_x, function(y) {
+      y[!(y %in% exclude)]
     })
-  } else {
-    chains_likelihood <- sum(likelihoods[x[!(x %in% exclude)]])
   }
 
+  ## assign likelihoods
+  chains_likelihood <- lapply(sampled_x, function(sx) {
+    likelihoods[sx[!(sx %in% exclude)]]
+  })
+
+  if (!individual) chains_likelihood <- vapply(chains_likelihood, sum, 0)
+
   return(chains_likelihood)
 }
 
diff --git a/R/simulate.r b/R/simulate.r
index fdffe01a..1d041eda 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,9 +1,9 @@
 ##' Simulate chains using a branching process
 ##'
 ##' @param n number of simulations to run.
-##' @param offspring offspring distribution, given as the function used to
-##'     generate the number of offspring in each generation, e.g. `rpois` for
-##'     Poisson distributed offspring
+##' @param offspring offspring distribution: a character string corresponding to
+##'   the R distribution function (e.g., "pois" for Poisson, where
+##'   \code{\link{rpois}} is the R function to generate Poisson random numbers) 
 ##' @param stat statistic to calculate ("size" or "length" of chains)
 ##' @param infinite a size or length from which the size/length is to be
 ##'     considered infinite
@@ -16,15 +16,20 @@
 ##' @author Sebastian Funk
 ##' @export
 ##' @examples
-##' chain_sim(n=5, rpois, "size", lambda=0.5)
+##' chain_sim(n=5, "pois", "size", lambda=0.5)
 chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
                       tree=FALSE, ...) {
 
     stat <- match.arg(stat)
 
     ## first, get random function as given by `offspring`
-    if (!is.function(offspring)) {
-        stop("object passed as 'offspring' is not a function.")
+    if (!is.character(offspring)) {
+        stop("object passed as 'offspring' is not a character string.")
+    }
+
+    roffspring_name <- paste0("r", offspring)
+    if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
+        stop("Function ", roffspring_name, " does not exist.")
     }
 
     stat_track <- rep(1, n) ## track length or size (depending on `stat`)
@@ -46,7 +51,7 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
     ## next, simulate n chains
     while (length(sim) > 0) {
         ## simulate next generation
-        next_gen <- offspring(n=sum(n_offspring[sim]), ...)
+        next_gen <- get(roffspring_name)(n=sum(n_offspring[sim]), ...)
         if (any(next_gen %% 1 > 0)) {
             stop("Offspring distribution must return integers")
         }
diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
index a1619dc8..82276f74 100644
--- a/man/chain_ll.Rd
+++ b/man/chain_ll.Rd
@@ -5,12 +5,14 @@
 \title{Likelihood for the outcome of a branching process}
 \usage{
 chain_ll(x, offspring, stat = c("size", "length"), obs_prob = 1,
-  infinite = Inf, exclude = c(), nsim_obs, ...)
+  infinite = Inf, exclude = c(), individual = FALSE, nsim_obs, ...)
 }
 \arguments{
 \item{x}{vector of sizes or lengths of transmission chains}
 
-\item{offspring}{offspring distribution: either a function (e.g., \code{rpois} for Poisson) or a character string (e.g., "pois" for Poisson)}
+\item{offspring}{offspring distribution: a character string corresponding to
+the R distribution function (e.g., "pois" for Poisson, where
+\code{\link{rpois}} is the R function to generate Poisson random numbers)}
 
 \item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
 
@@ -20,20 +22,22 @@ chain_ll(x, offspring, stat = c("size", "length"), obs_prob = 1,
 
 \item{exclude}{any sizes/lengths to exclude from the likelihood calculation}
 
+\item{individual}{if TRUE, a vector of individual log-likelihood contributions will be returned rather than the sum}
+
 \item{nsim_obs}{number of simulations if the likelihood is to be
 approximated for imperfect observations}
 
 \item{...}{parameters for the offspring distribution}
 }
 \value{
-likelihood
+likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or a list of individual likelihood contributions (if \code{individual=TRUE})
 }
 \description{
 Likelihood for the outcome of a branching process
 }
 \examples{
 chain_sizes <- c(1,1,4,7) # example of observed chain sizes
-chain_ll(chain_sizes, rpois, "size", lambda=0.5)
+chain_ll(chain_sizes, "pois", "size", lambda=0.5)
 }
 \seealso{
 pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll
diff --git a/man/chain_sim.Rd b/man/chain_sim.Rd
index c7c2de75..e1a546d5 100644
--- a/man/chain_sim.Rd
+++ b/man/chain_sim.Rd
@@ -10,9 +10,9 @@ chain_sim(n, offspring, stat = c("size", "length"), infinite = Inf,
 \arguments{
 \item{n}{number of simulations to run.}
 
-\item{offspring}{offspring distribution, given as the function used to
-generate the number of offspring in each generation, e.g. `rpois` for
-Poisson distributed offspring}
+\item{offspring}{offspring distribution: a character string corresponding to
+the R distribution function (e.g., "pois" for Poisson, where
+\code{\link{rpois}} is the R function to generate Poisson random numbers)}
 
 \item{stat}{statistic to calculate ("size" or "length" of chains)}
 
@@ -33,7 +33,7 @@ a vector of sizes/lengths (if \code{tree==FALSE}), or a data frame
 Simulate chains using a branching process
 }
 \examples{
-chain_sim(n=5, rpois, "size", lambda=0.5)
+chain_sim(n=5, "pois", "size", lambda=0.5)
 }
 \author{
 Sebastian Funk
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index cc55a913..260f36cd 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -9,7 +9,9 @@ offspring_ll(x, offspring, stat, nsim_offspring = 100, ...)
 \arguments{
 \item{x}{vector of sizes}
 
-\item{offspring}{offspring distribution: either a function (e.g., \code{rpois} for Poisson) or a character string (e.g., "pois" for Poisson)}
+\item{offspring}{offspring distribution: a character string corresponding to
+the R distribution function (e.g., "pois" for Poisson, where
+\code{\link{rpois}} is the R function to generate Poisson random numbers)}
 
 \item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
 
diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
index cff7b997..7f2f638f 100644
--- a/tests/testthat/tests-ll.r
+++ b/tests/testthat/tests-ll.r
@@ -1,20 +1,19 @@
 context("Calculating the likelihood from a branching process model")
 
 chains <- c(1,1,4,7)
-rtest <- "test"
 
 test_that("Likelihoods can be calculated",
 {
-    expect_lt(chain_ll(chains, rpois, "size", lambda=0.5), 0)
-    expect_lt(chain_ll(chains, rpois, "size", lambda=0.5, exclude=1), 0)
-    expect_lt(chain_ll(chains, rpois, "size", lambda=0.5, infinite = 5), 0)
-    expect_lt(chain_ll(chains, rpois, "size", lambda=0.5, obs_prob = 0.5,
+    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5), 0)
+    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, exclude=1), 0)
+    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, infinite = 5), 0)
+    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, obs_prob = 0.5,
                        nsim_obs=1), 0)
-    expect_lt(chain_ll(chains, rpois, "length", lambda=0.5, obs_prob = 0.5,
+    expect_lt(chain_ll(chains, "pois", "length", lambda=0.5, obs_prob = 0.5,
                        nsim_obs=1), 0)
-    expect_lt(chain_ll(chains, rpois, "size", lambda=0.5, infinite = 5,
+    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, infinite = 5,
                        obs_prob = 0.5, nsim_obs=1), 0)
-    expect_lt(chain_ll(chains, rbinom, "size", size=1, prob=0.5), 0)
+    expect_lt(chain_ll(chains, "binom", "size", size=1, prob=0.5), 0)
 })
 
 test_that("Analytical size/length distributions are implemented",
@@ -30,15 +29,11 @@ test_that("Analytical size/length distributions are implemented",
 
 test_that("Errors are thrown",
 {
-    expect_error(chain_ll(chains, "dummy", "size", lambda=0.5),
-                 "does not exist")
     expect_error(chain_ll(chains, list(), "size", lambda=0.5),
-                 "not a function or")
-    expect_error(chain_ll(chains, "test", "size", lambda=0.5),
-                 "not a function")
-    expect_error(chain_ll(chains, rpois, "size", lambda=0.5, obs_prob = 3),
+                 "not a character")
+    expect_error(chain_ll(chains, "pois", "size", lambda=0.5, obs_prob = 3),
                  "must be within")
-    expect_error(chain_ll(chains, rpois, "size", lambda=0.5, obs_prob = 0.5),
+    expect_error(chain_ll(chains, "pois", "size", lambda=0.5, obs_prob = 0.5),
                  "must be specified")
     expect_error(nbinom_size_ll(chains, mu=0.5, size=0.2, prob=0.1),
                  "both specified")
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 240b743d..a30e2f7a 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -2,16 +2,16 @@ context("Simulating from a branching process model")
 
 test_that("Chains can be simulated",
 {
-    expect_length(chain_sim(n=2, rpois, lambda=0.5), 2)
-    expect_length(chain_sim(n=10, rpois, "length", lambda=0.9), 10)
-    expect_true(is.data.frame(chain_sim(n=10, rpois, lambda=2, tree=TRUE,
+    expect_length(chain_sim(n=2, "pois", lambda=0.5), 2)
+    expect_length(chain_sim(n=10, "pois", "length", lambda=0.9), 10)
+    expect_true(is.data.frame(chain_sim(n=10, "pois", lambda=2, tree=TRUE,
                                         infinite=10)))
-    expect_false(any(is.finite(chain_sim(n=2, rpois, "length", lambda=0.5,
+    expect_false(any(is.finite(chain_sim(n=2, "pois", "length", lambda=0.5,
                                          infinite=1))))
 })
 
 test_that("Errors are thrown",
 {
-    expect_error(chain_sim(n=2, "dummy"), "is not a function")
-    expect_error(chain_sim(n=2, rlnorm, meanlog=log(1.6)), "integer")
+    expect_error(chain_sim(n=2, "dummy"), "does not exist")
+    expect_error(chain_sim(n=2, "lnorm", meanlog=log(1.6)), "integer")
 })
diff --git a/vignettes/introduction.Rmd b/vignettes/introduction.Rmd
index a56b810d..5d3e0c67 100644
--- a/vignettes/introduction.Rmd
+++ b/vignettes/introduction.Rmd
@@ -34,10 +34,10 @@ At the heart of the package are the `chains_ll` and `chains_sim` functions. The
 
 ```{r}
 chain_sizes <- c(1,1,4,7) # example of observed chain sizes
-chain_ll(chain_sizes, rpois, "size", lambda=0.5)
+chain_ll(chain_sizes, "pois", "size", lambda=0.5)
 ```
 
-The first argument of `chain_ll` is the size (or length) distribution to analyse. The second argument (called `offspring`) specifies the offspring distribution. This is given as a the function used to generate random offspring. It can be any probability distribution implemented in R, that is, one that has a corresponding function for generating random numbers beginning with the letter `r`. In the case of the example above, since random Poisson numbers are generated in R using a function called `rpois`, this is the function to pass as the `offspring` argument.
+The first argument of `chain_ll` is the size (or length) distribution to analyse. The second argument (called `offspring`) specifies the offspring distribution. This is given as a the function used to generate random offspring. It can be any probability distribution implemented in R, that is, one that has a corresponding function for generating random numbers beginning with the letter `r`. In the case of the example above, since random Poisson numbers are generated in R using a function called `rpois`, the string to pass to the `offspring` argument is `"pois"`.
 
 The third argument (called `stat`) determines whether to analyse chain sizes (`"size"`, the default if this argument is not specified) or lengths (`"length"`). Lastly, any named arguments not recognised by `chain_ll` are interpreted as parameters of the corresponding probability distribution, here `lambda=0.5` as the mean of the Poisson distribution (see the R help page for the Poisson distribution for more information).
 
@@ -50,7 +50,7 @@ You can use the `R` help to find out about usage of the `chains_ll` function,
 To simulate from a branching process, use the `chain_sim` function, which follows the same syntax as the `chain_ll` function:
 
 ```{r}
-chain_sim(n=5, rpois, "size", lambda=0.5)
+chain_sim(n=5, "pois", "size", lambda=0.5)
 ```
 
 # Methodology
@@ -58,7 +58,7 @@ chain_sim(n=5, rpois, "size", lambda=0.5)
 If the probability distribution of chain sizes or lengths has an analytical solution, this will be used (size distribution: Poisson and negative binomial; length distribution: Poisson and geometric). If not, simulations are used to approximate this probability distributions (using a linear approximation to the cumulative distribution for unobserved sizes/lengths), requiring an additional parameter `nsim_offspring` for the number of simulations to be used for this approximation. For example, to get offspring drawn from a binomial distribution with probability `prob=0.5`.
 
 ```{r}
-chain_ll(chain_sizes, rbinom, "size", size=1, prob=0.5, nsim_offspring=100)
+chain_ll(chain_sizes, "binom", "size", size=1, prob=0.5, nsim_offspring=100)
 ```
 
 # Imperfect observations
@@ -66,7 +66,7 @@ chain_ll(chain_sizes, rbinom, "size", size=1, prob=0.5, nsim_offspring=100)
 The `chain_ll` function has an `obs_prob` parameter that can be used to determine the likelihood if observations are imperfect. In that case, true chain sizes or lengths are simulated repeatedly (the number of times given by the `nsim_obs` argument) and the likelihood calculated for each of these simulations. For example, if the probability of observing each case is 30%, use
 
 ```{r}
-ll <- chain_ll(chain_sizes, rpois, "size", obs_prob = 0.3, lambda=0.5, nsim_obs=10)
+ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda=0.5, nsim_obs=10)
 summary(ll)
 ```
 

From fd5c0d854755ccdce518f1fb556e3fef245fef3e Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Fri, 24 Jan 2020 09:59:55 +0000
Subject: [PATCH 050/828] simulate serial intervals

---
 R/simulate.r | 73 ++++++++++++++++++++++++++++++++++++++++------------
 1 file changed, 56 insertions(+), 17 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 1d041eda..85715f34 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -8,17 +8,26 @@
 ##' @param infinite a size or length from which the size/length is to be
 ##'     considered infinite
 ##' @param tree return the tree of infectors
+##' @param serial the serial interval; a function that takes one parameter
+##' (`n`), the number of serial intervals to randomly sample; if this parameter
+##'   is set, `chain_sim` returns times of infection, too; implies (`tree`=TRUE)
+##' @param t0 start time (if serial interval is given); either a single value (0
+##'     by default for all simulations, or a vector of length `n` with initial
+##'     times) 
+##' @param tf end time (if serial interval is given)
 ##' @param ... parameters of the offspring distribution
-##' @return a vector of sizes/lengths (if \code{tree==FALSE}), or a data frame
-##'     with columns `n` (simulation ID), `id` (a unique ID within each
-##'     simulation for each individual element of the chain), `ancestor` (the ID
-##'     of the ancestor of each element) and `generation`.
+##' @return a vector of sizes/lengths (if \code{tree==FALSE} and no serial
+##'   interval given), or a data frame with columns `n` (simulation ID), `time`
+##'   (if the serial interval is given) and (if \code{tree==TRUE}) `id` (a
+##'   unique ID within each simulation for each individual element of the
+##'   chain), `ancestor` (the ID of the ancestor of each element) and
+##'   `generation`. 
 ##' @author Sebastian Funk
 ##' @export
 ##' @examples
 ##' chain_sim(n=5, "pois", "size", lambda=0.5)
 chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
-                      tree=FALSE, ...) {
+                      tree = FALSE, serial, init_time, t0 = 0, tf = Inf, ...) {
 
     stat <- match.arg(stat)
 
@@ -32,6 +41,18 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
         stop("Function ", roffspring_name, " does not exist.")
     }
 
+    if (!missing(serial)) {
+        if (!is.function(serial)) {
+            stop("The `serial` argument must be a function.")
+        }
+        if (!missing(tree) && tree == FALSE) {
+            stop("The `serial` argument can't be used with `tree==FALSE`.")
+        }
+        tree <- TRUE
+    } else if (!missing(tf)) {
+        stop("The `tf` argument needs a `serial` argument.")
+    }
+
     stat_track <- rep(1, n) ## track length or size (depending on `stat`)
     n_offspring <- rep(1, n) ## current number of offspring
     sim <- seq_len(n) ## track chains that are still being simulated
@@ -40,12 +61,16 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
     if (tree) {
         generation <- 1L
         tdf <-
-            data.frame(n=seq_len(n),
-                       id=1L,
-                       ancestor=NA_integer_,
-                       generation=generation)
+            data.frame(n = seq_len(n),
+                       id = 1L,
+                       ancestor = NA_integer_,
+                       generation = generation)
+
         ancestor_ids <- rep(1, n)
-        current_max_id <- rep(1, n)
+        if (!missing(serial)) {
+            tdf$time <- t0
+            times <- tdf$time
+        }
     }
 
     ## next, simulate n chains
@@ -71,7 +96,7 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
             stat_track <- stat_track + pmin(1, n_offspring)
         }
 
-        ## record ancestors (if tree==TRUE)
+        ## record times/ancestors (if tree==TRUE)
         if (tree && sum(n_offspring[sim]) > 0) {
             ancestors <- rep(ancestor_ids, next_gen)
             current_max_id <- unname(tapply(ancestor_ids, indices, max))
@@ -79,22 +104,36 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
             ids <- rep(current_max_id, n_offspring[sim]) +
                 unlist(lapply(n_offspring[sim], seq_len))
             generation <- generation + 1L
-            ## record indices corresponding the number of offspring
             new_df <-
-                data.frame(n=indices,
-                           id=ids,
-                           ancestor=ancestors,
-                           generation=generation)
+                data.frame(n = indices,
+                           id = ids,
+                           ancestor = ancestors,
+                           generation = generation)
+            if (!missing(serial)) {
+                times <- rep(times, next_gen) + serial(sum(n_offspring))
+                current_min_time <- unname(tapply(times, indices, min))
+                new_df$time <- times
+            }
             tdf <- rbind(tdf, new_df)
         }
 
         ## only continue to simulate chains that offspring and aren't of
         ## infinite size/length
         sim <- which(n_offspring > 0 & stat_track < infinite)
-        if (tree) ancestor_ids <- ids[indices %in% sim]
+        if (tree) {
+            if (!missing(serial)) {
+                sim <- sim[current_min_time < tf]
+                times <- times[indices %in% sim]
+            }
+            ancestor_ids <- ids[indices %in% sim]
+        }
     }
 
     if (tree) {
+        if (!missing(tf)) {
+            tdf <- tdf[tdf$time < tf, ]
+        }
+        rownames(tdf) <- NULL
         return(tdf)
     } else {
         stat_track[stat_track >= infinite] <- Inf

From e84891ed1d49552b09e962c860bd0100fde5ccb2 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Fri, 24 Jan 2020 13:31:45 +0000
Subject: [PATCH 051/828] fix bug in `infinite` argument

---
 R/simulate.r | 7 +++++--
 1 file changed, 5 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 85715f34..605f62d3 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -27,7 +27,7 @@
 ##' @examples
 ##' chain_sim(n=5, "pois", "size", lambda=0.5)
 chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
-                      tree = FALSE, serial, init_time, t0 = 0, tf = Inf, ...) {
+                      tree = FALSE, serial, t0 = 0, tf = Inf, ...) {
 
     stat <- match.arg(stat)
 
@@ -120,9 +120,12 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
         ## only continue to simulate chains that offspring and aren't of
         ## infinite size/length
         sim <- which(n_offspring > 0 & stat_track < infinite)
+        if (!missing(serial)) {
+            ## only continue to simulate chains that don't go beyond tf
+            sim <- intersect(sim, unique(indices)[current_min_time < tf])
+        }
         if (tree) {
             if (!missing(serial)) {
-                sim <- sim[current_min_time < tf]
                 times <- times[indices %in% sim]
             }
             ancestor_ids <- ids[indices %in% sim]

From e7c6d6ff8a3b0c99f16ce0ab3281954d2d96532b Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Fri, 24 Jan 2020 13:31:57 +0000
Subject: [PATCH 052/828] roxygen update

---
 DESCRIPTION      |  2 +-
 man/chain_ll.Rd  | 13 +++++++++++--
 man/chain_sim.Rd | 33 +++++++++++++++++++++++++++------
 3 files changed, 39 insertions(+), 9 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index c8f4b3c1..faa891a9 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -11,5 +11,5 @@ Suggests:
 License: GPL-3
 URL: https://github.com/sbfnk/bpmodels
 BugReports: https://github.com/sbfnk/bpmodels
-RoxygenNote: 6.1.1
+RoxygenNote: 7.0.2
 VignetteBuilder: knitr
diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
index 82276f74..8dda43e3 100644
--- a/man/chain_ll.Rd
+++ b/man/chain_ll.Rd
@@ -4,8 +4,17 @@
 \alias{chain_ll}
 \title{Likelihood for the outcome of a branching process}
 \usage{
-chain_ll(x, offspring, stat = c("size", "length"), obs_prob = 1,
-  infinite = Inf, exclude = c(), individual = FALSE, nsim_obs, ...)
+chain_ll(
+  x,
+  offspring,
+  stat = c("size", "length"),
+  obs_prob = 1,
+  infinite = Inf,
+  exclude = c(),
+  individual = FALSE,
+  nsim_obs,
+  ...
+)
 }
 \arguments{
 \item{x}{vector of sizes or lengths of transmission chains}
diff --git a/man/chain_sim.Rd b/man/chain_sim.Rd
index e1a546d5..08593cab 100644
--- a/man/chain_sim.Rd
+++ b/man/chain_sim.Rd
@@ -4,8 +4,17 @@
 \alias{chain_sim}
 \title{Simulate chains using a branching process}
 \usage{
-chain_sim(n, offspring, stat = c("size", "length"), infinite = Inf,
-  tree = FALSE, ...)
+chain_sim(
+  n,
+  offspring,
+  stat = c("size", "length"),
+  infinite = Inf,
+  tree = FALSE,
+  serial,
+  t0 = 0,
+  tf = Inf,
+  ...
+)
 }
 \arguments{
 \item{n}{number of simulations to run.}
@@ -21,13 +30,25 @@ considered infinite}
 
 \item{tree}{return the tree of infectors}
 
+\item{serial}{the serial interval; a function that takes one parameter
+(`n`), the number of serial intervals to randomly sample; if this parameter
+  is set, `chain_sim` returns times of infection, too; implies (`tree`=TRUE)}
+
+\item{t0}{start time (if serial interval is given); either a single value (0
+by default for all simulations, or a vector of length `n` with initial
+times)}
+
+\item{tf}{end time (if serial interval is given)}
+
 \item{...}{parameters of the offspring distribution}
 }
 \value{
-a vector of sizes/lengths (if \code{tree==FALSE}), or a data frame
-    with columns `n` (simulation ID), `id` (a unique ID within each
-    simulation for each individual element of the chain), `ancestor` (the ID
-    of the ancestor of each element) and `generation`.
+a vector of sizes/lengths (if \code{tree==FALSE} and no serial
+  interval given), or a data frame with columns `n` (simulation ID), `time`
+  (if the serial interval is given) and (if \code{tree==TRUE}) `id` (a
+  unique ID within each simulation for each individual element of the
+  chain), `ancestor` (the ID of the ancestor of each element) and
+  `generation`.
 }
 \description{
 Simulate chains using a branching process

From aadf40ee16bf8f788040bc19dd9bf68d991cea9d Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Fri, 24 Jan 2020 14:18:56 +0000
Subject: [PATCH 053/828] fix for error where 'current_min_time' is not found

---
 R/simulate.r | 16 +++++++++-------
 1 file changed, 9 insertions(+), 7 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 605f62d3..89f62cde 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -120,15 +120,17 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
         ## only continue to simulate chains that offspring and aren't of
         ## infinite size/length
         sim <- which(n_offspring > 0 & stat_track < infinite)
-        if (!missing(serial)) {
-            ## only continue to simulate chains that don't go beyond tf
-            sim <- intersect(sim, unique(indices)[current_min_time < tf])
-        }
-        if (tree) {
+        if (length(sim) > 0) {
             if (!missing(serial)) {
-                times <- times[indices %in% sim]
+                ## only continue to simulate chains that don't go beyond tf
+                sim <- intersect(sim, unique(indices)[current_min_time < tf])
+            }
+            if (tree) {
+                if (!missing(serial)) {
+                    times <- times[indices %in% sim]
+                }
+                ancestor_ids <- ids[indices %in% sim]
             }
-            ancestor_ids <- ids[indices %in% sim]
         }
     }
 

From 6de58a2fe7c24541f488fe58b0a2dfe04f45b58f Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Tue, 28 Jan 2020 10:00:28 +0000
Subject: [PATCH 054/828] rendered vignette

---
 vignettes/introduction.md | 95 +++++++++++++++++++++++++++++++++++++++
 1 file changed, 95 insertions(+)
 create mode 100644 vignettes/introduction.md

diff --git a/vignettes/introduction.md b/vignettes/introduction.md
new file mode 100644
index 00000000..641d09d0
--- /dev/null
+++ b/vignettes/introduction.md
@@ -0,0 +1,95 @@
+[bpmodels](https://github.com/sbfnk/bpmodels) is an `R` package to
+analyse and simulate the size and length of branching processes with a
+given offspring distribution. These can be used, for example, to analyse
+the distribution of chain sizes or length of infectious disease
+outbreaks.
+
+Usage
+=====
+
+To load the package, use
+
+    library('bpmodels')
+
+At the heart of the package are the `chains_ll` and `chains_sim`
+functions. The `chains_ll` function calculates the log-likelihood of a
+distribution of chain sizes or lengths given an offspring distribution
+and associated parameters. For example, to get the log-likelihood for a
+given observed distribution of chain sizes assuming a mean number of 0.5
+Poisson-distributed offspring per generation, use
+
+    chain_sizes <- c(1,1,4,7) # example of observed chain sizes
+    chain_ll(chain_sizes, "pois", "size", lambda=0.5)
+    #> [1] -8.607196
+
+The first argument of `chain_ll` is the size (or length) distribution to
+analyse. The second argument (called `offspring`) specifies the
+offspring distribution. This is given as a the function used to generate
+random offspring. It can be any probability distribution implemented in
+R, that is, one that has a corresponding function for generating random
+numbers beginning with the letter `r`. In the case of the example above,
+since random Poisson numbers are generated in R using a function called
+`rpois`, the string to pass to the `offspring` argument is `"pois"`.
+
+The third argument (called `stat`) determines whether to analyse chain
+sizes (`"size"`, the default if this argument is not specified) or
+lengths (`"length"`). Lastly, any named arguments not recognised by
+`chain_ll` are interpreted as parameters of the corresponding
+probability distribution, here `lambda=0.5` as the mean of the Poisson
+distribution (see the R help page for the Poisson distribution for more
+information).
+
+You can use the `R` help to find out about usage of the `chains_ll`
+function,
+
+    ?chains_ll
+
+To simulate from a branching process, use the `chain_sim` function,
+which follows the same syntax as the `chain_ll` function:
+
+    chain_sim(n=5, "pois", "size", lambda=0.5)
+    #> [1] 2 1 1 1 5
+
+Methodology
+===========
+
+If the probability distribution of chain sizes or lengths has an
+analytical solution, this will be used (size distribution: Poisson and
+negative binomial; length distribution: Poisson and geometric). If not,
+simulations are used to approximate this probability distributions
+(using a linear approximation to the cumulative distribution for
+unobserved sizes/lengths), requiring an additional parameter
+`nsim_offspring` for the number of simulations to be used for this
+approximation. For example, to get offspring drawn from a binomial
+distribution with probability `prob=0.5`.
+
+    chain_ll(chain_sizes, "binom", "size", size=1, prob=0.5, nsim_offspring=100)
+    #> [1] -8.477588
+
+Imperfect observations
+======================
+
+The `chain_ll` function has an `obs_prob` parameter that can be used to
+determine the likelihood if observations are imperfect. In that case,
+true chain sizes or lengths are simulated repeatedly (the number of
+times given by the `nsim_obs` argument) and the likelihood calculated
+for each of these simulations. For example, if the probability of
+observing each case is 30%, use
+
+    ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda=0.5, nsim_obs=10)
+    summary(ll)
+    #>    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
+    #>  -35.30  -25.68  -23.23  -24.19  -20.89  -18.91
+
+This returns `nsim_obs=10` likelihood values which can be averaged to
+come up with an overall likelihood estimate.
+
+References
+==========
+
+-   Farrington, C.P., Kanaan, M.N. and Gay, N.J. (2003). [Branching
+    process models for surveillance of infectious diseases controlled by
+    mass vaccination](https://doi.org/10.1093/biostatistics/4.2.279).
+-   Blumberg, S. and Lloyd-Smith, J.O. (2013). [Comparing methods for
+    estimating R0 from the size distribution of subcritical transmission
+    chains](https://doi.org/10.1016/j.epidem.2013.05.002).

From f39cc7fd243bca0cb49a77a5122dbd0b6fa9b0dc Mon Sep 17 00:00:00 2001
From: ffinger <12323626+ffinger@users.noreply.github.com>
Date: Wed, 4 Mar 2020 21:32:54 +0100
Subject: [PATCH 055/828] simulator function accounting for susceptible
 depletion

corrected error in time checking

corrected bug

minor fixes

changed function name

update doc
---
 DESCRIPTION               |  10 ++-
 NAMESPACE                 |   2 +
 R/globals.R               |   2 +
 R/simulate_susceptibles.R | 166 ++++++++++++++++++++++++++++++++++++++
 R/utils.r                 |  23 +++++-
 man/chain_ll.Rd           |  13 +--
 man/chain_sim.Rd          |  13 +--
 man/chain_sim_susc.Rd     |  56 +++++++++++++
 man/rbinom_size.Rd        |  12 ++-
 man/rnbinom_mean_disp.Rd  |  29 +++++++
 10 files changed, 300 insertions(+), 26 deletions(-)
 create mode 100644 R/globals.R
 create mode 100644 R/simulate_susceptibles.R
 create mode 100644 man/chain_sim_susc.Rd
 create mode 100644 man/rnbinom_mean_disp.Rd

diff --git a/DESCRIPTION b/DESCRIPTION
index faa891a9..df1508bb 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,13 +1,19 @@
 Package: bpmodels
 Version: 0.1.0
 Title: Analysing chain statistics using branching process models
-Authors@R: c(person("Sebastian", "Funk", email = "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")), person("Zhian N.", "Kamvar", email = "zkamvar@gmail.com", role = c("ctb")))
+Authors@R: c(
+    person("Sebastian", "Funk", email = "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")),
+    person("Zhian N.", "Kamvar", email = "zkamvar@gmail.com", role = c("ctb")),
+    person("Flavio", "Finger", email = "flavio.finger@epicentre.msf.org", role = c("aut"))
+    )
 Description: Provides methods to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks, as discussed in Farrington et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
 Suggests: 
     testthat,
     knitr,
     rmarkdown,
-    covr
+    covr,
+    extraDistr,
+    truncdist
 License: GPL-3
 URL: https://github.com/sbfnk/bpmodels
 BugReports: https://github.com/sbfnk/bpmodels
diff --git a/NAMESPACE b/NAMESPACE
index ddaddf44..1b73b8d9 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -2,3 +2,5 @@
 
 export(chain_ll)
 export(chain_sim)
+export(chain_sim_susc)
+export(rnbinom_mean_disp)
diff --git a/R/globals.R b/R/globals.R
new file mode 100644
index 00000000..75781a1a
--- /dev/null
+++ b/R/globals.R
@@ -0,0 +1,2 @@
+## avoid "no visible bindings" warning for dplyr verbs
+utils::globalVariables(c("generation", "time", "id"))
\ No newline at end of file
diff --git a/R/simulate_susceptibles.R b/R/simulate_susceptibles.R
new file mode 100644
index 00000000..df7cb793
--- /dev/null
+++ b/R/simulate_susceptibles.R
@@ -0,0 +1,166 @@
+##' Simulate a single chain using a branching process while accounting
+##' for depletion of susceptibles.
+##'
+##' @param offspring offspring distribution: a character string corresponding to
+##'   the R distribution function. Currently only "pois" & "nbinom" are
+##'   supported. Internally truncated distributions are used to avoid infecting
+##'   more people than susceptibles available.
+##' @param mn_offspring the average number of secondary cases for each case
+##' @param disp_offspring the dispersion coefficient (var/mean) of the number of
+##'      secondary cases. Ignored if offspring == "pois". Must be > 1.
+##' @param serial the serial interval. A function that takes one parameter
+##'     (`n`), the number of serial intervals to randomly sample.
+##'     Value must be >= 0.
+##' @param t0 start time
+##' @param tf end time
+##' @param pop the population
+##' @param initial_immune the number of initial immunes in the population
+##' @return a data frame with columns `time`, `id` (a unique ID for each
+##'     individual element of the chain), `ancestor` (the ID of the ancestor
+##'      of each element), and `generation`.
+##'
+##' @details This function has a couple of key differences with chain_sim:
+##'     it can only simulate one chain at a time,
+##'     it can only handle implemented offspring distributions
+##'         ("pois" and "nbinom"),
+##'     it always tracks and returns a data frame containing the entire tree,
+##'     the maximal length of chains is limited with pop instead of infinite.
+##'
+##' @author Flavio Finger
+##' @export
+##' @examples
+##' chain_sim_susc("pois", mn_offspring=0.5, serial = function(x) 3, pop = 100)
+chain_sim_susc <- function(
+    offspring = c("pois", "nbinom"),
+    mn_offspring,
+    disp_offspring,
+    serial,
+    t0 = 0,
+    tf = Inf,
+    pop,
+    initial_immune = 0
+) {
+
+    offspring <- match.arg(offspring)
+
+    if (missing(pop)) {
+        stop("Argument pop required.")
+    }
+
+    if (missing(mn_offspring)) {
+        stop("Argument mn_offspring reequired.")
+    }
+
+    if (offspring == "pois") {
+        if (!missing(disp_offspring)) {
+            warning("argument disp_offspring not used for
+                poisson offspring distribution.")
+        }
+
+        ## using a right truncated poisson distribution
+        ## to avoid more cases than susceptibles
+        offspring_fun <- function(n, susc) {
+            extraDistr::rtpois(
+                n,
+                lambda = mn_offspring * susc / pop,
+                b = susc)
+            }
+
+    } else if (offspring  == "nbinom") {
+
+        if (missing(disp_offspring) | disp_offspring <= 1) { ## dispersion index
+            stop("Offspring distribution 'nbinom' requires argument
+                disp_offspring > 1. Use 'pois' if there is no overdispersion.")
+        }
+
+        offspring_fun <- function(n, susc) {
+            ## get distribution params from mean and dispersion
+            ## see ?rnbinom for parameter definition
+            new_mn <- mn_offspring * susc / pop ##apply susceptibility
+            size <- new_mn / (disp_offspring - 1)
+
+            ## using a right truncated nbinom distribution
+            ## to avoid more cases than susceptibles
+            truncdist::rtrunc(
+                n,
+                spec = "nbinom",
+                b = susc,
+                mu = new_mn,
+                size = size)
+        }
+    }
+
+    ## initializations
+    tdf <- data.frame(
+        id = 1L,
+        ancestor = NA_integer_,
+        generation = 1L,
+        time = t0,
+        offspring_generated = FALSE
+    )
+
+    susc <- pop - initial_immune - 1L
+    t <- t0
+
+    ## continue if any unsimulated has t <= tf
+    ## AND there is still susceptibles left
+    while (
+        any(tdf$time[!tdf$offspring_generated] <= tf) &
+        susc > 0
+        ) {
+
+        ## select from which case to generate offspring
+        t <- min(tdf$time[!tdf$offspring_generated]) #lowest unsimulated t
+
+        ## index of the first in df with t, extract vars
+        idx <- which(tdf$time == t & !tdf$offspring_generated)[1]
+        id_parent <- tdf$id[idx]
+        t_parent <- tdf$time[idx]
+        gen_parent <- tdf$generation[idx]
+
+        ## generate it
+        current_max_id <- max(tdf$id)
+        n_offspring <- offspring_fun(1, susc)
+
+        if (n_offspring %% 1 > 0) {
+            stop("Offspring distribution must return integers")
+        }
+
+        ## mark as done
+        tdf$offspring_generated[idx] <- TRUE
+
+        ## add to df
+        if (n_offspring > 0) {
+            ## draw times
+            new_times <- serial(n_offspring)
+
+            if (any(new_times < 0)) {
+                stop("Serial interval must be >= 0.")
+            }
+
+            new_df <- data.frame(
+                id = current_max_id + seq_len(n_offspring),
+                time = new_times + t_parent,
+                ancestor = id_parent,
+                generation = gen_parent + 1L,
+                offspring_generated = FALSE
+            )
+
+            ## add new cases to tdf
+            tdf <- rbind(tdf, new_df)
+        }
+
+        ## adjust susceptibles
+        susc <- susc - n_offspring
+    }
+
+    ## remove cases with time > tf that could
+    ## have been generated in the last generation
+    tdf <- tdf[tdf$time <= tf, ]
+
+    ## sort output and remove columns not needed
+    tdf <- tdf[order(tdf$time, tdf$id), ]
+    tdf$offspring_generated <- NULL
+
+    return(tdf)
+}
\ No newline at end of file
diff --git a/R/utils.r b/R/utils.r
index 9f912db2..7e9f6f46 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -13,7 +13,13 @@ complementary_logprob <- function(x) {
 ##'
 ##' Samples the size parameter from the binomial distribution with fixed x
 ##' (number of successes) and p (success probability)
-##' @param n number of samples to generate
+##' @param n number of samples to generate##'      secondary cases. Ignored if offspring == "pois". Must be > 1.
+##' @param serial the serial interval. A function that takes one parameter
+##'     (`n`), the number of serial intervals to randomly sample.
+##'     Value must be >= 0.
+##' @param t0 start time
+##' @param tf end time
+##' @param pop the population
 ##' @param x number of successes
 ##' @param prob probability of success
 ##' @return sampled sizes
@@ -57,3 +63,18 @@ find_function_name <- function(fun) {
     }
   }
 }
+
+##' Negative binomial random numbers parametrized
+##' in terms of mean and dispersion coefficient
+##' @param n number of samples to draw
+##' @param mn mean of distribution
+##' @param disp dispersion coefficient (var/mean)
+##' @return vector containing the random numbers
+##' @author Flavio Finger
+##' @export
+##' @examples
+##' rnbinom_mean_disp(n = 5, mn = 4, disp = 2)
+rnbinom_mean_disp <- function(n, mn, disp) {
+  size <- mn / (disp - 1)
+  stats::rnbinom(n, size = size, mu = mn)
+  }
\ No newline at end of file
diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
index 8dda43e3..82276f74 100644
--- a/man/chain_ll.Rd
+++ b/man/chain_ll.Rd
@@ -4,17 +4,8 @@
 \alias{chain_ll}
 \title{Likelihood for the outcome of a branching process}
 \usage{
-chain_ll(
-  x,
-  offspring,
-  stat = c("size", "length"),
-  obs_prob = 1,
-  infinite = Inf,
-  exclude = c(),
-  individual = FALSE,
-  nsim_obs,
-  ...
-)
+chain_ll(x, offspring, stat = c("size", "length"), obs_prob = 1,
+  infinite = Inf, exclude = c(), individual = FALSE, nsim_obs, ...)
 }
 \arguments{
 \item{x}{vector of sizes or lengths of transmission chains}
diff --git a/man/chain_sim.Rd b/man/chain_sim.Rd
index 08593cab..5d2cf74d 100644
--- a/man/chain_sim.Rd
+++ b/man/chain_sim.Rd
@@ -4,17 +4,8 @@
 \alias{chain_sim}
 \title{Simulate chains using a branching process}
 \usage{
-chain_sim(
-  n,
-  offspring,
-  stat = c("size", "length"),
-  infinite = Inf,
-  tree = FALSE,
-  serial,
-  t0 = 0,
-  tf = Inf,
-  ...
-)
+chain_sim(n, offspring, stat = c("size", "length"), infinite = Inf,
+  tree = FALSE, serial, t0 = 0, tf = Inf, ...)
 }
 \arguments{
 \item{n}{number of simulations to run.}
diff --git a/man/chain_sim_susc.Rd b/man/chain_sim_susc.Rd
new file mode 100644
index 00000000..09b5858b
--- /dev/null
+++ b/man/chain_sim_susc.Rd
@@ -0,0 +1,56 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/simulate_susceptibles.R
+\name{chain_sim_susc}
+\alias{chain_sim_susc}
+\title{Simulate a single chain using a branching process while accounting
+for depletion of susceptibles.}
+\usage{
+chain_sim_susc(offspring = c("pois", "nbinom"), mn_offspring,
+  disp_offspring, serial, t0 = 0, tf = Inf, pop, initial_immune = 0)
+}
+\arguments{
+\item{offspring}{offspring distribution: a character string corresponding to
+the R distribution function. Currently only "pois" & "nbinom" are
+supported. Internally truncated distributions are used to avoid infecting
+more people than susceptibles available.}
+
+\item{mn_offspring}{the average number of secondary cases for each case}
+
+\item{disp_offspring}{the dispersion coefficient (var/mean) of the number of
+secondary cases. Ignored if offspring == "pois". Must be > 1.}
+
+\item{serial}{the serial interval. A function that takes one parameter
+(`n`), the number of serial intervals to randomly sample.
+Value must be >= 0.}
+
+\item{t0}{start time}
+
+\item{tf}{end time}
+
+\item{pop}{the population}
+
+\item{initial_immune}{the number of initial immunes in the population}
+}
+\value{
+a data frame with columns `time`, `id` (a unique ID for each
+    individual element of the chain), `ancestor` (the ID of the ancestor
+     of each element), and `generation`.
+}
+\description{
+Simulate a single chain using a branching process while accounting
+for depletion of susceptibles.
+}
+\details{
+This function has a couple of key differences with chain_sim:
+    it can only simulate one chain at a time,
+    it can only handle implemented offspring distributions
+        ("pois" and "nbinom"),
+    it always tracks and returns a data frame containing the entire tree,
+    the maximal length of chains is limited with pop instead of infinite.
+}
+\examples{
+chain_sim_susc("pois", mn_offspring=0.5, serial = function(x) 3, pop = 100)
+}
+\author{
+Flavio Finger
+}
diff --git a/man/rbinom_size.Rd b/man/rbinom_size.Rd
index 5e19360d..4be7a76a 100644
--- a/man/rbinom_size.Rd
+++ b/man/rbinom_size.Rd
@@ -7,11 +7,21 @@
 rbinom_size(n, x, prob)
 }
 \arguments{
-\item{n}{number of samples to generate}
+\item{n}{number of samples to generate##'      secondary cases. Ignored if offspring == "pois". Must be > 1.}
 
 \item{x}{number of successes}
 
 \item{prob}{probability of success}
+
+\item{serial}{the serial interval. A function that takes one parameter
+(`n`), the number of serial intervals to randomly sample.
+Value must be >= 0.}
+
+\item{t0}{start time}
+
+\item{tf}{end time}
+
+\item{pop}{the population}
 }
 \value{
 sampled sizes
diff --git a/man/rnbinom_mean_disp.Rd b/man/rnbinom_mean_disp.Rd
new file mode 100644
index 00000000..698836d6
--- /dev/null
+++ b/man/rnbinom_mean_disp.Rd
@@ -0,0 +1,29 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/utils.r
+\name{rnbinom_mean_disp}
+\alias{rnbinom_mean_disp}
+\title{Negative binomial random numbers parametrized
+in terms of mean and dispersion coefficient}
+\usage{
+rnbinom_mean_disp(n, mn, disp)
+}
+\arguments{
+\item{n}{number of samples to draw}
+
+\item{mn}{mean of distribution}
+
+\item{disp}{dispersion coefficient (var/mean)}
+}
+\value{
+vector containing the random numbers
+}
+\description{
+Negative binomial random numbers parametrized
+in terms of mean and dispersion coefficient
+}
+\examples{
+rnbinom_mean_disp(n = 5, mn = 4, disp = 2)
+}
+\author{
+Flavio Finger
+}

From da842ca24863b165de386b30ff72fdec31cc271e Mon Sep 17 00:00:00 2001
From: ffinger <12323626+ffinger@users.noreply.github.com>
Date: Sat, 7 Mar 2020 22:32:18 +0100
Subject: [PATCH 056/828] implemented tests and improved argument checking

fix doc

fix doc

remove flawed test

remove globals (unused)
---
 R/globals.R                |   2 -
 R/simulate_susceptibles.R  |  10 +---
 R/utils.r                  |   8 +--
 man/rbinom_size.Rd         |  12 +----
 tests/testthat/tests-sim.r | 103 +++++++++++++++++++++++++++++++++++++
 5 files changed, 106 insertions(+), 29 deletions(-)
 delete mode 100644 R/globals.R

diff --git a/R/globals.R b/R/globals.R
deleted file mode 100644
index 75781a1a..00000000
--- a/R/globals.R
+++ /dev/null
@@ -1,2 +0,0 @@
-## avoid "no visible bindings" warning for dplyr verbs
-utils::globalVariables(c("generation", "time", "id"))
\ No newline at end of file
diff --git a/R/simulate_susceptibles.R b/R/simulate_susceptibles.R
index df7cb793..f6fd180c 100644
--- a/R/simulate_susceptibles.R
+++ b/R/simulate_susceptibles.R
@@ -43,14 +43,6 @@ chain_sim_susc <- function(
 
     offspring <- match.arg(offspring)
 
-    if (missing(pop)) {
-        stop("Argument pop required.")
-    }
-
-    if (missing(mn_offspring)) {
-        stop("Argument mn_offspring reequired.")
-    }
-
     if (offspring == "pois") {
         if (!missing(disp_offspring)) {
             warning("argument disp_offspring not used for
@@ -68,7 +60,7 @@ chain_sim_susc <- function(
 
     } else if (offspring  == "nbinom") {
 
-        if (missing(disp_offspring) | disp_offspring <= 1) { ## dispersion index
+        if (disp_offspring <= 1) { ## dispersion index
             stop("Offspring distribution 'nbinom' requires argument
                 disp_offspring > 1. Use 'pois' if there is no overdispersion.")
         }
diff --git a/R/utils.r b/R/utils.r
index 7e9f6f46..79c225e9 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -13,13 +13,7 @@ complementary_logprob <- function(x) {
 ##'
 ##' Samples the size parameter from the binomial distribution with fixed x
 ##' (number of successes) and p (success probability)
-##' @param n number of samples to generate##'      secondary cases. Ignored if offspring == "pois". Must be > 1.
-##' @param serial the serial interval. A function that takes one parameter
-##'     (`n`), the number of serial intervals to randomly sample.
-##'     Value must be >= 0.
-##' @param t0 start time
-##' @param tf end time
-##' @param pop the population
+##' @param n number of samples to generate
 ##' @param x number of successes
 ##' @param prob probability of success
 ##' @return sampled sizes
diff --git a/man/rbinom_size.Rd b/man/rbinom_size.Rd
index 4be7a76a..5e19360d 100644
--- a/man/rbinom_size.Rd
+++ b/man/rbinom_size.Rd
@@ -7,21 +7,11 @@
 rbinom_size(n, x, prob)
 }
 \arguments{
-\item{n}{number of samples to generate##'      secondary cases. Ignored if offspring == "pois". Must be > 1.}
+\item{n}{number of samples to generate}
 
 \item{x}{number of successes}
 
 \item{prob}{probability of success}
-
-\item{serial}{the serial interval. A function that takes one parameter
-(`n`), the number of serial intervals to randomly sample.
-Value must be >= 0.}
-
-\item{t0}{start time}
-
-\item{tf}{end time}
-
-\item{pop}{the population}
 }
 \value{
 sampled sizes
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index a30e2f7a..5f28b6a1 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -15,3 +15,106 @@ test_that("Errors are thrown",
     expect_error(chain_sim(n=2, "dummy"), "does not exist")
     expect_error(chain_sim(n=2, "lnorm", meanlog=log(1.6)), "integer")
 })
+
+context("Simulating from a branching process model
+    accounting for depletion of susceptibles")
+
+
+test_that("Chains can be simulated",
+{
+    expect_true(
+        is.data.frame(
+            chain_sim_susc(
+                "pois",
+                mn_offspring = 2,
+                serial = function(x) 3,
+                pop = 100
+            )
+        )
+    )
+
+    expect_true(
+        is.data.frame(
+            chain_sim_susc(
+                "nbinom",
+                mn_offspring = 2,
+                disp = 1.5,
+                serial = function(x) 3,
+                pop = 100
+            )
+        )
+    )
+
+    expect_true(
+        nrow(
+            chain_sim_susc(
+                "pois",
+                mn_offspring = 2,
+                serial = function(x) 3,
+                pop = 1
+            )
+        ) == 1
+    )
+
+    expect_true(
+        nrow(
+            chain_sim_susc(
+                "pois",
+                mn_offspring = 100,
+                tf = 2,
+                serial = function(x) 3,
+                pop = 999
+            )
+        ) == 1
+    )
+
+    expect_true(
+        nrow(
+            chain_sim_susc(
+                "pois",
+                mn_offspring = 100,
+                serial = function(x) 3,
+                pop = 999,
+                initial_immune = 998
+            )
+        ) == 1
+    )
+
+})
+
+test_that("Errors are thrown",
+{
+    expect_error(
+        chain_sim_susc(
+            "dummy",
+            mn_offspring = 3,
+            serial = function(x) 3,
+            pop = 100),
+        "'arg' should be one of \"pois\", \"nbinom\"")
+    expect_error(
+        chain_sim_susc(
+            "nbinom",
+            mn_offspring = 3,
+            disp_offspring = 1,
+            serial = function(x) 3,
+            pop = 100
+            ),
+        "Offspring distribution 'nbinom' requires argument
+                disp_offspring > 1. Use 'pois' if there is no overdispersion.")
+    expect_error(
+        chain_sim_susc(
+            "nbinom",
+            mn_offspring = 3,
+            serial = function(x) 3,
+            pop = 100
+            ),
+        "argument \"disp_offspring\" is missing, with no default")
+    expect_error(
+        chain_sim_susc(
+            "pois",
+            mn_offspring = 3,
+            serial = function(x) -3,
+            pop = 100),
+        "Serial interval must be >= 0.")
+
+})
\ No newline at end of file

From 3d48c609bafe413f5d00f5ec7ede4d32e5d7618a Mon Sep 17 00:00:00 2001
From: ffinger <12323626+ffinger@users.noreply.github.com>
Date: Tue, 10 Mar 2020 11:32:48 +0100
Subject: [PATCH 057/828] removed extraDistr dependency and test that sometimes
 fails

---
 DESCRIPTION                | 1 -
 R/simulate_susceptibles.R  | 3 ++-
 tests/testthat/tests-sim.r | 7 -------
 3 files changed, 2 insertions(+), 9 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index df1508bb..53a89a8a 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -12,7 +12,6 @@ Suggests:
     knitr,
     rmarkdown,
     covr,
-    extraDistr,
     truncdist
 License: GPL-3
 URL: https://github.com/sbfnk/bpmodels
diff --git a/R/simulate_susceptibles.R b/R/simulate_susceptibles.R
index f6fd180c..b292d5f0 100644
--- a/R/simulate_susceptibles.R
+++ b/R/simulate_susceptibles.R
@@ -52,8 +52,9 @@ chain_sim_susc <- function(
         ## using a right truncated poisson distribution
         ## to avoid more cases than susceptibles
         offspring_fun <- function(n, susc) {
-            extraDistr::rtpois(
+            truncdist::rtrunc(
                 n,
+                spec = "pois",
                 lambda = mn_offspring * susc / pop,
                 b = susc)
             }
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 5f28b6a1..348c4bf3 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -109,12 +109,5 @@ test_that("Errors are thrown",
             pop = 100
             ),
         "argument \"disp_offspring\" is missing, with no default")
-    expect_error(
-        chain_sim_susc(
-            "pois",
-            mn_offspring = 3,
-            serial = function(x) -3,
-            pop = 100),
-        "Serial interval must be >= 0.")
 
 })
\ No newline at end of file

From c46eb565880184d89d8125ffe200ef37bf27b158 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 25 Jan 2023 16:31:11 +0000
Subject: [PATCH 058/828] Replace travis CI with GHA (#24)

* removed appveyor file

* removed travis yml

* removed all references to travis and appveyor

* ignore .Rproj

* add GHA badge to README

* GHA setup

* ignore .Rproj files

* Edited the README badge URLs to redirect to the Epiverse TRACE org
---
 .Rbuildignore                        |  5 +--
 .github/.gitignore                   |  1 +
 .github/workflows/R-CMD-check.yaml   | 49 +++++++++++++++++++++++++++
 .github/workflows/pkgdown.yaml       | 46 +++++++++++++++++++++++++
 .github/workflows/test-coverage.yaml | 50 ++++++++++++++++++++++++++++
 .gitignore                           |  1 +
 .travis.yml                          | 26 ---------------
 README.md                            | 10 +++---
 appveyor.yml                         | 41 -----------------------
 9 files changed, 155 insertions(+), 74 deletions(-)
 create mode 100644 .github/.gitignore
 create mode 100644 .github/workflows/R-CMD-check.yaml
 create mode 100644 .github/workflows/pkgdown.yaml
 create mode 100644 .github/workflows/test-coverage.yaml
 delete mode 100644 .travis.yml
 delete mode 100644 appveyor.yml

diff --git a/.Rbuildignore b/.Rbuildignore
index f0fea78d..0a6b13a1 100644
--- a/.Rbuildignore
+++ b/.Rbuildignore
@@ -1,4 +1,5 @@
 ^CODE_OF_CONDUCT\.md$
-^appveyor\.yml$
-^\.travis\.yml$
 cran-comments.md
+^\.github$
+^.*\.Rproj$
+^\.Rproj\.user$
diff --git a/.github/.gitignore b/.github/.gitignore
new file mode 100644
index 00000000..2d19fc76
--- /dev/null
+++ b/.github/.gitignore
@@ -0,0 +1 @@
+*.html
diff --git a/.github/workflows/R-CMD-check.yaml b/.github/workflows/R-CMD-check.yaml
new file mode 100644
index 00000000..a3ac6182
--- /dev/null
+++ b/.github/workflows/R-CMD-check.yaml
@@ -0,0 +1,49 @@
+# Workflow derived from https://github.com/r-lib/actions/tree/v2/examples
+# Need help debugging build failures? Start at https://github.com/r-lib/actions#where-to-find-help
+on:
+  push:
+    branches: [main, master]
+  pull_request:
+    branches: [main, master]
+
+name: R-CMD-check
+
+jobs:
+  R-CMD-check:
+    runs-on: ${{ matrix.config.os }}
+
+    name: ${{ matrix.config.os }} (${{ matrix.config.r }})
+
+    strategy:
+      fail-fast: false
+      matrix:
+        config:
+          - {os: macos-latest,   r: 'release'}
+          - {os: windows-latest, r: 'release'}
+          - {os: ubuntu-latest,   r: 'devel', http-user-agent: 'release'}
+          - {os: ubuntu-latest,   r: 'release'}
+          - {os: ubuntu-latest,   r: 'oldrel-1'}
+
+    env:
+      GITHUB_PAT: ${{ secrets.GITHUB_TOKEN }}
+      R_KEEP_PKG_SOURCE: yes
+
+    steps:
+      - uses: actions/checkout@v3
+
+      - uses: r-lib/actions/setup-pandoc@v2
+
+      - uses: r-lib/actions/setup-r@v2
+        with:
+          r-version: ${{ matrix.config.r }}
+          http-user-agent: ${{ matrix.config.http-user-agent }}
+          use-public-rspm: true
+
+      - uses: r-lib/actions/setup-r-dependencies@v2
+        with:
+          extra-packages: any::rcmdcheck
+          needs: check
+
+      - uses: r-lib/actions/check-r-package@v2
+        with:
+          upload-snapshots: true
diff --git a/.github/workflows/pkgdown.yaml b/.github/workflows/pkgdown.yaml
new file mode 100644
index 00000000..087f0b05
--- /dev/null
+++ b/.github/workflows/pkgdown.yaml
@@ -0,0 +1,46 @@
+# Workflow derived from https://github.com/r-lib/actions/tree/v2/examples
+# Need help debugging build failures? Start at https://github.com/r-lib/actions#where-to-find-help
+on:
+  push:
+    branches: [main, master]
+  pull_request:
+    branches: [main, master]
+  release:
+    types: [published]
+  workflow_dispatch:
+
+name: pkgdown
+
+jobs:
+  pkgdown:
+    runs-on: ubuntu-latest
+    # Only restrict concurrency for non-PR jobs
+    concurrency:
+      group: pkgdown-${{ github.event_name != 'pull_request' || github.run_id }}
+    env:
+      GITHUB_PAT: ${{ secrets.GITHUB_TOKEN }}
+    steps:
+      - uses: actions/checkout@v3
+
+      - uses: r-lib/actions/setup-pandoc@v2
+
+      - uses: r-lib/actions/setup-r@v2
+        with:
+          use-public-rspm: true
+
+      - uses: r-lib/actions/setup-r-dependencies@v2
+        with:
+          extra-packages: any::pkgdown, local::.
+          needs: website
+
+      - name: Build site
+        run: pkgdown::build_site_github_pages(new_process = FALSE, install = FALSE)
+        shell: Rscript {0}
+
+      - name: Deploy to GitHub pages 🚀
+        if: github.event_name != 'pull_request'
+        uses: JamesIves/github-pages-deploy-action@v4.4.1
+        with:
+          clean: false
+          branch: gh-pages
+          folder: docs
diff --git a/.github/workflows/test-coverage.yaml b/.github/workflows/test-coverage.yaml
new file mode 100644
index 00000000..2c5bb502
--- /dev/null
+++ b/.github/workflows/test-coverage.yaml
@@ -0,0 +1,50 @@
+# Workflow derived from https://github.com/r-lib/actions/tree/v2/examples
+# Need help debugging build failures? Start at https://github.com/r-lib/actions#where-to-find-help
+on:
+  push:
+    branches: [main, master]
+  pull_request:
+    branches: [main, master]
+
+name: test-coverage
+
+jobs:
+  test-coverage:
+    runs-on: ubuntu-latest
+    env:
+      GITHUB_PAT: ${{ secrets.GITHUB_TOKEN }}
+
+    steps:
+      - uses: actions/checkout@v3
+
+      - uses: r-lib/actions/setup-r@v2
+        with:
+          use-public-rspm: true
+
+      - uses: r-lib/actions/setup-r-dependencies@v2
+        with:
+          extra-packages: any::covr
+          needs: coverage
+
+      - name: Test coverage
+        run: |
+          covr::codecov(
+            quiet = FALSE,
+            clean = FALSE,
+            install_path = file.path(Sys.getenv("RUNNER_TEMP"), "package")
+          )
+        shell: Rscript {0}
+
+      - name: Show testthat output
+        if: always()
+        run: |
+          ## --------------------------------------------------------------------
+          find ${{ runner.temp }}/package -name 'testthat.Rout*' -exec cat '{}' \; || true
+        shell: bash
+
+      - name: Upload test results
+        if: failure()
+        uses: actions/upload-artifact@v3
+        with:
+          name: coverage-test-failures
+          path: ${{ runner.temp }}/package
diff --git a/.gitignore b/.gitignore
index 57133000..97ff1c51 100644
--- a/.gitignore
+++ b/.gitignore
@@ -13,6 +13,7 @@ inst/doc
 /*.Rcheck/
 # RStudio files
 .Rproj.user/
+*.Rproj
 # produced vignettes
 vignettes/*.html
 vignettes/*.pdf
diff --git a/.travis.yml b/.travis.yml
deleted file mode 100644
index a7279d6d..00000000
--- a/.travis.yml
+++ /dev/null
@@ -1,26 +0,0 @@
-# R for travis: see documentation at https://docs.travis-ci.com/user/languages/r
-language: r
-cache: packages
-
-matrix:
-  include:
-    - os: linux
-      r: release
-      env:
-        - R_CODECOV=true
-    - os: linux
-      r: devel
-    - os: linux
-      r: oldrel
-    - os: osx
-      osx_image: xcode8.3
-
-warnings_are_errors: true
-
-notifications:
-  email:
-    on_success: change
-    on_failure: change
-
-after_success:
-- if [[ "${R_CODECOV}" ]]; then Rscript -e 'covr::codecov()'; fi
diff --git a/README.md b/README.md
index 2654ac3e..37b7901a 100644
--- a/README.md
+++ b/README.md
@@ -1,15 +1,15 @@
 # bpmodels
-
-[![Travis-CI Build Status](https://travis-ci.org/sbfnk/bpmodels.svg?branch=master)](https://travis-ci.org/sbfnk/bpmodels)
-[![Appveyor Build Status](https://ci.appveyor.com/api/projects/status/y37i8x0wo9o8s2wf?svg=true)](https://ci.appveyor.com/project/sbfnk/bpmodels)
-[![codecov](https://codecov.io/github/sbfnk/bpmodels/branch/master/graphs/badge.svg)](https://codecov.io/github/sbfnk/bpmodels) 
+<!-- badges: start -->
+[![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
+[![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/master/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels) 
+<!-- badges: end -->
 
 Methods for analysing the distribution of epidemiological chain sizes and lengths
 
 The latest development version of the `bpmodels` package can be installed via
 
 ```{r eval=FALSE}
-devtools::install_github('sbfnk/bpmodels')
+devtools::install_github('epiverse-trace/bpmodels')
 ```
 
 Please note that the 'bpmodels' project is released with a [Contributor Code of Conduct](CODE_OF_CONDUCT.md). By contributing to this project, you agree to abide by its terms.
diff --git a/appveyor.yml b/appveyor.yml
deleted file mode 100644
index bc46a87c..00000000
--- a/appveyor.yml
+++ /dev/null
@@ -1,41 +0,0 @@
-# DO NOT CHANGE the "init" and "install" sections below
-
-# Download script file from GitHub
-init:
-  ps: |
-        $ErrorActionPreference = "Stop"
-        Invoke-WebRequest http://raw.github.com/krlmlr/r-appveyor/master/scripts/appveyor-tool.ps1 -OutFile "..\appveyor-tool.ps1"
-        Import-Module '..\appveyor-tool.ps1'
-install:
-  ps: Bootstrap
-
-# Adapt as necessary starting from here
-
-build_script:
-  - travis-tool.sh install_deps
-
-test_script:
-  - travis-tool.sh run_tests
-
-on_failure:
-  - 7z a failure.zip *.Rcheck\*
-  - appveyor PushArtifact failure.zip
-
-artifacts:
-  - path: '*.Rcheck\**\*.log'
-    name: Logs
-
-  - path: '*.Rcheck\**\*.out'
-    name: Logs
-
-  - path: '*.Rcheck\**\*.fail'
-    name: Logs
-
-  - path: '*.Rcheck\**\*.Rout'
-    name: Logs
-
-  - path: '\*_*.tar.gz'
-    name: Bits
-
-  - path: '\*_*.zip'
-    name: Bits

From 45a482219211ba52bc09080aafa3ab035ea155ea Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Thu, 26 Jan 2023 14:07:55 +0000
Subject: [PATCH 059/828] Revised chain_sim() documentation (#14)

* Added a bib file for the references

* added UTF-8 encoding to DESCRIPTION to fix Roxygen warning

* Changed the comment tags

* gitignore

* Changed the comment tags

* Updated the documentation of chain_sim()

* Converted the introduction vignette from md to rmd format

* Edited the README.md file

* Deleted this after converting to rmd format

* gitignore

* Starting to convert README to rmd format to provide more flexibility in adding code and output

* revised the chain_sim() doc

* Updated the DESCRIPTION and turned on markdown support

* Revised chain_sim() documentation

* Revised chain_sim() documentation

* added yours truly as an author and maintainer

* updated chain_sim() documentation

* Rendered README.md

* Fixed an issue with the author file

* Added an example

* Added our assumptions about the serial interval

* .Rd files generated

* Update chain_sim

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>

* Update chain_sim() documentation

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>

* Update chain_sim() documentation

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>

* Update chain_sim() documentation

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>

* Update chain_sim() documentation

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>

* updated the README

* deleted unwanted file byproduct

* ignore .bib.sav files

* ignore README.Rmd

* rebuilt the documentation

* added bookdown to Suggests

* fixed badges

* added the MIT license

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 .Rbuildignore                    |   2 +
 .gitignore                       |   8 ++
 DESCRIPTION                      |  35 ++++---
 LICENSE                          |   2 +
 LICENSE.md                       |  21 ++++
 R/likelihoods.R                  | 158 +++++++++++++++----------------
 R/simulate.r                     | 132 ++++++++++++++++++++------
 R/simulate_susceptibles.R        |  64 ++++++-------
 R/utils.r                        |  94 +++++++++---------
 README.Rmd                       |  32 +++++++
 README.md                        |  21 ++--
 man/chain_ll.Rd                  |  17 +++-
 man/chain_sim.Rd                 | 133 +++++++++++++++++++++-----
 man/chain_sim_susc.Rd            |  30 +++---
 man/offspring_ll.Rd              |   6 +-
 vignettes/introduction.R         |  31 ++++++
 vignettes/introduction.Rmd       |  61 ++++++++----
 vignettes/introduction.md        |  95 -------------------
 vignettes/projecting_incidence.R | 120 +++++++++++++++++++++++
 vignettes/references.bib         |  23 +++++
 20 files changed, 719 insertions(+), 366 deletions(-)
 create mode 100644 LICENSE
 create mode 100644 LICENSE.md
 create mode 100644 README.Rmd
 create mode 100644 vignettes/introduction.R
 delete mode 100644 vignettes/introduction.md
 create mode 100644 vignettes/projecting_incidence.R
 create mode 100644 vignettes/references.bib

diff --git a/.Rbuildignore b/.Rbuildignore
index 0a6b13a1..2bd66f52 100644
--- a/.Rbuildignore
+++ b/.Rbuildignore
@@ -3,3 +3,5 @@ cran-comments.md
 ^\.github$
 ^.*\.Rproj$
 ^\.Rproj\.user$
+^README\.Rmd$
+^LICENSE\.md$
diff --git a/.gitignore b/.gitignore
index 97ff1c51..8b89fb81 100644
--- a/.gitignore
+++ b/.gitignore
@@ -27,3 +27,11 @@ vignettes/*.pdf
 *.knit.md
 # Shiny token, see https://shiny.rstudio.com/articles/shinyapps.html
 rsconnect/
+/doc/
+/Meta/
+
+.Rbuildignore
+
+*.Rproj
+
+*.bib.sav
diff --git a/DESCRIPTION b/DESCRIPTION
index 53a89a8a..86bd2ff4 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,20 +1,29 @@
 Package: bpmodels
-Version: 0.1.0
 Title: Analysing chain statistics using branching process models
+Version: 0.1.0
 Authors@R: c(
-    person("Sebastian", "Funk", email = "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")),
-    person("Zhian N.", "Kamvar", email = "zkamvar@gmail.com", role = c("ctb")),
-    person("Flavio", "Finger", email = "flavio.finger@epicentre.msf.org", role = c("aut"))
-    )
-Description: Provides methods to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks, as discussed in Farrington et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
+    person("Sebastian", "Funk", , "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")),
+    person("Zhian N.", "Kamvar", , "zkamvar@gmail.com", role = "ctb"),
+    person("Flavio", "Finger", , "flavio.finger@epicentre.msf.org", role = "aut"),
+    person("James", "Azam", "Mba", "james.azam@lshtm.ac.uk", role = c("aut"))
+  )
+Description: Provides methods to analyse and simulate the size and length
+    of branching processes with an arbitrary offspring distribution. These
+    can be used, for example, to analyse the distribution of chain sizes
+    or length of infectious disease outbreaks, as discussed in Farrington
+    et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
+License: MIT + file LICENSE
+URL: https://github.com/sbfnk/bpmodels
+BugReports: https://github.com/sbfnk/bpmodels/issues
 Suggests: 
-    testthat,
+    covr,
     knitr,
     rmarkdown,
-    covr,
+    bookdown,
+    testthat,
     truncdist
-License: GPL-3
-URL: https://github.com/sbfnk/bpmodels
-BugReports: https://github.com/sbfnk/bpmodels
-RoxygenNote: 7.0.2
-VignetteBuilder: knitr
+VignetteBuilder: 
+    knitr
+Encoding: UTF-8
+Roxygen: list(markdown = TRUE)
+RoxygenNote: 7.2.3
diff --git a/LICENSE b/LICENSE
new file mode 100644
index 00000000..bad553b7
--- /dev/null
+++ b/LICENSE
@@ -0,0 +1,2 @@
+YEAR: 2023
+COPYRIGHT HOLDER: bpmodels authors
diff --git a/LICENSE.md b/LICENSE.md
new file mode 100644
index 00000000..9293f3eb
--- /dev/null
+++ b/LICENSE.md
@@ -0,0 +1,21 @@
+# MIT License
+
+Copyright (c) 2023 bpmodels authors
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
diff --git a/R/likelihoods.R b/R/likelihoods.R
index bdebd49a..70778346 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -1,27 +1,27 @@
-##' Likelihood of the size of chains with Poisson offspring distribution
-##'
-##' @param x vector of sizes
-##' @param lambda rate of the Poisson distribution
-##' @return log-likelihood values
-##' @author Sebastian Funk
-##' @keywords internal
+#' Likelihood of the size of chains with Poisson offspring distribution
+#'
+#' @param x vector of sizes
+#' @param lambda rate of the Poisson distribution
+#' @return log-likelihood values
+#' @author Sebastian Funk
+#' @keywords internal
 pois_size_ll <- function(x, lambda)
 {
   (x - 1) * log(lambda) - lambda * x + (x - 2) * log(x) - lgamma(x)
 }
 
-##' Likelihood of the size of chains with Negative-Binomial offspring
-##' distribution
-##'
-##' @param x vector of sizes
-##' @param size the dispersion parameter (often called \code{k} in ecological
-##'   applications)
-##' @param prob probability of success (in the parameterisation with
-##'   \code{prob}, see also \code{\link[stats]{NegBinomial}})
-##' @param mu mean parameter
-##' @return log-likelihood values
-##' @author Sebastian Funk
-##' @keywords internal
+#' Likelihood of the size of chains with Negative-Binomial offspring
+#' distribution
+#'
+#' @param x vector of sizes
+#' @param size the dispersion parameter (often called \code{k} in ecological
+#'   applications)
+#' @param prob probability of success (in the parameterisation with
+#'   \code{prob}, see also \code{\link[stats]{NegBinomial}})
+#' @param mu mean parameter
+#' @return log-likelihood values
+#' @author Sebastian Funk
+#' @keywords internal
 nbinom_size_ll <- function(x, size, prob, mu)
 {
   if (!missing(prob)) {
@@ -33,17 +33,17 @@ nbinom_size_ll <- function(x, size, prob, mu)
     (size * x + (x - 1)) * log(1 + mu / size)
 }
 
-##' Likelihood of the size of chains with gamma-Borel offspring distribution
-##'
-##' @param x vector of sizes
-##' @param size the dispersion parameter (often called \code{k} in ecological
-##'   applications)
-##' @param prob probability of success (in the parameterisation with
-##'   \code{prob}, see also \code{\link[stats]{NegBinomial}})
-##' @param mu mean parameter
-##' @return log-likelihood values
-##' @author Sebastian Funk
-##' @keywords internal
+#' Likelihood of the size of chains with gamma-Borel offspring distribution
+#'
+#' @param x vector of sizes
+#' @param size the dispersion parameter (often called \code{k} in ecological
+#'   applications)
+#' @param prob probability of success (in the parameterisation with
+#'   \code{prob}, see also \code{\link[stats]{NegBinomial}})
+#' @param mu mean parameter
+#' @return log-likelihood values
+#' @author Sebastian Funk
+#' @keywords internal
 gborel_size_ll <- function(x, size, prob, mu) {
   if (!missing(prob)) {
     if (!missing(mu)) stop("'prob' and 'mu' both specified")
@@ -54,13 +54,13 @@ gborel_size_ll <- function(x, size, prob, mu) {
     (x - 1) * log(x) - (size + x - 1) * log(x + size / mu)
 }
 
-##' Likelihood of the length of chains with Poisson offspring distribution
-##'
-##' @param x vector of sizes
-##' @param lambda rate of the Poisson distribution
-##' @return log-likelihood values
-##' @author Sebastian Funk
-##' @keywords internal
+#' Likelihood of the length of chains with Poisson offspring distribution
+#'
+#' @param x vector of sizes
+#' @param lambda rate of the Poisson distribution
+#' @return log-likelihood values
+#' @author Sebastian Funk
+#' @keywords internal
 pois_length_ll <- function(x, lambda) {
 
   ## iterated exponential function
@@ -73,14 +73,14 @@ pois_length_ll <- function(x, lambda) {
   log(Gk[x + 1] - Gk[x])
 }
 
-##' Likelihood of the length of chains with geometric offspring distribution
-##'
-##' @param x vector of sizes
-##' @param prob probability of the geometric distribution with mean
-##' \code{1/prob}
-##' @return log-likelihood values
-##' @author Sebastian Funk
-##' @keywords internal
+#' Likelihood of the length of chains with geometric offspring distribution
+#'
+#' @param x vector of sizes
+#' @param prob probability of the geometric distribution with mean
+#' \code{1/prob}
+#' @return log-likelihood values
+#' @author Sebastian Funk
+#' @keywords internal
 geom_length_ll <- function(x, prob) {
 
   lambda <- 1 / prob
@@ -91,20 +91,20 @@ geom_length_ll <- function(x, prob) {
   log(GkmGkm1)
 }
 
-##' Likelihood of the length of chains with generic offspring distribution
-##'
-##' The likelihoods are calculated with a crude approximation using simulated
-##'   chains by linearly approximating any missing values in the empirical
-##'   cumulative distribution function (ecdf).
-##' @param x vector of sizes
-##' @param nsim_offspring number of simulations of the offspring distribution
-##'   for approximation the size/length distribution
-##' @param ... any parameters to pass to \code{\link{chain_sim}}
-##' @return log-likelihood values
-##' @author Sebastian Funk
-##' @inheritParams chain_ll
-##' @inheritParams chain_sim
-##' @keywords internal
+#' Likelihood of the length of chains with generic offspring distribution
+#'
+#' The likelihoods are calculated with a crude approximation using simulated
+#'   chains by linearly approximating any missing values in the empirical
+#'   cumulative distribution function (ecdf).
+#' @param x vector of sizes
+#' @param nsim_offspring number of simulations of the offspring distribution
+#'   for approximation the size/length distribution
+#' @param ... any parameters to pass to \code{\link{chain_sim}}
+#' @return log-likelihood values
+#' @author Sebastian Funk
+#' @inheritParams chain_ll
+#' @inheritParams chain_sim
+#' @keywords internal
 offspring_ll <- function(x, offspring, stat, nsim_offspring=100, ...) {
 
   dist <- chain_sim(nsim_offspring, offspring, stat, ...)
@@ -119,26 +119,26 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring=100, ...) {
   log(lik)
 }
 
-##' Likelihood for the outcome of a branching process
-##'
-##' @param x vector of sizes or lengths of transmission chains
-##' @param stat statistic given as \code{x} ("size" or "length" of chains)
-##' @param obs_prob observation probability (assumed constant)
-##' @param infinite any chains of this size/length will be treated as infinite
-##' @param exclude any sizes/lengths to exclude from the likelihood calculation
-##' @param individual if TRUE, a vector of individual log-likelihood contributions will be returned rather than the sum
-##' @param nsim_obs number of simulations if the likelihood is to be
-##'   approximated for imperfect observations
-##' @param ... parameters for the offspring distribution
-##' @return likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or a list of individual likelihood contributions (if \code{individual=TRUE})
-##' @inheritParams chain_sim
-##' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll
-##'   geom_length_ll offspring_ll
-##' @author Sebastian Funk
-##' @export
-##' @examples
-##' chain_sizes <- c(1,1,4,7) # example of observed chain sizes
-##' chain_ll(chain_sizes, "pois", "size", lambda=0.5)
+#' Likelihood for the outcome of a branching process
+#'
+#' @param x vector of sizes or lengths of transmission chains
+#' @param stat statistic given as \code{x} ("size" or "length" of chains)
+#' @param obs_prob observation probability (assumed constant)
+#' @param infinite any chains of this size/length will be treated as infinite
+#' @param exclude any sizes/lengths to exclude from the likelihood calculation
+#' @param individual if TRUE, a vector of individual log-likelihood contributions will be returned rather than the sum
+#' @param nsim_obs number of simulations if the likelihood is to be
+#'   approximated for imperfect observations
+#' @param ... parameters for the offspring distribution
+#' @return likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or a list of individual likelihood contributions (if \code{individual=TRUE})
+#' @inheritParams chain_sim
+#' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll
+#'   geom_length_ll offspring_ll
+#' @author Sebastian Funk
+#' @export
+#' @examples
+#' chain_sizes <- c(1,1,4,7) # example of observed chain sizes
+#' chain_ll(chain_sizes, "pois", "size", lambda=0.5)
 chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
                      infinite = Inf, exclude=c(), individual=FALSE, nsim_obs, ...) {
   stat <- match.arg(stat)
diff --git a/R/simulate.r b/R/simulate.r
index 89f62cde..24f66ba8 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,31 +1,100 @@
-##' Simulate chains using a branching process
-##'
-##' @param n number of simulations to run.
-##' @param offspring offspring distribution: a character string corresponding to
-##'   the R distribution function (e.g., "pois" for Poisson, where
-##'   \code{\link{rpois}} is the R function to generate Poisson random numbers) 
-##' @param stat statistic to calculate ("size" or "length" of chains)
-##' @param infinite a size or length from which the size/length is to be
-##'     considered infinite
-##' @param tree return the tree of infectors
-##' @param serial the serial interval; a function that takes one parameter
-##' (`n`), the number of serial intervals to randomly sample; if this parameter
-##'   is set, `chain_sim` returns times of infection, too; implies (`tree`=TRUE)
-##' @param t0 start time (if serial interval is given); either a single value (0
-##'     by default for all simulations, or a vector of length `n` with initial
-##'     times) 
-##' @param tf end time (if serial interval is given)
-##' @param ... parameters of the offspring distribution
-##' @return a vector of sizes/lengths (if \code{tree==FALSE} and no serial
-##'   interval given), or a data frame with columns `n` (simulation ID), `time`
-##'   (if the serial interval is given) and (if \code{tree==TRUE}) `id` (a
-##'   unique ID within each simulation for each individual element of the
-##'   chain), `ancestor` (the ID of the ancestor of each element) and
-##'   `generation`. 
-##' @author Sebastian Funk
-##' @export
-##' @examples
-##' chain_sim(n=5, "pois", "size", lambda=0.5)
+#' Simulate transmission chains using a branching process
+#' @description \code{chain_sim()} is a stochastic simulator for generating 
+#' transmission chain data given information on the offspring distribution, 
+#' serial interval, time since the first case, etc. 
+#' @param n Number of simulations to run.
+#' @param offspring Offspring distribution: a character string corresponding to
+#'   the R distribution function (e.g., "pois" for Poisson, where
+#'   \code{\link{rpois}} is the R function to generate Poisson random numbers) 
+#' @param stat String; Statistic to calculate. Can be one of:
+#' \itemize{
+#'   \item "size": the total number of offspring.
+#'   \item "length": the total number of ancestors. 
+#' }
+#' @param infinite A size or length above which the simulation results should be 
+#' set to `Inf`. Defaults to `Inf`, resulting in no results ever set to `Inf`
+#' @param tree Logical. Should the transmission tree be returned? Defaults to `FALSE`.
+#' @param serial The serial interval generator function; the name of a user-defined 
+#' named or anonymous function with only one argument `n`, representing the number 
+#' of serial intervals to generate.
+#' @param t0 Start time (if serial interval is given); either a single value or a 
+#' vector of length `n` (number of simulations) with initial times. Defaults to 0.  
+#' @param tf End time (if serial interval is given).
+#' @param ... Parameters of the offspring distribution as required by R.
+#' @return Either: 
+#' \itemize{
+#'  \item{A vector of sizes/lengths (if \code{tree == FALSE} OR serial
+#'   interval function not specified, since that implies \code{tree == FALSE})}, or 
+#'   \item {a data frame with 
+#'   columns `n` (simulation ID), `time` (if the serial interval is given) and 
+#'   (if \code{tree == TRUE}), `id` (a unique ID within each simulation for each 
+#'   individual element of the chain), `ancestor` (the ID of the ancestor of each 
+#'   element), and `generation`.}
+#' }
+#' @author Sebastian Funk, James M. Azam
+#' @export
+#' @details 
+#' `chain_sim()` either returns a vector or a data.frame. The output is either a 
+#' vector if `serial` is not provided, which automatically sets \code{tree = FALSE},
+#' or a `data.frame`, which means that `serial` was provided as a function. When `serial`
+#' is provided, it means \code{tree = TRUE} automatically. However, setting 
+#' \code{tree = TRUE} would require providing a function for `serial`.
+#' 
+#' # The serial interval (`serial`):
+#' 
+#' ## Assumptions/disambiguation
+#' 
+#' In epidemiology, the generation interval is the duration between successive 
+#' infectious events in a chain of transmission. Similarly, the serial interval is the 
+#' duration between observed symptom onset times between successive 
+#' cases in a transmission chain. The generation interval is often hard to observe 
+#' because exact times of infection are hard to measure hence, the serial interval
+#' is often used instead. Here, we use the serial interval to represent what would 
+#' normally be called the generation interval, that is, the time between successive
+#' cases. 
+#' 
+#' ## Specifying `serial` in `chain_sim()`
+#' 
+#' `serial` must be specified as a named or 
+#' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) 
+#' with one argument. 
+#' 
+#' If `serial` is specified, `chain_sim()` returns times of 
+#' infection as a column in the output. Moreover, specifying a function for `serial` implies 
+#' \code{tree = TRUE} and a tree of infectors (`ancestor`) and infectees (`id`) 
+#' will be generated in the output. 
+#' 
+#' For example, assuming we want to specify the serial interval 
+#' generator as a random log-normally distributed variable with `meanlog = 0.58` 
+#' and `sdlog = 1.58`, we could define a named function, let's call it 
+#' "serial_interval", with only one argument representing the number of serial 
+#' intervals to sample: \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}}, 
+#' and assign the name of the function to serial in `chain_sim()` like so 
+#' \code{chain_sim(..., serial = serial_interval)}, 
+#' where `...` are the other arguments to `chain_sim()`. Alternatively, we 
+#' could assign an anonymous function to serial in the `chain_sim()` call like so
+#' \code{chain_sim(..., serial = function(n){rlnorm(n, 0.58, 1.38)})}, 
+#' where `...` are the other arguments to `chain_sim()`.
+#' @examples
+#' # Specifying no `serial` and `tree == FALSE` (default) returns a vector
+#' set.seed(123)
+#' chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5, tree = FALSE)
+#' 
+#' # Specifying `serial` without specifying `tree` will set `tree = TRUE` internally.
+#'  
+#' # We'll first define the serial function 
+#' set.seed(123)
+#' serial_interval <- function(n){rlnorm(n, meanlog = 0.58, sdlog = 1.58)}
+#' chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length', infinite = 100, 
+#' serial = serial_interval)
+#' 
+#' # Specifying `serial` and `tree = FALSE` will throw an error 
+#' set.seed(123)
+#' \dontrun{
+#' try(chain_sim(n = 10, serial = function(x) 3, offspring = "pois", lambda = 2, 
+#' infinite = 10, tree = FALSE)
+#' )
+#' }
 chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
                       tree = FALSE, serial, t0 = 0, tf = Inf, ...) {
 
@@ -33,7 +102,8 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
 
     ## first, get random function as given by `offspring`
     if (!is.character(offspring)) {
-        stop("object passed as 'offspring' is not a character string.")
+        stop("object passed as 'offspring' is not a character string. Did you forget
+             to enclose it in quotes?")
     }
 
     roffspring_name <- paste0("r", offspring)
@@ -43,7 +113,7 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
 
     if (!missing(serial)) {
         if (!is.function(serial)) {
-            stop("The `serial` argument must be a function.")
+            stop("The `serial` argument must be a function (see details in ?chain_sim()).")
         }
         if (!missing(tree) && tree == FALSE) {
             stop("The `serial` argument can't be used with `tree==FALSE`.")
@@ -81,7 +151,7 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
             stop("Offspring distribution must return integers")
         }
 
-        ## record indices corresponding the number of offspring
+        ## record indices corresponding to the number of offspring
         indices <- rep(sim, n_offspring[sim])
 
         ## initialise number of offspring
diff --git a/R/simulate_susceptibles.R b/R/simulate_susceptibles.R
index b292d5f0..1ece7158 100644
--- a/R/simulate_susceptibles.R
+++ b/R/simulate_susceptibles.R
@@ -1,35 +1,35 @@
-##' Simulate a single chain using a branching process while accounting
-##' for depletion of susceptibles.
-##'
-##' @param offspring offspring distribution: a character string corresponding to
-##'   the R distribution function. Currently only "pois" & "nbinom" are
-##'   supported. Internally truncated distributions are used to avoid infecting
-##'   more people than susceptibles available.
-##' @param mn_offspring the average number of secondary cases for each case
-##' @param disp_offspring the dispersion coefficient (var/mean) of the number of
-##'      secondary cases. Ignored if offspring == "pois". Must be > 1.
-##' @param serial the serial interval. A function that takes one parameter
-##'     (`n`), the number of serial intervals to randomly sample.
-##'     Value must be >= 0.
-##' @param t0 start time
-##' @param tf end time
-##' @param pop the population
-##' @param initial_immune the number of initial immunes in the population
-##' @return a data frame with columns `time`, `id` (a unique ID for each
-##'     individual element of the chain), `ancestor` (the ID of the ancestor
-##'      of each element), and `generation`.
-##'
-##' @details This function has a couple of key differences with chain_sim:
-##'     it can only simulate one chain at a time,
-##'     it can only handle implemented offspring distributions
-##'         ("pois" and "nbinom"),
-##'     it always tracks and returns a data frame containing the entire tree,
-##'     the maximal length of chains is limited with pop instead of infinite.
-##'
-##' @author Flavio Finger
-##' @export
-##' @examples
-##' chain_sim_susc("pois", mn_offspring=0.5, serial = function(x) 3, pop = 100)
+#' Simulate a single chain using a branching process while accounting
+#' for depletion of susceptibles.
+#'
+#' @param offspring offspring distribution: a character string corresponding to
+#'   the R distribution function. Currently only "pois" & "nbinom" are
+#'   supported. Internally truncated distributions are used to avoid infecting
+#'   more people than susceptibles available.
+#' @param mn_offspring the average number of secondary cases for each case
+#' @param disp_offspring the dispersion coefficient (var/mean) of the number of
+#'      secondary cases. Ignored if offspring == "pois". Must be > 1.
+#' @param serial the serial interval. A function that takes one parameter
+#'     (`n`), the number of serial intervals to randomly sample.
+#'     Value must be >= 0.
+#' @param t0 start time
+#' @param tf end time
+#' @param pop the population
+#' @param initial_immune the number of initial immunes in the population
+#' @return a data frame with columns `time`, `id` (a unique ID for each
+#'     individual element of the chain), `ancestor` (the ID of the ancestor
+#'      of each element), and `generation`.
+#'
+#' @details This function has a couple of key differences with chain_sim:
+#'     it can only simulate one chain at a time,
+#'     it can only handle implemented offspring distributions
+#'         ("pois" and "nbinom"),
+#'     it always tracks and returns a data frame containing the entire tree,
+#'     the maximal length of chains is limited with pop instead of infinite.
+#'
+#' @author Flavio Finger
+#' @export
+#' @examples
+#' chain_sim_susc("pois", mn_offspring=0.5, serial = function(x) 3, pop = 100)
 chain_sim_susc <- function(
     offspring = c("pois", "nbinom"),
     mn_offspring,
diff --git a/R/utils.r b/R/utils.r
index 79c225e9..573e342c 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -1,54 +1,54 @@
-##' Calculates the complementary log-probability
-##'
-##' Given x and norm, this calculates log(1-sum(exp(x)))
-##' @param x log-probabilities
-##' @return value
-##' @author Sebastian Funk
-##' @keywords internal
+#' Calculates the complementary log-probability
+#'
+#' Given x and norm, this calculates log(1-sum(exp(x)))
+#' @param x log-probabilities
+#' @return value
+#' @author Sebastian Funk
+#' @keywords internal
 complementary_logprob <- function(x) {
     tryCatch(log1p(-sum(exp(x))), error=function(e) -Inf)
 }
 
-##' Samples size (the number of trials) of a binomial distribution
-##'
-##' Samples the size parameter from the binomial distribution with fixed x
-##' (number of successes) and p (success probability)
-##' @param n number of samples to generate
-##' @param x number of successes
-##' @param prob probability of success
-##' @return sampled sizes
-##' @author Sebastian Funk
-##' @keywords internal
+#' Samples size (the number of trials) of a binomial distribution
+#'
+#' Samples the size parameter from the binomial distribution with fixed x
+#' (number of successes) and p (success probability)
+#' @param n number of samples to generate
+#' @param x number of successes
+#' @param prob probability of success
+#' @return sampled sizes
+#' @author Sebastian Funk
+#' @keywords internal
 rbinom_size <- function(n, x, prob) {
     x + stats::rnbinom(n, x + 1, prob)
 }
 
-##' Samples chain lengths with given observation probabilities
-##'
-##' Samples the length of a transmission chain where each individual element is
-##' observed with binomial probability with parameters n (number of successes)
-##' and p (success probability)
-##' @param n number of samples to generate
-##' @param x observed chain lengths
-##' @param prob probability of observation
-##' @return sampled lengths
-##' @author Sebastian Funk
-##' @keywords internal
+#' Samples chain lengths with given observation probabilities
+#'
+#' Samples the length of a transmission chain where each individual element is
+#' observed with binomial probability with parameters n (number of successes)
+#' and p (success probability)
+#' @param n number of samples to generate
+#' @param x observed chain lengths
+#' @param prob probability of observation
+#' @return sampled lengths
+#' @author Sebastian Funk
+#' @keywords internal
 rgen_length <- function(n, x, prob) {
     x +
       ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1) +
       ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1)
 }
 
-##' Finds the name of a function passed as an argument
-##'
-##' This works even when a function is passed multiple times (e.g., when used
-##' inside an \code{\link{optim}} call).
-##' See https://stackoverflow.com/a/46740314/10886760
-##' @param fun function of which the name is to be determined
-##' @return function name
-##' @author Sebastian Funk
-##' @keywords internal
+#' Finds the name of a function passed as an argument
+#'
+#' This works even when a function is passed multiple times (e.g., when used
+#' inside an \code{\link{optim}} call).
+#' See https://stackoverflow.com/a/46740314/10886760
+#' @param fun function of which the name is to be determined
+#' @return function name
+#' @author Sebastian Funk
+#' @keywords internal
 find_function_name <- function(fun) {
   objects <- ls(envir = environment(fun))
   for (i in objects) {
@@ -58,16 +58,16 @@ find_function_name <- function(fun) {
   }
 }
 
-##' Negative binomial random numbers parametrized
-##' in terms of mean and dispersion coefficient
-##' @param n number of samples to draw
-##' @param mn mean of distribution
-##' @param disp dispersion coefficient (var/mean)
-##' @return vector containing the random numbers
-##' @author Flavio Finger
-##' @export
-##' @examples
-##' rnbinom_mean_disp(n = 5, mn = 4, disp = 2)
+#' Negative binomial random numbers parametrized
+#' in terms of mean and dispersion coefficient
+#' @param n number of samples to draw
+#' @param mn mean of distribution
+#' @param disp dispersion coefficient (var/mean)
+#' @return vector containing the random numbers
+#' @author Flavio Finger
+#' @export
+#' @examples
+#' rnbinom_mean_disp(n = 5, mn = 4, disp = 2)
 rnbinom_mean_disp <- function(n, mn, disp) {
   size <- mn / (disp - 1)
   stats::rnbinom(n, size = size, mu = mn)
diff --git a/README.Rmd b/README.Rmd
new file mode 100644
index 00000000..38d313ed
--- /dev/null
+++ b/README.Rmd
@@ -0,0 +1,32 @@
+---
+output: github_document
+bibliography: vignettes/references.bib
+link-citations: true
+---
+
+```{r, include = FALSE}
+knitr::opts_chunk$set(
+  collapse = TRUE,
+  comment = "#>",
+  fig.path = "man/figures/README-",
+  out.width = "100%"
+)
+```
+<!-- badges: start -->
+[![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
+[![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/master/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels) 
+<!-- badges: end -->
+```{r setup, include=FALSE}
+knitr::opts_chunk$set(echo = TRUE)
+```
+
+`bpmodels` is an R package to simulate and analyse the size and length of branching processes with a given offspring distribution.
+
+# Installation
+The latest development version of the `bpmodels` package can be installed via
+
+```{r eval=FALSE}
+devtools::install_github('sbfnk/bpmodels')
+```
+
+Please note that the 'bpmodels' project is released with a [Contributor Code of Conduct](CODE_OF_CONDUCT.md). By contributing to this project, you agree to abide by its terms.
diff --git a/README.md b/README.md
index 37b7901a..f64355ad 100644
--- a/README.md
+++ b/README.md
@@ -1,15 +1,22 @@
-# bpmodels
+
 <!-- badges: start -->
+
 [![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
-[![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/master/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels) 
+[![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/master/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels)
 <!-- badges: end -->
 
-Methods for analysing the distribution of epidemiological chain sizes and lengths
+`bpmodels` is an R package to simulate and analyse the size and length
+of branching processes with a given offspring distribution.
+
+# Installation
 
-The latest development version of the `bpmodels` package can be installed via
+The latest development version of the `bpmodels` package can be
+installed via
 
-```{r eval=FALSE}
-devtools::install_github('epiverse-trace/bpmodels')
+``` r
+devtools::install_github('sbfnk/bpmodels')
 ```
 
-Please note that the 'bpmodels' project is released with a [Contributor Code of Conduct](CODE_OF_CONDUCT.md). By contributing to this project, you agree to abide by its terms.
+Please note that the ‘bpmodels’ project is released with a [Contributor
+Code of Conduct](CODE_OF_CONDUCT.md). By contributing to this project,
+you agree to abide by its terms.
diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
index 82276f74..b110fdce 100644
--- a/man/chain_ll.Rd
+++ b/man/chain_ll.Rd
@@ -4,13 +4,22 @@
 \alias{chain_ll}
 \title{Likelihood for the outcome of a branching process}
 \usage{
-chain_ll(x, offspring, stat = c("size", "length"), obs_prob = 1,
-  infinite = Inf, exclude = c(), individual = FALSE, nsim_obs, ...)
+chain_ll(
+  x,
+  offspring,
+  stat = c("size", "length"),
+  obs_prob = 1,
+  infinite = Inf,
+  exclude = c(),
+  individual = FALSE,
+  nsim_obs,
+  ...
+)
 }
 \arguments{
 \item{x}{vector of sizes or lengths of transmission chains}
 
-\item{offspring}{offspring distribution: a character string corresponding to
+\item{offspring}{Offspring distribution: a character string corresponding to
 the R distribution function (e.g., "pois" for Poisson, where
 \code{\link{rpois}} is the R function to generate Poisson random numbers)}
 
@@ -41,7 +50,7 @@ chain_ll(chain_sizes, "pois", "size", lambda=0.5)
 }
 \seealso{
 pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll
-  geom_length_ll offspring_ll
+geom_length_ll offspring_ll
 }
 \author{
 Sebastian Funk
diff --git a/man/chain_sim.Rd b/man/chain_sim.Rd
index 5d2cf74d..fb86c0bf 100644
--- a/man/chain_sim.Rd
+++ b/man/chain_sim.Rd
@@ -2,51 +2,132 @@
 % Please edit documentation in R/simulate.r
 \name{chain_sim}
 \alias{chain_sim}
-\title{Simulate chains using a branching process}
+\title{Simulate transmission chains using a branching process}
 \usage{
-chain_sim(n, offspring, stat = c("size", "length"), infinite = Inf,
-  tree = FALSE, serial, t0 = 0, tf = Inf, ...)
+chain_sim(
+  n,
+  offspring,
+  stat = c("size", "length"),
+  infinite = Inf,
+  tree = FALSE,
+  serial,
+  t0 = 0,
+  tf = Inf,
+  ...
+)
 }
 \arguments{
-\item{n}{number of simulations to run.}
+\item{n}{Number of simulations to run.}
 
-\item{offspring}{offspring distribution: a character string corresponding to
+\item{offspring}{Offspring distribution: a character string corresponding to
 the R distribution function (e.g., "pois" for Poisson, where
 \code{\link{rpois}} is the R function to generate Poisson random numbers)}
 
-\item{stat}{statistic to calculate ("size" or "length" of chains)}
+\item{stat}{String; Statistic to calculate. Can be one of:
+\itemize{
+\item "size": the total number of offspring.
+\item "length": the total number of ancestors.
+}}
 
-\item{infinite}{a size or length from which the size/length is to be
-considered infinite}
+\item{infinite}{A size or length above which the simulation results should be
+set to \code{Inf}. Defaults to \code{Inf}, resulting in no results ever set to \code{Inf}}
 
-\item{tree}{return the tree of infectors}
+\item{tree}{Logical. Should the transmission tree be returned? Defaults to \code{FALSE}.}
 
-\item{serial}{the serial interval; a function that takes one parameter
-(`n`), the number of serial intervals to randomly sample; if this parameter
-  is set, `chain_sim` returns times of infection, too; implies (`tree`=TRUE)}
+\item{serial}{The serial interval generator function; the name of a user-defined
+named or anonymous function with only one argument \code{n}, representing the number
+of serial intervals to generate.}
 
-\item{t0}{start time (if serial interval is given); either a single value (0
-by default for all simulations, or a vector of length `n` with initial
-times)}
+\item{t0}{Start time (if serial interval is given); either a single value or a
+vector of length \code{n} (number of simulations) with initial times. Defaults to 0.}
 
-\item{tf}{end time (if serial interval is given)}
+\item{tf}{End time (if serial interval is given).}
 
-\item{...}{parameters of the offspring distribution}
+\item{...}{Parameters of the offspring distribution as required by R.}
 }
 \value{
-a vector of sizes/lengths (if \code{tree==FALSE} and no serial
-  interval given), or a data frame with columns `n` (simulation ID), `time`
-  (if the serial interval is given) and (if \code{tree==TRUE}) `id` (a
-  unique ID within each simulation for each individual element of the
-  chain), `ancestor` (the ID of the ancestor of each element) and
-  `generation`.
+Either:
+\itemize{
+\item{A vector of sizes/lengths (if \code{tree == FALSE} OR serial
+interval function not specified, since that implies \code{tree == FALSE})}, or
+\item {a data frame with
+columns \code{n} (simulation ID), \code{time} (if the serial interval is given) and
+(if \code{tree == TRUE}), \code{id} (a unique ID within each simulation for each
+individual element of the chain), \code{ancestor} (the ID of the ancestor of each
+element), and \code{generation}.}
+}
 }
 \description{
-Simulate chains using a branching process
+\code{chain_sim()} is a stochastic simulator for generating
+transmission chain data given information on the offspring distribution,
+serial interval, time since the first case, etc.
+}
+\details{
+\code{chain_sim()} either returns a vector or a data.frame. The output is either a
+vector if \code{serial} is not provided, which automatically sets \code{tree = FALSE},
+or a \code{data.frame}, which means that \code{serial} was provided as a function. When \code{serial}
+is provided, it means \code{tree = TRUE} automatically. However, setting
+\code{tree = TRUE} would require providing a function for \code{serial}.
+}
+\section{The serial interval (\code{serial}):}{
+\subsection{Assumptions/disambiguation}{
+
+In epidemiology, the generation interval is the duration between successive
+infectious events in a chain of transmission. Similarly, the serial interval is the
+duration between observed symptom onset times between successive
+cases in a transmission chain. The generation interval is often hard to observe
+because exact times of infection are hard to measure hence, the serial interval
+is often used instead. Here, we use the serial interval to represent what would
+normally be called the generation interval, that is, the time between successive
+cases.
+}
+
+\subsection{Specifying \code{serial} in \code{chain_sim()}}{
+
+\code{serial} must be specified as a named or
+\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function}
+with one argument.
+
+If \code{serial} is specified, \code{chain_sim()} returns times of
+infection as a column in the output. Moreover, specifying a function for \code{serial} implies
+\code{tree = TRUE} and a tree of infectors (\code{ancestor}) and infectees (\code{id})
+will be generated in the output.
+
+For example, assuming we want to specify the serial interval
+generator as a random log-normally distributed variable with \code{meanlog = 0.58}
+and \code{sdlog = 1.58}, we could define a named function, let's call it
+"serial_interval", with only one argument representing the number of serial
+intervals to sample: \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
+and assign the name of the function to serial in \code{chain_sim()} like so
+\code{chain_sim(..., serial = serial_interval)},
+where \code{...} are the other arguments to \code{chain_sim()}. Alternatively, we
+could assign an anonymous function to serial in the \code{chain_sim()} call like so
+\code{chain_sim(..., serial = function(n){rlnorm(n, 0.58, 1.38)})},
+where \code{...} are the other arguments to \code{chain_sim()}.
 }
+}
+
 \examples{
-chain_sim(n=5, "pois", "size", lambda=0.5)
+# Specifying no `serial` and `tree == FALSE` (default) returns a vector
+set.seed(123)
+chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5, tree = FALSE)
+
+# Specifying `serial` without specifying `tree` will set `tree = TRUE` internally.
+ 
+# We'll first define the serial function 
+set.seed(123)
+serial_interval <- function(n){rlnorm(n, meanlog = 0.58, sdlog = 1.58)}
+chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length', infinite = 100, 
+serial = serial_interval)
+
+# Specifying `serial` and `tree = FALSE` will throw an error 
+set.seed(123)
+\dontrun{
+try(chain_sim(n = 10, serial = function(x) 3, offspring = "pois", lambda = 2, 
+infinite = 10, tree = FALSE)
+)
+}
 }
 \author{
-Sebastian Funk
+Sebastian Funk, James M. Azam
 }
diff --git a/man/chain_sim_susc.Rd b/man/chain_sim_susc.Rd
index 09b5858b..c06e52f1 100644
--- a/man/chain_sim_susc.Rd
+++ b/man/chain_sim_susc.Rd
@@ -5,8 +5,16 @@
 \title{Simulate a single chain using a branching process while accounting
 for depletion of susceptibles.}
 \usage{
-chain_sim_susc(offspring = c("pois", "nbinom"), mn_offspring,
-  disp_offspring, serial, t0 = 0, tf = Inf, pop, initial_immune = 0)
+chain_sim_susc(
+  offspring = c("pois", "nbinom"),
+  mn_offspring,
+  disp_offspring,
+  serial,
+  t0 = 0,
+  tf = Inf,
+  pop,
+  initial_immune = 0
+)
 }
 \arguments{
 \item{offspring}{offspring distribution: a character string corresponding to
@@ -20,7 +28,7 @@ more people than susceptibles available.}
 secondary cases. Ignored if offspring == "pois". Must be > 1.}
 
 \item{serial}{the serial interval. A function that takes one parameter
-(`n`), the number of serial intervals to randomly sample.
+(\code{n}), the number of serial intervals to randomly sample.
 Value must be >= 0.}
 
 \item{t0}{start time}
@@ -32,9 +40,9 @@ Value must be >= 0.}
 \item{initial_immune}{the number of initial immunes in the population}
 }
 \value{
-a data frame with columns `time`, `id` (a unique ID for each
-    individual element of the chain), `ancestor` (the ID of the ancestor
-     of each element), and `generation`.
+a data frame with columns \code{time}, \code{id} (a unique ID for each
+individual element of the chain), \code{ancestor} (the ID of the ancestor
+of each element), and \code{generation}.
 }
 \description{
 Simulate a single chain using a branching process while accounting
@@ -42,11 +50,11 @@ for depletion of susceptibles.
 }
 \details{
 This function has a couple of key differences with chain_sim:
-    it can only simulate one chain at a time,
-    it can only handle implemented offspring distributions
-        ("pois" and "nbinom"),
-    it always tracks and returns a data frame containing the entire tree,
-    the maximal length of chains is limited with pop instead of infinite.
+it can only simulate one chain at a time,
+it can only handle implemented offspring distributions
+("pois" and "nbinom"),
+it always tracks and returns a data frame containing the entire tree,
+the maximal length of chains is limited with pop instead of infinite.
 }
 \examples{
 chain_sim_susc("pois", mn_offspring=0.5, serial = function(x) 3, pop = 100)
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 260f36cd..427eb61a 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -9,7 +9,7 @@ offspring_ll(x, offspring, stat, nsim_offspring = 100, ...)
 \arguments{
 \item{x}{vector of sizes}
 
-\item{offspring}{offspring distribution: a character string corresponding to
+\item{offspring}{Offspring distribution: a character string corresponding to
 the R distribution function (e.g., "pois" for Poisson, where
 \code{\link{rpois}} is the R function to generate Poisson random numbers)}
 
@@ -25,8 +25,8 @@ log-likelihood values
 }
 \description{
 The likelihoods are calculated with a crude approximation using simulated
-  chains by linearly approximating any missing values in the empirical
-  cumulative distribution function (ecdf).
+chains by linearly approximating any missing values in the empirical
+cumulative distribution function (ecdf).
 }
 \author{
 Sebastian Funk
diff --git a/vignettes/introduction.R b/vignettes/introduction.R
new file mode 100644
index 00000000..af96b639
--- /dev/null
+++ b/vignettes/introduction.R
@@ -0,0 +1,31 @@
+## ----setup, include = FALSE---------------------------------------------------
+library('knitr')
+knitr::opts_chunk$set(
+  collapse = TRUE,
+  comment = "#>"
+)
+
+## ----eval=FALSE---------------------------------------------------------------
+#  library('bpmodels')
+
+## ----echo=FALSE---------------------------------------------------------------
+suppressWarnings(library('bpmodels'))
+set.seed(13)
+
+## -----------------------------------------------------------------------------
+chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
+chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
+
+## ----eval=FALSE---------------------------------------------------------------
+#  ?chains_ll
+
+## -----------------------------------------------------------------------------
+chain_sim(n = 5, "pois", "size", lambda = 0.5)
+
+## -----------------------------------------------------------------------------
+chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, nsim_offspring = 100)
+
+## -----------------------------------------------------------------------------
+ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5, nsim_obs = 10)
+summary(ll)
+
diff --git a/vignettes/introduction.Rmd b/vignettes/introduction.Rmd
index 5d3e0c67..c66bb4f4 100644
--- a/vignettes/introduction.Rmd
+++ b/vignettes/introduction.Rmd
@@ -1,14 +1,23 @@
 ---
 title: "Analysing chain statistics using branching process models"
 author: "Sebastian Funk"
-date: "`r Sys.Date()`"
-output: rmarkdown::html_vignette
+output:
+  bookdown::html_vignette2:
+    fig_caption: yes
+    code_folding: show
+pkgdown:
+  as_is: true
+bibliography: references.bib
+link-citations: true
 vignette: >
   %\VignetteIndexEntry{Analysing chain statistics using branching process models}
-  %\VignetteEngine{knitr::rmarkdown}
   %\VignetteEncoding{UTF-8}
+  %\VignetteEngine{knitr::rmarkdown}
+editor_options: 
+  chunk_output_type: console
 ---
 
+
 ```{r setup, include = FALSE}
 library('knitr')
 knitr::opts_chunk$set(
@@ -17,9 +26,9 @@ knitr::opts_chunk$set(
 )
 ```
 
-[bpmodels](https://github.com/sbfnk/bpmodels) is an `R` package to analyse and simulate the size and length of branching processes with a given offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks.
+[bpmodels](https://github.com/sbfnk/bpmodels) is an `R` package to simulate and analyse the size and length of branching processes with a given offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks.
 
-# Usage
+# Quick start
 
 To load the package, use
 ```{r eval=FALSE}
@@ -30,47 +39,63 @@ suppressWarnings(library('bpmodels'))
 set.seed(13)
 ```
 
-At the heart of the package are the `chains_ll` and `chains_sim` functions. The `chains_ll` function calculates the log-likelihood of a distribution of chain sizes or lengths given an offspring distribution and associated parameters. For example, to get the log-likelihood for a given observed distribution of chain sizes assuming a mean number of 0.5 Poisson-distributed offspring per generation, use
+At the heart of the package are the `chains_ll()` and `chains_sim()` functions. 
+
+## Calculating log-likelihoods
+
+The `chains_ll()` function calculates the log-likelihood of a distribution of chain sizes or lengths given an offspring distribution and its associated parameters. 
+
+If we have observed a distribution of chains of sizes $1, 1, 4, 7$, we can calculate the log-likelihood of this observed chain by assuming the offspring per generation is Poisson distributed with a mean number of 0.5. 
+
+To do this, we run 
 
 ```{r}
-chain_sizes <- c(1,1,4,7) # example of observed chain sizes
-chain_ll(chain_sizes, "pois", "size", lambda=0.5)
+chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
+chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
 ```
 
-The first argument of `chain_ll` is the size (or length) distribution to analyse. The second argument (called `offspring`) specifies the offspring distribution. This is given as a the function used to generate random offspring. It can be any probability distribution implemented in R, that is, one that has a corresponding function for generating random numbers beginning with the letter `r`. In the case of the example above, since random Poisson numbers are generated in R using a function called `rpois`, the string to pass to the `offspring` argument is `"pois"`.
+The first argument of `chain_ll()` is the size (or length) distribution to analyse. The second argument (called `offspring`) specifies the offspring distribution. This is given as a function used to generate random offspring. It can be any probability distribution implemented in `R`, that is, one that has a corresponding function for generating random numbers beginning with the letter `r`. In the case of the example above, since random Poisson numbers are generated in `R` using a function called `rpois()`, the string to pass to the `offspring` argument is `"pois"`.
 
-The third argument (called `stat`) determines whether to analyse chain sizes (`"size"`, the default if this argument is not specified) or lengths (`"length"`). Lastly, any named arguments not recognised by `chain_ll` are interpreted as parameters of the corresponding probability distribution, here `lambda=0.5` as the mean of the Poisson distribution (see the R help page for the Poisson distribution for more information).
+The third argument (called `stat`) determines whether to analyse chain sizes (`"size"`, the default if this argument is not specified) or lengths (`"length"`). Lastly, any named arguments not recognised by `chain_ll()` are interpreted as parameters of the corresponding probability distribution, here `lambda = 0.5` as the mean of the Poisson distribution (see the `R` help page for the [Poisson distribution](https://stat.ethz.ch/R-manual/R-devel/library/stats/html/Poisson.html) for more information).
 
-You can use the `R` help to find out about usage of the `chains_ll` function,
+To find out about usage of the `chains_ll()` function, you can use the `R` help file
 
 ```{r eval=FALSE}
 ?chains_ll
 ```
 
-To simulate from a branching process, use the `chain_sim` function, which follows the same syntax as the `chain_ll` function:
+## Simulating branching processes
+
+To simulate a branching process, we use the `chain_sim()` function. This function follows the same syntax as `chain_ll()`, that is:
 
 ```{r}
-chain_sim(n=5, "pois", "size", lambda=0.5)
+chain_sim(n = 5, "pois", "size", lambda = 0.5)
 ```
 
 # Methodology
 
-If the probability distribution of chain sizes or lengths has an analytical solution, this will be used (size distribution: Poisson and negative binomial; length distribution: Poisson and geometric). If not, simulations are used to approximate this probability distributions (using a linear approximation to the cumulative distribution for unobserved sizes/lengths), requiring an additional parameter `nsim_offspring` for the number of simulations to be used for this approximation. For example, to get offspring drawn from a binomial distribution with probability `prob=0.5`.
+If the probability distribution of chain sizes or lengths has an analytical solution, this will be used (size distribution: Poisson and negative binomial; length distribution: Poisson and geometric). 
+
+If an analytical solution does not exist, simulations are used to approximate this probability distributions (using a linear approximation to the cumulative distribution for unobserved sizes/lengths). The argument `nsim_offspring` is used to specify the number of simulations to be used for this approximation. 
+
+For example, to get offspring drawn from a binomial distribution with probability `prob = 0.5`, we run
 
 ```{r}
-chain_ll(chain_sizes, "binom", "size", size=1, prob=0.5, nsim_offspring=100)
+chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, nsim_offspring = 100)
 ```
 
 # Imperfect observations
 
-The `chain_ll` function has an `obs_prob` parameter that can be used to determine the likelihood if observations are imperfect. In that case, true chain sizes or lengths are simulated repeatedly (the number of times given by the `nsim_obs` argument) and the likelihood calculated for each of these simulations. For example, if the probability of observing each case is 30%, use
+If observations are imperfect, the `chain_ll()` function has an `obs_prob` argument that can be used to determine the likelihood. In that case, true chain sizes or lengths are simulated repeatedly (the number of times given by the `nsim_obs` argument), and the likelihood calculated for each of these simulations. 
+
+For example, if the probability of observing each case is $30%$, we use
 
 ```{r}
-ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda=0.5, nsim_obs=10)
+ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5, nsim_obs = 10)
 summary(ll)
 ```
 
-This returns `nsim_obs=10` likelihood values which can be averaged to come up with an overall likelihood estimate.
+This returns `nsim_obs = 10` likelihood values which can be averaged to come up with an overall likelihood estimate.
 
 # References
 
diff --git a/vignettes/introduction.md b/vignettes/introduction.md
deleted file mode 100644
index 641d09d0..00000000
--- a/vignettes/introduction.md
+++ /dev/null
@@ -1,95 +0,0 @@
-[bpmodels](https://github.com/sbfnk/bpmodels) is an `R` package to
-analyse and simulate the size and length of branching processes with a
-given offspring distribution. These can be used, for example, to analyse
-the distribution of chain sizes or length of infectious disease
-outbreaks.
-
-Usage
-=====
-
-To load the package, use
-
-    library('bpmodels')
-
-At the heart of the package are the `chains_ll` and `chains_sim`
-functions. The `chains_ll` function calculates the log-likelihood of a
-distribution of chain sizes or lengths given an offspring distribution
-and associated parameters. For example, to get the log-likelihood for a
-given observed distribution of chain sizes assuming a mean number of 0.5
-Poisson-distributed offspring per generation, use
-
-    chain_sizes <- c(1,1,4,7) # example of observed chain sizes
-    chain_ll(chain_sizes, "pois", "size", lambda=0.5)
-    #> [1] -8.607196
-
-The first argument of `chain_ll` is the size (or length) distribution to
-analyse. The second argument (called `offspring`) specifies the
-offspring distribution. This is given as a the function used to generate
-random offspring. It can be any probability distribution implemented in
-R, that is, one that has a corresponding function for generating random
-numbers beginning with the letter `r`. In the case of the example above,
-since random Poisson numbers are generated in R using a function called
-`rpois`, the string to pass to the `offspring` argument is `"pois"`.
-
-The third argument (called `stat`) determines whether to analyse chain
-sizes (`"size"`, the default if this argument is not specified) or
-lengths (`"length"`). Lastly, any named arguments not recognised by
-`chain_ll` are interpreted as parameters of the corresponding
-probability distribution, here `lambda=0.5` as the mean of the Poisson
-distribution (see the R help page for the Poisson distribution for more
-information).
-
-You can use the `R` help to find out about usage of the `chains_ll`
-function,
-
-    ?chains_ll
-
-To simulate from a branching process, use the `chain_sim` function,
-which follows the same syntax as the `chain_ll` function:
-
-    chain_sim(n=5, "pois", "size", lambda=0.5)
-    #> [1] 2 1 1 1 5
-
-Methodology
-===========
-
-If the probability distribution of chain sizes or lengths has an
-analytical solution, this will be used (size distribution: Poisson and
-negative binomial; length distribution: Poisson and geometric). If not,
-simulations are used to approximate this probability distributions
-(using a linear approximation to the cumulative distribution for
-unobserved sizes/lengths), requiring an additional parameter
-`nsim_offspring` for the number of simulations to be used for this
-approximation. For example, to get offspring drawn from a binomial
-distribution with probability `prob=0.5`.
-
-    chain_ll(chain_sizes, "binom", "size", size=1, prob=0.5, nsim_offspring=100)
-    #> [1] -8.477588
-
-Imperfect observations
-======================
-
-The `chain_ll` function has an `obs_prob` parameter that can be used to
-determine the likelihood if observations are imperfect. In that case,
-true chain sizes or lengths are simulated repeatedly (the number of
-times given by the `nsim_obs` argument) and the likelihood calculated
-for each of these simulations. For example, if the probability of
-observing each case is 30%, use
-
-    ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda=0.5, nsim_obs=10)
-    summary(ll)
-    #>    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
-    #>  -35.30  -25.68  -23.23  -24.19  -20.89  -18.91
-
-This returns `nsim_obs=10` likelihood values which can be averaged to
-come up with an overall likelihood estimate.
-
-References
-==========
-
--   Farrington, C.P., Kanaan, M.N. and Gay, N.J. (2003). [Branching
-    process models for surveillance of infectious diseases controlled by
-    mass vaccination](https://doi.org/10.1093/biostatistics/4.2.279).
--   Blumberg, S. and Lloyd-Smith, J.O. (2013). [Comparing methods for
-    estimating R0 from the size distribution of subcritical transmission
-    chains](https://doi.org/10.1016/j.epidem.2013.05.002).
diff --git a/vignettes/projecting_incidence.R b/vignettes/projecting_incidence.R
new file mode 100644
index 00000000..f46303b5
--- /dev/null
+++ b/vignettes/projecting_incidence.R
@@ -0,0 +1,120 @@
+## ----setup, include=FALSE-----------------------------------------------------
+knitr::opts_chunk$set(echo = TRUE, 
+                      message = FALSE, 
+                      warning = FALSE, 
+                      collapse = TRUE,
+                      comment = "#>"
+                      )
+
+
+## ----loading_packages, include=TRUE-------------------------------------------
+library("bpmodels")
+library('dplyr')
+library('ggplot2')
+library('lubridate')
+
+## ----data_generation, message=FALSE-------------------------------------------
+set.seed(12)
+cases_df <- data.frame(date = as.Date('2023-01-01') + seq_len(12),
+                       cases = rnbinom(12, size = 7.5, mu = 5)
+                       )
+head(cases_df)
+
+ggplot(cases_df, 
+       aes(x = date, y = cases)
+       ) + 
+  geom_col(fill = 'tomato3', size = 1)
+
+## ----input_prep, message=FALSE------------------------------------------------
+# We will create a vector of starting times for each case, using the time of the index cases as the reference point
+cases_df$days_since_index <- as.integer(cases_df$date - min(cases_df$date))
+
+#'Disaggregate the time series 
+case_times <- unlist(mapply(function(x, y) rep(x, times = ifelse(y == 0, 1, y)), 
+                       cases_df$days_since_index, 
+                       cases_df$cases
+                       )
+                       )
+                       
+
+
+#' Date to end simulation (14 day projection in this case)
+projection_window <- 14 #2 week ahead projection
+project_to_date <- max(cases_df$days_since_index) + projection_window 
+
+
+#' Number of simulations and maximum chain size
+sim_rep <- 1000
+cases_to_project <- 1000
+
+
+### Specifying the `serial` argument to `chain_sim()`
+#' Assume serial interval follows log-normal distribution with mean, mu = 4.7, 
+#' and standard deviation, sigma = 2.9, then the desired standard deviation, si_sd, 
+#' and mean, si_mean, are
+sigma = 2.9
+mu = 4.7
+
+si_sd <- sqrt(log(1 + (sigma/mu)^2)) #log standard deviation
+si_mean <- log((mu^2)/(sqrt(sigma^2 + mu^2))) #log mean
+
+#' serial interval function
+serial_interval <- function(sample_size) {
+  si <- rlnorm(sample_size, meanlog = si_mean, sdlog = si_sd)
+  return(si)
+}
+
+## ----simulations, message=FALSE-----------------------------------------------
+## Chain log-likelihood simulation
+sim_chain_sizes <- lapply(seq_len(sim_rep),
+                           function(sim){chain_sim(
+                               n = length(case_times),
+                               offspring = "nbinom",
+                               mu = 2.0,
+                               size = 0.38,
+                               stat = "size",
+                               infinite = cases_to_project,
+                               serial = serial_interval,
+                               t0 = case_times,
+                               tf = project_to_date,
+                               tree = TRUE
+                           ) |> 
+                               mutate(sim = sim)} 
+                          )
+
+sim_output <- do.call(rbind, sim_chain_sizes) 
+
+## ----post_processing----------------------------------------------------------
+ref_date <- min(cases_df$date)
+
+incidence_ts <- sim_output |> 
+  mutate(day = floor(time)) |> 
+  group_by(sim, day) |> 
+  summarise(cases = n()) |>  
+  ungroup()
+
+
+## Median cases by date.  
+median_daily_cases <- incidence_ts |>
+  group_by(day)|>
+  summarise(median_cases = median(cases)) |>
+  ungroup()|>
+  arrange(day) |>
+  mutate(date = ymd(ref_date) + 0:(project_to_date - 1))
+
+
+## ----visualisation------------------------------------------------------------
+# Visualization
+cases_plot <- ggplot(data = median_daily_cases) +
+  geom_col(aes(x = date, y = median_cases),
+           fill = "tomato3",
+           size = 1
+  ) +
+  scale_y_continuous(breaks = seq(0, max(median_daily_cases$median_cases) + 20, 20),
+                     labels = seq(0, max(median_daily_cases$median_cases) + 20, 20)
+  ) +
+  labs(x = 'Date', y = 'Daily cases (median)') + 
+  theme_minimal(base_size = 14)
+
+print(cases_plot)
+
diff --git a/vignettes/references.bib b/vignettes/references.bib
new file mode 100644
index 00000000..64afb555
--- /dev/null
+++ b/vignettes/references.bib
@@ -0,0 +1,23 @@
+@Article{Farrington2003,
+  author    = {Farrington, CP and Kanaan, MN and Gay, NJ},
+  journal   = {Biostatistics},
+  title     = {Branching process models for surveillance of infectious diseases controlled by mass vaccination},
+  year      = {2003},
+  number    = {2},
+  pages     = {279--295},
+  volume    = {4},
+  publisher = {Oxford University Press},
+}
+
+@Article{Blumberg2013,
+  author    = {Blumberg, Seth and Lloyd-Smith, James O},
+  journal   = {Epidemics},
+  title     = {Comparing methods for estimating R0 from the size distribution of subcritical transmission chains},
+  year      = {2013},
+  number    = {3},
+  pages     = {131--145},
+  volume    = {5},
+  publisher = {Elsevier},
+}
+
+@Comment{jabref-meta: databaseType:bibtex;}

From 16b5de5ba7b7cc847f97be68d9a16b3fe508b709 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 27 Jan 2023 11:15:20 +0000
Subject: [PATCH 060/828] Removed error when tree = FALSE and serial specified
 (#20)

Previously, an error would be thrown if the user specified a serial interval but also set tree=FALSE. The allowed specification here is tree=TRUE. This update will remove the error and rather throw a warning message.
---
 R/simulate.r | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 24f66ba8..520a3baf 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -116,8 +116,9 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
             stop("The `serial` argument must be a function (see details in ?chain_sim()).")
         }
         if (!missing(tree) && tree == FALSE) {
-            stop("The `serial` argument can't be used with `tree==FALSE`.")
-        }
+            warning("`serial` can't be used with `tree = FALSE`; Setting `tree = TRUE` internally.")
+          tree <- TRUE
+          }
         tree <- TRUE
     } else if (!missing(tf)) {
         stop("The `tf` argument needs a `serial` argument.")

From aca877129aa216775d58034d588ab422187aa3cb Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 27 Jan 2023 11:36:45 +0000
Subject: [PATCH 061/828] Tests and updated error messaging (#19)

* fixed partial argument in an old test

* fixed a failing test

* revised some error messages

* added more unit tests to improve coverage
---
 R/simulate.r                 |  8 +++----
 tests/testthat/tests-sim.r   | 41 +++++++++++++++++++++++++++++++++---
 vignettes/references.bib.sav | 23 ++++++++++++++++++++
 3 files changed, 64 insertions(+), 8 deletions(-)
 create mode 100644 vignettes/references.bib.sav

diff --git a/R/simulate.r b/R/simulate.r
index 520a3baf..f7df35a4 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -97,7 +97,6 @@
 #' }
 chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
                       tree = FALSE, serial, t0 = 0, tf = Inf, ...) {
-
     stat <- match.arg(stat)
 
     ## first, get random function as given by `offspring`
@@ -116,12 +115,11 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
             stop("The `serial` argument must be a function (see details in ?chain_sim()).")
         }
         if (!missing(tree) && tree == FALSE) {
-            warning("`serial` can't be used with `tree = FALSE`; Setting `tree = TRUE` internally.")
-          tree <- TRUE
-          }
+            stop("If `serial` is specified, then `tree` cannot be set to `FALSE`.")
+        }
         tree <- TRUE
     } else if (!missing(tf)) {
-        stop("The `tf` argument needs a `serial` argument.")
+        stop("If `tf` is specified, `serial` must be specified too.")
     }
 
     stat_track <- rep(1, n) ## track length or size (depending on `stat`)
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 348c4bf3..3c74acb4 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -8,12 +8,33 @@ test_that("Chains can be simulated",
                                         infinite=10)))
     expect_false(any(is.finite(chain_sim(n=2, "pois", "length", lambda=0.5,
                                          infinite=1))))
+    expect_no_error(chain_sim(n = 2, offspring = 'pois', "size", lambda = 0.9, 
+                                               tree = TRUE)
+                    )
 })
 
 test_that("Errors are thrown",
 {
     expect_error(chain_sim(n=2, "dummy"), "does not exist")
     expect_error(chain_sim(n=2, "lnorm", meanlog=log(1.6)), "integer")
+    expect_error(chain_sim(n = 2, offspring = pois, "length", lambda = 0.9), 
+                 "not found"
+                 )
+    expect_error(chain_sim(n = 2, offspring = 'pois', "size", lambda = 0.9, 
+                           serial = c(1:2), "must be a function")
+                 )
+    expect_error(chain_sim(n = 2, offspring = c(1, 2), "length", lambda = 0.9),
+                 "not a character string")
+    expect_error(chain_sim(n = 2, offspring = list(1, 2), "length", lambda = 0.9),
+                 "not a character string")
+    expect_error(chain_sim(n = 2, offspring = 'pois', "size", lambda = 0.9, 
+                           serial = function(x) rpois(x, 0.9), tree = FALSE),
+                 "If `serial` is specified, then `tree` cannot be set to `FALSE`."
+                 )
+    expect_error(chain_sim(n = 2, offspring = 'pois', "size", lambda = 0.9, 
+                           tf = 5, tree = FALSE),
+                 "If `tf` is specified, `serial` must be specified too."
+    )
 })
 
 context("Simulating from a branching process model
@@ -38,7 +59,7 @@ test_that("Chains can be simulated",
             chain_sim_susc(
                 "nbinom",
                 mn_offspring = 2,
-                disp = 1.5,
+                disp_offspring = 1.5,
                 serial = function(x) 3,
                 pop = 100
             )
@@ -90,7 +111,7 @@ test_that("Errors are thrown",
             mn_offspring = 3,
             serial = function(x) 3,
             pop = 100),
-        "'arg' should be one of \"pois\", \"nbinom\"")
+        paste0("'arg' should be one of ", dQuote('pois'), ', ', dQuote('nbinom')))
     expect_error(
         chain_sim_susc(
             "nbinom",
@@ -110,4 +131,18 @@ test_that("Errors are thrown",
             ),
         "argument \"disp_offspring\" is missing, with no default")
 
-})
\ No newline at end of file
+})
+
+test_that('warnings work as expected', {
+  expect_warning(
+    chain_sim_susc(
+      "pois",
+      mn_offspring = 3,
+      disp_offspring = 1,
+      serial = function(x) 3,
+      pop = 100
+      ),
+    "argument disp_offspring not used for
+                poisson offspring distribution."
+    )
+})
diff --git a/vignettes/references.bib.sav b/vignettes/references.bib.sav
new file mode 100644
index 00000000..f0c72507
--- /dev/null
+++ b/vignettes/references.bib.sav
@@ -0,0 +1,23 @@
+@Article{farrington2003branching,
+  author    = {Farrington, CP and Kanaan, MN and Gay, NJ},
+  journal   = {Biostatistics},
+  title     = {Branching process models for surveillance of infectious diseases controlled by mass vaccination},
+  year      = {2003},
+  number    = {2},
+  pages     = {279--295},
+  volume    = {4},
+  publisher = {Oxford University Press},
+}
+
+@Article{blumberg2013comparing,
+  author    = {Blumberg, Seth and Lloyd-Smith, James O},
+  journal   = {Epidemics},
+  title     = {Comparing methods for estimating R0 from the size distribution of subcritical transmission chains},
+  year      = {2013},
+  number    = {3},
+  pages     = {131--145},
+  volume    = {5},
+  publisher = {Elsevier},
+}
+
+@Comment{jabref-meta: databaseType:bibtex;}

From 96583748cf1a4330c22889021f997ea88e3ea3f7 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 27 Jan 2023 12:00:42 +0000
Subject: [PATCH 062/828] fixed covr badge pointing to master (#26)

---
 README.Rmd | 2 +-
 README.md  | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index 38d313ed..5405893f 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -14,7 +14,7 @@ knitr::opts_chunk$set(
 ```
 <!-- badges: start -->
 [![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
-[![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/master/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels) 
+[![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels) 
 <!-- badges: end -->
 ```{r setup, include=FALSE}
 knitr::opts_chunk$set(echo = TRUE)
diff --git a/README.md b/README.md
index f64355ad..b1cbaa53 100644
--- a/README.md
+++ b/README.md
@@ -2,7 +2,7 @@
 <!-- badges: start -->
 
 [![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
-[![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/master/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels)
+[![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels)
 <!-- badges: end -->
 
 `bpmodels` is an R package to simulate and analyse the size and length

From 1d5a90861f151887770a87746e625601f260f0fa Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Thu, 5 Jan 2023 17:24:36 +0000
Subject: [PATCH 063/828] gitignore

---
 .gitignore | 6 ------
 1 file changed, 6 deletions(-)

diff --git a/.gitignore b/.gitignore
index 8b89fb81..2d674619 100644
--- a/.gitignore
+++ b/.gitignore
@@ -29,9 +29,3 @@ vignettes/*.pdf
 rsconnect/
 /doc/
 /Meta/
-
-.Rbuildignore
-
-*.Rproj
-
-*.bib.sav

From 9b28b1491a4b366aa32dcbead4e4e87a277a8485 Mon Sep 17 00:00:00 2001
From: jamesmbaazam <jamesazam@sun.ac.za>
Date: Fri, 13 Jan 2023 12:45:15 +0000
Subject: [PATCH 064/828] removed references.bib.sav

---
 vignettes/references.bib.sav | 23 -----------------------
 1 file changed, 23 deletions(-)
 delete mode 100644 vignettes/references.bib.sav

diff --git a/vignettes/references.bib.sav b/vignettes/references.bib.sav
deleted file mode 100644
index f0c72507..00000000
--- a/vignettes/references.bib.sav
+++ /dev/null
@@ -1,23 +0,0 @@
-@Article{farrington2003branching,
-  author    = {Farrington, CP and Kanaan, MN and Gay, NJ},
-  journal   = {Biostatistics},
-  title     = {Branching process models for surveillance of infectious diseases controlled by mass vaccination},
-  year      = {2003},
-  number    = {2},
-  pages     = {279--295},
-  volume    = {4},
-  publisher = {Oxford University Press},
-}
-
-@Article{blumberg2013comparing,
-  author    = {Blumberg, Seth and Lloyd-Smith, James O},
-  journal   = {Epidemics},
-  title     = {Comparing methods for estimating R0 from the size distribution of subcritical transmission chains},
-  year      = {2013},
-  number    = {3},
-  pages     = {131--145},
-  volume    = {5},
-  publisher = {Elsevier},
-}
-
-@Comment{jabref-meta: databaseType:bibtex;}

From bf1ddecd709f5e4a76ced68359eb3065f0dfbaf1 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Sat, 14 Jan 2023 21:13:59 +0000
Subject: [PATCH 065/828] skeletal version of vignette about projecting
 incidence

---
 vignettes/projecting_incidence.Rmd | 139 +++++++++++++++++++++++++++++
 1 file changed, 139 insertions(+)
 create mode 100644 vignettes/projecting_incidence.Rmd

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
new file mode 100644
index 00000000..08c1ea4c
--- /dev/null
+++ b/vignettes/projecting_incidence.Rmd
@@ -0,0 +1,139 @@
+---
+title: "Projecting future disease incidence given early outbreak data"
+author: "James Azam, Sebastian Funk"
+date: '2023-01-13'
+output: html_document
+---
+
+```{r setup, include=FALSE}
+knitr::opts_chunk$set(echo = TRUE, message = FALSE, warning = FALSE)
+
+library("bpmodels")
+library("readr")
+library("lubridate")
+library('dplyr')
+library('ggplot2')
+```
+
+## Description
+Branching processes can be used to project future disease incidence given early 
+outbreak data. `bpmodels` can simulate branching processes using its `chain_sim()` function. 
+
+## Disease data
+
+Let's create some early outbreak data, assuming the cases have a negative binomial \
+distribution
+
+```{r data_generation, message=FALSE}
+set.seed(12)
+cases_df <- data.frame(date = as.Date('2023-01-01') + seq_len(12),
+                       cases = rnbinom(12, size = 7.5, mu = 5)
+                       )
+head(cases_df)
+ggplot(cases_df, aes(x = date, y = cases)) + geom_col()
+```
+
+## Preparing the inputs  
+
+```{r input_prep, message=FALSE}
+# We will create a vector of starting times for each case, using the time of the index cases as the reference point
+cases_df$days_since_index <- as.integer(cases_df$date - min(cases_df$date))
+
+#'Disaggregate the time series 
+case_times <- unlist(mapply(function(x, y) rep(x, times = ifelse(y == 0, 1, y)), 
+                       cases_df$days_since_index, 
+                       cases_df$cases
+                       )
+                       )
+                       
+
+
+#' Date to end simulation (14 day projection in this case)
+projection_window <- 14 #2 week ahead projection
+project_to_date <- max(cases_df$days_since_index) + projection_window 
+
+
+#' Number of simulations and maximum chain size
+sim_rep <- 1000
+cases_to_project <- 1000
+
+
+### Specifying the `serial` argument to `chain_sim()`
+#' Assume serial interval follows log-normal distribution with mean, mu = 4.7, 
+#' and standard deviation, sigma = 2.9, then the desired standard deviation, si_sd, 
+#' and mean, si_mean, are
+sigma = 2.9
+mu = 4.7
+
+si_sd <- sqrt(log(1 + (sigma/mu)^2)) 
+si_mean <- log((mu^2)/(sqrt(sigma^2 + mu^2))) #the desired mean
+
+#' serial interval function
+serial_interval <- function(sample_size = 1) {
+  si <- rlnorm(sample_size, meanlog = si_mean, sdlog = si_sd)
+  return(si)
+}
+```
+
+## Simulations
+```{r simulations, message=FALSE}
+## Chain log-likelihood simulation
+sim_chain_sizes <- lapply(seq_len(sim_rep),
+                           function(sim){chain_sim(
+                               n = length(case_times),
+                               offspring = "nbinom",
+                               mu = 2.0,
+                               size = 0.38,
+                               stat = "size",
+                               infinite = cases_to_project,
+                               serial = serial_interval,
+                               t0 = case_times,
+                               tf = project_to_date,
+                               tree = TRUE
+                           ) |> 
+                               mutate(sim = sim)} 
+                          )
+
+sim_output <- do.call(rbind, sim_chain_sizes) 
+```
+
+
+### Post-processing
+```{r post_processing}
+ref_date <- min(cases_df$date)
+
+incidence_ts <- sim_output |> 
+  mutate(day = floor(time)) |> 
+  group_by(sim, day) |> 
+  summarise(cases = n()) |>  
+  ungroup()
+
+
+## Median cases by date.  
+median_daily_cases <- incidence_ts |>
+  group_by(day)|>
+  summarise(median_cases = median(cases)) |>
+  ungroup()|>
+  arrange(day) |>
+  mutate(date = ymd(ref_date) + 0:(project_to_date - 1))
+
+```
+
+
+## Visualization
+```{r visualisation}
+# Visualization
+cases_plot <- ggplot(data = median_daily_cases) +
+  geom_col(aes(x = date, y = median_cases),
+           fill = "tomato3",
+           size = 1
+  ) +
+  scale_y_continuous(breaks = seq(0, max(median_daily_cases$median_cases) + 20, 20),
+                     labels = seq(0, max(median_daily_cases$median_cases) + 20, 20)
+  ) +
+  labs(x = 'Date', y = 'Daily cases (median)') + 
+  theme_minimal(base_size = 14)
+
+print(cases_plot)
+```
+

From 744fe768aa204862aeb2285ea619f38069fe345d Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 17 Jan 2023 14:03:39 +0000
Subject: [PATCH 066/828] added a bib file for the references

---
 vignettes/references.bib | 240 +++++++++++++++++++++++++++++++++++----
 1 file changed, 217 insertions(+), 23 deletions(-)

diff --git a/vignettes/references.bib b/vignettes/references.bib
index 64afb555..c19f323b 100644
--- a/vignettes/references.bib
+++ b/vignettes/references.bib
@@ -1,23 +1,217 @@
-@Article{Farrington2003,
-  author    = {Farrington, CP and Kanaan, MN and Gay, NJ},
-  journal   = {Biostatistics},
-  title     = {Branching process models for surveillance of infectious diseases controlled by mass vaccination},
-  year      = {2003},
-  number    = {2},
-  pages     = {279--295},
-  volume    = {4},
-  publisher = {Oxford University Press},
-}
-
-@Article{Blumberg2013,
-  author    = {Blumberg, Seth and Lloyd-Smith, James O},
-  journal   = {Epidemics},
-  title     = {Comparing methods for estimating R0 from the size distribution of subcritical transmission chains},
-  year      = {2013},
-  number    = {3},
-  pages     = {131--145},
-  volume    = {5},
-  publisher = {Elsevier},
-}
-
-@Comment{jabref-meta: databaseType:bibtex;}
+@article{Farrington2003,
+abstract = {Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.},
+author = {Farrington, C. P. and Kanaan, M. N. and Gay, N. J.},
+doi = {10.1093/biostatistics/4.2.279},
+issn = {14654644},
+journal = {Biostatistics (Oxford, England)},
+number = {2},
+pages = {279--295},
+title = {{Branching process models for surveillance of infectious diseases controlled by mass vaccination.}},
+volume = {4},
+year = {2003}
+}
+@article{Jacob2010,
+abstract = {Branching processes are stochastic individual-based processes leading consequently to a bottom-up approach. In addition, since the state variables are random integer variables (representing population sizes), the extinction occurs at random finite time on the extinction set, thus leading to fine and realistic predictions. Starting from the simplest and well-known single-type Bienaym{\'{e}}-Galton-Watson branching process that was used by several authors for approximating the beginning of an epidemic, we then present a general branching model with age and population dependent individual transitions. However contrary to the classical Bienaym{\'{e}}-Galton-Watson or asymptotically Bienaym{\'{e}}-Galton-Watson setting, where the asymptotic behavior of the process, as time tends to infinity, is well understood, the asymptotic behavior of this general process is a new question. Here we give some solutions for dealing with this problem depending on whether the initial population size is large or small, and whether the disease is rare or non-rare when the initial population size is large.},
+author = {Jacob, Christine},
+doi = {10.3390/ijerph7031204},
+issn = {16604601},
+journal = {International Journal of Environmental Research and Public Health},
+keywords = {Age-dependence,Branching process,Epidemic size,Extinction time,Population-dependence},
+number = {3},
+pages = {1186--1204},
+title = {{Branching processes: Their role in epidemiology}},
+volume = {7},
+year = {2010}
+}
+@article{Blumberg2013,
+abstract = {Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. {\textcopyright} 2013 Elsevier B.V.},
+author = {Blumberg, S. and Lloyd-Smith, J. O.},
+doi = {10.1016/j.epidem.2013.05.002},
+issn = {17554365},
+journal = {Epidemics},
+keywords = {Basic reproductive number,Imperfect observation,Measles,Stuttering chain,Transmission heterogeneity},
+number = {3},
+pages = {131--145},
+pmid = {24021520},
+publisher = {Elsevier B.V.},
+title = {{Comparing methods for estimating R0 from the size distribution of subcritical transmission chains}},
+url = {http://dx.doi.org/10.1016/j.epidem.2013.05.002},
+volume = {5},
+year = {2013}
+}
+@article{Nishiura2012,
+abstract = {Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. {\textcopyright} 2011.},
+author = {Nishiura, Hiroshi and Yan, Ping and Sleeman, Candace K. and Mode, Charles J.},
+doi = {10.1016/j.jtbi.2011.10.039},
+issn = {00225193},
+journal = {Journal of Theoretical Biology},
+keywords = {Basic reproduction number,Branching process,Confidence interval,Likelihood function,Statistical model},
+pages = {48--55},
+pmid = {22079419},
+publisher = {Elsevier},
+title = {{Estimating the transmission potential of supercritical processes based on the final size distribution of minor outbreaks}},
+url = {http://dx.doi.org/10.1016/j.jtbi.2011.10.039},
+volume = {294},
+year = {2012}
+}
+@article{Society2010,
+author = {Becker, Niels and Society, International Biometric},
+issn = {0006-341X},
+journal = {Biometrics},
+number = {3},
+pages = {515--522},
+publisher = {JSTOR},
+title = {{Estimation for discrete time branching processes with application to epidemics}},
+volume = {33},
+year = {1977}
+}
+@article{Allen2012,
+abstract = {The basic reproduction number, ℛ(0), one of the most well-known thresholds in deterministic epidemic theory, predicts a disease outbreak if ℛ(0)>1. In stochastic epidemic theory, there are also thresholds that predict a major outbreak. In the case of a single infectious group, if ℛ(0)>1 and i infectious individuals are introduced into a susceptible population, then the probability of a major outbreak is approximately 1-(1/ℛ(0))( i ). With multiple infectious groups from which the disease could emerge, this result no longer holds. Stochastic thresholds for multiple groups depend on the number of individuals within each group, i ( j ), j=1, {\ldots}, n, and on the probability of disease extinction for each group, q ( j ). It follows from multitype branching processes that the probability of a major outbreak is approximately [Formula: see text]. In this investigation, we summarize some of the deterministic and stochastic threshold theory, illustrate how to calculate the stochastic thresholds, and derive some new relationships between the deterministic and stochastic thresholds.},
+author = {Allen, Linda J.S. and Lahodny, Glenn E.},
+doi = {10.1080/17513758.2012.665502},
+issn = {17513758},
+journal = {Journal of Biological Dynamics},
+keywords = {multitype branching processes,reproduction numbers},
+number = {2},
+pages = {590--611},
+title = {{Extinction thresholds in deterministic and stochastic epidemic models}},
+volume = {6},
+year = {2012}
+}
+@article{Blumberg2013a,
+abstract = {For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, 0<R0<1 and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring R0 and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer R0, but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, k) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both R0 and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for R0 is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in R0 is detectable. In addition, by allowing for superspreading events, inference of k shifts the threshold above which a transmission chain should be considered anomalously large for a given value of R0 (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results.},
+author = {Blumberg, Seth and Lloyd-Smith, James O.},
+doi = {10.1371/journal.pcbi.1002993},
+issn = {15537358},
+journal = {PLoS Computational Biology},
+number = {5},
+pages = {1--17},
+pmid = {23658504},
+title = {{Inference of R0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains}},
+volume = {9},
+year = {2013}
+}
+@article{Chen2022,
+abstract = {The generation time distribution, reflecting the time between successive infections in transmission chains, is a key epidemiological parameter for describing COVID-19 transmission dynamics. However, because exact infection times are rarely known, it is often approximated by the serial interval distribution. This approximation holds under the assumption that infectors and infectees share the same incubation period distribution, which may not always be true. We estimated incubation period and serial interval distributions using 629 transmission pairs reconstructed by investigating 2989 confirmed cases in China in January-February 2020, and developed an inferential framework to estimate the generation time distribution that accounts for variation over time due to changes in epidemiology, sampling biases and public health and social measures. We identified substantial reductions over time in the serial interval and generation time distributions. Our proposed method provides more reliable estimation of the temporal variation in the generation time distribution, improving assessment of transmission dynamics.},
+author = {Chen, Dongxuan and Lau, Yiu Chung and Xu, Xiao Ke and Wang, Lin and Du, Zhanwei and Tsang, Tim K. and Wu, Peng and Lau, Eric H.Y. and Wallinga, Jacco and Cowling, Benjamin J. and Ali, Sheikh Taslim},
+doi = {10.1038/s41467-022-35496-8},
+issn = {20411723},
+journal = {Nature Communications},
+number = {1},
+publisher = {Springer US},
+title = {{Inferring time-varying generation time, serial interval, and incubation period distributions for COVID-19}},
+volume = {13},
+year = {2022}
+}
+@article{Lehtinen2021,
+abstract = {The timing of transmission plays a key role in the dynamics and controllability of an epidemic. However, observing generation times - the time interval between the infection of an infector and an infectee in a transmission pair - requires data on infection times, which are generally unknown. The timing of symptom onset is more easily observed; generation times are therefore often estimated based on serial intervals - the time interval between symptom onset of an infector and an infectee. This estimation follows one of two approaches: (i) approximating the generation time distribution by the serial interval distribution or (ii) deriving the generation time distribution from the serial interval and incubation period - the time interval between infection and symptom onset in a single individual - distributions. These two approaches make different - and not always explicitly stated - assumptions about the relationship between infectiousness and symptoms, resulting in different generation time distributions with the same mean but unequal variances. Here, we clarify the assumptions that each approach makes and show that neither set of assumptions is plausible for most pathogens. However, the variances of the generation time distribution derived under each assumption can reasonably be considered as upper (approximation with serial interval) and lower (derivation from serial interval) bounds. Thus, we suggest a pragmatic solution is to use both approaches and treat these as edge cases in downstream analysis. We discuss the impact of the variance of the generation time distribution on the controllability of an epidemic through strategies based on contact tracing, and we show that underestimating this variance is likely to overestimate controllability.},
+author = {Lehtinen, Sonja and Ashcroft, Peter and Bonhoeffer, Sebastian},
+doi = {10.1098/rsif.2020.0756},
+issn = {17425662},
+journal = {Journal of the Royal Society Interface},
+keywords = {SARS-CoV-2,contact tracing,epidemiology,generation time,infectiousness,modelling},
+number = {174},
+pmid = {33402022},
+title = {{On the relationship between serial interval, infectiousness profile and generation time: On the relationship between serial interval, infectiousness profile and generation time}},
+volume = {18},
+year = {2021}
+}
+@article{Pearson2020,
+abstract = {For 45 African countries/territories already reporting COVID-19 cases before 23 March 2020, we estimate the dates of reporting 1,000 and 10,000 cases. Assuming early epidemic trends without interventions, all 45 were likely to exceed 1,000 confirmed cases by the end of April 2020, with most exceeding 10,000 a few weeks later.},
+author = {Pearson, Carl A.B. and van Schalkwyk, Cari and Foss, Anna M. and O'Reilly, Kathleen M. and Pulliam, Juliet R.C.},
+doi = {10.2807/1560-7917.ES.2020.25.18.2000543},
+issn = {15607917},
+journal = {Eurosurveillance},
+number = {18},
+pages = {1--6},
+pmid = {32400361},
+publisher = {European Centre for Disease Prevention and Control (ECDC)},
+title = {{Projected early spread of COVID-19 in Africa through 1 June 2020}},
+url = {http://dx.doi.org/10.2807/1560-7917.ES.2020.25.18.2000543},
+volume = {25},
+year = {2020}
+}
+@article{Griffin2020,
+abstract = {The serial interval is the time between symptom onsets in an infector-infectee pair. The generation time, also known as the generation interval, is the time between infection events in an infector-infectee pair. The serial interval and the generation time are key parameters for assessing the dynamics of a disease. A number of scientific papers reported information pertaining to the serial interval and/or generation time for COVID-19. Objective Conduct a review of available evidence to advise on appropriate parameter values for serial interval and generation time in national COVID-19 transmission models for Ireland and on methodological issues relating to those parameters. Methods We conducted a rapid review of the literature covering the period 1 January 2020 and 21 August 2020, following predefined eligibility criteria. Forty scientific papers met our inclusion criteria and were included in the review. Results The mean of the serial interval ranged from 3.03 to 7.6 days, based on 38 estimates, and the median from 1.0 to 6.0 days (based on 15 estimates). Only three estimates were provided for the mean of the generation time. These ranged from 3.95 to 5.20 days. One estimate of 5.0 days was provided for the median of the generation time. Discussion Estimates of the serial interval and the generation time are very dependent on the specific factors that apply at the time that the data are collected, including the level of social contact. Consequently, the estimates may not be entirely relevant to other environments. Therefore, local estimates should be obtained as soon as possible. Careful consideration should be given to the methodology that is used. Real-time estimations of the serial interval/generation time, allowing for variations over time, may provide more accurate estimates of reproduction numbers than using conventionally fixed serial interval/generation time distributions.},
+author = {Griffin, John and Casey, Miriam and Collins, {\'{A}}ine and Hunt, Kevin and McEvoy, David and Byrne, Andrew and McAloon, Conor and Barber, Ann and Lane, Elizabeth Ann and More, Simon},
+doi = {10.1136/bmjopen-2020-040263},
+isbn = {9789241512763},
+issn = {20446055},
+journal = {BMJ Open},
+keywords = {COVID-19,epidemiology,public health,virology},
+number = {11},
+pages = {1--9},
+pmid = {33234640},
+title = {{Rapid review of available evidence on the serial interval and generation time of COVID-19}},
+volume = {10},
+year = {2020}
+}
+@article{Grassly2006a,
+abstract = {Seasonal change in the incidence of infectious diseases is a common phenomenon in both temperate and tropical climates. However, the mechanisms responsible for seasonal disease incidence, and the epidemiological consequences of seasonality, are poorly understood with rare exception. Standard epidemiological theory and concepts such as the basic reproductive number R  0  no longer apply, and the implications for interventions that themselves may be periodic, such as pulse vaccination, have not been formally examined. This paper examines the causes and consequences of seasonality, and in so doing derives several new results concerning vaccination strategy and the interpretation of disease outbreak data. It begins with a brief review of published scientific studies in support of different causes of seasonality in infectious diseases of humans, identifying four principal mechanisms and their association with different routes of transmission. It then describes the consequences of seasonality for R 0 , disease outbreaks, endemic dynamics and persistence. Finally, a mathematical analysis of routine and pulse vaccination programmes for seasonal infections is presented. The synthesis of seasonal infectious disease epidemiology attempted by this paper highlights the need for further empirical and theoretical work. {\textcopyright} 2006 The Royal Society.},
+author = {Grassly, Nicholas C. and Fraser, Christophe},
+doi = {10.1098/rspb.2006.3604},
+issn = {14712970},
+journal = {Proceedings of the Royal Society B: Biological Sciences},
+keywords = {Communicable diseases,Disease outbreaks,Epidemiology,Seasons,Vaccination},
+number = {1600},
+pages = {2541--2550},
+title = {{Seasonal infectious disease epidemiology}},
+volume = {273},
+year = {2006}
+}
+@article{Alene2021,
+abstract = {Background: Understanding the epidemiological parameters that determine the transmission dynamics of COVID-19 is essential for public health intervention. Globally, a number of studies were conducted to estimate the average serial interval and incubation period of COVID-19. Combining findings of existing studies that estimate the average serial interval and incubation period of COVID-19 significantly improves the quality of evidence. Hence, this study aimed to determine the overall average serial interval and incubation period of COVID-19. Methods: We followed the PRISMA checklist to present this study. A comprehensive search strategy was carried out from international electronic databases (Google Scholar, PubMed, Science Direct, Web of Science, CINAHL, and Cochrane Library) by two experienced reviewers (MAA and DBK) authors between the 1st of June and the 31st of July 2020. All observational studies either reporting the serial interval or incubation period in persons diagnosed with COVID-19 were included in this study. Heterogeneity across studies was assessed using the I2 and Higgins test. The NOS adapted for cross-sectional studies was used to evaluate the quality of studies. A random effect Meta-analysis was employed to determine the pooled estimate with 95% (CI). Microsoft Excel was used for data extraction and R software was used for analysis. Results: We combined a total of 23 studies to estimate the overall mean serial interval of COVID-19. The mean serial interval of COVID-19 ranged from 4. 2 to 7.5 days. Our meta-analysis showed that the weighted pooled mean serial interval of COVID-19 was 5.2 (95%CI: 4.9–5.5) days. Additionally, to pool the mean incubation period of COVID-19, we included 14 articles. The mean incubation period of COVID-19 also ranged from 4.8 to 9 days. Accordingly, the weighted pooled mean incubation period of COVID-19 was 6.5 (95%CI: 5.9–7.1) days. Conclusions: This systematic review and meta-analysis showed that the weighted pooled mean serial interval and incubation period of COVID-19 were 5.2, and 6.5 days, respectively. In this study, the average serial interval of COVID-19 is shorter than the average incubation period, which suggests that substantial numbers of COVID-19 cases will be attributed to presymptomatic transmission.},
+author = {Alene, Muluneh and Yismaw, Leltework and Assemie, Moges Agazhe and Ketema, Daniel Bekele and Gietaneh, Wodaje and Birhan, Tilahun Yemanu},
+doi = {10.1186/s12879-021-05950-x},
+issn = {14712334},
+journal = {BMC Infectious Diseases},
+keywords = {COVID-19,Incubation period,Meta-analysis,Serial interval},
+number = {1},
+pages = {1--9},
+pmid = {33706702},
+publisher = {BMC Infectious Diseases},
+title = {{Serial interval and incubation period of COVID-19: a systematic review and meta-analysis}},
+volume = {21},
+year = {2021}
+}
+@article{Farrington1999a,
+abstract = {We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.},
+author = {Farrington, C. P. and Grant, A. D.},
+doi = {10.1239/jap/1032374633},
+issn = {00219002},
+journal = {Journal of Applied Probability},
+keywords = {Branching process,Epidemic model,Extinction,Generation distribution,Maximum likelihood estimation,Power series family},
+number = {3},
+pages = {771--779},
+title = {{The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease}},
+volume = {36},
+year = {1999}
+}
+@article{Farrington1999,
+abstract = {We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.},
+author = {Farrington, C. P. and Grant, A. D.},
+doi = {10.1239/jap/1032374633},
+issn = {00219002},
+journal = {Journal of Applied Probability},
+keywords = {Branching process,Epidemic model,Extinction,Generation distribution,Maximum likelihood estimation,Power series family},
+number = {3},
+pages = {771--779},
+title = {{The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease}},
+volume = {36},
+year = {1999}
+}
+@article{Fine2003,
+abstract = {The interval between successive cases of an infectious disease is determined by the time from infection to infectiousness, the duration of infectiousness, the time from infection to disease onset (incubation period), the duration of any extra-human phase of the infectious agent, and the proportion clinically affected among infected individuals. The interval is important in the interpretation of infectious disease surveillance and trend data, in the identification of outbreaks, and in the optimization of quarantine and contact tracing. This paper discusses the properties of these intervals, as measured between transmission events or between clinical onsets of successive infected individuals, noting the determinants of their ranges and frequency distributions, the circumstances under which secondary cases may arise before primaries, and under which the infection transmission interval will be different from the interval between clinical onsets of successive cases. It discusses the derivation of interval distribution statistics from descriptive data given in standard textbooks, with illustrations from published data on outbreaks, households, and epidemiologic tracing. Finally, it discusses the implications of such measures for studies of secondary attack rates, for the persistence of infection in human communities, for outbreak response, and for elimination or eradication programs.},
+author = {Fine, Paul E.M.},
+doi = {10.1093/aje/kwg251},
+isbn = {0002-9262 (Print) 0002-9262 (Linking)},
+issn = {00029262},
+journal = {American Journal of Epidemiology},
+keywords = {Communicable diseases,Disease outbreaks},
+number = {11},
+pages = {1039--1047},
+pmid = {14630599},
+title = {{The Interval between Successive Cases of an Infectious Disease}},
+volume = {158},
+year = {2003}
+}

From 6cd4c4f9870549fcac954373e435e3fe338fa6ac Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 17 Jan 2023 15:59:43 +0000
Subject: [PATCH 067/828] updated vignette

---
 vignettes/projecting_incidence.Rmd | 18 +++++++++++-------
 1 file changed, 11 insertions(+), 7 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 08c1ea4c..9af16902 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -3,6 +3,7 @@ title: "Projecting future disease incidence given early outbreak data"
 author: "James Azam, Sebastian Funk"
 date: '2023-01-13'
 output: html_document
+bibliography: vignettes/references.bib 
 ---
 
 ```{r setup, include=FALSE}
@@ -16,13 +17,16 @@ library('ggplot2')
 ```
 
 ## Description
-Branching processes can be used to project future disease incidence given early 
-outbreak data. `bpmodels` can simulate branching processes using its `chain_sim()` function. 
+Branching processes can be used to project disease incidence provided we have some 
+information on the distribution of times between successive cases (serial interval), 
+and the distribution of secondary cases produced by a single individual (offspring 
+distribution). Such simulations can be achieved in `bpmodels` with the `chain_sim()` function. 
 
 ## Disease data
 
-Let's create some early outbreak data, assuming the cases have a negative binomial \
-distribution
+Let's create an outbreak dataset, assuming the cases are sampled from a negative binomial
+distribution with mean = 5 and dispersion = 7.5. These parameter values are arbitrarily
+chosen for illustrative purposes.
 
 ```{r data_generation, message=FALSE}
 set.seed(12)
@@ -65,11 +69,11 @@ cases_to_project <- 1000
 sigma = 2.9
 mu = 4.7
 
-si_sd <- sqrt(log(1 + (sigma/mu)^2)) 
-si_mean <- log((mu^2)/(sqrt(sigma^2 + mu^2))) #the desired mean
+si_sd <- sqrt(log(1 + (sigma/mu)^2)) #log standard deviation
+si_mean <- log((mu^2)/(sqrt(sigma^2 + mu^2))) #log mean
 
 #' serial interval function
-serial_interval <- function(sample_size = 1) {
+serial_interval <- function(sample_size) {
   si <- rlnorm(sample_size, meanlog = si_mean, sdlog = si_sd)
   return(si)
 }

From ebcfb54c6573517c6730e71608fdf453fe91bc86 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Wed, 18 Jan 2023 10:29:08 +0000
Subject: [PATCH 068/828] setting up the new vignette

---
 vignettes/.gitignore | 2 ++
 1 file changed, 2 insertions(+)
 create mode 100644 vignettes/.gitignore

diff --git a/vignettes/.gitignore b/vignettes/.gitignore
new file mode 100644
index 00000000..097b2416
--- /dev/null
+++ b/vignettes/.gitignore
@@ -0,0 +1,2 @@
+*.html
+*.R

From 6381a9584d4d6eb0afdce8b2ba9eb391f9adfe1d Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Wed, 18 Jan 2023 15:38:13 +0000
Subject: [PATCH 069/828] updated DESC packages to include vignette
 requirements

---
 DESCRIPTION                        |  5 +++-
 vignettes/projecting_incidence.Rmd | 44 +++++++++++++++++++++++-------
 2 files changed, 38 insertions(+), 11 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 86bd2ff4..7284e3f5 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -21,7 +21,10 @@ Suggests:
     rmarkdown,
     bookdown,
     testthat,
-    truncdist
+    truncdist,
+    dplyr,
+    ggplot2,
+    lubridate
 VignetteBuilder: 
     knitr
 Encoding: UTF-8
diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 9af16902..9256f35f 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -1,19 +1,30 @@
 ---
 title: "Projecting future disease incidence given early outbreak data"
 author: "James Azam, Sebastian Funk"
-date: '2023-01-13'
-output: html_document
-bibliography: vignettes/references.bib 
+output:
+  bookdown::html_vignette2:
+    fig_caption: yes
+    code_folding: show
+pkgdown:
+  as_is: true
+bibliography: references.bib
+link-citations: true
+vignette: >
+  %\VignetteIndexEntry{Projecting future disease incidence given early outbreak data}
+  %\VignetteEncoding{UTF-8}
+  %\VignetteEngine{knitr::rmarkdown}
+editor_options: 
+  chunk_output_type: console
 ---
 
 ```{r setup, include=FALSE}
-knitr::opts_chunk$set(echo = TRUE, message = FALSE, warning = FALSE)
+knitr::opts_chunk$set(echo = TRUE, 
+                      message = FALSE, 
+                      warning = FALSE, 
+                      collapse = TRUE,
+                      comment = "#>"
+                      )
 
-library("bpmodels")
-library("readr")
-library("lubridate")
-library('dplyr')
-library('ggplot2')
 ```
 
 ## Description
@@ -22,6 +33,15 @@ information on the distribution of times between successive cases (serial interv
 and the distribution of secondary cases produced by a single individual (offspring 
 distribution). Such simulations can be achieved in `bpmodels` with the `chain_sim()` function. 
 
+Let's load the required packages
+
+```{r loading_packages, include=TRUE}
+library("bpmodels")
+library('dplyr')
+library('ggplot2')
+library('lubridate')
+```
+
 ## Disease data
 
 Let's create an outbreak dataset, assuming the cases are sampled from a negative binomial
@@ -34,7 +54,11 @@ cases_df <- data.frame(date = as.Date('2023-01-01') + seq_len(12),
                        cases = rnbinom(12, size = 7.5, mu = 5)
                        )
 head(cases_df)
-ggplot(cases_df, aes(x = date, y = cases)) + geom_col()
+
+ggplot(cases_df, 
+       aes(x = date, y = cases)
+       ) + 
+  geom_col(fill = 'tomato3', size = 1)
 ```
 
 ## Preparing the inputs  

From 6db065e5237f7d8b2119df9ad53faf2525324846 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Thu, 26 Jan 2023 16:35:43 +0000
Subject: [PATCH 070/828] added a bibliography of branching process
 applciations to outbreaks

---
 vignettes/articles/bp_literature.Rmd | 24 ++++++++++++++++++++++++
 1 file changed, 24 insertions(+)
 create mode 100644 vignettes/articles/bp_literature.Rmd

diff --git a/vignettes/articles/bp_literature.Rmd b/vignettes/articles/bp_literature.Rmd
new file mode 100644
index 00000000..780062f3
--- /dev/null
+++ b/vignettes/articles/bp_literature.Rmd
@@ -0,0 +1,24 @@
+---
+title: "Applications of branching process models to outbreak modelling"
+---
+
+```{r, include = FALSE}
+knitr::opts_chunk$set(
+  collapse = TRUE,
+  comment = "#>"
+)
+```
+
+## Single-type models
+
+- Blumberg S, Lloyd-Smith J. Comparing methods for estimating R0 from the size distribution of sub- critical transmission chains. Epidemics. 2013; 5(3):131–45. doi: https://doi.org/10.1016/j.epidem.2013.05.002 PMID: 24021520
+
+- Blumberg S, Lloyd-Smith JO. Inference of R0 and transmission heterogeneity from the size distribution of stuttering chains. PLoS Comput Biol. 2013; 9(5):e1002993. doi: https://doi.org/10.1371/journal.pcbi.1002993 PMID: 23658504
+
+- Farrington C, Kanaan M, Gay N. Branching process models for surveillance of infectious diseases con- trolled by mass vaccination. Biostatistics. 2003; 4(2):279. doi: https://doi.org/10.1093/biostatistics/4.2.279 PMID: 12925522
+
+- Nishiura H, Yan P, Sleeman CK, Mode CJ. Estimating the transmission potential of supercritical pro- cesses based on the final size distribution of minor outbreaks. J Theor Biol. 2012; 294:48–55. doi: https://doi.org/10.1016/j.jtbi.2011.10.039 PMID: 22079419
+
+## Multi-type models
+
+- Kucharski, A. J., & Edmunds, W. J. (2015). Characterizing the Transmission Potential of Zoonotic Infections from Minor Outbreaks. PLoS Computational Biology, 11(4), 1–17. https://doi.org/10.1371/journal.pcbi.1004154

From 195c29f34431c7c7770f3f01e17ddc452532f292 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Thu, 26 Jan 2023 16:40:38 +0000
Subject: [PATCH 071/828] added a vignette on branching process theory

---
 vignettes/articles/bp_theory.Rmd | 19 +++++++++++++++++++
 1 file changed, 19 insertions(+)
 create mode 100644 vignettes/articles/bp_theory.Rmd

diff --git a/vignettes/articles/bp_theory.Rmd b/vignettes/articles/bp_theory.Rmd
new file mode 100644
index 00000000..fc1903a8
--- /dev/null
+++ b/vignettes/articles/bp_theory.Rmd
@@ -0,0 +1,19 @@
+---
+title: "Model and chain likelihood definitions"
+---
+
+```{r, include = FALSE}
+knitr::opts_chunk$set(
+  collapse = TRUE,
+  comment = "#>"
+)
+```
+
+# Branching process model definition
+
+This is a work in progress to document how the single and multi-type models used in this
+package are defined.
+
+# Likelihoods
+This is a work in progress to document the derivation of analytical solutions
+to the likelihoods used here. 
\ No newline at end of file

From 9dd7d0e9826a22c9ee45ddf36481b4fc1781b0ed Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 27 Jan 2023 22:13:36 +0000
Subject: [PATCH 072/828] removed CI files and code of conduct

---
 CODE_OF_CONDUCT.md | 25 -------------------------
 1 file changed, 25 deletions(-)
 delete mode 100644 CODE_OF_CONDUCT.md

diff --git a/CODE_OF_CONDUCT.md b/CODE_OF_CONDUCT.md
deleted file mode 100644
index 24aa0a3c..00000000
--- a/CODE_OF_CONDUCT.md
+++ /dev/null
@@ -1,25 +0,0 @@
-# Contributor Code of Conduct
-
-As contributors and maintainers of this project, we pledge to respect all people who 
-contribute through reporting issues, posting feature requests, updating documentation,
-submitting pull requests or patches, and other activities.
-
-We are committed to making participation in this project a harassment-free experience for
-everyone, regardless of level of experience, gender, gender identity and expression,
-sexual orientation, disability, personal appearance, body size, race, ethnicity, age, or religion.
-
-Examples of unacceptable behavior by participants include the use of sexual language or
-imagery, derogatory comments or personal attacks, trolling, public or private harassment,
-insults, or other unprofessional conduct.
-
-Project maintainers have the right and responsibility to remove, edit, or reject comments,
-commits, code, wiki edits, issues, and other contributions that are not aligned to this 
-Code of Conduct. Project maintainers who do not follow the Code of Conduct may be removed 
-from the project team.
-
-Instances of abusive, harassing, or otherwise unacceptable behavior may be reported by 
-opening an issue or contacting one or more of the project maintainers.
-
-This Code of Conduct is adapted from the Contributor Covenant 
-(http://contributor-covenant.org), version 1.0.0, available at 
-http://contributor-covenant.org/version/1/0/0/

From 86b229ba21a93792f26a41fe2eb79dde30de0d7f Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Mon, 30 Jan 2023 10:41:25 +0000
Subject: [PATCH 073/828] removed draft vignette on branching process theory

---
 vignettes/articles/bp_theory.Rmd | 19 -------------------
 1 file changed, 19 deletions(-)
 delete mode 100644 vignettes/articles/bp_theory.Rmd

diff --git a/vignettes/articles/bp_theory.Rmd b/vignettes/articles/bp_theory.Rmd
deleted file mode 100644
index fc1903a8..00000000
--- a/vignettes/articles/bp_theory.Rmd
+++ /dev/null
@@ -1,19 +0,0 @@
----
-title: "Model and chain likelihood definitions"
----
-
-```{r, include = FALSE}
-knitr::opts_chunk$set(
-  collapse = TRUE,
-  comment = "#>"
-)
-```
-
-# Branching process model definition
-
-This is a work in progress to document how the single and multi-type models used in this
-package are defined.
-
-# Likelihoods
-This is a work in progress to document the derivation of analytical solutions
-to the likelihoods used here. 
\ No newline at end of file

From 3c33e3563af291a1a71df1a84f59588f3b09bd7e Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Mon, 30 Jan 2023 10:48:25 +0000
Subject: [PATCH 074/828] moved introduction vignette to README quick start

---
 README.Rmd | 154 ++++++++++++++++++++++++++++++++++++++++-
 README.md  | 196 +++++++++++++++++++++++++++++++++++++++++++++++++++--
 2 files changed, 343 insertions(+), 7 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index 5405893f..2c055924 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -12,21 +12,169 @@ knitr::opts_chunk$set(
   out.width = "100%"
 )
 ```
+
+# _bpmodels_: Methods for analysing the size and length of chains from branching process models
+
 <!-- badges: start -->
 [![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
 [![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels) 
 <!-- badges: end -->
+
 ```{r setup, include=FALSE}
 knitr::opts_chunk$set(echo = TRUE)
 ```
 
-`bpmodels` is an R package to simulate and analyse the size and length of branching processes with a given offspring distribution.
+`bpmodels` is an R package to simulate and analyse the size and length of 
+branching processes with a given offspring distribution.
 
 # Installation
 The latest development version of the `bpmodels` package can be installed via
 
 ```{r eval=FALSE}
-devtools::install_github('sbfnk/bpmodels')
+devtools::install_github('epiverse-trace/bpmodels')
+```
+
+# Quick start
+
+To load the package, use
+
+```{r echo=FALSE}
+suppressWarnings(library('bpmodels'))
+```
+
+At the heart of the package are the `chains_ll()` and `chains_sim()` functions. 
+
+## Calculating log-likelihoods
+
+The `chains_ll()` function calculates the log-likelihood of a distribution of 
+chain sizes or lengths given an offspring distribution and its associated 
+parameters. 
+
+If we have observed a distribution of chains of sizes $1, 1, 4, 7$, we can 
+calculate the log-likelihood of this observed chain by assuming the offspring 
+per generation is Poisson distributed with a mean number of $0.5$. 
+
+To do this, we run 
+
+```{r}
+set.seed(13)
+chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
+chain_ll(x = chain_sizes, offspring = "pois", stat = "size", lambda = 0.5)
 ```
 
-Please note that the 'bpmodels' project is released with a [Contributor Code of Conduct](CODE_OF_CONDUCT.md). By contributing to this project, you agree to abide by its terms.
+The first argument of `chain_ll()` is the size (or length) distribution to 
+analyse. The second argument, `offspring`, specifies the offspring 
+distribution. This is given as a function used to generate random offspring. 
+It can be any probability distribution implemented in `R`, that is, one that 
+has a corresponding function for generating random numbers beginning with the 
+letter `r`. In the case of the example above, since random Poisson numbers are 
+generated in `R` using a function called `rpois()`, the string to pass to the 
+`offspring` argument is `"pois"`.
+
+The third argument, `stat`, determines whether to analyse chain sizes 
+(`"size"`, the default if this argument is not specified) or lengths 
+(`"length"`). Lastly, any named arguments not recognised by `chain_ll()` 
+are interpreted as parameters of the corresponding probability distribution, 
+here `lambda = 0.5` as the mean of the Poisson distribution (see the `R` help 
+page for the [Poisson distribution](https://stat.ethz.ch/R-manual/R-devel/library/stats/html/Poisson.html) for more information).
+
+# Imperfect observations
+
+By default, `chain_ll` assumes perfect observation, where `obs_prob = 1` 
+(See `?chain_ll`). If observations are imperfect, the `chain_ll()` function has 
+an `obs_prob` argument that can be used to determine the likelihood. In that 
+case, true chain sizes or lengths are simulated repeatedly (the number of times 
+given by the `nsim_obs` argument), and the likelihood calculated for each of 
+these simulations. 
+
+For example, if the probability of observing each case is $30%$, we use
+
+```{r}
+chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
+ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5, 
+               nsim_obs = 10)
+summary(ll)
+```
+
+This returns `10` likelihood values (because `nsim_obs = 10`), which can be 
+averaged to come up with an overall likelihood estimate.
+
+To find out about usage of the `chains_ll()` function, you can use the `R` help 
+file
+
+```{r eval=FALSE}
+?chains_ll
+```
+
+## Simulating branching processes
+
+To simulate a branching process, we use the `chain_sim()` function. This function 
+follows the same syntax as `chain_ll()`.
+
+Below, we are simulating $5$ chains, assuming the offspring are generated using
+a Poisson distribution with mean, `lambda = 5`. By default, `chain_sim()` returns
+a vector of chain sizes/lengths. However, to override that so that a tree of
+infectees and infectors is returned, we need to specify a function for the serial 
+interval and set `tree = TRUE`
+
+```{r}
+chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5)
+```
+
+### Simulating trees
+To simulate a tree of branching processes, we do specify the serial interval 
+generation function and set `tree = TRUE` as follows:
+
+```{r}
+set.seed(13)
+
+serial_interval <- function(n){rlnorm(n, meanlog = 0.58, sdlog = 1.58)}
+
+chains_df <- chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length', 
+                       infinite = 100, serial = serial_interval)
+
+chains_df
+```
+
+
+# Methodology
+
+If the probability distribution of chain sizes or lengths has an analytical 
+solution, this will be used (size distribution: Poisson and negative binomial; 
+length distribution: Poisson and geometric). 
+
+If an analytical solution does not exist, simulations are used to approximate 
+this probability distributions (using a linear approximation to the cumulative 
+distribution for unobserved sizes/lengths). The argument `nsim_offspring` is 
+used to specify the number of simulations to be used for this approximation. 
+
+For example, to get offspring drawn from a binomial distribution with 
+probability `prob = 0.5`, we run
+
+```{r}
+chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, nsim_offspring = 100)
+```
+
+## Package vignettes
+
+Specific use cases of _bpmodels_ can be found in the [online documentation as package vignettes](https://epiverse-trace.github.io/bpmodels/), under "Articles".
+
+## Reporting bugs 
+
+To report a bug please open an [issue](https://github.com/epiverse-trace/bpmodels/issues/new/choose).
+
+## Contribute
+
+We welcome contributions to enhance the package's functionalities. If you wish to
+do so, please follow the [package contributing guide](https://github.com/epiverse-trace/.github/blob/main/CONTRIBUTING.md).
+
+## Code of conduct
+
+Please note that the _bpmodels_ project is released with a [Contributor Code of Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md). 
+By contributing to this project, you agree to abide by its terms.
+
+## Citing this package
+
+```{r message=FALSE, warning=FALSE}
+citation("bpmodels")
+```
\ No newline at end of file
diff --git a/README.md b/README.md
index b1cbaa53..b2dccbf9 100644
--- a/README.md
+++ b/README.md
@@ -1,4 +1,6 @@
 
+# *bpmodels*: Methods for analysing the size and length of chains from branching process models
+
 <!-- badges: start -->
 
 [![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
@@ -14,9 +16,195 @@ The latest development version of the `bpmodels` package can be
 installed via
 
 ``` r
-devtools::install_github('sbfnk/bpmodels')
+devtools::install_github('epiverse-trace/bpmodels')
+```
+
+# Quick start
+
+To load the package, use
+
+At the heart of the package are the `chains_ll()` and `chains_sim()`
+functions.
+
+## Calculating log-likelihoods
+
+The `chains_ll()` function calculates the log-likelihood of a
+distribution of chain sizes or lengths given an offspring distribution
+and its associated parameters.
+
+If we have observed a distribution of chains of sizes $1, 1, 4, 7$, we
+can calculate the log-likelihood of this observed chain by assuming the
+offspring per generation is Poisson distributed with a mean number of
+$0.5$.
+
+To do this, we run
+
+``` r
+set.seed(13)
+chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
+chain_ll(x = chain_sizes, offspring = "pois", stat = "size", lambda = 0.5)
+#> [1] -8.607
+```
+
+The first argument of `chain_ll()` is the size (or length) distribution
+to analyse. The second argument, `offspring`, specifies the offspring
+distribution. This is given as a function used to generate random
+offspring. It can be any probability distribution implemented in `R`,
+that is, one that has a corresponding function for generating random
+numbers beginning with the letter `r`. In the case of the example above,
+since random Poisson numbers are generated in `R` using a function
+called `rpois()`, the string to pass to the `offspring` argument is
+`"pois"`.
+
+The third argument, `stat`, determines whether to analyse chain sizes
+(`"size"`, the default if this argument is not specified) or lengths
+(`"length"`). Lastly, any named arguments not recognised by `chain_ll()`
+are interpreted as parameters of the corresponding probability
+distribution, here `lambda = 0.5` as the mean of the Poisson
+distribution (see the `R` help page for the [Poisson
+distribution](https://stat.ethz.ch/R-manual/R-devel/library/stats/html/Poisson.html)
+for more information).
+
+# Imperfect observations
+
+By default, `chain_ll` assumes perfect observation, where `obs_prob = 1`
+(See `?chain_ll`). If observations are imperfect, the `chain_ll()`
+function has an `obs_prob` argument that can be used to determine the
+likelihood. In that case, true chain sizes or lengths are simulated
+repeatedly (the number of times given by the `nsim_obs` argument), and
+the likelihood calculated for each of these simulations.
+
+For example, if the probability of observing each case is $30%$, we use
+
+``` r
+chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
+ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5, 
+               nsim_obs = 10)
+summary(ll)
+#>    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
+#>   -32.1   -26.5   -24.1   -24.9   -22.5   -19.1
+```
+
+This returns `10` likelihood values (because `nsim_obs = 10`), which can
+be averaged to come up with an overall likelihood estimate.
+
+To find out about usage of the `chains_ll()` function, you can use the
+`R` help file
+
+``` r
+?chains_ll
+```
+
+## Simulating branching processes
+
+To simulate a branching process, we use the `chain_sim()` function. This
+function follows the same syntax as `chain_ll()`.
+
+Below, we are simulating $5$ chains, assuming the offspring are
+generated using a Poisson distribution with mean, `lambda = 5`. By
+default, `chain_sim()` returns a vector of chain sizes/lengths. However,
+to override that so that a tree of infectees and infectors is returned,
+we need to specify a function for the serial interval and set
+`tree = TRUE`
+
+``` r
+chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5)
+#> [1] 5 1 1 1 1
+```
+
+### Simulating trees
+
+To simulate a tree of branching processes, we do specify the serial
+interval generation function and set `tree = TRUE` as follows:
+
+``` r
+set.seed(13)
+
+serial_interval <- function(n){rlnorm(n, meanlog = 0.58, sdlog = 1.58)}
+
+chains_df <- chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length', 
+                       infinite = 100, serial = serial_interval)
+
+chains_df
+#>    n id ancestor generation     time
+#> 1  1  1       NA          1  0.00000
+#> 2  2  1       NA          1  0.00000
+#> 3  3  1       NA          1  0.00000
+#> 4  4  1       NA          1  0.00000
+#> 5  5  1       NA          1  0.00000
+#> 6  1  2        1          2  0.04772
+#> 7  5  2        1          2  5.57573
+#> 8  5  3        1          2  0.11454
+#> 9  1  3        2          3  2.64367
+#> 10 5  4        2          3  6.57843
+#> 11 1  4        3          4  2.96098
+#> 12 5  5        4          4 10.28370
+#> 13 5  6        5          5 10.37883
+```
+
+# Methodology
+
+If the probability distribution of chain sizes or lengths has an
+analytical solution, this will be used (size distribution: Poisson and
+negative binomial; length distribution: Poisson and geometric).
+
+If an analytical solution does not exist, simulations are used to
+approximate this probability distributions (using a linear approximation
+to the cumulative distribution for unobserved sizes/lengths). The
+argument `nsim_offspring` is used to specify the number of simulations
+to be used for this approximation.
+
+For example, to get offspring drawn from a binomial distribution with
+probability `prob = 0.5`, we run
+
+``` r
+chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, nsim_offspring = 100)
+#> [1] -8.761
 ```
 
-Please note that the ‘bpmodels’ project is released with a [Contributor
-Code of Conduct](CODE_OF_CONDUCT.md). By contributing to this project,
-you agree to abide by its terms.
+## Package vignettes
+
+Specific use cases of *bpmodels* can be found in the [online
+documentation as package
+vignettes](https://epiverse-trace.github.io/bpmodels/), under
+“Articles”.
+
+## Reporting bugs
+
+To report a bug please open an
+[issue](https://github.com/epiverse-trace/bpmodels/issues/new/choose).
+
+## Contribute
+
+We welcome contributions to enhance the package’s functionalities. If
+you wish to do so, please follow the [package contributing
+guide](https://github.com/epiverse-trace/.github/blob/main/CONTRIBUTING.md).
+
+## Code of conduct
+
+Please note that the *bpmodels* project is released with a [Contributor
+Code of
+Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md).
+By contributing to this project, you agree to abide by its terms.
+
+## Citing this package
+
+``` r
+citation("bpmodels")
+#> 
+#> To cite package 'bpmodels' in publications use:
+#> 
+#>   Funk S, Finger F (2023). _bpmodels: Analysing chain statistics using
+#>   branching process models_. R package version 0.1.0,
+#>   <https://github.com/sbfnk/bpmodels>.
+#> 
+#> A BibTeX entry for LaTeX users is
+#> 
+#>   @Manual{,
+#>     title = {bpmodels: Analysing chain statistics using branching process models},
+#>     author = {Sebastian Funk and Flavio Finger},
+#>     year = {2023},
+#>     note = {R package version 0.1.0},
+#>     url = {https://github.com/sbfnk/bpmodels},
+#>   }
+```

From 67644e860cc4ead304ff486f0c9ba517e5323345 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Wed, 1 Feb 2023 12:29:42 +0000
Subject: [PATCH 075/828] updated README

---
 README.Rmd |  7 +++++++
 README.md  | 49 ++++++++++++++++++++++++++++++-------------------
 2 files changed, 37 insertions(+), 19 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index 2c055924..92e69135 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -16,8 +16,15 @@ knitr::opts_chunk$set(
 # _bpmodels_: Methods for analysing the size and length of chains from branching process models
 
 <!-- badges: start -->
+![CRAN/METACRAN](https://img.shields.io/cran/v/bpmodels)
+![GitHub R package version](https://img.shields.io/github/r-package/v/epiverse-trace/bpmodels)
+![GitHub all releases](https://img.shields.io/github/downloads/epiverse-trace/bpmodels/total?style=flat)
+![GitHub issues](https://img.shields.io/github/issues/epiverse-trace/bpmodels)
 [![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
 [![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels) 
+![GitHub contributors](https://img.shields.io/github/contributors/epiverse-trace/bpmodels)
+![GitHub commit activity](https://img.shields.io/github/commit-activity/m/epiverse-trace/bpmodels)
+![GitHub](https://img.shields.io/github/license/epiverse-trace/bpmodels)
 <!-- badges: end -->
 
 ```{r setup, include=FALSE}
diff --git a/README.md b/README.md
index b2dccbf9..d0f94754 100644
--- a/README.md
+++ b/README.md
@@ -3,8 +3,20 @@
 
 <!-- badges: start -->
 
+![CRAN/METACRAN](https://img.shields.io/cran/v/bpmodels) ![GitHub R
+package
+version](https://img.shields.io/github/r-package/v/epiverse-trace/bpmodels)
+![GitHub all
+releases](https://img.shields.io/github/downloads/epiverse-trace/bpmodels/total?style=flat)
+![GitHub
+issues](https://img.shields.io/github/issues/epiverse-trace/bpmodels)
 [![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
 [![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels)
+![GitHub
+contributors](https://img.shields.io/github/contributors/epiverse-trace/bpmodels)
+![GitHub commit
+activity](https://img.shields.io/github/commit-activity/m/epiverse-trace/bpmodels)
+![GitHub](https://img.shields.io/github/license/epiverse-trace/bpmodels)
 <!-- badges: end -->
 
 `bpmodels` is an R package to simulate and analyse the size and length
@@ -43,7 +55,7 @@ To do this, we run
 set.seed(13)
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 chain_ll(x = chain_sizes, offspring = "pois", stat = "size", lambda = 0.5)
-#> [1] -8.607
+#> [1] -8.607196
 ```
 
 The first argument of `chain_ll()` is the size (or length) distribution
@@ -82,7 +94,7 @@ ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5,
                nsim_obs = 10)
 summary(ll)
 #>    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
-#>   -32.1   -26.5   -24.1   -24.9   -22.5   -19.1
+#>  -32.09  -26.52  -24.06  -24.94  -22.49  -19.14
 ```
 
 This returns `10` likelihood values (because `nsim_obs = 10`), which can
@@ -126,20 +138,20 @@ chains_df <- chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length',
                        infinite = 100, serial = serial_interval)
 
 chains_df
-#>    n id ancestor generation     time
-#> 1  1  1       NA          1  0.00000
-#> 2  2  1       NA          1  0.00000
-#> 3  3  1       NA          1  0.00000
-#> 4  4  1       NA          1  0.00000
-#> 5  5  1       NA          1  0.00000
-#> 6  1  2        1          2  0.04772
-#> 7  5  2        1          2  5.57573
-#> 8  5  3        1          2  0.11454
-#> 9  1  3        2          3  2.64367
-#> 10 5  4        2          3  6.57843
-#> 11 1  4        3          4  2.96098
-#> 12 5  5        4          4 10.28370
-#> 13 5  6        5          5 10.37883
+#>    n id ancestor generation        time
+#> 1  1  1       NA          1  0.00000000
+#> 2  2  1       NA          1  0.00000000
+#> 3  3  1       NA          1  0.00000000
+#> 4  4  1       NA          1  0.00000000
+#> 5  5  1       NA          1  0.00000000
+#> 6  1  2        1          2  0.04771887
+#> 7  5  2        1          2  5.57573333
+#> 8  5  3        1          2  0.11454421
+#> 9  1  3        2          3  2.64367236
+#> 10 5  4        2          3  6.57843219
+#> 11 1  4        3          4  2.96098160
+#> 12 5  5        4          4 10.28370183
+#> 13 5  6        5          5 10.37883069
 ```
 
 # Methodology
@@ -159,7 +171,7 @@ probability `prob = 0.5`, we run
 
 ``` r
 chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, nsim_offspring = 100)
-#> [1] -8.761
+#> [1] -8.760539
 ```
 
 ## Package vignettes
@@ -194,7 +206,7 @@ citation("bpmodels")
 #> 
 #> To cite package 'bpmodels' in publications use:
 #> 
-#>   Funk S, Finger F (2023). _bpmodels: Analysing chain statistics using
+#>   Funk S, Finger F (????). _bpmodels: Analysing chain statistics using
 #>   branching process models_. R package version 0.1.0,
 #>   <https://github.com/sbfnk/bpmodels>.
 #> 
@@ -203,7 +215,6 @@ citation("bpmodels")
 #>   @Manual{,
 #>     title = {bpmodels: Analysing chain statistics using branching process models},
 #>     author = {Sebastian Funk and Flavio Finger},
-#>     year = {2023},
 #>     note = {R package version 0.1.0},
 #>     url = {https://github.com/sbfnk/bpmodels},
 #>   }

From f923b96f8f7e8617d94932950444cb70bf3105ab Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 3 Feb 2023 21:37:31 +0000
Subject: [PATCH 076/828] added more dependencies

---
 DESCRIPTION | 12 ++++++++----
 1 file changed, 8 insertions(+), 4 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 7284e3f5..0c6510f2 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -15,18 +15,22 @@ Description: Provides methods to analyse and simulate the size and length
 License: MIT + file LICENSE
 URL: https://github.com/sbfnk/bpmodels
 BugReports: https://github.com/sbfnk/bpmodels/issues
+Depends: 
+    R (>= 2.10)
 Suggests: 
+    bookdown,
     covr,
+    dplyr,
+    ggplot2,
     knitr,
+    lubridate,
     rmarkdown,
     bookdown,
     testthat,
-    truncdist,
-    dplyr,
-    ggplot2,
-    lubridate
+    truncdist
 VignetteBuilder: 
     knitr
 Encoding: UTF-8
+LazyData: true
 Roxygen: list(markdown = TRUE)
 RoxygenNote: 7.2.3

From a5881a028410f75bda970abf7c67ff507cadc7fc Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 3 Feb 2023 21:38:09 +0000
Subject: [PATCH 077/828] added more references to bib lib

---
 vignettes/references.bib | 16 ++++++++++++++++
 1 file changed, 16 insertions(+)

diff --git a/vignettes/references.bib b/vignettes/references.bib
index c19f323b..ae7dc255 100644
--- a/vignettes/references.bib
+++ b/vignettes/references.bib
@@ -1,3 +1,19 @@
+@article{abbott2020,
+  title={The transmissibility of novel Coronavirus in the early stages of the 2019-20 outbreak in Wuhan: Exploring initial point-source exposure sizes and durations using scenario analysis},
+  author={Abbott, Sam and Hellewell, Joel and Munday, James and Funk, Sebastian and CMMID nCoV working group and others},
+  journal={Wellcome open research},
+  volume={5},
+  year={2020},
+  publisher={The Wellcome Trust}
+}
+
+@article{marivate2020,
+  title={Use of available data to inform the COVID-19 outbreak in South Africa: a case study},
+  author={Marivate, Vukosi and Combrink, Herkulaas MvE},
+  journal={arXiv preprint arXiv:2004.04813},
+  year={2020}
+}
+
 @article{Farrington2003,
 abstract = {Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.},
 author = {Farrington, C. P. and Kanaan, M. N. and Gay, N. J.},

From dae4dea54b2edbaa31536cba658b2484e64fdfb6 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 3 Feb 2023 21:40:20 +0000
Subject: [PATCH 078/828] added the covid-19 SA data and associated doc

---
 R/data.R              |  19 +++++++++++++++++++
 data-raw/covid19_sa.R |  17 +++++++++++++++++
 data/covid19_sa.rda   | Bin 0 -> 227 bytes
 man/covid19_sa.Rd     |  34 ++++++++++++++++++++++++++++++++++
 4 files changed, 70 insertions(+)
 create mode 100644 R/data.R
 create mode 100644 data-raw/covid19_sa.R
 create mode 100644 data/covid19_sa.rda
 create mode 100644 man/covid19_sa.Rd

diff --git a/R/data.R b/R/data.R
new file mode 100644
index 00000000..e134c1c3
--- /dev/null
+++ b/R/data.R
@@ -0,0 +1,19 @@
+#' COVID-19 Data Repository for South Africa
+#'
+#' An aggregated subset of the COVID-19 Data Repository for South Africa created, 
+#' maintained and hosted by Data Science for Social Impact research group, 
+#' led by Dr. Vukosi Marivate ...
+#' 
+#' The data is originally provided as a linelist but has been subsetted and 
+#' cleaned in `data-raw/covid19_sa.R`.
+#'
+#' @format ## `covid19_sa`
+#' A data frame with 19 rows and 2 columns:
+#' \describe{
+#'   \item{date}{Date case was reported}
+#'   \item{cases}{Number of cases}
+#'   ...
+#' }
+#' @source <https://github.com/dsfsi/covid19za>
+#' Further details in `data-raw/covid19_sa.R`.
+"covid19_sa"
\ No newline at end of file
diff --git a/data-raw/covid19_sa.R b/data-raw/covid19_sa.R
new file mode 100644
index 00000000..40afe3be
--- /dev/null
+++ b/data-raw/covid19_sa.R
@@ -0,0 +1,17 @@
+## code to prepare `covid_sa` dataset
+
+data_url <- 'https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv'
+
+#Read the data in using the url
+covid19_sa <- read.csv(data_url)
+
+#Clean and subset the data we need
+covid19_sa <- covid19_sa %>% 
+  dplyr::select(date) %>% 
+  dplyr::mutate(date = lubridate::dmy(date)) %>%
+  dplyr::filter(date <= lubridate::dmy('20-03-2020')) %>%   
+  dplyr::group_by(date) %>% 
+  dplyr::summarise(cases = n()) %>%   
+  dplyr::ungroup()
+
+usethis::use_data(covid19_sa, overwrite = TRUE)
diff --git a/data/covid19_sa.rda b/data/covid19_sa.rda
new file mode 100644
index 0000000000000000000000000000000000000000..43295c0cf928039ee39e9cd1d6c7adea1c3b70e1
GIT binary patch
literal 227
zcmV<903829T4*^jL0KkKS$q4essI3sf5ZRtNB{r<Fd#$#5J0~toq#|9AOHh^03omd
zwqVgFf@t!ZG-;C&=+tSLgc_%*<xfo~qIzVS7$Z!WnKWo5N{m1m02@#SfDe4gzHDd%
z6dbS+Jmxnw9KKM%8cKQ`8U7)=0KC$xX)0Lv!TN~Q(m^vp(wkF3g_ceeF%<}66GMhW
z2w)9}Tz5cF)4}q|_)1c{iH)^=?{8Mw2s~*eGaQ2<JB?aK^DwEMtOD*NsxFvij}r^v
dOmYUJb~En8TaiB!N&-KNxgwk>NIm`5RRBkATR{K-

literal 0
HcmV?d00001

diff --git a/man/covid19_sa.Rd b/man/covid19_sa.Rd
new file mode 100644
index 00000000..f772efb3
--- /dev/null
+++ b/man/covid19_sa.Rd
@@ -0,0 +1,34 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/data.R
+\docType{data}
+\name{covid19_sa}
+\alias{covid19_sa}
+\title{COVID-19 Data Repository for South Africa}
+\format{
+\subsection{\code{covid19_sa}}{
+
+A data frame with 19 rows and 2 columns:
+\describe{
+\item{date}{Date case was reported}
+\item{cases}{Number of cases}
+...
+}
+}
+}
+\source{
+\url{https://github.com/dsfsi/covid19za}
+Further details in \code{data-raw/covid19_sa.R}.
+}
+\usage{
+covid19_sa
+}
+\description{
+An aggregated subset of the COVID-19 Data Repository for South Africa created,
+maintained and hosted by Data Science for Social Impact research group,
+led by Dr. Vukosi Marivate ...
+}
+\details{
+The data is originally provided as a linelist but has been subsetted and
+cleaned in \code{data-raw/covid19_sa.R}.
+}
+\keyword{datasets}

From 5ab97bf7bbe02df900d50143191f4a50200073e9 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 3 Feb 2023 21:42:44 +0000
Subject: [PATCH 079/828] changed the vignette title

---
 vignettes/projecting_incidence.Rmd | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 9256f35f..3cde3865 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -1,5 +1,5 @@
 ---
-title: "Projecting future disease incidence given early outbreak data"
+title: "Projecting COVID-19 incidence using early outbreak data"
 author: "James Azam, Sebastian Funk"
 output:
   bookdown::html_vignette2:
@@ -10,7 +10,7 @@ pkgdown:
 bibliography: references.bib
 link-citations: true
 vignette: >
-  %\VignetteIndexEntry{Projecting future disease incidence given early outbreak data}
+  %\VignetteIndexEntry{Projecting COVID-19 incidence using early outbreak data}
   %\VignetteEncoding{UTF-8}
   %\VignetteEngine{knitr::rmarkdown}
 editor_options: 

From 5c37fc92452140aa95526171bdea6b927154b73b Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 3 Feb 2023 21:44:00 +0000
Subject: [PATCH 080/828] updated section on specifying serial interval

---
 vignettes/projecting_incidence.Rmd | 109 +++++++++++++++++------------
 1 file changed, 65 insertions(+), 44 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 3cde3865..b25d09ca 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -22,79 +22,100 @@ knitr::opts_chunk$set(echo = TRUE,
                       message = FALSE, 
                       warning = FALSE, 
                       collapse = TRUE,
-                      comment = "#>"
+                      comment = "#>",
+                      dpi = 300
                       )
 
 ```
 
-## Description
-Branching processes can be used to project disease incidence provided we have some 
-information on the distribution of times between successive cases (serial interval), 
-and the distribution of secondary cases produced by a single individual (offspring 
-distribution). Such simulations can be achieved in `bpmodels` with the `chain_sim()` function. 
+## Overview
+Branching processes can be used to project infectious disease trends provided 
+we have some information on the distribution of times between 
+successive cases (serial interval), and the distribution of secondary cases 
+produced by a single individual (offspring distribution). Such simulations can be achieved in `bpmodels` with the `chain_sim()` function. @Pearson2020, and 
+@abbott2020 illustrate its application to COVID-19. 
+
+The purpose of this vignette is to use early data on COVID-19 in South Africa [@marivate2020] to illustrate how `bpmodels` can be used to forecast 
+an outbreak. 
+
 
 Let's load the required packages
 
-```{r loading_packages, include=TRUE}
+```{r packages, include=TRUE}
 library("bpmodels")
 library('dplyr')
 library('ggplot2')
 library('lubridate')
 ```
 
-## Disease data
+### The data
+
+We will get and clean the first $15$ days of the COVID-19 
+outbreak in South Africa to seed the simulation for this example.
 
-Let's create an outbreak dataset, assuming the cases are sampled from a negative binomial
-distribution with mean = 5 and dispersion = 7.5. These parameter values are arbitrarily
-chosen for illustrative purposes.
+```{r data, message=FALSE}
+data_url <- 'https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv'
 
-```{r data_generation, message=FALSE}
-set.seed(12)
-cases_df <- data.frame(date = as.Date('2023-01-01') + seq_len(12),
-                       cases = rnbinom(12, size = 7.5, mu = 5)
-                       )
-head(cases_df)
+#Read the data in using the url
+covid19_sa <- read.csv(data_url)
 
-ggplot(cases_df, 
-       aes(x = date, y = cases)
-       ) + 
-  geom_col(fill = 'tomato3', size = 1)
+# Subset the first 15 days and count the number of cases per date
+covid19_sa <- covid19_sa %>%
+  dplyr::select(date) %>%
+  dplyr::mutate(date = lubridate::dmy(date)) %>%
+  dplyr::filter(date <= lubridate::dmy("20-03-2020")) %>%
+  dplyr::group_by(date) %>%
+  dplyr::summarise(cases = n()) %>%
+  dplyr::ungroup()
 ```
 
-## Preparing the inputs  
+### Preparing the inputs  
 
-```{r input_prep, message=FALSE}
-# We will create a vector of starting times for each case, using the time of the index cases as the reference point
-cases_df$days_since_index <- as.integer(cases_df$date - min(cases_df$date))
+```{r linelist_gen, message=FALSE}
+days_since_index <- as.integer(covid19_sa$date - min(covid19_sa$date))
 
-#'Disaggregate the time series 
-case_times <- unlist(mapply(function(x, y) rep(x, times = ifelse(y == 0, 1, y)), 
-                       cases_df$days_since_index, 
-                       cases_df$cases
-                       )
-                       )
+start_times <- unlist(mapply(
+  function(x, y) rep(x, times = ifelse(y == 0, 1, y)),
+  days_since_index,
+  covid19_sa$cases
+))
                        
+```
+
 
+Additionally, `chain_sim()` requires other inputs, which we will specify below: 
 
+```{r input_prep2, message=FALSE}
 #' Date to end simulation (14 day projection in this case)
-projection_window <- 14 #2 week ahead projection
-project_to_date <- max(cases_df$days_since_index) + projection_window 
+projection_window <- 14 # 14 days/ 2-week ahead projection
+
+projection_end_day <- max(days_since_index) + projection_window
+
+#' Number of simulations
+sim_rep <- 100
+
+#' Maximum chain size allowed
+chain_threshold <- 1000
+
+```
+
+#### Serial interval
+
+We also assume based on COVID-19 literature that the 
+serial interval, $si$, is lognormal distributed as follows:
 
+$ E[\text{si}] = \ln \left( \dfrac{\mu^2}{(\sqrt{\mu^2 + \sigma^2}} \right)$
 
-#' Number of simulations and maximum chain size
-sim_rep <- 1000
-cases_to_project <- 1000
+$\text{SD} [\text{si}] = \sqrt {\ln \left(1 + \dfrac{\sigma^2}{\mu^2} \right)}$
 
+with $\mu = 4.7$ and standard deviation $\sigma = 2.9$.
 
-### Specifying the `serial` argument to `chain_sim()`
-#' Assume serial interval follows log-normal distribution with mean, mu = 4.7, 
-#' and standard deviation, sigma = 2.9, then the desired standard deviation, si_sd, 
-#' and mean, si_mean, are
-sigma = 2.9
-mu = 4.7
+```{r input_prep3, message=FALSE}
+mu <- 4.7
+sigma <- 2.9
 
-si_sd <- sqrt(log(1 + (sigma/mu)^2)) #log standard deviation
-si_mean <- log((mu^2)/(sqrt(sigma^2 + mu^2))) #log mean
+si_sd <- sqrt(log(1 + (sigma / mu)^2)) # log standard deviation
+si_mean <- log((mu^2) / (sqrt(sigma^2 + mu^2))) # log mean
 
 #' serial interval function
 serial_interval <- function(sample_size) {

From 0bc64c48529d502c0e8cd192f9473a1798d930f3 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 3 Feb 2023 21:45:39 +0000
Subject: [PATCH 081/828] changed x- & y-axis, theme label customizations

---
 vignettes/projecting_incidence.Rmd | 142 ++++++++++++++++++++---------
 1 file changed, 100 insertions(+), 42 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index b25d09ca..e62f695d 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -124,65 +124,123 @@ serial_interval <- function(sample_size) {
 }
 ```
 
-## Simulations
+#### Offspring distribution
+
+We assume an offspring distribution that is distributed as a negative binomial with $R = 2.5$ [@abbott2020] and $k = 0.58$. In this parameterization, R represents the $\mathcal{R_0}$, which is defined as the average number of cases produced by a single individual in an entirely susceptible population. The parameter $k$ represents superspreading, that is, the degree of heterogeneity in transmission by single individuals.
+
+### Simulations
+To summarize the simulation set up, for each of the `r sim_rep` simulations, we want to project cases over a `r projection_window` day period since the last case, assuming that no chain would exceed `r chain_threshold`. 
+
+#### Model assumptions
+
+`chain_sim()` makes the following simplifying assumptions:
+
+1. All cases are observed
+1. There is no reporting delay
+1. Reporting rate is constant through the course of the epidemic
+1. No interventions have been implemented
+1. Population is homogeneous and well-mixed
+
 ```{r simulations, message=FALSE}
-## Chain log-likelihood simulation
-sim_chain_sizes <- lapply(seq_len(sim_rep),
-                           function(sim){chain_sim(
-                               n = length(case_times),
-                               offspring = "nbinom",
-                               mu = 2.0,
-                               size = 0.38,
-                               stat = "size",
-                               infinite = cases_to_project,
-                               serial = serial_interval,
-                               t0 = case_times,
-                               tf = project_to_date,
-                               tree = TRUE
-                           ) |> 
-                               mutate(sim = sim)} 
-                          )
-
-sim_output <- do.call(rbind, sim_chain_sizes) 
+set.seed(1234)
+
+
+sim_chain_sizes <- lapply(
+  seq_len(sim_rep),
+  function(sim) {
+    chain_sim(
+      n = length(start_times),
+      offspring = "nbinom",
+      mu = 2.5,
+      size = 0.58,
+      stat = "size",
+      infinite = chain_threshold,
+      serial = serial_interval,
+      t0 = start_times,
+      tf = projection_end_day,
+      tree = TRUE
+    ) %>%
+      mutate(sim = sim)
+  }
+)
+
+sim_output <- do.call(rbind, sim_chain_sizes)
+
+head(sim_output)
 ```
 
+From the simulated data, we count the median daily cases across 
+all simulations and overlay that over a plot of all the projections through time.
+
+#### Post-processing
 
-### Post-processing
 ```{r post_processing}
-ref_date <- min(cases_df$date)
+index_date <- min(covid19_sa$date)
 
-incidence_ts <- sim_output |> 
-  mutate(day = floor(time)) |> 
-  group_by(sim, day) |> 
-  summarise(cases = n()) |>  
+# Daily number of cases for each simulation
+incidence_ts <- sim_output %>%
+  mutate(day = ceiling(time)) %>%
+  group_by(sim, day) %>%
+  summarise(cases = n()) %>%
   ungroup()
 
+# Add dates
+incidence_ts <- incidence_ts %>%
+  group_by(sim) %>%
+  mutate(date = index_date + (0:(n() - 1))) %>%
+  ungroup()
+
+## Median daily number of cases aggregated across all simulations
+median_daily_cases <- incidence_ts %>%
+  group_by(day) %>%
+  summarise(median_cases = median(cases)) %>%
+  ungroup() %>%
+  arrange(day)
 
-## Median cases by date.  
-median_daily_cases <- incidence_ts |>
-  group_by(day)|>
-  summarise(median_cases = median(cases)) |>
-  ungroup()|>
-  arrange(day) |>
-  mutate(date = ymd(ref_date) + 0:(project_to_date - 1))
+# Add dates
+median_daily_cases <- median_daily_cases %>%
+  mutate(date = index_date + 0:projection_end_day) %>%
+  ungroup()
 
 ```
 
 
-## Visualization
-```{r visualisation}
+### Visualization
+
+```{r viz, fig.cap ="Projected COVID-19 epidemiological trend. Gray lines represent individual simulation results and red dots represent the median daily cases across all simulations.", fig.width=2.0, fig.height=1.8}
 # Visualization
-cases_plot <- ggplot(data = median_daily_cases) +
-  geom_col(aes(x = date, y = median_cases),
-           fill = "tomato3",
-           size = 1
+cases_plot <- ggplot(data = incidence_ts) +
+  geom_line(aes(
+    x = date,
+    y = cases,
+  group = sim
+  ),
+  color = "grey",
+  linewidth = 1.2,
+  alpha = 0.25
+  ) +
+  geom_point(
+    data = median_daily_cases,
+    aes(
+      x = date,
+      y = median_cases
+    ),
+    color = "tomato3",
+    size = 0.75
+  ) +
+  scale_x_continuous(
+    breaks = seq(min(incidence_ts$date), max(incidence_ts$date), 10),
+    labels = seq(min(incidence_ts$date), max(incidence_ts$date), 10)
   ) +
-  scale_y_continuous(breaks = seq(0, max(median_daily_cases$median_cases) + 20, 20),
-                     labels = seq(0, max(median_daily_cases$median_cases) + 20, 20)
+  scale_y_continuous(
+    breaks = seq(0, max(incidence_ts$cases) + 200, 100),
+    labels = seq(0, max(incidence_ts$cases) + 200, 100)
   ) +
-  labs(x = 'Date', y = 'Daily cases (median)') + 
-  theme_minimal(base_size = 14)
+  labs(x = "Date", y = "Daily cases (median)") +
+  theme_minimal(base_size = 4) +
+  NULL
 
 print(cases_plot)
 ```
 
+### References

From 3b16119f80faa6385df2ed9b9583a9ba41851f94 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Mon, 6 Feb 2023 11:50:25 +0000
Subject: [PATCH 082/828] generated basic package-level documentation.

---
 R/bpmodels-package.R | 6 ++++++
 1 file changed, 6 insertions(+)
 create mode 100644 R/bpmodels-package.R

diff --git a/R/bpmodels-package.R b/R/bpmodels-package.R
new file mode 100644
index 00000000..a65cf643
--- /dev/null
+++ b/R/bpmodels-package.R
@@ -0,0 +1,6 @@
+#' @keywords internal
+"_PACKAGE"
+
+## usethis namespace: start
+## usethis namespace: end
+NULL

From ff087bfee4ff844c236630d4411e09921578b6f4 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Mon, 6 Feb 2023 15:09:16 +0000
Subject: [PATCH 083/828] updated bib lib

---
 vignettes/references.bib | 439 ++++++++++++++++++++-------------------
 1 file changed, 227 insertions(+), 212 deletions(-)

diff --git a/vignettes/references.bib b/vignettes/references.bib
index ae7dc255..7cec9b45 100644
--- a/vignettes/references.bib
+++ b/vignettes/references.bib
@@ -1,233 +1,248 @@
 @article{abbott2020,
-  title={The transmissibility of novel Coronavirus in the early stages of the 2019-20 outbreak in Wuhan: Exploring initial point-source exposure sizes and durations using scenario analysis},
-  author={Abbott, Sam and Hellewell, Joel and Munday, James and Funk, Sebastian and CMMID nCoV working group and others},
-  journal={Wellcome open research},
-  volume={5},
-  year={2020},
-  publisher={The Wellcome Trust}
+  title     = {The transmissibility of novel Coronavirus in the early stages of the 2019-20 outbreak in Wuhan: Exploring initial point-source exposure sizes and durations using scenario analysis},
+  author    = {Abbott, Sam and Hellewell, Joel and Munday, James and Funk, Sebastian and CMMID nCoV working group and others},
+  journal   = {Wellcome open research},
+  volume    = {5},
+  year      = {2020},
+  publisher = {The Wellcome Trust}
 }
-
-@article{marivate2020,
-  title={Use of available data to inform the COVID-19 outbreak in South Africa: a case study},
-  author={Marivate, Vukosi and Combrink, Herkulaas MvE},
-  journal={arXiv preprint arXiv:2004.04813},
-  year={2020}
+@article{Alene2021,
+  abstract  = {Background: Understanding the epidemiological parameters that determine the transmission dynamics of COVID-19 is essential for public health intervention. Globally, a number of studies were conducted to estimate the average serial interval and incubation period of COVID-19. Combining findings of existing studies that estimate the average serial interval and incubation period of COVID-19 significantly improves the quality of evidence. Hence, this study aimed to determine the overall average serial interval and incubation period of COVID-19. Methods: We followed the PRISMA checklist to present this study. A comprehensive search strategy was carried out from international electronic databases (Google Scholar, PubMed, Science Direct, Web of Science, CINAHL, and Cochrane Library) by two experienced reviewers (MAA and DBK) authors between the 1st of June and the 31st of July 2020. All observational studies either reporting the serial interval or incubation period in persons diagnosed with COVID-19 were included in this study. Heterogeneity across studies was assessed using the I2 and Higgins test. The NOS adapted for cross-sectional studies was used to evaluate the quality of studies. A random effect Meta-analysis was employed to determine the pooled estimate with 95% (CI). Microsoft Excel was used for data extraction and R software was used for analysis. Results: We combined a total of 23 studies to estimate the overall mean serial interval of COVID-19. The mean serial interval of COVID-19 ranged from 4. 2 to 7.5 days. Our meta-analysis showed that the weighted pooled mean serial interval of COVID-19 was 5.2 (95%CI: 4.9–5.5) days. Additionally, to pool the mean incubation period of COVID-19, we included 14 articles. The mean incubation period of COVID-19 also ranged from 4.8 to 9 days. Accordingly, the weighted pooled mean incubation period of COVID-19 was 6.5 (95%CI: 5.9–7.1) days. Conclusions: This systematic review and meta-analysis showed that the weighted pooled mean serial interval and incubation period of COVID-19 were 5.2, and 6.5 days, respectively. In this study, the average serial interval of COVID-19 is shorter than the average incubation period, which suggests that substantial numbers of COVID-19 cases will be attributed to presymptomatic transmission.},
+  author    = {Alene, Muluneh and Yismaw, Leltework and Assemie, Moges Agazhe and Ketema, Daniel Bekele and Gietaneh, Wodaje and Birhan, Tilahun Yemanu},
+  doi       = {10.1186/s12879-021-05950-x},
+  issn      = {14712334},
+  journal   = {BMC Infectious Diseases},
+  keywords  = {COVID-19,Incubation period,Meta-analysis,Serial interval},
+  number    = {1},
+  pages     = {1--9},
+  pmid      = {33706702},
+  publisher = {BMC Infectious Diseases},
+  title     = {{Serial interval and incubation period of COVID-19: a systematic review and meta-analysis}},
+  volume    = {21},
+  year      = {2021}
 }
 
-@article{Farrington2003,
-abstract = {Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.},
-author = {Farrington, C. P. and Kanaan, M. N. and Gay, N. J.},
-doi = {10.1093/biostatistics/4.2.279},
-issn = {14654644},
-journal = {Biostatistics (Oxford, England)},
-number = {2},
-pages = {279--295},
-title = {{Branching process models for surveillance of infectious diseases controlled by mass vaccination.}},
-volume = {4},
-year = {2003}
-}
-@article{Jacob2010,
-abstract = {Branching processes are stochastic individual-based processes leading consequently to a bottom-up approach. In addition, since the state variables are random integer variables (representing population sizes), the extinction occurs at random finite time on the extinction set, thus leading to fine and realistic predictions. Starting from the simplest and well-known single-type Bienaym{\'{e}}-Galton-Watson branching process that was used by several authors for approximating the beginning of an epidemic, we then present a general branching model with age and population dependent individual transitions. However contrary to the classical Bienaym{\'{e}}-Galton-Watson or asymptotically Bienaym{\'{e}}-Galton-Watson setting, where the asymptotic behavior of the process, as time tends to infinity, is well understood, the asymptotic behavior of this general process is a new question. Here we give some solutions for dealing with this problem depending on whether the initial population size is large or small, and whether the disease is rare or non-rare when the initial population size is large.},
-author = {Jacob, Christine},
-doi = {10.3390/ijerph7031204},
-issn = {16604601},
-journal = {International Journal of Environmental Research and Public Health},
-keywords = {Age-dependence,Branching process,Epidemic size,Extinction time,Population-dependence},
-number = {3},
-pages = {1186--1204},
-title = {{Branching processes: Their role in epidemiology}},
-volume = {7},
-year = {2010}
+@article{Allen2012,
+  abstract = {The basic reproduction number, ℛ(0), one of the most well-known thresholds in deterministic epidemic theory, predicts a disease outbreak if ℛ(0)>1. In stochastic epidemic theory, there are also thresholds that predict a major outbreak. In the case of a single infectious group, if ℛ(0)>1 and i infectious individuals are introduced into a susceptible population, then the probability of a major outbreak is approximately 1-(1/ℛ(0))( i ). With multiple infectious groups from which the disease could emerge, this result no longer holds. Stochastic thresholds for multiple groups depend on the number of individuals within each group, i ( j ), j=1, {\ldots}, n, and on the probability of disease extinction for each group, q ( j ). It follows from multitype branching processes that the probability of a major outbreak is approximately [Formula: see text]. In this investigation, we summarize some of the deterministic and stochastic threshold theory, illustrate how to calculate the stochastic thresholds, and derive some new relationships between the deterministic and stochastic thresholds.},
+  author   = {Allen, Linda J.S. and Lahodny, Glenn E.},
+  doi      = {10.1080/17513758.2012.665502},
+  issn     = {17513758},
+  journal  = {Journal of Biological Dynamics},
+  keywords = {multitype branching processes,reproduction numbers},
+  number   = {2},
+  pages    = {590--611},
+  title    = {{Extinction thresholds in deterministic and stochastic epidemic models}},
+  volume   = {6},
+  year     = {2012}
 }
+
 @article{Blumberg2013,
-abstract = {Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. {\textcopyright} 2013 Elsevier B.V.},
-author = {Blumberg, S. and Lloyd-Smith, J. O.},
-doi = {10.1016/j.epidem.2013.05.002},
-issn = {17554365},
-journal = {Epidemics},
-keywords = {Basic reproductive number,Imperfect observation,Measles,Stuttering chain,Transmission heterogeneity},
-number = {3},
-pages = {131--145},
-pmid = {24021520},
-publisher = {Elsevier B.V.},
-title = {{Comparing methods for estimating R0 from the size distribution of subcritical transmission chains}},
-url = {http://dx.doi.org/10.1016/j.epidem.2013.05.002},
-volume = {5},
-year = {2013}
+  abstract  = {Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. {\textcopyright} 2013 Elsevier B.V.},
+  author    = {Blumberg, S. and Lloyd-Smith, J. O.},
+  doi       = {10.1016/j.epidem.2013.05.002},
+  issn      = {17554365},
+  journal   = {Epidemics},
+  keywords  = {Basic reproductive number,Imperfect observation,Measles,Stuttering chain,Transmission heterogeneity},
+  number    = {3},
+  pages     = {131--145},
+  pmid      = {24021520},
+  publisher = {Elsevier B.V.},
+  title     = {{Comparing methods for estimating R0 from the size distribution of subcritical transmission chains}},
+  url       = {http://dx.doi.org/10.1016/j.epidem.2013.05.002},
+  volume    = {5},
+  year      = {2013}
 }
-@article{Nishiura2012,
-abstract = {Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. {\textcopyright} 2011.},
-author = {Nishiura, Hiroshi and Yan, Ping and Sleeman, Candace K. and Mode, Charles J.},
-doi = {10.1016/j.jtbi.2011.10.039},
-issn = {00225193},
-journal = {Journal of Theoretical Biology},
-keywords = {Basic reproduction number,Branching process,Confidence interval,Likelihood function,Statistical model},
-pages = {48--55},
-pmid = {22079419},
-publisher = {Elsevier},
-title = {{Estimating the transmission potential of supercritical processes based on the final size distribution of minor outbreaks}},
-url = {http://dx.doi.org/10.1016/j.jtbi.2011.10.039},
-volume = {294},
-year = {2012}
+@article{Blumberg2013a,
+  abstract = {For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, 0<R0<1 and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring R0 and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer R0, but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, k) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both R0 and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for R0 is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in R0 is detectable. In addition, by allowing for superspreading events, inference of k shifts the threshold above which a transmission chain should be considered anomalously large for a given value of R0 (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results.},
+  author   = {Blumberg, Seth and Lloyd-Smith, James O.},
+  doi      = {10.1371/journal.pcbi.1002993},
+  issn     = {15537358},
+  journal  = {PLoS Computational Biology},
+  number   = {5},
+  pages    = {1--17},
+  pmid     = {23658504},
+  title    = {{Inference of R0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains}},
+  volume   = {9},
+  year     = {2013}
 }
-@article{Society2010,
-author = {Becker, Niels and Society, International Biometric},
-issn = {0006-341X},
-journal = {Biometrics},
-number = {3},
-pages = {515--522},
-publisher = {JSTOR},
-title = {{Estimation for discrete time branching processes with application to epidemics}},
-volume = {33},
-year = {1977}
+@article{Chen2022,
+  abstract  = {The generation time distribution, reflecting the time between successive infections in transmission chains, is a key epidemiological parameter for describing COVID-19 transmission dynamics. However, because exact infection times are rarely known, it is often approximated by the serial interval distribution. This approximation holds under the assumption that infectors and infectees share the same incubation period distribution, which may not always be true. We estimated incubation period and serial interval distributions using 629 transmission pairs reconstructed by investigating 2989 confirmed cases in China in January-February 2020, and developed an inferential framework to estimate the generation time distribution that accounts for variation over time due to changes in epidemiology, sampling biases and public health and social measures. We identified substantial reductions over time in the serial interval and generation time distributions. Our proposed method provides more reliable estimation of the temporal variation in the generation time distribution, improving assessment of transmission dynamics.},
+  author    = {Chen, Dongxuan and Lau, Yiu Chung and Xu, Xiao Ke and Wang, Lin and Du, Zhanwei and Tsang, Tim K. and Wu, Peng and Lau, Eric H.Y. and Wallinga, Jacco and Cowling, Benjamin J. and Ali, Sheikh Taslim},
+  doi       = {10.1038/s41467-022-35496-8},
+  issn      = {20411723},
+  journal   = {Nature Communications},
+  number    = {1},
+  publisher = {Springer US},
+  title     = {{Inferring time-varying generation time, serial interval, and incubation period distributions for COVID-19}},
+  volume    = {13},
+  year      = {2022}
 }
-@article{Allen2012,
-abstract = {The basic reproduction number, ℛ(0), one of the most well-known thresholds in deterministic epidemic theory, predicts a disease outbreak if ℛ(0)>1. In stochastic epidemic theory, there are also thresholds that predict a major outbreak. In the case of a single infectious group, if ℛ(0)>1 and i infectious individuals are introduced into a susceptible population, then the probability of a major outbreak is approximately 1-(1/ℛ(0))( i ). With multiple infectious groups from which the disease could emerge, this result no longer holds. Stochastic thresholds for multiple groups depend on the number of individuals within each group, i ( j ), j=1, {\ldots}, n, and on the probability of disease extinction for each group, q ( j ). It follows from multitype branching processes that the probability of a major outbreak is approximately [Formula: see text]. In this investigation, we summarize some of the deterministic and stochastic threshold theory, illustrate how to calculate the stochastic thresholds, and derive some new relationships between the deterministic and stochastic thresholds.},
-author = {Allen, Linda J.S. and Lahodny, Glenn E.},
-doi = {10.1080/17513758.2012.665502},
-issn = {17513758},
-journal = {Journal of Biological Dynamics},
-keywords = {multitype branching processes,reproduction numbers},
-number = {2},
-pages = {590--611},
-title = {{Extinction thresholds in deterministic and stochastic epidemic models}},
-volume = {6},
-year = {2012}
+@article{Farrington1999,
+  abstract = {We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.},
+  author   = {Farrington, C. P. and Grant, A. D.},
+  doi      = {10.1239/jap/1032374633},
+  issn     = {00219002},
+  journal  = {Journal of Applied Probability},
+  keywords = {Branching process,Epidemic model,Extinction,Generation distribution,Maximum likelihood estimation,Power series family},
+  number   = {3},
+  pages    = {771--779},
+  title    = {{The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease}},
+  volume   = {36},
+  year     = {1999}
 }
-@article{Blumberg2013a,
-abstract = {For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, 0<R0<1 and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring R0 and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer R0, but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, k) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both R0 and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for R0 is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in R0 is detectable. In addition, by allowing for superspreading events, inference of k shifts the threshold above which a transmission chain should be considered anomalously large for a given value of R0 (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results.},
-author = {Blumberg, Seth and Lloyd-Smith, James O.},
-doi = {10.1371/journal.pcbi.1002993},
-issn = {15537358},
-journal = {PLoS Computational Biology},
-number = {5},
-pages = {1--17},
-pmid = {23658504},
-title = {{Inference of R0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains}},
-volume = {9},
-year = {2013}
+@article{Farrington1999a,
+  abstract = {We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.},
+  author   = {Farrington, C. P. and Grant, A. D.},
+  doi      = {10.1239/jap/1032374633},
+  issn     = {00219002},
+  journal  = {Journal of Applied Probability},
+  keywords = {Branching process,Epidemic model,Extinction,Generation distribution,Maximum likelihood estimation,Power series family},
+  number   = {3},
+  pages    = {771--779},
+  title    = {{The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease}},
+  volume   = {36},
+  year     = {1999}
 }
-@article{Chen2022,
-abstract = {The generation time distribution, reflecting the time between successive infections in transmission chains, is a key epidemiological parameter for describing COVID-19 transmission dynamics. However, because exact infection times are rarely known, it is often approximated by the serial interval distribution. This approximation holds under the assumption that infectors and infectees share the same incubation period distribution, which may not always be true. We estimated incubation period and serial interval distributions using 629 transmission pairs reconstructed by investigating 2989 confirmed cases in China in January-February 2020, and developed an inferential framework to estimate the generation time distribution that accounts for variation over time due to changes in epidemiology, sampling biases and public health and social measures. We identified substantial reductions over time in the serial interval and generation time distributions. Our proposed method provides more reliable estimation of the temporal variation in the generation time distribution, improving assessment of transmission dynamics.},
-author = {Chen, Dongxuan and Lau, Yiu Chung and Xu, Xiao Ke and Wang, Lin and Du, Zhanwei and Tsang, Tim K. and Wu, Peng and Lau, Eric H.Y. and Wallinga, Jacco and Cowling, Benjamin J. and Ali, Sheikh Taslim},
-doi = {10.1038/s41467-022-35496-8},
-issn = {20411723},
-journal = {Nature Communications},
-number = {1},
-publisher = {Springer US},
-title = {{Inferring time-varying generation time, serial interval, and incubation period distributions for COVID-19}},
-volume = {13},
-year = {2022}
+@article{Farrington2003,
+  abstract = {Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.},
+  author   = {Farrington, C. P. and Kanaan, M. N. and Gay, N. J.},
+  doi      = {10.1093/biostatistics/4.2.279},
+  issn     = {14654644},
+  journal  = {Biostatistics (Oxford, England)},
+  number   = {2},
+  pages    = {279--295},
+  title    = {{Branching process models for surveillance of infectious diseases controlled by mass vaccination.}},
+  volume   = {4},
+  year     = {2003}
 }
-@article{Lehtinen2021,
-abstract = {The timing of transmission plays a key role in the dynamics and controllability of an epidemic. However, observing generation times - the time interval between the infection of an infector and an infectee in a transmission pair - requires data on infection times, which are generally unknown. The timing of symptom onset is more easily observed; generation times are therefore often estimated based on serial intervals - the time interval between symptom onset of an infector and an infectee. This estimation follows one of two approaches: (i) approximating the generation time distribution by the serial interval distribution or (ii) deriving the generation time distribution from the serial interval and incubation period - the time interval between infection and symptom onset in a single individual - distributions. These two approaches make different - and not always explicitly stated - assumptions about the relationship between infectiousness and symptoms, resulting in different generation time distributions with the same mean but unequal variances. Here, we clarify the assumptions that each approach makes and show that neither set of assumptions is plausible for most pathogens. However, the variances of the generation time distribution derived under each assumption can reasonably be considered as upper (approximation with serial interval) and lower (derivation from serial interval) bounds. Thus, we suggest a pragmatic solution is to use both approaches and treat these as edge cases in downstream analysis. We discuss the impact of the variance of the generation time distribution on the controllability of an epidemic through strategies based on contact tracing, and we show that underestimating this variance is likely to overestimate controllability.},
-author = {Lehtinen, Sonja and Ashcroft, Peter and Bonhoeffer, Sebastian},
-doi = {10.1098/rsif.2020.0756},
-issn = {17425662},
-journal = {Journal of the Royal Society Interface},
-keywords = {SARS-CoV-2,contact tracing,epidemiology,generation time,infectiousness,modelling},
-number = {174},
-pmid = {33402022},
-title = {{On the relationship between serial interval, infectiousness profile and generation time: On the relationship between serial interval, infectiousness profile and generation time}},
-volume = {18},
-year = {2021}
+@article{Fine2003,
+  abstract = {The interval between successive cases of an infectious disease is determined by the time from infection to infectiousness, the duration of infectiousness, the time from infection to disease onset (incubation period), the duration of any extra-human phase of the infectious agent, and the proportion clinically affected among infected individuals. The interval is important in the interpretation of infectious disease surveillance and trend data, in the identification of outbreaks, and in the optimization of quarantine and contact tracing. This paper discusses the properties of these intervals, as measured between transmission events or between clinical onsets of successive infected individuals, noting the determinants of their ranges and frequency distributions, the circumstances under which secondary cases may arise before primaries, and under which the infection transmission interval will be different from the interval between clinical onsets of successive cases. It discusses the derivation of interval distribution statistics from descriptive data given in standard textbooks, with illustrations from published data on outbreaks, households, and epidemiologic tracing. Finally, it discusses the implications of such measures for studies of secondary attack rates, for the persistence of infection in human communities, for outbreak response, and for elimination or eradication programs.},
+  author   = {Fine, Paul E.M.},
+  doi      = {10.1093/aje/kwg251},
+  isbn     = {0002-9262 (Print) 0002-9262 (Linking)},
+  issn     = {00029262},
+  journal  = {American Journal of Epidemiology},
+  keywords = {Communicable diseases,Disease outbreaks},
+  number   = {11},
+  pages    = {1039--1047},
+  pmid     = {14630599},
+  title    = {{The Interval between Successive Cases of an Infectious Disease}},
+  volume   = {158},
+  year     = {2003}
 }
-@article{Pearson2020,
-abstract = {For 45 African countries/territories already reporting COVID-19 cases before 23 March 2020, we estimate the dates of reporting 1,000 and 10,000 cases. Assuming early epidemic trends without interventions, all 45 were likely to exceed 1,000 confirmed cases by the end of April 2020, with most exceeding 10,000 a few weeks later.},
-author = {Pearson, Carl A.B. and van Schalkwyk, Cari and Foss, Anna M. and O'Reilly, Kathleen M. and Pulliam, Juliet R.C.},
-doi = {10.2807/1560-7917.ES.2020.25.18.2000543},
-issn = {15607917},
-journal = {Eurosurveillance},
-number = {18},
-pages = {1--6},
-pmid = {32400361},
-publisher = {European Centre for Disease Prevention and Control (ECDC)},
-title = {{Projected early spread of COVID-19 in Africa through 1 June 2020}},
-url = {http://dx.doi.org/10.2807/1560-7917.ES.2020.25.18.2000543},
-volume = {25},
-year = {2020}
+@article{Grassly2006a,
+  abstract = {Seasonal change in the incidence of infectious diseases is a common phenomenon in both temperate and tropical climates. However, the mechanisms responsible for seasonal disease incidence, and the epidemiological consequences of seasonality, are poorly understood with rare exception. Standard epidemiological theory and concepts such as the basic reproductive number R  0  no longer apply, and the implications for interventions that themselves may be periodic, such as pulse vaccination, have not been formally examined. This paper examines the causes and consequences of seasonality, and in so doing derives several new results concerning vaccination strategy and the interpretation of disease outbreak data. It begins with a brief review of published scientific studies in support of different causes of seasonality in infectious diseases of humans, identifying four principal mechanisms and their association with different routes of transmission. It then describes the consequences of seasonality for R 0 , disease outbreaks, endemic dynamics and persistence. Finally, a mathematical analysis of routine and pulse vaccination programmes for seasonal infections is presented. The synthesis of seasonal infectious disease epidemiology attempted by this paper highlights the need for further empirical and theoretical work. {\textcopyright} 2006 The Royal Society.},
+  author   = {Grassly, Nicholas C. and Fraser, Christophe},
+  doi      = {10.1098/rspb.2006.3604},
+  issn     = {14712970},
+  journal  = {Proceedings of the Royal Society B: Biological Sciences},
+  keywords = {Communicable diseases,Disease outbreaks,Epidemiology,Seasons,Vaccination},
+  number   = {1600},
+  pages    = {2541--2550},
+  title    = {{Seasonal infectious disease epidemiology}},
+  volume   = {273},
+  year     = {2006}
 }
 @article{Griffin2020,
-abstract = {The serial interval is the time between symptom onsets in an infector-infectee pair. The generation time, also known as the generation interval, is the time between infection events in an infector-infectee pair. The serial interval and the generation time are key parameters for assessing the dynamics of a disease. A number of scientific papers reported information pertaining to the serial interval and/or generation time for COVID-19. Objective Conduct a review of available evidence to advise on appropriate parameter values for serial interval and generation time in national COVID-19 transmission models for Ireland and on methodological issues relating to those parameters. Methods We conducted a rapid review of the literature covering the period 1 January 2020 and 21 August 2020, following predefined eligibility criteria. Forty scientific papers met our inclusion criteria and were included in the review. Results The mean of the serial interval ranged from 3.03 to 7.6 days, based on 38 estimates, and the median from 1.0 to 6.0 days (based on 15 estimates). Only three estimates were provided for the mean of the generation time. These ranged from 3.95 to 5.20 days. One estimate of 5.0 days was provided for the median of the generation time. Discussion Estimates of the serial interval and the generation time are very dependent on the specific factors that apply at the time that the data are collected, including the level of social contact. Consequently, the estimates may not be entirely relevant to other environments. Therefore, local estimates should be obtained as soon as possible. Careful consideration should be given to the methodology that is used. Real-time estimations of the serial interval/generation time, allowing for variations over time, may provide more accurate estimates of reproduction numbers than using conventionally fixed serial interval/generation time distributions.},
-author = {Griffin, John and Casey, Miriam and Collins, {\'{A}}ine and Hunt, Kevin and McEvoy, David and Byrne, Andrew and McAloon, Conor and Barber, Ann and Lane, Elizabeth Ann and More, Simon},
-doi = {10.1136/bmjopen-2020-040263},
-isbn = {9789241512763},
-issn = {20446055},
-journal = {BMJ Open},
-keywords = {COVID-19,epidemiology,public health,virology},
-number = {11},
-pages = {1--9},
-pmid = {33234640},
-title = {{Rapid review of available evidence on the serial interval and generation time of COVID-19}},
-volume = {10},
-year = {2020}
+  abstract = {The serial interval is the time between symptom onsets in an infector-infectee pair. The generation time, also known as the generation interval, is the time between infection events in an infector-infectee pair. The serial interval and the generation time are key parameters for assessing the dynamics of a disease. A number of scientific papers reported information pertaining to the serial interval and/or generation time for COVID-19. Objective Conduct a review of available evidence to advise on appropriate parameter values for serial interval and generation time in national COVID-19 transmission models for Ireland and on methodological issues relating to those parameters. Methods We conducted a rapid review of the literature covering the period 1 January 2020 and 21 August 2020, following predefined eligibility criteria. Forty scientific papers met our inclusion criteria and were included in the review. Results The mean of the serial interval ranged from 3.03 to 7.6 days, based on 38 estimates, and the median from 1.0 to 6.0 days (based on 15 estimates). Only three estimates were provided for the mean of the generation time. These ranged from 3.95 to 5.20 days. One estimate of 5.0 days was provided for the median of the generation time. Discussion Estimates of the serial interval and the generation time are very dependent on the specific factors that apply at the time that the data are collected, including the level of social contact. Consequently, the estimates may not be entirely relevant to other environments. Therefore, local estimates should be obtained as soon as possible. Careful consideration should be given to the methodology that is used. Real-time estimations of the serial interval/generation time, allowing for variations over time, may provide more accurate estimates of reproduction numbers than using conventionally fixed serial interval/generation time distributions.},
+  author   = {Griffin, John and Casey, Miriam and Collins, {\'{A}}ine and Hunt, Kevin and McEvoy, David and Byrne, Andrew and McAloon, Conor and Barber, Ann and Lane, Elizabeth Ann and More, Simon},
+  doi      = {10.1136/bmjopen-2020-040263},
+  isbn     = {9789241512763},
+  issn     = {20446055},
+  journal  = {BMJ Open},
+  keywords = {COVID-19,epidemiology,public health,virology},
+  number   = {11},
+  pages    = {1--9},
+  pmid     = {33234640},
+  title    = {{Rapid review of available evidence on the serial interval and generation time of COVID-19}},
+  volume   = {10},
+  year     = {2020}
 }
-@article{Grassly2006a,
-abstract = {Seasonal change in the incidence of infectious diseases is a common phenomenon in both temperate and tropical climates. However, the mechanisms responsible for seasonal disease incidence, and the epidemiological consequences of seasonality, are poorly understood with rare exception. Standard epidemiological theory and concepts such as the basic reproductive number R  0  no longer apply, and the implications for interventions that themselves may be periodic, such as pulse vaccination, have not been formally examined. This paper examines the causes and consequences of seasonality, and in so doing derives several new results concerning vaccination strategy and the interpretation of disease outbreak data. It begins with a brief review of published scientific studies in support of different causes of seasonality in infectious diseases of humans, identifying four principal mechanisms and their association with different routes of transmission. It then describes the consequences of seasonality for R 0 , disease outbreaks, endemic dynamics and persistence. Finally, a mathematical analysis of routine and pulse vaccination programmes for seasonal infections is presented. The synthesis of seasonal infectious disease epidemiology attempted by this paper highlights the need for further empirical and theoretical work. {\textcopyright} 2006 The Royal Society.},
-author = {Grassly, Nicholas C. and Fraser, Christophe},
-doi = {10.1098/rspb.2006.3604},
-issn = {14712970},
-journal = {Proceedings of the Royal Society B: Biological Sciences},
-keywords = {Communicable diseases,Disease outbreaks,Epidemiology,Seasons,Vaccination},
-number = {1600},
-pages = {2541--2550},
-title = {{Seasonal infectious disease epidemiology}},
-volume = {273},
-year = {2006}
+@article{Jacob2010,
+  abstract = {Branching processes are stochastic individual-based processes leading consequently to a bottom-up approach. In addition, since the state variables are random integer variables (representing population sizes), the extinction occurs at random finite time on the extinction set, thus leading to fine and realistic predictions. Starting from the simplest and well-known single-type Bienaym{\'{e}}-Galton-Watson branching process that was used by several authors for approximating the beginning of an epidemic, we then present a general branching model with age and population dependent individual transitions. However contrary to the classical Bienaym{\'{e}}-Galton-Watson or asymptotically Bienaym{\'{e}}-Galton-Watson setting, where the asymptotic behavior of the process, as time tends to infinity, is well understood, the asymptotic behavior of this general process is a new question. Here we give some solutions for dealing with this problem depending on whether the initial population size is large or small, and whether the disease is rare or non-rare when the initial population size is large.},
+  author   = {Jacob, Christine},
+  doi      = {10.3390/ijerph7031204},
+  issn     = {16604601},
+  journal  = {International Journal of Environmental Research and Public Health},
+  keywords = {Age-dependence,Branching process,Epidemic size,Extinction time,Population-dependence},
+  number   = {3},
+  pages    = {1186--1204},
+  title    = {{Branching processes: Their role in epidemiology}},
+  volume   = {7},
+  year     = {2010}
 }
-@article{Alene2021,
-abstract = {Background: Understanding the epidemiological parameters that determine the transmission dynamics of COVID-19 is essential for public health intervention. Globally, a number of studies were conducted to estimate the average serial interval and incubation period of COVID-19. Combining findings of existing studies that estimate the average serial interval and incubation period of COVID-19 significantly improves the quality of evidence. Hence, this study aimed to determine the overall average serial interval and incubation period of COVID-19. Methods: We followed the PRISMA checklist to present this study. A comprehensive search strategy was carried out from international electronic databases (Google Scholar, PubMed, Science Direct, Web of Science, CINAHL, and Cochrane Library) by two experienced reviewers (MAA and DBK) authors between the 1st of June and the 31st of July 2020. All observational studies either reporting the serial interval or incubation period in persons diagnosed with COVID-19 were included in this study. Heterogeneity across studies was assessed using the I2 and Higgins test. The NOS adapted for cross-sectional studies was used to evaluate the quality of studies. A random effect Meta-analysis was employed to determine the pooled estimate with 95% (CI). Microsoft Excel was used for data extraction and R software was used for analysis. Results: We combined a total of 23 studies to estimate the overall mean serial interval of COVID-19. The mean serial interval of COVID-19 ranged from 4. 2 to 7.5 days. Our meta-analysis showed that the weighted pooled mean serial interval of COVID-19 was 5.2 (95%CI: 4.9–5.5) days. Additionally, to pool the mean incubation period of COVID-19, we included 14 articles. The mean incubation period of COVID-19 also ranged from 4.8 to 9 days. Accordingly, the weighted pooled mean incubation period of COVID-19 was 6.5 (95%CI: 5.9–7.1) days. Conclusions: This systematic review and meta-analysis showed that the weighted pooled mean serial interval and incubation period of COVID-19 were 5.2, and 6.5 days, respectively. In this study, the average serial interval of COVID-19 is shorter than the average incubation period, which suggests that substantial numbers of COVID-19 cases will be attributed to presymptomatic transmission.},
-author = {Alene, Muluneh and Yismaw, Leltework and Assemie, Moges Agazhe and Ketema, Daniel Bekele and Gietaneh, Wodaje and Birhan, Tilahun Yemanu},
-doi = {10.1186/s12879-021-05950-x},
-issn = {14712334},
-journal = {BMC Infectious Diseases},
-keywords = {COVID-19,Incubation period,Meta-analysis,Serial interval},
-number = {1},
-pages = {1--9},
-pmid = {33706702},
-publisher = {BMC Infectious Diseases},
-title = {{Serial interval and incubation period of COVID-19: a systematic review and meta-analysis}},
-volume = {21},
-year = {2021}
+@article{Lehtinen2021,
+  abstract = {The timing of transmission plays a key role in the dynamics and controllability of an epidemic. However, observing generation times - the time interval between the infection of an infector and an infectee in a transmission pair - requires data on infection times, which are generally unknown. The timing of symptom onset is more easily observed; generation times are therefore often estimated based on serial intervals - the time interval between symptom onset of an infector and an infectee. This estimation follows one of two approaches: (i) approximating the generation time distribution by the serial interval distribution or (ii) deriving the generation time distribution from the serial interval and incubation period - the time interval between infection and symptom onset in a single individual - distributions. These two approaches make different - and not always explicitly stated - assumptions about the relationship between infectiousness and symptoms, resulting in different generation time distributions with the same mean but unequal variances. Here, we clarify the assumptions that each approach makes and show that neither set of assumptions is plausible for most pathogens. However, the variances of the generation time distribution derived under each assumption can reasonably be considered as upper (approximation with serial interval) and lower (derivation from serial interval) bounds. Thus, we suggest a pragmatic solution is to use both approaches and treat these as edge cases in downstream analysis. We discuss the impact of the variance of the generation time distribution on the controllability of an epidemic through strategies based on contact tracing, and we show that underestimating this variance is likely to overestimate controllability.},
+  author   = {Lehtinen, Sonja and Ashcroft, Peter and Bonhoeffer, Sebastian},
+  doi      = {10.1098/rsif.2020.0756},
+  issn     = {17425662},
+  journal  = {Journal of the Royal Society Interface},
+  keywords = {SARS-CoV-2,contact tracing,epidemiology,generation time,infectiousness,modelling},
+  number   = {174},
+  pmid     = {33402022},
+  title    = {{On the relationship between serial interval, infectiousness profile and generation time: On the relationship between serial interval, infectiousness profile and generation time}},
+  volume   = {18},
+  year     = {2021}
 }
-@article{Farrington1999a,
-abstract = {We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.},
-author = {Farrington, C. P. and Grant, A. D.},
-doi = {10.1239/jap/1032374633},
-issn = {00219002},
-journal = {Journal of Applied Probability},
-keywords = {Branching process,Epidemic model,Extinction,Generation distribution,Maximum likelihood estimation,Power series family},
-number = {3},
-pages = {771--779},
-title = {{The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease}},
-volume = {36},
-year = {1999}
+@article{marivate2020,
+  title   = {Use of available data to inform the COVID-19 outbreak in South Africa: a case study},
+  author  = {Marivate, Vukosi and Combrink, Herkulaas MvE},
+  journal = {arXiv preprint arXiv:2004.04813},
+  year    = {2020}
 }
-@article{Farrington1999,
-abstract = {We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.},
-author = {Farrington, C. P. and Grant, A. D.},
-doi = {10.1239/jap/1032374633},
-issn = {00219002},
-journal = {Journal of Applied Probability},
-keywords = {Branching process,Epidemic model,Extinction,Generation distribution,Maximum likelihood estimation,Power series family},
-number = {3},
-pages = {771--779},
-title = {{The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease}},
-volume = {36},
-year = {1999}
+@article{Nishiura2012,
+  abstract  = {Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. {\textcopyright} 2011.},
+  author    = {Nishiura, Hiroshi and Yan, Ping and Sleeman, Candace K. and Mode, Charles J.},
+  doi       = {10.1016/j.jtbi.2011.10.039},
+  issn      = {00225193},
+  journal   = {Journal of Theoretical Biology},
+  keywords  = {Basic reproduction number,Branching process,Confidence interval,Likelihood function,Statistical model},
+  pages     = {48--55},
+  pmid      = {22079419},
+  publisher = {Elsevier},
+  title     = {{Estimating the transmission potential of supercritical processes based on the final size distribution of minor outbreaks}},
+  url       = {http://dx.doi.org/10.1016/j.jtbi.2011.10.039},
+  volume    = {294},
+  year      = {2012}
 }
-@article{Fine2003,
-abstract = {The interval between successive cases of an infectious disease is determined by the time from infection to infectiousness, the duration of infectiousness, the time from infection to disease onset (incubation period), the duration of any extra-human phase of the infectious agent, and the proportion clinically affected among infected individuals. The interval is important in the interpretation of infectious disease surveillance and trend data, in the identification of outbreaks, and in the optimization of quarantine and contact tracing. This paper discusses the properties of these intervals, as measured between transmission events or between clinical onsets of successive infected individuals, noting the determinants of their ranges and frequency distributions, the circumstances under which secondary cases may arise before primaries, and under which the infection transmission interval will be different from the interval between clinical onsets of successive cases. It discusses the derivation of interval distribution statistics from descriptive data given in standard textbooks, with illustrations from published data on outbreaks, households, and epidemiologic tracing. Finally, it discusses the implications of such measures for studies of secondary attack rates, for the persistence of infection in human communities, for outbreak response, and for elimination or eradication programs.},
-author = {Fine, Paul E.M.},
-doi = {10.1093/aje/kwg251},
-isbn = {0002-9262 (Print) 0002-9262 (Linking)},
-issn = {00029262},
-journal = {American Journal of Epidemiology},
-keywords = {Communicable diseases,Disease outbreaks},
-number = {11},
-pages = {1039--1047},
-pmid = {14630599},
-title = {{The Interval between Successive Cases of an Infectious Disease}},
-volume = {158},
-year = {2003}
+@article{Pearson2020,
+  abstract  = {For 45 African countries/territories already reporting COVID-19 cases before 23 March 2020, we estimate the dates of reporting 1,000 and 10,000 cases. Assuming early epidemic trends without interventions, all 45 were likely to exceed 1,000 confirmed cases by the end of April 2020, with most exceeding 10,000 a few weeks later.},
+  author    = {Pearson, Carl A.B. and van Schalkwyk, Cari and Foss, Anna M. and O'Reilly, Kathleen M. and Pulliam, Juliet R.C.},
+  doi       = {10.2807/1560-7917.ES.2020.25.18.2000543},
+  issn      = {15607917},
+  journal   = {Eurosurveillance},
+  number    = {18},
+  pages     = {1--6},
+  pmid      = {32400361},
+  publisher = {European Centre for Disease Prevention and Control (ECDC)},
+  title     = {{Projected early spread of COVID-19 in Africa through 1 June 2020}},
+  url       = {http://dx.doi.org/10.2807/1560-7917.ES.2020.25.18.2000543},
+  volume    = {25},
+  year      = {2020}
+}
+@article{Society2010,
+  author    = {Becker, Niels and Society, International Biometric},
+  issn      = {0006-341X},
+  journal   = {Biometrics},
+  number    = {3},
+  pages     = {515--522},
+  publisher = {JSTOR},
+  title     = {{Estimation for discrete time branching processes with application to epidemics}},
+  volume    = {33},
+  year      = {1977}
+}
+@article{Wang2020,
+  abstract  = {Coronavirus disease 2019 (COVID-19) was first identified in late 2019 in Wuhan, Hubei Province, China and spread globally in months, sparking worldwide concern. However, it is unclear whether super-spreading events occurred during the early outbreak phase, as has been observed for other emerging viruses. Here, we analyse 208 publicly available SARS-CoV-2 genome sequences collected during the early outbreak phase. We combine phylogenetic analysis with Bayesian inference under an epidemiological model to trace person-to-person transmission. The dispersion parameter of the offspring distribution in the inferred transmission chain was estimated to be 0.23 (95% CI: 0.13–0.38), indicating there are individuals who directly infected a disproportionately large number of people. Our results showed that super-spreading events played an important role in the early stage of the COVID-19 outbreak.},
+  author    = {Wang, Liang and Didelot, Xavier and Yang, Jing and Wong, Gary and Shi, Yi and Liu, Wenjun and Gao, George F. and Bi, Yuhai},
+  doi       = {10.1038/s41467-020-18836-4},
+  issn      = {20411723},
+  journal   = {Nature Communications},
+  number    = {1},
+  pages     = {1--6},
+  pmid      = {33024095},
+  publisher = {Springer US},
+  title     = {{Inference of person-to-person transmission of COVID-19 reveals hidden super-spreading events during the early outbreak phase}},
+  url       = {http://dx.doi.org/10.1038/s41467-020-18836-4},
+  volume    = {11},
+  year      = {2020}
 }

From 881e55e7a596823bd4f8db2be9b13ff3b717df7d Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Mon, 6 Feb 2023 15:18:27 +0000
Subject: [PATCH 084/828] updated the title

---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index e62f695d..3be1eeec 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -1,5 +1,5 @@
 ---
-title: "Projecting COVID-19 incidence using early outbreak data"
+title: "Projecting infectious disease incidence using early outbreak data"
 author: "James Azam, Sebastian Funk"
 output:
   bookdown::html_vignette2:

From 6e0b1bd5483f5f868b97103d6565362f3add48f6 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Mon, 6 Feb 2023 15:18:51 +0000
Subject: [PATCH 085/828] updated the title

---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 3be1eeec..8810ead7 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -10,7 +10,7 @@ pkgdown:
 bibliography: references.bib
 link-citations: true
 vignette: >
-  %\VignetteIndexEntry{Projecting COVID-19 incidence using early outbreak data}
+  %\VignetteIndexEntry{Projecting infectious disease incidence using early outbreak data}
   %\VignetteEncoding{UTF-8}
   %\VignetteEngine{knitr::rmarkdown}
 editor_options: 

From 63a8b3b2dbac0047c98516df4c97799ad3b58dca Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Mon, 6 Feb 2023 15:19:23 +0000
Subject: [PATCH 086/828] updated the overview section

---
 vignettes/projecting_incidence.Rmd | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 8810ead7..768daf53 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -29,14 +29,14 @@ knitr::opts_chunk$set(echo = TRUE,
 ```
 
 ## Overview
+
 Branching processes can be used to project infectious disease trends provided 
-we have some information on the distribution of times between 
+we can characterize the distribution of times between 
 successive cases (serial interval), and the distribution of secondary cases 
 produced by a single individual (offspring distribution). Such simulations can be achieved in `bpmodels` with the `chain_sim()` function. @Pearson2020, and 
 @abbott2020 illustrate its application to COVID-19. 
 
-The purpose of this vignette is to use early data on COVID-19 in South Africa [@marivate2020] to illustrate how `bpmodels` can be used to forecast 
-an outbreak. 
+The purpose of this vignette is to use early data on COVID-19 in South Africa [@marivate2020] to illustrate how `bpmodels` can be used to forecast an outbreak. 
 
 
 Let's load the required packages

From bf3a521c17b43b624f3647156f23f9e348f8a780 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Mon, 6 Feb 2023 15:21:02 +0000
Subject: [PATCH 087/828] revised typesetting lognormal mean and SD

---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 768daf53..c6c78280 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -106,7 +106,7 @@ serial interval, $si$, is lognormal distributed as follows:
 
 $ E[\text{si}] = \ln \left( \dfrac{\mu^2}{(\sqrt{\mu^2 + \sigma^2}} \right)$
 
-$\text{SD} [\text{si}] = \sqrt {\ln \left(1 + \dfrac{\sigma^2}{\mu^2} \right)}$
+$ SD [\text{si}] = \sqrt {\ln \left(1 + \dfrac{\sigma^2}{\mu^2} \right)}$
 
 with $\mu = 4.7$ and standard deviation $\sigma = 2.9$.
 

From 7495c930df266ec34e5312965e53dfb21cbdc5f9 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Mon, 6 Feb 2023 15:21:40 +0000
Subject: [PATCH 088/828] added a reference for dispersion param, k

---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index c6c78280..ef5566ec 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -126,7 +126,7 @@ serial_interval <- function(sample_size) {
 
 #### Offspring distribution
 
-We assume an offspring distribution that is distributed as a negative binomial with $R = 2.5$ [@abbott2020] and $k = 0.58$. In this parameterization, R represents the $\mathcal{R_0}$, which is defined as the average number of cases produced by a single individual in an entirely susceptible population. The parameter $k$ represents superspreading, that is, the degree of heterogeneity in transmission by single individuals.
+We assume an offspring distribution that is distributed as a negative binomial with $R = 2.5$ [@abbott2020] and $k = 0.58$ [@Wang2020]. In this parameterization, R represents the $\mathcal{R_0}$, which is defined as the average number of cases produced by a single individual in an entirely susceptible population. The parameter $k$ represents superspreading, that is, the degree of heterogeneity in transmission by single individuals.
 
 ### Simulations
 To summarize the simulation set up, for each of the `r sim_rep` simulations, we want to project cases over a `r projection_window` day period since the last case, assuming that no chain would exceed `r chain_threshold`. 

From 1455e830d8558c8f2f1ed0057481d9e20383c132 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Mon, 6 Feb 2023 15:22:18 +0000
Subject: [PATCH 089/828] updated plot caption and axes scales

---
 vignettes/projecting_incidence.Rmd | 27 +++++++++++++++++++++++----
 1 file changed, 23 insertions(+), 4 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index ef5566ec..c6d32423 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -207,13 +207,13 @@ median_daily_cases <- median_daily_cases %>%
 
 ### Visualization
 
-```{r viz, fig.cap ="Projected COVID-19 epidemiological trend. Gray lines represent individual simulation results and red dots represent the median daily cases across all simulations.", fig.width=2.0, fig.height=1.8}
+```{r viz, fig.cap ="COVID-19 incidence projected over a two week window. The gray lines represent individual simulations, red connected dots represent the median daily cases across all simulations, and the black triangles represent the observed data.", fig.width=2.0, fig.height=1.8}
 # Visualization
 cases_plot <- ggplot(data = incidence_ts) +
   geom_line(aes(
     x = date,
     y = cases,
-  group = sim
+    group = sim
   ),
   color = "grey",
   linewidth = 1.2,
@@ -228,13 +228,32 @@ cases_plot <- ggplot(data = incidence_ts) +
     color = "tomato3",
     size = 0.75
   ) +
+  geom_line(
+    data = median_daily_cases,
+    aes(
+      x = date,
+      y = median_cases
+    ),
+    color = "tomato3",
+    linewidth = 0.25
+  ) +
+  geom_point(
+    data = covid19_sa,
+    aes(
+      x = date,
+      y = cases
+    ),
+    color = "black",
+    size = 0.25,
+    shape = 24
+  ) +
   scale_x_continuous(
     breaks = seq(min(incidence_ts$date), max(incidence_ts$date), 10),
     labels = seq(min(incidence_ts$date), max(incidence_ts$date), 10)
   ) +
   scale_y_continuous(
-    breaks = seq(0, max(incidence_ts$cases) + 200, 100),
-    labels = seq(0, max(incidence_ts$cases) + 200, 100)
+    breaks = seq(0, max(incidence_ts$cases) + 200, 250),
+    labels = seq(0, max(incidence_ts$cases) + 200, 250)
   ) +
   labs(x = "Date", y = "Daily cases (median)") +
   theme_minimal(base_size = 4) +

From 03c1fc2e5664208f99de8f097535a38b6c8e9831 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Mon, 6 Feb 2023 15:22:57 +0000
Subject: [PATCH 090/828] generated the package doc Rd file

---
 man/bpmodels-package.Rd | 33 +++++++++++++++++++++++++++++++++
 1 file changed, 33 insertions(+)
 create mode 100644 man/bpmodels-package.Rd

diff --git a/man/bpmodels-package.Rd b/man/bpmodels-package.Rd
new file mode 100644
index 00000000..dd44c49d
--- /dev/null
+++ b/man/bpmodels-package.Rd
@@ -0,0 +1,33 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/bpmodels-package.R
+\docType{package}
+\name{bpmodels-package}
+\alias{bpmodels}
+\alias{bpmodels-package}
+\title{bpmodels: Analysing chain statistics using branching process models}
+\description{
+Provides methods to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks, as discussed in Farrington et al. (2003) \doi{10.1093/biostatistics/4.2.279}.
+}
+\seealso{
+Useful links:
+\itemize{
+  \item \url{https://github.com/sbfnk/bpmodels}
+  \item Report bugs at \url{https://github.com/sbfnk/bpmodels/issues}
+}
+
+}
+\author{
+\strong{Maintainer}: Sebastian Funk \email{sebastian.funk@lshtm.ac.uk}
+
+Authors:
+\itemize{
+  \item Flavio Finger \email{flavio.finger@epicentre.msf.org}
+}
+
+Other contributors:
+\itemize{
+  \item Zhian N. Kamvar \email{zkamvar@gmail.com} [contributor]
+}
+
+}
+\keyword{internal}

From 3ae929d04e4089cf76b5f38e4ce9a838a2f4906b Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 7 Feb 2023 11:42:49 +0000
Subject: [PATCH 091/828] removed bp_literature vignette (not necessarily)

---
 vignettes/articles/bp_literature.Rmd | 24 ------------------------
 1 file changed, 24 deletions(-)
 delete mode 100644 vignettes/articles/bp_literature.Rmd

diff --git a/vignettes/articles/bp_literature.Rmd b/vignettes/articles/bp_literature.Rmd
deleted file mode 100644
index 780062f3..00000000
--- a/vignettes/articles/bp_literature.Rmd
+++ /dev/null
@@ -1,24 +0,0 @@
----
-title: "Applications of branching process models to outbreak modelling"
----
-
-```{r, include = FALSE}
-knitr::opts_chunk$set(
-  collapse = TRUE,
-  comment = "#>"
-)
-```
-
-## Single-type models
-
-- Blumberg S, Lloyd-Smith J. Comparing methods for estimating R0 from the size distribution of sub- critical transmission chains. Epidemics. 2013; 5(3):131–45. doi: https://doi.org/10.1016/j.epidem.2013.05.002 PMID: 24021520
-
-- Blumberg S, Lloyd-Smith JO. Inference of R0 and transmission heterogeneity from the size distribution of stuttering chains. PLoS Comput Biol. 2013; 9(5):e1002993. doi: https://doi.org/10.1371/journal.pcbi.1002993 PMID: 23658504
-
-- Farrington C, Kanaan M, Gay N. Branching process models for surveillance of infectious diseases con- trolled by mass vaccination. Biostatistics. 2003; 4(2):279. doi: https://doi.org/10.1093/biostatistics/4.2.279 PMID: 12925522
-
-- Nishiura H, Yan P, Sleeman CK, Mode CJ. Estimating the transmission potential of supercritical pro- cesses based on the final size distribution of minor outbreaks. J Theor Biol. 2012; 294:48–55. doi: https://doi.org/10.1016/j.jtbi.2011.10.039 PMID: 22079419
-
-## Multi-type models
-
-- Kucharski, A. J., & Edmunds, W. J. (2015). Characterizing the Transmission Potential of Zoonotic Infections from Minor Outbreaks. PLoS Computational Biology, 11(4), 1–17. https://doi.org/10.1371/journal.pcbi.1004154

From 4ce514d213f0e3307ef2bbabb15c4c205afd1d3d Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 7 Feb 2023 11:43:30 +0000
Subject: [PATCH 092/828] removed introduction vignette (content moved
 elsewhere)

---
 vignettes/introduction.Rmd | 103 -------------------------------------
 1 file changed, 103 deletions(-)
 delete mode 100644 vignettes/introduction.Rmd

diff --git a/vignettes/introduction.Rmd b/vignettes/introduction.Rmd
deleted file mode 100644
index c66bb4f4..00000000
--- a/vignettes/introduction.Rmd
+++ /dev/null
@@ -1,103 +0,0 @@
----
-title: "Analysing chain statistics using branching process models"
-author: "Sebastian Funk"
-output:
-  bookdown::html_vignette2:
-    fig_caption: yes
-    code_folding: show
-pkgdown:
-  as_is: true
-bibliography: references.bib
-link-citations: true
-vignette: >
-  %\VignetteIndexEntry{Analysing chain statistics using branching process models}
-  %\VignetteEncoding{UTF-8}
-  %\VignetteEngine{knitr::rmarkdown}
-editor_options: 
-  chunk_output_type: console
----
-
-
-```{r setup, include = FALSE}
-library('knitr')
-knitr::opts_chunk$set(
-  collapse = TRUE,
-  comment = "#>"
-)
-```
-
-[bpmodels](https://github.com/sbfnk/bpmodels) is an `R` package to simulate and analyse the size and length of branching processes with a given offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks.
-
-# Quick start
-
-To load the package, use
-```{r eval=FALSE}
-library('bpmodels')
-```
-```{r echo=FALSE}
-suppressWarnings(library('bpmodels'))
-set.seed(13)
-```
-
-At the heart of the package are the `chains_ll()` and `chains_sim()` functions. 
-
-## Calculating log-likelihoods
-
-The `chains_ll()` function calculates the log-likelihood of a distribution of chain sizes or lengths given an offspring distribution and its associated parameters. 
-
-If we have observed a distribution of chains of sizes $1, 1, 4, 7$, we can calculate the log-likelihood of this observed chain by assuming the offspring per generation is Poisson distributed with a mean number of 0.5. 
-
-To do this, we run 
-
-```{r}
-chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
-```
-
-The first argument of `chain_ll()` is the size (or length) distribution to analyse. The second argument (called `offspring`) specifies the offspring distribution. This is given as a function used to generate random offspring. It can be any probability distribution implemented in `R`, that is, one that has a corresponding function for generating random numbers beginning with the letter `r`. In the case of the example above, since random Poisson numbers are generated in `R` using a function called `rpois()`, the string to pass to the `offspring` argument is `"pois"`.
-
-The third argument (called `stat`) determines whether to analyse chain sizes (`"size"`, the default if this argument is not specified) or lengths (`"length"`). Lastly, any named arguments not recognised by `chain_ll()` are interpreted as parameters of the corresponding probability distribution, here `lambda = 0.5` as the mean of the Poisson distribution (see the `R` help page for the [Poisson distribution](https://stat.ethz.ch/R-manual/R-devel/library/stats/html/Poisson.html) for more information).
-
-To find out about usage of the `chains_ll()` function, you can use the `R` help file
-
-```{r eval=FALSE}
-?chains_ll
-```
-
-## Simulating branching processes
-
-To simulate a branching process, we use the `chain_sim()` function. This function follows the same syntax as `chain_ll()`, that is:
-
-```{r}
-chain_sim(n = 5, "pois", "size", lambda = 0.5)
-```
-
-# Methodology
-
-If the probability distribution of chain sizes or lengths has an analytical solution, this will be used (size distribution: Poisson and negative binomial; length distribution: Poisson and geometric). 
-
-If an analytical solution does not exist, simulations are used to approximate this probability distributions (using a linear approximation to the cumulative distribution for unobserved sizes/lengths). The argument `nsim_offspring` is used to specify the number of simulations to be used for this approximation. 
-
-For example, to get offspring drawn from a binomial distribution with probability `prob = 0.5`, we run
-
-```{r}
-chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, nsim_offspring = 100)
-```
-
-# Imperfect observations
-
-If observations are imperfect, the `chain_ll()` function has an `obs_prob` argument that can be used to determine the likelihood. In that case, true chain sizes or lengths are simulated repeatedly (the number of times given by the `nsim_obs` argument), and the likelihood calculated for each of these simulations. 
-
-For example, if the probability of observing each case is $30%$, we use
-
-```{r}
-ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5, nsim_obs = 10)
-summary(ll)
-```
-
-This returns `nsim_obs = 10` likelihood values which can be averaged to come up with an overall likelihood estimate.
-
-# References
-
-* Farrington, C.P., Kanaan, M.N. and Gay, N.J. (2003). [Branching process models for surveillance of infectious diseases controlled by mass vaccination](https://doi.org/10.1093/biostatistics/4.2.279).
-* Blumberg, S. and Lloyd-Smith, J.O. (2013). [Comparing methods for estimating R0 from the size distribution of subcritical transmission chains](https://doi.org/10.1016/j.epidem.2013.05.002).

From 28440c1e37b5535c315e24cce93998eb41b85ed5 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 7 Feb 2023 11:53:50 +0000
Subject: [PATCH 093/828] removed a trailing ellipsis

---
 R/data.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/data.R b/R/data.R
index e134c1c3..4ecafc32 100644
--- a/R/data.R
+++ b/R/data.R
@@ -2,7 +2,7 @@
 #'
 #' An aggregated subset of the COVID-19 Data Repository for South Africa created, 
 #' maintained and hosted by Data Science for Social Impact research group, 
-#' led by Dr. Vukosi Marivate ...
+#' led by Dr. Vukosi Marivate.
 #' 
 #' The data is originally provided as a linelist but has been subsetted and 
 #' cleaned in `data-raw/covid19_sa.R`.

From 82ef9d9632333ec7f39f32235a0b98382434718a Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 7 Feb 2023 11:55:09 +0000
Subject: [PATCH 094/828] removed an unnecessary ellipsis

---
 R/data.R | 1 -
 1 file changed, 1 deletion(-)

diff --git a/R/data.R b/R/data.R
index 4ecafc32..760306bd 100644
--- a/R/data.R
+++ b/R/data.R
@@ -12,7 +12,6 @@
 #' \describe{
 #'   \item{date}{Date case was reported}
 #'   \item{cases}{Number of cases}
-#'   ...
 #' }
 #' @source <https://github.com/dsfsi/covid19za>
 #' Further details in `data-raw/covid19_sa.R`.

From 7b5fa1fdd198520992aee6c8aede0ca74f755e5e Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 7 Feb 2023 11:56:22 +0000
Subject: [PATCH 095/828] added a new line

---
 R/simulate.r | 1 +
 1 file changed, 1 insertion(+)

diff --git a/R/simulate.r b/R/simulate.r
index f7df35a4..c84a5bfa 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,4 +1,5 @@
 #' Simulate transmission chains using a branching process
+#' 
 #' @description \code{chain_sim()} is a stochastic simulator for generating 
 #' transmission chain data given information on the offspring distribution, 
 #' serial interval, time since the first case, etc. 

From 8612397f39e8d3a488b942fa9faa11179e51bb1c Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 7 Feb 2023 11:59:52 +0000
Subject: [PATCH 096/828] revised the description of chain_sim()

---
 R/simulate.r | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index c84a5bfa..4a54ee8c 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,8 +1,8 @@
 #' Simulate transmission chains using a branching process
 #' 
 #' @description \code{chain_sim()} is a stochastic simulator for generating 
-#' transmission chain data given information on the offspring distribution, 
-#' serial interval, time since the first case, etc. 
+#' transmission chain data with key inputs such as the offspring distribution and 
+#' serial interval distribution. 
 #' @param n Number of simulations to run.
 #' @param offspring Offspring distribution: a character string corresponding to
 #'   the R distribution function (e.g., "pois" for Poisson, where

From 6769f9bb48ac57ce6705f456b7331b027cda2201 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 7 Feb 2023 12:15:37 +0000
Subject: [PATCH 097/828] added calls to loaded packages

---
 data-raw/covid19_sa.R | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/data-raw/covid19_sa.R b/data-raw/covid19_sa.R
index 40afe3be..b6846d44 100644
--- a/data-raw/covid19_sa.R
+++ b/data-raw/covid19_sa.R
@@ -1,5 +1,9 @@
 ## code to prepare `covid_sa` dataset
 
+library(dplyr)
+library(lubridate)
+
+#Link to data
 data_url <- 'https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv'
 
 #Read the data in using the url

From 43a350073f6cd3ff90395ccfffd4106018ca8a9b Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Thu, 9 Feb 2023 09:53:30 +0000
Subject: [PATCH 098/828] stoppped suppressing warnings on package load

---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index 92e69135..e17fa4eb 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -46,7 +46,7 @@ devtools::install_github('epiverse-trace/bpmodels')
 To load the package, use
 
 ```{r echo=FALSE}
-suppressWarnings(library('bpmodels'))
+library('bpmodels')
 ```
 
 At the heart of the package are the `chains_ll()` and `chains_sim()` functions. 

From 41e928a4309fd07e65cddd6fdfd1c40ec09e188d Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Thu, 9 Feb 2023 14:08:22 +0000
Subject: [PATCH 099/828] added lintr

---
 .lintr | 4 ++++
 1 file changed, 4 insertions(+)
 create mode 100644 .lintr

diff --git a/.lintr b/.lintr
new file mode 100644
index 00000000..77eb4c15
--- /dev/null
+++ b/.lintr
@@ -0,0 +1,4 @@
+linters:linters_with_defaults() # see vignette("lintr")
+encoding:"UTF-8"
+
+

From 81200e4b0709691a3831b127abe78e198026dd52 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Thu, 9 Feb 2023 14:09:42 +0000
Subject: [PATCH 100/828] minor formatting of overview lines over 80 chars

---
 vignettes/projecting_incidence.Rmd | 8 +++++---
 1 file changed, 5 insertions(+), 3 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index c6d32423..baa2a5c7 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -33,10 +33,12 @@ knitr::opts_chunk$set(echo = TRUE,
 Branching processes can be used to project infectious disease trends provided 
 we can characterize the distribution of times between 
 successive cases (serial interval), and the distribution of secondary cases 
-produced by a single individual (offspring distribution). Such simulations can be achieved in `bpmodels` with the `chain_sim()` function. @Pearson2020, and 
+produced by a single individual (offspring distribution). Such simulations can 
+be achieved in `bpmodels` with the `chain_sim()` function. @Pearson2020, and 
 @abbott2020 illustrate its application to COVID-19. 
 
-The purpose of this vignette is to use early data on COVID-19 in South Africa [@marivate2020] to illustrate how `bpmodels` can be used to forecast an outbreak. 
+The purpose of this vignette is to use early data on COVID-19 in South Africa 
+[@marivate2020] to illustrate how `bpmodels` can be used to forecast an outbreak. 
 
 
 Let's load the required packages
@@ -54,7 +56,7 @@ We will get and clean the first $15$ days of the COVID-19
 outbreak in South Africa to seed the simulation for this example.
 
 ```{r data, message=FALSE}
-data_url <- 'https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv'
+data_url <- 'https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv' # nolint: line_length_linter. this comment overflows the default 80 chars line length.
 
 #Read the data in using the url
 covid19_sa <- read.csv(data_url)

From 919b3dc50ab146055982ab6d96f525a85b49bd49 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Thu, 9 Feb 2023 14:10:12 +0000
Subject: [PATCH 101/828] replaced rbind with bind_rows for uniformity

---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index baa2a5c7..39ba2c8e 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -166,7 +166,7 @@ sim_chain_sizes <- lapply(
   }
 )
 
-sim_output <- do.call(rbind, sim_chain_sizes)
+sim_output <- bind_rows(sim_chain_sizes)
 
 head(sim_output)
 ```

From 74a634ab9316a281f6dbc818201b97a36c3dd20c Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Thu, 9 Feb 2023 14:25:16 +0000
Subject: [PATCH 102/828] removed namespacing and hardcoding of filter date

---
 data-raw/covid19_sa.R | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/data-raw/covid19_sa.R b/data-raw/covid19_sa.R
index b6846d44..16af4fbf 100644
--- a/data-raw/covid19_sa.R
+++ b/data-raw/covid19_sa.R
@@ -11,11 +11,11 @@ covid19_sa <- read.csv(data_url)
 
 #Clean and subset the data we need
 covid19_sa <- covid19_sa %>% 
-  dplyr::select(date) %>% 
-  dplyr::mutate(date = lubridate::dmy(date)) %>%
-  dplyr::filter(date <= lubridate::dmy('20-03-2020')) %>%   
-  dplyr::group_by(date) %>% 
-  dplyr::summarise(cases = n()) %>%   
-  dplyr::ungroup()
+  select(date) %>% 
+  mutate(date = lubridate::dmy(date)) %>%
+  filter(date <= min(date) + lubridate::days(15)) %>%   
+  group_by(date) %>% 
+  summarise(cases = n()) %>%   
+  ungroup()
 
 usethis::use_data(covid19_sa, overwrite = TRUE)

From 8b8f11995f9080e30e45fde62c015fd842e14c91 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Thu, 9 Feb 2023 22:34:58 +0000
Subject: [PATCH 103/828] generated Rd files

---
 man/chain_sim.Rd  | 4 ++--
 man/covid19_sa.Rd | 3 +--
 2 files changed, 3 insertions(+), 4 deletions(-)

diff --git a/man/chain_sim.Rd b/man/chain_sim.Rd
index fb86c0bf..49caed28 100644
--- a/man/chain_sim.Rd
+++ b/man/chain_sim.Rd
@@ -59,8 +59,8 @@ element), and \code{generation}.}
 }
 \description{
 \code{chain_sim()} is a stochastic simulator for generating
-transmission chain data given information on the offspring distribution,
-serial interval, time since the first case, etc.
+transmission chain data with key inputs such as the offspring distribution and
+serial interval distribution.
 }
 \details{
 \code{chain_sim()} either returns a vector or a data.frame. The output is either a
diff --git a/man/covid19_sa.Rd b/man/covid19_sa.Rd
index f772efb3..395e13c3 100644
--- a/man/covid19_sa.Rd
+++ b/man/covid19_sa.Rd
@@ -11,7 +11,6 @@ A data frame with 19 rows and 2 columns:
 \describe{
 \item{date}{Date case was reported}
 \item{cases}{Number of cases}
-...
 }
 }
 }
@@ -25,7 +24,7 @@ covid19_sa
 \description{
 An aggregated subset of the COVID-19 Data Repository for South Africa created,
 maintained and hosted by Data Science for Social Impact research group,
-led by Dr. Vukosi Marivate ...
+led by Dr. Vukosi Marivate.
 }
 \details{
 The data is originally provided as a linelist but has been subsetted and

From 355eb368339796bc5575e21b96113480d3a0d4cf Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Thu, 9 Feb 2023 22:35:58 +0000
Subject: [PATCH 104/828] removed namespacing

---
 vignettes/projecting_incidence.Rmd | 30 ++++++++++++++++--------------
 1 file changed, 16 insertions(+), 14 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 39ba2c8e..b07c6ca6 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -38,40 +38,42 @@ be achieved in `bpmodels` with the `chain_sim()` function. @Pearson2020, and
 @abbott2020 illustrate its application to COVID-19. 
 
 The purpose of this vignette is to use early data on COVID-19 in South Africa 
-[@marivate2020] to illustrate how `bpmodels` can be used to forecast an outbreak. 
+[@marivate2020] to illustrate how `bpmodels` can be used to forecast an 
+outbreak. 
 
 
 Let's load the required packages
 
 ```{r packages, include=TRUE}
 library("bpmodels")
-library('dplyr')
-library('ggplot2')
-library('lubridate')
+library("dplyr")
+library("ggplot2")
+library("lubridate")
 ```
 
-### The data
+## Data
 
 We will get and clean the first $15$ days of the COVID-19 
 outbreak in South Africa to seed the simulation for this example.
 
 ```{r data, message=FALSE}
-data_url <- 'https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv' # nolint: line_length_linter. this comment overflows the default 80 chars line length.
+data_url <- 'https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv' # nolint: line_length_linter. 
 
 #Read the data in using the url
 covid19_sa <- read.csv(data_url)
 
 # Subset the first 15 days and count the number of cases per date
-covid19_sa <- covid19_sa %>%
-  dplyr::select(date) %>%
-  dplyr::mutate(date = lubridate::dmy(date)) %>%
-  dplyr::filter(date <= lubridate::dmy("20-03-2020")) %>%
-  dplyr::group_by(date) %>%
-  dplyr::summarise(cases = n()) %>%
-  dplyr::ungroup()
+covid19_sa <- covid19_sa %>% 
+  select(date) %>% 
+  mutate(date = lubridate::dmy(date)) %>%
+  filter(date <= min(date) + lubridate::days(15)) %>%   
+  group_by(date) %>% 
+  summarise(cases = n()) %>%   
+  ungroup()
 ```
 
-### Preparing the inputs  
+## Inputs  
+Using the data above, we will set up a vector of start times for each case.
 
 ```{r linelist_gen, message=FALSE}
 days_since_index <- as.integer(covid19_sa$date - min(covid19_sa$date))

From 13b46ff3f20fe3d5616f35c06645fc467d4b7d9d Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Thu, 9 Feb 2023 22:37:00 +0000
Subject: [PATCH 105/828] updated the vignette

---
 vignettes/projecting_incidence.Rmd | 77 +++++++++++++++++++-----------
 1 file changed, 50 insertions(+), 27 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index b07c6ca6..40050a62 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -87,10 +87,15 @@ start_times <- unlist(mapply(
 ```
 
 
-Additionally, `chain_sim()` requires other inputs, which we will specify below: 
+Additionally, `chain_sim()` requires the end time for the simulations and the 
+maximum size of each chain. Since each result of `chain_sim()` is stochastic,
+it is also best to run it many times. 
+
+We will specify these as follows:
 
 ```{r input_prep2, message=FALSE}
 #' Date to end simulation (14 day projection in this case)
+
 projection_window <- 14 # 14 days/ 2-week ahead projection
 
 projection_end_day <- max(days_since_index) + projection_window
@@ -103,39 +108,59 @@ chain_threshold <- 1000
 
 ```
 
-#### Serial interval
+### Serial interval
+
+In this example, we will assume based on COVID-19 literature that the 
+serial interval, $\mathcal{S}$, is log-normal distributed with parameters, 
+$\mu = 4.7$ and $\sigma = 2.9$ [@Pearson2020]. The log-normal mean, 
+$E[ \mathcal{S} ]$ and standard deviation $SD[ \mathcal{S} ]$ are 
+characterised as:
 
-We also assume based on COVID-19 literature that the 
-serial interval, $si$, is lognormal distributed as follows:
+\begin{align}
+E[ \mathcal{S} ] &= \ln \left( \dfrac{\mu^2}{(\sqrt{\mu^2 + \sigma^2}} \right) \\
 
-$ E[\text{si}] = \ln \left( \dfrac{\mu^2}{(\sqrt{\mu^2 + \sigma^2}} \right)$
+SD [ \mathcal{S} ] &= \sqrt {\ln \left(1 + \dfrac{\sigma^2}{\mu^2} \right)}
+ 
+\end{align}
 
-$ SD [\text{si}] = \sqrt {\ln \left(1 + \dfrac{\sigma^2}{\mu^2} \right)}$
+See [Wikipedia](https://en.wikipedia.org/wiki/Log-normal_distribution) for a 
+detailed explanation of this parametrisation.
 
-with $\mu = 4.7$ and standard deviation $\sigma = 2.9$.
+The following is how we set up the serial interval function using the
+information provided above:
 
 ```{r input_prep3, message=FALSE}
 mu <- 4.7
+
 sigma <- 2.9
 
-si_sd <- sqrt(log(1 + (sigma / mu)^2)) # log standard deviation
-si_mean <- log((mu^2) / (sqrt(sigma^2 + mu^2))) # log mean
+log_sd <- sqrt(log(1 + (sigma / mu)^2)) # log standard deviation
+
+log_mean <- log((mu^2) / (sqrt(sigma^2 + mu^2))) # log mean
 
 #' serial interval function
 serial_interval <- function(sample_size) {
-  si <- rlnorm(sample_size, meanlog = si_mean, sdlog = si_sd)
+  si <- rlnorm(sample_size, meanlog = log_mean, sdlog = log_sd)
   return(si)
 }
 ```
 
-#### Offspring distribution
+### Offspring distribution
 
-We assume an offspring distribution that is distributed as a negative binomial with $R = 2.5$ [@abbott2020] and $k = 0.58$ [@Wang2020]. In this parameterization, R represents the $\mathcal{R_0}$, which is defined as the average number of cases produced by a single individual in an entirely susceptible population. The parameter $k$ represents superspreading, that is, the degree of heterogeneity in transmission by single individuals.
+We will also assume that the offspring distribution is characterised by a 
+negative binomial with $\mathcal{R} = 2.5$ [@abbott2020] and 
+$\mathcal{k} = 0.58$ [@Wang2020]. In this parameterization, $\mathcal{R}$ 
+represents the $\mathcal{R_0}$, which is defined as the average number of 
+cases produced by a single individual in an entirely susceptible population. 
+The parameter $k$ represents superspreading, that is, the degree of 
+heterogeneity in transmission by single individuals.
 
-### Simulations
-To summarize the simulation set up, for each of the `r sim_rep` simulations, we want to project cases over a `r projection_window` day period since the last case, assuming that no chain would exceed `r chain_threshold`. 
+## Simulations
+To summarize the simulation set up, for each of the `r sim_rep` simulations,
+we want to project cases over a `r projection_window` day period since the 
+last case, assuming that no chain would exceed `r chain_threshold`. 
 
-#### Model assumptions
+### Model assumptions
 
 `chain_sim()` makes the following simplifying assumptions:
 
@@ -173,10 +198,11 @@ sim_output <- bind_rows(sim_chain_sizes)
 head(sim_output)
 ```
 
-From the simulated data, we count the median daily cases across 
-all simulations and overlay that over a plot of all the projections through time.
+## Post-processing
 
-#### Post-processing
+From the simulated data, we will count the median daily cases 
+(`median_daily_cases`) across all simulations and overlay that on the results 
+of all the projections through time (`incidence_ts`).
 
 ```{r post_processing}
 index_date <- min(covid19_sa$date)
@@ -191,7 +217,7 @@ incidence_ts <- sim_output %>%
 # Add dates
 incidence_ts <- incidence_ts %>%
   group_by(sim) %>%
-  mutate(date = index_date + (0:(n() - 1))) %>%
+  mutate(date = index_date + days(seq(0, n() - 1))) %>%
   ungroup()
 
 ## Median daily number of cases aggregated across all simulations
@@ -203,17 +229,17 @@ median_daily_cases <- incidence_ts %>%
 
 # Add dates
 median_daily_cases <- median_daily_cases %>%
-  mutate(date = index_date + 0:projection_end_day) %>%
+  mutate(date = index_date + days(seq(0, projection_end_day))) %>%
   ungroup()
 
 ```
 
 
-### Visualization
+## Visualization
 
 ```{r viz, fig.cap ="COVID-19 incidence projected over a two week window. The gray lines represent individual simulations, red connected dots represent the median daily cases across all simulations, and the black triangles represent the observed data.", fig.width=2.0, fig.height=1.8}
 # Visualization
-cases_plot <- ggplot(data = incidence_ts) +
+ggplot(data = incidence_ts) +
   geom_line(aes(
     x = date,
     y = cases,
@@ -260,10 +286,7 @@ cases_plot <- ggplot(data = incidence_ts) +
     labels = seq(0, max(incidence_ts$cases) + 200, 250)
   ) +
   labs(x = "Date", y = "Daily cases (median)") +
-  theme_minimal(base_size = 4) +
-  NULL
-
-print(cases_plot)
+  theme_minimal() 
 ```
 
-### References
+## References

From c1e2202ae7a0ecec7a026c7a540278aeca21700c Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 12:54:44 +0000
Subject: [PATCH 106/828] fixed url

---
 DESCRIPTION | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 0c6510f2..b9f1b68f 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -13,8 +13,8 @@ Description: Provides methods to analyse and simulate the size and length
     or length of infectious disease outbreaks, as discussed in Farrington
     et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
 License: MIT + file LICENSE
-URL: https://github.com/sbfnk/bpmodels
-BugReports: https://github.com/sbfnk/bpmodels/issues
+URL: https://github.com/epiverse-trace/bpmodels
+BugReports: https://github.com/epiverse-trace/bpmodels/issues
 Depends: 
     R (>= 2.10)
 Suggests: 

From 548b415f6893d47d68b61f8bd970edea5f88cc78 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 12:55:17 +0000
Subject: [PATCH 107/828] added lintr workflow for changed files

---
 .github/workflows/lint_changed_files.yaml | 44 +++++++++++++++++++++++
 1 file changed, 44 insertions(+)
 create mode 100644 .github/workflows/lint_changed_files.yaml

diff --git a/.github/workflows/lint_changed_files.yaml b/.github/workflows/lint_changed_files.yaml
new file mode 100644
index 00000000..f39da76c
--- /dev/null
+++ b/.github/workflows/lint_changed_files.yaml
@@ -0,0 +1,44 @@
+# Workflow derived from https://github.com/r-lib/actions/tree/v2/examples
+# Need help debugging build failures? Start at https://github.com/r-lib/actions#where-to-find-help
+on:
+  pull_request:
+    branches: [main, master]
+
+name: lint-changed-files
+
+jobs:
+  lint-changed-files:
+    runs-on: ubuntu-latest
+    env:
+      GITHUB_PAT: ${{ secrets.GITHUB_TOKEN }}
+    steps:
+      - uses: actions/checkout@v3
+
+      - uses: r-lib/actions/setup-r@v2
+
+      - uses: r-lib/actions/setup-r-dependencies@v2
+        with:
+          extra-packages: |
+            any::gh
+            any::lintr
+            any::purrr
+          needs: check
+
+      - name: Add lintr options
+        run: |
+          cat('\noptions(lintr.linter_file = ".lintr")\n', file = "~/.Rprofile", append = TRUE)
+        shell: Rscript {0}
+
+      - name: Install package
+        run: R CMD INSTALL .
+
+      - name: Extract and lint files changed by this PR
+        run: |
+          files <- gh::gh("GET https://api.github.com/repos/${{ github.repository }}/pulls/${{ github.event.pull_request.number }}/files")
+          changed_files <- purrr::map_chr(files, "filename")
+          all_files <- list.files(recursive = TRUE)
+          exclusions_list <- as.list(setdiff(all_files, changed_files))
+          lintr::lint_package(exclusions = exclusions_list)
+        shell: Rscript {0}
+        env:
+          LINTR_ERROR_ON_LINT: true
\ No newline at end of file

From c01de4c01abcd1f9e563b1972b91a6f409bbc73b Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 13:11:09 +0000
Subject: [PATCH 108/828] fixed wrong reference to chain_ll() as chains_ll()

---
 README.Rmd | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index e17fa4eb..b1d3051c 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -49,11 +49,11 @@ To load the package, use
 library('bpmodels')
 ```
 
-At the heart of the package are the `chains_ll()` and `chains_sim()` functions. 
+At the heart of the package are the `chain_ll()` and `chain_sim()` functions. 
 
 ## Calculating log-likelihoods
 
-The `chains_ll()` function calculates the log-likelihood of a distribution of 
+The `chain_ll()` function calculates the log-likelihood of a distribution of 
 chain sizes or lengths given an offspring distribution and its associated 
 parameters. 
 
@@ -106,11 +106,11 @@ summary(ll)
 This returns `10` likelihood values (because `nsim_obs = 10`), which can be 
 averaged to come up with an overall likelihood estimate.
 
-To find out about usage of the `chains_ll()` function, you can use the `R` help 
+To find out about usage of the `chain_ll()` function, you can use the `R` help 
 file
 
 ```{r eval=FALSE}
-?chains_ll
+?chain_ll
 ```
 
 ## Simulating branching processes

From 439f284aaaad935484da178f03c7950070f20174 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 13:11:47 +0000
Subject: [PATCH 109/828] revised wrongly stated probability

---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index b1d3051c..78821492 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -94,7 +94,7 @@ case, true chain sizes or lengths are simulated repeatedly (the number of times
 given by the `nsim_obs` argument), and the likelihood calculated for each of 
 these simulations. 
 
-For example, if the probability of observing each case is $30%$, we use
+For example, if the probability of observing each case is $0.30$, we use
 
 ```{r}
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes

From 4be39a0026fb0618f7a638de9171da2019f526f5 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 13:12:14 +0000
Subject: [PATCH 110/828] polished some grammar

---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index 78821492..c19aad45 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -129,7 +129,7 @@ chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5)
 ```
 
 ### Simulating trees
-To simulate a tree of branching processes, we do specify the serial interval 
+To simulate a tree of branching processes, we specify the serial interval 
 generation function and set `tree = TRUE` as follows:
 
 ```{r}

From 19166cf9091c5b26d6d41dffa27d50da3bef9298 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 13:12:58 +0000
Subject: [PATCH 111/828] explicitly set

---
 README.Rmd | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index c19aad45..be7aceae 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -138,9 +138,9 @@ set.seed(13)
 serial_interval <- function(n){rlnorm(n, meanlog = 0.58, sdlog = 1.58)}
 
 chains_df <- chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length', 
-                       infinite = 100, serial = serial_interval)
+                       infinite = 100, serial = serial_interval, tree = TRUE)
 
-chains_df
+head(chains_df)
 ```
 
 
From 5be87151a7a5491143254aaf5f10887a5912db67 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 13:13:23 +0000
Subject: [PATCH 112/828] Generated README.md

---
 README.md | 48 +++++++++++++++++++++---------------------------
 1 file changed, 21 insertions(+), 27 deletions(-)

diff --git a/README.md b/README.md
index d0f94754..8fc1f96b 100644
--- a/README.md
+++ b/README.md
@@ -35,12 +35,12 @@ devtools::install_github('epiverse-trace/bpmodels')
 
 To load the package, use
 
-At the heart of the package are the `chains_ll()` and `chains_sim()`
+At the heart of the package are the `chain_ll()` and `chain_sim()`
 functions.
 
 ## Calculating log-likelihoods
 
-The `chains_ll()` function calculates the log-likelihood of a
+The `chain_ll()` function calculates the log-likelihood of a
 distribution of chain sizes or lengths given an offspring distribution
 and its associated parameters.
 
@@ -55,7 +55,7 @@ To do this, we run
 set.seed(13)
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 chain_ll(x = chain_sizes, offspring = "pois", stat = "size", lambda = 0.5)
-#> [1] -8.607196
+#> [1] -8.607
 ```
 
 The first argument of `chain_ll()` is the size (or length) distribution
@@ -86,7 +86,7 @@ likelihood. In that case, true chain sizes or lengths are simulated
 repeatedly (the number of times given by the `nsim_obs` argument), and
 the likelihood calculated for each of these simulations.
 
-For example, if the probability of observing each case is $30%$, we use
+For example, if the probability of observing each case is $0.30$, we use
 
 ``` r
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
@@ -94,17 +94,17 @@ ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5,
                nsim_obs = 10)
 summary(ll)
 #>    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
-#>  -32.09  -26.52  -24.06  -24.94  -22.49  -19.14
+#>   -32.1   -26.5   -24.1   -24.9   -22.5   -19.1
 ```
 
 This returns `10` likelihood values (because `nsim_obs = 10`), which can
 be averaged to come up with an overall likelihood estimate.
 
-To find out about usage of the `chains_ll()` function, you can use the
+To find out about usage of the `chain_ll()` function, you can use the
 `R` help file
 
 ``` r
-?chains_ll
+?chain_ll
 ```
 
 ## Simulating branching processes
@@ -126,7 +126,7 @@ chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5)
 
 ### Simulating trees
 
-To simulate a tree of branching processes, we do specify the serial
+To simulate a tree of branching processes, we specify the serial
 interval generation function and set `tree = TRUE` as follows:
 
 ``` r
@@ -135,23 +135,16 @@ set.seed(13)
 serial_interval <- function(n){rlnorm(n, meanlog = 0.58, sdlog = 1.58)}
 
 chains_df <- chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length', 
-                       infinite = 100, serial = serial_interval)
-
-chains_df
-#>    n id ancestor generation        time
-#> 1  1  1       NA          1  0.00000000
-#> 2  2  1       NA          1  0.00000000
-#> 3  3  1       NA          1  0.00000000
-#> 4  4  1       NA          1  0.00000000
-#> 5  5  1       NA          1  0.00000000
-#> 6  1  2        1          2  0.04771887
-#> 7  5  2        1          2  5.57573333
-#> 8  5  3        1          2  0.11454421
-#> 9  1  3        2          3  2.64367236
-#> 10 5  4        2          3  6.57843219
-#> 11 1  4        3          4  2.96098160
-#> 12 5  5        4          4 10.28370183
-#> 13 5  6        5          5 10.37883069
+                       infinite = 100, serial = serial_interval, tree = TRUE)
+
+head(chains_df)
+#>   n id ancestor generation    time
+#> 1 1  1       NA          1 0.00000
+#> 2 2  1       NA          1 0.00000
+#> 3 3  1       NA          1 0.00000
+#> 4 4  1       NA          1 0.00000
+#> 5 5  1       NA          1 0.00000
+#> 6 1  2        1          2 0.04772
 ```
 
 # Methodology
@@ -171,7 +164,7 @@ probability `prob = 0.5`, we run
 
 ``` r
 chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, nsim_offspring = 100)
-#> [1] -8.760539
+#> [1] -8.761
 ```
 
 ## Package vignettes
@@ -206,7 +199,7 @@ citation("bpmodels")
 #> 
 #> To cite package 'bpmodels' in publications use:
 #> 
-#>   Funk S, Finger F (????). _bpmodels: Analysing chain statistics using
+#>   Funk S, Finger F (2023). _bpmodels: Analysing chain statistics using
 #>   branching process models_. R package version 0.1.0,
 #>   <https://github.com/sbfnk/bpmodels>.
 #> 
@@ -215,6 +208,7 @@ citation("bpmodels")
 #>   @Manual{,
 #>     title = {bpmodels: Analysing chain statistics using branching process models},
 #>     author = {Sebastian Funk and Flavio Finger},
+#>     year = {2023},
 #>     note = {R package version 0.1.0},
 #>     url = {https://github.com/sbfnk/bpmodels},
 #>   }

From 05decc7f17162bba67c759bded9a6645f15cbef4 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 13:19:22 +0000
Subject: [PATCH 113/828] updated .gitignore

---
 .gitignore | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/.gitignore b/.gitignore
index 2d674619..211a19d8 100644
--- a/.gitignore
+++ b/.gitignore
@@ -29,3 +29,5 @@ vignettes/*.pdf
 rsconnect/
 /doc/
 /Meta/
+/docs/
+.DS_Store

From a6489f717091b002d21c5db03fbb228dfef89539 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 14:13:32 +0000
Subject: [PATCH 114/828] deleted introduction vignette

---
 vignettes/introduction.R | 31 -------------------------------
 1 file changed, 31 deletions(-)
 delete mode 100644 vignettes/introduction.R

diff --git a/vignettes/introduction.R b/vignettes/introduction.R
deleted file mode 100644
index af96b639..00000000
--- a/vignettes/introduction.R
+++ /dev/null
@@ -1,31 +0,0 @@
-## ----setup, include = FALSE---------------------------------------------------
-library('knitr')
-knitr::opts_chunk$set(
-  collapse = TRUE,
-  comment = "#>"
-)
-
-## ----eval=FALSE---------------------------------------------------------------
-#  library('bpmodels')
-
-## ----echo=FALSE---------------------------------------------------------------
-suppressWarnings(library('bpmodels'))
-set.seed(13)
-
-## -----------------------------------------------------------------------------
-chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
-
-## ----eval=FALSE---------------------------------------------------------------
-#  ?chains_ll
-
-## -----------------------------------------------------------------------------
-chain_sim(n = 5, "pois", "size", lambda = 0.5)
-
-## -----------------------------------------------------------------------------
-chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, nsim_offspring = 100)
-
-## -----------------------------------------------------------------------------
-ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5, nsim_obs = 10)
-summary(ll)
-

From f254355ec2f283a4d2162efa8d5fa932a7d6752b Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 15:40:28 +0000
Subject: [PATCH 115/828] turned off some lints

---
 .lintr | 6 +++++-
 1 file changed, 5 insertions(+), 1 deletion(-)

diff --git a/.lintr b/.lintr
index 77eb4c15..ae732225 100644
--- a/.lintr
+++ b/.lintr
@@ -1,4 +1,8 @@
-linters:linters_with_defaults() # see vignette("lintr")
+linters: linters_with_defaults(
+    line_length_linter(90), 
+    commented_code_linter = NULL,
+      object_name_linter = NULL
+  )
 encoding:"UTF-8"
 
 
From 3a086b6835862d138e1525ba85cf3c93d49090f4 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 15:41:14 +0000
Subject: [PATCH 116/828] removed .R file from vignettes

---
 vignettes/projecting_incidence.R | 120 -------------------------------
 1 file changed, 120 deletions(-)
 delete mode 100644 vignettes/projecting_incidence.R

diff --git a/vignettes/projecting_incidence.R b/vignettes/projecting_incidence.R
deleted file mode 100644
index f46303b5..00000000
--- a/vignettes/projecting_incidence.R
+++ /dev/null
@@ -1,120 +0,0 @@
-## ----setup, include=FALSE-----------------------------------------------------
-knitr::opts_chunk$set(echo = TRUE, 
-                      message = FALSE, 
-                      warning = FALSE, 
-                      collapse = TRUE,
-                      comment = "#>"
-                      )
-
-
-## ----loading_packages, include=TRUE-------------------------------------------
-library("bpmodels")
-library('dplyr')
-library('ggplot2')
-library('lubridate')
-
-## ----data_generation, message=FALSE-------------------------------------------
-set.seed(12)
-cases_df <- data.frame(date = as.Date('2023-01-01') + seq_len(12),
-                       cases = rnbinom(12, size = 7.5, mu = 5)
-                       )
-head(cases_df)
-
-ggplot(cases_df, 
-       aes(x = date, y = cases)
-       ) + 
-  geom_col(fill = 'tomato3', size = 1)
-
-## ----input_prep, message=FALSE------------------------------------------------
-# We will create a vector of starting times for each case, using the time of the index cases as the reference point
-cases_df$days_since_index <- as.integer(cases_df$date - min(cases_df$date))
-
-#'Disaggregate the time series 
-case_times <- unlist(mapply(function(x, y) rep(x, times = ifelse(y == 0, 1, y)), 
-                       cases_df$days_since_index, 
-                       cases_df$cases
-                       )
-                       )
-                       
-
-
-#' Date to end simulation (14 day projection in this case)
-projection_window <- 14 #2 week ahead projection
-project_to_date <- max(cases_df$days_since_index) + projection_window 
-
-
-#' Number of simulations and maximum chain size
-sim_rep <- 1000
-cases_to_project <- 1000
-
-
-### Specifying the `serial` argument to `chain_sim()`
-#' Assume serial interval follows log-normal distribution with mean, mu = 4.7, 
-#' and standard deviation, sigma = 2.9, then the desired standard deviation, si_sd, 
-#' and mean, si_mean, are
-sigma = 2.9
-mu = 4.7
-
-si_sd <- sqrt(log(1 + (sigma/mu)^2)) #log standard deviation
-si_mean <- log((mu^2)/(sqrt(sigma^2 + mu^2))) #log mean
-
-#' serial interval function
-serial_interval <- function(sample_size) {
-  si <- rlnorm(sample_size, meanlog = si_mean, sdlog = si_sd)
-  return(si)
-}
-
-## ----simulations, message=FALSE-----------------------------------------------
-## Chain log-likelihood simulation
-sim_chain_sizes <- lapply(seq_len(sim_rep),
-                           function(sim){chain_sim(
-                               n = length(case_times),
-                               offspring = "nbinom",
-                               mu = 2.0,
-                               size = 0.38,
-                               stat = "size",
-                               infinite = cases_to_project,
-                               serial = serial_interval,
-                               t0 = case_times,
-                               tf = project_to_date,
-                               tree = TRUE
-                           ) |> 
-                               mutate(sim = sim)} 
-                          )
-
-sim_output <- do.call(rbind, sim_chain_sizes) 
-
-## ----post_processing----------------------------------------------------------
-ref_date <- min(cases_df$date)
-
-incidence_ts <- sim_output |> 
-  mutate(day = floor(time)) |> 
-  group_by(sim, day) |> 
-  summarise(cases = n()) |>  
-  ungroup()
-
-
-## Median cases by date.  
-median_daily_cases <- incidence_ts |>
-  group_by(day)|>
-  summarise(median_cases = median(cases)) |>
-  ungroup()|>
-  arrange(day) |>
-  mutate(date = ymd(ref_date) + 0:(project_to_date - 1))
-
-
-## ----visualisation------------------------------------------------------------
-# Visualization
-cases_plot <- ggplot(data = median_daily_cases) +
-  geom_col(aes(x = date, y = median_cases),
-           fill = "tomato3",
-           size = 1
-  ) +
-  scale_y_continuous(breaks = seq(0, max(median_daily_cases$median_cases) + 20, 20),
-                     labels = seq(0, max(median_daily_cases$median_cases) + 20, 20)
-  ) +
-  labs(x = 'Date', y = 'Daily cases (median)') + 
-  theme_minimal(base_size = 14)
-
-print(cases_plot)
-

From fdf752613a3d3ce7aaa9ee1ec0570bfd36902491 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 15:43:03 +0000
Subject: [PATCH 117/828] styled the R/files

---
 R/borel.r                 |  14 +-
 R/data.R                  |  12 +-
 R/likelihoods.R           |  65 +++----
 R/simulate.r              | 357 ++++++++++++++++++++------------------
 R/simulate_susceptibles.R | 235 +++++++++++++------------
 R/utils.r                 |  12 +-
 6 files changed, 357 insertions(+), 338 deletions(-)

diff --git a/R/borel.r b/R/borel.r
index 56dc4331..9804cc55 100644
--- a/R/borel.r
+++ b/R/borel.r
@@ -5,11 +5,11 @@
 ##' @param log logical; if TRUE, probabilities p are given as log(p).
 ##' @return probability mass.
 ##' @author Sebastian Funk
-dborel <- function(x, mu, log=FALSE) {
-    if (x < 1) stop("'x' must be greater than 0")
-    ld <- -mu * x + (x - 1) * log(mu * x) - lgamma(x + 1)
-    if (!log) ld <- exp(ld)
-    return(ld)
+dborel <- function(x, mu, log = FALSE) {
+  if (x < 1) stop("'x' must be greater than 0")
+  ld <- -mu * x + (x - 1) * log(mu * x) - lgamma(x + 1)
+  if (!log) ld <- exp(ld)
+  return(ld)
 }
 
 ##' Generate random numbers from the Borel distribution
@@ -21,6 +21,6 @@ dborel <- function(x, mu, log=FALSE) {
 ##'     if this number is reached
 ##' @return vector of random numbers
 ##' @author Sebastian Funk
-rborel <- function(n, mu, infinite=Inf) {
-    chain_sim(n, "pois", "size", infinite=infinite, lambda=mu)
+rborel <- function(n, mu, infinite = Inf) {
+  chain_sim(n, "pois", "size", infinite = infinite, lambda = mu)
 }
diff --git a/R/data.R b/R/data.R
index 760306bd..3aaf5b78 100644
--- a/R/data.R
+++ b/R/data.R
@@ -1,10 +1,10 @@
 #' COVID-19 Data Repository for South Africa
 #'
-#' An aggregated subset of the COVID-19 Data Repository for South Africa created, 
-#' maintained and hosted by Data Science for Social Impact research group, 
-#' led by Dr. Vukosi Marivate.
-#' 
-#' The data is originally provided as a linelist but has been subsetted and 
+#' An aggregated subset of the COVID-19 Data Repository for South Africa 
+#' created, maintained and hosted by Data Science for Social Impact research 
+#' group, led by Dr. Vukosi Marivate.
+#'
+#' The data is originally provided as a linelist but has been subsetted and
 #' cleaned in `data-raw/covid19_sa.R`.
 #'
 #' @format ## `covid19_sa`
@@ -15,4 +15,4 @@
 #' }
 #' @source <https://github.com/dsfsi/covid19za>
 #' Further details in `data-raw/covid19_sa.R`.
-"covid19_sa"
\ No newline at end of file
+"covid19_sa"
diff --git a/R/likelihoods.R b/R/likelihoods.R
index 70778346..3be65722 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -5,8 +5,7 @@
 #' @return log-likelihood values
 #' @author Sebastian Funk
 #' @keywords internal
-pois_size_ll <- function(x, lambda)
-{
+pois_size_ll <- function(x, lambda) {
   (x - 1) * log(lambda) - lambda * x + (x - 2) * log(x) - lgamma(x)
 }
 
@@ -22,14 +21,13 @@ pois_size_ll <- function(x, lambda)
 #' @return log-likelihood values
 #' @author Sebastian Funk
 #' @keywords internal
-nbinom_size_ll <- function(x, size, prob, mu)
-{
+nbinom_size_ll <- function(x, size, prob, mu) {
   if (!missing(prob)) {
     if (!missing(mu)) stop("'prob' and 'mu' both specified")
     mu <- size * (1 - prob) / prob
   }
   lgamma(size * x + (x - 1)) - (lgamma(size * x) + lgamma(x + 1)) +
-    (x - 1) * log (mu / size) -
+    (x - 1) * log(mu / size) -
     (size * x + (x - 1)) * log(1 + mu / size)
 }
 
@@ -66,7 +64,7 @@ pois_length_ll <- function(x, lambda) {
   ## iterated exponential function
   arg <- exp(lambda * exp(-lambda))
   itex <- 1
-  for (i in seq_len(max(x))) itex <- c(itex, arg ^ itex[i])
+  for (i in seq_len(max(x))) itex <- c(itex, arg^itex[i])
 
   Gk <- c(0, exp(-lambda) * itex) ## set G_{0}=1
 
@@ -82,11 +80,10 @@ pois_length_ll <- function(x, lambda) {
 #' @author Sebastian Funk
 #' @keywords internal
 geom_length_ll <- function(x, prob) {
-
   lambda <- 1 / prob
   ## G(k) - G(k - 1)
-  GkmGkm1 <- (1 - lambda ^ (x)) / (1 - lambda ^ (x + 1)) -
-    (1 - lambda ^ (x - 1)) / (1 - lambda ^ (x))
+  GkmGkm1 <- (1 - lambda^(x)) / (1 - lambda^(x + 1)) -
+    (1 - lambda^(x - 1)) / (1 - lambda^(x))
 
   log(GkmGkm1)
 }
@@ -105,15 +102,16 @@ geom_length_ll <- function(x, prob) {
 #' @inheritParams chain_ll
 #' @inheritParams chain_sim
 #' @keywords internal
-offspring_ll <- function(x, offspring, stat, nsim_offspring=100, ...) {
-
+offspring_ll <- function(x, offspring, stat, nsim_offspring = 100, ...) {
   dist <- chain_sim(nsim_offspring, offspring, stat, ...)
 
   ## linear approximation
   f <- stats::ecdf(dist)
   acdf <-
-    diff(c(0, stats::approx(unique(dist), f(unique(dist)),
-                            seq_len(max(dist[is.finite(dist)])))$y))
+    diff(c(0, stats::approx(
+      unique(dist), f(unique(dist)),
+      seq_len(max(dist[is.finite(dist)]))
+    )$y))
   lik <- acdf[x]
   lik[is.na(lik)] <- 0
   log(lik)
@@ -126,21 +124,24 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring=100, ...) {
 #' @param obs_prob observation probability (assumed constant)
 #' @param infinite any chains of this size/length will be treated as infinite
 #' @param exclude any sizes/lengths to exclude from the likelihood calculation
-#' @param individual if TRUE, a vector of individual log-likelihood contributions will be returned rather than the sum
+#' @param individual if TRUE, a vector of individual log-likelihood 
+#' contributions will be returned rather than the sum
 #' @param nsim_obs number of simulations if the likelihood is to be
 #'   approximated for imperfect observations
 #' @param ... parameters for the offspring distribution
-#' @return likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or a list of individual likelihood contributions (if \code{individual=TRUE})
+#' @return likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or
+#'  a list of individual likelihood contributions (if \code{individual=TRUE})
 #' @inheritParams chain_sim
 #' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll
 #'   geom_length_ll offspring_ll
 #' @author Sebastian Funk
 #' @export
 #' @examples
-#' chain_sizes <- c(1,1,4,7) # example of observed chain sizes
-#' chain_ll(chain_sizes, "pois", "size", lambda=0.5)
-chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
-                     infinite = Inf, exclude=c(), individual=FALSE, nsim_obs, ...) {
+#' chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
+#' chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
+chain_ll <- function(x, offspring, stat = c("size", "length"), obs_prob = 1,
+                     infinite = Inf, exclude = c(), individual = FALSE, 
+                     nsim_obs, ...) {
   stat <- match.arg(stat)
 
   ## checks
@@ -152,13 +153,14 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
     if (missing(nsim_obs)) {
       stop("'nsim_obs' must be specified if 'obs_prob' is <1")
     }
-    if (stat=="size") {
+    if (stat == "size") {
       sample_func <- rbinom_size
-    } else if (stat=="length"){
+    } else if (stat == "length") {
       sample_func <- rgen_length
     }
     sampled_x <-
-      replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob), infinite), simplify = FALSE)
+      replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob), 
+                               infinite), simplify = FALSE)
     size_x <- unlist(sampled_x)
     if (!is.finite(infinite)) infinite <- max(size_x) + 1
   } else {
@@ -169,25 +171,29 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
 
   ## determine for which sizes to calculate the likelihood (for true chain size)
   if (any(size_x == infinite)) {
-    calc_sizes <- seq_len(infinite-1)
+    calc_sizes <- seq_len(infinite - 1)
   } else {
     calc_sizes <- unique(c(size_x, exclude))
   }
 
   ## get likelihood function as given by `offspring` and `stat``
   likelihoods <- c()
-  ll_func <- paste(offspring, stat, "ll", sep="_")
+  ll_func <- paste(offspring, stat, "ll", sep = "_")
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
   ## calculate likelihoods
-  if (exists(ll_func, where=asNamespace('bpmodels'), mode='function')) {
+  if (exists(ll_func, where = asNamespace("bpmodels"), mode = "function")) {
     func <- get(ll_func)
-    likelihoods[calc_sizes] <- do.call(func, c(list(x=calc_sizes), pars))
+    likelihoods[calc_sizes] <- do.call(func, c(list(x = calc_sizes), pars))
   } else {
     likelihoods[calc_sizes] <-
-      do.call(offspring_ll,
-              c(list(x=calc_sizes, offspring=offspring,
-                     stat=stat, infinite=infinite), pars))
+      do.call(
+        offspring_ll,
+        c(list(
+          x = calc_sizes, offspring = offspring,
+          stat = stat, infinite = infinite
+        ), pars)
+      )
   }
 
   ## assign probabilities to infinite outbreak sizes
@@ -213,4 +219,3 @@ chain_ll <- function(x, offspring, stat=c("size", "length"), obs_prob=1,
 
   return(chains_likelihood)
 }
-
diff --git a/R/simulate.r b/R/simulate.r
index 4a54ee8c..774369f8 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,218 +1,235 @@
 #' Simulate transmission chains using a branching process
-#' 
-#' @description \code{chain_sim()} is a stochastic simulator for generating 
-#' transmission chain data with key inputs such as the offspring distribution and 
-#' serial interval distribution. 
+#'
+#' @description \code{chain_sim()} is a stochastic simulator for generating
+#' transmission chain data with key inputs such as the offspring distribution 
+#' and serial interval distribution.
 #' @param n Number of simulations to run.
 #' @param offspring Offspring distribution: a character string corresponding to
 #'   the R distribution function (e.g., "pois" for Poisson, where
-#'   \code{\link{rpois}} is the R function to generate Poisson random numbers) 
+#'   \code{\link{rpois}} is the R function to generate Poisson random numbers)
 #' @param stat String; Statistic to calculate. Can be one of:
 #' \itemize{
 #'   \item "size": the total number of offspring.
-#'   \item "length": the total number of ancestors. 
+#'   \item "length": the total number of ancestors.
 #' }
-#' @param infinite A size or length above which the simulation results should be 
-#' set to `Inf`. Defaults to `Inf`, resulting in no results ever set to `Inf`
-#' @param tree Logical. Should the transmission tree be returned? Defaults to `FALSE`.
-#' @param serial The serial interval generator function; the name of a user-defined 
-#' named or anonymous function with only one argument `n`, representing the number 
-#' of serial intervals to generate.
-#' @param t0 Start time (if serial interval is given); either a single value or a 
-#' vector of length `n` (number of simulations) with initial times. Defaults to 0.  
+#' @param infinite A size or length above which the simulation results 
+#' should be set to `Inf`. Defaults to `Inf`, resulting in no results 
+#' ever set to `Inf`
+#' @param tree Logical. Should the transmission tree be returned? Defaults 
+#' to `FALSE`.
+#' @param serial The serial interval generator function; the name of a 
+#' user-defined named or anonymous function with only one argument `n`, 
+#' representing the number of serial intervals to generate.
+#' @param t0 Start time (if serial interval is given); either a single value 
+#' or a vector of length `n` (number of simulations) with initial times. 
+#' Defaults to 0.
 #' @param tf End time (if serial interval is given).
 #' @param ... Parameters of the offspring distribution as required by R.
-#' @return Either: 
+#' @return Either:
 #' \itemize{
 #'  \item{A vector of sizes/lengths (if \code{tree == FALSE} OR serial
-#'   interval function not specified, since that implies \code{tree == FALSE})}, or 
-#'   \item {a data frame with 
-#'   columns `n` (simulation ID), `time` (if the serial interval is given) and 
-#'   (if \code{tree == TRUE}), `id` (a unique ID within each simulation for each 
-#'   individual element of the chain), `ancestor` (the ID of the ancestor of each 
-#'   element), and `generation`.}
+#'   interval function not specified, since that implies 
+#'   \code{tree == FALSE})}, or
+#'   \item {a data frame with
+#'   columns `n` (simulation ID), `time` (if the serial interval is given) and
+#'   (if \code{tree == TRUE}), `id` (a unique ID within each simulation for 
+#'   each individual element of the chain), `ancestor` (the ID of the 
+#'   ancestor of each element), and `generation`.}
 #' }
 #' @author Sebastian Funk, James M. Azam
 #' @export
-#' @details 
-#' `chain_sim()` either returns a vector or a data.frame. The output is either a 
-#' vector if `serial` is not provided, which automatically sets \code{tree = FALSE},
-#' or a `data.frame`, which means that `serial` was provided as a function. When `serial`
-#' is provided, it means \code{tree = TRUE} automatically. However, setting 
-#' \code{tree = TRUE} would require providing a function for `serial`.
-#' 
+#' @details
+#' `chain_sim()` either returns a vector or a data.frame. The output is 
+#' either a vector if `serial` is not provided, which automatically sets 
+#' \code{tree = FALSE}, or a `data.frame`, which means that `serial` was 
+#' provided as a function. When `serial` is provided, it means 
+#' \code{tree = TRUE} automatically. However, setting \code{tree = TRUE} 
+#' would require providing a function for `serial`.
+#'
 #' # The serial interval (`serial`):
-#' 
+#'
 #' ## Assumptions/disambiguation
-#' 
-#' In epidemiology, the generation interval is the duration between successive 
-#' infectious events in a chain of transmission. Similarly, the serial interval is the 
-#' duration between observed symptom onset times between successive 
-#' cases in a transmission chain. The generation interval is often hard to observe 
-#' because exact times of infection are hard to measure hence, the serial interval
-#' is often used instead. Here, we use the serial interval to represent what would 
-#' normally be called the generation interval, that is, the time between successive
-#' cases. 
-#' 
+#'
+#' In epidemiology, the generation interval is the duration between successive
+#' infectious events in a chain of transmission. Similarly, the serial 
+#' interval is the duration between observed symptom onset times between 
+#' successive cases in a transmission chain. The generation interval is 
+#' often hard to observe because exact times of infection are hard to 
+#' measure hence, the serial interval is often used instead. Here, we 
+#' use the serial interval to represent what would normally be called the 
+#' generation interval, that is, the time between successive cases.
+#'
 #' ## Specifying `serial` in `chain_sim()`
-#' 
-#' `serial` must be specified as a named or 
-#' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) 
-#' with one argument. 
-#' 
-#' If `serial` is specified, `chain_sim()` returns times of 
-#' infection as a column in the output. Moreover, specifying a function for `serial` implies 
-#' \code{tree = TRUE} and a tree of infectors (`ancestor`) and infectees (`id`) 
-#' will be generated in the output. 
-#' 
-#' For example, assuming we want to specify the serial interval 
-#' generator as a random log-normally distributed variable with `meanlog = 0.58` 
-#' and `sdlog = 1.58`, we could define a named function, let's call it 
-#' "serial_interval", with only one argument representing the number of serial 
-#' intervals to sample: \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}}, 
-#' and assign the name of the function to serial in `chain_sim()` like so 
-#' \code{chain_sim(..., serial = serial_interval)}, 
-#' where `...` are the other arguments to `chain_sim()`. Alternatively, we 
-#' could assign an anonymous function to serial in the `chain_sim()` call like so
-#' \code{chain_sim(..., serial = function(n){rlnorm(n, 0.58, 1.38)})}, 
+#'
+#' `serial` must be specified as a named or
+#' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) # nolint
+#' with one argument.
+#'
+#' If `serial` is specified, `chain_sim()` returns times of
+#' infection as a column in the output. Moreover, specifying a function 
+#' for `serial` implies \code{tree = TRUE} and a tree of 
+#' infectors (`ancestor`) and infectees (`id`) will be generated in the output.
+#'
+#' For example, assuming we want to specify the serial interval
+#' generator as a random log-normally distributed variable with 
+#' `meanlog = 0.58` and `sdlog = 1.58`, we could define a named function, 
+#' let's call it "serial_interval", with only one argument representing the 
+#' number of serial intervals to sample: 
+#' \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
+#' and assign the name of the function to serial in `chain_sim()` like so
+#' \code{chain_sim(..., serial = serial_interval)},
+#' where `...` are the other arguments to `chain_sim()`. Alternatively, we
+#' could assign an anonymous function to serial in the `chain_sim()` call 
+#' like so \code{chain_sim(..., serial = function(n){rlnorm(n, 0.58, 1.38)})},
 #' where `...` are the other arguments to `chain_sim()`.
 #' @examples
 #' # Specifying no `serial` and `tree == FALSE` (default) returns a vector
 #' set.seed(123)
-#' chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5, tree = FALSE)
-#' 
-#' # Specifying `serial` without specifying `tree` will set `tree = TRUE` internally.
-#'  
-#' # We'll first define the serial function 
+#' chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5, 
+#' tree = FALSE)
+#'
+#' # Specifying `serial` without specifying `tree` will set `tree = TRUE` 
+#' internally.
+#'
+#' # We'll first define the serial function
 #' set.seed(123)
-#' serial_interval <- function(n){rlnorm(n, meanlog = 0.58, sdlog = 1.58)}
-#' chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length', infinite = 100, 
-#' serial = serial_interval)
-#' 
-#' # Specifying `serial` and `tree = FALSE` will throw an error 
+#' serial_interval <- function(n) {
+#'   rlnorm(n, meanlog = 0.58, sdlog = 1.58)
+#' }
+#' chain_sim(
+#'   n = 5, offspring = "pois", lambda = 0.5, stat = "length", 
+#'   infinite = 100,
+#'   serial = serial_interval
+#' )
+#'
+#' # Specifying `serial` and `tree = FALSE` will throw an error
 #' set.seed(123)
 #' \dontrun{
-#' try(chain_sim(n = 10, serial = function(x) 3, offspring = "pois", lambda = 2, 
-#' infinite = 10, tree = FALSE)
-#' )
+#' try(chain_sim(
+#'   n = 10, serial = function(x) 3, offspring = "pois", lambda = 2,
+#'   infinite = 10, tree = FALSE
+#' ))
 #' }
 chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
                       tree = FALSE, serial, t0 = 0, tf = Inf, ...) {
-    stat <- match.arg(stat)
+  stat <- match.arg(stat)
 
-    ## first, get random function as given by `offspring`
-    if (!is.character(offspring)) {
-        stop("object passed as 'offspring' is not a character string. Did you forget
+  ## first, get random function as given by `offspring`
+  if (!is.character(offspring)) {
+    stop("object passed as 'offspring' is not a character string. Did you forget
              to enclose it in quotes?")
-    }
+  }
 
-    roffspring_name <- paste0("r", offspring)
-    if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
-        stop("Function ", roffspring_name, " does not exist.")
-    }
+  roffspring_name <- paste0("r", offspring)
+  if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
+    stop("Function ", roffspring_name, " does not exist.")
+  }
 
-    if (!missing(serial)) {
-        if (!is.function(serial)) {
-            stop("The `serial` argument must be a function (see details in ?chain_sim()).")
-        }
-        if (!missing(tree) && tree == FALSE) {
-            stop("If `serial` is specified, then `tree` cannot be set to `FALSE`.")
-        }
-        tree <- TRUE
-    } else if (!missing(tf)) {
-        stop("If `tf` is specified, `serial` must be specified too.")
+  if (!missing(serial)) {
+    if (!is.function(serial)) {
+      stop("The `serial` argument must be a function (see details in ?chain_sim()).") # nolint
+    }
+    if (!missing(tree) && tree == FALSE) {
+      stop("If `serial` is specified, then `tree` cannot be set to `FALSE`.")
     }
+    tree <- TRUE
+  } else if (!missing(tf)) {
+    stop("If `tf` is specified, `serial` must be specified too.")
+  }
 
-    stat_track <- rep(1, n) ## track length or size (depending on `stat`)
-    n_offspring <- rep(1, n) ## current number of offspring
-    sim <- seq_len(n) ## track chains that are still being simulated
+  stat_track <- rep(1, n) ## track length or size (depending on `stat`)
+  n_offspring <- rep(1, n) ## current number of offspring
+  sim <- seq_len(n) ## track chains that are still being simulated
 
-    ## initialise data frame to hold the trees
-    if (tree) {
-        generation <- 1L
-        tdf <-
-            data.frame(n = seq_len(n),
-                       id = 1L,
-                       ancestor = NA_integer_,
-                       generation = generation)
+  ## initialise data frame to hold the trees
+  if (tree) {
+    generation <- 1L
+    tdf <-
+      data.frame(
+        n = seq_len(n),
+        id = 1L,
+        ancestor = NA_integer_,
+        generation = generation
+      )
 
-        ancestor_ids <- rep(1, n)
-        if (!missing(serial)) {
-            tdf$time <- t0
-            times <- tdf$time
-        }
+    ancestor_ids <- rep(1, n)
+    if (!missing(serial)) {
+      tdf$time <- t0
+      times <- tdf$time
     }
+  }
 
-    ## next, simulate n chains
-    while (length(sim) > 0) {
-        ## simulate next generation
-        next_gen <- get(roffspring_name)(n=sum(n_offspring[sim]), ...)
-        if (any(next_gen %% 1 > 0)) {
-            stop("Offspring distribution must return integers")
-        }
+  ## next, simulate n chains
+  while (length(sim) > 0) {
+    ## simulate next generation
+    next_gen <- get(roffspring_name)(n = sum(n_offspring[sim]), ...)
+    if (any(next_gen %% 1 > 0)) {
+      stop("Offspring distribution must return integers")
+    }
 
-        ## record indices corresponding to the number of offspring
-        indices <- rep(sim, n_offspring[sim])
+    ## record indices corresponding to the number of offspring
+    indices <- rep(sim, n_offspring[sim])
 
-        ## initialise number of offspring
-        n_offspring <- rep(0, n)
-        ## assign offspring sum to indices still being simulated
-        n_offspring[sim] <- tapply(next_gen, indices, sum)
+    ## initialise number of offspring
+    n_offspring <- rep(0, n)
+    ## assign offspring sum to indices still being simulated
+    n_offspring[sim] <- tapply(next_gen, indices, sum)
 
-        ## track size/length
-        if (stat=="size") {
-            stat_track <- stat_track + n_offspring
-        } else if (stat=="length") {
-            stat_track <- stat_track + pmin(1, n_offspring)
-        }
+    ## track size/length
+    if (stat == "size") {
+      stat_track <- stat_track + n_offspring
+    } else if (stat == "length") {
+      stat_track <- stat_track + pmin(1, n_offspring)
+    }
 
-        ## record times/ancestors (if tree==TRUE)
-        if (tree && sum(n_offspring[sim]) > 0) {
-            ancestors <- rep(ancestor_ids, next_gen)
-            current_max_id <- unname(tapply(ancestor_ids, indices, max))
-            indices <- rep(sim, n_offspring[sim])
-            ids <- rep(current_max_id, n_offspring[sim]) +
-                unlist(lapply(n_offspring[sim], seq_len))
-            generation <- generation + 1L
-            new_df <-
-                data.frame(n = indices,
-                           id = ids,
-                           ancestor = ancestors,
-                           generation = generation)
-            if (!missing(serial)) {
-                times <- rep(times, next_gen) + serial(sum(n_offspring))
-                current_min_time <- unname(tapply(times, indices, min))
-                new_df$time <- times
-            }
-            tdf <- rbind(tdf, new_df)
-        }
+    ## record times/ancestors (if tree==TRUE)
+    if (tree && sum(n_offspring[sim]) > 0) {
+      ancestors <- rep(ancestor_ids, next_gen)
+      current_max_id <- unname(tapply(ancestor_ids, indices, max))
+      indices <- rep(sim, n_offspring[sim])
+      ids <- rep(current_max_id, n_offspring[sim]) +
+        unlist(lapply(n_offspring[sim], seq_len))
+      generation <- generation + 1L
+      new_df <-
+        data.frame(
+          n = indices,
+          id = ids,
+          ancestor = ancestors,
+          generation = generation
+        )
+      if (!missing(serial)) {
+        times <- rep(times, next_gen) + serial(sum(n_offspring))
+        current_min_time <- unname(tapply(times, indices, min))
+        new_df$time <- times
+      }
+      tdf <- rbind(tdf, new_df)
+    }
 
-        ## only continue to simulate chains that offspring and aren't of
-        ## infinite size/length
-        sim <- which(n_offspring > 0 & stat_track < infinite)
-        if (length(sim) > 0) {
-            if (!missing(serial)) {
-                ## only continue to simulate chains that don't go beyond tf
-                sim <- intersect(sim, unique(indices)[current_min_time < tf])
-            }
-            if (tree) {
-                if (!missing(serial)) {
-                    times <- times[indices %in% sim]
-                }
-                ancestor_ids <- ids[indices %in% sim]
-            }
+    ## only continue to simulate chains that offspring and aren't of
+    ## infinite size/length
+    sim <- which(n_offspring > 0 & stat_track < infinite)
+    if (length(sim) > 0) {
+      if (!missing(serial)) {
+        ## only continue to simulate chains that don't go beyond tf
+        sim <- intersect(sim, unique(indices)[current_min_time < tf])
+      }
+      if (tree) {
+        if (!missing(serial)) {
+          times <- times[indices %in% sim]
         }
+        ancestor_ids <- ids[indices %in% sim]
+      }
     }
+  }
 
-    if (tree) {
-        if (!missing(tf)) {
-            tdf <- tdf[tdf$time < tf, ]
-        }
-        rownames(tdf) <- NULL
-        return(tdf)
-    } else {
-        stat_track[stat_track >= infinite] <- Inf
-        return(stat_track)
+  if (tree) {
+    if (!missing(tf)) {
+      tdf <- tdf[tdf$time < tf, ]
     }
+    rownames(tdf) <- NULL
+    return(tdf)
+  } else {
+    stat_track[stat_track >= infinite] <- Inf
+    return(stat_track)
+  }
 }
-
diff --git a/R/simulate_susceptibles.R b/R/simulate_susceptibles.R
index 1ece7158..32013be8 100644
--- a/R/simulate_susceptibles.R
+++ b/R/simulate_susceptibles.R
@@ -29,131 +29,128 @@
 #' @author Flavio Finger
 #' @export
 #' @examples
-#' chain_sim_susc("pois", mn_offspring=0.5, serial = function(x) 3, pop = 100)
-chain_sim_susc <- function(
-    offspring = c("pois", "nbinom"),
-    mn_offspring,
-    disp_offspring,
-    serial,
-    t0 = 0,
-    tf = Inf,
-    pop,
-    initial_immune = 0
-) {
-
-    offspring <- match.arg(offspring)
-
-    if (offspring == "pois") {
-        if (!missing(disp_offspring)) {
-            warning("argument disp_offspring not used for
+#' chain_sim_susc("pois", mn_offspring = 0.5, serial = function(x) 3, pop = 100)
+chain_sim_susc <- function(offspring = c("pois", "nbinom"),
+                           mn_offspring,
+                           disp_offspring,
+                           serial,
+                           t0 = 0,
+                           tf = Inf,
+                           pop,
+                           initial_immune = 0) {
+  offspring <- match.arg(offspring)
+
+  if (offspring == "pois") {
+    if (!missing(disp_offspring)) {
+      warning("argument disp_offspring not used for
                 poisson offspring distribution.")
-        }
-
-        ## using a right truncated poisson distribution
-        ## to avoid more cases than susceptibles
-        offspring_fun <- function(n, susc) {
-            truncdist::rtrunc(
-                n,
-                spec = "pois",
-                lambda = mn_offspring * susc / pop,
-                b = susc)
-            }
-
-    } else if (offspring  == "nbinom") {
-
-        if (disp_offspring <= 1) { ## dispersion index
-            stop("Offspring distribution 'nbinom' requires argument
+    }
+
+    ## using a right truncated poisson distribution
+    ## to avoid more cases than susceptibles
+    offspring_fun <- function(n, susc) {
+      truncdist::rtrunc(
+        n,
+        spec = "pois",
+        lambda = mn_offspring * susc / pop,
+        b = susc
+      )
+    }
+  } else if (offspring == "nbinom") {
+    if (disp_offspring <= 1) { ## dispersion index
+      stop("Offspring distribution 'nbinom' requires argument
                 disp_offspring > 1. Use 'pois' if there is no overdispersion.")
-        }
-
-        offspring_fun <- function(n, susc) {
-            ## get distribution params from mean and dispersion
-            ## see ?rnbinom for parameter definition
-            new_mn <- mn_offspring * susc / pop ##apply susceptibility
-            size <- new_mn / (disp_offspring - 1)
-
-            ## using a right truncated nbinom distribution
-            ## to avoid more cases than susceptibles
-            truncdist::rtrunc(
-                n,
-                spec = "nbinom",
-                b = susc,
-                mu = new_mn,
-                size = size)
-        }
     }
 
-    ## initializations
-    tdf <- data.frame(
-        id = 1L,
-        ancestor = NA_integer_,
-        generation = 1L,
-        time = t0,
+    offspring_fun <- function(n, susc) {
+      ## get distribution params from mean and dispersion
+      ## see ?rnbinom for parameter definition
+      new_mn <- mn_offspring * susc / pop ## apply susceptibility
+      size <- new_mn / (disp_offspring - 1)
+
+      ## using a right truncated nbinom distribution
+      ## to avoid more cases than susceptibles
+      truncdist::rtrunc(
+        n,
+        spec = "nbinom",
+        b = susc,
+        mu = new_mn,
+        size = size
+      )
+    }
+  }
+
+  ## initializations
+  tdf <- data.frame(
+    id = 1L,
+    ancestor = NA_integer_,
+    generation = 1L,
+    time = t0,
+    offspring_generated = FALSE
+  )
+
+  susc <- pop - initial_immune - 1L
+  t <- t0
+
+  ## continue if any unsimulated has t <= tf
+  ## AND there is still susceptibles left
+  while (
+    any(tdf$time[!tdf$offspring_generated] <= tf) &&
+      susc > 0
+  ) {
+
+    ## select from which case to generate offspring
+    t <- min(tdf$time[!tdf$offspring_generated]) # lowest unsimulated t
+
+    ## index of the first in df with t, extract vars
+    idx <- which(tdf$time == t & !tdf$offspring_generated)[1]
+    id_parent <- tdf$id[idx]
+    t_parent <- tdf$time[idx]
+    gen_parent <- tdf$generation[idx]
+
+    ## generate it
+    current_max_id <- max(tdf$id)
+    n_offspring <- offspring_fun(1, susc)
+
+    if (n_offspring %% 1 > 0) {
+      stop("Offspring distribution must return integers")
+    }
+
+    ## mark as done
+    tdf$offspring_generated[idx] <- TRUE
+
+    ## add to df
+    if (n_offspring > 0) {
+      ## draw times
+      new_times <- serial(n_offspring)
+
+      if (any(new_times < 0)) {
+        stop("Serial interval must be >= 0.")
+      }
+
+      new_df <- data.frame(
+        id = current_max_id + seq_len(n_offspring),
+        time = new_times + t_parent,
+        ancestor = id_parent,
+        generation = gen_parent + 1L,
         offspring_generated = FALSE
-    )
-
-    susc <- pop - initial_immune - 1L
-    t <- t0
-
-    ## continue if any unsimulated has t <= tf
-    ## AND there is still susceptibles left
-    while (
-        any(tdf$time[!tdf$offspring_generated] <= tf) &
-        susc > 0
-        ) {
-
-        ## select from which case to generate offspring
-        t <- min(tdf$time[!tdf$offspring_generated]) #lowest unsimulated t
-
-        ## index of the first in df with t, extract vars
-        idx <- which(tdf$time == t & !tdf$offspring_generated)[1]
-        id_parent <- tdf$id[idx]
-        t_parent <- tdf$time[idx]
-        gen_parent <- tdf$generation[idx]
-
-        ## generate it
-        current_max_id <- max(tdf$id)
-        n_offspring <- offspring_fun(1, susc)
-
-        if (n_offspring %% 1 > 0) {
-            stop("Offspring distribution must return integers")
-        }
-
-        ## mark as done
-        tdf$offspring_generated[idx] <- TRUE
-
-        ## add to df
-        if (n_offspring > 0) {
-            ## draw times
-            new_times <- serial(n_offspring)
-
-            if (any(new_times < 0)) {
-                stop("Serial interval must be >= 0.")
-            }
-
-            new_df <- data.frame(
-                id = current_max_id + seq_len(n_offspring),
-                time = new_times + t_parent,
-                ancestor = id_parent,
-                generation = gen_parent + 1L,
-                offspring_generated = FALSE
-            )
-
-            ## add new cases to tdf
-            tdf <- rbind(tdf, new_df)
-        }
-
-        ## adjust susceptibles
-        susc <- susc - n_offspring
+      )
+
+      ## add new cases to tdf
+      tdf <- rbind(tdf, new_df)
     }
 
-    ## remove cases with time > tf that could
-    ## have been generated in the last generation
-    tdf <- tdf[tdf$time <= tf, ]
+    ## adjust susceptibles
+    susc <- susc - n_offspring
+  }
+
+  ## remove cases with time > tf that could
+  ## have been generated in the last generation
+  tdf <- tdf[tdf$time <= tf, ]
 
-    ## sort output and remove columns not needed
-    tdf <- tdf[order(tdf$time, tdf$id), ]
-    tdf$offspring_generated <- NULL
+  ## sort output and remove columns not needed
+  tdf <- tdf[order(tdf$time, tdf$id), ]
+  tdf$offspring_generated <- NULL
 
-    return(tdf)
-}
\ No newline at end of file
+  return(tdf)
+}
diff --git a/R/utils.r b/R/utils.r
index 573e342c..4ff2e0e0 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -6,7 +6,7 @@
 #' @author Sebastian Funk
 #' @keywords internal
 complementary_logprob <- function(x) {
-    tryCatch(log1p(-sum(exp(x))), error=function(e) -Inf)
+  tryCatch(log1p(-sum(exp(x))), error = function(e) -Inf)
 }
 
 #' Samples size (the number of trials) of a binomial distribution
@@ -20,7 +20,7 @@ complementary_logprob <- function(x) {
 #' @author Sebastian Funk
 #' @keywords internal
 rbinom_size <- function(n, x, prob) {
-    x + stats::rnbinom(n, x + 1, prob)
+  x + stats::rnbinom(n, x + 1, prob)
 }
 
 #' Samples chain lengths with given observation probabilities
@@ -35,9 +35,9 @@ rbinom_size <- function(n, x, prob) {
 #' @author Sebastian Funk
 #' @keywords internal
 rgen_length <- function(n, x, prob) {
-    x +
-      ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1) +
-      ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1)
+  x +
+    ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1) +
+    ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1)
 }
 
 #' Finds the name of a function passed as an argument
@@ -71,4 +71,4 @@ find_function_name <- function(fun) {
 rnbinom_mean_disp <- function(n, mn, disp) {
   size <- mn / (disp - 1)
   stats::rnbinom(n, size = size, mu = mn)
-  }
\ No newline at end of file
+}

From 487be152704a34128bcf79a416405742522bd588 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 15:43:21 +0000
Subject: [PATCH 118/828] styled the /man/ files

---
 man/bpmodels-package.Rd |  5 ++-
 man/chain_ll.Rd         | 10 +++--
 man/chain_sim.Rd        | 99 ++++++++++++++++++++++++++---------------
 man/chain_sim_susc.Rd   |  2 +-
 man/covid19_sa.Rd       |  6 +--
 5 files changed, 77 insertions(+), 45 deletions(-)

diff --git a/man/bpmodels-package.Rd b/man/bpmodels-package.Rd
index dd44c49d..c32b684c 100644
--- a/man/bpmodels-package.Rd
+++ b/man/bpmodels-package.Rd
@@ -11,8 +11,8 @@ Provides methods to analyse and simulate the size and length of branching proces
 \seealso{
 Useful links:
 \itemize{
-  \item \url{https://github.com/sbfnk/bpmodels}
-  \item Report bugs at \url{https://github.com/sbfnk/bpmodels/issues}
+  \item \url{https://github.com/epiverse-trace/bpmodels}
+  \item Report bugs at \url{https://github.com/epiverse-trace/bpmodels/issues}
 }
 
 }
@@ -22,6 +22,7 @@ Useful links:
 Authors:
 \itemize{
   \item Flavio Finger \email{flavio.finger@epicentre.msf.org}
+  \item James Mba Azam \email{james.azam@lshtm.ac.uk}
 }
 
 Other contributors:
diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
index b110fdce..8505e6af 100644
--- a/man/chain_ll.Rd
+++ b/man/chain_ll.Rd
@@ -31,7 +31,8 @@ the R distribution function (e.g., "pois" for Poisson, where
 
 \item{exclude}{any sizes/lengths to exclude from the likelihood calculation}
 
-\item{individual}{if TRUE, a vector of individual log-likelihood contributions will be returned rather than the sum}
+\item{individual}{if TRUE, a vector of individual log-likelihood
+contributions will be returned rather than the sum}
 
 \item{nsim_obs}{number of simulations if the likelihood is to be
 approximated for imperfect observations}
@@ -39,14 +40,15 @@ approximated for imperfect observations}
 \item{...}{parameters for the offspring distribution}
 }
 \value{
-likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or a list of individual likelihood contributions (if \code{individual=TRUE})
+likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or
+a list of individual likelihood contributions (if \code{individual=TRUE})
 }
 \description{
 Likelihood for the outcome of a branching process
 }
 \examples{
-chain_sizes <- c(1,1,4,7) # example of observed chain sizes
-chain_ll(chain_sizes, "pois", "size", lambda=0.5)
+chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
+chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
 }
 \seealso{
 pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll
diff --git a/man/chain_sim.Rd b/man/chain_sim.Rd
index 49caed28..bc364955 100644
--- a/man/chain_sim.Rd
+++ b/man/chain_sim.Rd
@@ -29,17 +29,23 @@ the R distribution function (e.g., "pois" for Poisson, where
 \item "length": the total number of ancestors.
 }}
 
-\item{infinite}{A size or length above which the simulation results should be
+\item{infinite}{A size or length above which the simulation results
+should be
 set to \code{Inf}. Defaults to \code{Inf}, resulting in no results ever set to \code{Inf}}
 
-\item{tree}{Logical. Should the transmission tree be returned? Defaults to \code{FALSE}.}
+\item{tree}{Logical. Should the transmission tree be returned? Defaults
+to \code{FALSE}.}
 
-\item{serial}{The serial interval generator function; the name of a user-defined
-named or anonymous function with only one argument \code{n}, representing the number
+\item{serial}{The serial interval generator function; the name of a
+user-defined
+named or anonymous function with only one argument \code{n}, representing
+the number
 of serial intervals to generate.}
 
-\item{t0}{Start time (if serial interval is given); either a single value or a
-vector of length \code{n} (number of simulations) with initial times. Defaults to 0.}
+\item{t0}{Start time (if serial interval is given); either a single value
+or a
+vector of length \code{n} (number of simulations) with initial times. Defaults
+to 0.}
 
 \item{tf}{End time (if serial interval is given).}
 
@@ -49,23 +55,29 @@ vector of length \code{n} (number of simulations) with initial times. Defaults t
 Either:
 \itemize{
 \item{A vector of sizes/lengths (if \code{tree == FALSE} OR serial
-interval function not specified, since that implies \code{tree == FALSE})}, or
+interval function not specified, since that implies
+\code{tree == FALSE})}, or
 \item {a data frame with
 columns \code{n} (simulation ID), \code{time} (if the serial interval is given) and
-(if \code{tree == TRUE}), \code{id} (a unique ID within each simulation for each
-individual element of the chain), \code{ancestor} (the ID of the ancestor of each
+(if \code{tree == TRUE}), \code{id} (a unique ID within each simulation for
+each
+individual element of the chain), \code{ancestor} (the ID of the ancestor of
+each
 element), and \code{generation}.}
 }
 }
 \description{
 \code{chain_sim()} is a stochastic simulator for generating
-transmission chain data with key inputs such as the offspring distribution and
-serial interval distribution.
+transmission chain data with key inputs such as the offspring distribution
+and serial interval distribution.
 }
 \details{
-\code{chain_sim()} either returns a vector or a data.frame. The output is either a
-vector if \code{serial} is not provided, which automatically sets \code{tree = FALSE},
-or a \code{data.frame}, which means that \code{serial} was provided as a function. When \code{serial}
+\code{chain_sim()} either returns a vector or a data.frame. The output is
+either a
+vector if \code{serial} is not provided, which automatically sets
+\code{tree = FALSE},
+or a \code{data.frame}, which means that \code{serial} was provided as a function.
+When \code{serial}
 is provided, it means \code{tree = TRUE} automatically. However, setting
 \code{tree = TRUE} would require providing a function for \code{serial}.
 }
@@ -73,35 +85,44 @@ is provided, it means \code{tree = TRUE} automatically. However, setting
 \subsection{Assumptions/disambiguation}{
 
 In epidemiology, the generation interval is the duration between successive
-infectious events in a chain of transmission. Similarly, the serial interval is the
+infectious events in a chain of transmission. Similarly, the serial
+interval is the
 duration between observed symptom onset times between successive
-cases in a transmission chain. The generation interval is often hard to observe
-because exact times of infection are hard to measure hence, the serial interval
-is often used instead. Here, we use the serial interval to represent what would
-normally be called the generation interval, that is, the time between successive
+cases in a transmission chain. The generation interval is often hard to
+observe
+because exact times of infection are hard to measure hence, the serial
+interval
+is often used instead. Here, we use the serial interval to represent
+what would
+normally be called the generation interval, that is, the time between
+successive
 cases.
 }
 
 \subsection{Specifying \code{serial} in \code{chain_sim()}}{
 
 \code{serial} must be specified as a named or
-\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function}
+\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function} # nolint
 with one argument.
 
 If \code{serial} is specified, \code{chain_sim()} returns times of
-infection as a column in the output. Moreover, specifying a function for \code{serial} implies
+infection as a column in the output. Moreover, specifying a function
+for \code{serial} implies
 \code{tree = TRUE} and a tree of infectors (\code{ancestor}) and infectees (\code{id})
 will be generated in the output.
 
 For example, assuming we want to specify the serial interval
-generator as a random log-normally distributed variable with \code{meanlog = 0.58}
+generator as a random log-normally distributed variable with
+\code{meanlog = 0.58}
 and \code{sdlog = 1.58}, we could define a named function, let's call it
 "serial_interval", with only one argument representing the number of serial
-intervals to sample: \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
+intervals to sample:
+\code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
 and assign the name of the function to serial in \code{chain_sim()} like so
 \code{chain_sim(..., serial = serial_interval)},
 where \code{...} are the other arguments to \code{chain_sim()}. Alternatively, we
-could assign an anonymous function to serial in the \code{chain_sim()} call like so
+could assign an anonymous function to serial in the \code{chain_sim()} call
+like so
 \code{chain_sim(..., serial = function(n){rlnorm(n, 0.58, 1.38)})},
 where \code{...} are the other arguments to \code{chain_sim()}.
 }
@@ -110,22 +131,30 @@ where \code{...} are the other arguments to \code{chain_sim()}.
 \examples{
 # Specifying no `serial` and `tree == FALSE` (default) returns a vector
 set.seed(123)
-chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5, tree = FALSE)
+chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5, 
+tree = FALSE)
 
-# Specifying `serial` without specifying `tree` will set `tree = TRUE` internally.
- 
-# We'll first define the serial function 
+# Specifying `serial` without specifying `tree` will set `tree = TRUE` 
+internally.
+
+# We'll first define the serial function
 set.seed(123)
-serial_interval <- function(n){rlnorm(n, meanlog = 0.58, sdlog = 1.58)}
-chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length', infinite = 100, 
-serial = serial_interval)
+serial_interval <- function(n) {
+  rlnorm(n, meanlog = 0.58, sdlog = 1.58)
+}
+chain_sim(
+  n = 5, offspring = "pois", lambda = 0.5, stat = "length", 
+  infinite = 100,
+  serial = serial_interval
+)
 
-# Specifying `serial` and `tree = FALSE` will throw an error 
+# Specifying `serial` and `tree = FALSE` will throw an error
 set.seed(123)
 \dontrun{
-try(chain_sim(n = 10, serial = function(x) 3, offspring = "pois", lambda = 2, 
-infinite = 10, tree = FALSE)
-)
+try(chain_sim(
+  n = 10, serial = function(x) 3, offspring = "pois", lambda = 2,
+  infinite = 10, tree = FALSE
+))
 }
 }
 \author{
diff --git a/man/chain_sim_susc.Rd b/man/chain_sim_susc.Rd
index c06e52f1..3d1c7832 100644
--- a/man/chain_sim_susc.Rd
+++ b/man/chain_sim_susc.Rd
@@ -57,7 +57,7 @@ it always tracks and returns a data frame containing the entire tree,
 the maximal length of chains is limited with pop instead of infinite.
 }
 \examples{
-chain_sim_susc("pois", mn_offspring=0.5, serial = function(x) 3, pop = 100)
+chain_sim_susc("pois", mn_offspring = 0.5, serial = function(x) 3, pop = 100)
 }
 \author{
 Flavio Finger
diff --git a/man/covid19_sa.Rd b/man/covid19_sa.Rd
index 395e13c3..1bc989c4 100644
--- a/man/covid19_sa.Rd
+++ b/man/covid19_sa.Rd
@@ -22,9 +22,9 @@ Further details in \code{data-raw/covid19_sa.R}.
 covid19_sa
 }
 \description{
-An aggregated subset of the COVID-19 Data Repository for South Africa created,
-maintained and hosted by Data Science for Social Impact research group,
-led by Dr. Vukosi Marivate.
+An aggregated subset of the COVID-19 Data Repository for South Africa
+created, maintained and hosted by Data Science for Social Impact research
+group, led by Dr. Vukosi Marivate.
 }
 \details{
 The data is originally provided as a linelist but has been subsetted and

From a994eb6ba235e64ad4b1f35785fa6327a279ea2c Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 15:44:13 +0000
Subject: [PATCH 119/828] styled files in data-raw/ and tests/

---
 data-raw/covid19_sa.R        |  18 +--
 tests/testthat/tests-borel.r |  16 +--
 tests/testthat/tests-ll.r    |  81 ++++++-----
 tests/testthat/tests-sim.r   | 251 +++++++++++++++++++----------------
 4 files changed, 195 insertions(+), 171 deletions(-)

diff --git a/data-raw/covid19_sa.R b/data-raw/covid19_sa.R
index 16af4fbf..3991eff3 100644
--- a/data-raw/covid19_sa.R
+++ b/data-raw/covid19_sa.R
@@ -3,19 +3,19 @@
 library(dplyr)
 library(lubridate)
 
-#Link to data
-data_url <- 'https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv'
+# Link to data
+data_url <- "https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv" # nolint: line_length_linter.
 
-#Read the data in using the url
+# Read the data in using the url
 covid19_sa <- read.csv(data_url)
 
-#Clean and subset the data we need
-covid19_sa <- covid19_sa %>% 
-  select(date) %>% 
+# Clean and subset the data we need
+covid19_sa <- covid19_sa %>%
+  select(date) %>%
   mutate(date = lubridate::dmy(date)) %>%
-  filter(date <= min(date) + lubridate::days(15)) %>%   
-  group_by(date) %>% 
-  summarise(cases = n()) %>%   
+  filter(date <= min(date) + lubridate::days(15)) %>%
+  group_by(date) %>%
+  summarise(cases = n()) %>%
   ungroup()
 
 usethis::use_data(covid19_sa, overwrite = TRUE)
diff --git a/tests/testthat/tests-borel.r b/tests/testthat/tests-borel.r
index 266997e3..e45fa23a 100644
--- a/tests/testthat/tests-borel.r
+++ b/tests/testthat/tests-borel.r
@@ -1,15 +1,11 @@
 context("The Borel distribution is implemented")
 
-test_that("We can calculate probabilities and sample",
-{
-    expect_gt(dborel(1, 0.5), 0)
-    expect_equal(dborel(1, 0.5, log=TRUE), -0.5)
-    expect_length(rborel(2, 0.9), 2)
+test_that("We can calculate probabilities and sample", {
+  expect_gt(dborel(1, 0.5), 0)
+  expect_equal(dborel(1, 0.5, log = TRUE), -0.5)
+  expect_length(rborel(2, 0.9), 2)
 })
 
-test_that("Errors are thrown",
-{
-    expect_error(dborel(0, 0.5), "greater than 0")
+test_that("Errors are thrown", {
+  expect_error(dborel(0, 0.5), "greater than 0")
 })
-
-
diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
index 7f2f638f..0c52d3f4 100644
--- a/tests/testthat/tests-ll.r
+++ b/tests/testthat/tests-ll.r
@@ -1,42 +1,55 @@
 context("Calculating the likelihood from a branching process model")
 
-chains <- c(1,1,4,7)
+chains <- c(1, 1, 4, 7)
 
-test_that("Likelihoods can be calculated",
-{
-    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5), 0)
-    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, exclude=1), 0)
-    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, infinite = 5), 0)
-    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, obs_prob = 0.5,
-                       nsim_obs=1), 0)
-    expect_lt(chain_ll(chains, "pois", "length", lambda=0.5, obs_prob = 0.5,
-                       nsim_obs=1), 0)
-    expect_lt(chain_ll(chains, "pois", "size", lambda=0.5, infinite = 5,
-                       obs_prob = 0.5, nsim_obs=1), 0)
-    expect_lt(chain_ll(chains, "binom", "size", size=1, prob=0.5), 0)
+test_that("Likelihoods can be calculated", {
+  expect_lt(chain_ll(chains, "pois", "size", lambda = 0.5), 0)
+  expect_lt(chain_ll(chains, "pois", "size", lambda = 0.5, exclude = 1), 0)
+  expect_lt(chain_ll(chains, "pois", "size", lambda = 0.5, infinite = 5), 0)
+  expect_lt(chain_ll(chains, "pois", "size",
+    lambda = 0.5, obs_prob = 0.5,
+    nsim_obs = 1
+  ), 0)
+  expect_lt(chain_ll(chains, "pois", "length",
+    lambda = 0.5, obs_prob = 0.5,
+    nsim_obs = 1
+  ), 0)
+  expect_lt(chain_ll(chains, "pois", "size",
+    lambda = 0.5, infinite = 5,
+    obs_prob = 0.5, nsim_obs = 1
+  ), 0)
+  expect_lt(chain_ll(chains, "binom", "size", size = 1, prob = 0.5), 0)
 })
 
-test_that("Analytical size/length distributions are implemented",
-{
-    expect_true(all(pois_size_ll(chains, lambda=0.5) < 0))
-    expect_true(all(nbinom_size_ll(chains, mu=0.5, size=0.2) < 0))
-    expect_true(all(nbinom_size_ll(chains, prob=0.5, size=0.2) < 0))
-    expect_true(all(gborel_size_ll(chains, prob=0.5, size=0.2) < 0))
-    expect_true(all(gborel_size_ll(chains, prob=0.5, size=0.2) < 0))
-    expect_true(all(pois_length_ll(chains, lambda=0.5) < 0))
-    expect_true(all(geom_length_ll(chains, prob=0.5) < 0))
+test_that("Analytical size/length distributions are implemented", {
+  expect_true(all(pois_size_ll(chains, lambda = 0.5) < 0))
+  expect_true(all(nbinom_size_ll(chains, mu = 0.5, size = 0.2) < 0))
+  expect_true(all(nbinom_size_ll(chains, prob = 0.5, size = 0.2) < 0))
+  expect_true(all(gborel_size_ll(chains, prob = 0.5, size = 0.2) < 0))
+  expect_true(all(gborel_size_ll(chains, prob = 0.5, size = 0.2) < 0))
+  expect_true(all(pois_length_ll(chains, lambda = 0.5) < 0))
+  expect_true(all(geom_length_ll(chains, prob = 0.5) < 0))
 })
 
-test_that("Errors are thrown",
-{
-    expect_error(chain_ll(chains, list(), "size", lambda=0.5),
-                 "not a character")
-    expect_error(chain_ll(chains, "pois", "size", lambda=0.5, obs_prob = 3),
-                 "must be within")
-    expect_error(chain_ll(chains, "pois", "size", lambda=0.5, obs_prob = 0.5),
-                 "must be specified")
-    expect_error(nbinom_size_ll(chains, mu=0.5, size=0.2, prob=0.1),
-                 "both specified")
-    expect_error(gborel_size_ll(chains, mu=0.5, size=0.2, prob=0.1),
-                 "both specified")
+test_that("Errors are thrown", {
+  expect_error(
+    chain_ll(chains, list(), "size", lambda = 0.5),
+    "not a character"
+  )
+  expect_error(
+    chain_ll(chains, "pois", "size", lambda = 0.5, obs_prob = 3),
+    "must be within"
+  )
+  expect_error(
+    chain_ll(chains, "pois", "size", lambda = 0.5, obs_prob = 0.5),
+    "must be specified"
+  )
+  expect_error(
+    nbinom_size_ll(chains, mu = 0.5, size = 0.2, prob = 0.1),
+    "both specified"
+  )
+  expect_error(
+    gborel_size_ll(chains, mu = 0.5, size = 0.2, prob = 0.1),
+    "both specified"
+  )
 })
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 3c74acb4..ed8f31c7 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -1,139 +1,154 @@
 context("Simulating from a branching process model")
 
-test_that("Chains can be simulated",
-{
-    expect_length(chain_sim(n=2, "pois", lambda=0.5), 2)
-    expect_length(chain_sim(n=10, "pois", "length", lambda=0.9), 10)
-    expect_true(is.data.frame(chain_sim(n=10, "pois", lambda=2, tree=TRUE,
-                                        infinite=10)))
-    expect_false(any(is.finite(chain_sim(n=2, "pois", "length", lambda=0.5,
-                                         infinite=1))))
-    expect_no_error(chain_sim(n = 2, offspring = 'pois', "size", lambda = 0.9, 
-                                               tree = TRUE)
-                    )
+test_that("Chains can be simulated", {
+  expect_length(chain_sim(n = 2, "pois", lambda = 0.5), 2)
+  expect_length(chain_sim(n = 10, "pois", "length", lambda = 0.9), 10)
+  expect_true(is.data.frame(chain_sim(
+    n = 10, "pois", lambda = 2, tree = TRUE,
+    infinite = 10
+  )))
+  expect_false(any(is.finite(chain_sim(
+    n = 2, "pois", "length", lambda = 0.5,
+    infinite = 1
+  ))))
+  expect_no_error(chain_sim(
+    n = 2, offspring = "pois", "size", lambda = 0.9,
+    tree = TRUE
+  ))
 })
 
-test_that("Errors are thrown",
-{
-    expect_error(chain_sim(n=2, "dummy"), "does not exist")
-    expect_error(chain_sim(n=2, "lnorm", meanlog=log(1.6)), "integer")
-    expect_error(chain_sim(n = 2, offspring = pois, "length", lambda = 0.9), 
-                 "not found"
-                 )
-    expect_error(chain_sim(n = 2, offspring = 'pois', "size", lambda = 0.9, 
-                           serial = c(1:2), "must be a function")
-                 )
-    expect_error(chain_sim(n = 2, offspring = c(1, 2), "length", lambda = 0.9),
-                 "not a character string")
-    expect_error(chain_sim(n = 2, offspring = list(1, 2), "length", lambda = 0.9),
-                 "not a character string")
-    expect_error(chain_sim(n = 2, offspring = 'pois', "size", lambda = 0.9, 
-                           serial = function(x) rpois(x, 0.9), tree = FALSE),
-                 "If `serial` is specified, then `tree` cannot be set to `FALSE`."
-                 )
-    expect_error(chain_sim(n = 2, offspring = 'pois', "size", lambda = 0.9, 
-                           tf = 5, tree = FALSE),
-                 "If `tf` is specified, `serial` must be specified too."
-    )
+test_that("Errors are thrown", {
+  expect_error(chain_sim(n = 2, "dummy"), "does not exist")
+  expect_error(chain_sim(n = 2, "lnorm", meanlog = log(1.6)), "integer")
+  expect_error(
+    chain_sim(n = 2, offspring = pois, "length", lambda = 0.9),
+    "not found"
+  )
+  expect_error(chain_sim(
+    n = 2, offspring = "pois", "size", lambda = 0.9,
+    serial = c(1:2), "must be a function"
+  ))
+  expect_error(
+    chain_sim(n = 2, offspring = c(1, 2), "length", lambda = 0.9),
+    "not a character string"
+  )
+  expect_error(
+    chain_sim(n = 2, offspring = list(1, 2), "length", lambda = 0.9),
+    "not a character string"
+  )
+  expect_error(
+    chain_sim(
+      n = 2, offspring = "pois", "size", lambda = 0.9,
+      serial = function(x) rpois(x, 0.9), tree = FALSE
+    ),
+    "If `serial` is specified, then `tree` cannot be set to `FALSE`."
+  )
+  expect_error(
+    chain_sim(
+      n = 2, offspring = "pois", "size", lambda = 0.9,
+      tf = 5, tree = FALSE
+    ),
+    "If `tf` is specified, `serial` must be specified too."
+  )
 })
 
 context("Simulating from a branching process model
     accounting for depletion of susceptibles")
 
 
-test_that("Chains can be simulated",
-{
-    expect_true(
-        is.data.frame(
-            chain_sim_susc(
-                "pois",
-                mn_offspring = 2,
-                serial = function(x) 3,
-                pop = 100
-            )
-        )
+test_that("Chains can be simulated", {
+  expect_true(
+    is.data.frame(
+      chain_sim_susc(
+        "pois",
+        mn_offspring = 2,
+        serial = function(x) 3,
+        pop = 100
+      )
     )
+  )
 
-    expect_true(
-        is.data.frame(
-            chain_sim_susc(
-                "nbinom",
-                mn_offspring = 2,
-                disp_offspring = 1.5,
-                serial = function(x) 3,
-                pop = 100
-            )
-        )
+  expect_true(
+    is.data.frame(
+      chain_sim_susc(
+        "nbinom",
+        mn_offspring = 2,
+        disp_offspring = 1.5,
+        serial = function(x) 3,
+        pop = 100
+      )
     )
+  )
 
-    expect_true(
-        nrow(
-            chain_sim_susc(
-                "pois",
-                mn_offspring = 2,
-                serial = function(x) 3,
-                pop = 1
-            )
-        ) == 1
-    )
+  expect_true(
+    nrow(
+      chain_sim_susc(
+        "pois",
+        mn_offspring = 2,
+        serial = function(x) 3,
+        pop = 1
+      )
+    ) == 1
+  )
 
-    expect_true(
-        nrow(
-            chain_sim_susc(
-                "pois",
-                mn_offspring = 100,
-                tf = 2,
-                serial = function(x) 3,
-                pop = 999
-            )
-        ) == 1
-    )
-
-    expect_true(
-        nrow(
-            chain_sim_susc(
-                "pois",
-                mn_offspring = 100,
-                serial = function(x) 3,
-                pop = 999,
-                initial_immune = 998
-            )
-        ) == 1
-    )
+  expect_true(
+    nrow(
+      chain_sim_susc(
+        "pois",
+        mn_offspring = 100,
+        tf = 2,
+        serial = function(x) 3,
+        pop = 999
+      )
+    ) == 1
+  )
 
+  expect_true(
+    nrow(
+      chain_sim_susc(
+        "pois",
+        mn_offspring = 100,
+        serial = function(x) 3,
+        pop = 999,
+        initial_immune = 998
+      )
+    ) == 1
+  )
 })
 
-test_that("Errors are thrown",
-{
-    expect_error(
-        chain_sim_susc(
-            "dummy",
-            mn_offspring = 3,
-            serial = function(x) 3,
-            pop = 100),
-        paste0("'arg' should be one of ", dQuote('pois'), ', ', dQuote('nbinom')))
-    expect_error(
-        chain_sim_susc(
-            "nbinom",
-            mn_offspring = 3,
-            disp_offspring = 1,
-            serial = function(x) 3,
-            pop = 100
-            ),
-        "Offspring distribution 'nbinom' requires argument
-                disp_offspring > 1. Use 'pois' if there is no overdispersion.")
-    expect_error(
-        chain_sim_susc(
-            "nbinom",
-            mn_offspring = 3,
-            serial = function(x) 3,
-            pop = 100
-            ),
-        "argument \"disp_offspring\" is missing, with no default")
-
+test_that("Errors are thrown", {
+  expect_error(
+    chain_sim_susc(
+      "dummy",
+      mn_offspring = 3,
+      serial = function(x) 3,
+      pop = 100
+    ),
+    paste0("'arg' should be one of ", dQuote("pois"), ", ", dQuote("nbinom"))
+  )
+  expect_error(
+    chain_sim_susc(
+      "nbinom",
+      mn_offspring = 3,
+      disp_offspring = 1,
+      serial = function(x) 3,
+      pop = 100
+    ),
+    "Offspring distribution 'nbinom' requires argument
+                disp_offspring > 1. Use 'pois' if there is no overdispersion."
+  )
+  expect_error(
+    chain_sim_susc(
+      "nbinom",
+      mn_offspring = 3,
+      serial = function(x) 3,
+      pop = 100
+    ),
+    "argument \"disp_offspring\" is missing, with no default"
+  )
 })
 
-test_that('warnings work as expected', {
+test_that("warnings work as expected", {
   expect_warning(
     chain_sim_susc(
       "pois",
@@ -141,8 +156,8 @@ test_that('warnings work as expected', {
       disp_offspring = 1,
       serial = function(x) 3,
       pop = 100
-      ),
+    ),
     "argument disp_offspring not used for
                 poisson offspring distribution."
-    )
+  )
 })

From 23055b377c6032b84ce086b98b55d2af5146543a Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 16:21:11 +0000
Subject: [PATCH 120/828] changed single quote to double quote

---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 40050a62..e183c138 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -57,7 +57,7 @@ We will get and clean the first $15$ days of the COVID-19
 outbreak in South Africa to seed the simulation for this example.
 
 ```{r data, message=FALSE}
-data_url <- 'https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv' # nolint: line_length_linter. 
+data_url <- "https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv" # nolint: line_length_linter. 
 
 #Read the data in using the url
 covid19_sa <- read.csv(data_url)

From c1cb6d2a04c915cea7daad51559a2bba0d16eadf Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 16:22:45 +0000
Subject: [PATCH 121/828] broke long lines to be shorter than 80 chars

---
 README.Rmd | 33 ++++++++++++++++++---------------
 README.md  | 16 ++++++++++------
 2 files changed, 28 insertions(+), 21 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index be7aceae..dc29224b 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -35,20 +35,21 @@ knitr::opts_chunk$set(echo = TRUE)
 branching processes with a given offspring distribution.
 
 # Installation
+
 The latest development version of the `bpmodels` package can be installed via
 
-```{r eval=FALSE}
+```{r include=TRUE,eval=FALSE}
 devtools::install_github('epiverse-trace/bpmodels')
 ```
 
-# Quick start
-
 To load the package, use
 
-```{r echo=FALSE}
+```{r eval=TRUE}
 library('bpmodels')
 ```
 
+# Quick start
+
 At the heart of the package are the `chain_ll()` and `chain_sim()` functions. 
 
 ## Calculating log-likelihoods
@@ -83,7 +84,7 @@ The third argument, `stat`, determines whether to analyse chain sizes
 (`"length"`). Lastly, any named arguments not recognised by `chain_ll()` 
 are interpreted as parameters of the corresponding probability distribution, 
 here `lambda = 0.5` as the mean of the Poisson distribution (see the `R` help 
-page for the [Poisson distribution](https://stat.ethz.ch/R-manual/R-devel/library/stats/html/Poisson.html) for more information).
+page for the [Poisson distribution](https://stat.ethz.ch/R-manual/R-devel/library/stats/html/Poisson.html) for more information). 
 
 # Imperfect observations
 
@@ -115,20 +116,21 @@ file
 
 ## Simulating branching processes
 
-To simulate a branching process, we use the `chain_sim()` function. This function 
-follows the same syntax as `chain_ll()`.
+To simulate a branching process, we use the `chain_sim()` function. This 
+function follows the same syntax as `chain_ll()`.
 
 Below, we are simulating $5$ chains, assuming the offspring are generated using
-a Poisson distribution with mean, `lambda = 5`. By default, `chain_sim()` returns
-a vector of chain sizes/lengths. However, to override that so that a tree of
-infectees and infectors is returned, we need to specify a function for the serial 
-interval and set `tree = TRUE`
+a Poisson distribution with mean, `lambda = 5`. By default, `chain_sim()` 
+returns a vector of chain sizes/lengths. However, to override that so that 
+a tree of infectees and infectors is returned, we need to specify a function 
+for the serial interval and set `tree = TRUE`.
 
 ```{r}
 chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5)
 ```
 
 ### Simulating trees
+
 To simulate a tree of branching processes, we specify the serial interval 
 generation function and set `tree = TRUE` as follows:
 
@@ -164,7 +166,8 @@ chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, nsim_offspring = 10
 
 ## Package vignettes
 
-Specific use cases of _bpmodels_ can be found in the [online documentation as package vignettes](https://epiverse-trace.github.io/bpmodels/), under "Articles".
+Specific use cases of _bpmodels_ can be found in 
+the [online documentation as package vignettes](https://epiverse-trace.github.io/bpmodels/), under "Articles".
 
 ## Reporting bugs 
 
@@ -172,8 +175,8 @@ To report a bug please open an [issue](https://github.com/epiverse-trace/bpmodel
 
 ## Contribute
 
-We welcome contributions to enhance the package's functionalities. If you wish to
-do so, please follow the [package contributing guide](https://github.com/epiverse-trace/.github/blob/main/CONTRIBUTING.md).
+We welcome contributions to enhance the package's functionalities. If you 
+wish to do so, please follow the [package contributing guide](https://github.com/epiverse-trace/.github/blob/main/CONTRIBUTING.md).
 
 ## Code of conduct
 
@@ -184,4 +187,4 @@ By contributing to this project, you agree to abide by its terms.
 
 ```{r message=FALSE, warning=FALSE}
 citation("bpmodels")
-```
\ No newline at end of file
+```
diff --git a/README.md b/README.md
index 8fc1f96b..bbe9b262 100644
--- a/README.md
+++ b/README.md
@@ -31,10 +31,14 @@ installed via
 devtools::install_github('epiverse-trace/bpmodels')
 ```
 
-# Quick start
-
 To load the package, use
 
+``` r
+library('bpmodels')
+```
+
+# Quick start
+
 At the heart of the package are the `chain_ll()` and `chain_sim()`
 functions.
 
@@ -117,7 +121,7 @@ generated using a Poisson distribution with mean, `lambda = 5`. By
 default, `chain_sim()` returns a vector of chain sizes/lengths. However,
 to override that so that a tree of infectees and infectors is returned,
 we need to specify a function for the serial interval and set
-`tree = TRUE`
+`tree = TRUE`.
 
 ``` r
 chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5)
@@ -199,15 +203,15 @@ citation("bpmodels")
 #> 
 #> To cite package 'bpmodels' in publications use:
 #> 
-#>   Funk S, Finger F (2023). _bpmodels: Analysing chain statistics using
-#>   branching process models_. R package version 0.1.0,
+#>   Funk S, Finger F, Azam JM (2023). _bpmodels: Analysing chain
+#>   statistics using branching process models_. R package version 0.1.0,
 #>   <https://github.com/sbfnk/bpmodels>.
 #> 
 #> A BibTeX entry for LaTeX users is
 #> 
 #>   @Manual{,
 #>     title = {bpmodels: Analysing chain statistics using branching process models},
-#>     author = {Sebastian Funk and Flavio Finger},
+#>     author = {Sebastian Funk and Flavio Finger and James Mba Azam},
 #>     year = {2023},
 #>     note = {R package version 0.1.0},
 #>     url = {https://github.com/sbfnk/bpmodels},

From b3bbed1110721202cf7d36c09e4965816aa5c197 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 16:28:50 +0000
Subject: [PATCH 122/828] remove trailing shite space

---
 R/data.R        |  4 ++--
 R/likelihoods.R |  6 +++---
 R/simulate.r    | 28 ++++++++++++++--------------
 3 files changed, 19 insertions(+), 19 deletions(-)

diff --git a/R/data.R b/R/data.R
index 3aaf5b78..d7b105d9 100644
--- a/R/data.R
+++ b/R/data.R
@@ -1,7 +1,7 @@
 #' COVID-19 Data Repository for South Africa
 #'
-#' An aggregated subset of the COVID-19 Data Repository for South Africa 
-#' created, maintained and hosted by Data Science for Social Impact research 
+#' An aggregated subset of the COVID-19 Data Repository for South Africa
+#' created, maintained and hosted by Data Science for Social Impact research
 #' group, led by Dr. Vukosi Marivate.
 #'
 #' The data is originally provided as a linelist but has been subsetted and
diff --git a/R/likelihoods.R b/R/likelihoods.R
index 3be65722..db7dc194 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -124,7 +124,7 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring = 100, ...) {
 #' @param obs_prob observation probability (assumed constant)
 #' @param infinite any chains of this size/length will be treated as infinite
 #' @param exclude any sizes/lengths to exclude from the likelihood calculation
-#' @param individual if TRUE, a vector of individual log-likelihood 
+#' @param individual if TRUE, a vector of individual log-likelihood
 #' contributions will be returned rather than the sum
 #' @param nsim_obs number of simulations if the likelihood is to be
 #'   approximated for imperfect observations
@@ -140,7 +140,7 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring = 100, ...) {
 #' chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 #' chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
 chain_ll <- function(x, offspring, stat = c("size", "length"), obs_prob = 1,
-                     infinite = Inf, exclude = c(), individual = FALSE, 
+                     infinite = Inf, exclude = c(), individual = FALSE,
                      nsim_obs, ...) {
   stat <- match.arg(stat)
 
@@ -159,7 +159,7 @@ chain_ll <- function(x, offspring, stat = c("size", "length"), obs_prob = 1,
       sample_func <- rgen_length
     }
     sampled_x <-
-      replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob), 
+      replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob),
                                infinite), simplify = FALSE)
     size_x <- unlist(sampled_x)
     if (!is.finite(infinite)) infinite <- max(size_x) + 1
diff --git a/R/simulate.r b/R/simulate.r
index 774369f8..fbe70167 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,7 +1,7 @@
 #' Simulate transmission chains using a branching process
 #'
 #' @description \code{chain_sim()} is a stochastic simulator for generating
-#' transmission chain data with key inputs such as the offspring distribution 
+#' transmission chain data with key inputs such as the offspring distribution
 #' and serial interval distribution.
 #' @param n Number of simulations to run.
 #' @param offspring Offspring distribution: a character string corresponding to
@@ -12,36 +12,36 @@
 #'   \item "size": the total number of offspring.
 #'   \item "length": the total number of ancestors.
 #' }
-#' @param infinite A size or length above which the simulation results 
-#' should be set to `Inf`. Defaults to `Inf`, resulting in no results 
+#' @param infinite A size or length above which the simulation results
+#' should be set to `Inf`. Defaults to `Inf`, resulting in no results
 #' ever set to `Inf`
-#' @param tree Logical. Should the transmission tree be returned? Defaults 
+#' @param tree Logical. Should the transmission tree be returned? Defaults
 #' to `FALSE`.
-#' @param serial The serial interval generator function; the name of a 
-#' user-defined named or anonymous function with only one argument `n`, 
+#' @param serial The serial interval generator function; the name of a
+#' user-defined named or anonymous function with only one argument `n`,
 #' representing the number of serial intervals to generate.
-#' @param t0 Start time (if serial interval is given); either a single value 
-#' or a vector of length `n` (number of simulations) with initial times. 
+#' @param t0 Start time (if serial interval is given); either a single value
+#' or a vector of length `n` (number of simulations) with initial times.
 #' Defaults to 0.
 #' @param tf End time (if serial interval is given).
 #' @param ... Parameters of the offspring distribution as required by R.
 #' @return Either:
 #' \itemize{
 #'  \item{A vector of sizes/lengths (if \code{tree == FALSE} OR serial
-#'   interval function not specified, since that implies 
+#'   interval function not specified, since that implies
 #'   \code{tree == FALSE})}, or
 #'   \item {a data frame with
 #'   columns `n` (simulation ID), `time` (if the serial interval is given) and
-#'   (if \code{tree == TRUE}), `id` (a unique ID within each simulation for 
-#'   each individual element of the chain), `ancestor` (the ID of the 
+#'   (if \code{tree == TRUE}), `id` (a unique ID within each simulation for
+#'   each individual element of the chain), `ancestor` (the ID of the
 #'   ancestor of each element), and `generation`.}
 #' }
 #' @author Sebastian Funk, James M. Azam
 #' @export
 #' @details
-#' `chain_sim()` either returns a vector or a data.frame. The output is 
-#' either a vector if `serial` is not provided, which automatically sets 
-#' \code{tree = FALSE}, or a `data.frame`, which means that `serial` was 
+#' `chain_sim()` either returns a vector or a data.frame. The output is
+#' either a vector if `serial` is not provided, which automatically sets
+#' \code{tree = FALSE}, or a `data.frame`, which means that `serial` was
 #' provided as a function. When `serial` is provided, it means 
 #' \code{tree = TRUE} automatically. However, setting \code{tree = TRUE} 
 #' would require providing a function for `serial`.

From 75712d25ecd7c39e14d394c682de22c26c7c04da Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 16:29:22 +0000
Subject: [PATCH 123/828] removed trailing whitespace linting

---
 .lintr | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/.lintr b/.lintr
index ae732225..56463147 100644
--- a/.lintr
+++ b/.lintr
@@ -1,7 +1,8 @@
 linters: linters_with_defaults(
     line_length_linter(90), 
     commented_code_linter = NULL,
-      object_name_linter = NULL
+    object_name_linter = NULL,
+    trailing_whitespace_linter = NULL
   )
 encoding:"UTF-8"
 

From 10d736135b6f1a9d32cc42d7ee347eb025ea895b Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 16:39:39 +0000
Subject: [PATCH 124/828] added .lintr to .Rbuildignore

---
 .Rbuildignore | 1 +
 1 file changed, 1 insertion(+)

diff --git a/.Rbuildignore b/.Rbuildignore
index 2bd66f52..e39d1aea 100644
--- a/.Rbuildignore
+++ b/.Rbuildignore
@@ -5,3 +5,4 @@ cran-comments.md
 ^\.Rproj\.user$
 ^README\.Rmd$
 ^LICENSE\.md$
+.lintr
\ No newline at end of file

From bd93e5603ad4c625a63b0a93f11b83cadb954df5 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 16:40:59 +0000
Subject: [PATCH 125/828] removed extra bookdown item in Suggests

---
 DESCRIPTION | 1 -
 1 file changed, 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index b9f1b68f..7fe2f0d1 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -25,7 +25,6 @@ Suggests:
     knitr,
     lubridate,
     rmarkdown,
-    bookdown,
     testthat,
     truncdist
 VignetteBuilder: 

From 7809425de5f8d9f8715ae04a25292ae9b8d8e164 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 16:42:19 +0000
Subject: [PATCH 126/828] updated .Rbuildignore

---
 .Rbuildignore | 16 +++++++++++-----
 1 file changed, 11 insertions(+), 5 deletions(-)

diff --git a/.Rbuildignore b/.Rbuildignore
index e39d1aea..b21681d2 100644
--- a/.Rbuildignore
+++ b/.Rbuildignore
@@ -1,8 +1,14 @@
-^CODE_OF_CONDUCT\.md$
-cran-comments.md
-^\.github$
 ^.*\.Rproj$
 ^\.Rproj\.user$
-^README\.Rmd$
 ^LICENSE\.md$
-.lintr
\ No newline at end of file
+^\.github$
+^codecov\.yml$
+^README\.Rmd$
+^\.lintr$
+^\_pkgdown.yml$
+^cran-comments\.md$
+^doc$
+^docs$
+^Meta$
+^pkgdown$
+^data-raw$
\ No newline at end of file

From d924aec1acbfc277888c661d42579146eb766170 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 17:11:11 +0000
Subject: [PATCH 127/828] removed given name

---
 DESCRIPTION | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 7fe2f0d1..46932565 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -5,7 +5,7 @@ Authors@R: c(
     person("Sebastian", "Funk", , "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")),
     person("Zhian N.", "Kamvar", , "zkamvar@gmail.com", role = "ctb"),
     person("Flavio", "Finger", , "flavio.finger@epicentre.msf.org", role = "aut"),
-    person("James", "Azam", "Mba", "james.azam@lshtm.ac.uk", role = c("aut"))
+    person("James M.", "Azam", , "james.azam@lshtm.ac.uk", role = c("aut"))
   )
 Description: Provides methods to analyse and simulate the size and length
     of branching processes with an arbitrary offspring distribution. These

From 8547c04581a2132dd86d3b6efd30c662f791f956 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 17:13:59 +0000
Subject: [PATCH 128/828] fixed uncommented example section causing errors in
 checks

---
 R/simulate.r            |  2 +-
 man/bpmodels-package.Rd |  2 +-
 man/chain_sim.Rd        | 67 ++++++++++++++++-------------------------
 3 files changed, 28 insertions(+), 43 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index fbe70167..3e8cc099 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -89,7 +89,7 @@
 #' tree = FALSE)
 #'
 #' # Specifying `serial` without specifying `tree` will set `tree = TRUE` 
-#' internally.
+#' # internally.
 #'
 #' # We'll first define the serial function
 #' set.seed(123)
diff --git a/man/bpmodels-package.Rd b/man/bpmodels-package.Rd
index c32b684c..0d3715d6 100644
--- a/man/bpmodels-package.Rd
+++ b/man/bpmodels-package.Rd
@@ -22,7 +22,7 @@ Useful links:
 Authors:
 \itemize{
   \item Flavio Finger \email{flavio.finger@epicentre.msf.org}
-  \item James Mba Azam \email{james.azam@lshtm.ac.uk}
+  \item James M. Azam \email{james.azam@lshtm.ac.uk}
 }
 
 Other contributors:
diff --git a/man/chain_sim.Rd b/man/chain_sim.Rd
index bc364955..5dd691a6 100644
--- a/man/chain_sim.Rd
+++ b/man/chain_sim.Rd
@@ -30,22 +30,19 @@ the R distribution function (e.g., "pois" for Poisson, where
 }}
 
 \item{infinite}{A size or length above which the simulation results
-should be
-set to \code{Inf}. Defaults to \code{Inf}, resulting in no results ever set to \code{Inf}}
+should be set to \code{Inf}. Defaults to \code{Inf}, resulting in no results
+ever set to \code{Inf}}
 
 \item{tree}{Logical. Should the transmission tree be returned? Defaults
 to \code{FALSE}.}
 
 \item{serial}{The serial interval generator function; the name of a
-user-defined
-named or anonymous function with only one argument \code{n}, representing
-the number
-of serial intervals to generate.}
+user-defined named or anonymous function with only one argument \code{n},
+representing the number of serial intervals to generate.}
 
 \item{t0}{Start time (if serial interval is given); either a single value
-or a
-vector of length \code{n} (number of simulations) with initial times. Defaults
-to 0.}
+or a vector of length \code{n} (number of simulations) with initial times.
+Defaults to 0.}
 
 \item{tf}{End time (if serial interval is given).}
 
@@ -60,10 +57,8 @@ interval function not specified, since that implies
 \item {a data frame with
 columns \code{n} (simulation ID), \code{time} (if the serial interval is given) and
 (if \code{tree == TRUE}), \code{id} (a unique ID within each simulation for
-each
-individual element of the chain), \code{ancestor} (the ID of the ancestor of
-each
-element), and \code{generation}.}
+each individual element of the chain), \code{ancestor} (the ID of the
+ancestor of each element), and \code{generation}.}
 }
 }
 \description{
@@ -73,30 +68,23 @@ and serial interval distribution.
 }
 \details{
 \code{chain_sim()} either returns a vector or a data.frame. The output is
-either a
-vector if \code{serial} is not provided, which automatically sets
-\code{tree = FALSE},
-or a \code{data.frame}, which means that \code{serial} was provided as a function.
-When \code{serial}
-is provided, it means \code{tree = TRUE} automatically. However, setting
-\code{tree = TRUE} would require providing a function for \code{serial}.
+either a vector if \code{serial} is not provided, which automatically sets
+\code{tree = FALSE}, or a \code{data.frame}, which means that \code{serial} was
+provided as a function. When \code{serial} is provided, it means
+\code{tree = TRUE} automatically. However, setting \code{tree = TRUE}
+would require providing a function for \code{serial}.
 }
 \section{The serial interval (\code{serial}):}{
 \subsection{Assumptions/disambiguation}{
 
 In epidemiology, the generation interval is the duration between successive
 infectious events in a chain of transmission. Similarly, the serial
-interval is the
-duration between observed symptom onset times between successive
-cases in a transmission chain. The generation interval is often hard to
-observe
-because exact times of infection are hard to measure hence, the serial
-interval
-is often used instead. Here, we use the serial interval to represent
-what would
-normally be called the generation interval, that is, the time between
-successive
-cases.
+interval is the duration between observed symptom onset times between
+successive cases in a transmission chain. The generation interval is
+often hard to observe because exact times of infection are hard to
+measure hence, the serial interval is often used instead. Here, we
+use the serial interval to represent what would normally be called the
+generation interval, that is, the time between successive cases.
 }
 
 \subsection{Specifying \code{serial} in \code{chain_sim()}}{
@@ -107,23 +95,20 @@ with one argument.
 
 If \code{serial} is specified, \code{chain_sim()} returns times of
 infection as a column in the output. Moreover, specifying a function
-for \code{serial} implies
-\code{tree = TRUE} and a tree of infectors (\code{ancestor}) and infectees (\code{id})
-will be generated in the output.
+for \code{serial} implies \code{tree = TRUE} and a tree of
+infectors (\code{ancestor}) and infectees (\code{id}) will be generated in the output.
 
 For example, assuming we want to specify the serial interval
 generator as a random log-normally distributed variable with
-\code{meanlog = 0.58}
-and \code{sdlog = 1.58}, we could define a named function, let's call it
-"serial_interval", with only one argument representing the number of serial
-intervals to sample:
+\code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
+let's call it "serial_interval", with only one argument representing the
+number of serial intervals to sample:
 \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
 and assign the name of the function to serial in \code{chain_sim()} like so
 \code{chain_sim(..., serial = serial_interval)},
 where \code{...} are the other arguments to \code{chain_sim()}. Alternatively, we
 could assign an anonymous function to serial in the \code{chain_sim()} call
-like so
-\code{chain_sim(..., serial = function(n){rlnorm(n, 0.58, 1.38)})},
+like so \code{chain_sim(..., serial = function(n){rlnorm(n, 0.58, 1.38)})},
 where \code{...} are the other arguments to \code{chain_sim()}.
 }
 }
@@ -135,7 +120,7 @@ chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5,
 tree = FALSE)
 
 # Specifying `serial` without specifying `tree` will set `tree = TRUE` 
-internally.
+# internally.
 
 # We'll first define the serial function
 set.seed(123)

From ba0e70a15507de59193586bf4cf4f7d1686970c6 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Fri, 10 Feb 2023 21:30:23 +0000
Subject: [PATCH 129/828] turning off cyclocomp_linter for next PR

---
 .lintr | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/.lintr b/.lintr
index 56463147..736ca976 100644
--- a/.lintr
+++ b/.lintr
@@ -2,7 +2,8 @@ linters: linters_with_defaults(
     line_length_linter(90), 
     commented_code_linter = NULL,
     object_name_linter = NULL,
-    trailing_whitespace_linter = NULL
+    trailing_whitespace_linter = NULL,
+    cyclocomp_linter = NULL
   )
 encoding:"UTF-8"
 

From 856e491e11663fe7bd176fdf2c09dc9d25747342 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 14 Feb 2023 13:00:48 +0000
Subject: [PATCH 130/828] updated lint_changed_files with version from package
 template

---
 .github/workflows/lint_changed_files.yaml | 1 +
 1 file changed, 1 insertion(+)

diff --git a/.github/workflows/lint_changed_files.yaml b/.github/workflows/lint_changed_files.yaml
index f39da76c..5f16f852 100644
--- a/.github/workflows/lint_changed_files.yaml
+++ b/.github/workflows/lint_changed_files.yaml
@@ -22,6 +22,7 @@ jobs:
             any::gh
             any::lintr
             any::purrr
+            epiverse-trace/etdev
           needs: check
 
       - name: Add lintr options

From b08e840091ecf2ef17e779c4170a2e8977fd6990 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 14 Feb 2023 13:03:29 +0000
Subject: [PATCH 131/828] updated lintr with version from package template

---
 .lintr | 11 ++++++++---
 1 file changed, 8 insertions(+), 3 deletions(-)

diff --git a/.lintr b/.lintr
index 736ca976..c7425f4f 100644
--- a/.lintr
+++ b/.lintr
@@ -1,8 +1,13 @@
 linters: linters_with_defaults(
-    line_length_linter(90), 
-    commented_code_linter = NULL,
+tags = NULL, # include all linters
     object_name_linter = NULL,
-    trailing_whitespace_linter = NULL,
+    undesirable_function_linter = NULL,
+    implicit_integer_linter = NULL,
+    extraction_operator_linter = NULL,
+    todo_comment_linter = NULL,
+    function_argument_linter = NULL,
+    # Use minimum R declared in DESCRIPTION or fall back to current R version
+    backport_linter(if (length(x <- etdev::extract_min_r_version())) x else getRversion()),
     cyclocomp_linter = NULL
   )
 encoding:"UTF-8"

From b698422990fe9affe10db3e9cf306469be780107 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 14 Feb 2023 13:18:35 +0000
Subject: [PATCH 132/828] fixed wrong call to linters

---
 .lintr | 9 +++------
 1 file changed, 3 insertions(+), 6 deletions(-)

diff --git a/.lintr b/.lintr
index c7425f4f..a86e3973 100644
--- a/.lintr
+++ b/.lintr
@@ -1,5 +1,5 @@
-linters: linters_with_defaults(
-tags = NULL, # include all linters
+linters: linters_with_tags(
+    tags = NULL, # include all linters
     object_name_linter = NULL,
     undesirable_function_linter = NULL,
     implicit_integer_linter = NULL,
@@ -9,7 +9,4 @@ tags = NULL, # include all linters
     # Use minimum R declared in DESCRIPTION or fall back to current R version
     backport_linter(if (length(x <- etdev::extract_min_r_version())) x else getRversion()),
     cyclocomp_linter = NULL
-  )
-encoding:"UTF-8"
-
-
+  )
\ No newline at end of file

From 9225a1ac2221e972ef1cbcf799267beae1fcf3f2 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 14 Feb 2023 13:55:51 +0000
Subject: [PATCH 133/828] fixed lintr to help pass checks

---
 .lintr | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/.lintr b/.lintr
index a86e3973..c99344cb 100644
--- a/.lintr
+++ b/.lintr
@@ -6,7 +6,7 @@ linters: linters_with_tags(
     extraction_operator_linter = NULL,
     todo_comment_linter = NULL,
     function_argument_linter = NULL,
+    cyclocomp_linter = NULL,
     # Use minimum R declared in DESCRIPTION or fall back to current R version
-    backport_linter(if (length(x <- etdev::extract_min_r_version())) x else getRversion()),
-    cyclocomp_linter = NULL
+    backport_linter(if (length(x <- etdev::extract_min_r_version())) x else getRversion())
   )
\ No newline at end of file

From 46419dce63154b4cea78a88bdd35d9fe4d795be0 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 14 Feb 2023 14:10:48 +0000
Subject: [PATCH 134/828] fixed lintr to help pass checks

---
 .lintr | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/.lintr b/.lintr
index c99344cb..ce4840c6 100644
--- a/.lintr
+++ b/.lintr
@@ -9,4 +9,5 @@ linters: linters_with_tags(
     cyclocomp_linter = NULL,
     # Use minimum R declared in DESCRIPTION or fall back to current R version
     backport_linter(if (length(x <- etdev::extract_min_r_version())) x else getRversion())
-  )
\ No newline at end of file
+  )
+  
\ No newline at end of file

From 36add89ee604c7540a616777c07d0ff7211f7663 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Wed, 15 Feb 2023 16:46:14 +0000
Subject: [PATCH 135/828] fixed new line issue

---
 .lintr | 3 +--
 1 file changed, 1 insertion(+), 2 deletions(-)

diff --git a/.lintr b/.lintr
index ce4840c6..12d8a253 100644
--- a/.lintr
+++ b/.lintr
@@ -9,5 +9,4 @@ linters: linters_with_tags(
     cyclocomp_linter = NULL,
     # Use minimum R declared in DESCRIPTION or fall back to current R version
     backport_linter(if (length(x <- etdev::extract_min_r_version())) x else getRversion())
-  )
-  
\ No newline at end of file
+    )

From 1ad16ef65e5ead9b1b6a96b8af9a33a3517bcd34 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Wed, 15 Feb 2023 16:46:43 +0000
Subject: [PATCH 136/828] fixed R version specification

---
 DESCRIPTION | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 46932565..7ff53e58 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -16,7 +16,7 @@ License: MIT + file LICENSE
 URL: https://github.com/epiverse-trace/bpmodels
 BugReports: https://github.com/epiverse-trace/bpmodels/issues
 Depends: 
-    R (>= 2.10)
+    R (>= 2.10.0)
 Suggests: 
     bookdown,
     covr,

From c6b457246b5c33b252e48a9a0b8e6fa1a30a936f Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 20 Feb 2023 16:47:49 +0000
Subject: [PATCH 137/828] Updated README.Rmd

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index dc29224b..2e34818b 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -58,7 +58,7 @@ The `chain_ll()` function calculates the log-likelihood of a distribution of
 chain sizes or lengths given an offspring distribution and its associated 
 parameters. 
 
-If we have observed a distribution of chains of sizes $1, 1, 4, 7$, we can 
+For example, if we have observed a distribution of chains of sizes $1, 1, 4, 7$, we can 
 calculate the log-likelihood of this observed chain by assuming the offspring 
 per generation is Poisson distributed with a mean number of $0.5$. 
 

From 5426b18f7c1eb7d5fad4512c99387f78cfca4bb3 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 20 Feb 2023 16:48:10 +0000
Subject: [PATCH 138/828] Updated README.Rmd

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index 2e34818b..8adbf79a 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -120,7 +120,7 @@ To simulate a branching process, we use the `chain_sim()` function. This
 function follows the same syntax as `chain_ll()`.
 
 Below, we are simulating $5$ chains, assuming the offspring are generated using
-a Poisson distribution with mean, `lambda = 5`. By default, `chain_sim()` 
+a Poisson distribution with mean, `lambda = 0.5`. By default, `chain_sim()` 
 returns a vector of chain sizes/lengths. However, to override that so that 
 a tree of infectees and infectors is returned, we need to specify a function 
 for the serial interval and set `tree = TRUE`.

From 37018d51afc1f0285c55feed4918b65a98c3525d Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 20 Feb 2023 16:49:14 +0000
Subject: [PATCH 139/828] Updated README.Rmd

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index 8adbf79a..a5647ad1 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -60,7 +60,7 @@ parameters.
 
 For example, if we have observed a distribution of chains of sizes $1, 1, 4, 7$, we can 
 calculate the log-likelihood of this observed chain by assuming the offspring 
-per generation is Poisson distributed with a mean number of $0.5$. 
+per generation is Poisson distributed with a mean number (which can be interpreted as the reproduction number $R$) of $0.5$. 
 
 To do this, we run 
 

From 544fd8700dbbe707b481b1ddfec324b35fa9ad7c Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 20 Feb 2023 16:49:46 +0000
Subject: [PATCH 140/828] Updated README.Rmd

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index a5647ad1..82f1f53f 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -70,7 +70,7 @@ chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 chain_ll(x = chain_sizes, offspring = "pois", stat = "size", lambda = 0.5)
 ```
 
-The first argument of `chain_ll()` is the size (or length) distribution to 
+The first argument of `chain_ll()` is the chain size (or length, in number of generations that a chain lasted) distribution to 
 analyse. The second argument, `offspring`, specifies the offspring 
 distribution. This is given as a function used to generate random offspring. 
 It can be any probability distribution implemented in `R`, that is, one that 

From 4c141389833d01d53ac1806258ee7d65b7e56811 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 20 Feb 2023 16:51:05 +0000
Subject: [PATCH 141/828] Updated README.Rmd

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 README.Rmd | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index 82f1f53f..4ed4e76c 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -32,7 +32,10 @@ knitr::opts_chunk$set(echo = TRUE)
 ```
 
 `bpmodels` is an R package to simulate and analyse the size and length of 
-branching processes with a given offspring distribution.
+branching processes with a given offspring distribution. These models are often 
+used in infectious disease epidemiology, where the chains represent chains of
+transmission, and offspring distribution, the distribution of secondary infections
+caused by an infected individual.
 
 # Installation
 

From 48a3563dbbda3758b6aa7a53efeab1940d972a48 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 20 Feb 2023 16:51:55 +0000
Subject: [PATCH 142/828] Updated vignettes/projecting_incidence.Rmd

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index e183c138..aee2f1b6 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -220,7 +220,7 @@ incidence_ts <- incidence_ts %>%
   mutate(date = index_date + days(seq(0, n() - 1))) %>%
   ungroup()
 
-## Median daily number of cases aggregated across all simulations
+# Median daily number of cases aggregated across all simulations
 median_daily_cases <- incidence_ts %>%
   group_by(day) %>%
   summarise(median_cases = median(cases)) %>%

From ad38a176be822bb4cbecc69093402304922c0289 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Tue, 21 Feb 2023 12:09:39 +0000
Subject: [PATCH 143/828] Updated vignette

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 vignettes/projecting_incidence.Rmd | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index aee2f1b6..ba331179 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -83,7 +83,8 @@ start_times <- unlist(mapply(
   days_since_index,
   covid19_sa$cases
 ))
-                       
+
+start_times
 ```
 
 
From 0956718c5fad64e6fe42efc566d68c1c28ccbce8 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 21 Feb 2023 18:50:19 +0000
Subject: [PATCH 144/828] updated README

---
 README.Rmd |  84 ++++++++++++++++++++++------------------
 README.md  | 112 +++++++++++++++++++++++++++++------------------------
 2 files changed, 107 insertions(+), 89 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index 4ed4e76c..57e936ec 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -31,15 +31,15 @@ knitr::opts_chunk$set(
 knitr::opts_chunk$set(echo = TRUE)
 ```
 
-`bpmodels` is an R package to simulate and analyse the size and length of 
+_bpmodels_ is an R package to simulate and analyse the size and length of 
 branching processes with a given offspring distribution. These models are often 
 used in infectious disease epidemiology, where the chains represent chains of
-transmission, and offspring distribution, the distribution of secondary infections
-caused by an infected individual.
+transmission, and the offspring distribution represents the distribution of 
+secondary infections caused by an infected individual.
 
 # Installation
 
-The latest development version of the `bpmodels` package can be installed via
+The latest development version of the _bpmodels_ package can be installed via
 
 ```{r include=TRUE,eval=FALSE}
 devtools::install_github('epiverse-trace/bpmodels')
@@ -61,9 +61,11 @@ The `chain_ll()` function calculates the log-likelihood of a distribution of
 chain sizes or lengths given an offspring distribution and its associated 
 parameters. 
 
-For example, if we have observed a distribution of chains of sizes $1, 1, 4, 7$, we can 
+For example, if we have observed a distribution of chains of sizes 
+$1, 1, 4, 7$, we can 
 calculate the log-likelihood of this observed chain by assuming the offspring 
-per generation is Poisson distributed with a mean number (which can be interpreted as the reproduction number $R$) of $0.5$. 
+per generation is Poisson distributed with a mean number (which can 
+be interpreted as the reproduction number $\mathcal{R_0}$) of $0.5$. 
 
 To do this, we run 
 
@@ -73,7 +75,8 @@ chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 chain_ll(x = chain_sizes, offspring = "pois", stat = "size", lambda = 0.5)
 ```
 
-The first argument of `chain_ll()` is the chain size (or length, in number of generations that a chain lasted) distribution to 
+The first argument of `chain_ll()` is the chain size (or length, in number of 
+generations that a chain lasted) distribution to 
 analyse. The second argument, `offspring`, specifies the offspring 
 distribution. This is given as a function used to generate random offspring. 
 It can be any probability distribution implemented in `R`, that is, one that 
@@ -89,22 +92,26 @@ are interpreted as parameters of the corresponding probability distribution,
 here `lambda = 0.5` as the mean of the Poisson distribution (see the `R` help 
 page for the [Poisson distribution](https://stat.ethz.ch/R-manual/R-devel/library/stats/html/Poisson.html) for more information). 
 
-# Imperfect observations
+### Imperfect observations
 
-By default, `chain_ll` assumes perfect observation, where `obs_prob = 1` 
-(See `?chain_ll`). If observations are imperfect, the `chain_ll()` function has 
-an `obs_prob` argument that can be used to determine the likelihood. In that 
-case, true chain sizes or lengths are simulated repeatedly (the number of times 
-given by the `nsim_obs` argument), and the likelihood calculated for each of 
+By default, `chain_ll()` assumes perfect observation, where `obs_prob = 1` 
+(See `?chain_ll`), meaning that all transmission events are observed and 
+recorded in the data. If observations are imperfect, `chain_ll()` provides 
+the argument, `obs_prob`, for specifying the probability of observation. 
+This probability is used to determine the likelihood of observing the specified
+chain sizes or lengths. In the case of imperfect observation, true chain sizes 
+or lengths are simulated repeatedly (the number of times given by the 
+`nsim_obs` argument), and the likelihood calculated for each of 
 these simulations. 
 
-For example, if the probability of observing each case is $0.30$, we use
+For example, if the probability of observing each case is `obs_prob = 0.30`, 
+we use
 
 ```{r}
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5, 
                nsim_obs = 10)
-summary(ll)
+ll
 ```
 
 This returns `10` likelihood values (because `nsim_obs = 10`), which can be 
@@ -117,6 +124,29 @@ file
 ?chain_ll
 ```
 
+### How `chain_ll()` works
+
+If the probability distribution of chain sizes or lengths has an analytical 
+solution, this will be used. `chain_ll()` currently supports the Poisson and 
+negative binomial size distribution and the Poisson and geometric length 
+distribution. 
+
+If an analytical solution does not exist, simulations are used to approximate 
+this probability distributions ([using a linear approximation to the cumulative 
+distribution](https://en.wikipedia.org/wiki/Empirical_distribution_function) 
+for unobserved sizes/lengths). In that case, an extra argument `nsim_offspring` 
+must be passed to `chain_ll()` to specify the number of simulations to be 
+used for this approximation. 
+
+For example, to get offspring drawn from a binomial distribution with 
+probability `prob = 0.5`, we run
+
+```{r}
+chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
+chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, 
+         nsim_offspring = 100)
+```
+
 ## Simulating branching processes
 
 To simulate a branching process, we use the `chain_sim()` function. This 
@@ -134,39 +164,17 @@ chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5)
 
 ### Simulating trees
 
-To simulate a tree of branching processes, we specify the serial interval 
+To simulate a tree of transmission chains, we specify the serial interval 
 generation function and set `tree = TRUE` as follows:
 
 ```{r}
 set.seed(13)
-
 serial_interval <- function(n){rlnorm(n, meanlog = 0.58, sdlog = 1.58)}
-
 chains_df <- chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length', 
                        infinite = 100, serial = serial_interval, tree = TRUE)
-
 head(chains_df)
 ```
 
-
-# Methodology
-
-If the probability distribution of chain sizes or lengths has an analytical 
-solution, this will be used (size distribution: Poisson and negative binomial; 
-length distribution: Poisson and geometric). 
-
-If an analytical solution does not exist, simulations are used to approximate 
-this probability distributions (using a linear approximation to the cumulative 
-distribution for unobserved sizes/lengths). The argument `nsim_offspring` is 
-used to specify the number of simulations to be used for this approximation. 
-
-For example, to get offspring drawn from a binomial distribution with 
-probability `prob = 0.5`, we run
-
-```{r}
-chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, nsim_offspring = 100)
-```
-
 ## Package vignettes
 
 Specific use cases of _bpmodels_ can be found in 
diff --git a/README.md b/README.md
index bbe9b262..dc0d058e 100644
--- a/README.md
+++ b/README.md
@@ -19,12 +19,16 @@ activity](https://img.shields.io/github/commit-activity/m/epiverse-trace/bpmodel
 ![GitHub](https://img.shields.io/github/license/epiverse-trace/bpmodels)
 <!-- badges: end -->
 
-`bpmodels` is an R package to simulate and analyse the size and length
-of branching processes with a given offspring distribution.
+*bpmodels* is an R package to simulate and analyse the size and length
+of branching processes with a given offspring distribution. These models
+are often used in infectious disease epidemiology, where the chains
+represent chains of transmission, and the offspring distribution
+represents the distribution of secondary infections caused by an
+infected individual.
 
 # Installation
 
-The latest development version of the `bpmodels` package can be
+The latest development version of the *bpmodels* package can be
 installed via
 
 ``` r
@@ -48,10 +52,11 @@ The `chain_ll()` function calculates the log-likelihood of a
 distribution of chain sizes or lengths given an offspring distribution
 and its associated parameters.
 
-If we have observed a distribution of chains of sizes $1, 1, 4, 7$, we
-can calculate the log-likelihood of this observed chain by assuming the
-offspring per generation is Poisson distributed with a mean number of
-$0.5$.
+For example, if we have observed a distribution of chains of sizes
+$1, 1, 4, 7$, we can calculate the log-likelihood of this observed chain
+by assuming the offspring per generation is Poisson distributed with a
+mean number (which can be interpreted as the reproduction number
+$\mathcal{R_0}$) of $0.5$.
 
 To do this, we run
 
@@ -62,15 +67,15 @@ chain_ll(x = chain_sizes, offspring = "pois", stat = "size", lambda = 0.5)
 #> [1] -8.607
 ```
 
-The first argument of `chain_ll()` is the size (or length) distribution
-to analyse. The second argument, `offspring`, specifies the offspring
-distribution. This is given as a function used to generate random
-offspring. It can be any probability distribution implemented in `R`,
-that is, one that has a corresponding function for generating random
-numbers beginning with the letter `r`. In the case of the example above,
-since random Poisson numbers are generated in `R` using a function
-called `rpois()`, the string to pass to the `offspring` argument is
-`"pois"`.
+The first argument of `chain_ll()` is the chain size (or length, in
+number of generations that a chain lasted) distribution to analyse. The
+second argument, `offspring`, specifies the offspring distribution. This
+is given as a function used to generate random offspring. It can be any
+probability distribution implemented in `R`, that is, one that has a
+corresponding function for generating random numbers beginning with the
+letter `r`. In the case of the example above, since random Poisson
+numbers are generated in `R` using a function called `rpois()`, the
+string to pass to the `offspring` argument is `"pois"`.
 
 The third argument, `stat`, determines whether to analyse chain sizes
 (`"size"`, the default if this argument is not specified) or lengths
@@ -81,24 +86,27 @@ distribution (see the `R` help page for the [Poisson
 distribution](https://stat.ethz.ch/R-manual/R-devel/library/stats/html/Poisson.html)
 for more information).
 
-# Imperfect observations
+### Imperfect observations
 
-By default, `chain_ll` assumes perfect observation, where `obs_prob = 1`
-(See `?chain_ll`). If observations are imperfect, the `chain_ll()`
-function has an `obs_prob` argument that can be used to determine the
-likelihood. In that case, true chain sizes or lengths are simulated
+By default, `chain_ll()` assumes perfect observation, where
+`obs_prob = 1` (See `?chain_ll`), meaning that all transmission events
+are observed and recorded in the data. If observations are imperfect,
+`chain_ll()` provides the argument, `obs_prob`, for specifying the
+probability of observation. This probability is used to determine the
+likelihood of observing the specified chain sizes or lengths. In the
+case of imperfect observation, true chain sizes or lengths are simulated
 repeatedly (the number of times given by the `nsim_obs` argument), and
 the likelihood calculated for each of these simulations.
 
-For example, if the probability of observing each case is $0.30$, we use
+For example, if the probability of observing each case is
+`obs_prob = 0.30`, we use
 
 ``` r
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5, 
                nsim_obs = 10)
-summary(ll)
-#>    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
-#>   -32.1   -26.5   -24.1   -24.9   -22.5   -19.1
+ll
+#>  [1] -26.54 -23.26 -24.33 -20.80 -30.76 -26.47 -23.79 -19.14 -32.09 -22.23
 ```
 
 This returns `10` likelihood values (because `nsim_obs = 10`), which can
@@ -111,13 +119,38 @@ To find out about usage of the `chain_ll()` function, you can use the
 ?chain_ll
 ```
 
+### How `chain_ll()` works
+
+If the probability distribution of chain sizes or lengths has an
+analytical solution, this will be used. `chain_ll()` currently supports
+the Poisson and negative binomial size distribution and the Poisson and
+geometric length distribution.
+
+If an analytical solution does not exist, simulations are used to
+approximate this probability distributions ([using a linear
+approximation to the cumulative
+distribution](https://en.wikipedia.org/wiki/Empirical_distribution_function)
+for unobserved sizes/lengths). In that case, an extra argument
+`nsim_offspring` must be passed to `chain_ll()` to specify the number of
+simulations to be used for this approximation.
+
+For example, to get offspring drawn from a binomial distribution with
+probability `prob = 0.5`, we run
+
+``` r
+chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
+chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, 
+         nsim_offspring = 100)
+#> [1] -Inf
+```
+
 ## Simulating branching processes
 
 To simulate a branching process, we use the `chain_sim()` function. This
 function follows the same syntax as `chain_ll()`.
 
 Below, we are simulating $5$ chains, assuming the offspring are
-generated using a Poisson distribution with mean, `lambda = 5`. By
+generated using a Poisson distribution with mean, `lambda = 0.5`. By
 default, `chain_sim()` returns a vector of chain sizes/lengths. However,
 to override that so that a tree of infectees and infectors is returned,
 we need to specify a function for the serial interval and set
@@ -125,22 +158,19 @@ we need to specify a function for the serial interval and set
 
 ``` r
 chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5)
-#> [1] 5 1 1 1 1
+#> [1] 2 6 2 1 2
 ```
 
 ### Simulating trees
 
-To simulate a tree of branching processes, we specify the serial
+To simulate a tree of transmission chains, we specify the serial
 interval generation function and set `tree = TRUE` as follows:
 
 ``` r
 set.seed(13)
-
 serial_interval <- function(n){rlnorm(n, meanlog = 0.58, sdlog = 1.58)}
-
 chains_df <- chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length', 
                        infinite = 100, serial = serial_interval, tree = TRUE)
-
 head(chains_df)
 #>   n id ancestor generation    time
 #> 1 1  1       NA          1 0.00000
@@ -151,26 +181,6 @@ head(chains_df)
 #> 6 1  2        1          2 0.04772
 ```
 
-# Methodology
-
-If the probability distribution of chain sizes or lengths has an
-analytical solution, this will be used (size distribution: Poisson and
-negative binomial; length distribution: Poisson and geometric).
-
-If an analytical solution does not exist, simulations are used to
-approximate this probability distributions (using a linear approximation
-to the cumulative distribution for unobserved sizes/lengths). The
-argument `nsim_offspring` is used to specify the number of simulations
-to be used for this approximation.
-
-For example, to get offspring drawn from a binomial distribution with
-probability `prob = 0.5`, we run
-
-``` r
-chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, nsim_offspring = 100)
-#> [1] -8.761
-```
-
 ## Package vignettes
 
 Specific use cases of *bpmodels* can be found in the [online

From 916fbd15fbca2b28dd4858c801da6797d0de991b Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 21 Feb 2023 19:06:25 +0000
Subject: [PATCH 145/828] updated vignette

---
 vignettes/projecting_incidence.Rmd | 34 +++++++++++++++---------------
 1 file changed, 17 insertions(+), 17 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index ba331179..5cf87c75 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -132,13 +132,9 @@ information provided above:
 
 ```{r input_prep3, message=FALSE}
 mu <- 4.7
-
-sigma <- 2.9
-
-log_sd <- sqrt(log(1 + (sigma / mu)^2)) # log standard deviation
-
-log_mean <- log((mu^2) / (sqrt(sigma^2 + mu^2))) # log mean
-
+sgma <- 2.9
+log_sd <- sqrt(log(1 + (sgma / mu)^2)) # log standard deviation
+log_mean <- log((mu^2) / (sqrt(sgma^2 + mu^2))) # log mean
 #' serial interval function
 serial_interval <- function(sample_size) {
   si <- rlnorm(sample_size, meanlog = log_mean, sdlog = log_sd)
@@ -173,8 +169,6 @@ last case, assuming that no chain would exceed `r chain_threshold`.
 
 ```{r simulations, message=FALSE}
 set.seed(1234)
-
-
 sim_chain_sizes <- lapply(
   seq_len(sim_rep),
   function(sim) {
@@ -207,7 +201,7 @@ of all the projections through time (`incidence_ts`).
 
 ```{r post_processing}
 index_date <- min(covid19_sa$date)
-
+index_date
 # Daily number of cases for each simulation
 incidence_ts <- sim_output %>%
   mutate(day = ceiling(time)) %>%
@@ -215,12 +209,15 @@ incidence_ts <- sim_output %>%
   summarise(cases = n()) %>%
   ungroup()
 
+head(incidence_ts)
+
 # Add dates
-incidence_ts <- incidence_ts %>%
+incidence_ts_by_date <- incidence_ts %>%
   group_by(sim) %>%
   mutate(date = index_date + days(seq(0, n() - 1))) %>%
   ungroup()
 
+head(incidence_ts_by_date)
 # Median daily number of cases aggregated across all simulations
 median_daily_cases <- incidence_ts %>%
   group_by(day) %>%
@@ -228,19 +225,22 @@ median_daily_cases <- incidence_ts %>%
   ungroup() %>%
   arrange(day)
 
+head(median_daily_cases)
+
 # Add dates
 median_daily_cases <- median_daily_cases %>%
   mutate(date = index_date + days(seq(0, projection_end_day))) %>%
   ungroup()
 
+head(median_daily_cases)
 ```
 
 
 ## Visualization
 
-```{r viz, fig.cap ="COVID-19 incidence projected over a two week window. The gray lines represent individual simulations, red connected dots represent the median daily cases across all simulations, and the black triangles represent the observed data.", fig.width=2.0, fig.height=1.8}
+```{r viz, fig.cap ="COVID-19 incidence projected over a two week window. The gray lines represent individual simulations, red connected dots represent the median daily cases across all simulations, and the black triangles represent the observed data.", fig.width=4.0, fig.height=3.8}
 # Visualization
-ggplot(data = incidence_ts) +
+ggplot(data = incidence_ts_by_date) +
   geom_line(aes(
     x = date,
     y = cases,
@@ -279,12 +279,12 @@ ggplot(data = incidence_ts) +
     shape = 24
   ) +
   scale_x_continuous(
-    breaks = seq(min(incidence_ts$date), max(incidence_ts$date), 10),
-    labels = seq(min(incidence_ts$date), max(incidence_ts$date), 10)
+    breaks = seq(min(incidence_ts_by_date$date), max(incidence_ts_by_date$date), 10),
+    labels = seq(min(incidence_ts_by_date$date), max(incidence_ts_by_date$date), 10)
   ) +
   scale_y_continuous(
-    breaks = seq(0, max(incidence_ts$cases) + 200, 250),
-    labels = seq(0, max(incidence_ts$cases) + 200, 250)
+    breaks = seq(0, max(incidence_ts_by_date$cases) + 200, 250),
+    labels = seq(0, max(incidence_ts_by_date$cases) + 200, 250)
   ) +
   labs(x = "Date", y = "Daily cases (median)") +
   theme_minimal() 

From 3ff710dacc144e18f40f384e8b3c576c07edca9e Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Wed, 22 Feb 2023 16:53:46 +0000
Subject: [PATCH 146/828] Updated a function doc

---
 R/likelihoods.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index db7dc194..30101fdf 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -132,8 +132,8 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring = 100, ...) {
 #' @return likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or
 #'  a list of individual likelihood contributions (if \code{individual=TRUE})
 #' @inheritParams chain_sim
-#' @seealso pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll
-#'   geom_length_ll offspring_ll
+#' @seealso pois_size_ll, nbinom_size_ll, gborel_size_ll, pois_length_ll,
+#'   geom_length_ll, offspring_ll
 #' @author Sebastian Funk
 #' @export
 #' @examples

From 4a8bab8ce40018a86e7f79cec383b79d2b8d589d Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 28 Feb 2023 19:42:56 +0000
Subject: [PATCH 147/828] modified the data

---
 data/covid19_sa.rda | Bin 227 -> 252 bytes
 1 file changed, 0 insertions(+), 0 deletions(-)

diff --git a/data/covid19_sa.rda b/data/covid19_sa.rda
index 43295c0cf928039ee39e9cd1d6c7adea1c3b70e1..775ad1b468a4925b77155174c9d0f3628b2a3c63 100644
GIT binary patch
literal 252
zcmV<Y00aL*T4*^jL0KkKS$Ss)umAwBf5-oONPv(5G$2F(5J0~toq#|9KmZ^B1K<z=
zumIL%G}R`ari0RG00HT!(dsmMGJz8!Xqp(FjZFqdCIrAtOeBc}01Sgb000J;b`cxM
zi!A?WmP`Qu0E37E02X)7wIT+(5rINNWZ}zbPY8-}W64Ft9Hp;BNC}v2?o%u4WFbb#
zI!y@Nz{hC<HUe=BAq<9g9f8;Zm<7jkfL<UHn4?;ysjX{ws)7kZP`Zs&yF0f+7Kf`u
z#X_-y!nfVDkh!dk1pHD}+{sV2;UatJ$H*c-F^sB7A`wz0>uQn5{x0N-aG@ab&J|!;
C8)pIl

literal 227
zcmV<903829T4*^jL0KkKS$q4essI3sf5ZRtNB{r<Fd#$#5J0~toq#|9AOHh^03omd
zwqVgFf@t!ZG-;C&=+tSLgc_%*<xfo~qIzVS7$Z!WnKWo5N{m1m02@#SfDe4gzHDd%
z6dbS+Jmxnw9KKM%8cKQ`8U7)=0KC$xX)0Lv!TN~Q(m^vp(wkF3g_ceeF%<}66GMhW
z2w)9}Tz5cF)4}q|_)1c{iH)^=?{8Mw2s~*eGaQ2<JB?aK^DwEMtOD*NsxFvij}r^v
dOmYUJb~En8TaiB!N&-KNxgwk>NIm`5RRBkATR{K-


From 29795abf79b1a5b99d8cd614e472312f1526b1e6 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 28 Feb 2023 19:43:53 +0000
Subject: [PATCH 148/828] removed dpi setting

---
 vignettes/projecting_incidence.Rmd | 3 +--
 1 file changed, 1 insertion(+), 2 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 5cf87c75..e30090fb 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -22,8 +22,7 @@ knitr::opts_chunk$set(echo = TRUE,
                       message = FALSE, 
                       warning = FALSE, 
                       collapse = TRUE,
-                      comment = "#>",
-                      dpi = 300
+                      comment = "#>"
                       )
 
 ```

From efdcffaa48b9a2d24eb756acc28b5e6699867361 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 28 Feb 2023 19:45:11 +0000
Subject: [PATCH 149/828] revised the overview

---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index e30090fb..cde35519 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -33,7 +33,7 @@ Branching processes can be used to project infectious disease trends provided
 we can characterize the distribution of times between 
 successive cases (serial interval), and the distribution of secondary cases 
 produced by a single individual (offspring distribution). Such simulations can 
-be achieved in `bpmodels` with the `chain_sim()` function. @Pearson2020, and 
+be achieved in `bpmodels` with the `chain_sim()` function and @Pearson2020, and 
 @abbott2020 illustrate its application to COVID-19. 
 
 The purpose of this vignette is to use early data on COVID-19 in South Africa 

From 8d49c7cb4e017732562d14d5bc739baee4866204 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 28 Feb 2023 19:47:04 +0000
Subject: [PATCH 150/828] reworded the inputs setup section

---
 vignettes/projecting_incidence.Rmd | 45 +++++++++++++++++-------------
 1 file changed, 26 insertions(+), 19 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index cde35519..c4c85308 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -52,31 +52,38 @@ library("lubridate")
 
 ## Data
 
-We will get and clean the first $15$ days of the COVID-19 
-outbreak in South Africa to seed the simulation for this example.
-
-```{r data, message=FALSE}
-data_url <- "https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv" # nolint: line_length_linter. 
-
-#Read the data in using the url
-covid19_sa <- read.csv(data_url)
-
-# Subset the first 15 days and count the number of cases per date
-covid19_sa <- covid19_sa %>% 
-  select(date) %>% 
-  mutate(date = lubridate::dmy(date)) %>%
-  filter(date <= min(date) + lubridate::days(15)) %>%   
-  group_by(date) %>% 
-  summarise(cases = n()) %>%   
-  ungroup()
+Included in `bpmodels` is a cleaned time series of the first 15 days of 
+the COVID-19 outbreak in South Africa. This can be loaded into 
+memory as follows: 
+```{r}
+data('covid19_sa', package = 'bpmodels')
+```
+
+Let us examine the first 6 entries of the dataset.
+```{r}
+head(covid19_sa)
 ```
 
-## Inputs  
-Using the data above, we will set up a vector of start times for each case.
+## Setting up the inputs  
 
+### Onset times 
+`chain_sim()` requires a vector of onset times, `t0`, for each 
+chain/individual/simulation. 
+
+The `covid19_sa` dataset above is aggregated, so we will have to disaggregate
+it into a linelist with each row representing a case and their onset time. 
+
+To achieve this, we will first use the date of the index case as the reference 
+and find the difference between each date and the reference. 
 ```{r linelist_gen, message=FALSE}
 days_since_index <- as.integer(covid19_sa$date - min(covid19_sa$date))
+days_since_index
+```
 
+Using the times created above, we will then create the linelist
+by disaggregating the time series so that each case has a 
+corresponding start time.
+```{r}
 start_times <- unlist(mapply(
   function(x, y) rep(x, times = ifelse(y == 0, 1, y)),
   days_since_index,

From af0b11883fcf55f12e91e638e7f22603852bc781 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 28 Feb 2023 19:48:28 +0000
Subject: [PATCH 151/828] clarified lognormal distribution usage and adopted
 epiparameter

---
 vignettes/projecting_incidence.Rmd | 58 ++++++++++++++----------------
 1 file changed, 26 insertions(+), 32 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index c4c85308..532d317d 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -93,54 +93,48 @@ start_times <- unlist(mapply(
 start_times
 ```
 
-
-Additionally, `chain_sim()` requires the end time for the simulations and the 
-maximum size of each chain. Since each result of `chain_sim()` is stochastic,
-it is also best to run it many times. 
-
-We will specify these as follows:
-
-```{r input_prep2, message=FALSE}
-#' Date to end simulation (14 day projection in this case)
-
-projection_window <- 14 # 14 days/ 2-week ahead projection
-
-projection_end_day <- max(days_since_index) + projection_window
-
-#' Number of simulations
-sim_rep <- 100
-
-#' Maximum chain size allowed
-chain_threshold <- 1000
-
-```
-
 ### Serial interval
 
+The log-normal distribution is [commonly used in epidemiology](https://ete-online.biomedcentral.com/articles/10.1186/1742-7622-4-2) 
+to characterise quantities such as the serial interval because it tends to 
+have a large variance and can only be positive-valued. The log-normal 
+distribution is right-skewed and assumes positive real-numbered values, hence
+often models the serial interval appropriately. 
+
 In this example, we will assume based on COVID-19 literature that the 
-serial interval, $\mathcal{S}$, is log-normal distributed with parameters, 
-$\mu = 4.7$ and $\sigma = 2.9$ [@Pearson2020]. The log-normal mean, 
-$E[ \mathcal{S} ]$ and standard deviation $SD[ \mathcal{S} ]$ are 
-characterised as:
+serial interval, S, is log-normal distributed with parameters, 
+$\mu = 4.7$ and $\sigma = 2.9$ [@Pearson2020]. Note that when the distribution
+is described this way, it means $\mu$ and $\sigma$ are the expected value 
+and standard deviation of the natural logarithm of the serial interval. Hence, 
+in order to sample the untransformed serial interval with expectation/mean, 
+$E[S]$ and standard deviation, $SD [S]$, we can use the following 
+parametrisation:
 
 \begin{align}
-E[ \mathcal{S} ] &= \ln \left( \dfrac{\mu^2}{(\sqrt{\mu^2 + \sigma^2}} \right) \\
+E[S] &= \ln \left( \dfrac{\mu^2}{(\sqrt{\mu^2 + \sigma^2}} \right) \\
 
-SD [ \mathcal{S} ] &= \sqrt {\ln \left(1 + \dfrac{\sigma^2}{\mu^2} \right)}
+SD [S] &= \sqrt {\ln \left(1 + \dfrac{\sigma^2}{\mu^2} \right)}
  
 \end{align}
 
 See [Wikipedia](https://en.wikipedia.org/wiki/Log-normal_distribution) for a 
 detailed explanation of this parametrisation.
 
-The following is how we set up the serial interval function using the
-information provided above:
+The [epiparameter](https://github.com/epiverse-trace/epiparameter) R package 
+provides the function `epiparameter::lnorm_meansd2musigma()` for implementing 
+this parametrisation. It takes as inputs the mean, $\mu$ and standard 
+deviation, $\sigma$ and returns a list with the transformed mean and 
+standard deviation. Refer to `?epiparamete::lnorm_meansd2musigma` 
+for more details.
 
+Let us set up the serial interval with this information:
 ```{r input_prep3, message=FALSE}
 mu <- 4.7
 sgma <- 2.9
-log_sd <- sqrt(log(1 + (sgma / mu)^2)) # log standard deviation
-log_mean <- log((mu^2) / (sqrt(sgma^2 + mu^2))) # log mean
+
+log_mean <- epiparameter::lnorm_meansd2musigma(mu, sgma)[[1]]  # log mean
+log_sd <- epiparameter::lnorm_meansd2musigma(mu, sgma)[[2]] # log standard deviation
+
 #' serial interval function
 serial_interval <- function(sample_size) {
   si <- rlnorm(sample_size, meanlog = log_mean, sdlog = log_sd)

From 0683309485f1476c0c838b0ad86471a9f94cd2af Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 28 Feb 2023 19:51:40 +0000
Subject: [PATCH 152/828] explained negbin epidemiology application and split
 up inputs section into subsections

---
 vignettes/projecting_incidence.Rmd | 49 ++++++++++++++++++++++++------
 1 file changed, 40 insertions(+), 9 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 532d317d..9acb32df 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -144,20 +144,51 @@ serial_interval <- function(sample_size) {
 
 ### Offspring distribution
 
-We will also assume that the offspring distribution is characterised by a 
-negative binomial with $\mathcal{R} = 2.5$ [@abbott2020] and 
-$\mathcal{k} = 0.58$ [@Wang2020]. In this parameterization, $\mathcal{R}$ 
-represents the $\mathcal{R_0}$, which is defined as the average number of 
+The negative binomial distribution is commonly used in epidemiology to
+account for [individual variation in transmissibility](https://www.nature.com/articles/nature04153), also known as superspreading.
+
+For this example, we will assume that the offspring distribution is 
+characterised by a negative binomial with $R = 2.5$ [@abbott2020] and 
+$k = 0.58$ [@Wang2020]. In this parameterization, $R$ 
+represents the $R_0$, which is defined as the average number of 
 cases produced by a single individual in an entirely susceptible population. 
 The parameter $k$ represents superspreading, that is, the degree of 
 heterogeneity in transmission by single individuals.
 
-## Simulations
-To summarize the simulation set up, for each of the `r sim_rep` simulations,
-we want to project cases over a `r projection_window` day period since the 
-last case, assuming that no chain would exceed `r chain_threshold`. 
 
-### Model assumptions
+### Simulation controls
+
+`chain_sim()` also requires the end time for the simulations. For this 
+example, we will simulate outbreaks that end 14 days after the last date 
+of observations in `covid19_sa`.   
+```{r input_prep2, message=FALSE}
+#' Date to end simulation (14 day projection in this case)
+projection_window <- 14 # 14 days/ 2-week ahead projection
+projection_end_day <- max(days_since_index) + projection_window
+projection_end_day
+```
+
+`chain_sim()` is stochastic, meaning the results are different every 
+time it is run for the same set of parameters, so we will run the simulations
+many times and summarise the results. 
+
+We will, therefore, run each simulation $100$ times.
+```{r}
+#' Number of simulations
+sim_rep <- 100
+```
+
+Lastly, `chain_sim()` requires the maximum size of each chain. 
+Above this value, the simulation is cut off. If this value is 
+not specified, it assumes a value of infinity. Here, we will
+assume a maximum chain size of $1000$.
+```{r}
+#' Maximum chain size allowed
+chain_threshold <- 1000
+```
+
+
+## Modelling assumptions
 
 `chain_sim()` makes the following simplifying assumptions:
 

From c7fa79bcc5476bfdd9153569d2b7bee46b019a49 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 28 Feb 2023 19:52:15 +0000
Subject: [PATCH 153/828] Revised summary of simulation set up

---
 vignettes/projecting_incidence.Rmd | 9 +++++++++
 1 file changed, 9 insertions(+)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 9acb32df..dd7ba883 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -198,6 +198,15 @@ chain_threshold <- 1000
 1. No interventions have been implemented
 1. Population is homogeneous and well-mixed
 
+To summarise the whole set up so far, we are going to simulate 
+each chain `r sim_rep` times, projecting COVID-19 cases over
+`r projection_window` days after the first $15$ days, and 
+assuming that no outbreak size exceeds `r chain_threshold` cases. 
+
+## Running the simulations
+
+We will use the function `lapply()` to run the simulations and bind them
+by rows with `dplyr::bind_rows()`.
 ```{r simulations, message=FALSE}
 set.seed(1234)
 sim_chain_sizes <- lapply(

From c39a3f34c022f7c4b220ff6af18d3bd29aad57c0 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 28 Feb 2023 19:52:49 +0000
Subject: [PATCH 154/828] Added a chunk to examine head of simulation output

---
 vignettes/projecting_incidence.Rmd | 14 +++++++++-----
 1 file changed, 9 insertions(+), 5 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index dd7ba883..ba7380e8 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -229,19 +229,23 @@ sim_chain_sizes <- lapply(
 )
 
 sim_output <- bind_rows(sim_chain_sizes)
+```
 
+Let us view the first few rows of the simulation results.
+```{r sim_output_head}
 head(sim_output)
 ```
 
 ## Post-processing
 
-From the simulated data, we will count the median daily cases 
-(`median_daily_cases`) across all simulations and overlay that on the results 
-of all the projections through time (`incidence_ts`).
+Now, we will summarise the simulation results. 
+
+We want to plot the individual simulated daily time series and show 
+the median cases per day aggregated over all simulations.
 
+First, we will create the daily time series per simulation by
+aggregating the number of cases per day of each simulation.
 ```{r post_processing}
-index_date <- min(covid19_sa$date)
-index_date
 # Daily number of cases for each simulation
 incidence_ts <- sim_output %>%
   mutate(day = ceiling(time)) %>%

From 41193e90e989bf6fd00c576028c652930a52b38e Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 28 Feb 2023 19:53:26 +0000
Subject: [PATCH 155/828] Split up post-processing into smaller chunks

---
 vignettes/projecting_incidence.Rmd | 21 ++++++++++++++++++++-
 1 file changed, 20 insertions(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index ba7380e8..72d38183 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -254,14 +254,29 @@ incidence_ts <- sim_output %>%
   ungroup()
 
 head(incidence_ts)
+```
 
-# Add dates
+Next, we will add a date column to the results of each simulation 
+set. We will use the date of the first case in the observed data 
+as the reference start date.
+```{r}
+# Get start date from the observed data
+index_date <- min(covid19_sa$date)
+index_date
+
+# Add a dates column to each simulation result
 incidence_ts_by_date <- incidence_ts %>%
   group_by(sim) %>%
   mutate(date = index_date + days(seq(0, n() - 1))) %>%
   ungroup()
 
 head(incidence_ts_by_date)
+```
+
+
+Now we will aggregate the simulations by day and evaluate the median 
+daily cases across all simulations.
+```{r}
 # Median daily number of cases aggregated across all simulations
 median_daily_cases <- incidence_ts %>%
   group_by(day) %>%
@@ -270,7 +285,11 @@ median_daily_cases <- incidence_ts %>%
   arrange(day)
 
 head(median_daily_cases)
+```
 
+As was done for the individual simulations, we will add a date column in the
+same manner.
+```{r}
 # Add dates
 median_daily_cases <- median_daily_cases %>%
   mutate(date = index_date + days(seq(0, projection_end_day))) %>%

From 42ef4fc786daba0f692c168f4bc1eb9f8da34563 Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 28 Feb 2023 19:53:48 +0000
Subject: [PATCH 156/828] Modified the plot

---
 vignettes/projecting_incidence.Rmd | 69 +++++++++++++++++++-----------
 1 file changed, 43 insertions(+), 26 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 72d38183..db151fd4 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -301,26 +301,20 @@ head(median_daily_cases)
 
 ## Visualization
 
-```{r viz, fig.cap ="COVID-19 incidence projected over a two week window. The gray lines represent individual simulations, red connected dots represent the median daily cases across all simulations, and the black triangles represent the observed data.", fig.width=4.0, fig.height=3.8}
-# Visualization
+We will now plot the individual simulation results alongside the median
+of the aggregated results.
+```{r viz, fig.cap ="COVID-19 incidence projected over a two week window. The gray lines represent individual simulations, red connected dots represent the median daily cases across all simulations, and the black triangles represent the observed data.", fig.width=6.0, fig.height=6}
+
 ggplot(data = incidence_ts_by_date) +
-  geom_line(aes(
-    x = date,
-    y = cases,
-    group = sim
-  ),
-  color = "grey",
-  linewidth = 1.2,
-  alpha = 0.25
-  ) +
-  geom_point(
-    data = median_daily_cases,
+  geom_line(
     aes(
       x = date,
-      y = median_cases
+      y = cases,
+      group = sim
     ),
-    color = "tomato3",
-    size = 0.75
+    color = "grey",
+    linewidth = 1.2,
+    alpha = 0.25
   ) +
   geom_line(
     data = median_daily_cases,
@@ -329,7 +323,7 @@ ggplot(data = incidence_ts_by_date) +
       y = median_cases
     ),
     color = "tomato3",
-    linewidth = 0.25
+    linewidth = 1.8
   ) +
   geom_point(
     data = covid19_sa,
@@ -338,19 +332,42 @@ ggplot(data = incidence_ts_by_date) +
       y = cases
     ),
     color = "black",
-    size = 0.25,
-    shape = 24
+    size = 1.75,
+    shape = 21
+  ) +
+  geom_line(
+    data = covid19_sa,
+    aes(
+      x = date,
+      y = cases
+    ),
+    color = "black",
+    size = 1.75,
+    shape = 21
   ) +
   scale_x_continuous(
-    breaks = seq(min(incidence_ts_by_date$date), max(incidence_ts_by_date$date), 10),
-    labels = seq(min(incidence_ts_by_date$date), max(incidence_ts_by_date$date), 10)
+    breaks = seq(
+      min(incidence_ts_by_date$date),
+      max(incidence_ts_by_date$date),
+      10
+    ),
+    labels = seq(
+      min(incidence_ts_by_date$date),
+      max(incidence_ts_by_date$date),
+      10
+    )
   ) +
   scale_y_continuous(
-    breaks = seq(0, max(incidence_ts_by_date$cases) + 200, 250),
-    labels = seq(0, max(incidence_ts_by_date$cases) + 200, 250)
+    breaks = seq(
+      0,
+      max(incidence_ts_by_date$cases) + 200,
+      250
+    ),
+    labels = seq(
+      0,
+      max(incidence_ts_by_date$cases) + 200, 250
+    )
   ) +
-  labs(x = "Date", y = "Daily cases (median)") +
-  theme_minimal() 
+  labs(x = "Date", y = "Daily cases (median)")
 ```
-
 ## References

From 955eb9ca254c085501de568f954b177a1294530a Mon Sep 17 00:00:00 2001
From: James Azam <jamesazam@sun.ac.za>
Date: Tue, 28 Feb 2023 19:54:07 +0000
Subject: [PATCH 157/828] Metadata from vignette

---
 man/Meta/vignette.rds | Bin 0 -> 248 bytes
 1 file changed, 0 insertions(+), 0 deletions(-)
 create mode 100644 man/Meta/vignette.rds

diff --git a/man/Meta/vignette.rds b/man/Meta/vignette.rds
new file mode 100644
index 0000000000000000000000000000000000000000..3419d044842410f2b479d671df6848ef0b2df328
GIT binary patch
literal 248
zcmV<U00;jciwFP!0000025nKnio!4uP1CBYAP9v;Z~21$gCJhKNZEt8CEJW{cH2af
z6zR<`U!CmQvW*SQWD?$+c`rH42qBatloA?a8K;=W7z>PuBxH*F@@(`M6i%wsyHte~
zpbE(HN(8v|zQZx8j=s{hWkOou7Fb7Rwe=9-rfit5-G>4G%>;KmXt)|2{OPJP0KN_@
zL~H3U>JN=8q5oJT#VfEutH}n=poG8v8Rkc~fbz0~=Auo@>0!nXOtO_Fv~%C2>kjdL
yvwf6N9%^{%-_t)e`jWLC=KdqEm~Oa2qeaPWXmsWuJUbfXd);>-U$Myw0ssK@u6NA<

literal 0
HcmV?d00001


From db40b132798dce069a45c8377d00a67364ec425f Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 18:17:51 +0000
Subject: [PATCH 158/828] Revised section on simulating branching processes

---
 README.Rmd | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index 57e936ec..bb5143e5 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -154,9 +154,9 @@ function follows the same syntax as `chain_ll()`.
 
 Below, we are simulating $5$ chains, assuming the offspring are generated using
 a Poisson distribution with mean, `lambda = 0.5`. By default, `chain_sim()` 
-returns a vector of chain sizes/lengths. However, to override that so that 
-a tree of infectees and infectors is returned, we need to specify a function 
-for the serial interval and set `tree = TRUE`.
+returns a vector of chain sizes/lengths. If we instead want to return 
+a tree of infectees and infectors, we need to specify a function for 
+the serial interval and set `tree = TRUE` (see next section).
 
 ```{r}
 chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5)

From 5ffeeea3d4812050ba900910e4c8c9cbb432ce30 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 18:18:13 +0000
Subject: [PATCH 159/828] Changed the vignette title

---
 vignettes/projecting_incidence.Rmd | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index db151fd4..3006c56c 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -1,5 +1,5 @@
 ---
-title: "Projecting infectious disease incidence using early outbreak data"
+title: "Projecting infectious disease incidence: a COVID-19 example"
 author: "James Azam, Sebastian Funk"
 output:
   bookdown::html_vignette2:
@@ -10,7 +10,7 @@ pkgdown:
 bibliography: references.bib
 link-citations: true
 vignette: >
-  %\VignetteIndexEntry{Projecting infectious disease incidence using early outbreak data}
+  %\VignetteIndexEntry{Projecting infectious disease incidence: a COVID-19 example}
   %\VignetteEncoding{UTF-8}
   %\VignetteEngine{knitr::rmarkdown}
 editor_options: 

From 0cfec1651e530bce5f8ad7514651aaf314b2c39b Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 18:18:30 +0000
Subject: [PATCH 160/828] Revised the overview

---
 vignettes/projecting_incidence.Rmd | 5 ++---
 1 file changed, 2 insertions(+), 3 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 3006c56c..07c3bebc 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -29,8 +29,8 @@ knitr::opts_chunk$set(echo = TRUE,
 
 ## Overview
 
-Branching processes can be used to project infectious disease trends provided 
-we can characterize the distribution of times between 
+Branching processes can be used to project infectious disease trends in time 
+provided we can characterize the distribution of times between 
 successive cases (serial interval), and the distribution of secondary cases 
 produced by a single individual (offspring distribution). Such simulations can 
 be achieved in `bpmodels` with the `chain_sim()` function and @Pearson2020, and 
@@ -40,7 +40,6 @@ The purpose of this vignette is to use early data on COVID-19 in South Africa
 [@marivate2020] to illustrate how `bpmodels` can be used to forecast an 
 outbreak. 
 
-
 Let's load the required packages
 
 ```{r packages, include=TRUE}

From 668c4f3c57337cacb55b8165368279437d09dac2 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 18:19:03 +0000
Subject: [PATCH 161/828] Revised description of setup of onset times linelist

---
 vignettes/projecting_incidence.Rmd | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 07c3bebc..cf57958f 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -79,9 +79,9 @@ days_since_index <- as.integer(covid19_sa$date - min(covid19_sa$date))
 days_since_index
 ```
 
-Using the times created above, we will then create the linelist
-by disaggregating the time series so that each case has a 
-corresponding start time.
+Using the vector of start times for the time series, we will then 
+create the linelist by disaggregating the time series so 
+that each case has a corresponding start time.
 ```{r}
 start_times <- unlist(mapply(
   function(x, y) rep(x, times = ifelse(y == 0, 1, y)),

From 79b53cf1a60c8890c8d1655d7f9bb60efbe1f88a Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 18:19:38 +0000
Subject: [PATCH 162/828] Revised description of serial interval set up

---
 vignettes/projecting_incidence.Rmd | 18 ++++++++----------
 1 file changed, 8 insertions(+), 10 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index cf57958f..31d7f28d 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -94,20 +94,18 @@ start_times
 
 ### Serial interval
 
-The log-normal distribution is [commonly used in epidemiology](https://ete-online.biomedcentral.com/articles/10.1186/1742-7622-4-2) 
-to characterise quantities such as the serial interval because it tends to 
-have a large variance and can only be positive-valued. The log-normal 
-distribution is right-skewed and assumes positive real-numbered values, hence
-often models the serial interval appropriately. 
+The log-normal distribution is commonly used in epidemiology to characterise 
+quantities such as the serial interval because it has a large variance 
+and can only be positive-valued [@Nishiura2007; @Limpert2001]. 
 
 In this example, we will assume based on COVID-19 literature that the 
 serial interval, S, is log-normal distributed with parameters, 
 $\mu = 4.7$ and $\sigma = 2.9$ [@Pearson2020]. Note that when the distribution
 is described this way, it means $\mu$ and $\sigma$ are the expected value 
 and standard deviation of the natural logarithm of the serial interval. Hence, 
-in order to sample the untransformed serial interval with expectation/mean, 
-$E[S]$ and standard deviation, $SD [S]$, we can use the following 
-parametrisation:
+in order to sample the "back-transformed" measured serial interval with 
+expectation/mean, $E[S]$ and standard deviation, $SD [S]$, 
+we can use the following parametrisation:
 
 \begin{align}
 E[S] &= \ln \left( \dfrac{\mu^2}{(\sqrt{\mu^2 + \sigma^2}} \right) \\
@@ -123,10 +121,10 @@ The [epiparameter](https://github.com/epiverse-trace/epiparameter) R package
 provides the function `epiparameter::lnorm_meansd2musigma()` for implementing 
 this parametrisation. It takes as inputs the mean, $\mu$ and standard 
 deviation, $\sigma$ and returns a list with the transformed mean and 
-standard deviation. Refer to `?epiparamete::lnorm_meansd2musigma` 
+standard deviation. Refer to `?epiparameter::lnorm_meansd2musigma` 
 for more details.
 
-Let us set up the serial interval with this information:
+Let us set up the serial interval function with the appropriate inputs:
 ```{r input_prep3, message=FALSE}
 mu <- 4.7
 sgma <- 2.9

From 0dff3359aa27bbab70141fe86ff4425f31673107 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 18:20:25 +0000
Subject: [PATCH 163/828] Replaced hyperlinks with citations

---
 vignettes/projecting_incidence.Rmd | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 31d7f28d..10174049 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -142,7 +142,8 @@ serial_interval <- function(sample_size) {
 ### Offspring distribution
 
 The negative binomial distribution is commonly used in epidemiology to
-account for [individual variation in transmissibility](https://www.nature.com/articles/nature04153), also known as superspreading.
+account for individual variation in transmissibility, 
+also known as superspreading [@Lloyd-Smith2005a].
 
 For this example, we will assume that the offspring distribution is 
 characterised by a negative binomial with $R = 2.5$ [@abbott2020] and 

From 1c2b06e763ee630d10d2e2dc5f988ea7cfc1d580 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 18:20:48 +0000
Subject: [PATCH 164/828] Made individual trajectories thinner

---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 10174049..e9178301 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -311,7 +311,7 @@ ggplot(data = incidence_ts_by_date) +
       group = sim
     ),
     color = "grey",
-    linewidth = 1.2,
+    linewidth = 0.2,
     alpha = 0.25
   ) +
   geom_line(

From 840e6d8fdfdcb7852df9af50cf38671c1aebd45b Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 18:21:10 +0000
Subject: [PATCH 165/828] Styled plotting code

---
 vignettes/projecting_incidence.Rmd | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index e9178301..52a02b95 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -363,7 +363,8 @@ ggplot(data = incidence_ts_by_date) +
     ),
     labels = seq(
       0,
-      max(incidence_ts_by_date$cases) + 200, 250
+      max(incidence_ts_by_date$cases) + 200, 
+      250
     )
   ) +
   labs(x = "Date", y = "Daily cases (median)")

From 64362e5b433c61f778d76719474bd7d03116c7c7 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 18:21:43 +0000
Subject: [PATCH 166/828] Added new references to bibtex library

---
 vignettes/references.bib | 54 ++++++++++++++++++++++++++++++++++++++--
 1 file changed, 52 insertions(+), 2 deletions(-)

diff --git a/vignettes/references.bib b/vignettes/references.bib
index 7cec9b45..e2fcc49f 100644
--- a/vignettes/references.bib
+++ b/vignettes/references.bib
@@ -21,7 +21,6 @@ @article{Alene2021
   volume    = {21},
   year      = {2021}
 }
-
 @article{Allen2012,
   abstract = {The basic reproduction number, ℛ(0), one of the most well-known thresholds in deterministic epidemic theory, predicts a disease outbreak if ℛ(0)>1. In stochastic epidemic theory, there are also thresholds that predict a major outbreak. In the case of a single infectious group, if ℛ(0)>1 and i infectious individuals are introduced into a susceptible population, then the probability of a major outbreak is approximately 1-(1/ℛ(0))( i ). With multiple infectious groups from which the disease could emerge, this result no longer holds. Stochastic thresholds for multiple groups depend on the number of individuals within each group, i ( j ), j=1, {\ldots}, n, and on the probability of disease extinction for each group, q ( j ). It follows from multitype branching processes that the probability of a major outbreak is approximately [Formula: see text]. In this investigation, we summarize some of the deterministic and stochastic threshold theory, illustrate how to calculate the stochastic thresholds, and derive some new relationships between the deterministic and stochastic thresholds.},
   author   = {Allen, Linda J.S. and Lahodny, Glenn E.},
@@ -35,7 +34,6 @@ @article{Allen2012
   volume   = {6},
   year     = {2012}
 }
-
 @article{Blumberg2013,
   abstract  = {Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. {\textcopyright} 2013 Elsevier B.V.},
   author    = {Blumberg, S. and Lloyd-Smith, J. O.},
@@ -77,6 +75,7 @@ @article{Chen2022
   volume    = {13},
   year      = {2022}
 }
+
 @article{Farrington1999,
   abstract = {We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.},
   author   = {Farrington, C. P. and Grant, A. D.},
@@ -90,6 +89,7 @@ @article{Farrington1999
   volume   = {36},
   year     = {1999}
 }
+
 @article{Farrington1999a,
   abstract = {We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.},
   author   = {Farrington, C. P. and Grant, A. D.},
@@ -184,12 +184,48 @@ @article{Lehtinen2021
   volume   = {18},
   year     = {2021}
 }
+@article{Limpert2001,
+  abstract = {On the charms of statistics, and how mechanical models resembling gambling machines offer a link to a handy way to characterize log-normal distributions, which can provide deeper insight into variability and probability - Normal or log-normal: That is the question.},
+  author   = {Limpert, Eckhard and Stahel, Werner A. and Abbt, Markus},
+  doi      = {10.1641/0006-3568(2001)051[0341:LNDATS]2.0.CO;2},
+  issn     = {00063568},
+  journal  = {BioScience},
+  number   = {5},
+  pages    = {341--352},
+  title    = {{Log-normal distributions across the sciences: Keys and clues}},
+  volume   = {51},
+  year     = {2001}
+}
+@article{Lloyd-Smith2005a,
+  abstract = {Population-level analyses often use average quantities to describe heterogeneous systems, particularly when variation does not arise from identifiable groups. A prominent example, central to our current understanding of epidemic spread, is the basic reproductive number, R0, which is defined as the mean number of infections caused by an infected individual in a susceptible population. Population estimates of R0 can obscure considerable individual variation in infectiousness, as highlighted during the global emergence of severe acute respiratory syndrome (SARS) by numerous 'superspreading events' in which certain individuals infected unusually large numbers of secondary cases. For diseases transmitted by non-sexual direct contacts, such as SARS or smallpox, individual variation is difficult to measure empirically, and thus its importance for outbreak dynamics has been unclear. Here we present an integrated theoretical and statistical analysis of the influence of individual variation in infectiousness on disease emergence. Using contact tracing data from eight directly transmitted diseases, we show that the distribution of individual infectiousness around R0 is often highly skewed. Model predictions accounting for this variation differ sharply from average-based approaches, with disease extinction more likely and outbreaks rarer but more explosive. Using these models, we explore implications for outbreak control, showing that individual-specific control measures outperform population-wide measures. Moreover, the dramatic improvements achieved through targeted control policies emphasize the need to identify predictive correlates of higher infectiousness. Our findings indicate that superspreading is a normal feature of disease spread, and to frame ongoing discussion we propose a rigorous definition for superspreading events and a method to predict their frequency. {\textcopyright} 2005 Nature Publishing Group.},
+  author   = {Lloyd-Smith, J. O. and Schreiber, S. J. and Kopp, P. E. and Getz, W. M.},
+  doi      = {10.1038/nature04153},
+  issn     = {14764687},
+  journal  = {Nature},
+  number   = {7066},
+  pages    = {355--359},
+  pmid     = {16292310},
+  title    = {{Superspreading and the effect of individual variation on disease emergence}},
+  volume   = {438},
+  year     = {2005}
+}
 @article{marivate2020,
   title   = {Use of available data to inform the COVID-19 outbreak in South Africa: a case study},
   author  = {Marivate, Vukosi and Combrink, Herkulaas MvE},
   journal = {arXiv preprint arXiv:2004.04813},
   year    = {2020}
 }
+@article{Nishiura2007,
+  abstract = {The incubation period of infectious diseases, the time from infection with a microorganism to onset of disease, is directly relevant to prevention and control. Since explicit models of the incubation period enhance our understanding of the spread of disease, previous classic studies were revisited, focusing on the modeling methods employed and paying particular attention to relatively unknown historical efforts. The earliest study on the incubation period of pandemic influenza was published in 1919, providing estimates of the incubation period of Spanish flu using the daily incidence on ships departing from several ports in Australia. Although the study explicitly dealt with an unknown time of exposure, the assumed periods of exposure, which had an equal probability of infection, were too long, and thus, likely resulted in slight underestimates of the incubation period. After the suggestion that the incubation period follows lognormal distribution, Japanese epidemiologists extended this assumption to estimates of the time of exposure during a point source outbreak. Although the reason why the incubation period of acute infectious diseases tends to reveal a right-skewed distribution has been explored several times, the validity of the lognormal assumption is yet to be fully clarified. At present, various different distributions are assumed, and the lack of validity in assuming lognormal distribution is particularly apparent in the case of slowly progressing diseases. The present paper indicates that (1) analysis using well-defined short periods of exposure with appropriate statistical methods is critical when the exact time of exposure is unknown, and (2) when assuming a specific distribution for the incubation period, comparisons using different distributions are needed in addition to estimations using different datasets, analyses of the determinants of incubation period, and an understanding of the underlying disease mechanisms. {\textcopyright} 2007 Nishiura; licensee BioMed Central Ltd.},
+  author   = {Nishiura, Hiroshi},
+  doi      = {10.1186/1742-7622-4-2},
+  issn     = {17427622},
+  journal  = {Emerging Themes in Epidemiology},
+  pages    = {1--12},
+  title    = {{Early efforts in modeling the incubation period of infectious diseases with an acute course of illness}},
+  volume   = {4},
+  year     = {2007}
+}
 @article{Nishiura2012,
   abstract  = {Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. {\textcopyright} 2011.},
   author    = {Nishiura, Hiroshi and Yan, Ping and Sleeman, Candace K. and Mode, Charles J.},
@@ -246,3 +282,17 @@ @article{Wang2020
   volume    = {11},
   year      = {2020}
 }
+@article{Yadav2021,
+  abstract = {In this review, we have discussed the different statistical modeling and prediction techniques for various infectious diseases including the recent pandemic of COVID-19. The distribution fitting, time series modeling along with predictive monitoring approaches, and epidemiological modeling are illustrated. When the epidemiology data is sufficient to fit with the required sample size, the normal distribution in general or other theoretical distributions are fitted and the best-fitted distribution is chosen for the prediction of the spread of the disease. The infectious diseases develop over time and we have data on the single variable that is the number of infections that happened, therefore, time series models are fitted and the prediction is done based on the best-fitted model. Monitoring approaches may also be applied to time series models which could estimate the parameters more precisely. In epidemiological modeling, more biological parameters are incorporated in the models and the forecasting of the disease spread is carried out. We came up with, how to improve the existing modeling methods, the use of fuzzy variables, and detection of fraud in the available data. Ultimately, we have reviewed the results of recent statistical modeling efforts to predict the course of COVID-19 spread.},
+  author   = {Yadav, Subhash Kumar and Akhter, Yusuf},
+  doi      = {10.3389/fpubh.2021.645405},
+  issn     = {22962565},
+  journal  = {Frontiers in Public Health},
+  keywords = {distribution fitting models,epidemiological models of disease,estimation,parameters,prediction,time series regression models},
+  number   = {June},
+  pages    = {1--27},
+  pmid     = {34222166},
+  title    = {{Statistical Modeling for the Prediction of Infectious Disease Dissemination With Special Reference to COVID-19 Spread}},
+  volume   = {9},
+  year     = {2021}
+}

From 778120397f3ac726df69eaccc5bdb816b474fb27 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:03:57 +0000
Subject: [PATCH 167/828] Edited chain_ll's @seealso tags

---
 man/chain_ll.Rd | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
index 8505e6af..63d048d9 100644
--- a/man/chain_ll.Rd
+++ b/man/chain_ll.Rd
@@ -51,8 +51,8 @@ chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
 }
 \seealso{
-pois_size_ll nbinom_size_ll gborel_size_ll pois_length_ll
-geom_length_ll offspring_ll
+pois_size_ll, nbinom_size_ll, gborel_size_ll, pois_length_ll,
+geom_length_ll, offspring_ll
 }
 \author{
 Sebastian Funk

From bed2eb943b1f07b6b773e0a4a1776cb18288e764 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:04:23 +0000
Subject: [PATCH 168/828] Increased R dependency to >= 3.0

---
 DESCRIPTION | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 7ff53e58..f3a5ef90 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -16,7 +16,7 @@ License: MIT + file LICENSE
 URL: https://github.com/epiverse-trace/bpmodels
 BugReports: https://github.com/epiverse-trace/bpmodels/issues
 Depends: 
-    R (>= 2.10.0)
+    R (>= 3.0.0)
 Suggests: 
     bookdown,
     covr,

From 8da0cc9c0db255cfa9ceb918337dc63a62b13e84 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:04:43 +0000
Subject: [PATCH 169/828] Added epiparameter to the dependencies

---
 DESCRIPTION | 1 +
 1 file changed, 1 insertion(+)

diff --git a/DESCRIPTION b/DESCRIPTION
index f3a5ef90..0d11a90f 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -21,6 +21,7 @@ Suggests:
     bookdown,
     covr,
     dplyr,
+    epiparameter,
     ggplot2,
     knitr,
     lubridate,

From e8649a7dc98754ec8337cbf9adfb078b23595333 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:05:16 +0000
Subject: [PATCH 170/828] Removed namespacing of epiparameter

---
 vignettes/projecting_incidence.Rmd | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 52a02b95..2d630051 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -129,8 +129,8 @@ Let us set up the serial interval function with the appropriate inputs:
 mu <- 4.7
 sgma <- 2.9
 
-log_mean <- epiparameter::lnorm_meansd2musigma(mu, sgma)[[1]]  # log mean
-log_sd <- epiparameter::lnorm_meansd2musigma(mu, sgma)[[2]] # log standard deviation
+log_mean <- lnorm_meansd2musigma(mu, sgma)[[1]]  # log mean
+log_sd <- lnorm_meansd2musigma(mu, sgma)[[2]] # log sd
 
 #' serial interval function
 serial_interval <- function(sample_size) {

From 4d5eb502d4b653e825c66eb542aab758ca296872 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:05:34 +0000
Subject: [PATCH 171/828] Added epiparameter to loaded packages

---
 vignettes/projecting_incidence.Rmd | 1 +
 1 file changed, 1 insertion(+)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 2d630051..e121f982 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -47,6 +47,7 @@ library("bpmodels")
 library("dplyr")
 library("ggplot2")
 library("lubridate")
+library("epiparameter")
 ```
 
 ## Data

From 9f11dfeb04991cee8812932d893eeae1e7811dfc Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:51:47 +0000
Subject: [PATCH 172/828] Regenerated docs for chain_sim and chain_ll

---
 man/chain_ll.Rd  | 2 +-
 man/chain_sim.Rd | 6 +++---
 2 files changed, 4 insertions(+), 4 deletions(-)

diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
index 63d048d9..cbd5e549 100644
--- a/man/chain_ll.Rd
+++ b/man/chain_ll.Rd
@@ -10,7 +10,7 @@ chain_ll(
   stat = c("size", "length"),
   obs_prob = 1,
   infinite = Inf,
-  exclude = c(),
+  exclude = NULL,
   individual = FALSE,
   nsim_obs,
   ...
diff --git a/man/chain_sim.Rd b/man/chain_sim.Rd
index 5dd691a6..ae3ae0c8 100644
--- a/man/chain_sim.Rd
+++ b/man/chain_sim.Rd
@@ -116,10 +116,10 @@ where \code{...} are the other arguments to \code{chain_sim()}.
 \examples{
 # Specifying no `serial` and `tree == FALSE` (default) returns a vector
 set.seed(123)
-chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5, 
+chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5,
 tree = FALSE)
 
-# Specifying `serial` without specifying `tree` will set `tree = TRUE` 
+# Specifying `serial` without specifying `tree` will set `tree = TRUE`
 # internally.
 
 # We'll first define the serial function
@@ -128,7 +128,7 @@ serial_interval <- function(n) {
   rlnorm(n, meanlog = 0.58, sdlog = 1.58)
 }
 chain_sim(
-  n = 5, offspring = "pois", lambda = 0.5, stat = "length", 
+  n = 5, offspring = "pois", lambda = 0.5, stat = "length",
   infinite = 100,
   serial = serial_interval
 )

From 19fa1cf6ef53509901a211b144f61febe03c9dc3 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:52:41 +0000
Subject: [PATCH 173/828] lintr: changed expect_equal to expect_identical

---
 tests/testthat/tests-borel.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tests/testthat/tests-borel.r b/tests/testthat/tests-borel.r
index e45fa23a..673b4f12 100644
--- a/tests/testthat/tests-borel.r
+++ b/tests/testthat/tests-borel.r
@@ -2,7 +2,7 @@ context("The Borel distribution is implemented")
 
 test_that("We can calculate probabilities and sample", {
   expect_gt(dborel(1, 0.5), 0)
-  expect_equal(dborel(1, 0.5, log = TRUE), -0.5)
+  expect_identical(dborel(1, 0.5, log = TRUE), -0.5)
   expect_length(rborel(2, 0.9), 2)
 })
 

From 4e6c4f4aa239b09d3b9bc86ee2cc88393c013ded Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:53:13 +0000
Subject: [PATCH 174/828] lintr: removed loading of bpmodels from testthat

---
 tests/testthat.R | 1 -
 1 file changed, 1 deletion(-)

diff --git a/tests/testthat.R b/tests/testthat.R
index b9a1b439..2d4e1df3 100644
--- a/tests/testthat.R
+++ b/tests/testthat.R
@@ -1,4 +1,3 @@
 library(testthat)
-library(bpmodels)
 
 test_check("bpmodels")

From d961ac72247092e9d737cf56324a10b01641ee64 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:54:04 +0000
Subject: [PATCH 175/828] lintr: replaced expect_true of data.frame output to
 expect_s3_class

---
 tests/testthat/tests-sim.r | 29 ++++++++++++++++-------------
 1 file changed, 16 insertions(+), 13 deletions(-)

diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index ed8f31c7..b14256cc 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -3,10 +3,15 @@ context("Simulating from a branching process model")
 test_that("Chains can be simulated", {
   expect_length(chain_sim(n = 2, "pois", lambda = 0.5), 2)
   expect_length(chain_sim(n = 10, "pois", "length", lambda = 0.9), 10)
-  expect_true(is.data.frame(chain_sim(
-    n = 10, "pois", lambda = 2, tree = TRUE,
-    infinite = 10
-  )))
+  expect_s3_class(
+    chain_sim(n = 10,
+              "pois",
+              lambda = 2,
+              tree = TRUE,
+              infinite = 10
+              ),
+    "data.frame"
+    )
   expect_false(any(is.finite(chain_sim(
     n = 2, "pois", "length", lambda = 0.5,
     infinite = 1
@@ -57,30 +62,28 @@ context("Simulating from a branching process model
 
 
 test_that("Chains can be simulated", {
-  expect_true(
-    is.data.frame(
+  expect_s3_class(
       chain_sim_susc(
         "pois",
         mn_offspring = 2,
         serial = function(x) 3,
         pop = 100
-      )
-    )
+      ),
+      "data.frame"
   )
 
-  expect_true(
-    is.data.frame(
+  expect_s3_class(
       chain_sim_susc(
         "nbinom",
         mn_offspring = 2,
         disp_offspring = 1.5,
         serial = function(x) 3,
         pop = 100
-      )
-    )
+      ),
+      "data.frame"
   )
 
-  expect_true(
+  expect_equal(
     nrow(
       chain_sim_susc(
         "pois",

From a6a3a7e60a965fc21c76c25226f3f06d68b03a2b Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:54:55 +0000
Subject: [PATCH 176/828] lintr: replaced expect_true tests with expect_equal

---
 tests/testthat/tests-sim.r | 13 ++++++++-----
 1 file changed, 8 insertions(+), 5 deletions(-)

diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index b14256cc..eca94b61 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -91,10 +91,11 @@ test_that("Chains can be simulated", {
         serial = function(x) 3,
         pop = 1
       )
-    ) == 1
+    ), 
+    1
   )
 
-  expect_true(
+  expect_equal(
     nrow(
       chain_sim_susc(
         "pois",
@@ -103,10 +104,11 @@ test_that("Chains can be simulated", {
         serial = function(x) 3,
         pop = 999
       )
-    ) == 1
+    ),
+    1
   )
 
-  expect_true(
+  expect_equal(
     nrow(
       chain_sim_susc(
         "pois",
@@ -115,7 +117,8 @@ test_that("Chains can be simulated", {
         pop = 999,
         initial_immune = 998
       )
-    ) == 1
+    ),
+    1
   )
 })
 

From d1bbd1420bc319d57c06903793c62ea594526059 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:55:49 +0000
Subject: [PATCH 177/828] lintr:  fixed minor styling issues

---
 tests/testthat/tests-ll.r  | 2 +-
 tests/testthat/tests-sim.r | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
index 0c52d3f4..ca4ef9be 100644
--- a/tests/testthat/tests-ll.r
+++ b/tests/testthat/tests-ll.r
@@ -21,7 +21,7 @@ test_that("Likelihoods can be calculated", {
   expect_lt(chain_ll(chains, "binom", "size", size = 1, prob = 0.5), 0)
 })
 
-test_that("Analytical size/length distributions are implemented", {
+test_that("Analytical size or length distributions are implemented", {
   expect_true(all(pois_size_ll(chains, lambda = 0.5) < 0))
   expect_true(all(nbinom_size_ll(chains, mu = 0.5, size = 0.2) < 0))
   expect_true(all(nbinom_size_ll(chains, prob = 0.5, size = 0.2) < 0))
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index eca94b61..ec212d93 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -31,7 +31,7 @@ test_that("Errors are thrown", {
   )
   expect_error(chain_sim(
     n = 2, offspring = "pois", "size", lambda = 0.9,
-    serial = c(1:2), "must be a function"
+    serial = c(1, 2), "must be a function"
   ))
   expect_error(
     chain_sim(n = 2, offspring = c(1, 2), "length", lambda = 0.9),

From b14b25c80b78900f47b39c418ea97cbcc482eed8 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:56:25 +0000
Subject: [PATCH 178/828] lintr: replaced explicit paths with file.path

---
 README.Rmd | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index bb5143e5..990dd02b 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -8,7 +8,7 @@ link-citations: true
 knitr::opts_chunk$set(
   collapse = TRUE,
   comment = "#>",
-  fig.path = "man/figures/README-",
+  fig.path = file.path("man", "figures", "README-"),
   out.width = "100%"
 )
 ```
@@ -42,13 +42,13 @@ secondary infections caused by an infected individual.
 The latest development version of the _bpmodels_ package can be installed via
 
 ```{r include=TRUE,eval=FALSE}
-devtools::install_github('epiverse-trace/bpmodels')
+devtools::install_github(file.path("epiverse-trace", "bpmodels"))
 ```
 
 To load the package, use
 
 ```{r eval=TRUE}
-library('bpmodels')
+library("bpmodels")
 ```
 
 # Quick start

From d91301a99c0ef458784aecec5e490f14fc871259 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:56:54 +0000
Subject: [PATCH 179/828] removed trailing white space

---
 R/simulate.r | 36 ++++++++++++++++++------------------
 1 file changed, 18 insertions(+), 18 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 3e8cc099..477d8f74 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -42,8 +42,8 @@
 #' `chain_sim()` either returns a vector or a data.frame. The output is
 #' either a vector if `serial` is not provided, which automatically sets
 #' \code{tree = FALSE}, or a `data.frame`, which means that `serial` was
-#' provided as a function. When `serial` is provided, it means 
-#' \code{tree = TRUE} automatically. However, setting \code{tree = TRUE} 
+#' provided as a function. When `serial` is provided, it means
+#' \code{tree = TRUE} automatically. However, setting \code{tree = TRUE}
 #' would require providing a function for `serial`.
 #'
 #' # The serial interval (`serial`):
@@ -51,12 +51,12 @@
 #' ## Assumptions/disambiguation
 #'
 #' In epidemiology, the generation interval is the duration between successive
-#' infectious events in a chain of transmission. Similarly, the serial 
-#' interval is the duration between observed symptom onset times between 
-#' successive cases in a transmission chain. The generation interval is 
-#' often hard to observe because exact times of infection are hard to 
-#' measure hence, the serial interval is often used instead. Here, we 
-#' use the serial interval to represent what would normally be called the 
+#' infectious events in a chain of transmission. Similarly, the serial
+#' interval is the duration between observed symptom onset times between
+#' successive cases in a transmission chain. The generation interval is
+#' often hard to observe because exact times of infection are hard to
+#' measure hence, the serial interval is often used instead. Here, we
+#' use the serial interval to represent what would normally be called the
 #' generation interval, that is, the time between successive cases.
 #'
 #' ## Specifying `serial` in `chain_sim()`
@@ -66,29 +66,29 @@
 #' with one argument.
 #'
 #' If `serial` is specified, `chain_sim()` returns times of
-#' infection as a column in the output. Moreover, specifying a function 
-#' for `serial` implies \code{tree = TRUE} and a tree of 
+#' infection as a column in the output. Moreover, specifying a function
+#' for `serial` implies \code{tree = TRUE} and a tree of
 #' infectors (`ancestor`) and infectees (`id`) will be generated in the output.
 #'
 #' For example, assuming we want to specify the serial interval
-#' generator as a random log-normally distributed variable with 
-#' `meanlog = 0.58` and `sdlog = 1.58`, we could define a named function, 
-#' let's call it "serial_interval", with only one argument representing the 
-#' number of serial intervals to sample: 
+#' generator as a random log-normally distributed variable with
+#' `meanlog = 0.58` and `sdlog = 1.58`, we could define a named function,
+#' let's call it "serial_interval", with only one argument representing the
+#' number of serial intervals to sample:
 #' \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
 #' and assign the name of the function to serial in `chain_sim()` like so
 #' \code{chain_sim(..., serial = serial_interval)},
 #' where `...` are the other arguments to `chain_sim()`. Alternatively, we
-#' could assign an anonymous function to serial in the `chain_sim()` call 
+#' could assign an anonymous function to serial in the `chain_sim()` call
 #' like so \code{chain_sim(..., serial = function(n){rlnorm(n, 0.58, 1.38)})},
 #' where `...` are the other arguments to `chain_sim()`.
 #' @examples
 #' # Specifying no `serial` and `tree == FALSE` (default) returns a vector
 #' set.seed(123)
-#' chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5, 
+#' chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5,
 #' tree = FALSE)
 #'
-#' # Specifying `serial` without specifying `tree` will set `tree = TRUE` 
+#' # Specifying `serial` without specifying `tree` will set `tree = TRUE`
 #' # internally.
 #'
 #' # We'll first define the serial function
@@ -97,7 +97,7 @@
 #'   rlnorm(n, meanlog = 0.58, sdlog = 1.58)
 #' }
 #' chain_sim(
-#'   n = 5, offspring = "pois", lambda = 0.5, stat = "length", 
+#'   n = 5, offspring = "pois", lambda = 0.5, stat = "length",
 #'   infinite = 100,
 #'   serial = serial_interval
 #' )

From 6132ac9819e7941fa4c3d46856d97a127bfb521b Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:57:10 +0000
Subject: [PATCH 180/828] lintr: removed commented out code

---
 R/likelihoods.R | 1 -
 1 file changed, 1 deletion(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 30101fdf..8dd32d9c 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -81,7 +81,6 @@ pois_length_ll <- function(x, lambda) {
 #' @keywords internal
 geom_length_ll <- function(x, prob) {
   lambda <- 1 / prob
-  ## G(k) - G(k - 1)
   GkmGkm1 <- (1 - lambda^(x)) / (1 - lambda^(x + 1)) -
     (1 - lambda^(x - 1)) / (1 - lambda^(x))
 

From a28812017b381cc4d8c93104ccab405480f2edf0 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:57:39 +0000
Subject: [PATCH 181/828] lintr: replaced c() with vector()

---
 R/likelihoods.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 8dd32d9c..f18f95da 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -176,7 +176,7 @@ chain_ll <- function(x, offspring, stat = c("size", "length"), obs_prob = 1,
   }
 
   ## get likelihood function as given by `offspring` and `stat``
-  likelihoods <- c()
+  likelihoods <- vector(mode = "numeric")
   ll_func <- paste(offspring, stat, "ll", sep = "_")
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 

From dcf2a26f98f948fdcef044f270ea96d9bf31443c Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:58:05 +0000
Subject: [PATCH 182/828] lintr: replaced arg = c() with arg = NULL

---
 R/likelihoods.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index f18f95da..48288152 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -139,7 +139,7 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring = 100, ...) {
 #' chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 #' chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
 chain_ll <- function(x, offspring, stat = c("size", "length"), obs_prob = 1,
-                     infinite = Inf, exclude = c(), individual = FALSE,
+                     infinite = Inf, exclude = NULL, individual = FALSE,
                      nsim_obs, ...) {
   stat <- match.arg(stat)
 

From 2e3f2e8e194f289cb92a0605c7348925362de110 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:58:35 +0000
Subject: [PATCH 183/828] lintr: styled README to break long lines

---
 README.Rmd | 14 +++++++++-----
 1 file changed, 9 insertions(+), 5 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index 990dd02b..3254548c 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -109,7 +109,7 @@ we use
 
 ```{r}
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5, 
+ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5,
                nsim_obs = 10)
 ll
 ```
@@ -143,7 +143,7 @@ probability `prob = 0.5`, we run
 
 ```{r}
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, 
+chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5,
          nsim_offspring = 100)
 ```
 
@@ -169,9 +169,13 @@ generation function and set `tree = TRUE` as follows:
 
 ```{r}
 set.seed(13)
-serial_interval <- function(n){rlnorm(n, meanlog = 0.58, sdlog = 1.58)}
-chains_df <- chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length', 
-                       infinite = 100, serial = serial_interval, tree = TRUE)
+serial_interval <- function(n) {
+  rlnorm(n, meanlog = 0.58, sdlog = 1.58)
+}
+chains_df <- chain_sim(
+  n = 5, offspring = "pois", lambda = 0.5, stat = "length",
+  infinite = 100, serial = serial_interval, tree = TRUE
+)
 head(chains_df)
 ```
 

From 3895ef9da9f77a4c4b7514a9d370f34d4b56064e Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:59:37 +0000
Subject: [PATCH 184/828] lintr: removed newlines and trailing white space

---
 vignettes/projecting_incidence.Rmd | 11 ++++-------
 1 file changed, 4 insertions(+), 7 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index e121f982..1e7c2f85 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -18,9 +18,9 @@ editor_options:
 ---
 
 ```{r setup, include=FALSE}
-knitr::opts_chunk$set(echo = TRUE, 
-                      message = FALSE, 
-                      warning = FALSE, 
+knitr::opts_chunk$set(echo = TRUE,
+                      message = FALSE,
+                      warning = FALSE,
                       collapse = TRUE,
                       comment = "#>"
                       )
@@ -67,6 +67,7 @@ head(covid19_sa)
 ## Setting up the inputs  
 
 ### Onset times 
+
 `chain_sim()` requires a vector of onset times, `t0`, for each 
 chain/individual/simulation. 
 
@@ -154,7 +155,6 @@ cases produced by a single individual in an entirely susceptible population.
 The parameter $k$ represents superspreading, that is, the degree of 
 heterogeneity in transmission by single individuals.
 
-
 ### Simulation controls
 
 `chain_sim()` also requires the end time for the simulations. For this 
@@ -186,7 +186,6 @@ assume a maximum chain size of $1000$.
 chain_threshold <- 1000
 ```
 
-
 ## Modelling assumptions
 
 `chain_sim()` makes the following simplifying assumptions:
@@ -272,7 +271,6 @@ incidence_ts_by_date <- incidence_ts %>%
 head(incidence_ts_by_date)
 ```
 
-
 Now we will aggregate the simulations by day and evaluate the median 
 daily cases across all simulations.
 ```{r}
@@ -297,7 +295,6 @@ median_daily_cases <- median_daily_cases %>%
 head(median_daily_cases)
 ```
 
-
 ## Visualization
 
 We will now plot the individual simulation results alongside the median

From 012fb8b02909b383eea679d58a9f6e7cce3cceed Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 19:59:58 +0000
Subject: [PATCH 185/828] lintr: replaced single quotes with double quotes

---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 1e7c2f85..d8f59bd8 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -56,7 +56,7 @@ Included in `bpmodels` is a cleaned time series of the first 15 days of
 the COVID-19 outbreak in South Africa. This can be loaded into 
 memory as follows: 
 ```{r}
-data('covid19_sa', package = 'bpmodels')
+data("covid19_sa", package = "bpmodels")
 ```
 
 Let us examine the first 6 entries of the dataset.

From 93f13781a7c8d492e17eb0d8e79bcd466779e92e Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 20:00:23 +0000
Subject: [PATCH 186/828] lintr: styled the plotting code

---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index d8f59bd8..145babbd 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -361,7 +361,7 @@ ggplot(data = incidence_ts_by_date) +
     ),
     labels = seq(
       0,
-      max(incidence_ts_by_date$cases) + 200, 
+      max(incidence_ts_by_date$cases) + 200,
       250
     )
   ) +

From 3006da55c060f725db6c98baf18d264925c1af05 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 20:01:23 +0000
Subject: [PATCH 187/828] rebuilt the readme

---
 README.md | 54 +++++++++++++++++++++++++++++-------------------------
 1 file changed, 29 insertions(+), 25 deletions(-)

diff --git a/README.md b/README.md
index dc0d058e..3a9b601b 100644
--- a/README.md
+++ b/README.md
@@ -32,13 +32,13 @@ The latest development version of the *bpmodels* package can be
 installed via
 
 ``` r
-devtools::install_github('epiverse-trace/bpmodels')
+devtools::install_github(file.path("epiverse-trace", "bpmodels"))
 ```
 
 To load the package, use
 
 ``` r
-library('bpmodels')
+library("bpmodels")
 ```
 
 # Quick start
@@ -64,7 +64,7 @@ To do this, we run
 set.seed(13)
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 chain_ll(x = chain_sizes, offspring = "pois", stat = "size", lambda = 0.5)
-#> [1] -8.607
+#> [1] -8.607196
 ```
 
 The first argument of `chain_ll()` is the chain size (or length, in
@@ -103,10 +103,11 @@ For example, if the probability of observing each case is
 
 ``` r
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5, 
+ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5,
                nsim_obs = 10)
 ll
-#>  [1] -26.54 -23.26 -24.33 -20.80 -30.76 -26.47 -23.79 -19.14 -32.09 -22.23
+#>  [1] -26.54167 -23.26117 -24.33027 -20.80310 -30.76152 -26.46751 -23.79326
+#>  [8] -19.14490 -32.08875 -22.23401
 ```
 
 This returns `10` likelihood values (because `nsim_obs = 10`), which can
@@ -139,7 +140,7 @@ probability `prob = 0.5`, we run
 
 ``` r
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5, 
+chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5,
          nsim_offspring = 100)
 #> [1] -Inf
 ```
@@ -151,10 +152,10 @@ function follows the same syntax as `chain_ll()`.
 
 Below, we are simulating $5$ chains, assuming the offspring are
 generated using a Poisson distribution with mean, `lambda = 0.5`. By
-default, `chain_sim()` returns a vector of chain sizes/lengths. However,
-to override that so that a tree of infectees and infectors is returned,
-we need to specify a function for the serial interval and set
-`tree = TRUE`.
+default, `chain_sim()` returns a vector of chain sizes/lengths. If we
+instead want to return a tree of infectees and infectors, we need to
+specify a function for the serial interval and set `tree = TRUE` (see
+next section).
 
 ``` r
 chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5)
@@ -168,17 +169,21 @@ interval generation function and set `tree = TRUE` as follows:
 
 ``` r
 set.seed(13)
-serial_interval <- function(n){rlnorm(n, meanlog = 0.58, sdlog = 1.58)}
-chains_df <- chain_sim(n = 5, offspring = 'pois', lambda = 0.5, stat = 'length', 
-                       infinite = 100, serial = serial_interval, tree = TRUE)
+serial_interval <- function(n) {
+  rlnorm(n, meanlog = 0.58, sdlog = 1.58)
+}
+chains_df <- chain_sim(
+  n = 5, offspring = "pois", lambda = 0.5, stat = "length",
+  infinite = 100, serial = serial_interval, tree = TRUE
+)
 head(chains_df)
-#>   n id ancestor generation    time
-#> 1 1  1       NA          1 0.00000
-#> 2 2  1       NA          1 0.00000
-#> 3 3  1       NA          1 0.00000
-#> 4 4  1       NA          1 0.00000
-#> 5 5  1       NA          1 0.00000
-#> 6 1  2        1          2 0.04772
+#>   n id ancestor generation       time
+#> 1 1  1       NA          1 0.00000000
+#> 2 2  1       NA          1 0.00000000
+#> 3 3  1       NA          1 0.00000000
+#> 4 4  1       NA          1 0.00000000
+#> 5 5  1       NA          1 0.00000000
+#> 6 1  2        1          2 0.04771887
 ```
 
 ## Package vignettes
@@ -213,17 +218,16 @@ citation("bpmodels")
 #> 
 #> To cite package 'bpmodels' in publications use:
 #> 
-#>   Funk S, Finger F, Azam JM (2023). _bpmodels: Analysing chain
+#>   Funk S, Finger F, Azam J (????). _bpmodels: Analysing chain
 #>   statistics using branching process models_. R package version 0.1.0,
-#>   <https://github.com/sbfnk/bpmodels>.
+#>   <https://github.com/epiverse-trace/bpmodels>.
 #> 
 #> A BibTeX entry for LaTeX users is
 #> 
 #>   @Manual{,
 #>     title = {bpmodels: Analysing chain statistics using branching process models},
-#>     author = {Sebastian Funk and Flavio Finger and James Mba Azam},
-#>     year = {2023},
+#>     author = {Sebastian Funk and Flavio Finger and James M. Azam},
 #>     note = {R package version 0.1.0},
-#>     url = {https://github.com/sbfnk/bpmodels},
+#>     url = {https://github.com/epiverse-trace/bpmodels},
 #>   }
 ```

From ff485db4f16235b4b3546efba78580e3a18c65ac Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 20:12:32 +0000
Subject: [PATCH 188/828] lintr: fixed contributor list

---
 DESCRIPTION | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 0d11a90f..3c21556f 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -5,7 +5,7 @@ Authors@R: c(
     person("Sebastian", "Funk", , "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")),
     person("Zhian N.", "Kamvar", , "zkamvar@gmail.com", role = "ctb"),
     person("Flavio", "Finger", , "flavio.finger@epicentre.msf.org", role = "aut"),
-    person("James M.", "Azam", , "james.azam@lshtm.ac.uk", role = c("aut"))
+    person("James M.", "Azam", , "james.azam@lshtm.ac.uk", role = "aut")
   )
 Description: Provides methods to analyse and simulate the size and length
     of branching processes with an arbitrary offspring distribution. These

From 9bb8d42e261e7a041a3866a14ef2312bcafccd2e Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 20:12:57 +0000
Subject: [PATCH 189/828] Failing checks: added epiparameter remotes

---
 DESCRIPTION | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/DESCRIPTION b/DESCRIPTION
index 3c21556f..0e94d89e 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -30,6 +30,8 @@ Suggests:
     truncdist
 VignetteBuilder: 
     knitr
+Remotes: 
+    github::epiverse-trace/epiparameter
 Encoding: UTF-8
 LazyData: true
 Roxygen: list(markdown = TRUE)

From d20a386f0b88dc05ab386b989181541e1ffccfb7 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 1 Mar 2023 20:20:03 +0000
Subject: [PATCH 190/828] Failing checks: Loaded usethis

---
 data-raw/covid19_sa.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/data-raw/covid19_sa.R b/data-raw/covid19_sa.R
index 3991eff3..b64b45b0 100644
--- a/data-raw/covid19_sa.R
+++ b/data-raw/covid19_sa.R
@@ -2,6 +2,7 @@
 
 library(dplyr)
 library(lubridate)
+library(usethis)
 
 # Link to data
 data_url <- "https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv" # nolint: line_length_linter.
@@ -18,4 +19,4 @@ covid19_sa <- covid19_sa %>%
   summarise(cases = n()) %>%
   ungroup()
 
-usethis::use_data(covid19_sa, overwrite = TRUE)
+use_data(covid19_sa, overwrite = TRUE)

From 00f2673c7c99a30df51cb352caf793510ab9d498 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Thu, 2 Mar 2023 19:48:34 +0000
Subject: [PATCH 191/828] added usethis ad dependency

---
 DESCRIPTION | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 0e94d89e..4872bee5 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -27,7 +27,8 @@ Suggests:
     lubridate,
     rmarkdown,
     testthat,
-    truncdist
+    truncdist,
+    usethis
 VignetteBuilder: 
     knitr
 Remotes: 

From 5ccf3bae3a90465edb2e05506120259c64f33949 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Thu, 2 Mar 2023 20:04:57 +0000
Subject: [PATCH 192/828] removed trailing whitespace

---
 tests/testthat/tests-sim.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index ec212d93..bc603146 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -91,7 +91,7 @@ test_that("Chains can be simulated", {
         serial = function(x) 3,
         pop = 1
       )
-    ), 
+    ),
     1
   )
 

From df60f205d758dc93990befe55ebbe74c53869bb3 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Thu, 2 Mar 2023 21:34:37 +0000
Subject: [PATCH 193/828] changed expect_equal to expect_identical and made
 expected an integer

---
 tests/testthat/tests-sim.r | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index bc603146..e45677cd 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -83,7 +83,7 @@ test_that("Chains can be simulated", {
       "data.frame"
   )
 
-  expect_equal(
+  expect_identical(
     nrow(
       chain_sim_susc(
         "pois",
@@ -92,10 +92,10 @@ test_that("Chains can be simulated", {
         pop = 1
       )
     ),
-    1
+    1L
   )
 
-  expect_equal(
+  expect_identical(
     nrow(
       chain_sim_susc(
         "pois",
@@ -105,10 +105,10 @@ test_that("Chains can be simulated", {
         pop = 999
       )
     ),
-    1
+    1L
   )
 
-  expect_equal(
+  expect_identical(
     nrow(
       chain_sim_susc(
         "pois",
@@ -118,7 +118,7 @@ test_that("Chains can be simulated", {
         initial_immune = 998
       )
     ),
-    1
+    1L
   )
 })
 

From 90bec53caa61867fba51514ba2caa6f62001cd95 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Mon, 6 Mar 2023 14:24:25 +0000
Subject: [PATCH 194/828] removed deprecated testthat contexts

---
 tests/testthat/tests-borel.r | 2 --
 tests/testthat/tests-ll.r    | 2 --
 tests/testthat/tests-sim.r   | 6 ------
 3 files changed, 10 deletions(-)

diff --git a/tests/testthat/tests-borel.r b/tests/testthat/tests-borel.r
index 673b4f12..e17512e1 100644
--- a/tests/testthat/tests-borel.r
+++ b/tests/testthat/tests-borel.r
@@ -1,5 +1,3 @@
-context("The Borel distribution is implemented")
-
 test_that("We can calculate probabilities and sample", {
   expect_gt(dborel(1, 0.5), 0)
   expect_identical(dborel(1, 0.5, log = TRUE), -0.5)
diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
index ca4ef9be..73085038 100644
--- a/tests/testthat/tests-ll.r
+++ b/tests/testthat/tests-ll.r
@@ -1,5 +1,3 @@
-context("Calculating the likelihood from a branching process model")
-
 chains <- c(1, 1, 4, 7)
 
 test_that("Likelihoods can be calculated", {
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index e45677cd..6582bc39 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -1,5 +1,3 @@
-context("Simulating from a branching process model")
-
 test_that("Chains can be simulated", {
   expect_length(chain_sim(n = 2, "pois", lambda = 0.5), 2)
   expect_length(chain_sim(n = 10, "pois", "length", lambda = 0.9), 10)
@@ -57,10 +55,6 @@ test_that("Errors are thrown", {
   )
 })
 
-context("Simulating from a branching process model
-    accounting for depletion of susceptibles")
-
-
 test_that("Chains can be simulated", {
   expect_s3_class(
       chain_sim_susc(

From 251ddd130573a2a2c7c5503909059c321e79dae5 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Mon, 6 Mar 2023 15:24:01 +0000
Subject: [PATCH 195/828] added GHA for Rmd readme

---
 .github/workflows/render_readme.yml | 47 +++++++++++++++++++++++++++++
 1 file changed, 47 insertions(+)
 create mode 100644 .github/workflows/render_readme.yml

diff --git a/.github/workflows/render_readme.yml b/.github/workflows/render_readme.yml
new file mode 100644
index 00000000..d3817f15
--- /dev/null
+++ b/.github/workflows/render_readme.yml
@@ -0,0 +1,47 @@
+# Name of the workflow
+name: render-readme
+
+# Controls when the action will run. Triggers include:
+#
+# - button trigger from github action page
+# - on changes to readme.Rmd
+
+on:
+  workflow_dispatch:
+  push:
+    paths:
+      - 'README.Rmd'
+
+# A workflow run is made up of one or more jobs that can run sequentially or in parallel
+jobs:
+  render-readme:
+    runs-on: macos-latest
+    env:
+      GITHUB_PAT: ${{ secrets.GITHUB_TOKEN }}
+    steps:
+      - name: Checkout repos
+        uses: actions/checkout@v2
+
+      - name: Setup R
+        uses: r-lib/actions/setup-r@v2
+
+      - name: Setup pandoc
+        uses: r-lib/actions/setup-pandoc@v2
+
+      - name: Install dependencies
+        uses: r-lib/actions/setup-r-dependencies@v2
+        with:
+          extra-packages: any::rmarkdown, local::.
+
+      - name: Compile the readme
+        run: |
+          rmarkdown::render("README.Rmd")
+        shell: Rscript {0}
+
+      - name: Commit files
+        run: |
+          git config --local user.email "action@github.com"
+          git config --local user.name "GitHub Action"
+          git add README.md man/figures/
+          git diff-index --quiet HEAD || git commit -m "Automatic readme update"
+          git push origin || echo "No changes to push"
\ No newline at end of file

From 0d7150509ac584362b0ed2718d10d0137455574d Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Mon, 6 Mar 2023 16:18:42 +0000
Subject: [PATCH 196/828] linked covidza data to commit

---
 data-raw/covid19_sa.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/data-raw/covid19_sa.R b/data-raw/covid19_sa.R
index b64b45b0..187654ef 100644
--- a/data-raw/covid19_sa.R
+++ b/data-raw/covid19_sa.R
@@ -5,7 +5,7 @@ library(lubridate)
 library(usethis)
 
 # Link to data
-data_url <- "https://raw.githubusercontent.com/dsfsi/covid19za/master/data/covid19za_timeline_confirmed.csv" # nolint: line_length_linter.
+data_url <- "https://raw.githubusercontent.com/dsfsi/covid19za/1943f5e0d80fa296d9171ced473eebd3f2cde109/data/covid19za_timeline_confirmed.csv" # nolint: line_length_linter.
 
 # Read the data in using the url
 covid19_sa <- read.csv(data_url)

From 3988a6c3c04e2b6ba37dc1a62554560df8c5b2a3 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Mon, 6 Mar 2023 16:56:06 +0000
Subject: [PATCH 197/828] added a contribution guide and templates for
 submitting issues and PR

---
 .github/CONTRIBUTING.md                       | 51 +++++++++++++++++++
 .github/ISSUE_TEMPLATE/bug_report.md          | 26 ++++++++++
 .github/ISSUE_TEMPLATE/feature_request.md     | 17 +++++++
 .../pull_request_template.md                  | 25 +++++++++
 4 files changed, 119 insertions(+)
 create mode 100644 .github/CONTRIBUTING.md
 create mode 100644 .github/ISSUE_TEMPLATE/bug_report.md
 create mode 100644 .github/ISSUE_TEMPLATE/feature_request.md
 create mode 100644 .github/PULL_REQUEST_TEMPLATE/pull_request_template.md

diff --git a/.github/CONTRIBUTING.md b/.github/CONTRIBUTING.md
new file mode 100644
index 00000000..f1a643c5
--- /dev/null
+++ b/.github/CONTRIBUTING.md
@@ -0,0 +1,51 @@
+# Contributing to bpmodels
+
+This outlines how to propose a change to bpmodels.
+
+## Making changes
+
+If you want to make a change, it's a good idea to first file an issue and make 
+sure someone from the team agrees that it’s needed.
+If you’ve found a bug, please file an issue that illustrates the bug with 
+a minimal [reprex](https://www.tidyverse.org/help/#reprex) (this will also 
+help you write a unit test, if needed). See [bug report template](../.github/ISSUE_TEMPLATE/bug_report.md). If you have a 
+feature request see [feature request](../.github/ISSUE_TEMPLATE/feature_request.md).
+
+### Pull request process
+
+See [pull request template](../.github/PULL_REQUEST_TEMPLATE/pull_request_template.md)
+
+*   Fork the package and clone onto your computer. If you haven't done 
+this before, we recommend using `usethis::create_from_github("epiverse-trace/bpmodels", fork = TRUE)`.
+
+*   Install all development dependencies with `devtools::install_dev_deps()`, 
+and then make sure the package passes R CMD check by running `devtools::check()`. 
+    If R CMD check doesn't pass cleanly, it's a good idea to ask for 
+    help before continuing. 
+*   Create a Git branch for your pull request (PR). We recommend using `usethis::pr_init("brief-description-of-change")`.
+
+*   Make your changes, commit to git, and then create a PR by running `usethis::pr_push()`, and following the prompts in your browser.
+    The title of your PR should briefly describe the change.
+    The body of your PR should contain `Fixes #issue-number`.
+
+*  For user-facing changes, add a bullet to the top of `NEWS.md` (i.e. just 
+below the first header). Follow the style described in <https://style.tidyverse.org/news.html>.
+
+### Code style
+
+*   New code should follow the tidyverse [style guide](https://style.tidyverse.org). 
+    You can use the [styler](https://CRAN.R-project.org/package=styler) 
+    package to apply these styles, but please don't restyle code that has 
+    nothing to do with your PR.  
+
+*  We use [roxygen2](https://cran.r-project.org/package=roxygen2), with [Markdown syntax](https://cran.r-project.org/web/packages/roxygen2/vignettes/rd-formatting.html), for documentation.  
+
+*  We use [testthat](https://cran.r-project.org/package=testthat) for 
+unit tests. 
+   Contributions with test cases included are easier to accept.  
+
+## Code of Conduct
+
+Please note that `bpmodels` is released with a
+[Contributor Code of Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md). By contributing to this
+project you agree to abide by its terms.
diff --git a/.github/ISSUE_TEMPLATE/bug_report.md b/.github/ISSUE_TEMPLATE/bug_report.md
new file mode 100644
index 00000000..31bbc095
--- /dev/null
+++ b/.github/ISSUE_TEMPLATE/bug_report.md
@@ -0,0 +1,26 @@
+---
+name: Bug report
+about: Create a report to help us improve
+title: ''
+labels: ''
+assignees: ''
+
+---
+  
+Please place an "x" in all the boxes that apply
+---------------------------------------------
+  
+- [ ] I have the most recent version of bpmodels and R
+- [ ] I have found a bug
+- [ ] I have a [reproducible example](http://reprex.tidyverse.org/articles/reprex-dos-and-donts.html)
+- [ ] I want to request a new feature
+
+--------
+  
+Please include a brief description of the problem with a code example:
+  
+```r
+# insert reprex here
+```
+
+---------
diff --git a/.github/ISSUE_TEMPLATE/feature_request.md b/.github/ISSUE_TEMPLATE/feature_request.md
new file mode 100644
index 00000000..3fe3d3f8
--- /dev/null
+++ b/.github/ISSUE_TEMPLATE/feature_request.md
@@ -0,0 +1,17 @@
+---
+name: Feature request
+about: Suggest an idea for this project
+title: ''
+labels: ''
+assignees: ''
+
+---
+  
+**Is your feature request related to a problem? Please describe.**
+A clear and concise description of what the problem is. E.g., I'm always frustrated when [...]
+
+**Describe the solution you'd like**
+A clear and concise description of what you want to happen.
+
+**Additional context**
+Add any other context or screenshots about the feature request here.
diff --git a/.github/PULL_REQUEST_TEMPLATE/pull_request_template.md b/.github/PULL_REQUEST_TEMPLATE/pull_request_template.md
new file mode 100644
index 00000000..706632aa
--- /dev/null
+++ b/.github/PULL_REQUEST_TEMPLATE/pull_request_template.md
@@ -0,0 +1,25 @@
+* **Please check if the PR fulfils these requirements**
+  
+- [ ] I have read the CONTRIBUTING guidelines 
+- [ ] The commit message follows our guidelines
+- [ ] Tests for the changes have been added (for bug fixes / features)
+- [ ] Docs have been added / updated (for bug fixes / features)
+
+
+* **What kind of change does this PR introduce?** (Bug fix, feature, docs update, ...)
+
+
+
+* **What is the current behaviour?** (You can also link to an open issue here)
+
+
+
+* **What is the new behaviour (if this is a feature change)?**
+  
+  
+  
+* **Does this PR introduce a breaking change?** (What changes might users need to make in their application due to this PR?)
+
+
+
+* **Other information**:

From 3f0b545f69bd57052e7d1202d5a2c2f6f4d33652 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Mon, 6 Mar 2023 17:09:45 +0000
Subject: [PATCH 198/828] matched package version in DESCRIPTION with news.md

---
 DESCRIPTION | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 4872bee5..2aefa977 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,6 +1,6 @@
 Package: bpmodels
 Title: Analysing chain statistics using branching process models
-Version: 0.1.0
+Version: 0.1.9999
 Authors@R: c(
     person("Sebastian", "Funk", , "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")),
     person("Zhian N.", "Kamvar", , "zkamvar@gmail.com", role = "ctb"),

From bfaa7b8ac1f80cf7e593dc2dd789cef052a206d8 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Mon, 6 Mar 2023 17:10:47 +0000
Subject: [PATCH 199/828] updated README with link to contribution guide

---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index 3254548c..258d5fa0 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -191,7 +191,7 @@ To report a bug please open an [issue](https://github.com/epiverse-trace/bpmodel
 ## Contribute
 
 We welcome contributions to enhance the package's functionalities. If you 
-wish to do so, please follow the [package contributing guide](https://github.com/epiverse-trace/.github/blob/main/CONTRIBUTING.md).
+wish to do so, please follow the [package contributing guide](https://github.com/epiverse-trace/bpmodels/blob/main/.github/CONTRIBUTING.md).
 
 ## Code of conduct
 

From c8590589a7bf2e0f14294e8bbd29d111f0638e42 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Mon, 6 Mar 2023 17:23:10 +0000
Subject: [PATCH 200/828] removed non-existent path from workflow

---
 .github/workflows/render_readme.yml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/.github/workflows/render_readme.yml b/.github/workflows/render_readme.yml
index d3817f15..6df5f98f 100644
--- a/.github/workflows/render_readme.yml
+++ b/.github/workflows/render_readme.yml
@@ -42,6 +42,6 @@ jobs:
         run: |
           git config --local user.email "action@github.com"
           git config --local user.name "GitHub Action"
-          git add README.md man/figures/
+          git add README.md 
           git diff-index --quiet HEAD || git commit -m "Automatic readme update"
           git push origin || echo "No changes to push"
\ No newline at end of file

From 798cbdc4ab2eeb2ee0328d6c4159d3cfd2d4aa43 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Tue, 7 Mar 2023 15:20:51 +0000
Subject: [PATCH 201/828] changed maintainer to James

---
 DESCRIPTION | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 2aefa977..45228b60 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -2,10 +2,10 @@ Package: bpmodels
 Title: Analysing chain statistics using branching process models
 Version: 0.1.9999
 Authors@R: c(
-    person("Sebastian", "Funk", , "sebastian.funk@lshtm.ac.uk", role = c("aut", "cre")),
+    person("Sebastian", "Funk", , "sebastian.funk@lshtm.ac.uk", role = "aut"),
     person("Zhian N.", "Kamvar", , "zkamvar@gmail.com", role = "ctb"),
     person("Flavio", "Finger", , "flavio.finger@epicentre.msf.org", role = "aut"),
-    person("James M.", "Azam", , "james.azam@lshtm.ac.uk", role = "aut")
+    person("James M.", "Azam", , "james.azam@lshtm.ac.uk", role = c("aut", "cre"))
   )
 Description: Provides methods to analyse and simulate the size and length
     of branching processes with an arbitrary offspring distribution. These
@@ -13,7 +13,7 @@ Description: Provides methods to analyse and simulate the size and length
     or length of infectious disease outbreaks, as discussed in Farrington
     et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
 License: MIT + file LICENSE
-URL: https://github.com/epiverse-trace/bpmodels
+URL: https://github.com/epiverse-trace/bpmodels, https://epiverse-trace.github.io/bpmodels/
 BugReports: https://github.com/epiverse-trace/bpmodels/issues
 Depends: 
     R (>= 3.0.0)

From f85919641caa803e548c851f02322c0435794b39 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Tue, 7 Mar 2023 15:28:21 +0000
Subject: [PATCH 202/828] removed some badges and fixed license badge

---
 README.Rmd |  6 +-----
 README.md  | 36 +++++++++++++++---------------------
 2 files changed, 16 insertions(+), 26 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index 258d5fa0..4827753b 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -16,15 +16,11 @@ knitr::opts_chunk$set(
 # _bpmodels_: Methods for analysing the size and length of chains from branching process models
 
 <!-- badges: start -->
-![CRAN/METACRAN](https://img.shields.io/cran/v/bpmodels)
 ![GitHub R package version](https://img.shields.io/github/r-package/v/epiverse-trace/bpmodels)
-![GitHub all releases](https://img.shields.io/github/downloads/epiverse-trace/bpmodels/total?style=flat)
-![GitHub issues](https://img.shields.io/github/issues/epiverse-trace/bpmodels)
 [![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
 [![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels) 
 ![GitHub contributors](https://img.shields.io/github/contributors/epiverse-trace/bpmodels)
-![GitHub commit activity](https://img.shields.io/github/commit-activity/m/epiverse-trace/bpmodels)
-![GitHub](https://img.shields.io/github/license/epiverse-trace/bpmodels)
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 <!-- badges: end -->
 
 ```{r setup, include=FALSE}
diff --git a/README.md b/README.md
index 3a9b601b..25efeb8b 100644
--- a/README.md
+++ b/README.md
@@ -3,20 +3,14 @@
 
 <!-- badges: start -->
 
-![CRAN/METACRAN](https://img.shields.io/cran/v/bpmodels) ![GitHub R
-package
+![GitHub R package
 version](https://img.shields.io/github/r-package/v/epiverse-trace/bpmodels)
-![GitHub all
-releases](https://img.shields.io/github/downloads/epiverse-trace/bpmodels/total?style=flat)
-![GitHub
-issues](https://img.shields.io/github/issues/epiverse-trace/bpmodels)
 [![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
 [![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels)
 ![GitHub
 contributors](https://img.shields.io/github/contributors/epiverse-trace/bpmodels)
-![GitHub commit
-activity](https://img.shields.io/github/commit-activity/m/epiverse-trace/bpmodels)
-![GitHub](https://img.shields.io/github/license/epiverse-trace/bpmodels)
+[![License:
+MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 <!-- badges: end -->
 
 *bpmodels* is an R package to simulate and analyse the size and length
@@ -64,7 +58,7 @@ To do this, we run
 set.seed(13)
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 chain_ll(x = chain_sizes, offspring = "pois", stat = "size", lambda = 0.5)
-#> [1] -8.607196
+#> [1] -8.607
 ```
 
 The first argument of `chain_ll()` is the chain size (or length, in
@@ -106,8 +100,7 @@ chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5,
                nsim_obs = 10)
 ll
-#>  [1] -26.54167 -23.26117 -24.33027 -20.80310 -30.76152 -26.46751 -23.79326
-#>  [8] -19.14490 -32.08875 -22.23401
+#>  [1] -26.54 -23.26 -24.33 -20.80 -30.76 -26.47 -23.79 -19.14 -32.09 -22.23
 ```
 
 This returns `10` likelihood values (because `nsim_obs = 10`), which can
@@ -177,13 +170,13 @@ chains_df <- chain_sim(
   infinite = 100, serial = serial_interval, tree = TRUE
 )
 head(chains_df)
-#>   n id ancestor generation       time
-#> 1 1  1       NA          1 0.00000000
-#> 2 2  1       NA          1 0.00000000
-#> 3 3  1       NA          1 0.00000000
-#> 4 4  1       NA          1 0.00000000
-#> 5 5  1       NA          1 0.00000000
-#> 6 1  2        1          2 0.04771887
+#>   n id ancestor generation    time
+#> 1 1  1       NA          1 0.00000
+#> 2 2  1       NA          1 0.00000
+#> 3 3  1       NA          1 0.00000
+#> 4 4  1       NA          1 0.00000
+#> 5 5  1       NA          1 0.00000
+#> 6 1  2        1          2 0.04772
 ```
 
 ## Package vignettes
@@ -202,7 +195,7 @@ To report a bug please open an
 
 We welcome contributions to enhance the package’s functionalities. If
 you wish to do so, please follow the [package contributing
-guide](https://github.com/epiverse-trace/.github/blob/main/CONTRIBUTING.md).
+guide](https://github.com/epiverse-trace/bpmodels/blob/main/.github/CONTRIBUTING.md).
 
 ## Code of conduct
 
@@ -218,7 +211,7 @@ citation("bpmodels")
 #> 
 #> To cite package 'bpmodels' in publications use:
 #> 
-#>   Funk S, Finger F, Azam J (????). _bpmodels: Analysing chain
+#>   Funk S, Finger F, Azam J (2023). _bpmodels: Analysing chain
 #>   statistics using branching process models_. R package version 0.1.0,
 #>   <https://github.com/epiverse-trace/bpmodels>.
 #> 
@@ -227,6 +220,7 @@ citation("bpmodels")
 #>   @Manual{,
 #>     title = {bpmodels: Analysing chain statistics using branching process models},
 #>     author = {Sebastian Funk and Flavio Finger and James M. Azam},
+#>     year = {2023},
 #>     note = {R package version 0.1.0},
 #>     url = {https://github.com/epiverse-trace/bpmodels},
 #>   }

From b01536b8513e128497d4d397bc15ea551feec843 Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Tue, 7 Mar 2023 15:36:03 +0000
Subject: [PATCH 203/828] Automatic readme update

---
 README.md | 28 +++++++++++++++-------------
 1 file changed, 15 insertions(+), 13 deletions(-)

diff --git a/README.md b/README.md
index 25efeb8b..46136865 100644
--- a/README.md
+++ b/README.md
@@ -58,7 +58,7 @@ To do this, we run
 set.seed(13)
 chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 chain_ll(x = chain_sizes, offspring = "pois", stat = "size", lambda = 0.5)
-#> [1] -8.607
+#> [1] -8.607196
 ```
 
 The first argument of `chain_ll()` is the chain size (or length, in
@@ -100,7 +100,8 @@ chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
 ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5,
                nsim_obs = 10)
 ll
-#>  [1] -26.54 -23.26 -24.33 -20.80 -30.76 -26.47 -23.79 -19.14 -32.09 -22.23
+#>  [1] -26.54167 -23.26117 -24.33027 -20.80310 -30.76152 -26.46751 -23.79326
+#>  [8] -19.14490 -32.08875 -22.23401
 ```
 
 This returns `10` likelihood values (because `nsim_obs = 10`), which can
@@ -170,13 +171,13 @@ chains_df <- chain_sim(
   infinite = 100, serial = serial_interval, tree = TRUE
 )
 head(chains_df)
-#>   n id ancestor generation    time
-#> 1 1  1       NA          1 0.00000
-#> 2 2  1       NA          1 0.00000
-#> 3 3  1       NA          1 0.00000
-#> 4 4  1       NA          1 0.00000
-#> 5 5  1       NA          1 0.00000
-#> 6 1  2        1          2 0.04772
+#>   n id ancestor generation       time
+#> 1 1  1       NA          1 0.00000000
+#> 2 2  1       NA          1 0.00000000
+#> 3 3  1       NA          1 0.00000000
+#> 4 4  1       NA          1 0.00000000
+#> 5 5  1       NA          1 0.00000000
+#> 6 1  2        1          2 0.04771887
 ```
 
 ## Package vignettes
@@ -212,8 +213,9 @@ citation("bpmodels")
 #> To cite package 'bpmodels' in publications use:
 #> 
 #>   Funk S, Finger F, Azam J (2023). _bpmodels: Analysing chain
-#>   statistics using branching process models_. R package version 0.1.0,
-#>   <https://github.com/epiverse-trace/bpmodels>.
+#>   statistics using branching process models_.
+#>   https://github.com/epiverse-trace/bpmodels,
+#>   https://epiverse-trace.github.io/bpmodels/.
 #> 
 #> A BibTeX entry for LaTeX users is
 #> 
@@ -221,7 +223,7 @@ citation("bpmodels")
 #>     title = {bpmodels: Analysing chain statistics using branching process models},
 #>     author = {Sebastian Funk and Flavio Finger and James M. Azam},
 #>     year = {2023},
-#>     note = {R package version 0.1.0},
-#>     url = {https://github.com/epiverse-trace/bpmodels},
+#>     note = {https://github.com/epiverse-trace/bpmodels,
+#> https://epiverse-trace.github.io/bpmodels/},
 #>   }
 ```

From 53e35c71d6014f2a5873d5febdfc0e23999b0a59 Mon Sep 17 00:00:00 2001
From: jamesmbaazam <james.azam@lshtm.ac.uk>
Date: Wed, 8 Mar 2023 12:54:33 +0000
Subject: [PATCH 204/828] formatted the contributor details and added ORCHID
 IDs

---
 DESCRIPTION             | 40 ++++++++++++++++++++++++++++++++--------
 man/bpmodels-package.Rd |  9 +++++----
 2 files changed, 37 insertions(+), 12 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 45228b60..2398e41d 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -2,10 +2,33 @@ Package: bpmodels
 Title: Analysing chain statistics using branching process models
 Version: 0.1.9999
 Authors@R: c(
-    person("Sebastian", "Funk", , "sebastian.funk@lshtm.ac.uk", role = "aut"),
-    person("Zhian N.", "Kamvar", , "zkamvar@gmail.com", role = "ctb"),
-    person("Flavio", "Finger", , "flavio.finger@epicentre.msf.org", role = "aut"),
-    person("James M.", "Azam", , "james.azam@lshtm.ac.uk", role = c("aut", "cre"))
+    person(
+    given = "Sebastian",
+    family = "Funk",
+    email = "sebastian.funk@lshtm.ac.uk",
+    role = "aut",
+    comment = c(ORCHID = "https://orcid.org/0000-0002-2842-3406")
+    ),
+    person(
+    given = "Zhian N.",
+    family = "Kamvar",
+    email = "zkamvar@gmail.com",
+    role = "ctb",
+    comment = c(ORCHID = "https://orcid.org/0000-0003-1458-7108")
+    ),
+    person(
+    given = "Flavio",
+    family = "Finger",
+    email = "flavio.finger@epicentre.msf.org",
+    role = "aut",
+    comment = c(ORCHID = "https://orcid.org/0000-0002-8613-5170")
+    ),
+    person(
+    given = "James M.",
+    family = "Azam",
+    email = "james.azam@lshtm.ac.uk",
+    role = c("aut", "cre"),
+    comment = c(ORCHID = "https://orcid.org/0000-0001-5782-7330"))
   )
 Description: Provides methods to analyse and simulate the size and length
     of branching processes with an arbitrary offspring distribution. These
@@ -15,9 +38,9 @@ Description: Provides methods to analyse and simulate the size and length
 License: MIT + file LICENSE
 URL: https://github.com/epiverse-trace/bpmodels, https://epiverse-trace.github.io/bpmodels/
 BugReports: https://github.com/epiverse-trace/bpmodels/issues
-Depends: 
+Depends:
     R (>= 3.0.0)
-Suggests: 
+Suggests:
     bookdown,
     covr,
     dplyr,
@@ -29,9 +52,10 @@ Suggests:
     testthat,
     truncdist,
     usethis
-VignetteBuilder: 
+Config/testthat/edition: 3
+VignetteBuilder:
     knitr
-Remotes: 
+Remotes:
     github::epiverse-trace/epiparameter
 Encoding: UTF-8
 LazyData: true
diff --git a/man/bpmodels-package.Rd b/man/bpmodels-package.Rd
index 0d3715d6..c8ee18b2 100644
--- a/man/bpmodels-package.Rd
+++ b/man/bpmodels-package.Rd
@@ -12,22 +12,23 @@ Provides methods to analyse and simulate the size and length of branching proces
 Useful links:
 \itemize{
   \item \url{https://github.com/epiverse-trace/bpmodels}
+  \item \url{https://epiverse-trace.github.io/bpmodels/}
   \item Report bugs at \url{https://github.com/epiverse-trace/bpmodels/issues}
 }
 
 }
 \author{
-\strong{Maintainer}: Sebastian Funk \email{sebastian.funk@lshtm.ac.uk}
+\strong{Maintainer}: James M. Azam \email{james.azam@lshtm.ac.uk} (\href{https://orcid.org/0000-0001-5782-7330}{ORCID})
 
 Authors:
 \itemize{
-  \item Flavio Finger \email{flavio.finger@epicentre.msf.org}
-  \item James M. Azam \email{james.azam@lshtm.ac.uk}
+  \item Sebastian Funk \email{sebastian.funk@lshtm.ac.uk} (\href{https://orcid.org/0000-0002-2842-3406}{ORCID})
+  \item Flavio Finger \email{flavio.finger@epicentre.msf.org} (\href{https://orcid.org/0000-0002-8613-5170}{ORCID})
 }
 
 Other contributors:
 \itemize{
-  \item Zhian N. Kamvar \email{zkamvar@gmail.com} [contributor]
+  \item Zhian N. Kamvar \email{zkamvar@gmail.com} (\href{https://orcid.org/0000-0003-1458-7108}{ORCID}) [contributor]
 }
 
 }

From 04c47f25433780f0b3a0a4ada2ec78466ae0deee Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Tue, 7 Mar 2023 19:32:08 +0000
Subject: [PATCH 205/828] Standardized all warnings and errors to sentence case

---
 R/likelihoods.R            |   2 +-
 R/simulate.r               |  11 ++++++++---
 R/simulate_susceptibles.R  |   5 +++--
 man/Meta/vignette.rds      | Bin 248 -> 0 bytes
 tests/testthat/tests-sim.r |  13 +++++++------
 5 files changed, 19 insertions(+), 12 deletions(-)
 delete mode 100644 man/Meta/vignette.rds

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 48288152..23ce84f2 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -145,7 +145,7 @@ chain_ll <- function(x, offspring, stat = c("size", "length"), obs_prob = 1,
 
   ## checks
   if (!is.character(offspring)) {
-    stop("object passed as 'offspring' is not a character string.")
+    stop("Object passed as 'offspring' is not a character string.")
   }
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
diff --git a/R/simulate.r b/R/simulate.r
index 477d8f74..2c9df565 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -116,8 +116,10 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
 
   ## first, get random function as given by `offspring`
   if (!is.character(offspring)) {
-    stop("object passed as 'offspring' is not a character string. Did you forget
-             to enclose it in quotes?")
+    stop(paste("Object passed as 'offspring' is not a character string.",
+               "Did you forget to enclose it in quotes?"
+               )
+         )
   }
 
   roffspring_name <- paste0("r", offspring)
@@ -127,7 +129,10 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
 
   if (!missing(serial)) {
     if (!is.function(serial)) {
-      stop("The `serial` argument must be a function (see details in ?chain_sim()).") # nolint
+      stop(paste("The `serial` argument must be a function",
+                 "(see details in ?chain_sim)."
+                 )
+           )
     }
     if (!missing(tree) && tree == FALSE) {
       stop("If `serial` is specified, then `tree` cannot be set to `FALSE`.")
diff --git a/R/simulate_susceptibles.R b/R/simulate_susceptibles.R
index 32013be8..aec46938 100644
--- a/R/simulate_susceptibles.R
+++ b/R/simulate_susceptibles.R
@@ -42,8 +42,9 @@ chain_sim_susc <- function(offspring = c("pois", "nbinom"),
 
   if (offspring == "pois") {
     if (!missing(disp_offspring)) {
-      warning("argument disp_offspring not used for
-                poisson offspring distribution.")
+      warning(paste("Argument 'disp_offspring' not used for",
+                    "poisson offspring distribution.")
+              )
     }
 
     ## using a right truncated poisson distribution
diff --git a/man/Meta/vignette.rds b/man/Meta/vignette.rds
deleted file mode 100644
index 3419d044842410f2b479d671df6848ef0b2df328..0000000000000000000000000000000000000000
GIT binary patch
literal 0
HcmV?d00001

literal 248
zcmV<U00;jciwFP!0000025nKnio!4uP1CBYAP9v;Z~21$gCJhKNZEt8CEJW{cH2af
z6zR<`U!CmQvW*SQWD?$+c`rH42qBatloA?a8K;=W7z>PuBxH*F@@(`M6i%wsyHte~
zpbE(HN(8v|zQZx8j=s{hWkOou7Fb7Rwe=9-rfit5-G>4G%>;KmXt)|2{OPJP0KN_@
zL~H3U>JN=8q5oJT#VfEutH}n=poG8v8Rkc~fbz0~=Auo@>0!nXOtO_Fv~%C2>kjdL
yvwf6N9%^{%-_t)e`jWLC=KdqEm~Oa2qeaPWXmsWuJUbfXd);>-U$Myw0ssK@u6NA<

diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 6582bc39..748f104e 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -134,9 +134,11 @@ test_that("Errors are thrown", {
       serial = function(x) 3,
       pop = 100
     ),
-    "Offspring distribution 'nbinom' requires argument
-                disp_offspring > 1. Use 'pois' if there is no overdispersion."
-  )
+    paste("Offspring distribution 'nbinom'",
+          "requires argument 'disp_offspring' > 1.",
+          "Use 'pois' if there is no overdispersion."
+          )
+    )
   expect_error(
     chain_sim_susc(
       "nbinom",
@@ -144,7 +146,7 @@ test_that("Errors are thrown", {
       serial = function(x) 3,
       pop = 100
     ),
-    "argument \"disp_offspring\" is missing, with no default"
+    "Argument 'disp_offspring' was not specified."
   )
 })
 
@@ -157,7 +159,6 @@ test_that("warnings work as expected", {
       serial = function(x) 3,
       pop = 100
     ),
-    "argument disp_offspring not used for
-                poisson offspring distribution."
+    "Argument 'disp_offspring' not used for poisson offspring distribution."
   )
 })

From a079ceb17939a39653592b2349abb86ff65f0d29 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Tue, 7 Mar 2023 19:33:38 +0000
Subject: [PATCH 206/828] Added input validation to stop when "nbinom"
 offspring is specified but "disp_offspring" missing

---
 R/simulate_susceptibles.R | 14 +++++++++-----
 1 file changed, 9 insertions(+), 5 deletions(-)

diff --git a/R/simulate_susceptibles.R b/R/simulate_susceptibles.R
index aec46938..5f34db4a 100644
--- a/R/simulate_susceptibles.R
+++ b/R/simulate_susceptibles.R
@@ -58,11 +58,15 @@ chain_sim_susc <- function(offspring = c("pois", "nbinom"),
       )
     }
   } else if (offspring == "nbinom") {
-    if (disp_offspring <= 1) { ## dispersion index
-      stop("Offspring distribution 'nbinom' requires argument
-                disp_offspring > 1. Use 'pois' if there is no overdispersion.")
-    }
-
+  if (missing(disp_offspring)) {
+    stop(paste("Argument 'disp_offspring' was not specified."))
+  } else if (disp_offspring <= 1) { ## dispersion index
+    stop(paste(
+      "Offspring distribution 'nbinom' requires",
+      "argument 'disp_offspring' > 1.",
+      "Use 'pois' if there is no overdispersion."
+    ))
+  }
     offspring_fun <- function(n, susc) {
       ## get distribution params from mean and dispersion
       ## see ?rnbinom for parameter definition

From d9d3058ef556e3506f18097d9bdcacfe0f646180 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 8 Mar 2023 15:51:06 +0000
Subject: [PATCH 207/828] replaced paste() with sprintf() in constructed
 messages

---
 R/simulate.r              |  8 +++++---
 R/simulate_susceptibles.R | 10 ++++++----
 2 files changed, 11 insertions(+), 7 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 2c9df565..c6f0147a 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -116,8 +116,9 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
 
   ## first, get random function as given by `offspring`
   if (!is.character(offspring)) {
-    stop(paste("Object passed as 'offspring' is not a character string.",
-               "Did you forget to enclose it in quotes?"
+    stop(sprintf("%s %s",
+                 "Object passed as 'offspring' is not a character string.",
+                 "Did you forget to enclose it in quotes?"
                )
          )
   }
@@ -129,7 +130,8 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
 
   if (!missing(serial)) {
     if (!is.function(serial)) {
-      stop(paste("The `serial` argument must be a function",
+      stop(sprintf("%s %s",
+                   "The `serial` argument must be a function",
                  "(see details in ?chain_sim)."
                  )
            )
diff --git a/R/simulate_susceptibles.R b/R/simulate_susceptibles.R
index 5f34db4a..20fa9f83 100644
--- a/R/simulate_susceptibles.R
+++ b/R/simulate_susceptibles.R
@@ -42,8 +42,10 @@ chain_sim_susc <- function(offspring = c("pois", "nbinom"),
 
   if (offspring == "pois") {
     if (!missing(disp_offspring)) {
-      warning(paste("Argument 'disp_offspring' not used for",
-                    "poisson offspring distribution.")
+      warning(sprintf("%s %s",
+                     "Argument 'disp_offspring' not used for",
+                    "poisson offspring distribution."
+                    )
               )
     }
 
@@ -59,9 +61,9 @@ chain_sim_susc <- function(offspring = c("pois", "nbinom"),
     }
   } else if (offspring == "nbinom") {
   if (missing(disp_offspring)) {
-    stop(paste("Argument 'disp_offspring' was not specified."))
+    stop(sprintf("%s", "Argument 'disp_offspring' was not specified."))
   } else if (disp_offspring <= 1) { ## dispersion index
-    stop(paste(
+    stop(sprintf("%s %s %s",
       "Offspring distribution 'nbinom' requires",
       "argument 'disp_offspring' > 1.",
       "Use 'pois' if there is no overdispersion."

From aaec3eca55704b198f3970be0d32b9e221677d8c Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Mon, 6 Mar 2023 17:17:34 +0000
Subject: [PATCH 208/828] updated NEWS with new changes in release

---
 NEWS.md | 11 +++++++++++
 1 file changed, 11 insertions(+)

diff --git a/NEWS.md b/NEWS.md
index 0e74623f..1e6e590f 100644
--- a/NEWS.md
+++ b/NEWS.md
@@ -1,3 +1,14 @@
+# bpmodels 0.1.1
+
+* `chain_sim()`'s help file has been updated with more details and examples
+
+* `chain_sim()` now throws a warning instead of an error when `tree` is set 
+to `FALSE` with `serial` also specified
+
+* README has been updated with what was previously the introduction vignette
+
+* A new vignette has been added to showcase a use-case with COVID-19 data
+
 # bpmodels 0.1.9999
 
 * faster, vectorised chain simulations

From db44913d28a9b168643af8107c3b5ae005e02d82 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Tue, 7 Mar 2023 17:32:30 +0000
Subject: [PATCH 209/828] incremented version to 0.2.0

---
 DESCRIPTION | 1 +
 1 file changed, 1 insertion(+)

diff --git a/DESCRIPTION b/DESCRIPTION
index 2398e41d..50603bcf 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,6 +1,7 @@
 Package: bpmodels
 Title: Analysing chain statistics using branching process models
 Version: 0.1.9999
+Version: 0.2.0
 Authors@R: c(
     person(
     given = "Sebastian",

From 12b1e15fa5f5f2a8f92d719e35bc284220eef24c Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Tue, 7 Mar 2023 17:52:12 +0000
Subject: [PATCH 210/828] updated changelog for the minor release

---
 NEWS.md | 27 +++++++++++++++++++++------
 1 file changed, 21 insertions(+), 6 deletions(-)

diff --git a/NEWS.md b/NEWS.md
index 1e6e590f..a0d42bcd 100644
--- a/NEWS.md
+++ b/NEWS.md
@@ -1,13 +1,28 @@
-# bpmodels 0.1.1
+# bpmodels 0.2.0
 
-* `chain_sim()`'s help file has been updated with more details and examples
+## Documentation
 
-* `chain_sim()` now throws a warning instead of an error when `tree` is set 
-to `FALSE` with `serial` also specified
+* `chain_sim()`'s help file has been updated with more details. In particular,
+we describe in detail how to specify the `serial` argument as a function. We 
+have also added more examples.
 
-* README has been updated with what was previously the introduction vignette
+* A new vignette describing how to project COVID-19 incidence with `chain_sim()`
+has been added and can be accessed on the 
+[bpmodels website](https://epiverse-trace.github.io/bpmodels/) under "Articles".
 
-* A new vignette has been added to showcase a use-case with COVID-19 data
+* The README's "quick start" section has been updated with what was 
+previously the introduction vignette.
+
+## Minor functionality change
+
+* `chain_sim()` now throws a warning, instead of an error, when `tree` is set 
+to `FALSE` with `serial` also specified. Providing a serial interval implicitly
+means you want the tree of transmissions to be simulated, so `chain_sim()`
+internally sets `tree = TRUE` and throws a warning explaining what happened. 
+This behaviour should not break any simulations with previous versions 
+with `bpmodels`, but if it does, please submit an issue. 
+To remove the warning, the user should explicitly set `tree = TRUE` when 
+they specify `serial`. 
 
 # bpmodels 0.1.9999
 

From 17523c3dec2ecfed30a5fbba10a2f0a258baa25a Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 8 Mar 2023 18:02:45 +0000
Subject: [PATCH 211/828] removed erroneous version number

---
 DESCRIPTION | 1 -
 1 file changed, 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 50603bcf..bf767d70 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,6 +1,5 @@
 Package: bpmodels
 Title: Analysing chain statistics using branching process models
-Version: 0.1.9999
 Version: 0.2.0
 Authors@R: c(
     person(

From 4e8b35acb8180aefb7ca2931ed120f3b2950551c Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 8 Mar 2023 18:20:54 +0000
Subject: [PATCH 212/828] Added patterns to enable Github count .Rmd files as R
 code

---
 .gitattributes | 1 +
 1 file changed, 1 insertion(+)
 create mode 100644 .gitattributes

diff --git a/.gitattributes b/.gitattributes
new file mode 100644
index 00000000..35c5dbc8
--- /dev/null
+++ b/.gitattributes
@@ -0,0 +1 @@
+*.[Rr]md  linguist-detectable

From 141b5656c9a1c5d6816f8a6364d91612b6659bac Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 8 Mar 2023 18:21:37 +0000
Subject: [PATCH 213/828] Updated rbuild ignore files

---
 .Rbuildignore | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/.Rbuildignore b/.Rbuildignore
index b21681d2..be512134 100644
--- a/.Rbuildignore
+++ b/.Rbuildignore
@@ -5,10 +5,11 @@
 ^codecov\.yml$
 ^README\.Rmd$
 ^\.lintr$
+^\.gitattributes$
 ^\_pkgdown.yml$
 ^cran-comments\.md$
 ^doc$
 ^docs$
 ^Meta$
 ^pkgdown$
-^data-raw$
\ No newline at end of file
+^data-raw$

From 1e1809c44cb9f65c04d775d701c05e568cbf1960 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 8 Mar 2023 18:32:52 +0000
Subject: [PATCH 214/828] updated the pattern instruction for Github linguist

---
 .gitattributes | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/.gitattributes b/.gitattributes
index 35c5dbc8..d8d3291f 100644
--- a/.gitattributes
+++ b/.gitattributes
@@ -1 +1 @@
-*.[Rr]md  linguist-detectable
+*.[Rr]md linguist-language=R

From c72aa26f32ff66e6ecbd72948b0643fdea37a0a4 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 8 Mar 2023 18:39:19 +0000
Subject: [PATCH 215/828] Updated pattern for Github linguist

---
 .gitattributes | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/.gitattributes b/.gitattributes
index d8d3291f..2b8d2ea0 100644
--- a/.gitattributes
+++ b/.gitattributes
@@ -1 +1 @@
-*.[Rr]md linguist-language=R
+*.* linguist-language=R

From a20d1de4a0536a46b60df1cd79ae6eacf57d2567 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Wed, 8 Mar 2023 18:46:11 +0000
Subject: [PATCH 216/828] Updated pattern for Gihub linguist to detect .md and
 .Rmd files

---
 .gitattributes | 10 +++++++++-
 1 file changed, 9 insertions(+), 1 deletion(-)

diff --git a/.gitattributes b/.gitattributes
index 2b8d2ea0..cdf5f6f6 100644
--- a/.gitattributes
+++ b/.gitattributes
@@ -1 +1,9 @@
-*.* linguist-language=R
+*.md linguist-vendored=false
+*.md linguist-generated=false
+*.md linguist-documentation=false
+*.md linguist-detectable=true
+*.Rmd linguist-vendored=false
+*.Rmd linguist-generated=false
+*.Rmd linguist-documentation=false
+*.Rmd linguist-detectable=true
+

From 3a1f1ffeac9b3e9033610f472f4016e30299fe9d Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Thu, 9 Mar 2023 22:25:50 +0000
Subject: [PATCH 217/828] Set linguist to detect .rmd and .md files as R (#46)

* Added patterns to enable Github count .Rmd files as R code

* Updated rbuild ignore files

* updated the pattern instruction for Github linguist

* Updated pattern for Github linguist

* Updated pattern for Gihub linguist to detect .md and .Rmd files

* set linguist to count .rmd and .md files as R

* set linguist to count .rmd and .md files as R
---
 .gitattributes | 11 ++---------
 1 file changed, 2 insertions(+), 9 deletions(-)

diff --git a/.gitattributes b/.gitattributes
index cdf5f6f6..1240be10 100644
--- a/.gitattributes
+++ b/.gitattributes
@@ -1,9 +1,2 @@
-*.md linguist-vendored=false
-*.md linguist-generated=false
-*.md linguist-documentation=false
-*.md linguist-detectable=true
-*.Rmd linguist-vendored=false
-*.Rmd linguist-generated=false
-*.Rmd linguist-documentation=false
-*.Rmd linguist-detectable=true
-
+*.[Rr]md linguist-language=R
+*.md linguist-language=R

From 6f3556ae215bce686d772f3285080bf4e955b264 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Thu, 9 Mar 2023 23:24:53 +0000
Subject: [PATCH 218/828] Set linguist to vignette files in repo language stats
 (#47)

* set linguist to count .rmd and .md files as R

* set linguist to ignore files in vignette directory
---
 .gitattributes | 1 +
 1 file changed, 1 insertion(+)

diff --git a/.gitattributes b/.gitattributes
index 1240be10..d08dc7c9 100644
--- a/.gitattributes
+++ b/.gitattributes
@@ -1,2 +1,3 @@
 *.[Rr]md linguist-language=R
 *.md linguist-language=R
+vignettes/* linguist-documentation
\ No newline at end of file

From f3689b0b13916e3ec3a07a8c276e9eae5c3c5231 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Fri, 21 Apr 2023 12:44:57 +0100
Subject: [PATCH 219/828] rephrased the package title

---
 DESCRIPTION | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index bf767d70..1ea6d21a 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,5 +1,5 @@
 Package: bpmodels
-Title: Analysing chain statistics using branching process models
+Title: Analysing transmission chain statistics using branching process models
 Version: 0.2.0
 Authors@R: c(
     person(

From 89bd11881860c8350104a4dc8d03ac9a8ef9f4f6 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Fri, 21 Apr 2023 13:11:43 +0100
Subject: [PATCH 220/828] rendered readme to reflect rephrased package title

---
 README.Rmd |  2 +-
 README.md  | 18 ++++++++----------
 2 files changed, 9 insertions(+), 11 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index 4827753b..0b48e9f1 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -13,7 +13,7 @@ knitr::opts_chunk$set(
 )
 ```
 
-# _bpmodels_: Methods for analysing the size and length of chains from branching process models
+# _bpmodels_: Methods for analysing the size and length of transmission chains from branching process models
 
 <!-- badges: start -->
 ![GitHub R package version](https://img.shields.io/github/r-package/v/epiverse-trace/bpmodels)
diff --git a/README.md b/README.md
index 46136865..648f931c 100644
--- a/README.md
+++ b/README.md
@@ -1,5 +1,5 @@
 
-# *bpmodels*: Methods for analysing the size and length of chains from branching process models
+# *bpmodels*: Methods for analysing the size and length of transmission chains from branching process models
 
 <!-- badges: start -->
 
@@ -210,20 +210,18 @@ By contributing to this project, you agree to abide by its terms.
 ``` r
 citation("bpmodels")
 #> 
-#> To cite package 'bpmodels' in publications use:
+#> To cite package bpmodels in publications use:
 #> 
-#>   Funk S, Finger F, Azam J (2023). _bpmodels: Analysing chain
-#>   statistics using branching process models_.
-#>   https://github.com/epiverse-trace/bpmodels,
-#>   https://epiverse-trace.github.io/bpmodels/.
+#>   Sebastian Funk, Flavio Finger, and James M. Azam (2023). bpmodels:
+#>   Analysing transmission chain statistics using branching process
+#>   models, website: https://github.com/epiverse-trace/bpmodels/
 #> 
 #> A BibTeX entry for LaTeX users is
 #> 
 #>   @Manual{,
-#>     title = {bpmodels: Analysing chain statistics using branching process models},
-#>     author = {Sebastian Funk and Flavio Finger and James M. Azam},
+#>     title = {bpmodels: Analysing transmission chain statistics using branching process models},
+#>     author = {{Sebastian Funk} and {Flavio Finger} and {James M. Azam}},
 #>     year = {2023},
-#>     note = {https://github.com/epiverse-trace/bpmodels,
-#> https://epiverse-trace.github.io/bpmodels/},
+#>     url = {https://github.com/epiverse-trace/bpmodels/},
 #>   }
 ```

From d95f21161b98ebd1d4a87bb39877e5c91dbad6df Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Fri, 21 Apr 2023 13:11:58 +0100
Subject: [PATCH 221/828] added citation files

---
 .Rbuildignore           |   1 +
 CITATION.cff            | 267 ++++++++++++++++++++++++++++++++++++++++
 inst/CITATION           |  16 +++
 man/bpmodels-package.Rd |   2 +-
 4 files changed, 285 insertions(+), 1 deletion(-)
 create mode 100644 CITATION.cff
 create mode 100644 inst/CITATION

diff --git a/.Rbuildignore b/.Rbuildignore
index be512134..6bb079a6 100644
--- a/.Rbuildignore
+++ b/.Rbuildignore
@@ -13,3 +13,4 @@
 ^Meta$
 ^pkgdown$
 ^data-raw$
+^CITATION\.cff$
diff --git a/CITATION.cff b/CITATION.cff
new file mode 100644
index 00000000..1b32ebeb
--- /dev/null
+++ b/CITATION.cff
@@ -0,0 +1,267 @@
+# -----------------------------------------------------------
+# CITATION file created with {cffr} R package, v0.4.1
+# See also: https://docs.ropensci.org/cffr/
+# -----------------------------------------------------------
+ 
+cff-version: 1.2.0
+message: 'To cite package "bpmodels" in publications use:'
+type: software
+license: MIT
+title: 'bpmodels: Analysing transmission chain statistics using branching process
+  models'
+version: 0.2.0
+abstract: Provides methods to analyse and simulate the size and length of branching
+  processes with an arbitrary offspring distribution. These can be used, for example,
+  to analyse the distribution of chain sizes or length of infectious disease outbreaks,
+  as discussed in Farrington et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
+authors:
+- family-names: Funk
+  given-names: Sebastian
+  email: sebastian.funk@lshtm.ac.uk
+  orcid: https://orcid.org/0000-0002-2842-3406
+- family-names: Finger
+  given-names: Flavio
+  email: flavio.finger@epicentre.msf.org
+  orcid: https://orcid.org/0000-0002-8613-5170
+- family-names: Azam
+  given-names: James M.
+  email: james.azam@lshtm.ac.uk
+  orcid: https://orcid.org/0000-0001-5782-7330
+repository-code: https://github.com/epiverse-trace/bpmodels
+url: https://epiverse-trace.github.io/bpmodels/
+contact:
+- family-names: Azam
+  given-names: James M.
+  email: james.azam@lshtm.ac.uk
+  orcid: https://orcid.org/0000-0001-5782-7330
+keywords:
+- branching-process
+- epidemic-dynamics
+- epidemic-modelling
+- epidemic-simulations
+- outbreak-simulator
+- r
+- r-package
+- transmission-chain
+- transmission-chain-reconstruction
+references:
+- type: software
+  title: 'R: A Language and Environment for Statistical Computing'
+  notes: Depends
+  url: https://www.R-project.org/
+  authors:
+  - name: R Core Team
+  location:
+    name: Vienna, Austria
+  year: '2023'
+  institution:
+    name: R Foundation for Statistical Computing
+  version: '>= 3.0.0'
+- type: software
+  title: bookdown
+  abstract: 'bookdown: Authoring Books and Technical Documents with R Markdown'
+  notes: Suggests
+  url: https://pkgs.rstudio.com/bookdown/
+  repository: https://CRAN.R-project.org/package=bookdown
+  authors:
+  - family-names: Xie
+    given-names: Yihui
+    email: xie@yihui.name
+    orcid: https://orcid.org/0000-0003-0645-5666
+  year: '2023'
+- type: software
+  title: covr
+  abstract: 'covr: Test Coverage for Packages'
+  notes: Suggests
+  url: https://covr.r-lib.org
+  repository: https://CRAN.R-project.org/package=covr
+  authors:
+  - family-names: Hester
+    given-names: Jim
+    email: james.f.hester@gmail.com
+  year: '2023'
+- type: software
+  title: dplyr
+  abstract: 'dplyr: A Grammar of Data Manipulation'
+  notes: Suggests
+  url: https://dplyr.tidyverse.org
+  repository: https://CRAN.R-project.org/package=dplyr
+  authors:
+  - family-names: Wickham
+    given-names: Hadley
+    email: hadley@posit.co
+    orcid: https://orcid.org/0000-0003-4757-117X
+  - family-names: François
+    given-names: Romain
+    orcid: https://orcid.org/0000-0002-2444-4226
+  - family-names: Henry
+    given-names: Lionel
+  - family-names: Müller
+    given-names: Kirill
+    orcid: https://orcid.org/0000-0002-1416-3412
+  - family-names: Vaughan
+    given-names: Davis
+    email: davis@posit.co
+    orcid: https://orcid.org/0000-0003-4777-038X
+  year: '2023'
+- type: software
+  title: epiparameter
+  abstract: 'epiparameter: Library of Epidemiological Parameters'
+  notes: Suggests
+  url: https://epiverse-trace.github.io/epiparameter/
+  authors:
+  - family-names: Lambert
+    given-names: Joshua W.
+    email: joshua.lambert@lshtm.ac.uk
+    orcid: https://orcid.org/0000-0001-5218-3046
+  - family-names: Kucharski
+    given-names: Adam
+    email: adam.kucharski@lshtm.ac.uk
+    orcid: https://orcid.org/0000-0001-8814-9421
+  year: '2023'
+- type: software
+  title: ggplot2
+  abstract: 'ggplot2: Create Elegant Data Visualisations Using the Grammar of Graphics'
+  notes: Suggests
+  url: https://ggplot2.tidyverse.org
+  repository: https://CRAN.R-project.org/package=ggplot2
+  authors:
+  - family-names: Wickham
+    given-names: Hadley
+    email: hadley@posit.co
+    orcid: https://orcid.org/0000-0003-4757-117X
+  - family-names: Chang
+    given-names: Winston
+    orcid: https://orcid.org/0000-0002-1576-2126
+  - family-names: Henry
+    given-names: Lionel
+  - family-names: Pedersen
+    given-names: Thomas Lin
+    email: thomas.pedersen@posit.co
+    orcid: https://orcid.org/0000-0002-5147-4711
+  - family-names: Takahashi
+    given-names: Kohske
+  - family-names: Wilke
+    given-names: Claus
+    orcid: https://orcid.org/0000-0002-7470-9261
+  - family-names: Woo
+    given-names: Kara
+    orcid: https://orcid.org/0000-0002-5125-4188
+  - family-names: Yutani
+    given-names: Hiroaki
+    orcid: https://orcid.org/0000-0002-3385-7233
+  - family-names: Dunnington
+    given-names: Dewey
+    orcid: https://orcid.org/0000-0002-9415-4582
+  year: '2023'
+- type: software
+  title: knitr
+  abstract: 'knitr: A General-Purpose Package for Dynamic Report Generation in R'
+  notes: Suggests
+  url: https://yihui.org/knitr/
+  repository: https://CRAN.R-project.org/package=knitr
+  authors:
+  - family-names: Xie
+    given-names: Yihui
+    email: xie@yihui.name
+    orcid: https://orcid.org/0000-0003-0645-5666
+  year: '2023'
+- type: software
+  title: lubridate
+  abstract: 'lubridate: Make Dealing with Dates a Little Easier'
+  notes: Suggests
+  url: https://lubridate.tidyverse.org
+  repository: https://CRAN.R-project.org/package=lubridate
+  authors:
+  - family-names: Spinu
+    given-names: Vitalie
+    email: spinuvit@gmail.com
+  - family-names: Grolemund
+    given-names: Garrett
+  - family-names: Wickham
+    given-names: Hadley
+  year: '2023'
+- type: software
+  title: rmarkdown
+  abstract: 'rmarkdown: Dynamic Documents for R'
+  notes: Suggests
+  url: https://pkgs.rstudio.com/rmarkdown/
+  repository: https://CRAN.R-project.org/package=rmarkdown
+  authors:
+  - family-names: Allaire
+    given-names: JJ
+    email: jj@rstudio.com
+  - family-names: Xie
+    given-names: Yihui
+    email: xie@yihui.name
+    orcid: https://orcid.org/0000-0003-0645-5666
+  - family-names: McPherson
+    given-names: Jonathan
+    email: jonathan@rstudio.com
+  - family-names: Luraschi
+    given-names: Javier
+    email: javier@rstudio.com
+  - family-names: Ushey
+    given-names: Kevin
+    email: kevin@rstudio.com
+  - family-names: Atkins
+    given-names: Aron
+    email: aron@rstudio.com
+  - family-names: Wickham
+    given-names: Hadley
+    email: hadley@rstudio.com
+  - family-names: Cheng
+    given-names: Joe
+    email: joe@rstudio.com
+  - family-names: Chang
+    given-names: Winston
+    email: winston@rstudio.com
+  - family-names: Iannone
+    given-names: Richard
+    email: rich@rstudio.com
+    orcid: https://orcid.org/0000-0003-3925-190X
+  year: '2023'
+- type: software
+  title: testthat
+  abstract: 'testthat: Unit Testing for R'
+  notes: Suggests
+  url: https://testthat.r-lib.org
+  repository: https://CRAN.R-project.org/package=testthat
+  authors:
+  - family-names: Wickham
+    given-names: Hadley
+    email: hadley@rstudio.com
+  year: '2023'
+- type: software
+  title: truncdist
+  abstract: 'truncdist: Truncated Random Variables'
+  notes: Suggests
+  repository: https://CRAN.R-project.org/package=truncdist
+  authors:
+  - family-names: Novomestky
+    given-names: Frederick
+    email: fn334@nyu.edu
+  - family-names: Nadarajah
+    given-names: Saralees
+    email: saralees.nadarajah@manchester.ac.uk
+  year: '2023'
+- type: software
+  title: usethis
+  abstract: 'usethis: Automate Package and Project Setup'
+  notes: Suggests
+  url: https://usethis.r-lib.org
+  repository: https://CRAN.R-project.org/package=usethis
+  authors:
+  - family-names: Wickham
+    given-names: Hadley
+    email: hadley@rstudio.com
+    orcid: https://orcid.org/0000-0003-4757-117X
+  - family-names: Bryan
+    given-names: Jennifer
+    email: jenny@rstudio.com
+    orcid: https://orcid.org/0000-0002-6983-2759
+  - family-names: Barrett
+    given-names: Malcolm
+    email: malcolmbarrett@gmail.com
+    orcid: https://orcid.org/0000-0003-0299-5825
+  year: '2023'
diff --git a/inst/CITATION b/inst/CITATION
new file mode 100644
index 00000000..372018e3
--- /dev/null
+++ b/inst/CITATION
@@ -0,0 +1,16 @@
+citHeader("To cite package bpmodels in publications use:")
+
+citEntry(
+  entry = "Manual",
+  title = "bpmodels: Analysing transmission chain statistics using branching process models",
+  author = c(person("Sebastian Funk"), person("Flavio Finger"), person("James M. Azam")),
+  year     = "2023",
+  url      = "https://github.com/epiverse-trace/bpmodels/",
+  textVersion =
+  sprintf("%s %s %s %s",
+  "Sebastian Funk, Flavio Finger, and James M. Azam (2023).",
+  "bpmodels: Analysing transmission chain statistics",
+  "using branching process models,",
+  "website: https://github.com/epiverse-trace/bpmodels/"
+  )
+)
diff --git a/man/bpmodels-package.Rd b/man/bpmodels-package.Rd
index c8ee18b2..4b6b9458 100644
--- a/man/bpmodels-package.Rd
+++ b/man/bpmodels-package.Rd
@@ -4,7 +4,7 @@
 \name{bpmodels-package}
 \alias{bpmodels}
 \alias{bpmodels-package}
-\title{bpmodels: Analysing chain statistics using branching process models}
+\title{bpmodels: Analysing transmission chain statistics using branching process models}
 \description{
 Provides methods to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks, as discussed in Farrington et al. (2003) \doi{10.1093/biostatistics/4.2.279}.
 }

From 4a578724189e1aac8bb2a1dab94c723fb2cfccf2 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Tue, 28 Mar 2023 19:19:06 +0100
Subject: [PATCH 222/828] change error to warning

---
 R/simulate.r | 12 ++++++++----
 1 file changed, 8 insertions(+), 4 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index c6f0147a..e51ff0bf 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -136,10 +136,14 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
                  )
            )
     }
-    if (!missing(tree) && tree == FALSE) {
-      stop("If `serial` is specified, then `tree` cannot be set to `FALSE`.")
-    }
-    tree <- TRUE
+      if (!missing(tree) && isFALSE(tree)) {
+            warning(sprintf("%s %s",
+                            "`serial` can't be used with `tree = FALSE`;",
+                          "Setting `tree = TRUE` internally."
+                          )
+                    )
+          tree <- TRUE
+          }
   } else if (!missing(tf)) {
     stop("If `tf` is specified, `serial` must be specified too.")
   }

From 12ff7c7da0ba2f72b7f40200d2712ae667f7b45a Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Tue, 28 Mar 2023 19:20:14 +0100
Subject: [PATCH 223/828] added tests for change in chain_sim

---
 tests/testthat/tests-sim.r | 29 ++++++++++++++++++++---------
 1 file changed, 20 insertions(+), 9 deletions(-)

diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 748f104e..bb053a43 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -41,15 +41,12 @@ test_that("Errors are thrown", {
   )
   expect_error(
     chain_sim(
-      n = 2, offspring = "pois", "size", lambda = 0.9,
-      serial = function(x) rpois(x, 0.9), tree = FALSE
-    ),
-    "If `serial` is specified, then `tree` cannot be set to `FALSE`."
-  )
-  expect_error(
-    chain_sim(
-      n = 2, offspring = "pois", "size", lambda = 0.9,
-      tf = 5, tree = FALSE
+      n = 2,
+      offspring = "pois",
+      "size",
+      lambda = 0.9,
+      tf = 5,
+      tree = FALSE
     ),
     "If `tf` is specified, `serial` must be specified too."
   )
@@ -161,4 +158,18 @@ test_that("warnings work as expected", {
     ),
     "Argument 'disp_offspring' not used for poisson offspring distribution."
   )
+  expect_warning(
+    chain_sim(
+      n = 2,
+      offspring = "pois",
+      "size",
+      lambda = 0.9,
+      serial = function(x) rpois(x, 0.9),
+      tree = FALSE
+    ),
+    sprintf("%s %s",
+            "`serial` can't be used with `tree = FALSE`;",
+            "Setting `tree = TRUE` internally."
+    )
+  )
 })

From 4a10c5185d96b71c79e24f61dd118a18da82dc38 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Tue, 4 Apr 2023 21:45:23 +0100
Subject: [PATCH 224/828] Assigned tree to TRUE after throwing warning

---
 R/simulate.r | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index e51ff0bf..ef9fec91 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -142,8 +142,8 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
                           "Setting `tree = TRUE` internally."
                           )
                     )
-          tree <- TRUE
-          }
+      }
+    tree <- TRUE
   } else if (!missing(tf)) {
     stop("If `tf` is specified, `serial` must be specified too.")
   }

From 7be58e14441cbfdb151cef9f80678d0f3505eaa5 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Tue, 4 Apr 2023 21:59:07 +0100
Subject: [PATCH 225/828] bumped up R version

---
 DESCRIPTION | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 1ea6d21a..da4a4da2 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -39,7 +39,7 @@ License: MIT + file LICENSE
 URL: https://github.com/epiverse-trace/bpmodels, https://epiverse-trace.github.io/bpmodels/
 BugReports: https://github.com/epiverse-trace/bpmodels/issues
 Depends:
-    R (>= 3.0.0)
+    R (>= 3.6.0)
 Suggests:
     bookdown,
     covr,

From 5e412670a73cde3954c8b8f46436e477fbfae229 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Fri, 21 Apr 2023 16:56:46 +0100
Subject: [PATCH 226/828] bumped up the version number

---
 DESCRIPTION | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index da4a4da2..42f02eb3 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,6 +1,6 @@
 Package: bpmodels
 Title: Analysing transmission chain statistics using branching process models
-Version: 0.2.0
+Version: 0.2.1
 Authors@R: c(
     person(
     given = "Sebastian",

From 3b18492734d44a73bfb7e029757d46341688fda3 Mon Sep 17 00:00:00 2001
From: James Azam <james.azam@lshtm.ac.uk>
Date: Fri, 21 Apr 2023 17:00:05 +0100
Subject: [PATCH 227/828] updated the changelog

---
 NEWS.md | 24 +++++++++++++-----------
 1 file changed, 13 insertions(+), 11 deletions(-)

diff --git a/NEWS.md b/NEWS.md
index a0d42bcd..2c1c5eef 100644
--- a/NEWS.md
+++ b/NEWS.md
@@ -1,3 +1,16 @@
+# bpmodels 0.2.1
+
+## Minor functionality change
+
+* `chain_sim()` now throws a warning, instead of an error, when `tree` is set 
+to `FALSE` with `serial` also specified. We assume that providing a serial 
+interval means you want the tree of transmissions to be simulated, 
+so `chain_sim()` internally sets `tree = TRUE` and throws a warning explaining 
+what happened. This behaviour should not break any simulations with previous 
+versions with `bpmodels`, but if it does, please submit an issue. 
+To remove the warning, the user should explicitly set `tree = TRUE` when 
+they specify `serial`. 
+
 # bpmodels 0.2.0
 
 ## Documentation
@@ -13,17 +26,6 @@ has been added and can be accessed on the
 * The README's "quick start" section has been updated with what was 
 previously the introduction vignette.
 
-## Minor functionality change
-
-* `chain_sim()` now throws a warning, instead of an error, when `tree` is set 
-to `FALSE` with `serial` also specified. Providing a serial interval implicitly
-means you want the tree of transmissions to be simulated, so `chain_sim()`
-internally sets `tree = TRUE` and throws a warning explaining what happened. 
-This behaviour should not break any simulations with previous versions 
-with `bpmodels`, but if it does, please submit an issue. 
-To remove the warning, the user should explicitly set `tree = TRUE` when 
-they specify `serial`. 
-
 # bpmodels 0.1.9999
 
 * faster, vectorised chain simulations

From ae8589ad510eae8d0541f753bbf1141786fd012e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 22 May 2023 12:29:32 +0100
Subject: [PATCH 228/828] replaced bibtex library with csl-json

---
 vignettes/references.json | 853 ++++++++++++++++++++++++++++++++++++++
 1 file changed, 853 insertions(+)
 create mode 100644 vignettes/references.json

diff --git a/vignettes/references.json b/vignettes/references.json
new file mode 100644
index 00000000..f1678a28
--- /dev/null
+++ b/vignettes/references.json
@@ -0,0 +1,853 @@
+[
+    {
+        "id": "abbott2020",
+        "author": [
+            {
+                "family": "Abbott",
+                "given": "Sam"
+            },
+            {
+                "family": "Hellewell",
+                "given": "Joel"
+            },
+            {
+                "family": "Munday",
+                "given": "James"
+            },
+            {
+                "family": "Funk",
+                "given": "Sebastian"
+            },
+            {
+                "family": "group",
+                "given": "CMMID",
+                "dropping-particle": "nCoV working"
+            },
+            {
+                "literal": "others"
+            }
+        ],
+        "citation-key": "abbott2020",
+        "container-title": "Wellcome open research",
+        "issued": {
+            "date-parts": [
+                [
+                    2020
+                ]
+            ]
+        },
+        "publisher": "The Wellcome Trust",
+        "title": "The transmissibility of novel Coronavirus in the early stages of the 2019-20 outbreak in Wuhan: Exploring initial point-source exposure sizes and durations using scenario analysis",
+        "type": "article-journal",
+        "volume": "5"
+    },
+    {
+        "id": "alene2021",
+        "abstract": "Background: Understanding the epidemiological parameters that determine the transmission dynamics of COVID-19 is essential for public health intervention. Globally, a number of studies were conducted to estimate the average serial interval and incubation period of COVID-19. Combining findings of existing studies that estimate the average serial interval and incubation period of COVID-19 significantly improves the quality of evidence. Hence, this study aimed to determine the overall average serial interval and incubation period of COVID-19. Methods: We followed the PRISMA checklist to present this study. A comprehensive search strategy was carried out from international electronic databases (Google Scholar, PubMed, Science Direct, Web of Science, CINAHL, and Cochrane Library) by two experienced reviewers (MAA and DBK) authors between the 1st of June and the 31st of July 2020. All observational studies either reporting the serial interval or incubation period in persons diagnosed with COVID-19 were included in this study. Heterogeneity across studies was assessed using the I2 and Higgins test. The NOS adapted for cross-sectional studies was used to evaluate the quality of studies. A random effect Meta-analysis was employed to determine the pooled estimate with 95% (CI). Microsoft Excel was used for data extraction and R software was used for analysis. Results: We combined a total of 23 studies to estimate the overall mean serial interval of COVID-19. The mean serial interval of COVID-19 ranged from 4. 2 to 7.5 days. Our meta-analysis showed that the weighted pooled mean serial interval of COVID-19 was 5.2 (95%CI: 4.9–5.5) days. Additionally, to pool the mean incubation period of COVID-19, we included 14 articles. The mean incubation period of COVID-19 also ranged from 4.8 to 9 days. Accordingly, the weighted pooled mean incubation period of COVID-19 was 6.5 (95%CI: 5.9–7.1) days. Conclusions: This systematic review and meta-analysis showed that the weighted pooled mean serial interval and incubation period of COVID-19 were 5.2, and 6.5 days, respectively. In this study, the average serial interval of COVID-19 is shorter than the average incubation period, which suggests that substantial numbers of COVID-19 cases will be attributed to presymptomatic transmission.",
+        "author": [
+            {
+                "family": "Alene",
+                "given": "Muluneh"
+            },
+            {
+                "family": "Yismaw",
+                "given": "Leltework"
+            },
+            {
+                "family": "Assemie",
+                "given": "Moges Agazhe"
+            },
+            {
+                "family": "Ketema",
+                "given": "Daniel Bekele"
+            },
+            {
+                "family": "Gietaneh",
+                "given": "Wodaje"
+            },
+            {
+                "family": "Birhan",
+                "given": "Tilahun Yemanu"
+            }
+        ],
+        "citation-key": "alene2021",
+        "container-title": "BMC Infectious Diseases",
+        "DOI": "10.1186/s12879-021-05950-x",
+        "ISSN": "14712334",
+        "issue": "1",
+        "issued": {
+            "date-parts": [
+                [
+                    2021
+                ]
+            ]
+        },
+        "page": "1–9",
+        "PMID": "33706702",
+        "publisher": "BMC Infectious Diseases",
+        "title": "Serial interval and incubation period of COVID-19: a systematic review and meta-analysis",
+        "type": "article-journal",
+        "volume": "21"
+    },
+    {
+        "id": "allen2012",
+        "abstract": "The basic reproduction number, ℛ(0), one of the most well-known thresholds in deterministic epidemic theory, predicts a disease outbreak if ℛ(0)>1. In stochastic epidemic theory, there are also thresholds that predict a major outbreak. In the case of a single infectious group, if ℛ(0)>1 and i infectious individuals are introduced into a susceptible population, then the probability of a major outbreak is approximately 1-(1/ℛ(0))( i ). With multiple infectious groups from which the disease could emerge, this result no longer holds. Stochastic thresholds for multiple groups depend on the number of individuals within each group, i ( j ), j=1, \\ldots, n, and on the probability of disease extinction for each group, q ( j ). It follows from multitype branching processes that the probability of a major outbreak is approximately [Formula: see text]. In this investigation, we summarize some of the deterministic and stochastic threshold theory, illustrate how to calculate the stochastic thresholds, and derive some new relationships between the deterministic and stochastic thresholds.",
+        "author": [
+            {
+                "family": "Allen",
+                "given": "Linda J.S."
+            },
+            {
+                "family": "Lahodny",
+                "given": "Glenn E."
+            }
+        ],
+        "citation-key": "allen2012",
+        "container-title": "Journal of Biological Dynamics",
+        "DOI": "10.1080/17513758.2012.665502",
+        "ISSN": "17513758",
+        "issue": "2",
+        "issued": {
+            "date-parts": [
+                [
+                    2012
+                ]
+            ]
+        },
+        "page": "590–611",
+        "title": "Extinction thresholds in deterministic and stochastic epidemic models",
+        "type": "article-journal",
+        "volume": "6"
+    },
+    {
+        "id": "becker1977",
+        "author": [
+            {
+                "family": "Becker",
+                "given": "Niels"
+            },
+            {
+                "family": "Society",
+                "given": "International Biometric"
+            }
+        ],
+        "citation-key": "becker1977",
+        "container-title": "Biometrics",
+        "ISSN": "0006-341X",
+        "issue": "3",
+        "issued": {
+            "date-parts": [
+                [
+                    1977
+                ]
+            ]
+        },
+        "page": "515–522",
+        "publisher": "JSTOR",
+        "title": "Estimation for discrete time branching processes with application to epidemics",
+        "type": "article-journal",
+        "volume": "33"
+    },
+    {
+        "id": "blumberg2013",
+        "abstract": "Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. © 2013 Elsevier B.V.",
+        "author": [
+            {
+                "family": "Blumberg",
+                "given": "S."
+            },
+            {
+                "family": "Lloyd-Smith",
+                "given": "J. O."
+            }
+        ],
+        "citation-key": "blumberg2013",
+        "container-title": "Epidemics",
+        "DOI": "10.1016/j.epidem.2013.05.002",
+        "ISSN": "17554365",
+        "issue": "3",
+        "issued": {
+            "date-parts": [
+                [
+                    2013
+                ]
+            ]
+        },
+        "page": "131–145",
+        "PMID": "24021520",
+        "publisher": "Elsevier B.V.",
+        "title": "Comparing methods for estimating R0 from the size distribution of subcritical transmission chains",
+        "type": "article-journal",
+        "URL": "http://dx.doi.org/10.1016/j.epidem.2013.05.002",
+        "volume": "5"
+    },
+    {
+        "id": "blumberg2013a",
+        "abstract": "For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, 0<R0<1 and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring R0 and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer R0, but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, k) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both R0 and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for R0 is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in R0 is detectable. In addition, by allowing for superspreading events, inference of k shifts the threshold above which a transmission chain should be considered anomalously large for a given value of R0 (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results.",
+        "author": [
+            {
+                "family": "Blumberg",
+                "given": "Seth"
+            },
+            {
+                "family": "Lloyd-Smith",
+                "given": "James O."
+            }
+        ],
+        "citation-key": "blumberg2013a",
+        "container-title": "PLoS Computational Biology",
+        "DOI": "10.1371/journal.pcbi.1002993",
+        "ISSN": "15537358",
+        "issue": "5",
+        "issued": {
+            "date-parts": [
+                [
+                    2013
+                ]
+            ]
+        },
+        "page": "1–17",
+        "PMID": "23658504",
+        "title": "Inference of R0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains",
+        "type": "article-journal",
+        "volume": "9"
+    },
+    {
+        "id": "chen2022",
+        "abstract": "The generation time distribution, reflecting the time between successive infections in transmission chains, is a key epidemiological parameter for describing COVID-19 transmission dynamics. However, because exact infection times are rarely known, it is often approximated by the serial interval distribution. This approximation holds under the assumption that infectors and infectees share the same incubation period distribution, which may not always be true. We estimated incubation period and serial interval distributions using 629 transmission pairs reconstructed by investigating 2989 confirmed cases in China in January-February 2020, and developed an inferential framework to estimate the generation time distribution that accounts for variation over time due to changes in epidemiology, sampling biases and public health and social measures. We identified substantial reductions over time in the serial interval and generation time distributions. Our proposed method provides more reliable estimation of the temporal variation in the generation time distribution, improving assessment of transmission dynamics.",
+        "author": [
+            {
+                "family": "Chen",
+                "given": "Dongxuan"
+            },
+            {
+                "family": "Lau",
+                "given": "Yiu Chung"
+            },
+            {
+                "family": "Xu",
+                "given": "Xiao Ke"
+            },
+            {
+                "family": "Wang",
+                "given": "Lin"
+            },
+            {
+                "family": "Du",
+                "given": "Zhanwei"
+            },
+            {
+                "family": "Tsang",
+                "given": "Tim K."
+            },
+            {
+                "family": "Wu",
+                "given": "Peng"
+            },
+            {
+                "family": "Lau",
+                "given": "Eric H.Y."
+            },
+            {
+                "family": "Wallinga",
+                "given": "Jacco"
+            },
+            {
+                "family": "Cowling",
+                "given": "Benjamin J."
+            },
+            {
+                "family": "Ali",
+                "given": "Sheikh Taslim"
+            }
+        ],
+        "citation-key": "chen2022",
+        "container-title": "Nature Communications",
+        "DOI": "10.1038/s41467-022-35496-8",
+        "ISSN": "20411723",
+        "issue": "1",
+        "issued": {
+            "date-parts": [
+                [
+                    2022
+                ]
+            ]
+        },
+        "publisher": "Springer US",
+        "title": "Inferring time-varying generation time, serial interval, and incubation period distributions for COVID-19",
+        "type": "article-journal",
+        "volume": "13"
+    },
+    {
+        "id": "farrington1999",
+        "abstract": "We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.",
+        "author": [
+            {
+                "family": "Farrington",
+                "given": "C. P."
+            },
+            {
+                "family": "Grant",
+                "given": "A. D."
+            }
+        ],
+        "citation-key": "farrington1999",
+        "container-title": "Journal of Applied Probability",
+        "DOI": "10.1239/jap/1032374633",
+        "ISSN": "00219002",
+        "issue": "3",
+        "issued": {
+            "date-parts": [
+                [
+                    1999
+                ]
+            ]
+        },
+        "page": "771–779",
+        "title": "The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease",
+        "type": "article-journal",
+        "volume": "36"
+    },
+    {
+        "id": "farrington1999a",
+        "abstract": "We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.",
+        "author": [
+            {
+                "family": "Farrington",
+                "given": "C. P."
+            },
+            {
+                "family": "Grant",
+                "given": "A. D."
+            }
+        ],
+        "citation-key": "farrington1999a",
+        "container-title": "Journal of Applied Probability",
+        "DOI": "10.1239/jap/1032374633",
+        "ISSN": "00219002",
+        "issue": "3",
+        "issued": {
+            "date-parts": [
+                [
+                    1999
+                ]
+            ]
+        },
+        "page": "771–779",
+        "title": "The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease",
+        "type": "article-journal",
+        "volume": "36"
+    },
+    {
+        "id": "farrington2003",
+        "abstract": "Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.",
+        "author": [
+            {
+                "family": "Farrington",
+                "given": "C. P."
+            },
+            {
+                "family": "Kanaan",
+                "given": "M. N."
+            },
+            {
+                "family": "Gay",
+                "given": "N. J."
+            }
+        ],
+        "citation-key": "farrington2003",
+        "container-title": "Biostatistics (Oxford, England)",
+        "DOI": "10.1093/biostatistics/4.2.279",
+        "ISSN": "14654644",
+        "issue": "2",
+        "issued": {
+            "date-parts": [
+                [
+                    2003
+                ]
+            ]
+        },
+        "page": "279–295",
+        "title": "Branching process models for surveillance of infectious diseases controlled by mass vaccination.",
+        "type": "article-journal",
+        "volume": "4"
+    },
+    {
+        "id": "fine2003",
+        "abstract": "The interval between successive cases of an infectious disease is determined by the time from infection to infectiousness, the duration of infectiousness, the time from infection to disease onset (incubation period), the duration of any extra-human phase of the infectious agent, and the proportion clinically affected among infected individuals. The interval is important in the interpretation of infectious disease surveillance and trend data, in the identification of outbreaks, and in the optimization of quarantine and contact tracing. This paper discusses the properties of these intervals, as measured between transmission events or between clinical onsets of successive infected individuals, noting the determinants of their ranges and frequency distributions, the circumstances under which secondary cases may arise before primaries, and under which the infection transmission interval will be different from the interval between clinical onsets of successive cases. It discusses the derivation of interval distribution statistics from descriptive data given in standard textbooks, with illustrations from published data on outbreaks, households, and epidemiologic tracing. Finally, it discusses the implications of such measures for studies of secondary attack rates, for the persistence of infection in human communities, for outbreak response, and for elimination or eradication programs.",
+        "author": [
+            {
+                "family": "Fine",
+                "given": "Paul E.M."
+            }
+        ],
+        "citation-key": "fine2003",
+        "container-title": "American Journal of Epidemiology",
+        "DOI": "10.1093/aje/kwg251",
+        "ISBN": "0002-9262 (Print) 0002-9262 (Linking)",
+        "ISSN": "00029262",
+        "issue": "11",
+        "issued": {
+            "date-parts": [
+                [
+                    2003
+                ]
+            ]
+        },
+        "page": "1039–1047",
+        "PMID": "14630599",
+        "title": "The Interval between Successive Cases of an Infectious Disease",
+        "type": "article-journal",
+        "volume": "158"
+    },
+    {
+        "id": "grassly2006",
+        "abstract": "Seasonal change in the incidence of infectious diseases is a common phenomenon in both temperate and tropical climates. However, the mechanisms responsible for seasonal disease incidence, and the epidemiological consequences of seasonality, are poorly understood with rare exception. Standard epidemiological theory and concepts such as the basic reproductive number R 0 no longer apply, and the implications for interventions that themselves may be periodic, such as pulse vaccination, have not been formally examined. This paper examines the causes and consequences of seasonality, and in so doing derives several new results concerning vaccination strategy and the interpretation of disease outbreak data. It begins with a brief review of published scientific studies in support of different causes of seasonality in infectious diseases of humans, identifying four principal mechanisms and their association with different routes of transmission. It then describes the consequences of seasonality for R 0 , disease outbreaks, endemic dynamics and persistence. Finally, a mathematical analysis of routine and pulse vaccination programmes for seasonal infections is presented. The synthesis of seasonal infectious disease epidemiology attempted by this paper highlights the need for further empirical and theoretical work. © 2006 The Royal Society.",
+        "author": [
+            {
+                "family": "Grassly",
+                "given": "Nicholas C."
+            },
+            {
+                "family": "Fraser",
+                "given": "Christophe"
+            }
+        ],
+        "citation-key": "grassly2006",
+        "container-title": "Proceedings of the Royal Society B: Biological Sciences",
+        "DOI": "10.1098/rspb.2006.3604",
+        "ISSN": "14712970",
+        "issue": "1600",
+        "issued": {
+            "date-parts": [
+                [
+                    2006
+                ]
+            ]
+        },
+        "page": "2541–2550",
+        "title": "Seasonal infectious disease epidemiology",
+        "type": "article-journal",
+        "volume": "273"
+    },
+    {
+        "id": "griffin2020",
+        "abstract": "The serial interval is the time between symptom onsets in an infector-infectee pair. The generation time, also known as the generation interval, is the time between infection events in an infector-infectee pair. The serial interval and the generation time are key parameters for assessing the dynamics of a disease. A number of scientific papers reported information pertaining to the serial interval and/or generation time for COVID-19. Objective Conduct a review of available evidence to advise on appropriate parameter values for serial interval and generation time in national COVID-19 transmission models for Ireland and on methodological issues relating to those parameters. Methods We conducted a rapid review of the literature covering the period 1 January 2020 and 21 August 2020, following predefined eligibility criteria. Forty scientific papers met our inclusion criteria and were included in the review. Results The mean of the serial interval ranged from 3.03 to 7.6 days, based on 38 estimates, and the median from 1.0 to 6.0 days (based on 15 estimates). Only three estimates were provided for the mean of the generation time. These ranged from 3.95 to 5.20 days. One estimate of 5.0 days was provided for the median of the generation time. Discussion Estimates of the serial interval and the generation time are very dependent on the specific factors that apply at the time that the data are collected, including the level of social contact. Consequently, the estimates may not be entirely relevant to other environments. Therefore, local estimates should be obtained as soon as possible. Careful consideration should be given to the methodology that is used. Real-time estimations of the serial interval/generation time, allowing for variations over time, may provide more accurate estimates of reproduction numbers than using conventionally fixed serial interval/generation time distributions.",
+        "author": [
+            {
+                "family": "Griffin",
+                "given": "John"
+            },
+            {
+                "family": "Casey",
+                "given": "Miriam"
+            },
+            {
+                "family": "Collins",
+                "given": "Áine"
+            },
+            {
+                "family": "Hunt",
+                "given": "Kevin"
+            },
+            {
+                "family": "McEvoy",
+                "given": "David"
+            },
+            {
+                "family": "Byrne",
+                "given": "Andrew"
+            },
+            {
+                "family": "McAloon",
+                "given": "Conor"
+            },
+            {
+                "family": "Barber",
+                "given": "Ann"
+            },
+            {
+                "family": "Lane",
+                "given": "Elizabeth Ann"
+            },
+            {
+                "family": "More",
+                "given": "Simon"
+            }
+        ],
+        "citation-key": "griffin2020",
+        "container-title": "BMJ Open",
+        "DOI": "10.1136/bmjopen-2020-040263",
+        "ISBN": "9789241512763",
+        "ISSN": "20446055",
+        "issue": "11",
+        "issued": {
+            "date-parts": [
+                [
+                    2020
+                ]
+            ]
+        },
+        "page": "1–9",
+        "PMID": "33234640",
+        "title": "Rapid review of available evidence on the serial interval and generation time of COVID-19",
+        "type": "article-journal",
+        "volume": "10"
+    },
+    {
+        "id": "jacob2010",
+        "abstract": "Branching processes are stochastic individual-based processes leading consequently to a bottom-up approach. In addition, since the state variables are random integer variables (representing population sizes), the extinction occurs at random finite time on the extinction set, thus leading to fine and realistic predictions. Starting from the simplest and well-known single-type Bienaymé-Galton-Watson branching process that was used by several authors for approximating the beginning of an epidemic, we then present a general branching model with age and population dependent individual transitions. However contrary to the classical Bienaymé-Galton-Watson or asymptotically Bienaymé-Galton-Watson setting, where the asymptotic behavior of the process, as time tends to infinity, is well understood, the asymptotic behavior of this general process is a new question. Here we give some solutions for dealing with this problem depending on whether the initial population size is large or small, and whether the disease is rare or non-rare when the initial population size is large.",
+        "author": [
+            {
+                "family": "Jacob",
+                "given": "Christine"
+            }
+        ],
+        "citation-key": "jacob2010",
+        "container-title": "International Journal of Environmental Research and Public Health",
+        "DOI": "10.3390/ijerph7031204",
+        "ISSN": "16604601",
+        "issue": "3",
+        "issued": {
+            "date-parts": [
+                [
+                    2010
+                ]
+            ]
+        },
+        "page": "1186–1204",
+        "title": "Branching processes: Their role in epidemiology",
+        "type": "article-journal",
+        "volume": "7"
+    },
+    {
+        "id": "lehtinen2021",
+        "abstract": "The timing of transmission plays a key role in the dynamics and controllability of an epidemic. However, observing generation times - the time interval between the infection of an infector and an infectee in a transmission pair - requires data on infection times, which are generally unknown. The timing of symptom onset is more easily observed; generation times are therefore often estimated based on serial intervals - the time interval between symptom onset of an infector and an infectee. This estimation follows one of two approaches: (i) approximating the generation time distribution by the serial interval distribution or (ii) deriving the generation time distribution from the serial interval and incubation period - the time interval between infection and symptom onset in a single individual - distributions. These two approaches make different - and not always explicitly stated - assumptions about the relationship between infectiousness and symptoms, resulting in different generation time distributions with the same mean but unequal variances. Here, we clarify the assumptions that each approach makes and show that neither set of assumptions is plausible for most pathogens. However, the variances of the generation time distribution derived under each assumption can reasonably be considered as upper (approximation with serial interval) and lower (derivation from serial interval) bounds. Thus, we suggest a pragmatic solution is to use both approaches and treat these as edge cases in downstream analysis. We discuss the impact of the variance of the generation time distribution on the controllability of an epidemic through strategies based on contact tracing, and we show that underestimating this variance is likely to overestimate controllability.",
+        "author": [
+            {
+                "family": "Lehtinen",
+                "given": "Sonja"
+            },
+            {
+                "family": "Ashcroft",
+                "given": "Peter"
+            },
+            {
+                "family": "Bonhoeffer",
+                "given": "Sebastian"
+            }
+        ],
+        "citation-key": "lehtinen2021",
+        "container-title": "Journal of the Royal Society Interface",
+        "DOI": "10.1098/rsif.2020.0756",
+        "ISSN": "17425662",
+        "issue": "174",
+        "issued": {
+            "date-parts": [
+                [
+                    2021
+                ]
+            ]
+        },
+        "PMID": "33402022",
+        "title": "On the relationship between serial interval, infectiousness profile and generation time: On the relationship between serial interval, infectiousness profile and generation time",
+        "type": "article-journal",
+        "volume": "18"
+    },
+    {
+        "id": "limpert2001",
+        "abstract": "On the charms of statistics, and how mechanical models resembling gambling machines offer a link to a handy way to characterize log-normal distributions, which can provide deeper insight into variability and probability - Normal or log-normal: That is the question.",
+        "author": [
+            {
+                "family": "Limpert",
+                "given": "Eckhard"
+            },
+            {
+                "family": "Stahel",
+                "given": "Werner A."
+            },
+            {
+                "family": "Abbt",
+                "given": "Markus"
+            }
+        ],
+        "citation-key": "limpert2001",
+        "container-title": "BioScience",
+        "DOI": "10.1641/0006-3568(2001)051[0341:LNDATS]2.0.CO;2",
+        "ISSN": "00063568",
+        "issue": "5",
+        "issued": {
+            "date-parts": [
+                [
+                    2001
+                ]
+            ]
+        },
+        "page": "341–352",
+        "title": "Log-normal distributions across the sciences: Keys and clues",
+        "type": "article-journal",
+        "volume": "51"
+    },
+    {
+        "id": "lloyd-smith2005",
+        "abstract": "Population-level analyses often use average quantities to describe heterogeneous systems, particularly when variation does not arise from identifiable groups. A prominent example, central to our current understanding of epidemic spread, is the basic reproductive number, R0, which is defined as the mean number of infections caused by an infected individual in a susceptible population. Population estimates of R0 can obscure considerable individual variation in infectiousness, as highlighted during the global emergence of severe acute respiratory syndrome (SARS) by numerous 'superspreading events' in which certain individuals infected unusually large numbers of secondary cases. For diseases transmitted by non-sexual direct contacts, such as SARS or smallpox, individual variation is difficult to measure empirically, and thus its importance for outbreak dynamics has been unclear. Here we present an integrated theoretical and statistical analysis of the influence of individual variation in infectiousness on disease emergence. Using contact tracing data from eight directly transmitted diseases, we show that the distribution of individual infectiousness around R0 is often highly skewed. Model predictions accounting for this variation differ sharply from average-based approaches, with disease extinction more likely and outbreaks rarer but more explosive. Using these models, we explore implications for outbreak control, showing that individual-specific control measures outperform population-wide measures. Moreover, the dramatic improvements achieved through targeted control policies emphasize the need to identify predictive correlates of higher infectiousness. Our findings indicate that superspreading is a normal feature of disease spread, and to frame ongoing discussion we propose a rigorous definition for superspreading events and a method to predict their frequency. © 2005 Nature Publishing Group.",
+        "author": [
+            {
+                "family": "Lloyd-Smith",
+                "given": "J. O."
+            },
+            {
+                "family": "Schreiber",
+                "given": "S. J."
+            },
+            {
+                "family": "Kopp",
+                "given": "P. E."
+            },
+            {
+                "family": "Getz",
+                "given": "W. M."
+            }
+        ],
+        "citation-key": "lloyd-smith2005",
+        "container-title": "Nature",
+        "DOI": "10.1038/nature04153",
+        "ISSN": "14764687",
+        "issue": "7066",
+        "issued": {
+            "date-parts": [
+                [
+                    2005
+                ]
+            ]
+        },
+        "page": "355–359",
+        "PMID": "16292310",
+        "title": "Superspreading and the effect of individual variation on disease emergence",
+        "type": "article-journal",
+        "volume": "438"
+    },
+    {
+        "id": "marivate2020",
+        "author": [
+            {
+                "family": "Marivate",
+                "given": "Vukosi"
+            },
+            {
+                "family": "Combrink",
+                "given": "Herkulaas MvE"
+            }
+        ],
+        "citation-key": "marivate2020",
+        "container-title": "arXiv preprint arXiv:2004.04813",
+        "issued": {
+            "date-parts": [
+                [
+                    2020
+                ]
+            ]
+        },
+        "title": "Use of available data to inform the COVID-19 outbreak in South Africa: a case study",
+        "type": "article-journal"
+    },
+    {
+        "id": "nishiura2007",
+        "abstract": "The incubation period of infectious diseases, the time from infection with a microorganism to onset of disease, is directly relevant to prevention and control. Since explicit models of the incubation period enhance our understanding of the spread of disease, previous classic studies were revisited, focusing on the modeling methods employed and paying particular attention to relatively unknown historical efforts. The earliest study on the incubation period of pandemic influenza was published in 1919, providing estimates of the incubation period of Spanish flu using the daily incidence on ships departing from several ports in Australia. Although the study explicitly dealt with an unknown time of exposure, the assumed periods of exposure, which had an equal probability of infection, were too long, and thus, likely resulted in slight underestimates of the incubation period. After the suggestion that the incubation period follows lognormal distribution, Japanese epidemiologists extended this assumption to estimates of the time of exposure during a point source outbreak. Although the reason why the incubation period of acute infectious diseases tends to reveal a right-skewed distribution has been explored several times, the validity of the lognormal assumption is yet to be fully clarified. At present, various different distributions are assumed, and the lack of validity in assuming lognormal distribution is particularly apparent in the case of slowly progressing diseases. The present paper indicates that (1) analysis using well-defined short periods of exposure with appropriate statistical methods is critical when the exact time of exposure is unknown, and (2) when assuming a specific distribution for the incubation period, comparisons using different distributions are needed in addition to estimations using different datasets, analyses of the determinants of incubation period, and an understanding of the underlying disease mechanisms. © 2007 Nishiura; licensee BioMed Central Ltd.",
+        "author": [
+            {
+                "family": "Nishiura",
+                "given": "Hiroshi"
+            }
+        ],
+        "citation-key": "nishiura2007",
+        "container-title": "Emerging Themes in Epidemiology",
+        "DOI": "10.1186/1742-7622-4-2",
+        "ISSN": "17427622",
+        "issued": {
+            "date-parts": [
+                [
+                    2007
+                ]
+            ]
+        },
+        "page": "1–12",
+        "title": "Early efforts in modeling the incubation period of infectious diseases with an acute course of illness",
+        "type": "article-journal",
+        "volume": "4"
+    },
+    {
+        "id": "nishiura2012",
+        "abstract": "Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. © 2011.",
+        "author": [
+            {
+                "family": "Nishiura",
+                "given": "Hiroshi"
+            },
+            {
+                "family": "Yan",
+                "given": "Ping"
+            },
+            {
+                "family": "Sleeman",
+                "given": "Candace K."
+            },
+            {
+                "family": "Mode",
+                "given": "Charles J."
+            }
+        ],
+        "citation-key": "nishiura2012",
+        "container-title": "Journal of Theoretical Biology",
+        "DOI": "10.1016/j.jtbi.2011.10.039",
+        "ISSN": "00225193",
+        "issued": {
+            "date-parts": [
+                [
+                    2012
+                ]
+            ]
+        },
+        "page": "48–55",
+        "PMID": "22079419",
+        "publisher": "Elsevier",
+        "title": "Estimating the transmission potential of supercritical processes based on the final size distribution of minor outbreaks",
+        "type": "article-journal",
+        "URL": "http://dx.doi.org/10.1016/j.jtbi.2011.10.039",
+        "volume": "294"
+    },
+    {
+        "id": "pearson2020",
+        "abstract": "For 45 African countries/territories already reporting COVID-19 cases before 23 March 2020, we estimate the dates of reporting 1,000 and 10,000 cases. Assuming early epidemic trends without interventions, all 45 were likely to exceed 1,000 confirmed cases by the end of April 2020, with most exceeding 10,000 a few weeks later.",
+        "author": [
+            {
+                "family": "Pearson",
+                "given": "Carl A.B."
+            },
+            {
+                "family": "Schalkwyk",
+                "given": "Cari",
+                "non-dropping-particle": "van"
+            },
+            {
+                "family": "Foss",
+                "given": "Anna M."
+            },
+            {
+                "family": "O'Reilly",
+                "given": "Kathleen M."
+            },
+            {
+                "family": "Pulliam",
+                "given": "Juliet R.C."
+            }
+        ],
+        "citation-key": "pearson2020",
+        "container-title": "Eurosurveillance",
+        "DOI": "10.2807/1560-7917.ES.2020.25.18.2000543",
+        "ISSN": "15607917",
+        "issue": "18",
+        "issued": {
+            "date-parts": [
+                [
+                    2020
+                ]
+            ]
+        },
+        "page": "1–6",
+        "PMID": "32400361",
+        "publisher": "European Centre for Disease Prevention and Control (ECDC)",
+        "title": "Projected early spread of COVID-19 in Africa through 1 June 2020",
+        "type": "article-journal",
+        "URL": "http://dx.doi.org/10.2807/1560-7917.ES.2020.25.18.2000543",
+        "volume": "25"
+    },
+    {
+        "id": "wang2020",
+        "abstract": "Coronavirus disease 2019 (COVID-19) was first identified in late 2019 in Wuhan, Hubei Province, China and spread globally in months, sparking worldwide concern. However, it is unclear whether super-spreading events occurred during the early outbreak phase, as has been observed for other emerging viruses. Here, we analyse 208 publicly available SARS-CoV-2 genome sequences collected during the early outbreak phase. We combine phylogenetic analysis with Bayesian inference under an epidemiological model to trace person-to-person transmission. The dispersion parameter of the offspring distribution in the inferred transmission chain was estimated to be 0.23 (95% CI: 0.13–0.38), indicating there are individuals who directly infected a disproportionately large number of people. Our results showed that super-spreading events played an important role in the early stage of the COVID-19 outbreak.",
+        "author": [
+            {
+                "family": "Wang",
+                "given": "Liang"
+            },
+            {
+                "family": "Didelot",
+                "given": "Xavier"
+            },
+            {
+                "family": "Yang",
+                "given": "Jing"
+            },
+            {
+                "family": "Wong",
+                "given": "Gary"
+            },
+            {
+                "family": "Shi",
+                "given": "Yi"
+            },
+            {
+                "family": "Liu",
+                "given": "Wenjun"
+            },
+            {
+                "family": "Gao",
+                "given": "George F."
+            },
+            {
+                "family": "Bi",
+                "given": "Yuhai"
+            }
+        ],
+        "citation-key": "wang2020",
+        "container-title": "Nature Communications",
+        "DOI": "10.1038/s41467-020-18836-4",
+        "ISSN": "20411723",
+        "issue": "1",
+        "issued": {
+            "date-parts": [
+                [
+                    2020
+                ]
+            ]
+        },
+        "page": "1–6",
+        "PMID": "33024095",
+        "publisher": "Springer US",
+        "title": "Inference of person-to-person transmission of COVID-19 reveals hidden super-spreading events during the early outbreak phase",
+        "type": "article-journal",
+        "URL": "http://dx.doi.org/10.1038/s41467-020-18836-4",
+        "volume": "11"
+    },
+    {
+        "id": "yadav2021",
+        "abstract": "In this review, we have discussed the different statistical modeling and prediction techniques for various infectious diseases including the recent pandemic of COVID-19. The distribution fitting, time series modeling along with predictive monitoring approaches, and epidemiological modeling are illustrated. When the epidemiology data is sufficient to fit with the required sample size, the normal distribution in general or other theoretical distributions are fitted and the best-fitted distribution is chosen for the prediction of the spread of the disease. The infectious diseases develop over time and we have data on the single variable that is the number of infections that happened, therefore, time series models are fitted and the prediction is done based on the best-fitted model. Monitoring approaches may also be applied to time series models which could estimate the parameters more precisely. In epidemiological modeling, more biological parameters are incorporated in the models and the forecasting of the disease spread is carried out. We came up with, how to improve the existing modeling methods, the use of fuzzy variables, and detection of fraud in the available data. Ultimately, we have reviewed the results of recent statistical modeling efforts to predict the course of COVID-19 spread.",
+        "author": [
+            {
+                "family": "Yadav",
+                "given": "Subhash Kumar"
+            },
+            {
+                "family": "Akhter",
+                "given": "Yusuf"
+            }
+        ],
+        "citation-key": "yadav2021",
+        "container-title": "Frontiers in Public Health",
+        "DOI": "10.3389/fpubh.2021.645405",
+        "ISSN": "22962565",
+        "issue": "June",
+        "issued": {
+            "date-parts": [
+                [
+                    2021
+                ]
+            ]
+        },
+        "page": "1–27",
+        "PMID": "34222166",
+        "title": "Statistical Modeling for the Prediction of Infectious Disease Dissemination With Special Reference to COVID-19 Spread",
+        "type": "article-journal",
+        "volume": "9"
+    }
+]
\ No newline at end of file

From 38d05ffa9a55354b54296f9c445ac02cf22bad12 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 22 May 2023 12:31:14 +0100
Subject: [PATCH 229/828] Changed extension from .bib to json

---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 145babbd..26fb725f 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -7,7 +7,7 @@ output:
     code_folding: show
 pkgdown:
   as_is: true
-bibliography: references.bib
+bibliography: references.json
 link-citations: true
 vignette: >
   %\VignetteIndexEntry{Projecting infectious disease incidence: a COVID-19 example}

From cace31fba14deca3b96f3e052d7f581e8c3fdfd8 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 22 May 2023 12:31:34 +0100
Subject: [PATCH 230/828] Deleted .bib file

---
 vignettes/references.bib | 298 ---------------------------------------
 1 file changed, 298 deletions(-)
 delete mode 100644 vignettes/references.bib

diff --git a/vignettes/references.bib b/vignettes/references.bib
deleted file mode 100644
index e2fcc49f..00000000
--- a/vignettes/references.bib
+++ /dev/null
@@ -1,298 +0,0 @@
-@article{abbott2020,
-  title     = {The transmissibility of novel Coronavirus in the early stages of the 2019-20 outbreak in Wuhan: Exploring initial point-source exposure sizes and durations using scenario analysis},
-  author    = {Abbott, Sam and Hellewell, Joel and Munday, James and Funk, Sebastian and CMMID nCoV working group and others},
-  journal   = {Wellcome open research},
-  volume    = {5},
-  year      = {2020},
-  publisher = {The Wellcome Trust}
-}
-@article{Alene2021,
-  abstract  = {Background: Understanding the epidemiological parameters that determine the transmission dynamics of COVID-19 is essential for public health intervention. Globally, a number of studies were conducted to estimate the average serial interval and incubation period of COVID-19. Combining findings of existing studies that estimate the average serial interval and incubation period of COVID-19 significantly improves the quality of evidence. Hence, this study aimed to determine the overall average serial interval and incubation period of COVID-19. Methods: We followed the PRISMA checklist to present this study. A comprehensive search strategy was carried out from international electronic databases (Google Scholar, PubMed, Science Direct, Web of Science, CINAHL, and Cochrane Library) by two experienced reviewers (MAA and DBK) authors between the 1st of June and the 31st of July 2020. All observational studies either reporting the serial interval or incubation period in persons diagnosed with COVID-19 were included in this study. Heterogeneity across studies was assessed using the I2 and Higgins test. The NOS adapted for cross-sectional studies was used to evaluate the quality of studies. A random effect Meta-analysis was employed to determine the pooled estimate with 95% (CI). Microsoft Excel was used for data extraction and R software was used for analysis. Results: We combined a total of 23 studies to estimate the overall mean serial interval of COVID-19. The mean serial interval of COVID-19 ranged from 4. 2 to 7.5 days. Our meta-analysis showed that the weighted pooled mean serial interval of COVID-19 was 5.2 (95%CI: 4.9–5.5) days. Additionally, to pool the mean incubation period of COVID-19, we included 14 articles. The mean incubation period of COVID-19 also ranged from 4.8 to 9 days. Accordingly, the weighted pooled mean incubation period of COVID-19 was 6.5 (95%CI: 5.9–7.1) days. Conclusions: This systematic review and meta-analysis showed that the weighted pooled mean serial interval and incubation period of COVID-19 were 5.2, and 6.5 days, respectively. In this study, the average serial interval of COVID-19 is shorter than the average incubation period, which suggests that substantial numbers of COVID-19 cases will be attributed to presymptomatic transmission.},
-  author    = {Alene, Muluneh and Yismaw, Leltework and Assemie, Moges Agazhe and Ketema, Daniel Bekele and Gietaneh, Wodaje and Birhan, Tilahun Yemanu},
-  doi       = {10.1186/s12879-021-05950-x},
-  issn      = {14712334},
-  journal   = {BMC Infectious Diseases},
-  keywords  = {COVID-19,Incubation period,Meta-analysis,Serial interval},
-  number    = {1},
-  pages     = {1--9},
-  pmid      = {33706702},
-  publisher = {BMC Infectious Diseases},
-  title     = {{Serial interval and incubation period of COVID-19: a systematic review and meta-analysis}},
-  volume    = {21},
-  year      = {2021}
-}
-@article{Allen2012,
-  abstract = {The basic reproduction number, ℛ(0), one of the most well-known thresholds in deterministic epidemic theory, predicts a disease outbreak if ℛ(0)>1. In stochastic epidemic theory, there are also thresholds that predict a major outbreak. In the case of a single infectious group, if ℛ(0)>1 and i infectious individuals are introduced into a susceptible population, then the probability of a major outbreak is approximately 1-(1/ℛ(0))( i ). With multiple infectious groups from which the disease could emerge, this result no longer holds. Stochastic thresholds for multiple groups depend on the number of individuals within each group, i ( j ), j=1, {\ldots}, n, and on the probability of disease extinction for each group, q ( j ). It follows from multitype branching processes that the probability of a major outbreak is approximately [Formula: see text]. In this investigation, we summarize some of the deterministic and stochastic threshold theory, illustrate how to calculate the stochastic thresholds, and derive some new relationships between the deterministic and stochastic thresholds.},
-  author   = {Allen, Linda J.S. and Lahodny, Glenn E.},
-  doi      = {10.1080/17513758.2012.665502},
-  issn     = {17513758},
-  journal  = {Journal of Biological Dynamics},
-  keywords = {multitype branching processes,reproduction numbers},
-  number   = {2},
-  pages    = {590--611},
-  title    = {{Extinction thresholds in deterministic and stochastic epidemic models}},
-  volume   = {6},
-  year     = {2012}
-}
-@article{Blumberg2013,
-  abstract  = {Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. {\textcopyright} 2013 Elsevier B.V.},
-  author    = {Blumberg, S. and Lloyd-Smith, J. O.},
-  doi       = {10.1016/j.epidem.2013.05.002},
-  issn      = {17554365},
-  journal   = {Epidemics},
-  keywords  = {Basic reproductive number,Imperfect observation,Measles,Stuttering chain,Transmission heterogeneity},
-  number    = {3},
-  pages     = {131--145},
-  pmid      = {24021520},
-  publisher = {Elsevier B.V.},
-  title     = {{Comparing methods for estimating R0 from the size distribution of subcritical transmission chains}},
-  url       = {http://dx.doi.org/10.1016/j.epidem.2013.05.002},
-  volume    = {5},
-  year      = {2013}
-}
-@article{Blumberg2013a,
-  abstract = {For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, 0<R0<1 and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring R0 and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer R0, but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, k) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both R0 and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for R0 is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in R0 is detectable. In addition, by allowing for superspreading events, inference of k shifts the threshold above which a transmission chain should be considered anomalously large for a given value of R0 (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results.},
-  author   = {Blumberg, Seth and Lloyd-Smith, James O.},
-  doi      = {10.1371/journal.pcbi.1002993},
-  issn     = {15537358},
-  journal  = {PLoS Computational Biology},
-  number   = {5},
-  pages    = {1--17},
-  pmid     = {23658504},
-  title    = {{Inference of R0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains}},
-  volume   = {9},
-  year     = {2013}
-}
-@article{Chen2022,
-  abstract  = {The generation time distribution, reflecting the time between successive infections in transmission chains, is a key epidemiological parameter for describing COVID-19 transmission dynamics. However, because exact infection times are rarely known, it is often approximated by the serial interval distribution. This approximation holds under the assumption that infectors and infectees share the same incubation period distribution, which may not always be true. We estimated incubation period and serial interval distributions using 629 transmission pairs reconstructed by investigating 2989 confirmed cases in China in January-February 2020, and developed an inferential framework to estimate the generation time distribution that accounts for variation over time due to changes in epidemiology, sampling biases and public health and social measures. We identified substantial reductions over time in the serial interval and generation time distributions. Our proposed method provides more reliable estimation of the temporal variation in the generation time distribution, improving assessment of transmission dynamics.},
-  author    = {Chen, Dongxuan and Lau, Yiu Chung and Xu, Xiao Ke and Wang, Lin and Du, Zhanwei and Tsang, Tim K. and Wu, Peng and Lau, Eric H.Y. and Wallinga, Jacco and Cowling, Benjamin J. and Ali, Sheikh Taslim},
-  doi       = {10.1038/s41467-022-35496-8},
-  issn      = {20411723},
-  journal   = {Nature Communications},
-  number    = {1},
-  publisher = {Springer US},
-  title     = {{Inferring time-varying generation time, serial interval, and incubation period distributions for COVID-19}},
-  volume    = {13},
-  year      = {2022}
-}
-
-@article{Farrington1999,
-  abstract = {We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.},
-  author   = {Farrington, C. P. and Grant, A. D.},
-  doi      = {10.1239/jap/1032374633},
-  issn     = {00219002},
-  journal  = {Journal of Applied Probability},
-  keywords = {Branching process,Epidemic model,Extinction,Generation distribution,Maximum likelihood estimation,Power series family},
-  number   = {3},
-  pages    = {771--779},
-  title    = {{The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease}},
-  volume   = {36},
-  year     = {1999}
-}
-
-@article{Farrington1999a,
-  abstract = {We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.},
-  author   = {Farrington, C. P. and Grant, A. D.},
-  doi      = {10.1239/jap/1032374633},
-  issn     = {00219002},
-  journal  = {Journal of Applied Probability},
-  keywords = {Branching process,Epidemic model,Extinction,Generation distribution,Maximum likelihood estimation,Power series family},
-  number   = {3},
-  pages    = {771--779},
-  title    = {{The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease}},
-  volume   = {36},
-  year     = {1999}
-}
-@article{Farrington2003,
-  abstract = {Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.},
-  author   = {Farrington, C. P. and Kanaan, M. N. and Gay, N. J.},
-  doi      = {10.1093/biostatistics/4.2.279},
-  issn     = {14654644},
-  journal  = {Biostatistics (Oxford, England)},
-  number   = {2},
-  pages    = {279--295},
-  title    = {{Branching process models for surveillance of infectious diseases controlled by mass vaccination.}},
-  volume   = {4},
-  year     = {2003}
-}
-@article{Fine2003,
-  abstract = {The interval between successive cases of an infectious disease is determined by the time from infection to infectiousness, the duration of infectiousness, the time from infection to disease onset (incubation period), the duration of any extra-human phase of the infectious agent, and the proportion clinically affected among infected individuals. The interval is important in the interpretation of infectious disease surveillance and trend data, in the identification of outbreaks, and in the optimization of quarantine and contact tracing. This paper discusses the properties of these intervals, as measured between transmission events or between clinical onsets of successive infected individuals, noting the determinants of their ranges and frequency distributions, the circumstances under which secondary cases may arise before primaries, and under which the infection transmission interval will be different from the interval between clinical onsets of successive cases. It discusses the derivation of interval distribution statistics from descriptive data given in standard textbooks, with illustrations from published data on outbreaks, households, and epidemiologic tracing. Finally, it discusses the implications of such measures for studies of secondary attack rates, for the persistence of infection in human communities, for outbreak response, and for elimination or eradication programs.},
-  author   = {Fine, Paul E.M.},
-  doi      = {10.1093/aje/kwg251},
-  isbn     = {0002-9262 (Print) 0002-9262 (Linking)},
-  issn     = {00029262},
-  journal  = {American Journal of Epidemiology},
-  keywords = {Communicable diseases,Disease outbreaks},
-  number   = {11},
-  pages    = {1039--1047},
-  pmid     = {14630599},
-  title    = {{The Interval between Successive Cases of an Infectious Disease}},
-  volume   = {158},
-  year     = {2003}
-}
-@article{Grassly2006a,
-  abstract = {Seasonal change in the incidence of infectious diseases is a common phenomenon in both temperate and tropical climates. However, the mechanisms responsible for seasonal disease incidence, and the epidemiological consequences of seasonality, are poorly understood with rare exception. Standard epidemiological theory and concepts such as the basic reproductive number R  0  no longer apply, and the implications for interventions that themselves may be periodic, such as pulse vaccination, have not been formally examined. This paper examines the causes and consequences of seasonality, and in so doing derives several new results concerning vaccination strategy and the interpretation of disease outbreak data. It begins with a brief review of published scientific studies in support of different causes of seasonality in infectious diseases of humans, identifying four principal mechanisms and their association with different routes of transmission. It then describes the consequences of seasonality for R 0 , disease outbreaks, endemic dynamics and persistence. Finally, a mathematical analysis of routine and pulse vaccination programmes for seasonal infections is presented. The synthesis of seasonal infectious disease epidemiology attempted by this paper highlights the need for further empirical and theoretical work. {\textcopyright} 2006 The Royal Society.},
-  author   = {Grassly, Nicholas C. and Fraser, Christophe},
-  doi      = {10.1098/rspb.2006.3604},
-  issn     = {14712970},
-  journal  = {Proceedings of the Royal Society B: Biological Sciences},
-  keywords = {Communicable diseases,Disease outbreaks,Epidemiology,Seasons,Vaccination},
-  number   = {1600},
-  pages    = {2541--2550},
-  title    = {{Seasonal infectious disease epidemiology}},
-  volume   = {273},
-  year     = {2006}
-}
-@article{Griffin2020,
-  abstract = {The serial interval is the time between symptom onsets in an infector-infectee pair. The generation time, also known as the generation interval, is the time between infection events in an infector-infectee pair. The serial interval and the generation time are key parameters for assessing the dynamics of a disease. A number of scientific papers reported information pertaining to the serial interval and/or generation time for COVID-19. Objective Conduct a review of available evidence to advise on appropriate parameter values for serial interval and generation time in national COVID-19 transmission models for Ireland and on methodological issues relating to those parameters. Methods We conducted a rapid review of the literature covering the period 1 January 2020 and 21 August 2020, following predefined eligibility criteria. Forty scientific papers met our inclusion criteria and were included in the review. Results The mean of the serial interval ranged from 3.03 to 7.6 days, based on 38 estimates, and the median from 1.0 to 6.0 days (based on 15 estimates). Only three estimates were provided for the mean of the generation time. These ranged from 3.95 to 5.20 days. One estimate of 5.0 days was provided for the median of the generation time. Discussion Estimates of the serial interval and the generation time are very dependent on the specific factors that apply at the time that the data are collected, including the level of social contact. Consequently, the estimates may not be entirely relevant to other environments. Therefore, local estimates should be obtained as soon as possible. Careful consideration should be given to the methodology that is used. Real-time estimations of the serial interval/generation time, allowing for variations over time, may provide more accurate estimates of reproduction numbers than using conventionally fixed serial interval/generation time distributions.},
-  author   = {Griffin, John and Casey, Miriam and Collins, {\'{A}}ine and Hunt, Kevin and McEvoy, David and Byrne, Andrew and McAloon, Conor and Barber, Ann and Lane, Elizabeth Ann and More, Simon},
-  doi      = {10.1136/bmjopen-2020-040263},
-  isbn     = {9789241512763},
-  issn     = {20446055},
-  journal  = {BMJ Open},
-  keywords = {COVID-19,epidemiology,public health,virology},
-  number   = {11},
-  pages    = {1--9},
-  pmid     = {33234640},
-  title    = {{Rapid review of available evidence on the serial interval and generation time of COVID-19}},
-  volume   = {10},
-  year     = {2020}
-}
-@article{Jacob2010,
-  abstract = {Branching processes are stochastic individual-based processes leading consequently to a bottom-up approach. In addition, since the state variables are random integer variables (representing population sizes), the extinction occurs at random finite time on the extinction set, thus leading to fine and realistic predictions. Starting from the simplest and well-known single-type Bienaym{\'{e}}-Galton-Watson branching process that was used by several authors for approximating the beginning of an epidemic, we then present a general branching model with age and population dependent individual transitions. However contrary to the classical Bienaym{\'{e}}-Galton-Watson or asymptotically Bienaym{\'{e}}-Galton-Watson setting, where the asymptotic behavior of the process, as time tends to infinity, is well understood, the asymptotic behavior of this general process is a new question. Here we give some solutions for dealing with this problem depending on whether the initial population size is large or small, and whether the disease is rare or non-rare when the initial population size is large.},
-  author   = {Jacob, Christine},
-  doi      = {10.3390/ijerph7031204},
-  issn     = {16604601},
-  journal  = {International Journal of Environmental Research and Public Health},
-  keywords = {Age-dependence,Branching process,Epidemic size,Extinction time,Population-dependence},
-  number   = {3},
-  pages    = {1186--1204},
-  title    = {{Branching processes: Their role in epidemiology}},
-  volume   = {7},
-  year     = {2010}
-}
-@article{Lehtinen2021,
-  abstract = {The timing of transmission plays a key role in the dynamics and controllability of an epidemic. However, observing generation times - the time interval between the infection of an infector and an infectee in a transmission pair - requires data on infection times, which are generally unknown. The timing of symptom onset is more easily observed; generation times are therefore often estimated based on serial intervals - the time interval between symptom onset of an infector and an infectee. This estimation follows one of two approaches: (i) approximating the generation time distribution by the serial interval distribution or (ii) deriving the generation time distribution from the serial interval and incubation period - the time interval between infection and symptom onset in a single individual - distributions. These two approaches make different - and not always explicitly stated - assumptions about the relationship between infectiousness and symptoms, resulting in different generation time distributions with the same mean but unequal variances. Here, we clarify the assumptions that each approach makes and show that neither set of assumptions is plausible for most pathogens. However, the variances of the generation time distribution derived under each assumption can reasonably be considered as upper (approximation with serial interval) and lower (derivation from serial interval) bounds. Thus, we suggest a pragmatic solution is to use both approaches and treat these as edge cases in downstream analysis. We discuss the impact of the variance of the generation time distribution on the controllability of an epidemic through strategies based on contact tracing, and we show that underestimating this variance is likely to overestimate controllability.},
-  author   = {Lehtinen, Sonja and Ashcroft, Peter and Bonhoeffer, Sebastian},
-  doi      = {10.1098/rsif.2020.0756},
-  issn     = {17425662},
-  journal  = {Journal of the Royal Society Interface},
-  keywords = {SARS-CoV-2,contact tracing,epidemiology,generation time,infectiousness,modelling},
-  number   = {174},
-  pmid     = {33402022},
-  title    = {{On the relationship between serial interval, infectiousness profile and generation time: On the relationship between serial interval, infectiousness profile and generation time}},
-  volume   = {18},
-  year     = {2021}
-}
-@article{Limpert2001,
-  abstract = {On the charms of statistics, and how mechanical models resembling gambling machines offer a link to a handy way to characterize log-normal distributions, which can provide deeper insight into variability and probability - Normal or log-normal: That is the question.},
-  author   = {Limpert, Eckhard and Stahel, Werner A. and Abbt, Markus},
-  doi      = {10.1641/0006-3568(2001)051[0341:LNDATS]2.0.CO;2},
-  issn     = {00063568},
-  journal  = {BioScience},
-  number   = {5},
-  pages    = {341--352},
-  title    = {{Log-normal distributions across the sciences: Keys and clues}},
-  volume   = {51},
-  year     = {2001}
-}
-@article{Lloyd-Smith2005a,
-  abstract = {Population-level analyses often use average quantities to describe heterogeneous systems, particularly when variation does not arise from identifiable groups. A prominent example, central to our current understanding of epidemic spread, is the basic reproductive number, R0, which is defined as the mean number of infections caused by an infected individual in a susceptible population. Population estimates of R0 can obscure considerable individual variation in infectiousness, as highlighted during the global emergence of severe acute respiratory syndrome (SARS) by numerous 'superspreading events' in which certain individuals infected unusually large numbers of secondary cases. For diseases transmitted by non-sexual direct contacts, such as SARS or smallpox, individual variation is difficult to measure empirically, and thus its importance for outbreak dynamics has been unclear. Here we present an integrated theoretical and statistical analysis of the influence of individual variation in infectiousness on disease emergence. Using contact tracing data from eight directly transmitted diseases, we show that the distribution of individual infectiousness around R0 is often highly skewed. Model predictions accounting for this variation differ sharply from average-based approaches, with disease extinction more likely and outbreaks rarer but more explosive. Using these models, we explore implications for outbreak control, showing that individual-specific control measures outperform population-wide measures. Moreover, the dramatic improvements achieved through targeted control policies emphasize the need to identify predictive correlates of higher infectiousness. Our findings indicate that superspreading is a normal feature of disease spread, and to frame ongoing discussion we propose a rigorous definition for superspreading events and a method to predict their frequency. {\textcopyright} 2005 Nature Publishing Group.},
-  author   = {Lloyd-Smith, J. O. and Schreiber, S. J. and Kopp, P. E. and Getz, W. M.},
-  doi      = {10.1038/nature04153},
-  issn     = {14764687},
-  journal  = {Nature},
-  number   = {7066},
-  pages    = {355--359},
-  pmid     = {16292310},
-  title    = {{Superspreading and the effect of individual variation on disease emergence}},
-  volume   = {438},
-  year     = {2005}
-}
-@article{marivate2020,
-  title   = {Use of available data to inform the COVID-19 outbreak in South Africa: a case study},
-  author  = {Marivate, Vukosi and Combrink, Herkulaas MvE},
-  journal = {arXiv preprint arXiv:2004.04813},
-  year    = {2020}
-}
-@article{Nishiura2007,
-  abstract = {The incubation period of infectious diseases, the time from infection with a microorganism to onset of disease, is directly relevant to prevention and control. Since explicit models of the incubation period enhance our understanding of the spread of disease, previous classic studies were revisited, focusing on the modeling methods employed and paying particular attention to relatively unknown historical efforts. The earliest study on the incubation period of pandemic influenza was published in 1919, providing estimates of the incubation period of Spanish flu using the daily incidence on ships departing from several ports in Australia. Although the study explicitly dealt with an unknown time of exposure, the assumed periods of exposure, which had an equal probability of infection, were too long, and thus, likely resulted in slight underestimates of the incubation period. After the suggestion that the incubation period follows lognormal distribution, Japanese epidemiologists extended this assumption to estimates of the time of exposure during a point source outbreak. Although the reason why the incubation period of acute infectious diseases tends to reveal a right-skewed distribution has been explored several times, the validity of the lognormal assumption is yet to be fully clarified. At present, various different distributions are assumed, and the lack of validity in assuming lognormal distribution is particularly apparent in the case of slowly progressing diseases. The present paper indicates that (1) analysis using well-defined short periods of exposure with appropriate statistical methods is critical when the exact time of exposure is unknown, and (2) when assuming a specific distribution for the incubation period, comparisons using different distributions are needed in addition to estimations using different datasets, analyses of the determinants of incubation period, and an understanding of the underlying disease mechanisms. {\textcopyright} 2007 Nishiura; licensee BioMed Central Ltd.},
-  author   = {Nishiura, Hiroshi},
-  doi      = {10.1186/1742-7622-4-2},
-  issn     = {17427622},
-  journal  = {Emerging Themes in Epidemiology},
-  pages    = {1--12},
-  title    = {{Early efforts in modeling the incubation period of infectious diseases with an acute course of illness}},
-  volume   = {4},
-  year     = {2007}
-}
-@article{Nishiura2012,
-  abstract  = {Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. {\textcopyright} 2011.},
-  author    = {Nishiura, Hiroshi and Yan, Ping and Sleeman, Candace K. and Mode, Charles J.},
-  doi       = {10.1016/j.jtbi.2011.10.039},
-  issn      = {00225193},
-  journal   = {Journal of Theoretical Biology},
-  keywords  = {Basic reproduction number,Branching process,Confidence interval,Likelihood function,Statistical model},
-  pages     = {48--55},
-  pmid      = {22079419},
-  publisher = {Elsevier},
-  title     = {{Estimating the transmission potential of supercritical processes based on the final size distribution of minor outbreaks}},
-  url       = {http://dx.doi.org/10.1016/j.jtbi.2011.10.039},
-  volume    = {294},
-  year      = {2012}
-}
-@article{Pearson2020,
-  abstract  = {For 45 African countries/territories already reporting COVID-19 cases before 23 March 2020, we estimate the dates of reporting 1,000 and 10,000 cases. Assuming early epidemic trends without interventions, all 45 were likely to exceed 1,000 confirmed cases by the end of April 2020, with most exceeding 10,000 a few weeks later.},
-  author    = {Pearson, Carl A.B. and van Schalkwyk, Cari and Foss, Anna M. and O'Reilly, Kathleen M. and Pulliam, Juliet R.C.},
-  doi       = {10.2807/1560-7917.ES.2020.25.18.2000543},
-  issn      = {15607917},
-  journal   = {Eurosurveillance},
-  number    = {18},
-  pages     = {1--6},
-  pmid      = {32400361},
-  publisher = {European Centre for Disease Prevention and Control (ECDC)},
-  title     = {{Projected early spread of COVID-19 in Africa through 1 June 2020}},
-  url       = {http://dx.doi.org/10.2807/1560-7917.ES.2020.25.18.2000543},
-  volume    = {25},
-  year      = {2020}
-}
-@article{Society2010,
-  author    = {Becker, Niels and Society, International Biometric},
-  issn      = {0006-341X},
-  journal   = {Biometrics},
-  number    = {3},
-  pages     = {515--522},
-  publisher = {JSTOR},
-  title     = {{Estimation for discrete time branching processes with application to epidemics}},
-  volume    = {33},
-  year      = {1977}
-}
-@article{Wang2020,
-  abstract  = {Coronavirus disease 2019 (COVID-19) was first identified in late 2019 in Wuhan, Hubei Province, China and spread globally in months, sparking worldwide concern. However, it is unclear whether super-spreading events occurred during the early outbreak phase, as has been observed for other emerging viruses. Here, we analyse 208 publicly available SARS-CoV-2 genome sequences collected during the early outbreak phase. We combine phylogenetic analysis with Bayesian inference under an epidemiological model to trace person-to-person transmission. The dispersion parameter of the offspring distribution in the inferred transmission chain was estimated to be 0.23 (95% CI: 0.13–0.38), indicating there are individuals who directly infected a disproportionately large number of people. Our results showed that super-spreading events played an important role in the early stage of the COVID-19 outbreak.},
-  author    = {Wang, Liang and Didelot, Xavier and Yang, Jing and Wong, Gary and Shi, Yi and Liu, Wenjun and Gao, George F. and Bi, Yuhai},
-  doi       = {10.1038/s41467-020-18836-4},
-  issn      = {20411723},
-  journal   = {Nature Communications},
-  number    = {1},
-  pages     = {1--6},
-  pmid      = {33024095},
-  publisher = {Springer US},
-  title     = {{Inference of person-to-person transmission of COVID-19 reveals hidden super-spreading events during the early outbreak phase}},
-  url       = {http://dx.doi.org/10.1038/s41467-020-18836-4},
-  volume    = {11},
-  year      = {2020}
-}
-@article{Yadav2021,
-  abstract = {In this review, we have discussed the different statistical modeling and prediction techniques for various infectious diseases including the recent pandemic of COVID-19. The distribution fitting, time series modeling along with predictive monitoring approaches, and epidemiological modeling are illustrated. When the epidemiology data is sufficient to fit with the required sample size, the normal distribution in general or other theoretical distributions are fitted and the best-fitted distribution is chosen for the prediction of the spread of the disease. The infectious diseases develop over time and we have data on the single variable that is the number of infections that happened, therefore, time series models are fitted and the prediction is done based on the best-fitted model. Monitoring approaches may also be applied to time series models which could estimate the parameters more precisely. In epidemiological modeling, more biological parameters are incorporated in the models and the forecasting of the disease spread is carried out. We came up with, how to improve the existing modeling methods, the use of fuzzy variables, and detection of fraud in the available data. Ultimately, we have reviewed the results of recent statistical modeling efforts to predict the course of COVID-19 spread.},
-  author   = {Yadav, Subhash Kumar and Akhter, Yusuf},
-  doi      = {10.3389/fpubh.2021.645405},
-  issn     = {22962565},
-  journal  = {Frontiers in Public Health},
-  keywords = {distribution fitting models,epidemiological models of disease,estimation,parameters,prediction,time series regression models},
-  number   = {June},
-  pages    = {1--27},
-  pmid     = {34222166},
-  title    = {{Statistical Modeling for the Prediction of Infectious Disease Dissemination With Special Reference to COVID-19 Spread}},
-  volume   = {9},
-  year     = {2021}
-}

From d77d7b3d6b6023966b3bb7b29b479f8c8ff01eb6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 22 May 2023 13:09:17 +0100
Subject: [PATCH 231/828] replaced old library with new one exported as
 csl-json

---
 vignettes/references.json | 1643 ++++++++++++++++++-------------------
 1 file changed, 792 insertions(+), 851 deletions(-)

diff --git a/vignettes/references.json b/vignettes/references.json
index f1678a28..dcbb4440 100644
--- a/vignettes/references.json
+++ b/vignettes/references.json
@@ -1,853 +1,794 @@
 [
-    {
-        "id": "abbott2020",
-        "author": [
-            {
-                "family": "Abbott",
-                "given": "Sam"
-            },
-            {
-                "family": "Hellewell",
-                "given": "Joel"
-            },
-            {
-                "family": "Munday",
-                "given": "James"
-            },
-            {
-                "family": "Funk",
-                "given": "Sebastian"
-            },
-            {
-                "family": "group",
-                "given": "CMMID",
-                "dropping-particle": "nCoV working"
-            },
-            {
-                "literal": "others"
-            }
-        ],
-        "citation-key": "abbott2020",
-        "container-title": "Wellcome open research",
-        "issued": {
-            "date-parts": [
-                [
-                    2020
-                ]
-            ]
-        },
-        "publisher": "The Wellcome Trust",
-        "title": "The transmissibility of novel Coronavirus in the early stages of the 2019-20 outbreak in Wuhan: Exploring initial point-source exposure sizes and durations using scenario analysis",
-        "type": "article-journal",
-        "volume": "5"
-    },
-    {
-        "id": "alene2021",
-        "abstract": "Background: Understanding the epidemiological parameters that determine the transmission dynamics of COVID-19 is essential for public health intervention. Globally, a number of studies were conducted to estimate the average serial interval and incubation period of COVID-19. Combining findings of existing studies that estimate the average serial interval and incubation period of COVID-19 significantly improves the quality of evidence. Hence, this study aimed to determine the overall average serial interval and incubation period of COVID-19. Methods: We followed the PRISMA checklist to present this study. A comprehensive search strategy was carried out from international electronic databases (Google Scholar, PubMed, Science Direct, Web of Science, CINAHL, and Cochrane Library) by two experienced reviewers (MAA and DBK) authors between the 1st of June and the 31st of July 2020. All observational studies either reporting the serial interval or incubation period in persons diagnosed with COVID-19 were included in this study. Heterogeneity across studies was assessed using the I2 and Higgins test. The NOS adapted for cross-sectional studies was used to evaluate the quality of studies. A random effect Meta-analysis was employed to determine the pooled estimate with 95% (CI). Microsoft Excel was used for data extraction and R software was used for analysis. Results: We combined a total of 23 studies to estimate the overall mean serial interval of COVID-19. The mean serial interval of COVID-19 ranged from 4. 2 to 7.5 days. Our meta-analysis showed that the weighted pooled mean serial interval of COVID-19 was 5.2 (95%CI: 4.9–5.5) days. Additionally, to pool the mean incubation period of COVID-19, we included 14 articles. The mean incubation period of COVID-19 also ranged from 4.8 to 9 days. Accordingly, the weighted pooled mean incubation period of COVID-19 was 6.5 (95%CI: 5.9–7.1) days. Conclusions: This systematic review and meta-analysis showed that the weighted pooled mean serial interval and incubation period of COVID-19 were 5.2, and 6.5 days, respectively. In this study, the average serial interval of COVID-19 is shorter than the average incubation period, which suggests that substantial numbers of COVID-19 cases will be attributed to presymptomatic transmission.",
-        "author": [
-            {
-                "family": "Alene",
-                "given": "Muluneh"
-            },
-            {
-                "family": "Yismaw",
-                "given": "Leltework"
-            },
-            {
-                "family": "Assemie",
-                "given": "Moges Agazhe"
-            },
-            {
-                "family": "Ketema",
-                "given": "Daniel Bekele"
-            },
-            {
-                "family": "Gietaneh",
-                "given": "Wodaje"
-            },
-            {
-                "family": "Birhan",
-                "given": "Tilahun Yemanu"
-            }
-        ],
-        "citation-key": "alene2021",
-        "container-title": "BMC Infectious Diseases",
-        "DOI": "10.1186/s12879-021-05950-x",
-        "ISSN": "14712334",
-        "issue": "1",
-        "issued": {
-            "date-parts": [
-                [
-                    2021
-                ]
-            ]
-        },
-        "page": "1–9",
-        "PMID": "33706702",
-        "publisher": "BMC Infectious Diseases",
-        "title": "Serial interval and incubation period of COVID-19: a systematic review and meta-analysis",
-        "type": "article-journal",
-        "volume": "21"
-    },
-    {
-        "id": "allen2012",
-        "abstract": "The basic reproduction number, ℛ(0), one of the most well-known thresholds in deterministic epidemic theory, predicts a disease outbreak if ℛ(0)>1. In stochastic epidemic theory, there are also thresholds that predict a major outbreak. In the case of a single infectious group, if ℛ(0)>1 and i infectious individuals are introduced into a susceptible population, then the probability of a major outbreak is approximately 1-(1/ℛ(0))( i ). With multiple infectious groups from which the disease could emerge, this result no longer holds. Stochastic thresholds for multiple groups depend on the number of individuals within each group, i ( j ), j=1, \\ldots, n, and on the probability of disease extinction for each group, q ( j ). It follows from multitype branching processes that the probability of a major outbreak is approximately [Formula: see text]. In this investigation, we summarize some of the deterministic and stochastic threshold theory, illustrate how to calculate the stochastic thresholds, and derive some new relationships between the deterministic and stochastic thresholds.",
-        "author": [
-            {
-                "family": "Allen",
-                "given": "Linda J.S."
-            },
-            {
-                "family": "Lahodny",
-                "given": "Glenn E."
-            }
-        ],
-        "citation-key": "allen2012",
-        "container-title": "Journal of Biological Dynamics",
-        "DOI": "10.1080/17513758.2012.665502",
-        "ISSN": "17513758",
-        "issue": "2",
-        "issued": {
-            "date-parts": [
-                [
-                    2012
-                ]
-            ]
-        },
-        "page": "590–611",
-        "title": "Extinction thresholds in deterministic and stochastic epidemic models",
-        "type": "article-journal",
-        "volume": "6"
-    },
-    {
-        "id": "becker1977",
-        "author": [
-            {
-                "family": "Becker",
-                "given": "Niels"
-            },
-            {
-                "family": "Society",
-                "given": "International Biometric"
-            }
-        ],
-        "citation-key": "becker1977",
-        "container-title": "Biometrics",
-        "ISSN": "0006-341X",
-        "issue": "3",
-        "issued": {
-            "date-parts": [
-                [
-                    1977
-                ]
-            ]
-        },
-        "page": "515–522",
-        "publisher": "JSTOR",
-        "title": "Estimation for discrete time branching processes with application to epidemics",
-        "type": "article-journal",
-        "volume": "33"
-    },
-    {
-        "id": "blumberg2013",
-        "abstract": "Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. © 2013 Elsevier B.V.",
-        "author": [
-            {
-                "family": "Blumberg",
-                "given": "S."
-            },
-            {
-                "family": "Lloyd-Smith",
-                "given": "J. O."
-            }
-        ],
-        "citation-key": "blumberg2013",
-        "container-title": "Epidemics",
-        "DOI": "10.1016/j.epidem.2013.05.002",
-        "ISSN": "17554365",
-        "issue": "3",
-        "issued": {
-            "date-parts": [
-                [
-                    2013
-                ]
-            ]
-        },
-        "page": "131–145",
-        "PMID": "24021520",
-        "publisher": "Elsevier B.V.",
-        "title": "Comparing methods for estimating R0 from the size distribution of subcritical transmission chains",
-        "type": "article-journal",
-        "URL": "http://dx.doi.org/10.1016/j.epidem.2013.05.002",
-        "volume": "5"
-    },
-    {
-        "id": "blumberg2013a",
-        "abstract": "For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, 0<R0<1 and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring R0 and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer R0, but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, k) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both R0 and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for R0 is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in R0 is detectable. In addition, by allowing for superspreading events, inference of k shifts the threshold above which a transmission chain should be considered anomalously large for a given value of R0 (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results.",
-        "author": [
-            {
-                "family": "Blumberg",
-                "given": "Seth"
-            },
-            {
-                "family": "Lloyd-Smith",
-                "given": "James O."
-            }
-        ],
-        "citation-key": "blumberg2013a",
-        "container-title": "PLoS Computational Biology",
-        "DOI": "10.1371/journal.pcbi.1002993",
-        "ISSN": "15537358",
-        "issue": "5",
-        "issued": {
-            "date-parts": [
-                [
-                    2013
-                ]
-            ]
-        },
-        "page": "1–17",
-        "PMID": "23658504",
-        "title": "Inference of R0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains",
-        "type": "article-journal",
-        "volume": "9"
-    },
-    {
-        "id": "chen2022",
-        "abstract": "The generation time distribution, reflecting the time between successive infections in transmission chains, is a key epidemiological parameter for describing COVID-19 transmission dynamics. However, because exact infection times are rarely known, it is often approximated by the serial interval distribution. This approximation holds under the assumption that infectors and infectees share the same incubation period distribution, which may not always be true. We estimated incubation period and serial interval distributions using 629 transmission pairs reconstructed by investigating 2989 confirmed cases in China in January-February 2020, and developed an inferential framework to estimate the generation time distribution that accounts for variation over time due to changes in epidemiology, sampling biases and public health and social measures. We identified substantial reductions over time in the serial interval and generation time distributions. Our proposed method provides more reliable estimation of the temporal variation in the generation time distribution, improving assessment of transmission dynamics.",
-        "author": [
-            {
-                "family": "Chen",
-                "given": "Dongxuan"
-            },
-            {
-                "family": "Lau",
-                "given": "Yiu Chung"
-            },
-            {
-                "family": "Xu",
-                "given": "Xiao Ke"
-            },
-            {
-                "family": "Wang",
-                "given": "Lin"
-            },
-            {
-                "family": "Du",
-                "given": "Zhanwei"
-            },
-            {
-                "family": "Tsang",
-                "given": "Tim K."
-            },
-            {
-                "family": "Wu",
-                "given": "Peng"
-            },
-            {
-                "family": "Lau",
-                "given": "Eric H.Y."
-            },
-            {
-                "family": "Wallinga",
-                "given": "Jacco"
-            },
-            {
-                "family": "Cowling",
-                "given": "Benjamin J."
-            },
-            {
-                "family": "Ali",
-                "given": "Sheikh Taslim"
-            }
-        ],
-        "citation-key": "chen2022",
-        "container-title": "Nature Communications",
-        "DOI": "10.1038/s41467-022-35496-8",
-        "ISSN": "20411723",
-        "issue": "1",
-        "issued": {
-            "date-parts": [
-                [
-                    2022
-                ]
-            ]
-        },
-        "publisher": "Springer US",
-        "title": "Inferring time-varying generation time, serial interval, and incubation period distributions for COVID-19",
-        "type": "article-journal",
-        "volume": "13"
-    },
-    {
-        "id": "farrington1999",
-        "abstract": "We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.",
-        "author": [
-            {
-                "family": "Farrington",
-                "given": "C. P."
-            },
-            {
-                "family": "Grant",
-                "given": "A. D."
-            }
-        ],
-        "citation-key": "farrington1999",
-        "container-title": "Journal of Applied Probability",
-        "DOI": "10.1239/jap/1032374633",
-        "ISSN": "00219002",
-        "issue": "3",
-        "issued": {
-            "date-parts": [
-                [
-                    1999
-                ]
-            ]
-        },
-        "page": "771–779",
-        "title": "The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease",
-        "type": "article-journal",
-        "volume": "36"
-    },
-    {
-        "id": "farrington1999a",
-        "abstract": "We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.",
-        "author": [
-            {
-                "family": "Farrington",
-                "given": "C. P."
-            },
-            {
-                "family": "Grant",
-                "given": "A. D."
-            }
-        ],
-        "citation-key": "farrington1999a",
-        "container-title": "Journal of Applied Probability",
-        "DOI": "10.1239/jap/1032374633",
-        "ISSN": "00219002",
-        "issue": "3",
-        "issued": {
-            "date-parts": [
-                [
-                    1999
-                ]
-            ]
-        },
-        "page": "771–779",
-        "title": "The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease",
-        "type": "article-journal",
-        "volume": "36"
-    },
-    {
-        "id": "farrington2003",
-        "abstract": "Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.",
-        "author": [
-            {
-                "family": "Farrington",
-                "given": "C. P."
-            },
-            {
-                "family": "Kanaan",
-                "given": "M. N."
-            },
-            {
-                "family": "Gay",
-                "given": "N. J."
-            }
-        ],
-        "citation-key": "farrington2003",
-        "container-title": "Biostatistics (Oxford, England)",
-        "DOI": "10.1093/biostatistics/4.2.279",
-        "ISSN": "14654644",
-        "issue": "2",
-        "issued": {
-            "date-parts": [
-                [
-                    2003
-                ]
-            ]
-        },
-        "page": "279–295",
-        "title": "Branching process models for surveillance of infectious diseases controlled by mass vaccination.",
-        "type": "article-journal",
-        "volume": "4"
-    },
-    {
-        "id": "fine2003",
-        "abstract": "The interval between successive cases of an infectious disease is determined by the time from infection to infectiousness, the duration of infectiousness, the time from infection to disease onset (incubation period), the duration of any extra-human phase of the infectious agent, and the proportion clinically affected among infected individuals. The interval is important in the interpretation of infectious disease surveillance and trend data, in the identification of outbreaks, and in the optimization of quarantine and contact tracing. This paper discusses the properties of these intervals, as measured between transmission events or between clinical onsets of successive infected individuals, noting the determinants of their ranges and frequency distributions, the circumstances under which secondary cases may arise before primaries, and under which the infection transmission interval will be different from the interval between clinical onsets of successive cases. It discusses the derivation of interval distribution statistics from descriptive data given in standard textbooks, with illustrations from published data on outbreaks, households, and epidemiologic tracing. Finally, it discusses the implications of such measures for studies of secondary attack rates, for the persistence of infection in human communities, for outbreak response, and for elimination or eradication programs.",
-        "author": [
-            {
-                "family": "Fine",
-                "given": "Paul E.M."
-            }
-        ],
-        "citation-key": "fine2003",
-        "container-title": "American Journal of Epidemiology",
-        "DOI": "10.1093/aje/kwg251",
-        "ISBN": "0002-9262 (Print) 0002-9262 (Linking)",
-        "ISSN": "00029262",
-        "issue": "11",
-        "issued": {
-            "date-parts": [
-                [
-                    2003
-                ]
-            ]
-        },
-        "page": "1039–1047",
-        "PMID": "14630599",
-        "title": "The Interval between Successive Cases of an Infectious Disease",
-        "type": "article-journal",
-        "volume": "158"
-    },
-    {
-        "id": "grassly2006",
-        "abstract": "Seasonal change in the incidence of infectious diseases is a common phenomenon in both temperate and tropical climates. However, the mechanisms responsible for seasonal disease incidence, and the epidemiological consequences of seasonality, are poorly understood with rare exception. Standard epidemiological theory and concepts such as the basic reproductive number R 0 no longer apply, and the implications for interventions that themselves may be periodic, such as pulse vaccination, have not been formally examined. This paper examines the causes and consequences of seasonality, and in so doing derives several new results concerning vaccination strategy and the interpretation of disease outbreak data. It begins with a brief review of published scientific studies in support of different causes of seasonality in infectious diseases of humans, identifying four principal mechanisms and their association with different routes of transmission. It then describes the consequences of seasonality for R 0 , disease outbreaks, endemic dynamics and persistence. Finally, a mathematical analysis of routine and pulse vaccination programmes for seasonal infections is presented. The synthesis of seasonal infectious disease epidemiology attempted by this paper highlights the need for further empirical and theoretical work. © 2006 The Royal Society.",
-        "author": [
-            {
-                "family": "Grassly",
-                "given": "Nicholas C."
-            },
-            {
-                "family": "Fraser",
-                "given": "Christophe"
-            }
-        ],
-        "citation-key": "grassly2006",
-        "container-title": "Proceedings of the Royal Society B: Biological Sciences",
-        "DOI": "10.1098/rspb.2006.3604",
-        "ISSN": "14712970",
-        "issue": "1600",
-        "issued": {
-            "date-parts": [
-                [
-                    2006
-                ]
-            ]
-        },
-        "page": "2541–2550",
-        "title": "Seasonal infectious disease epidemiology",
-        "type": "article-journal",
-        "volume": "273"
-    },
-    {
-        "id": "griffin2020",
-        "abstract": "The serial interval is the time between symptom onsets in an infector-infectee pair. The generation time, also known as the generation interval, is the time between infection events in an infector-infectee pair. The serial interval and the generation time are key parameters for assessing the dynamics of a disease. A number of scientific papers reported information pertaining to the serial interval and/or generation time for COVID-19. Objective Conduct a review of available evidence to advise on appropriate parameter values for serial interval and generation time in national COVID-19 transmission models for Ireland and on methodological issues relating to those parameters. Methods We conducted a rapid review of the literature covering the period 1 January 2020 and 21 August 2020, following predefined eligibility criteria. Forty scientific papers met our inclusion criteria and were included in the review. Results The mean of the serial interval ranged from 3.03 to 7.6 days, based on 38 estimates, and the median from 1.0 to 6.0 days (based on 15 estimates). Only three estimates were provided for the mean of the generation time. These ranged from 3.95 to 5.20 days. One estimate of 5.0 days was provided for the median of the generation time. Discussion Estimates of the serial interval and the generation time are very dependent on the specific factors that apply at the time that the data are collected, including the level of social contact. Consequently, the estimates may not be entirely relevant to other environments. Therefore, local estimates should be obtained as soon as possible. Careful consideration should be given to the methodology that is used. Real-time estimations of the serial interval/generation time, allowing for variations over time, may provide more accurate estimates of reproduction numbers than using conventionally fixed serial interval/generation time distributions.",
-        "author": [
-            {
-                "family": "Griffin",
-                "given": "John"
-            },
-            {
-                "family": "Casey",
-                "given": "Miriam"
-            },
-            {
-                "family": "Collins",
-                "given": "Áine"
-            },
-            {
-                "family": "Hunt",
-                "given": "Kevin"
-            },
-            {
-                "family": "McEvoy",
-                "given": "David"
-            },
-            {
-                "family": "Byrne",
-                "given": "Andrew"
-            },
-            {
-                "family": "McAloon",
-                "given": "Conor"
-            },
-            {
-                "family": "Barber",
-                "given": "Ann"
-            },
-            {
-                "family": "Lane",
-                "given": "Elizabeth Ann"
-            },
-            {
-                "family": "More",
-                "given": "Simon"
-            }
-        ],
-        "citation-key": "griffin2020",
-        "container-title": "BMJ Open",
-        "DOI": "10.1136/bmjopen-2020-040263",
-        "ISBN": "9789241512763",
-        "ISSN": "20446055",
-        "issue": "11",
-        "issued": {
-            "date-parts": [
-                [
-                    2020
-                ]
-            ]
-        },
-        "page": "1–9",
-        "PMID": "33234640",
-        "title": "Rapid review of available evidence on the serial interval and generation time of COVID-19",
-        "type": "article-journal",
-        "volume": "10"
-    },
-    {
-        "id": "jacob2010",
-        "abstract": "Branching processes are stochastic individual-based processes leading consequently to a bottom-up approach. In addition, since the state variables are random integer variables (representing population sizes), the extinction occurs at random finite time on the extinction set, thus leading to fine and realistic predictions. Starting from the simplest and well-known single-type Bienaymé-Galton-Watson branching process that was used by several authors for approximating the beginning of an epidemic, we then present a general branching model with age and population dependent individual transitions. However contrary to the classical Bienaymé-Galton-Watson or asymptotically Bienaymé-Galton-Watson setting, where the asymptotic behavior of the process, as time tends to infinity, is well understood, the asymptotic behavior of this general process is a new question. Here we give some solutions for dealing with this problem depending on whether the initial population size is large or small, and whether the disease is rare or non-rare when the initial population size is large.",
-        "author": [
-            {
-                "family": "Jacob",
-                "given": "Christine"
-            }
-        ],
-        "citation-key": "jacob2010",
-        "container-title": "International Journal of Environmental Research and Public Health",
-        "DOI": "10.3390/ijerph7031204",
-        "ISSN": "16604601",
-        "issue": "3",
-        "issued": {
-            "date-parts": [
-                [
-                    2010
-                ]
-            ]
-        },
-        "page": "1186–1204",
-        "title": "Branching processes: Their role in epidemiology",
-        "type": "article-journal",
-        "volume": "7"
-    },
-    {
-        "id": "lehtinen2021",
-        "abstract": "The timing of transmission plays a key role in the dynamics and controllability of an epidemic. However, observing generation times - the time interval between the infection of an infector and an infectee in a transmission pair - requires data on infection times, which are generally unknown. The timing of symptom onset is more easily observed; generation times are therefore often estimated based on serial intervals - the time interval between symptom onset of an infector and an infectee. This estimation follows one of two approaches: (i) approximating the generation time distribution by the serial interval distribution or (ii) deriving the generation time distribution from the serial interval and incubation period - the time interval between infection and symptom onset in a single individual - distributions. These two approaches make different - and not always explicitly stated - assumptions about the relationship between infectiousness and symptoms, resulting in different generation time distributions with the same mean but unequal variances. Here, we clarify the assumptions that each approach makes and show that neither set of assumptions is plausible for most pathogens. However, the variances of the generation time distribution derived under each assumption can reasonably be considered as upper (approximation with serial interval) and lower (derivation from serial interval) bounds. Thus, we suggest a pragmatic solution is to use both approaches and treat these as edge cases in downstream analysis. We discuss the impact of the variance of the generation time distribution on the controllability of an epidemic through strategies based on contact tracing, and we show that underestimating this variance is likely to overestimate controllability.",
-        "author": [
-            {
-                "family": "Lehtinen",
-                "given": "Sonja"
-            },
-            {
-                "family": "Ashcroft",
-                "given": "Peter"
-            },
-            {
-                "family": "Bonhoeffer",
-                "given": "Sebastian"
-            }
-        ],
-        "citation-key": "lehtinen2021",
-        "container-title": "Journal of the Royal Society Interface",
-        "DOI": "10.1098/rsif.2020.0756",
-        "ISSN": "17425662",
-        "issue": "174",
-        "issued": {
-            "date-parts": [
-                [
-                    2021
-                ]
-            ]
-        },
-        "PMID": "33402022",
-        "title": "On the relationship between serial interval, infectiousness profile and generation time: On the relationship between serial interval, infectiousness profile and generation time",
-        "type": "article-journal",
-        "volume": "18"
-    },
-    {
-        "id": "limpert2001",
-        "abstract": "On the charms of statistics, and how mechanical models resembling gambling machines offer a link to a handy way to characterize log-normal distributions, which can provide deeper insight into variability and probability - Normal or log-normal: That is the question.",
-        "author": [
-            {
-                "family": "Limpert",
-                "given": "Eckhard"
-            },
-            {
-                "family": "Stahel",
-                "given": "Werner A."
-            },
-            {
-                "family": "Abbt",
-                "given": "Markus"
-            }
-        ],
-        "citation-key": "limpert2001",
-        "container-title": "BioScience",
-        "DOI": "10.1641/0006-3568(2001)051[0341:LNDATS]2.0.CO;2",
-        "ISSN": "00063568",
-        "issue": "5",
-        "issued": {
-            "date-parts": [
-                [
-                    2001
-                ]
-            ]
-        },
-        "page": "341–352",
-        "title": "Log-normal distributions across the sciences: Keys and clues",
-        "type": "article-journal",
-        "volume": "51"
-    },
-    {
-        "id": "lloyd-smith2005",
-        "abstract": "Population-level analyses often use average quantities to describe heterogeneous systems, particularly when variation does not arise from identifiable groups. A prominent example, central to our current understanding of epidemic spread, is the basic reproductive number, R0, which is defined as the mean number of infections caused by an infected individual in a susceptible population. Population estimates of R0 can obscure considerable individual variation in infectiousness, as highlighted during the global emergence of severe acute respiratory syndrome (SARS) by numerous 'superspreading events' in which certain individuals infected unusually large numbers of secondary cases. For diseases transmitted by non-sexual direct contacts, such as SARS or smallpox, individual variation is difficult to measure empirically, and thus its importance for outbreak dynamics has been unclear. Here we present an integrated theoretical and statistical analysis of the influence of individual variation in infectiousness on disease emergence. Using contact tracing data from eight directly transmitted diseases, we show that the distribution of individual infectiousness around R0 is often highly skewed. Model predictions accounting for this variation differ sharply from average-based approaches, with disease extinction more likely and outbreaks rarer but more explosive. Using these models, we explore implications for outbreak control, showing that individual-specific control measures outperform population-wide measures. Moreover, the dramatic improvements achieved through targeted control policies emphasize the need to identify predictive correlates of higher infectiousness. Our findings indicate that superspreading is a normal feature of disease spread, and to frame ongoing discussion we propose a rigorous definition for superspreading events and a method to predict their frequency. © 2005 Nature Publishing Group.",
-        "author": [
-            {
-                "family": "Lloyd-Smith",
-                "given": "J. O."
-            },
-            {
-                "family": "Schreiber",
-                "given": "S. J."
-            },
-            {
-                "family": "Kopp",
-                "given": "P. E."
-            },
-            {
-                "family": "Getz",
-                "given": "W. M."
-            }
-        ],
-        "citation-key": "lloyd-smith2005",
-        "container-title": "Nature",
-        "DOI": "10.1038/nature04153",
-        "ISSN": "14764687",
-        "issue": "7066",
-        "issued": {
-            "date-parts": [
-                [
-                    2005
-                ]
-            ]
-        },
-        "page": "355–359",
-        "PMID": "16292310",
-        "title": "Superspreading and the effect of individual variation on disease emergence",
-        "type": "article-journal",
-        "volume": "438"
-    },
-    {
-        "id": "marivate2020",
-        "author": [
-            {
-                "family": "Marivate",
-                "given": "Vukosi"
-            },
-            {
-                "family": "Combrink",
-                "given": "Herkulaas MvE"
-            }
-        ],
-        "citation-key": "marivate2020",
-        "container-title": "arXiv preprint arXiv:2004.04813",
-        "issued": {
-            "date-parts": [
-                [
-                    2020
-                ]
-            ]
-        },
-        "title": "Use of available data to inform the COVID-19 outbreak in South Africa: a case study",
-        "type": "article-journal"
-    },
-    {
-        "id": "nishiura2007",
-        "abstract": "The incubation period of infectious diseases, the time from infection with a microorganism to onset of disease, is directly relevant to prevention and control. Since explicit models of the incubation period enhance our understanding of the spread of disease, previous classic studies were revisited, focusing on the modeling methods employed and paying particular attention to relatively unknown historical efforts. The earliest study on the incubation period of pandemic influenza was published in 1919, providing estimates of the incubation period of Spanish flu using the daily incidence on ships departing from several ports in Australia. Although the study explicitly dealt with an unknown time of exposure, the assumed periods of exposure, which had an equal probability of infection, were too long, and thus, likely resulted in slight underestimates of the incubation period. After the suggestion that the incubation period follows lognormal distribution, Japanese epidemiologists extended this assumption to estimates of the time of exposure during a point source outbreak. Although the reason why the incubation period of acute infectious diseases tends to reveal a right-skewed distribution has been explored several times, the validity of the lognormal assumption is yet to be fully clarified. At present, various different distributions are assumed, and the lack of validity in assuming lognormal distribution is particularly apparent in the case of slowly progressing diseases. The present paper indicates that (1) analysis using well-defined short periods of exposure with appropriate statistical methods is critical when the exact time of exposure is unknown, and (2) when assuming a specific distribution for the incubation period, comparisons using different distributions are needed in addition to estimations using different datasets, analyses of the determinants of incubation period, and an understanding of the underlying disease mechanisms. © 2007 Nishiura; licensee BioMed Central Ltd.",
-        "author": [
-            {
-                "family": "Nishiura",
-                "given": "Hiroshi"
-            }
-        ],
-        "citation-key": "nishiura2007",
-        "container-title": "Emerging Themes in Epidemiology",
-        "DOI": "10.1186/1742-7622-4-2",
-        "ISSN": "17427622",
-        "issued": {
-            "date-parts": [
-                [
-                    2007
-                ]
-            ]
-        },
-        "page": "1–12",
-        "title": "Early efforts in modeling the incubation period of infectious diseases with an acute course of illness",
-        "type": "article-journal",
-        "volume": "4"
-    },
-    {
-        "id": "nishiura2012",
-        "abstract": "Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. © 2011.",
-        "author": [
-            {
-                "family": "Nishiura",
-                "given": "Hiroshi"
-            },
-            {
-                "family": "Yan",
-                "given": "Ping"
-            },
-            {
-                "family": "Sleeman",
-                "given": "Candace K."
-            },
-            {
-                "family": "Mode",
-                "given": "Charles J."
-            }
-        ],
-        "citation-key": "nishiura2012",
-        "container-title": "Journal of Theoretical Biology",
-        "DOI": "10.1016/j.jtbi.2011.10.039",
-        "ISSN": "00225193",
-        "issued": {
-            "date-parts": [
-                [
-                    2012
-                ]
-            ]
-        },
-        "page": "48–55",
-        "PMID": "22079419",
-        "publisher": "Elsevier",
-        "title": "Estimating the transmission potential of supercritical processes based on the final size distribution of minor outbreaks",
-        "type": "article-journal",
-        "URL": "http://dx.doi.org/10.1016/j.jtbi.2011.10.039",
-        "volume": "294"
-    },
-    {
-        "id": "pearson2020",
-        "abstract": "For 45 African countries/territories already reporting COVID-19 cases before 23 March 2020, we estimate the dates of reporting 1,000 and 10,000 cases. Assuming early epidemic trends without interventions, all 45 were likely to exceed 1,000 confirmed cases by the end of April 2020, with most exceeding 10,000 a few weeks later.",
-        "author": [
-            {
-                "family": "Pearson",
-                "given": "Carl A.B."
-            },
-            {
-                "family": "Schalkwyk",
-                "given": "Cari",
-                "non-dropping-particle": "van"
-            },
-            {
-                "family": "Foss",
-                "given": "Anna M."
-            },
-            {
-                "family": "O'Reilly",
-                "given": "Kathleen M."
-            },
-            {
-                "family": "Pulliam",
-                "given": "Juliet R.C."
-            }
-        ],
-        "citation-key": "pearson2020",
-        "container-title": "Eurosurveillance",
-        "DOI": "10.2807/1560-7917.ES.2020.25.18.2000543",
-        "ISSN": "15607917",
-        "issue": "18",
-        "issued": {
-            "date-parts": [
-                [
-                    2020
-                ]
-            ]
-        },
-        "page": "1–6",
-        "PMID": "32400361",
-        "publisher": "European Centre for Disease Prevention and Control (ECDC)",
-        "title": "Projected early spread of COVID-19 in Africa through 1 June 2020",
-        "type": "article-journal",
-        "URL": "http://dx.doi.org/10.2807/1560-7917.ES.2020.25.18.2000543",
-        "volume": "25"
-    },
-    {
-        "id": "wang2020",
-        "abstract": "Coronavirus disease 2019 (COVID-19) was first identified in late 2019 in Wuhan, Hubei Province, China and spread globally in months, sparking worldwide concern. However, it is unclear whether super-spreading events occurred during the early outbreak phase, as has been observed for other emerging viruses. Here, we analyse 208 publicly available SARS-CoV-2 genome sequences collected during the early outbreak phase. We combine phylogenetic analysis with Bayesian inference under an epidemiological model to trace person-to-person transmission. The dispersion parameter of the offspring distribution in the inferred transmission chain was estimated to be 0.23 (95% CI: 0.13–0.38), indicating there are individuals who directly infected a disproportionately large number of people. Our results showed that super-spreading events played an important role in the early stage of the COVID-19 outbreak.",
-        "author": [
-            {
-                "family": "Wang",
-                "given": "Liang"
-            },
-            {
-                "family": "Didelot",
-                "given": "Xavier"
-            },
-            {
-                "family": "Yang",
-                "given": "Jing"
-            },
-            {
-                "family": "Wong",
-                "given": "Gary"
-            },
-            {
-                "family": "Shi",
-                "given": "Yi"
-            },
-            {
-                "family": "Liu",
-                "given": "Wenjun"
-            },
-            {
-                "family": "Gao",
-                "given": "George F."
-            },
-            {
-                "family": "Bi",
-                "given": "Yuhai"
-            }
-        ],
-        "citation-key": "wang2020",
-        "container-title": "Nature Communications",
-        "DOI": "10.1038/s41467-020-18836-4",
-        "ISSN": "20411723",
-        "issue": "1",
-        "issued": {
-            "date-parts": [
-                [
-                    2020
-                ]
-            ]
-        },
-        "page": "1–6",
-        "PMID": "33024095",
-        "publisher": "Springer US",
-        "title": "Inference of person-to-person transmission of COVID-19 reveals hidden super-spreading events during the early outbreak phase",
-        "type": "article-journal",
-        "URL": "http://dx.doi.org/10.1038/s41467-020-18836-4",
-        "volume": "11"
-    },
-    {
-        "id": "yadav2021",
-        "abstract": "In this review, we have discussed the different statistical modeling and prediction techniques for various infectious diseases including the recent pandemic of COVID-19. The distribution fitting, time series modeling along with predictive monitoring approaches, and epidemiological modeling are illustrated. When the epidemiology data is sufficient to fit with the required sample size, the normal distribution in general or other theoretical distributions are fitted and the best-fitted distribution is chosen for the prediction of the spread of the disease. The infectious diseases develop over time and we have data on the single variable that is the number of infections that happened, therefore, time series models are fitted and the prediction is done based on the best-fitted model. Monitoring approaches may also be applied to time series models which could estimate the parameters more precisely. In epidemiological modeling, more biological parameters are incorporated in the models and the forecasting of the disease spread is carried out. We came up with, how to improve the existing modeling methods, the use of fuzzy variables, and detection of fraud in the available data. Ultimately, we have reviewed the results of recent statistical modeling efforts to predict the course of COVID-19 spread.",
-        "author": [
-            {
-                "family": "Yadav",
-                "given": "Subhash Kumar"
-            },
-            {
-                "family": "Akhter",
-                "given": "Yusuf"
-            }
-        ],
-        "citation-key": "yadav2021",
-        "container-title": "Frontiers in Public Health",
-        "DOI": "10.3389/fpubh.2021.645405",
-        "ISSN": "22962565",
-        "issue": "June",
-        "issued": {
-            "date-parts": [
-                [
-                    2021
-                ]
-            ]
-        },
-        "page": "1–27",
-        "PMID": "34222166",
-        "title": "Statistical Modeling for the Prediction of Infectious Disease Dissemination With Special Reference to COVID-19 Spread",
-        "type": "article-journal",
-        "volume": "9"
-    }
+	{
+		"id": "abbott2020",
+		"type": "article-journal",
+		"container-title": "Wellcome open research",
+		"note": "publisher: The Wellcome Trust",
+		"title": "The transmissibility of novel Coronavirus in the early stages of the 2019-20 outbreak in Wuhan: Exploring initial point-source exposure sizes and durations using scenario analysis",
+		"volume": "5",
+		"author": [
+			{
+				"family": "Abbott",
+				"given": "Sam"
+			},
+			{
+				"family": "Hellewell",
+				"given": "Joel"
+			},
+			{
+				"family": "Munday",
+				"given": "James"
+			},
+			{
+				"family": "Funk",
+				"given": "Sebastian"
+			},
+			{
+				"family": "group",
+				"given": "CMMID",
+				"dropping-particle": "nCoV working"
+			},
+			{
+				"literal": "others"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "alene2021",
+		"type": "article-journal",
+		"abstract": "Background: Understanding the epidemiological parameters that determine the transmission dynamics of COVID-19 is essential for public health intervention. Globally, a number of studies were conducted to estimate the average serial interval and incubation period of COVID-19. Combining findings of existing studies that estimate the average serial interval and incubation period of COVID-19 significantly improves the quality of evidence. Hence, this study aimed to determine the overall average serial interval and incubation period of COVID-19. Methods: We followed the PRISMA checklist to present this study. A comprehensive search strategy was carried out from international electronic databases (Google Scholar, PubMed, Science Direct, Web of Science, CINAHL, and Cochrane Library) by two experienced reviewers (MAA and DBK) authors between the 1st of June and the 31st of July 2020. All observational studies either reporting the serial interval or incubation period in persons diagnosed with COVID-19 were included in this study. Heterogeneity across studies was assessed using the I2 and Higgins test. The NOS adapted for cross-sectional studies was used to evaluate the quality of studies. A random effect Meta-analysis was employed to determine the pooled estimate with 95% (CI). Microsoft Excel was used for data extraction and R software was used for analysis. Results: We combined a total of 23 studies to estimate the overall mean serial interval of COVID-19. The mean serial interval of COVID-19 ranged from 4. 2 to 7.5 days. Our meta-analysis showed that the weighted pooled mean serial interval of COVID-19 was 5.2 (95%CI: 4.9–5.5) days. Additionally, to pool the mean incubation period of COVID-19, we included 14 articles. The mean incubation period of COVID-19 also ranged from 4.8 to 9 days. Accordingly, the weighted pooled mean incubation period of COVID-19 was 6.5 (95%CI: 5.9–7.1) days. Conclusions: This systematic review and meta-analysis showed that the weighted pooled mean serial interval and incubation period of COVID-19 were 5.2, and 6.5 days, respectively. In this study, the average serial interval of COVID-19 is shorter than the average incubation period, which suggests that substantial numbers of COVID-19 cases will be attributed to presymptomatic transmission.",
+		"container-title": "BMC Infectious Diseases",
+		"DOI": "10.1186/s12879-021-05950-x",
+		"ISSN": "14712334",
+		"issue": "1",
+		"note": "publisher: BMC Infectious Diseases\nPMID: 33706702",
+		"page": "1–9",
+		"title": "Serial interval and incubation period of COVID-19: a systematic review and meta-analysis",
+		"volume": "21",
+		"author": [
+			{
+				"family": "Alene",
+				"given": "Muluneh"
+			},
+			{
+				"family": "Yismaw",
+				"given": "Leltework"
+			},
+			{
+				"family": "Assemie",
+				"given": "Moges Agazhe"
+			},
+			{
+				"family": "Ketema",
+				"given": "Daniel Bekele"
+			},
+			{
+				"family": "Gietaneh",
+				"given": "Wodaje"
+			},
+			{
+				"family": "Birhan",
+				"given": "Tilahun Yemanu"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2021"
+				]
+			]
+		}
+	},
+	{
+		"id": "allen2012",
+		"type": "article-journal",
+		"abstract": "The basic reproduction number, ℛ(0), one of the most well-known thresholds in deterministic epidemic theory, predicts a disease outbreak if ℛ(0)>1. In stochastic epidemic theory, there are also thresholds that predict a major outbreak. In the case of a single infectious group, if ℛ(0)>1 and i infectious individuals are introduced into a susceptible population, then the probability of a major outbreak is approximately 1-(1/ℛ(0))( i ). With multiple infectious groups from which the disease could emerge, this result no longer holds. Stochastic thresholds for multiple groups depend on the number of individuals within each group, i ( j ), j=1, \\ldots, n, and on the probability of disease extinction for each group, q ( j ). It follows from multitype branching processes that the probability of a major outbreak is approximately [Formula: see text]. In this investigation, we summarize some of the deterministic and stochastic threshold theory, illustrate how to calculate the stochastic thresholds, and derive some new relationships between the deterministic and stochastic thresholds.",
+		"container-title": "Journal of Biological Dynamics",
+		"DOI": "10.1080/17513758.2012.665502",
+		"ISSN": "17513758",
+		"issue": "2",
+		"page": "590–611",
+		"title": "Extinction thresholds in deterministic and stochastic epidemic models",
+		"volume": "6",
+		"author": [
+			{
+				"family": "Allen",
+				"given": "Linda J.S."
+			},
+			{
+				"family": "Lahodny",
+				"given": "Glenn E."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2012"
+				]
+			]
+		}
+	},
+	{
+		"id": "blumberg2013",
+		"type": "article-journal",
+		"abstract": "Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. © 2013 Elsevier B.V.",
+		"container-title": "Epidemics",
+		"DOI": "10.1016/j.epidem.2013.05.002",
+		"ISSN": "17554365",
+		"issue": "3",
+		"note": "publisher: Elsevier B.V.\nPMID: 24021520",
+		"page": "131–145",
+		"title": "Comparing methods for estimating R0 from the size distribution of subcritical transmission chains",
+		"URL": "http://dx.doi.org/10.1016/j.epidem.2013.05.002",
+		"volume": "5",
+		"author": [
+			{
+				"family": "Blumberg",
+				"given": "S."
+			},
+			{
+				"family": "Lloyd-Smith",
+				"given": "J. O."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2013"
+				]
+			]
+		}
+	},
+	{
+		"id": "blumberg2013a",
+		"type": "article-journal",
+		"abstract": "For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, 0<R0<1 and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring R0 and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer R0, but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, k) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both R0 and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for R0 is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in R0 is detectable. In addition, by allowing for superspreading events, inference of k shifts the threshold above which a transmission chain should be considered anomalously large for a given value of R0 (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results.",
+		"container-title": "PLoS Computational Biology",
+		"DOI": "10.1371/journal.pcbi.1002993",
+		"ISSN": "15537358",
+		"issue": "5",
+		"note": "PMID: 23658504",
+		"page": "1–17",
+		"title": "Inference of R0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains",
+		"volume": "9",
+		"author": [
+			{
+				"family": "Blumberg",
+				"given": "Seth"
+			},
+			{
+				"family": "Lloyd-Smith",
+				"given": "James O."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2013"
+				]
+			]
+		}
+	},
+	{
+		"id": "chen2022",
+		"type": "article-journal",
+		"abstract": "The generation time distribution, reflecting the time between successive infections in transmission chains, is a key epidemiological parameter for describing COVID-19 transmission dynamics. However, because exact infection times are rarely known, it is often approximated by the serial interval distribution. This approximation holds under the assumption that infectors and infectees share the same incubation period distribution, which may not always be true. We estimated incubation period and serial interval distributions using 629 transmission pairs reconstructed by investigating 2989 confirmed cases in China in January-February 2020, and developed an inferential framework to estimate the generation time distribution that accounts for variation over time due to changes in epidemiology, sampling biases and public health and social measures. We identified substantial reductions over time in the serial interval and generation time distributions. Our proposed method provides more reliable estimation of the temporal variation in the generation time distribution, improving assessment of transmission dynamics.",
+		"container-title": "Nature Communications",
+		"DOI": "10.1038/s41467-022-35496-8",
+		"ISSN": "20411723",
+		"issue": "1",
+		"note": "publisher: Springer US",
+		"title": "Inferring time-varying generation time, serial interval, and incubation period distributions for COVID-19",
+		"volume": "13",
+		"author": [
+			{
+				"family": "Chen",
+				"given": "Dongxuan"
+			},
+			{
+				"family": "Lau",
+				"given": "Yiu Chung"
+			},
+			{
+				"family": "Xu",
+				"given": "Xiao Ke"
+			},
+			{
+				"family": "Wang",
+				"given": "Lin"
+			},
+			{
+				"family": "Du",
+				"given": "Zhanwei"
+			},
+			{
+				"family": "Tsang",
+				"given": "Tim K."
+			},
+			{
+				"family": "Wu",
+				"given": "Peng"
+			},
+			{
+				"family": "Lau",
+				"given": "Eric H.Y."
+			},
+			{
+				"family": "Wallinga",
+				"given": "Jacco"
+			},
+			{
+				"family": "Cowling",
+				"given": "Benjamin J."
+			},
+			{
+				"family": "Ali",
+				"given": "Sheikh Taslim"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2022"
+				]
+			]
+		}
+	},
+	{
+		"id": "farrington1999",
+		"type": "article-journal",
+		"abstract": "We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.",
+		"container-title": "Journal of Applied Probability",
+		"DOI": "10.1239/jap/1032374633",
+		"ISSN": "00219002",
+		"issue": "3",
+		"page": "771–779",
+		"title": "The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease",
+		"volume": "36",
+		"author": [
+			{
+				"family": "Farrington",
+				"given": "C. P."
+			},
+			{
+				"family": "Grant",
+				"given": "A. D."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"1999"
+				]
+			]
+		}
+	},
+	{
+		"id": "farrington2003",
+		"type": "article-journal",
+		"abstract": "Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.",
+		"container-title": "Biostatistics (Oxford, England)",
+		"DOI": "10.1093/biostatistics/4.2.279",
+		"ISSN": "14654644",
+		"issue": "2",
+		"page": "279–295",
+		"title": "Branching process models for surveillance of infectious diseases controlled by mass vaccination.",
+		"volume": "4",
+		"author": [
+			{
+				"family": "Farrington",
+				"given": "C. P."
+			},
+			{
+				"family": "Kanaan",
+				"given": "M. N."
+			},
+			{
+				"family": "Gay",
+				"given": "N. J."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2003"
+				]
+			]
+		}
+	},
+	{
+		"id": "fine2003",
+		"type": "article-journal",
+		"abstract": "The interval between successive cases of an infectious disease is determined by the time from infection to infectiousness, the duration of infectiousness, the time from infection to disease onset (incubation period), the duration of any extra-human phase of the infectious agent, and the proportion clinically affected among infected individuals. The interval is important in the interpretation of infectious disease surveillance and trend data, in the identification of outbreaks, and in the optimization of quarantine and contact tracing. This paper discusses the properties of these intervals, as measured between transmission events or between clinical onsets of successive infected individuals, noting the determinants of their ranges and frequency distributions, the circumstances under which secondary cases may arise before primaries, and under which the infection transmission interval will be different from the interval between clinical onsets of successive cases. It discusses the derivation of interval distribution statistics from descriptive data given in standard textbooks, with illustrations from published data on outbreaks, households, and epidemiologic tracing. Finally, it discusses the implications of such measures for studies of secondary attack rates, for the persistence of infection in human communities, for outbreak response, and for elimination or eradication programs.",
+		"container-title": "American Journal of Epidemiology",
+		"DOI": "10.1093/aje/kwg251",
+		"ISSN": "00029262",
+		"issue": "11",
+		"note": "ISBN: 0002-9262 (Print) 0002-9262 (Linking)\nPMID: 14630599",
+		"page": "1039–1047",
+		"title": "The Interval between Successive Cases of an Infectious Disease",
+		"volume": "158",
+		"author": [
+			{
+				"family": "Fine",
+				"given": "Paul E.M."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2003"
+				]
+			]
+		}
+	},
+	{
+		"id": "grassly2006",
+		"type": "article-journal",
+		"abstract": "Seasonal change in the incidence of infectious diseases is a common phenomenon in both temperate and tropical climates. However, the mechanisms responsible for seasonal disease incidence, and the epidemiological consequences of seasonality, are poorly understood with rare exception. Standard epidemiological theory and concepts such as the basic reproductive number R 0 no longer apply, and the implications for interventions that themselves may be periodic, such as pulse vaccination, have not been formally examined. This paper examines the causes and consequences of seasonality, and in so doing derives several new results concerning vaccination strategy and the interpretation of disease outbreak data. It begins with a brief review of published scientific studies in support of different causes of seasonality in infectious diseases of humans, identifying four principal mechanisms and their association with different routes of transmission. It then describes the consequences of seasonality for R 0 , disease outbreaks, endemic dynamics and persistence. Finally, a mathematical analysis of routine and pulse vaccination programmes for seasonal infections is presented. The synthesis of seasonal infectious disease epidemiology attempted by this paper highlights the need for further empirical and theoretical work. © 2006 The Royal Society.",
+		"container-title": "Proceedings of the Royal Society B: Biological Sciences",
+		"DOI": "10.1098/rspb.2006.3604",
+		"ISSN": "14712970",
+		"issue": "1600",
+		"page": "2541–2550",
+		"title": "Seasonal infectious disease epidemiology",
+		"volume": "273",
+		"author": [
+			{
+				"family": "Grassly",
+				"given": "Nicholas C."
+			},
+			{
+				"family": "Fraser",
+				"given": "Christophe"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2006"
+				]
+			]
+		}
+	},
+	{
+		"id": "griffin2020",
+		"type": "article-journal",
+		"abstract": "The serial interval is the time between symptom onsets in an infector-infectee pair. The generation time, also known as the generation interval, is the time between infection events in an infector-infectee pair. The serial interval and the generation time are key parameters for assessing the dynamics of a disease. A number of scientific papers reported information pertaining to the serial interval and/or generation time for COVID-19. Objective Conduct a review of available evidence to advise on appropriate parameter values for serial interval and generation time in national COVID-19 transmission models for Ireland and on methodological issues relating to those parameters. Methods We conducted a rapid review of the literature covering the period 1 January 2020 and 21 August 2020, following predefined eligibility criteria. Forty scientific papers met our inclusion criteria and were included in the review. Results The mean of the serial interval ranged from 3.03 to 7.6 days, based on 38 estimates, and the median from 1.0 to 6.0 days (based on 15 estimates). Only three estimates were provided for the mean of the generation time. These ranged from 3.95 to 5.20 days. One estimate of 5.0 days was provided for the median of the generation time. Discussion Estimates of the serial interval and the generation time are very dependent on the specific factors that apply at the time that the data are collected, including the level of social contact. Consequently, the estimates may not be entirely relevant to other environments. Therefore, local estimates should be obtained as soon as possible. Careful consideration should be given to the methodology that is used. Real-time estimations of the serial interval/generation time, allowing for variations over time, may provide more accurate estimates of reproduction numbers than using conventionally fixed serial interval/generation time distributions.",
+		"container-title": "BMJ Open",
+		"DOI": "10.1136/bmjopen-2020-040263",
+		"ISSN": "20446055",
+		"issue": "11",
+		"note": "ISBN: 9789241512763\nPMID: 33234640",
+		"page": "1–9",
+		"title": "Rapid review of available evidence on the serial interval and generation time of COVID-19",
+		"volume": "10",
+		"author": [
+			{
+				"family": "Griffin",
+				"given": "John"
+			},
+			{
+				"family": "Casey",
+				"given": "Miriam"
+			},
+			{
+				"family": "Collins",
+				"given": "Áine"
+			},
+			{
+				"family": "Hunt",
+				"given": "Kevin"
+			},
+			{
+				"family": "McEvoy",
+				"given": "David"
+			},
+			{
+				"family": "Byrne",
+				"given": "Andrew"
+			},
+			{
+				"family": "McAloon",
+				"given": "Conor"
+			},
+			{
+				"family": "Barber",
+				"given": "Ann"
+			},
+			{
+				"family": "Lane",
+				"given": "Elizabeth Ann"
+			},
+			{
+				"family": "More",
+				"given": "Simon"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "jacob2010",
+		"type": "article-journal",
+		"abstract": "Branching processes are stochastic individual-based processes leading consequently to a bottom-up approach. In addition, since the state variables are random integer variables (representing population sizes), the extinction occurs at random finite time on the extinction set, thus leading to fine and realistic predictions. Starting from the simplest and well-known single-type Bienaymé-Galton-Watson branching process that was used by several authors for approximating the beginning of an epidemic, we then present a general branching model with age and population dependent individual transitions. However contrary to the classical Bienaymé-Galton-Watson or asymptotically Bienaymé-Galton-Watson setting, where the asymptotic behavior of the process, as time tends to infinity, is well understood, the asymptotic behavior of this general process is a new question. Here we give some solutions for dealing with this problem depending on whether the initial population size is large or small, and whether the disease is rare or non-rare when the initial population size is large.",
+		"container-title": "International Journal of Environmental Research and Public Health",
+		"DOI": "10.3390/ijerph7031204",
+		"ISSN": "16604601",
+		"issue": "3",
+		"page": "1186–1204",
+		"title": "Branching processes: Their role in epidemiology",
+		"volume": "7",
+		"author": [
+			{
+				"family": "Jacob",
+				"given": "Christine"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2010"
+				]
+			]
+		}
+	},
+	{
+		"id": "lehtinen2021",
+		"type": "article-journal",
+		"abstract": "The timing of transmission plays a key role in the dynamics and controllability of an epidemic. However, observing generation times - the time interval between the infection of an infector and an infectee in a transmission pair - requires data on infection times, which are generally unknown. The timing of symptom onset is more easily observed; generation times are therefore often estimated based on serial intervals - the time interval between symptom onset of an infector and an infectee. This estimation follows one of two approaches: (i) approximating the generation time distribution by the serial interval distribution or (ii) deriving the generation time distribution from the serial interval and incubation period - the time interval between infection and symptom onset in a single individual - distributions. These two approaches make different - and not always explicitly stated - assumptions about the relationship between infectiousness and symptoms, resulting in different generation time distributions with the same mean but unequal variances. Here, we clarify the assumptions that each approach makes and show that neither set of assumptions is plausible for most pathogens. However, the variances of the generation time distribution derived under each assumption can reasonably be considered as upper (approximation with serial interval) and lower (derivation from serial interval) bounds. Thus, we suggest a pragmatic solution is to use both approaches and treat these as edge cases in downstream analysis. We discuss the impact of the variance of the generation time distribution on the controllability of an epidemic through strategies based on contact tracing, and we show that underestimating this variance is likely to overestimate controllability.",
+		"container-title": "Journal of the Royal Society Interface",
+		"DOI": "10.1098/rsif.2020.0756",
+		"ISSN": "17425662",
+		"issue": "174",
+		"note": "PMID: 33402022",
+		"title": "On the relationship between serial interval, infectiousness profile and generation time: On the relationship between serial interval, infectiousness profile and generation time",
+		"volume": "18",
+		"author": [
+			{
+				"family": "Lehtinen",
+				"given": "Sonja"
+			},
+			{
+				"family": "Ashcroft",
+				"given": "Peter"
+			},
+			{
+				"family": "Bonhoeffer",
+				"given": "Sebastian"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2021"
+				]
+			]
+		}
+	},
+	{
+		"id": "limpert2001",
+		"type": "article-journal",
+		"abstract": "On the charms of statistics, and how mechanical models resembling gambling machines offer a link to a handy way to characterize log-normal distributions, which can provide deeper insight into variability and probability - Normal or log-normal: That is the question.",
+		"container-title": "BioScience",
+		"DOI": "10.1641/0006-3568(2001)051[0341:LNDATS]2.0.CO;2",
+		"ISSN": "00063568",
+		"issue": "5",
+		"page": "341–352",
+		"title": "Log-normal distributions across the sciences: Keys and clues",
+		"volume": "51",
+		"author": [
+			{
+				"family": "Limpert",
+				"given": "Eckhard"
+			},
+			{
+				"family": "Stahel",
+				"given": "Werner A."
+			},
+			{
+				"family": "Abbt",
+				"given": "Markus"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2001"
+				]
+			]
+		}
+	},
+	{
+		"id": "lloyd-smith2005",
+		"type": "article-journal",
+		"abstract": "Population-level analyses often use average quantities to describe heterogeneous systems, particularly when variation does not arise from identifiable groups. A prominent example, central to our current understanding of epidemic spread, is the basic reproductive number, R0, which is defined as the mean number of infections caused by an infected individual in a susceptible population. Population estimates of R0 can obscure considerable individual variation in infectiousness, as highlighted during the global emergence of severe acute respiratory syndrome (SARS) by numerous 'superspreading events' in which certain individuals infected unusually large numbers of secondary cases. For diseases transmitted by non-sexual direct contacts, such as SARS or smallpox, individual variation is difficult to measure empirically, and thus its importance for outbreak dynamics has been unclear. Here we present an integrated theoretical and statistical analysis of the influence of individual variation in infectiousness on disease emergence. Using contact tracing data from eight directly transmitted diseases, we show that the distribution of individual infectiousness around R0 is often highly skewed. Model predictions accounting for this variation differ sharply from average-based approaches, with disease extinction more likely and outbreaks rarer but more explosive. Using these models, we explore implications for outbreak control, showing that individual-specific control measures outperform population-wide measures. Moreover, the dramatic improvements achieved through targeted control policies emphasize the need to identify predictive correlates of higher infectiousness. Our findings indicate that superspreading is a normal feature of disease spread, and to frame ongoing discussion we propose a rigorous definition for superspreading events and a method to predict their frequency. © 2005 Nature Publishing Group.",
+		"container-title": "Nature",
+		"DOI": "10.1038/nature04153",
+		"ISSN": "14764687",
+		"issue": "7066",
+		"note": "PMID: 16292310",
+		"page": "355–359",
+		"title": "Superspreading and the effect of individual variation on disease emergence",
+		"volume": "438",
+		"author": [
+			{
+				"family": "Lloyd-Smith",
+				"given": "J. O."
+			},
+			{
+				"family": "Schreiber",
+				"given": "S. J."
+			},
+			{
+				"family": "Kopp",
+				"given": "P. E."
+			},
+			{
+				"family": "Getz",
+				"given": "W. M."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2005"
+				]
+			]
+		}
+	},
+	{
+		"id": "marivate2020",
+		"type": "article-journal",
+		"container-title": "arXiv preprint arXiv:2004.04813",
+		"title": "Use of available data to inform the COVID-19 outbreak in South Africa: a case study",
+		"author": [
+			{
+				"family": "Marivate",
+				"given": "Vukosi"
+			},
+			{
+				"family": "Combrink",
+				"given": "Herkulaas MvE"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "nishiura2007",
+		"type": "article-journal",
+		"abstract": "The incubation period of infectious diseases, the time from infection with a microorganism to onset of disease, is directly relevant to prevention and control. Since explicit models of the incubation period enhance our understanding of the spread of disease, previous classic studies were revisited, focusing on the modeling methods employed and paying particular attention to relatively unknown historical efforts. The earliest study on the incubation period of pandemic influenza was published in 1919, providing estimates of the incubation period of Spanish flu using the daily incidence on ships departing from several ports in Australia. Although the study explicitly dealt with an unknown time of exposure, the assumed periods of exposure, which had an equal probability of infection, were too long, and thus, likely resulted in slight underestimates of the incubation period. After the suggestion that the incubation period follows lognormal distribution, Japanese epidemiologists extended this assumption to estimates of the time of exposure during a point source outbreak. Although the reason why the incubation period of acute infectious diseases tends to reveal a right-skewed distribution has been explored several times, the validity of the lognormal assumption is yet to be fully clarified. At present, various different distributions are assumed, and the lack of validity in assuming lognormal distribution is particularly apparent in the case of slowly progressing diseases. The present paper indicates that (1) analysis using well-defined short periods of exposure with appropriate statistical methods is critical when the exact time of exposure is unknown, and (2) when assuming a specific distribution for the incubation period, comparisons using different distributions are needed in addition to estimations using different datasets, analyses of the determinants of incubation period, and an understanding of the underlying disease mechanisms. © 2007 Nishiura; licensee BioMed Central Ltd.",
+		"container-title": "Emerging Themes in Epidemiology",
+		"DOI": "10.1186/1742-7622-4-2",
+		"ISSN": "17427622",
+		"page": "1–12",
+		"title": "Early efforts in modeling the incubation period of infectious diseases with an acute course of illness",
+		"volume": "4",
+		"author": [
+			{
+				"family": "Nishiura",
+				"given": "Hiroshi"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2007"
+				]
+			]
+		}
+	},
+	{
+		"id": "nishiura2012",
+		"type": "article-journal",
+		"abstract": "Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. © 2011.",
+		"container-title": "Journal of Theoretical Biology",
+		"DOI": "10.1016/j.jtbi.2011.10.039",
+		"ISSN": "00225193",
+		"note": "publisher: Elsevier\nPMID: 22079419",
+		"page": "48–55",
+		"title": "Estimating the transmission potential of supercritical processes based on the final size distribution of minor outbreaks",
+		"URL": "http://dx.doi.org/10.1016/j.jtbi.2011.10.039",
+		"volume": "294",
+		"author": [
+			{
+				"family": "Nishiura",
+				"given": "Hiroshi"
+			},
+			{
+				"family": "Yan",
+				"given": "Ping"
+			},
+			{
+				"family": "Sleeman",
+				"given": "Candace K."
+			},
+			{
+				"family": "Mode",
+				"given": "Charles J."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2012"
+				]
+			]
+		}
+	},
+	{
+		"id": "pearson2020",
+		"type": "article-journal",
+		"abstract": "For 45 African countries/territories already reporting COVID-19 cases before 23 March 2020, we estimate the dates of reporting 1,000 and 10,000 cases. Assuming early epidemic trends without interventions, all 45 were likely to exceed 1,000 confirmed cases by the end of April 2020, with most exceeding 10,000 a few weeks later.",
+		"container-title": "Eurosurveillance",
+		"DOI": "10.2807/1560-7917.ES.2020.25.18.2000543",
+		"ISSN": "15607917",
+		"issue": "18",
+		"note": "publisher: European Centre for Disease Prevention and Control (ECDC)\nPMID: 32400361",
+		"page": "1–6",
+		"title": "Projected early spread of COVID-19 in Africa through 1 June 2020",
+		"URL": "http://dx.doi.org/10.2807/1560-7917.ES.2020.25.18.2000543",
+		"volume": "25",
+		"author": [
+			{
+				"family": "Pearson",
+				"given": "Carl A.B."
+			},
+			{
+				"family": "Schalkwyk",
+				"given": "Cari",
+				"non-dropping-particle": "van"
+			},
+			{
+				"family": "Foss",
+				"given": "Anna M."
+			},
+			{
+				"family": "O'Reilly",
+				"given": "Kathleen M."
+			},
+			{
+				"family": "Pulliam",
+				"given": "Juliet R.C."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "becker1977",
+		"type": "article-journal",
+		"container-title": "Biometrics",
+		"ISSN": "0006-341X",
+		"issue": "3",
+		"note": "publisher: JSTOR",
+		"page": "515–522",
+		"title": "Estimation for discrete time branching processes with application to epidemics",
+		"volume": "33",
+		"author": [
+			{
+				"family": "Becker",
+				"given": "Niels"
+			},
+			{
+				"family": "Society",
+				"given": "International Biometric"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"1977"
+				]
+			]
+		}
+	},
+	{
+		"id": "wang2020",
+		"type": "article-journal",
+		"abstract": "Coronavirus disease 2019 (COVID-19) was first identified in late 2019 in Wuhan, Hubei Province, China and spread globally in months, sparking worldwide concern. However, it is unclear whether super-spreading events occurred during the early outbreak phase, as has been observed for other emerging viruses. Here, we analyse 208 publicly available SARS-CoV-2 genome sequences collected during the early outbreak phase. We combine phylogenetic analysis with Bayesian inference under an epidemiological model to trace person-to-person transmission. The dispersion parameter of the offspring distribution in the inferred transmission chain was estimated to be 0.23 (95% CI: 0.13–0.38), indicating there are individuals who directly infected a disproportionately large number of people. Our results showed that super-spreading events played an important role in the early stage of the COVID-19 outbreak.",
+		"container-title": "Nature Communications",
+		"DOI": "10.1038/s41467-020-18836-4",
+		"ISSN": "20411723",
+		"issue": "1",
+		"note": "publisher: Springer US\nPMID: 33024095",
+		"page": "1–6",
+		"title": "Inference of person-to-person transmission of COVID-19 reveals hidden super-spreading events during the early outbreak phase",
+		"URL": "http://dx.doi.org/10.1038/s41467-020-18836-4",
+		"volume": "11",
+		"author": [
+			{
+				"family": "Wang",
+				"given": "Liang"
+			},
+			{
+				"family": "Didelot",
+				"given": "Xavier"
+			},
+			{
+				"family": "Yang",
+				"given": "Jing"
+			},
+			{
+				"family": "Wong",
+				"given": "Gary"
+			},
+			{
+				"family": "Shi",
+				"given": "Yi"
+			},
+			{
+				"family": "Liu",
+				"given": "Wenjun"
+			},
+			{
+				"family": "Gao",
+				"given": "George F."
+			},
+			{
+				"family": "Bi",
+				"given": "Yuhai"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "yadav2021",
+		"type": "article-journal",
+		"abstract": "In this review, we have discussed the different statistical modeling and prediction techniques for various infectious diseases including the recent pandemic of COVID-19. The distribution fitting, time series modeling along with predictive monitoring approaches, and epidemiological modeling are illustrated. When the epidemiology data is sufficient to fit with the required sample size, the normal distribution in general or other theoretical distributions are fitted and the best-fitted distribution is chosen for the prediction of the spread of the disease. The infectious diseases develop over time and we have data on the single variable that is the number of infections that happened, therefore, time series models are fitted and the prediction is done based on the best-fitted model. Monitoring approaches may also be applied to time series models which could estimate the parameters more precisely. In epidemiological modeling, more biological parameters are incorporated in the models and the forecasting of the disease spread is carried out. We came up with, how to improve the existing modeling methods, the use of fuzzy variables, and detection of fraud in the available data. Ultimately, we have reviewed the results of recent statistical modeling efforts to predict the course of COVID-19 spread.",
+		"container-title": "Frontiers in Public Health",
+		"DOI": "10.3389/fpubh.2021.645405",
+		"ISSN": "22962565",
+		"issue": "June",
+		"note": "PMID: 34222166",
+		"page": "1–27",
+		"title": "Statistical Modeling for the Prediction of Infectious Disease Dissemination With Special Reference to COVID-19 Spread",
+		"volume": "9",
+		"author": [
+			{
+				"family": "Yadav",
+				"given": "Subhash Kumar"
+			},
+			{
+				"family": "Akhter",
+				"given": "Yusuf"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2021"
+				]
+			]
+		}
+	}
 ]
\ No newline at end of file

From 8c55ac6488664e53b3beed88b29f2210a5be209e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 22 May 2023 13:10:47 +0100
Subject: [PATCH 232/828] changed the bibtex keys to align with new library

---
 vignettes/projecting_incidence.Rmd | 10 +++++-----
 1 file changed, 5 insertions(+), 5 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 26fb725f..1dc7246c 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -33,7 +33,7 @@ Branching processes can be used to project infectious disease trends in time
 provided we can characterize the distribution of times between 
 successive cases (serial interval), and the distribution of secondary cases 
 produced by a single individual (offspring distribution). Such simulations can 
-be achieved in `bpmodels` with the `chain_sim()` function and @Pearson2020, and 
+be achieved in `bpmodels` with the `chain_sim()` function and @pearson2020, and 
 @abbott2020 illustrate its application to COVID-19. 
 
 The purpose of this vignette is to use early data on COVID-19 in South Africa 
@@ -98,11 +98,11 @@ start_times
 
 The log-normal distribution is commonly used in epidemiology to characterise 
 quantities such as the serial interval because it has a large variance 
-and can only be positive-valued [@Nishiura2007; @Limpert2001]. 
+and can only be positive-valued [@nishiura2007; @limpert2001]. 
 
 In this example, we will assume based on COVID-19 literature that the 
 serial interval, S, is log-normal distributed with parameters, 
-$\mu = 4.7$ and $\sigma = 2.9$ [@Pearson2020]. Note that when the distribution
+$\mu = 4.7$ and $\sigma = 2.9$ [@pearson2020]. Note that when the distribution
 is described this way, it means $\mu$ and $\sigma$ are the expected value 
 and standard deviation of the natural logarithm of the serial interval. Hence, 
 in order to sample the "back-transformed" measured serial interval with 
@@ -145,11 +145,11 @@ serial_interval <- function(sample_size) {
 
 The negative binomial distribution is commonly used in epidemiology to
 account for individual variation in transmissibility, 
-also known as superspreading [@Lloyd-Smith2005a].
+also known as superspreading [@lloyd-smith2005].
 
 For this example, we will assume that the offspring distribution is 
 characterised by a negative binomial with $R = 2.5$ [@abbott2020] and 
-$k = 0.58$ [@Wang2020]. In this parameterization, $R$ 
+$k = 0.58$ [@wang2020]. In this parameterization, $R$ 
 represents the $R_0$, which is defined as the average number of 
 cases produced by a single individual in an entirely susceptible population. 
 The parameter $k$ represents superspreading, that is, the degree of 

From ae24211d5708d4c14501feb2cf88b96703530338 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 22 May 2023 13:12:21 +0100
Subject: [PATCH 233/828] replaced the function  with to reflect change in
 epiparameter's development

---
 vignettes/projecting_incidence.Rmd | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 1dc7246c..4a840091 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -120,10 +120,10 @@ See [Wikipedia](https://en.wikipedia.org/wiki/Log-normal_distribution) for a
 detailed explanation of this parametrisation.
 
 The [epiparameter](https://github.com/epiverse-trace/epiparameter) R package 
-provides the function `epiparameter::lnorm_meansd2musigma()` for implementing 
+provides the function `epiparameter::lnorm_meansd2meanlogsdlog()` for implementing 
 this parametrisation. It takes as inputs the mean, $\mu$ and standard 
 deviation, $\sigma$ and returns a list with the transformed mean and 
-standard deviation. Refer to `?epiparameter::lnorm_meansd2musigma` 
+standard deviation. Refer to `?epiparameter::lnorm_meansd2meanlogsdlog` 
 for more details.
 
 Let us set up the serial interval function with the appropriate inputs:
@@ -131,8 +131,8 @@ Let us set up the serial interval function with the appropriate inputs:
 mu <- 4.7
 sgma <- 2.9
 
-log_mean <- lnorm_meansd2musigma(mu, sgma)[[1]]  # log mean
-log_sd <- lnorm_meansd2musigma(mu, sgma)[[2]] # log sd
+log_mean <- lnorm_meansd2meanlogsdlog(mu, sgma)[[1]]  # log mean
+log_sd <- lnorm_meansd2meanlogsdlog(mu, sgma)[[2]] # log sd
 
 #' serial interval function
 serial_interval <- function(sample_size) {

From a2c7dd729c82d34901e9c8a5e600637da294d277 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 22 May 2023 13:25:20 +0100
Subject: [PATCH 234/828] reset github linguist

---
 .gitattributes | 3 ---
 1 file changed, 3 deletions(-)

diff --git a/.gitattributes b/.gitattributes
index d08dc7c9..e69de29b 100644
--- a/.gitattributes
+++ b/.gitattributes
@@ -1,3 +0,0 @@
-*.[Rr]md linguist-language=R
-*.md linguist-language=R
-vignettes/* linguist-documentation
\ No newline at end of file

From df757ada37c48d8be56befa3526bcc47b074e5c5 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Sat, 27 May 2023 22:20:23 +0100
Subject: [PATCH 235/828] replaced bpmodels with epichains everywhere

---
 .github/ISSUE_TEMPLATE/bug_report.md          |  2 +-
 DESCRIPTION                                   |  6 +--
 R/{bpmodels-package.R => epichains-package.R} |  0
 README.Rmd                                    | 32 +++++++--------
 README.md                                     | 41 +++++++++----------
 inst/CITATION                                 | 10 ++---
 ...models-package.Rd => epichains-package.Rd} | 16 ++++----
 vignettes/projecting_incidence.Rmd            | 12 +++---
 8 files changed, 59 insertions(+), 60 deletions(-)
 rename R/{bpmodels-package.R => epichains-package.R} (100%)
 rename man/{bpmodels-package.Rd => epichains-package.Rd} (70%)

diff --git a/.github/ISSUE_TEMPLATE/bug_report.md b/.github/ISSUE_TEMPLATE/bug_report.md
index 31bbc095..6d32efc5 100644
--- a/.github/ISSUE_TEMPLATE/bug_report.md
+++ b/.github/ISSUE_TEMPLATE/bug_report.md
@@ -10,7 +10,7 @@ assignees: ''
 Please place an "x" in all the boxes that apply
 ---------------------------------------------
   
-- [ ] I have the most recent version of bpmodels and R
+- [ ] I have the most recent version of epichains and R
 - [ ] I have found a bug
 - [ ] I have a [reproducible example](http://reprex.tidyverse.org/articles/reprex-dos-and-donts.html)
 - [ ] I want to request a new feature
diff --git a/DESCRIPTION b/DESCRIPTION
index 42f02eb3..3873301a 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,4 +1,4 @@
-Package: bpmodels
+Package: epichains
 Title: Analysing transmission chain statistics using branching process models
 Version: 0.2.1
 Authors@R: c(
@@ -36,8 +36,8 @@ Description: Provides methods to analyse and simulate the size and length
     or length of infectious disease outbreaks, as discussed in Farrington
     et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
 License: MIT + file LICENSE
-URL: https://github.com/epiverse-trace/bpmodels, https://epiverse-trace.github.io/bpmodels/
-BugReports: https://github.com/epiverse-trace/bpmodels/issues
+URL: https://github.com/epiverse-trace/epichains, https://epiverse-trace.github.io/epichains/
+BugReports: https://github.com/epiverse-trace/epichains/issues
 Depends:
     R (>= 3.6.0)
 Suggests:
diff --git a/R/bpmodels-package.R b/R/epichains-package.R
similarity index 100%
rename from R/bpmodels-package.R
rename to R/epichains-package.R
diff --git a/README.Rmd b/README.Rmd
index 0b48e9f1..dbc7d1bc 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -1,6 +1,6 @@
 ---
 output: github_document
-bibliography: vignettes/references.bib
+bibliography: vignettes/references.json
 link-citations: true
 ---
 
@@ -13,13 +13,13 @@ knitr::opts_chunk$set(
 )
 ```
 
-# _bpmodels_: Methods for analysing the size and length of transmission chains from branching process models
+# _epichains_: Methods for analysing the size and length of transmission chains from branching process models
 
 <!-- badges: start -->
-![GitHub R package version](https://img.shields.io/github/r-package/v/epiverse-trace/bpmodels)
-[![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
-[![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels) 
-![GitHub contributors](https://img.shields.io/github/contributors/epiverse-trace/bpmodels)
+![GitHub R package version](https://img.shields.io/github/r-package/v/epiverse-trace/epichains)
+[![R-CMD-check](https://github.com/epiverse-trace/epichains/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/epichains/actions/workflows/R-CMD-check.yaml)
+[![codecov](https://codecov.io/github/epiverse-trace/epichains/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/epichains)
+![GitHub contributors](https://img.shields.io/github/contributors/epiverse-trace/epichains)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 <!-- badges: end -->
 
@@ -27,7 +27,7 @@ knitr::opts_chunk$set(
 knitr::opts_chunk$set(echo = TRUE)
 ```
 
-_bpmodels_ is an R package to simulate and analyse the size and length of 
+_epichains_ is an R package to simulate and analyse the size and length of
 branching processes with a given offspring distribution. These models are often 
 used in infectious disease epidemiology, where the chains represent chains of
 transmission, and the offspring distribution represents the distribution of 
@@ -35,16 +35,16 @@ secondary infections caused by an infected individual.
 
 # Installation
 
-The latest development version of the _bpmodels_ package can be installed via
+The latest development version of the _epichains_ package can be installed via
 
 ```{r include=TRUE,eval=FALSE}
-devtools::install_github(file.path("epiverse-trace", "bpmodels"))
+pak::pkg_install("epiverse-trace/epichains")
 ```
 
 To load the package, use
 
 ```{r eval=TRUE}
-library("bpmodels")
+library("epichains")
 ```
 
 # Quick start
@@ -177,25 +177,25 @@ head(chains_df)
 
 ## Package vignettes
 
-Specific use cases of _bpmodels_ can be found in 
-the [online documentation as package vignettes](https://epiverse-trace.github.io/bpmodels/), under "Articles".
+Specific use cases of _epichains_ can be found in 
+the [online documentation as package vignettes](https://epiverse-trace.github.io/epichains/), under "Articles".
 
 ## Reporting bugs 
 
-To report a bug please open an [issue](https://github.com/epiverse-trace/bpmodels/issues/new/choose).
+To report a bug please open an [issue](https://github.com/epiverse-trace/epichains/issues/new/choose).
 
 ## Contribute
 
 We welcome contributions to enhance the package's functionalities. If you 
-wish to do so, please follow the [package contributing guide](https://github.com/epiverse-trace/bpmodels/blob/main/.github/CONTRIBUTING.md).
+wish to do so, please follow the [package contributing guide](https://github.com/epiverse-trace/epichains/blob/main/.github/CONTRIBUTING.md).
 
 ## Code of conduct
 
-Please note that the _bpmodels_ project is released with a [Contributor Code of Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md). 
+Please note that the _epichains_ project is released with a [Contributor Code of Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md). 
 By contributing to this project, you agree to abide by its terms.
 
 ## Citing this package
 
 ```{r message=FALSE, warning=FALSE}
-citation("bpmodels")
+citation("epichains")
 ```
diff --git a/README.md b/README.md
index 648f931c..febabb13 100644
--- a/README.md
+++ b/README.md
@@ -1,19 +1,19 @@
 
-# *bpmodels*: Methods for analysing the size and length of transmission chains from branching process models
+# *epichains*: Methods for analysing the size and length of transmission chains from branching process models
 
 <!-- badges: start -->
 
 ![GitHub R package
-version](https://img.shields.io/github/r-package/v/epiverse-trace/bpmodels)
-[![R-CMD-check](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/bpmodels/actions/workflows/R-CMD-check.yaml)
-[![codecov](https://codecov.io/github/epiverse-trace/bpmodels/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/bpmodels)
+version](https://img.shields.io/github/r-package/v/epiverse-trace/epichains)
+[![R-CMD-check](https://github.com/epiverse-trace/epichains/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/epichains/actions/workflows/R-CMD-check.yaml)
+[![codecov](https://codecov.io/github/epiverse-trace/epichains/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/epichains)
 ![GitHub
-contributors](https://img.shields.io/github/contributors/epiverse-trace/bpmodels)
+contributors](https://img.shields.io/github/contributors/epiverse-trace/epichains)
 [![License:
 MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 <!-- badges: end -->
 
-*bpmodels* is an R package to simulate and analyse the size and length
+*epichains* is an R package to simulate and analyse the size and length
 of branching processes with a given offspring distribution. These models
 are often used in infectious disease epidemiology, where the chains
 represent chains of transmission, and the offspring distribution
@@ -22,17 +22,17 @@ infected individual.
 
 # Installation
 
-The latest development version of the *bpmodels* package can be
+The latest development version of the *epichains* package can be
 installed via
 
 ``` r
-devtools::install_github(file.path("epiverse-trace", "bpmodels"))
+pak::pkg_install("epiverse-trace/epichains")
 ```
 
 To load the package, use
 
 ``` r
-library("bpmodels")
+library("epichains")
 ```
 
 # Quick start
@@ -182,25 +182,25 @@ head(chains_df)
 
 ## Package vignettes
 
-Specific use cases of *bpmodels* can be found in the [online
+Specific use cases of *epichains* can be found in the [online
 documentation as package
-vignettes](https://epiverse-trace.github.io/bpmodels/), under
+vignettes](https://epiverse-trace.github.io/epichains/), under
 “Articles”.
 
 ## Reporting bugs
 
 To report a bug please open an
-[issue](https://github.com/epiverse-trace/bpmodels/issues/new/choose).
+[issue](https://github.com/epiverse-trace/epichains/issues/new/choose).
 
 ## Contribute
 
 We welcome contributions to enhance the package’s functionalities. If
 you wish to do so, please follow the [package contributing
-guide](https://github.com/epiverse-trace/bpmodels/blob/main/.github/CONTRIBUTING.md).
+guide](https://github.com/epiverse-trace/epichains/blob/main/.github/CONTRIBUTING.md).
 
 ## Code of conduct
 
-Please note that the *bpmodels* project is released with a [Contributor
+Please note that the *epichains* project is released with a [Contributor
 Code of
 Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md).
 By contributing to this project, you agree to abide by its terms.
@@ -208,20 +208,19 @@ By contributing to this project, you agree to abide by its terms.
 ## Citing this package
 
 ``` r
-citation("bpmodels")
+citation("epichains")
+#> To cite package epichains in publications use:
 #> 
-#> To cite package bpmodels in publications use:
-#> 
-#>   Sebastian Funk, Flavio Finger, and James M. Azam (2023). bpmodels:
+#>   Sebastian Funk, Flavio Finger, and James M. Azam (2023). epichains:
 #>   Analysing transmission chain statistics using branching process
-#>   models, website: https://github.com/epiverse-trace/bpmodels/
+#>   models, website: https://github.com/epiverse-trace/epichains/
 #> 
 #> A BibTeX entry for LaTeX users is
 #> 
 #>   @Manual{,
-#>     title = {bpmodels: Analysing transmission chain statistics using branching process models},
+#>     title = {epichains: Analysing transmission chain statistics using branching process models},
 #>     author = {{Sebastian Funk} and {Flavio Finger} and {James M. Azam}},
 #>     year = {2023},
-#>     url = {https://github.com/epiverse-trace/bpmodels/},
+#>     url = {https://github.com/epiverse-trace/epichains/},
 #>   }
 ```
diff --git a/inst/CITATION b/inst/CITATION
index 372018e3..3c65e6cb 100644
--- a/inst/CITATION
+++ b/inst/CITATION
@@ -1,16 +1,16 @@
-citHeader("To cite package bpmodels in publications use:")
+citHeader("To cite package epichains in publications use:")
 
 citEntry(
   entry = "Manual",
-  title = "bpmodels: Analysing transmission chain statistics using branching process models",
+  title = "epichains: Analysing transmission chain statistics using branching process models",
   author = c(person("Sebastian Funk"), person("Flavio Finger"), person("James M. Azam")),
   year     = "2023",
-  url      = "https://github.com/epiverse-trace/bpmodels/",
+  url      = "https://github.com/epiverse-trace/epichains/",
   textVersion =
   sprintf("%s %s %s %s",
   "Sebastian Funk, Flavio Finger, and James M. Azam (2023).",
-  "bpmodels: Analysing transmission chain statistics",
+  "epichains: Analysing transmission chain statistics",
   "using branching process models,",
-  "website: https://github.com/epiverse-trace/bpmodels/"
+  "website: https://github.com/epiverse-trace/epichains/"
   )
 )
diff --git a/man/bpmodels-package.Rd b/man/epichains-package.Rd
similarity index 70%
rename from man/bpmodels-package.Rd
rename to man/epichains-package.Rd
index 4b6b9458..ab3d4d31 100644
--- a/man/bpmodels-package.Rd
+++ b/man/epichains-package.Rd
@@ -1,19 +1,19 @@
 % Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/bpmodels-package.R
+% Please edit documentation in R/epichains-package.R
 \docType{package}
-\name{bpmodels-package}
-\alias{bpmodels}
-\alias{bpmodels-package}
-\title{bpmodels: Analysing transmission chain statistics using branching process models}
+\name{epichains-package}
+\alias{epichains}
+\alias{epichains-package}
+\title{epichains: Analysing transmission chain statistics using branching process models}
 \description{
 Provides methods to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks, as discussed in Farrington et al. (2003) \doi{10.1093/biostatistics/4.2.279}.
 }
 \seealso{
 Useful links:
 \itemize{
-  \item \url{https://github.com/epiverse-trace/bpmodels}
-  \item \url{https://epiverse-trace.github.io/bpmodels/}
-  \item Report bugs at \url{https://github.com/epiverse-trace/bpmodels/issues}
+  \item \url{https://github.com/epiverse-trace/epichains}
+  \item \url{https://epiverse-trace.github.io/epichains/}
+  \item Report bugs at \url{https://github.com/epiverse-trace/epichains/issues}
 }
 
 }
diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 4a840091..fb36b764 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -10,8 +10,8 @@ pkgdown:
 bibliography: references.json
 link-citations: true
 vignette: >
-  %\VignetteIndexEntry{Projecting infectious disease incidence: a COVID-19 example}
   %\VignetteEncoding{UTF-8}
+  %\VignetteIndexEntry{Projecting infectious disease incidence: a COVID-19 example}
   %\VignetteEngine{knitr::rmarkdown}
 editor_options: 
   chunk_output_type: console
@@ -33,17 +33,17 @@ Branching processes can be used to project infectious disease trends in time
 provided we can characterize the distribution of times between 
 successive cases (serial interval), and the distribution of secondary cases 
 produced by a single individual (offspring distribution). Such simulations can 
-be achieved in `bpmodels` with the `chain_sim()` function and @pearson2020, and 
+be achieved in `epichains` with the `chain_sim()` function and @pearson2020, and 
 @abbott2020 illustrate its application to COVID-19. 
 
 The purpose of this vignette is to use early data on COVID-19 in South Africa 
-[@marivate2020] to illustrate how `bpmodels` can be used to forecast an 
+[@marivate2020] to illustrate how `epichains` can be used to forecast an 
 outbreak. 
 
 Let's load the required packages
 
 ```{r packages, include=TRUE}
-library("bpmodels")
+library("epichains")
 library("dplyr")
 library("ggplot2")
 library("lubridate")
@@ -52,11 +52,11 @@ library("epiparameter")
 
 ## Data
 
-Included in `bpmodels` is a cleaned time series of the first 15 days of 
+Included in `epichains` is a cleaned time series of the first 15 days of 
 the COVID-19 outbreak in South Africa. This can be loaded into 
 memory as follows: 
 ```{r}
-data("covid19_sa", package = "bpmodels")
+data("covid19_sa", package = "epichains")
 ```
 
 Let us examine the first 6 entries of the dataset.

From b2fe317d45f9c4c9ac038e340278b36e6a0e29c0 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Sat, 27 May 2023 22:39:22 +0100
Subject: [PATCH 236/828] Replaced bpmodels with epichains in chain_ll and the
 testthat script

---
 R/likelihoods.R  | 2 +-
 tests/testthat.R | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 23ce84f2..bfb9ee91 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -181,7 +181,7 @@ chain_ll <- function(x, offspring, stat = c("size", "length"), obs_prob = 1,
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
   ## calculate likelihoods
-  if (exists(ll_func, where = asNamespace("bpmodels"), mode = "function")) {
+  if (exists(ll_func, where = asNamespace("epichains"), mode = "function")) {
     func <- get(ll_func)
     likelihoods[calc_sizes] <- do.call(func, c(list(x = calc_sizes), pars))
   } else {
diff --git a/tests/testthat.R b/tests/testthat.R
index 2d4e1df3..68af5393 100644
--- a/tests/testthat.R
+++ b/tests/testthat.R
@@ -1,3 +1,3 @@
 library(testthat)
 
-test_check("bpmodels")
+test_check("epichains")

From 627c928a0cd8187e607c0e1b7f1a24a9087097a9 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 31 May 2023 17:24:50 +0100
Subject: [PATCH 237/828] added spellchecking to tests

---
 DESCRIPTION      |  2 ++
 inst/WORDLIST    | 31 +++++++++++++++++++++++++++++++
 tests/spelling.R |  3 +++
 3 files changed, 36 insertions(+)
 create mode 100644 inst/WORDLIST
 create mode 100644 tests/spelling.R

diff --git a/DESCRIPTION b/DESCRIPTION
index 3873301a..ff4097c8 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -49,6 +49,7 @@ Suggests:
     knitr,
     lubridate,
     rmarkdown,
+    spelling,
     testthat,
     truncdist,
     usethis
@@ -61,3 +62,4 @@ Encoding: UTF-8
 LazyData: true
 Roxygen: list(markdown = TRUE)
 RoxygenNote: 7.2.3
+Language: en-GB
diff --git a/inst/WORDLIST b/inst/WORDLIST
new file mode 100644
index 00000000..9ecd63a6
--- /dev/null
+++ b/inst/WORDLIST
@@ -0,0 +1,31 @@
+Borel
+CMD
+COVID
+Marivate
+ORCID
+Poisson
+README's
+Vukosi
+Zhian
+bpmodels
+codecov
+com
+dfrac
+doi
+ecdf
+epiparameter
+gborel
+immunes
+infectees
+infectors
+linelist
+ln
+mathcal
+nbinom
+nolint
+pois
+sim
+stackoverflow
+superspreading
+susceptibles
+var
diff --git a/tests/spelling.R b/tests/spelling.R
new file mode 100644
index 00000000..6713838f
--- /dev/null
+++ b/tests/spelling.R
@@ -0,0 +1,3 @@
+if(requireNamespace('spelling', quietly = TRUE))
+  spelling::spell_check_test(vignettes = TRUE, error = FALSE,
+                             skip_on_cran = TRUE)

From 8b5ba41af12af53b2f729e7ff8daaa45a2aaca94 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:34:19 +0100
Subject: [PATCH 238/828] Linting

---
 tests/spelling.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tests/spelling.R b/tests/spelling.R
index 6713838f..33ef2c73 100644
--- a/tests/spelling.R
+++ b/tests/spelling.R
@@ -1,3 +1,3 @@
-if(requireNamespace('spelling', quietly = TRUE))
+if (requireNamespace("spelling", quietly = TRUE))
   spelling::spell_check_test(vignettes = TRUE, error = FALSE,
                              skip_on_cran = TRUE)

From 82f7d7a7365cc7b2224c5dffcce7414e8c85865d Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 16 Jun 2023 14:30:35 +0100
Subject: [PATCH 239/828] Set up README and workflows (#28)

* Added an experimental lifecycle badge

* Deleted old README content from bpmodels

* Replaced the package name with the dynamic placeholder from packagetemplate

* Rendered the README to .md

* Aligned pkgdown workflows with that of packagetemplate

* Aligned RCMDCHECK workflows with that of packagetemplate

* Aligned render-readme workflows with that of packagetemplate

* Aligned test-coverage workflows with that of packagetemplate

* added package logo file

* Added package logo to README

* Replaced the logo file with the thumbnail sized version

* resized the logo

* resized the logo

* rendered the readme to .md

* Resized the logo

* Resized the README logo

* fixed the render-readme workflow

* fixed an issue in the render-readme workflow

* fixed the render-readme workflow

* fixed the render-readme workflow

* fixed the render-readme workflow

* updated README to mention epichains as a re-implementaion of bpmodels

* Automatic readme update

* Fixed an issue in the README that's causing to render-readme to fail

* Aligned render-readme workflows with that of packagetemplate

* resized the logo

* Resized the logo

* fixed the render-readme workflow

* fixed an issue in the render-readme workflow

* fixed the render-readme workflow

* fixed the render-readme workflow

* fixed the render-readme workflow

* Automatic readme update

* Replaced logo with dark background with the new version

* Changed package name to italics

---------

Co-authored-by: GitHub Action <action@github.com>
---
 .github/workflows/R-CMD-check.yaml   |  40 ++++++
 .github/workflows/pkgdown.yaml       |  35 ++++-
 .github/workflows/render_readme.yml  |  32 ++++-
 .github/workflows/test-coverage.yaml |  39 +++---
 README.Rmd                           | 159 ++++------------------
 README.md                            | 189 +++++----------------------
 man/figures/epichains_logo.png       | Bin 0 -> 31438 bytes
 7 files changed, 178 insertions(+), 316 deletions(-)
 create mode 100644 man/figures/epichains_logo.png

diff --git a/.github/workflows/R-CMD-check.yaml b/.github/workflows/R-CMD-check.yaml
index a3ac6182..8adbac68 100644
--- a/.github/workflows/R-CMD-check.yaml
+++ b/.github/workflows/R-CMD-check.yaml
@@ -3,11 +3,41 @@
 on:
   push:
     branches: [main, master]
+    paths:
+      - 'data/**'
+      - 'R/**'
+      - 'inst/**'
+      - 'man/**'
+      - 'src/**'
+      - 'tests/**'
+      - 'vignettes/**'
+      - 'DESCRIPTION'
+      - 'NAMESPACE'
+      - 'LICENSE'
+      - '.Rbuildignore'
+      - '.github/workflows/R-CMD-check.yaml'
   pull_request:
     branches: [main, master]
+    paths:
+      - 'data/**'
+      - 'R/**'
+      - 'inst/**'
+      - 'man/**'
+      - 'src/**'
+      - 'tests/**'
+      - 'vignettes/**'
+      - 'DESCRIPTION'
+      - 'NAMESPACE'
+      - 'LICENSE'
+      - '.Rbuildignore'
+      - '.github/workflows/R-CMD-check.yaml'
 
 name: R-CMD-check
 
+concurrency:
+  group: ${{ github.workflow }}-${{ github.ref }}
+  cancel-in-progress: true
+
 jobs:
   R-CMD-check:
     runs-on: ${{ matrix.config.os }}
@@ -27,6 +57,8 @@ jobs:
     env:
       GITHUB_PAT: ${{ secrets.GITHUB_TOKEN }}
       R_KEEP_PKG_SOURCE: yes
+      # Exists since R 4.3.0 but `false` by default
+      _R_CHECK_LENGTH_COLON_: true
 
     steps:
       - uses: actions/checkout@v3
@@ -45,5 +77,13 @@ jobs:
           needs: check
 
       - uses: r-lib/actions/check-r-package@v2
+        id: rcmdcheck
         with:
           upload-snapshots: true
+
+      # fail-fast but only if rcmdcheck step fails
+      - name: Manual fail-fast
+        env:
+          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
+        if: always() && steps.rcmdcheck.outcome == 'failure'
+        run: gh run cancel ${{ github.run_id }}
diff --git a/.github/workflows/pkgdown.yaml b/.github/workflows/pkgdown.yaml
index 087f0b05..269728a4 100644
--- a/.github/workflows/pkgdown.yaml
+++ b/.github/workflows/pkgdown.yaml
@@ -3,22 +3,49 @@
 on:
   push:
     branches: [main, master]
+    paths:
+      - 'README.Rmd'
+      - 'README.md'
+      - 'index.Rmd'
+      - 'index.md'
+      - 'man/**'
+      - 'vignettes/**'
+      - '_pkgdown.yml'
+      - 'pkgdown/**'
+      - 'DESCRIPTION'
+      - '.Rbuildignore'
+      - '.github/**'
   pull_request:
     branches: [main, master]
+    paths:
+      - 'README.Rmd'
+      - 'README.md'
+      - 'index.Rmd'
+      - 'index.md'
+      - 'man/**'
+      - 'vignettes/**'
+      - '_pkgdown.yml'
+      - 'pkgdown/**'
+      - 'DESCRIPTION'
+      - '.Rbuildignore'
+      - '.github/**'
   release:
     types: [published]
   workflow_dispatch:
 
 name: pkgdown
 
+concurrency:
+  group: ${{ github.workflow }}-${{ github.ref }}
+  cancel-in-progress: true
+
 jobs:
   pkgdown:
     runs-on: ubuntu-latest
-    # Only restrict concurrency for non-PR jobs
-    concurrency:
-      group: pkgdown-${{ github.event_name != 'pull_request' || github.run_id }}
     env:
       GITHUB_PAT: ${{ secrets.GITHUB_TOKEN }}
+    permissions:
+      contents: write
     steps:
       - uses: actions/checkout@v3
 
@@ -39,7 +66,7 @@ jobs:
 
       - name: Deploy to GitHub pages 🚀
         if: github.event_name != 'pull_request'
-        uses: JamesIves/github-pages-deploy-action@v4.4.1
+        uses: JamesIves/github-pages-deploy-action@4.1.4
         with:
           clean: false
           branch: gh-pages
diff --git a/.github/workflows/render_readme.yml b/.github/workflows/render_readme.yml
index 6df5f98f..0c427323 100644
--- a/.github/workflows/render_readme.yml
+++ b/.github/workflows/render_readme.yml
@@ -11,6 +11,15 @@ on:
   push:
     paths:
       - 'README.Rmd'
+      - '.github/workflows/render_readme.yml'
+
+concurrency:
+  group: ${{ github.workflow }}-${{ github.ref }}
+  cancel-in-progress: true
+
+concurrency:
+  group: ${{ github.workflow }}-${{ github.ref }}
+  cancel-in-progress: true
 
 # A workflow run is made up of one or more jobs that can run sequentially or in parallel
 jobs:
@@ -20,10 +29,12 @@ jobs:
       GITHUB_PAT: ${{ secrets.GITHUB_TOKEN }}
     steps:
       - name: Checkout repos
-        uses: actions/checkout@v2
+        uses: actions/checkout@v3
 
       - name: Setup R
         uses: r-lib/actions/setup-r@v2
+        with:
+          use-public-rspm: true
 
       - name: Setup pandoc
         uses: r-lib/actions/setup-pandoc@v2
@@ -35,13 +46,26 @@ jobs:
 
       - name: Compile the readme
         run: |
-          rmarkdown::render("README.Rmd")
+          writeLines(
+            knitr::knit_expand(
+              "README.Rmd",
+              packagename = read.dcf("DESCRIPTION", "Package"),
+              gh_repo = Sys.getenv("GITHUB_REPOSITORY")
+            ),
+            "README.Rmd"
+          )
+          rmarkdown::render("README.Rmd", output_file = "README.md", output_dir = ".")
         shell: Rscript {0}
 
       - name: Commit files
         run: |
           git config --local user.email "action@github.com"
           git config --local user.name "GitHub Action"
-          git add README.md 
+          git add README.md
+          # Also add README figures if they exist
+          if [ -d man/figures ]
+          then
+            git add man/figures/
+          fi
           git diff-index --quiet HEAD || git commit -m "Automatic readme update"
-          git push origin || echo "No changes to push"
\ No newline at end of file
+          git push origin || echo "No changes to push"
diff --git a/.github/workflows/test-coverage.yaml b/.github/workflows/test-coverage.yaml
index 2c5bb502..cba9c5b1 100644
--- a/.github/workflows/test-coverage.yaml
+++ b/.github/workflows/test-coverage.yaml
@@ -3,8 +3,26 @@
 on:
   push:
     branches: [main, master]
+    paths:
+      - 'R/**'
+      - 'src/**'
+      - 'tests/**'
+      - 'inst/**'
+      - 'DESCRIPTION'
+      - '.github/workflows/test-coverage.yaml'
   pull_request:
     branches: [main, master]
+    paths:
+      - 'R/**'
+      - 'src/**'
+      - 'tests/**'
+      - 'inst/**'
+      - 'DESCRIPTION'
+      - '.github/workflows/test-coverage.yaml'
+
+concurrency:
+  group: ${{ github.workflow }}-${{ github.ref }}
+  cancel-in-progress: true
 
 name: test-coverage
 
@@ -27,24 +45,5 @@ jobs:
           needs: coverage
 
       - name: Test coverage
-        run: |
-          covr::codecov(
-            quiet = FALSE,
-            clean = FALSE,
-            install_path = file.path(Sys.getenv("RUNNER_TEMP"), "package")
-          )
+        run: covr::codecov(quiet = FALSE)
         shell: Rscript {0}
-
-      - name: Show testthat output
-        if: always()
-        run: |
-          ## --------------------------------------------------------------------
-          find ${{ runner.temp }}/package -name 'testthat.Rout*' -exec cat '{}' \; || true
-        shell: bash
-
-      - name: Upload test results
-        if: failure()
-        uses: actions/upload-artifact@v3
-        with:
-          name: coverage-test-failures
-          path: ${{ runner.temp }}/package
diff --git a/README.Rmd b/README.Rmd
index dbc7d1bc..de86e1e6 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -4,6 +4,12 @@ bibliography: vignettes/references.json
 link-citations: true
 ---
 
+<!-- README.md is generated from README.Rmd. Please edit that file. -->
+<!-- The code to render this README is stored in .github/workflows/render-readme.yaml -->
+<!-- Variables marked with double curly braces will be transformed beforehand: -->
+<!-- `packagename` is extracted from the DESCRIPTION file -->
+<!-- `gh_repo` is extracted via a special environment variable in GitHub Actions -->
+
 ```{r, include = FALSE}
 knitr::opts_chunk$set(
   collapse = TRUE,
@@ -13,7 +19,7 @@ knitr::opts_chunk$set(
 )
 ```
 
-# _epichains_: Methods for analysing the size and length of transmission chains from branching process models
+# _{{ packagename }}_: Methods for analysing the size and length of transmission chains from branching process models <img src="man/figures/epichains_logo.png" align="right" height="130" />
 
 <!-- badges: start -->
 ![GitHub R package version](https://img.shields.io/github/r-package/v/epiverse-trace/epichains)
@@ -21,24 +27,33 @@ knitr::opts_chunk$set(
 [![codecov](https://codecov.io/github/epiverse-trace/epichains/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/epichains)
 ![GitHub contributors](https://img.shields.io/github/contributors/epiverse-trace/epichains)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
+[![Lifecycle:
+experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html#experimental)
 <!-- badges: end -->
 
 ```{r setup, include=FALSE}
 knitr::opts_chunk$set(echo = TRUE)
 ```
 
-_epichains_ is an R package to simulate and analyse the size and length of
-branching processes with a given offspring distribution. These models are often 
-used in infectious disease epidemiology, where the chains represent chains of
+_{{ packagename }}_ is an R package to simulate, analyse, and visualize the size 
+and length of branching processes with a given offspring distribution. These 
+models are often used in infectious disease epidemiology, where the chains represent chains of
 transmission, and the offspring distribution represents the distribution of 
-secondary infections caused by an infected individual.
+secondary infections caused by an infected individual. 
+
+_{{ packagename }}_ re-implements [bpmodels]("https://github.com/epiverse-trace/bpmodels/") by providing dedicated classes that allow easy manipulation and interoperability with other existing
+packages for handling transmission chain and contact-tracing data.
+
+_{{ packagename }}_ is developed at the [Centre for the Mathematical Modelling of Infectious Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling-infectious-diseases) at the London School of Hygiene and Tropical Medicine as part of the [Epiverse Initiative](https://data.org/initiatives/epiverse/).
 
 # Installation
 
-The latest development version of the _epichains_ package can be installed via
+The latest development version of the _{{ packagename }}_ package can be installed via
 
 ```{r include=TRUE,eval=FALSE}
-pak::pkg_install("epiverse-trace/epichains")
+# check whether {pak} is installed
+if(!require("pak")) install.packages("pak")
+pak::pak("{{ gh_repo }}")
 ```
 
 To load the package, use
@@ -49,135 +64,11 @@ library("epichains")
 
 # Quick start
 
-At the heart of the package are the `chain_ll()` and `chain_sim()` functions. 
-
-## Calculating log-likelihoods
-
-The `chain_ll()` function calculates the log-likelihood of a distribution of 
-chain sizes or lengths given an offspring distribution and its associated 
-parameters. 
-
-For example, if we have observed a distribution of chains of sizes 
-$1, 1, 4, 7$, we can 
-calculate the log-likelihood of this observed chain by assuming the offspring 
-per generation is Poisson distributed with a mean number (which can 
-be interpreted as the reproduction number $\mathcal{R_0}$) of $0.5$. 
-
-To do this, we run 
-
-```{r}
-set.seed(13)
-chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-chain_ll(x = chain_sizes, offspring = "pois", stat = "size", lambda = 0.5)
-```
-
-The first argument of `chain_ll()` is the chain size (or length, in number of 
-generations that a chain lasted) distribution to 
-analyse. The second argument, `offspring`, specifies the offspring 
-distribution. This is given as a function used to generate random offspring. 
-It can be any probability distribution implemented in `R`, that is, one that 
-has a corresponding function for generating random numbers beginning with the 
-letter `r`. In the case of the example above, since random Poisson numbers are 
-generated in `R` using a function called `rpois()`, the string to pass to the 
-`offspring` argument is `"pois"`.
-
-The third argument, `stat`, determines whether to analyse chain sizes 
-(`"size"`, the default if this argument is not specified) or lengths 
-(`"length"`). Lastly, any named arguments not recognised by `chain_ll()` 
-are interpreted as parameters of the corresponding probability distribution, 
-here `lambda = 0.5` as the mean of the Poisson distribution (see the `R` help 
-page for the [Poisson distribution](https://stat.ethz.ch/R-manual/R-devel/library/stats/html/Poisson.html) for more information). 
-
-### Imperfect observations
-
-By default, `chain_ll()` assumes perfect observation, where `obs_prob = 1` 
-(See `?chain_ll`), meaning that all transmission events are observed and 
-recorded in the data. If observations are imperfect, `chain_ll()` provides 
-the argument, `obs_prob`, for specifying the probability of observation. 
-This probability is used to determine the likelihood of observing the specified
-chain sizes or lengths. In the case of imperfect observation, true chain sizes 
-or lengths are simulated repeatedly (the number of times given by the 
-`nsim_obs` argument), and the likelihood calculated for each of 
-these simulations. 
-
-For example, if the probability of observing each case is `obs_prob = 0.30`, 
-we use
-
-```{r}
-chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5,
-               nsim_obs = 10)
-ll
-```
-
-This returns `10` likelihood values (because `nsim_obs = 10`), which can be 
-averaged to come up with an overall likelihood estimate.
-
-To find out about usage of the `chain_ll()` function, you can use the `R` help 
-file
-
-```{r eval=FALSE}
-?chain_ll
-```
-
-### How `chain_ll()` works
-
-If the probability distribution of chain sizes or lengths has an analytical 
-solution, this will be used. `chain_ll()` currently supports the Poisson and 
-negative binomial size distribution and the Poisson and geometric length 
-distribution. 
-
-If an analytical solution does not exist, simulations are used to approximate 
-this probability distributions ([using a linear approximation to the cumulative 
-distribution](https://en.wikipedia.org/wiki/Empirical_distribution_function) 
-for unobserved sizes/lengths). In that case, an extra argument `nsim_offspring` 
-must be passed to `chain_ll()` to specify the number of simulations to be 
-used for this approximation. 
-
-For example, to get offspring drawn from a binomial distribution with 
-probability `prob = 0.5`, we run
-
-```{r}
-chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5,
-         nsim_offspring = 100)
-```
-
-## Simulating branching processes
-
-To simulate a branching process, we use the `chain_sim()` function. This 
-function follows the same syntax as `chain_ll()`.
-
-Below, we are simulating $5$ chains, assuming the offspring are generated using
-a Poisson distribution with mean, `lambda = 0.5`. By default, `chain_sim()` 
-returns a vector of chain sizes/lengths. If we instead want to return 
-a tree of infectees and infectors, we need to specify a function for 
-the serial interval and set `tree = TRUE` (see next section).
-
-```{r}
-chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5)
-```
-
-### Simulating trees
-
-To simulate a tree of transmission chains, we specify the serial interval 
-generation function and set `tree = TRUE` as follows:
-
-```{r}
-set.seed(13)
-serial_interval <- function(n) {
-  rlnorm(n, meanlog = 0.58, sdlog = 1.58)
-}
-chains_df <- chain_sim(
-  n = 5, offspring = "pois", lambda = 0.5, stat = "length",
-  infinite = 100, serial = serial_interval, tree = TRUE
-)
-head(chains_df)
-```
+Work in progress
 
 ## Package vignettes
 
-Specific use cases of _epichains_ can be found in 
+Specific use cases of _{{ packagename }}_ can be found in 
 the [online documentation as package vignettes](https://epiverse-trace.github.io/epichains/), under "Articles".
 
 ## Reporting bugs 
@@ -191,7 +82,7 @@ wish to do so, please follow the [package contributing guide](https://github.com
 
 ## Code of conduct
 
-Please note that the _epichains_ project is released with a [Contributor Code of Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md). 
+Please note that the _{{ packagename }}_ project is released with a [Contributor Code of Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md). 
 By contributing to this project, you agree to abide by its terms.
 
 ## Citing this package
diff --git a/README.md b/README.md
index febabb13..5af3535b 100644
--- a/README.md
+++ b/README.md
@@ -1,5 +1,11 @@
 
-# *epichains*: Methods for analysing the size and length of transmission chains from branching process models
+<!-- README.md is generated from README.Rmd. Please edit that file. -->
+<!-- The code to render this README is stored in .github/workflows/render-readme.yaml -->
+<!-- Variables marked with double curly braces will be transformed beforehand: -->
+<!-- `packagename` is extracted from the DESCRIPTION file -->
+<!-- `gh_repo` is extracted via a special environment variable in GitHub Actions -->
+
+# epichains: Methods for analysing the size and length of transmission chains from branching process models <img src="man/figures/epichains_logo.png" align="right" height="130" />
 
 <!-- badges: start -->
 
@@ -11,22 +17,38 @@ version](https://img.shields.io/github/r-package/v/epiverse-trace/epichains)
 contributors](https://img.shields.io/github/contributors/epiverse-trace/epichains)
 [![License:
 MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
+[![Lifecycle:
+experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html#experimental)
 <!-- badges: end -->
 
-*epichains* is an R package to simulate and analyse the size and length
-of branching processes with a given offspring distribution. These models
-are often used in infectious disease epidemiology, where the chains
-represent chains of transmission, and the offspring distribution
-represents the distribution of secondary infections caused by an
-infected individual.
+epichains is an R package to simulate, analyse, and visualize the size
+and length of branching processes with a given offspring distribution.
+These models are often used in infectious disease epidemiology, where
+the chains represent chains of transmission, and the offspring
+distribution represents the distribution of secondary infections caused
+by an infected individual.
+
+epichains re-implements
+[bpmodels](%22https://github.com/epiverse-trace/bpmodels/%22) by
+providing dedicated classes that allow easy manipulation and
+interoperability with other existing packages for handling transmission
+chain and contact-tracing data.
+
+epichains is developed at the [Centre for the Mathematical Modelling of
+Infectious
+Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling-infectious-diseases)
+at the London School of Hygiene and Tropical Medicine as part of the
+[Epiverse Initiative](https://data.org/initiatives/epiverse/).
 
 # Installation
 
-The latest development version of the *epichains* package can be
-installed via
+The latest development version of the epichains package can be installed
+via
 
 ``` r
-pak::pkg_install("epiverse-trace/epichains")
+# check whether {pak} is installed
+if(!require("pak")) install.packages("pak")
+pak::pak("epiverse-trace/epichains")
 ```
 
 To load the package, use
@@ -37,152 +59,11 @@ library("epichains")
 
 # Quick start
 
-At the heart of the package are the `chain_ll()` and `chain_sim()`
-functions.
-
-## Calculating log-likelihoods
-
-The `chain_ll()` function calculates the log-likelihood of a
-distribution of chain sizes or lengths given an offspring distribution
-and its associated parameters.
-
-For example, if we have observed a distribution of chains of sizes
-$1, 1, 4, 7$, we can calculate the log-likelihood of this observed chain
-by assuming the offspring per generation is Poisson distributed with a
-mean number (which can be interpreted as the reproduction number
-$\mathcal{R_0}$) of $0.5$.
-
-To do this, we run
-
-``` r
-set.seed(13)
-chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-chain_ll(x = chain_sizes, offspring = "pois", stat = "size", lambda = 0.5)
-#> [1] -8.607196
-```
-
-The first argument of `chain_ll()` is the chain size (or length, in
-number of generations that a chain lasted) distribution to analyse. The
-second argument, `offspring`, specifies the offspring distribution. This
-is given as a function used to generate random offspring. It can be any
-probability distribution implemented in `R`, that is, one that has a
-corresponding function for generating random numbers beginning with the
-letter `r`. In the case of the example above, since random Poisson
-numbers are generated in `R` using a function called `rpois()`, the
-string to pass to the `offspring` argument is `"pois"`.
-
-The third argument, `stat`, determines whether to analyse chain sizes
-(`"size"`, the default if this argument is not specified) or lengths
-(`"length"`). Lastly, any named arguments not recognised by `chain_ll()`
-are interpreted as parameters of the corresponding probability
-distribution, here `lambda = 0.5` as the mean of the Poisson
-distribution (see the `R` help page for the [Poisson
-distribution](https://stat.ethz.ch/R-manual/R-devel/library/stats/html/Poisson.html)
-for more information).
-
-### Imperfect observations
-
-By default, `chain_ll()` assumes perfect observation, where
-`obs_prob = 1` (See `?chain_ll`), meaning that all transmission events
-are observed and recorded in the data. If observations are imperfect,
-`chain_ll()` provides the argument, `obs_prob`, for specifying the
-probability of observation. This probability is used to determine the
-likelihood of observing the specified chain sizes or lengths. In the
-case of imperfect observation, true chain sizes or lengths are simulated
-repeatedly (the number of times given by the `nsim_obs` argument), and
-the likelihood calculated for each of these simulations.
-
-For example, if the probability of observing each case is
-`obs_prob = 0.30`, we use
-
-``` r
-chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-ll <- chain_ll(chain_sizes, "pois", "size", obs_prob = 0.3, lambda = 0.5,
-               nsim_obs = 10)
-ll
-#>  [1] -26.54167 -23.26117 -24.33027 -20.80310 -30.76152 -26.46751 -23.79326
-#>  [8] -19.14490 -32.08875 -22.23401
-```
-
-This returns `10` likelihood values (because `nsim_obs = 10`), which can
-be averaged to come up with an overall likelihood estimate.
-
-To find out about usage of the `chain_ll()` function, you can use the
-`R` help file
-
-``` r
-?chain_ll
-```
-
-### How `chain_ll()` works
-
-If the probability distribution of chain sizes or lengths has an
-analytical solution, this will be used. `chain_ll()` currently supports
-the Poisson and negative binomial size distribution and the Poisson and
-geometric length distribution.
-
-If an analytical solution does not exist, simulations are used to
-approximate this probability distributions ([using a linear
-approximation to the cumulative
-distribution](https://en.wikipedia.org/wiki/Empirical_distribution_function)
-for unobserved sizes/lengths). In that case, an extra argument
-`nsim_offspring` must be passed to `chain_ll()` to specify the number of
-simulations to be used for this approximation.
-
-For example, to get offspring drawn from a binomial distribution with
-probability `prob = 0.5`, we run
-
-``` r
-chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-chain_ll(chain_sizes, "binom", "size", size = 1, prob = 0.5,
-         nsim_offspring = 100)
-#> [1] -Inf
-```
-
-## Simulating branching processes
-
-To simulate a branching process, we use the `chain_sim()` function. This
-function follows the same syntax as `chain_ll()`.
-
-Below, we are simulating $5$ chains, assuming the offspring are
-generated using a Poisson distribution with mean, `lambda = 0.5`. By
-default, `chain_sim()` returns a vector of chain sizes/lengths. If we
-instead want to return a tree of infectees and infectors, we need to
-specify a function for the serial interval and set `tree = TRUE` (see
-next section).
-
-``` r
-chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5)
-#> [1] 2 6 2 1 2
-```
-
-### Simulating trees
-
-To simulate a tree of transmission chains, we specify the serial
-interval generation function and set `tree = TRUE` as follows:
-
-``` r
-set.seed(13)
-serial_interval <- function(n) {
-  rlnorm(n, meanlog = 0.58, sdlog = 1.58)
-}
-chains_df <- chain_sim(
-  n = 5, offspring = "pois", lambda = 0.5, stat = "length",
-  infinite = 100, serial = serial_interval, tree = TRUE
-)
-head(chains_df)
-#>   n id ancestor generation       time
-#> 1 1  1       NA          1 0.00000000
-#> 2 2  1       NA          1 0.00000000
-#> 3 3  1       NA          1 0.00000000
-#> 4 4  1       NA          1 0.00000000
-#> 5 5  1       NA          1 0.00000000
-#> 6 1  2        1          2 0.04771887
-```
+Work in progress
 
 ## Package vignettes
 
-Specific use cases of *epichains* can be found in the [online
+Specific use cases of epichains can be found in the [online
 documentation as package
 vignettes](https://epiverse-trace.github.io/epichains/), under
 “Articles”.
@@ -200,7 +81,7 @@ guide](https://github.com/epiverse-trace/epichains/blob/main/.github/CONTRIBUTIN
 
 ## Code of conduct
 
-Please note that the *epichains* project is released with a [Contributor
+Please note that the epichains project is released with a [Contributor
 Code of
 Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md).
 By contributing to this project, you agree to abide by its terms.
diff --git a/man/figures/epichains_logo.png b/man/figures/epichains_logo.png
new file mode 100644
index 0000000000000000000000000000000000000000..7963de5bca351f5bf27316d333b322ad1204637c
GIT binary patch
literal 31438
zcmXt8V~{3Iw|vIj+2M|D+qP}nwr%ftc5K_WZQHha=lvq?kLrr(=#GjyQFTt9%nX;4
z5&Z>?0Sy2Eeu;|-Dg4Z9KjQ-u{O5^q(suJRLD`9^JN``3{|z85+VGd3m*`HyYEFu_
zCQh#U4#ogiS63Qy8%sw+eLG_sTL;thD=rKG01qH8#INL*aiMMMhP;UJHI<mTnYk%l
z{TCDxs`I%_Nw^I<j{q-%3^D+00Q@N+Vx=_<3=E|(%<o~#6Y4LYfQKxL@%e(IZ;&1^
zc{?Wv`H^?l`-Z0=gQLk&$FbKj<#j0C(2QRjP_<{a70vj7Cm;xS$SuZsxDJMyy9m4m
ze1WfFEq2F~q?YBMI}~6N0#pM48U}zeYycMfVjfSvcC_?}`c;DQOQ?L14-((yW?>i}
z9v7I84gkytF#b*_5ibXQc{C86NpYJNs1<?O@ROM3YtVq6&nfK$LXf2AQ{_8(lSMdb
zz>ta~Qp1{3Xad3m^#}=X=4?Gpw%)#|s^w?RQDB0vOy=h`b0CB+d8Q8tmm8xC0^oxZ
zzB?gqW%5kv&spAbQhqP_biV=!M;Q`{^ZXaDT-XMRp_{$3<Q!#Z{nGzi6L`V1W2(f^
z)R3<l7;pa<5(0pODfTZ<#_QjF+4d8MqTSW<(F4itidpPVr|32x6aXk6d{BX5>NRLF
z9j)*9@u2IAJcb+)MwkkW9>&9q7<#`#P>wlNs{|y{$v$!S$BNi765;hxIA;4&<%-h@
z75BBT@0mjV<Ulv5aL&91G(w1~BZs?pBq~kf^~^p~4)D*ez<altTaozlrU5y=>JXtJ
zdHQ!ZJ{x(YSFUMloszgCQ!ppxjhxKc6{}T^?CwfW9$OVxWzqY^X{<gjH!M6dAZ2_Y
zvlf7&-h|JZux&t)39x|CsH|ideSxOu?*3{lM_MqrM*~RVAqu>0#Y)bn<F@%PaM;2l
zQUb^ulSj>ateDfM=<LFhN2*_TzkUg|2l8nb&r?}u`@`Ro;N4KX>O&!<vBD*9Wxv+k
zZ{c8rGG94!vBYIGU&wBjwjh^Xc$=jmN>#X+JBS>eyG{R=VZK`QJNpSXCuqOlXdpr|
z4+!2GpeH)~+S=$=5=oT9l*{h7Xk1X)^8i}^G-r(YpU)QgbWJ1Tv90AC;|atbNp@eK
zht4uqYrbGJyfy2P((MDYe~CQ&XwhWn;ss>TI5=yezUQU(aK5(l)}N3pq_I5e^?|Qm
zn=~Ovg>yTY0AR=)maJ*=2HCo=&(YR(_K<H5j4+q<9mXAdTmB<m5xxna9x3%>q3)Q<
zfmjpnGc!zdn#~wb7&H)V{}cuu>QJ?oJKN2n{^QfSeSW~o`iKpgorA3A67^Iow&7cK
zzP}j-NG>Z>({)lfWUm1Lk_uDGW{pkE$NMZW#y-%a%mMBmxqW&-ftGeVr#&N0IC+w0
z*21hfYt}23K<;{%8Rj(&%g|d8WL7Dt_M6ZZD5TIYM{pIY!jS1@^|rs|HDQ<oJ*{k&
zUni7o^rnBQKh0IucJUB(U2YYucff*?7>YNu@u)MJnPbEw2t2P8;5{84O|IebyZ^DF
z^qd5pAJ0uskM`@+6xSK_)nj=K)taU$*Hy)O@vG(&l2C#nk(t~(`VGmX+&jX!e>la$
zE!7$4myjZ0!8vg&XMPeS>Ig`n?VJ~k50I(Bl)#h5Jvw_|)KI$SXQe|Fz@*g5rfns`
z!tVR(ae3vadRixg8Bb)4OZcuB*Uz)uaTJ(`fo9YBb0WC_0+8q)+%I?n2L@4gLO*we
z#H6OX`r|EOKQnf7Ia^e@K)MtR9R!|Y%Vd5Q=HcU-^-Q4-W-JH<-VA+AB<>(x-C(2G
zZ8B|rZ=iJA3@<HR&oky)K&}Rv?t$FW<g@|S$KeA0D`^<l3FBdn^GU%SJy1lt2nawc
zadTM|IPu<Xy{us?aeT*oQoYJ<V{w)C{8Z{O?-2u@*9zJ64T2vO_6I136e#yR?(p1P
zk(&c5yNhjcmS~{<XRi@bjtI9;WtVHp?XngJ45wOt@GoeO@6>}aqhROTvSq#aOXxpW
z7bB-?-aL6M+aJ;m2Jlb9M~z{3O_Vu~$IT`pU$=Ov@d~R56txm-R!DN3O1Um8qa|Ko
zO1D<tbx*?L)rlqo8P?;Hp;%kzZ@o!w;}EzO#0*~ph2GiYju@;P6d;1?9Pj3qVGorw
z+Ih;$j7=2rtVp`_mrgQ&ejghB(9LnL<HI>+^U2(O<oM-DMKU=N)Lmhe?v|QACW+F|
z)zJW>d-SOx9=R(<%hf05CI2VKB;1=QAJ`f^kL%)rzRGdC2Q3tLlS{TJrGZJBBfE5^
z&$C?<yDNmAr0v%cZB?;(>c;im>>5+%-2D~+o|okB+Ks>y&qoc6X955yvxKd8TWa3!
zx5{#8Fk5cy*n{{bZU`96PHQ&$VsR9cd%6e$R<_4)Tx0s*5}hC0SD87-)&jxvfB+bD
zeheHP#Rjm~fB?TbBvJ+{As^c%<YDj6wk5MQWPqP`GQTd2l(Ibd3pJ%KBmF|`wkLj8
zTDL*{m4XF}x{H3EdHk~Z2{C#rJIrnJbSnn79d(ayme~+aJ~%K54+!YxSX*r;ER~PV
zp4qWuY|JiGBa!u)3tlK{IA1iwyuuKg&#JdZz1<><woYxO7F&<ejLCfm7pd#wd3Pz}
zUR(G#)oe#wlnqV74GtiRl9L`1u@@xp{SJ4^yANS!U-?>h$~Zm2YS2QNJ`|?HrNQlz
z=d#W>hM|LqfAtwd&p`HY%{qa)e7mh=sPhct%TWmC%I>%2yjfNG7lHLpW+oBZp0XbN
zeJtbL5Noq!2dC)Q(xA0>n9XwTk|o_joH@Rp^S){2I6<S+Nmnfb;U0RMjgKJ!S*tC|
z6#^J5`AU}Z3lr(R>odi1j>25)9-7vd_&XgLlr6w0N1Z|K^4EQNZsr5`dscgNgmPAx
zT>5xU&V@SPowBm!bl7dO<$ECSDH{)C-CK<nI6)Ffu-c)ln}gctDNcT)qjaCWQbh4v
zHwa?HE<&he$?*<49(3D^ea!3Tn5S}ZG)YmF<@6s@xnhP&!&_seMRapho(@v<B^Oho
z&;tZY=@$=M1Ml}%M(LEE@3uxQ$OddERbbU0H#FC@$kN^}lEHtpuY)WPSGb;y+t*yq
zk!^L^sGc=7J@ln^la}EU*NU3*wBaEhvCHV0ORH$T)Mee78q6s?Ubp0;)o_oL)?+9o
znWxuKGQ>uI?|nZ+T`w#&*qx2jh=c>PUYGDks}HJDO-onh%VIr5nVQDE2@!@Q5d@?q
zeH8BS13$bESxuxIcvu{6u*r2xquJE-3g6z)ECzw=Plc#H=VUdV&D{ruR)cGZGGTNk
zUVOHmTZB^UbaJ-fqc7cbNrMii0l8lQK%vo`1(VyFAmIzxJdQnvw$i7y-+9eAv6UY8
z3WarsU)Pp{(*3P&XoHIkS*~U&Y&O5|;1N8|^tbz$zA}WCV9C7@<0IMysiRT&87<)T
zS{}Cd@0NCu?9rqX(=V8+q+>ebk`jqFk{a&nus?J?HvrmL?xxaL;eo!z_C{ZvaPsV_
zV8BPz#Sm68-R4`rq`F0PD~X(Gcu1Q|p?a??IG&UC<Ma0K1)8?fhuuh7&No+UUCN1q
zHelvDp~Rh)BqoQivCFslWrndrM|tGl`Y03eG(GhoUo|z-#iVnhS=h@^Sa4u(aDerg
z3GPd3)Jq{LDVr$|N7GVD^@2#nG|f!Zue(zDf9&XnYK`9+CbPDl&aJG4j_A%(EU|~|
zt=pYy$8)V`$g@(|5TfH~JV1PeW27|-bcbL18+oS1Y@JadZTEb}a+({j%^<BEVxTRp
ziW-?*8g;jH^-lNnAt?n8x2(7-t_<5}85W<$U%~72TkHND{7l(0ARS=gZs_o7AONY=
zikPoWj)=%(?tL?)#RuLv{T&LIbm|2@UBl?~DHZt;dsD>dZ>PwI5o2Fw#%wpoNuA9h
zBvms&OPsa<yx5aTWA`DawIkIw?TE7DL^DWZni*uLc#ZXwG$^n)D1hJCwble7`tbRP
z=MarPM>SI>y-ZT9#!;vst2B;rW9ojn1g!gFVbC~jyp)oW$#@z5ATR2rkJxRx4?|at
zLa>rJt@g90oi5}D)$RWZzy~D(i9D9_8DT%Zy?t!+DrDIU9H6?~xWOY8T#~|4_-piH
zKdnC5Zu$FcLPhlFNaxEW9PJdI9^qzfD-Z^jgEz*Qy}^NCAR&7EhNnqk5r&7vX>rB-
z-j$qY1OD`~SkAcySVVGmv-DWm=eaIjEq~QKV-b!nFSbgwr(-m{M`Z3T^r1LbE#A<L
z=Ptqja=t8MwHtCr+jj?usj_J1xTZYq?Y%N$V`~UAXY?!e#h$uPB#L}B=H0ap&$7)&
zEFm;CkvyO6O_3NU-1}@5a(Xalw$bat1H}Kv%LrEQRJ;^^=$fExXUVsRVvznMD|C(R
zM?7$MdOU)?H(4NfJ~O$TkF^Q%DRIM4W+#?Xkg!?~aS;9;5|2sjO?TGJHi>E7PR*1w
zQ}UFlLc<dQ@nqdmTdf9IS}nEwCBA6HXzRsl82A%L&=T8GGYMVzKvQY-=}FHsdYSQs
z={uCh%~unCfi5QX{A&wik*=Z8q8l+Vb%LzEsmPP^N>b)*8=k60ZP{arOj(l7CyCnV
zY>kio3_({9t-=z7yxPk5P<Tk%J~VD1NE^2}c>bOde(&JtWA5`X$X^z0u`Q6WfZ_-&
zBURc>->rrzvKsj6&o15eC$$Rj{_ZJ6Awa6r#GiCwuc43mi$qTDDozQeg+|Ir4QEC_
zzh*#>(i8l8t#abSZop~`k2c0!kp;+>%*dP!?CoLtmOf(*f!E*EN^W(d!mkUo(>jd1
z@E|FNtuyX6#pSw^y89?1`01I2;Z>O@>yQ3K&g3cQ;jp2mGgd@+od7zaWan{ku$R&M
z?0*+t4j*J$CB^>`viuT^_0Cq`$q<R0)(nclUobWqsFAF=;x3#JK|VJq`3BK;QUdAI
zgne=5jx;WWR$tO}1dr_lUJQC&>wO)L)3WxOkR})EY+=w3hyiA5t8S?wMf!JvgsrT{
zQjTe7W@`(VLF({Ox*FK7T|q!J>exn{hLdT@?Gt~_mZj>C+F1~uINf#sgA}Z|raapo
zA-h5dNs;3g%%srReN5*O>c=+6Xf~L$9qZ5c2=4X>01Ej}>I4?%247kITS`Z)ID}}+
z4XW165ptJ2<oaMcXhi6H<3l^$d@UsA;yhTpt-Tx$&r>DFaYr|VQz&3qWO>s+qc0c0
z2D90rPMET!iIT?xel<Fx-{vT{^~ttbKd}8-CDY8T2OyEXn;HS=1OWgUPY#rrAk4kr
zo3LMZwtp2#vx7pZV6{wqI6RacDTv^!KWz(iKHi<kNC%^aG`NDNU1|MpT=#FTHa6F+
ziDKdzDIV%32KRfHB}L=0SzWo3h*-IbBwq;FembYwPMj*t730KnW96kiHtaCTEyE{P
z4mYt{HN;DvMdX|5x&U!Yk8;~$RLo!`<sOJL`s8p;9DHr?x|=<0Up?@$O!_^_!1Q&o
zyY&>>#&T~K56uvQT+Ao51S}f7_CnbH7YFs=a>=t7K=+~4u{)ov|CC%|*_xV1)&Kk5
zfa&_vSu4d?kt2m)w0IS(?;-DyQCad>A=pf3vYjbK?2NnTP}!F}K1g(EYq<Ndj}gvy
z`z@2jnSee8>-JS8P!L|50S*D!?b{zG2HmyUg_BdT4=qASy|G2THfVP_%FDz<ZPFwH
zIh^6lfoRF8R<@kr2(A<oWrgkG>5Xs=_qUdTka|en{c_JtC{@5m;^F#fO5&C%!mL~<
zTt#n{Hxx*h@ME*?GtJ)K@)5^clbO!7A!q9!(zC}ea6_5e{>sZV0>~H3CGI63`Cev>
zIe25ggh7|G;%1#j`vMMkfkZ%BowMO`^dwWlz3+PUrXzW0MjHb^JfdL2I2ca1mCln=
zT=DXAm>DGgfQXAXEW8(_UqJOYJOGd;5;ST%;XX!h^D=|cym^g$^x>oS{OQj&l<?Ck
ztf+FZ`8E`GK{-;3kLgq8MQxekXi*kGFk}$ePZ^Mgf~A&aq@+&~&C!0_^QFYQZhmxs
z=`#@-#{s(KQ;+Y1NzcY?kfNh6wxapm$iYvPlV3eL;dnU@^=-Z;NHd~BTwJl7|FGx?
zNgr3TunUhGmVG>4a4$7Y0vo%Q|D_ZpA%#z|mT|~Qzb$L?LyJs^nq!P_FXTv|C8~X&
zV4*mzVp+VB=pgH@<3>$=X3bjAaL-*>1^59TV7*sqZssGqeUAJ%pI!f%S-Bm{74YK%
zM38`be^EmO;O+YF3cmdQGhq)3_OjUKh&BZ9++!#KQDrQBxiFa9L&lpa=)c5#7yIpG
zLlNXwA3^fGeH?!(nsRNDq-A*2)3ova?CMC?Dx$6j*fEB+$aI7jLhr;%FVbCN*(e$V
zL&1CNA+9cs_G(Q@Z*5k*tAkzb5DPNbW;JGnu^wp0jJ5P-$0{bT93szr=J?eZcpO{|
z-Q~_Pnq;Xg$%MS5Ijk_X%9o=G)T8!Pvr_17dD_3MQ}cYW(WR2r@1i49tfTI#g_^Fu
zHK_v~fv4xyEcVs;-p=l&K9n!KmvDKU%F3FxM2>zu!xA0Ln~koppU^@<wB{}_=~N-5
zLu5eZ;t;00ShO*@qol>BHCQ@;<*;=f>O>k}H(u-*2P%s&fGXxdGxlmBl60!C+@726
zj5azK6X~}D0-#I&;v%Bs`+5spzVuxFGApel^0XaCzYNs9rc)AO@IyqsJ~LEh?%dwi
zi<&I^crFVVBq#ei8;m)tj@vn~V4}%$ncCl84SpC7Z`E-9zLH6R1>{K1^O+xi)GndO
z!qx@9eOLmC$5-qMoq`FOBY9_O)1%{;IN70GnS~T(asc-z@XdU`RsV2d9Mplu(pb~o
z44!H8Jxg0Ar@EJ|15|3B6P4N|JvsfMf1>I@Jx@2&<Ro2n^mMlE>gGH*YuL(q!k!TN
z&WgcRTJy<EYr5jKg+A;V_#JtE`n?R`v3=o#<KeyHb|HuU^-w4!?y=GVIR%D;WBwDv
znOiqQQ`*dy7U6t}NjFZ~JlUo!6P|5*|6QR{tq<w)+$jeFpSF|n;n>X&*Fz!Jj=r@w
z3FKDzowhUEf8-1gRa<?<wlsE3P%xE?S8J53!UK8TGHsm-M;h2>LuG<(&Wsz3yBr~}
zs;&9$bze<32ild&J5Zx_k(XvywxW;ev_$jG6~*N4b|+E{D0_t|yco;#^lgSg!!Zd=
z#e5sSM`^om)qj|ECb@3C5IqZKM6}O!JLmUJ{+1l@D)U$8-kj&Ragsh;57MsLX}uqh
zTBgnUIzlUmf}V)&WF%Q$^zfQ~UW{JM`3j=#$KNGs>4W2`fkJkSn>aVvZVJ4eJi5np
zadRw(-z5$Et`zHfo^6M^qH<>8+MLc(%@{kX{WS{nICQ10pHv?uE%TA(e7Pf&d>`^D
zOWQe~yBnp~wPHO`<*2$#WiFw9zEYdUW+0czf^_YlmBY4*hozjm`?rb!WCJ#fbctoZ
zdPH>HE;v~1X~<XtW*K`zQ-qqRs!9Sy3jQIFueRNvJ-s&)>#N49w*Qv}5Fic^)AX3R
ze|+kz=H;D#ofNfQvwWvffySiG`&SNkDy;sr^4HW1@yZ<O=q^tH5t2<@1ft4Fg<w?q
zQ7)vxhH3(5LB(EV2=^aY8q0G1J<@$dxMzwLvx>}@rac#Zs6d!s7Y_JzCz*43dn_t@
z$*bzz<8%mwT@WU}2yR-Re8p4D<TA;Zh`U9%idSkjKaUG_sM8GrFdrUB5uJ$ZKe5e%
zX4Y(RQf3{0C_g#9(d?v>FP_^Gx2xFe<h8fwGZIARO}X)38RAofzHA5TJ1xioMERs|
zO-_`#4UR&74VMNKmo_G0;^n=(yzD(krgB+&Sh>q1Y3HVu!(HqB9(x<?9@Ob=&DyCo
z%R{r6j*%xm6isf3^x&?LNLBfV#{|He1gn`n?va5H^Kva0E${*}_!GKmbH&8822oQ)
z$8iipkN|%oL)xI376&9lxo@4Rg!N?<T6a(Xe43>&;`DNvBWE!7LAd7biY$`bGjkAY
z1RwAeRRwD4U0j^*@juuHg=1%yc#<xOg14&+UlOj%W%yzYUUipPjQ9hgzJrr&%E~Qz
z_J5_HwI5W@JtY2`9fyT=(r;xG8Mk}K#5Qw=d&&%9;&Nsim7HvMK4N@atz5-`(8e}B
zMM9G|izTI}+D>OHBCj&#$)tv(_~&xn&7=fg=wF!zWy^0p?um{6U5YN}#uS2Z7w6Pz
zndT^v-luSdT@B3raN*k8zb6OuB;S`}U|wq&G?bf}E{M0c3zx19B7yXLP@c9?$=F%i
zTNf;jNIyg^#5>Eb!2$3D#z#Otqxuwgzm#bNDh0g@z@D8cT(>G!6}t7i#_G6!s^)7t
zSC>eReB|;Jn!!41iukA3Lz&5U5-de|$v}n?8RGULfsj{?B%{98vI&=Zw0Kr86etj&
z7xr?=O@{cGT*KN;Jfxod)hPRjKZyod1u9TFegm3Ss5Yu>3a1_~S9cC3^$9%l!K|cu
zyr?ULuco@)w4fLYw0w>AmaH~M2cc5Xkax6XQ7<y#*-iLrmP+on^`Pnx{+>3qgj8sZ
z?U#95vL*fzob_<0gDz>xWUU-4luE<}!Shr&3V!~G&f)tz1-odv-f`G><1ba|c)Z@;
zggB1qaifQP@eH0E>s*cJ?#&Y3I0>>_0M7>D-2RvvFn;O+=UY=)<5npnH$mdFca=@a
zeL5KXBGD=er|1JWhUo;#fCfFTuhjH%6t81tpJOUzSfc|GY0|UTas-4T()4fhVfS;c
zam|iyM@4?>V<B>hX?0M79-2`-q{UXRfK1Xk3bt$^a5tN`<UHY;w(A@Yp>y<Ct7{cu
zAc}B=q`mtCEfYnX-1S1DZWCm>(1?q~{y;&8pZ~zKJ1n$PG?lKp9%IaMw<<mc^Sv(?
zGyN(0!V9n%@@I<9ILW2$1pzsvP<rHzG_m}$TNv$UD5Vn*kH(EH#&(bYD3o`tY7;h0
zn_8HA1JNX$X`hQn+1H+{jN3DBI!h%B4z;e62RVKp$+aXWXG7mRy0nVhS8O5t0_Zsn
zL+ZaCv|DMQ29`*FMj?vB|IM%^4)N_;{98(}Dgoig*D9|1C(c9UsglpXOqx(vJpA9A
z+_PAgO38U0&w7P=rh1Nr{2uN<lV8_IaP`HSiI3G&lbbGHGoZk}zwaf?fqOCI@TX4F
zJJKHG%>aPkF6I<i?#@w%TTS``#_gJZ{GqzIv#a(NyPwI0zn`0sAo&Gk=;=T1E3H)G
zw3A#FF{t^PU!@IUZw9Z>j!u&x)MOXQz0|3F?9Ds}Z|hm}1+d50N>0dXZwD?`oIZ6|
z!EAzQ($1I^daTbx|BgZlW$k#>+IHS?Y(<R#5;~4fdz%)3DQj6-5+bI_353uIlXLB2
zi}($;OYmx$o>)p%k!cOlk3KwvaI+PqVqPsU+s&8g%DZddDt^g>mrM{)MUS_}@BZC8
zfWj8{Yd1~OibUdmo#*+?T7ks?;0<WfA|XSay8lTzt;B~?<0;Li&Ive9q6-n?lx1?2
z*O#yss>5*CjXl{jOKOiET6Z(>^g-Yz-01X@KnHPR$FejaN7TJsUMoB}^pI{)2zo_3
z81@iWK9|mzYX^&^@7dUnIiWR1jwkG1u~=d9OkJFoYji9uGgMt=)NsBbB6pRw0%x{9
z^zk--Tdrx+?`w{IY&N~+qJJ*_Z0o;WUGy<bPzg+tJ~OSR^6Q$mD&&=#FllnOzN&lj
zR#r<+8$N(R^e9ySJKkQrMg#(Ids3kAoU$Ml1@(Zs>hJ^xVwz`kfR}FjVEu;5YYiNJ
zZ`d)8+I`)QJ)t#0{!7^XGF)T+L|0ynvQo%L@%~JV-tVa?W68Ve!%sPih87XTD&p+I
z#hF7L3h(#3JnqZ=is#%Fg4U8}8$G{L%rZV#jH^u%Nq@G4W@?Bh3E~~%4L#W3=j=4L
zz!8{2C8t&aVM}~uAsXu8XuS>J^UM5P<VpK=Fp(5yqZen^CJB@oz7m*gwwd^PNBH6F
z4XG{l0oY;SSo2r9>F@ftnyY7vc6$*;doT7u8K%eKPAhX6LqlDaFa5F2ru1=7c;M8f
zYLsuIZIzGP0jxVI3m#+p8$NizEdQ3n^@v_=+vc0gPUR7P8HM4h5qg)zs3GD7)~z7t
zG9#--y+&yxM5$U<j=x#b&Z@ul4~OPv{VA?3)%FkTeUZgw|H9Lng!*!6U|=~j+QP~V
z0;XbNH3-XLbj?tzYET>ifVIV?@nWpa-qh56CGdmii#73AP$%f}rk1p9UfaP3$#*M`
zXtp01HQ;2%A1D^#sILPBN^fIoZct#KgaPPj+8}IBrlWY98}EjCoI5Ll)I&7xZ{f@@
z0s-teh+{E62U(W1qyPGXERCg-j4@oWBa3!E&HHHotWAd;zTMfKzQ0@yzFJ10ZtO-w
zFv8xq&3$?~W>>c&QaNqUOQ(gd6Ct@A&7QfVVj_Z?ATw7^#oEo@oSKE0&=<p<d19X|
zM*gFn{>sAKi1BoELWKbIlpbhVOr)%<Q66)w+OP>CzdS3#R19=Ve$i8HdFW6Y!p8ZN
zxF1LMSJs1j=+IdZ#98#-L*aLQTw|w|74030Cai=iu=;(@B{^-5MR_&%fp~YS*9PlQ
zawDEqx)s~dVQcqn;9ve1XUBc>&G3T%t+x8pDp=evwd;u(D2bRf85r3C%c~>a7CDkX
zOejM<g1Nu}0k~Eg(Z65&JF&r!mOLHnU_tS6&gE)tgzZ%8XHhDq$n*rzHL{S;jv355
z%n+|Hb7kCR`(us>bkNrPJ#Vw(^{##nhsEBVcEx6}pYbVrB}Ea`T%{L^G5Xe)lLY5&
z`=*tDrr_Fn(cZJ-Rh_C31Hez(eej{`!$4W8MRc09cDvw+qD+N$aP0zkvH(UrLTE(t
zdHMgn047NWvdtxXm)e`ti1IMMIMXoQ42@8Wqv*ZbJ_B677o{vdSUFdCv(OC|XOmL=
zKF^&iGRzTJ%}I(<D+dy<pU-4T4u3of^87k;v5MDC&6Wq6hx*_Ce9lu^003`OWKA5!
ztAur4R0T!by=ayU9TB_Roj|q<4W+j5c9Guuz<u~dFED3gxn={gg08!{;?K@W{zRXh
zWiS?0A_4TilcdgaZCx?21P<@NMNGmXZ9Vo^joPN)ebFVwpY)bZKE7V_D;DFVtY;wo
zHU69wi1=)@$yBj=RKsz1J{p+jdt}-I09#Shb22|)OFM*th8idCIjOt`W4bW?zF9i4
zkx>VK@luU)vDE@+)?bxznVb)_HZDZQiid@REs?^i9J|LNm8j9%wBL*Xsf8T^c>Idy
z^)4yikHeXKm2MS&fJ8Z%pHjyn0tCQ`9M<N$LCA*8Bo^o}2b{Y#{%f;m1Ndb#PDDw^
z!^hH_ZQQ_LpW_&lBbHyYZt*ewzMWdJkwFJ}b8v;NCuIrDs<A?SUw_~yHbNryv*t8k
zX^Q+Z-i9hb0J~xTY-j1#mek;^>CVA*&-Q!xK0+or)(jh^d^(dYK9C0xrxEaS)AwwV
zEQkUF%nB9cm`8h&&<vHso(5ixylrt+ym&d3>>Xp=?aki=!CcPdWS{G(DFUSvbJdc=
z;aM4Oe9u=F(uSdeIEg^W5VR^|i~v7UZjkG+i{lCyP!})!@I_mOjO?oqZF@(he|^_S
zk&cu!@eis?`l{;;vok6}in(k**&CFsea5`MY?FV6I+r1xtyjgJL)A^ub*r&t{2CX1
zV|z&R-CVDFFm?gU2U$cy*_`Jd%S$$2oh(45yn%3!@+YlIV_?q`Yi-XQSAfu#=78#X
ze4uufFAn?3AsMpF3F^Ww0Vzgd(0B3qfgMDJkBV8q_<qFuo}6?;H~qbou${(~+Bdx{
z7|M#p1c&MJUyk>yhL}&{vFDoc004N%kR-{23)e!WdbE*h3k)+M<i3E{t37@d%!N{@
zDCVo3X#`f;)a7^K5k2KR3=?nT?`HS9n?<NR;)6d(l1;iZU-owx-j1<myeg5y&X!}{
zPO_q=!o)Zh`$zF;K>!}llGyy!LNtSm<TG)VKip4N__)^leyKizg}B9A*7r&wdb^9T
zo9Xgn(k<O}Fy@g&`QbJt63QXC^LME+Iu$Fk(L~#0cq3?7bx~3@f9|H>ejfn%7JW9h
z3lAAfWwKt#`zvmVRO^>KvAa3p%TFJ-bcd2etIQ}Y2C{4&^K74WOxil@w;Eo(#;k3b
zHFj6GO@#NGe?Y-+3YueWc2LdX#^+Qc06?}~T~-+#UVyK(L9tSdcE+kzoV8yOme+Z5
z<yNdhJlYZnaKWlsT1d`rGYHvjj%4FWwvw(pv+y{VRadgexrW+qn5w#*X(xW)Yvq{t
zUqeNczOl_xB{|Qzrm*xh3?7}Bus@q848sM<`#IK=4#Tw-C%~0h)Q+C7#MO^2N1hK1
z*ol+Fy{enc)xv(}Fwv1QzIQ{2zJ2~<6Qi#ZLPL_dFo=9)zvx!EISV&vs#slJ-EplZ
z3~!8XQpr)!ou=p{nmzvY%p0L%GkjDTu@HRDS;TaA$QxDFVfwG#jgVo$KD%2SQnj`<
zPdZtD@0IAO_~8TEbxcFD0d*N`)}3J5`WURiuAf}E(q=P&DiGdNc-DgQFg-3Y&V9W3
zg}!nx_J~F#D}uFTchU1(BiUu?P^t5d5{SzkKm3zKI1{-VNtneUcJ-{znT6&jG_y2)
zMJ6IArs3>s+OL1!CN0%bEZ&wS^0386ifBxb-MHMp!^LwImCW3Bp`q_#pf2n>xfP86
z-cL4b8A=%PDo^Q@b22q5FY8UCII)}-@N893pren%)e2}!$leSLBD*LMN0V~((*84<
zd|?a+Pd|7Id3H18?Lr?PiDfN&NnadeT$>s<j~F5zMN7VD>3Oihq0-2}iZ_DIz{7{2
z1p?<2j)_lAibmx_rr#q*zgU*kjmQA*V*oEE^H)TUqXJ1BYPrY=0~H77@6$V$^wFgZ
zx6?QU0Rj9i!{)c~moMBhets!;4OySn?gZVY)nslWU2$8~*@1vkZ}Z|8=r&FLSE~O-
zL+z3aOO1P`<KEM$f_N#RQoEPZw8uwPzsFZWQ6#Qx7mr+UE<H0+g;(lN<R4+>A?k>J
zj`a49Hu%Mpqj9xQT8XjFYJ@t6g!x+bhoB@$2+8;nvGNyev^cPT2tkdE;$2H2$|y%f
zAFMDiKOR{UnO-AP5frc)w0>vN!0>J*B|DQPGn1Kl(9iesnt6@-24-}2$ILa3#lvt2
zRmzl5^jFnT^|Xa&ytbEFq*T*;LZk_fr9a4=06t&@K>ONCRsXgAgwUkJ11@Zo{U>1}
ztDa+YpF8Qf?#h8^YGGKC)MrbhmFHkC5sPGYDlWUx>8EP$n<8G_GM1jTRwW5Bo^H@i
zcB8|DxH{3G9eF(eA>PT(q&OzSTeDPRb3eRv9-_}A2!h&7znAx~GT&|&tWy+sKM2*F
zws6&w$S+Js_H9HC@0J`h6$o*4V7X=W=op){M!Xo=Vf(4%e80#cloec$X}ZLw7T125
zQ{2swD#ymCJm`<VHcM2plZ#(7P2rWnz08@23i~K6B}(i^;iAK<0l=S@8D9T{T^c?O
z(g;#s>p|A+<SAGyN??ub;FA^RA$RNEw}vCXqIW)%UHQAB3E_rK6~EVA|EYUY#L+n1
zkTLWG{j!MwQV#n4#a`_Fb_{ZnbU{!@AX-Ic=tQ3H_i=ioEFIR2(`<X8E-vF2(l@Cz
zHlBfYHvas~TxJ0G^uQqY`4J`H87Pxf%k54dEbV+JluMS2kTig!IH6*&So?y=2$s7h
z-oTdtN4t$X?zZMVpR%|oOZrf!NKv6v6>z$#?R3Q{Hs1RBqldu4C2Llz>$jF`O8OH)
zhD}qGfQpN}vc#<T5+{6FOcR?77cxbp5d9R0<jqU7v2nCN)@-GpZtLuol-EDn=)-id
z632)nsVnqzH-D27sq6`4ka#e#U<Xy8G`OCUSmFc%kq{8@Sg<AY+?)d?3g(DteN^J!
zLHKV?76+x|KQ4^37KNU?6T^7CGlTaBYsFU<qouPZ%h)4Sk=r<o+8lehyK~p`l^S9|
z<=yob1bqix^1XX0u}`(S?>oDBi}T>C98HjMOhiBoLv{q@DIUAe`EMgBzRcV=k8n@N
zgsRLgsG8ewTV7)=PO4FwooU8V`dhfe5A>%;n6x)v^~<o?B*$%=_pc^Wk>dKCxNQf8
z4;(D&Au<rO?wgJjTO|(~06<{uFlIY$G$RxZ%>{^8^xn`^+|rPZ(ZRyYl`=YyTTZ*7
zt`>OGsY!h}OD@k`<g-%?p=rH>S_ZJkMLYM~X6@<2H1}2Lt3eSH@hQ`YeREp8lA7V_
z&52u#_Z~f*{Go1l42ix_sNH`fcV;TTdzyG2bBbuT-EiJqCUGW4(){7S5zsh%JJS>$
z{LaaYa%H`&R$_LXOdryxD2`)|ok7-njwxfYJdI@m0$`i1FQjlbxE5t~$!J_buJ)ti
z=ldazSLV}|2Nf}|?czJ4a{2zs<%qPtEwvgzv{vVMty2tyG!GUQ?e9)n)~z?2yRU2S
zRqYA<ulI3jQc~^WV0vU7(>#W)4{@WsHol}l{2?gqw=ayUjZotNfY^tyy>7RH=1dYO
z_yk3$-9BLj8Zv>A9Edh)%5a-4DF^lC9gaC@tARJh@Q09j@rUqpM2JGjr@2u=4k5^A
zR||@VXy`8cqE&bD%(n0IOB@J$sGyyTK8T1UHqkTa_zSK$TCtMI6plwT+QGteD|X?N
zl<nB0f~CbJMh)@+DGsh54jMjPS&ZmVokAN50T^w)6{_3h4L-eF-gTI$Dpq?OU(~mD
zGgU*67#`T|bn9tqaNmPNVqFMfOD8qPhQ^YkA?ZYgnrLns)ssNoYR>$JPw<~R*qZ%F
zm5~O`Ob*E~fRHykj!QzCJw4#ESgduMoEELNV1T|9O0DMqI=4dfrlKF-w++uNSk?%o
z>$V`>&IP74Mj%EmnUPF2Iu5UV6?W1Lyo34{O;#5qM0__d`@Q5jPXwpjd#V&+7N{~s
z+mOF$njq=I9Ar^_sulssoBy#~|GWaHHJPV^9le|(K1$38Ejcktr>k;>f=<M<RSdTM
z#1gr@p%vP6<%Ib8Fb(k_!!HCYyhn?BscAiXBupv#G7AbyInrc{vPbyp0@*+Q-rQtZ
zkK)6NE^Et`kT=@MA(yr);#Er-@@k+8yML(fxm(u~uY5AM)K14Ed0dQdTu4wxt1Z5a
zOVOZ!6p*yoUuX|Z|L)MXl}&OR+*Q@6Tn5`gC0HJq(n$nsSF2l2umomV5UxV#%94co
zyX;YjQn<TBuT~d*{zN99YP4$wyT7CBt3YN5?T7o(VYbexl)cZf7+xC6i8+4{CRw)3
z^Lm(JWRUwX@}eEyeX%-g-iJzaE=Dq^B+Vg!C|+qK9a|~1ytpWm;aGgO?zbie78sie
z%RTh>5#f*SkOBf^)J4&W^o%p~A_j?W*G$1zvhXDuyXj9NZ2pF*rUL+|13C+Mvv3T`
zPaR?X;eU?YpLY?k$jIU6Anla7Q?T>c)A;*DVigFqYpZ>JLgBqG9`L~fK0r-b=ef3V
z8L%Y0U4v<)0Xo2oY&v~)S>oBpI%tqQ=f7(pDZa!@A+&M(0clA-y1;M)BP2h|Iy+0W
zvB@HZw!|o54;?4#hpKaJ!nqz%^V4?gWTaNIan%rZr($@&rRB<j(JztG64j4uDH*`c
zTO5Q@c$8q%X_k0{B~!3g{@5b@8h|nFr{gC>v{UBI0e+#LOvlT-oC_LFk&<xLQFw(b
zNve+!j$q_)S?~fK@cDR_yepEV$LS3NZB~a4pqLKOr`MF^VNMXdh3WtHg;~z5;<V%3
zTgSDetfwf8eh4IasV}wi1T0^gQMz5N_+5kA`Z;^%)R@H`Hv_oWn{SueA05~qz9;gz
z!L^GgZ=Bk`4Ud%xw7f5{cg!=Yk%jIq9!~nQ#kA<^J<uLU`sNtC`Pf7Ryga_Mt3$d)
zTSb3dceN}0G9rmkgM#ZE?v6P;Ht6MsyECyI8<Pw%MWQ&JNvKeE^~Ymbvnz%8xiE&A
zCH!(1$Md7Ii!MR2v-u&$E&uHNG-B;cC0Hkpe~;0%SwUG?;+H!IFd#9$0y(?Es-lGg
z0LZCLPDZ-6P=o|1yWE=7VN>@0hpoxCuZ=SU!n__}b$A_1l%Yng>R|Jr6%Vv(`w!s`
zf4Ugo>9%EzBTKv2DeV5w0_J&$buIIZ)c#1bv0x&VQ=7(NtK5)mDLM}1Gis|*^1OAd
zj*CWO(SNO$e~1<?#WGIg?!#%!Z|8*`E9-vz%t%D&{TbVE=Q`T10;Gnp(x1_HO{gK7
zZxpc92loW>ROGzF*pzkhaI9yKX)l<E-n7vT8@ke0WYUOqhJ2Vd>l?d&Xi+)COxVHp
z)Z#Mh#Ktm^>{%e(mn=x@Q6w<2$m`@y_=Q3vw3dIDLZ&-Jo-mQhUQ9NZk?5+GE7sUR
zOt8RVOOF)oIZ7th1w?sla8iiD-qVk$$nYhoTr6I9C@JL4(^9BuF;CVnH8vvDss|sG
z5vralj=C#J_$T4P2bg@3l!_YpF}12P;d($QF595esAm{i26uwd0k~Y*g|}A+A`~%N
zw_0kb;B+Plt~K9e#pfKgM_}b)R2?J2Sid<6*InqNj|Mcm1yjNR^xH>x@m0fFBu=(d
zzK>?Op3<Yy+i(rAPsv+2>y<i^BxJ<seBxMF_bOwQ?c;f06L}OjpSfz%*Uw)JHe4xM
z_@e-cH$~DzbtpbSHSG1t$991aO@h>~SPTGPK&Ijc_Z?{5izZm`4)YJ32M;#I2KYft
zX)%6e6;l04Ar&i`%pKR^EjT2vYjR1CEFJQsb(?NU%@bpwU8R9X@HFgL$BtlbeLFqp
zo=v{&o|yQ&49jw49tj7UgSoKDWLLV?2H^`3Y=z}Rpyh8<Y>m>)2qdjqp@@ZPjR58s
zA(zpab-RcUx`>DUx12Q0C0vn2NDTXTiX3y=y*&DWK7>mL?}A2RZt<LP?Z-#rFf*JU
zopfoStpA$WZuQ&Q+C{W*5Yz4*!O;G%bri>7DRDN5u_d8%6c@)xdmrP*q-ldkXMX47
zvw0=ye`K!VCsbydh1RbUY%|SYqgZvY9WdLUuO$3aV`3@@XVkL_KL~J`cpA)yA@X9N
zDV0g1@?JF=nq0$N#=k{R-e$@>DQ(<{!l_V`7(FB_iuDiW1m_14z2^G9S;n3WeP_(j
zKMm}TgTezx;uSXni*MrVWC%l)aq5jI56K2^`+td{YdowqoGjYR)fJTGM$R=63~EBl
z4+3QZJhb@jeLHl(04Na1l}3fMF+ZTdFnTg?&bvo?cZw5GfL@9*5eT^W=|Cyqaj7#q
zQMbYkq4h@}S~#;p*AaedE+-S`Rmci>5HYGTkVZ)+(o2DMHlt<v<<hmH^y(H<inq2l
zRgx6*WF?Q4qp)S6zOarx**hLVj+mfN!kQ_W3&xs;&6M*b;G|y+y=-w%CZu9r2U{F@
zVPh-zm~0m9on*>xgvuT!ntW}Lulu@r{!bU~ta+$l&I_!dzEtne(G5ZF!Y??$jj;qx
zLiG)o8kxjNoTGS!0?1kZfYoKmJXAzm#|U8NZO6FU-n5FmZK}DnRqAC6?Rph@UFMO)
zVUX#=DPZ?fVvSKb#KV%q1?pK<3wYOdW{;~J>2N*6Y;m?cP5B2hG%y9f3xh=(axfzd
zIav^(H8QCfHX<53PuXLDUPV}K0t17{LHa8s^zb37ohm~D^u&S!(TtC<!r{RIgVRbL
zi8&j{>SfLLOi@5<v~T?fHsNCAKeYUPBn}zNx_iUb1M^!o<ET?<ffjiKa`?M&XxM*C
zi8zPBYe^4T^KAwTCgb<g{IxEh-T0P0{E_<6jAX-~%%+dD!dNLo-RfsZ8X_Ow!f2)}
zTL%Y@V95Um*G3Pt#?(a?oSv#fa~xZ`*@_&I^tDn<1Zq;c)==06t%&3AD}%^J;nB_F
z`!~r<&HdhLMi~m$f+=7mok4*X<0=Ri#<71H`muN8w&CaP(-442-}%4hQ~jh~*71oX
z0`_eIE3OnN+xEXAxH2O}W8lqDK{=8_P^bTei0O`gL|rBha#5jfbtu>PotB;#x3x^1
zaeYjieL#{{_5^2?m$e1^czlZjwVAs*^~`K;+#kBrKz87#pstIoB&wzQ8l^FxJ4T!=
zLK7dg;f6B`*G3fxNgby|c}+^HGCcf-@Vw7%TS+S=aANaXP3#XJh)<5bKoUq4-&%af
zzBGy7TxQWCbFw?J6KIm0UZd(EvrT1RuxOzH*FgA5P7frezpTb5F;<et>)B>`=}R9%
z7{wp9T3|K)8nMMCN@j|U75-NXP_Yk)f1RV@3%y|+G_z(d9cg5s=rGrh$WX1?^S9sK
zC?7?NzO9HlLDHrG7cIG<<{A<e9!vQCq7E&aQ5lsL9P@{Eq5%SUnM2R}^~9>UV1tMY
z(1h*e2T;9CmZ);BO%h?w@(8?~hPPXT3fz40nl|WOwGFILic1j#^bD%azAouEv3M$u
zO)G&hV_7m~i<$=u<~V5z72$g80gS@%H;)5dPf*Tc9JxGveE^Gx_bzwQa#o@Ce?~t(
z6d1~EG(AeqGlM4klmBp;;H|n+-Ks!XJZ$bMdC6lE+8&Uv#Kpros|xe5(XnvDy94;+
z{VZjguYrQOEA#2kS$v+rG`B`NQ>?Zexk6@O_lJ~k@%*`V9A-^3&#HuZO0)?P^x&!Q
z@4FP@;DDWR2Dc%<rH!&5r61xrwh7DIxZ>~_hkvSgKUQGpbRXHpqv%RgNxUmtM{pMb
zyHBzW$h}6CLl~;F<EdpTv)i4UM=+&~4o8(t?Bt6P3cT=XUh`Icjej_)QKb(6@4XYh
ze8M=kXP-@CV#$dc!_4i%GfIs8cZ_c51!aC>-2CmwVX^r}<D6wyNJN>WTpneV<bS%X
z2ny)o`u%a6fJ1zNF*^2$^wV(*h(FJLsxp*F{sW_lEpJDqe4}tmmJ=F9mnsEA>HEv3
zRcr;p4}kA8JD>~ZlqaO7Mk*0%lo)k}U2bF$WWLIjJ#SzXA|C|+N{Xb7D42x)Ffu?L
z_Yxw{cgL>#UzOeA|70lz2}Esr@D*qFObhJ2v{yEUoIRE5s~*1i7hmhdlsj1%n4O|v
ze}mOF@6DuRB_8Cxao&QN1W15gfoP;bJsyECD>jL<i)cTZ-c_2fI7^GO;qGv$YrAlr
z0$pcYH~_G*yI)p{lSIH$E@!VUDb;q)x#o+42wqc~bL`qOBcaozq7xy62XGY!mlF+l
z2{eL`5MwD(Cfxl|oYuIki^m^m=y6|H*nXNms%BpDPnV7yuGW9;JYkY55Utx4QWC!a
zJ4ux2ZTEv?ZB59~h-V%|ONpg))pwOIRMzvKsGW)N1)I5&I+ee5?pzwVZG_p>&5Ci@
z63nn11TvF5jpdKSR1ije(s@P<<ADHlpB_+t@P0%ft_N<d;aciglgemHW_HJww5=h(
zrFQ|rrYL@hg(Z4FGk3V1WTy&)`^r!WvNdcJR-vJh%5J31xJ4m6HFW}X9Xh7D`9FLp
zt31k-rzd3Rv#1dIo_mTPJD4Br&9IY^qIg)hh(l<MeA|0SSrN)yp|~F;AVPrTvWit5
zR^N<WD0pXF2xW=_Bfw9WgUmMifv@KGw;%K;_>z-F5gQXyUg2TEmo+joO$Ds8i6|xU
zYQ#Aa=86J#d{V&asx~OV4u`$yjNMvpTuxrpGs`>&9|~<TchpyhNbDE8Gr(_~5r*kC
zz(GAcWpB<ISIe;zML82sLzw9O)BI5k9MJP{i4P123m*+i{CBwISz1>7k9K@V)RXzy
zZP2I2k_cCCp-pPR(j~@?6o0-FO;*_L)uQbx6ZF}libCMMzqaT{x`?poQOMvz=%O9+
zOzzXD8w_7vW3M%TgP3k9D%C|_GC?u(8Kq;COmJ*4>1Ccw)DUH=j~0ZeHj&4iCYkp%
zU?i(g5~cmHKWK`F1@NsG7l)M~N~UqT4NxbvwaNUA6mfPAd2h}Bhif!9dL46lWOr;E
zfUbeFW2w{u@^r@#w%PFz!KeWKTDOPin~--4YDUJMW?b+E-R{4Ix-;oHkBoC-D#w)7
z{wWo}D!pfyDe{!7ea^3HsZg_&DU<ize2wV%pds_xf+Rq!iLt>GpG08#01D#!MNo*#
zo$>s65h$@Q_Vnm$`7(bj-k*@bMnNz^46HpthBxfh8a+wl($di4eva1RKGnH2cdrg|
z_QygSS|u)1)UbPPO6Jw)w<jC^`_hihzi-)i&=s~r3b`cy_4WZTrz>8iQ+8=73H%A#
zj<V*jJ50rwvvrzu25SNDf=pW2=BHTbQWxc`B_h5JUf+VlZXpI8Fl}CWaJ5YN95pcU
zJ^<2>jA%PV$=WL^pD#q|0&TW3N6o6Sq!Nmas(31|Yr~ApnItZ#3SJ~D&LvGlCbYL0
zgt>0A8pvkwh*ou75>(whzWq>_MA#wi(Q(@9S$S>(+Zc1X5ky;ViTD%i{2x}umxpnn
z?LOGo({MX#P<&N`*@JD{lVQtDl`PY)ZA3;GyXD|e2D3EcMpy!F-bgp<87#QAv$eML
z4Rvqs;T|Om>U)g3!1Yk`#JF^<;cGlkWb?gV1$Ut}{n75nr^wU(mSc0c095Ak4gkzN
zoKO%@2i7JCK0T(JeFo;!?d}JX<6dKqS7xM({_X0jTdp{j*<6vQS$~_QB1}ovg|*of
z!`f(z6FrWW2ioc=jo6S-y;W-IR;-rOB3dg!8V2{cRW2224dCx0xi!nxHp{I+nphC=
z`EZX+@M2ofm0w$i^rhYPE#j&z)XBWc12DUZaAIXkZ@#boiDj%~<z<~%lht?ZaeybT
zsZ_5ff7?OIw5t7TH^c*Vx;KgUnXucUyT#{mopV}xc|gjz+UoswPw@niup690fRh)&
zmTJR<5w2v<_pX1AClIJ+Lpye%#M|97uG<+@eT<R*PGnRLUZFI43bP*yP~BcgAw&lN
zhVZnc@go2DR*hdc#zOu0#Q#ok>&c16Pg9AVoG)x>GUIFn1<`Xk@WeTj!l|JDXJ3A<
zr7ZH49mbqi)_+@qoHdHl|IufKP0nS0OUO>#>nyJ?*(@APtM`nVF#ae!ve|@=xM>FP
zdsQlyePb$=do)7|G18j0t*(EHxgendHCp<==>UF=!Y*gz!Q4Fn0FUIDu?XdYF(|<8
z!wYf!sK<EF<7rx3X0aPPyHK;bW_#_Q0ck7gwuEnHe157Kk-WlKAi%GpFQ_VxQKM?m
zANnE-MLjuhWi-EnfO9uuO^JNO51{VHhA59r<MyT8ssFa0;XVc2sbyvFO;rbbgsNi<
zlFnK`L!ED%3f&7!Y)BdkFT81&>=q!N+Z8b`vG&htk5W1i#TMT%urugS#hv5Wy>vi^
z>w9bUl_%pL;Sja}CS;sHCmyIN5i^UDq|{-s4nYiK#%qC0iCFX1%C(B%kfnKWH6mdq
z%A>;9$QzuT@7|BKo2{O_h34BLd(*l^_r=rt#*vsr?Qj$KaxZV3ZGOBeTO-wuSKNGu
zyo@Qt9Ub;NZ@LvmVWvo#@?}2I#CuuN4`ZbAK`MlzB{PgUo#Ld8w2nTvKOzpH??zoM
zg-p%ru0IW}Stv}Ul0RPVZqmY=a!L(APO!+ZCIzRR{6PcX>*4C(d6;pn%>kW%6cgSb
z<p1{qK&Jfr@)n|U>!NcYotEo6WK)n5C$#FWTtx1;?ZV+?Co(og+SoO8rcl=;cS8^C
zJ7NL={$3n}Un|pP8pG1PyeRL()Q)X@6r#DZ%+gVl1V_uyFqDgg4Ox(F3{>y2#g&`k
zn8hcOfjEkCEklwEUj+^Sa#ze^xiOti2>HL3zBxRP?)m#plg4I~#<q>dR%0iPZ99!^
z+i2X_w(X>`ZNK}x-``%>{<pjLoZ)BY%$fVTfa{M!X~prh>O5k5d2bTK<{(NLb$!P>
zEmz8lUT!<R_02P>i@RpkP8i>bvcLV&NO`zr^TCkuk#0wNnG#MreKM)j$N_27K8>$9
z2&5k=#;!1Ggl%E9k2hBa(`7XDH3q4mZ#9U<;T$ey2mR2VDT^o6hu_PwK)L_v?2dXO
z?tnVe6$HwLz1^*sc*c3wIA!FKrmXJJ+@$X%FOOZhHZC8GW*KnNEvQjT<`CWS1)5<X
z%6`HDu%$l5lP-^5BWrJlSnIh}j16)Jrug1=;M3NZ)^K-y>SkJz)W=1@QdADys*ajn
zw>Bbe?Lr7$R6<^wW~k854^&|)jnhtvz?#w<qKN+l34$)q4MjJEFQcN`hdkLmjffI6
zEPA|K`gw!wn?N0*0L3vG_}wA=$V!f|ID`t$DP?Es><jl;Pm<>Y4vD5AfvS(Uk?G->
zFTNvB&e;Z3E0<7vQiCyZdk2O*vyt-pN~sVj1v%;?BCdMX#@XSPm*IDI1Cgly%Zm8G
zdWhc}$GN@sxH$7owk@V^4j*HaVvqc^^6)CUZmXBe7=~&L9rYq#4~Ofpl+<BBkk8nG
z2OgCM`SO-$Q=bY)lvrQ>%xDh*=RFRx@s<}WRrXp0t+VD{8XR7QxgPNf9R`0GP=4Ie
z(D~8_qgRnp`?$zvsP3!EHW-)#2@m=}m=x2ClYgH2SB8ODwClkjwIqt=3x@g`+tS3+
zjI+IwWoi+p^`1~r!t{nTgV-MdX1{o;r%7_}mIB!*&!Nq>u8acb(wO>R=S(_r0o6_;
zcg77*l}tK#2_B;E^*ERvpJj5PZBjR;3_sWNND>?6ZR4hoYtHQ4tUi*+iBgEw^^vL-
zBN6U)Qotw?i$4~)6fySqbvE)N8mNzqV(vMveIHazZc>P!s~UqWR?fE?qwJ*P^{wT=
zIyi~MAN3=|9@@Ng%HzuIfy2o%XDgWm8%#n^+dnHzUI(1G$vgb&?Y}S|o3)6>Hu2^g
z23|=#9<(=oVIY|+3!v=RAuFq4T=`tOtduAKTQLz#zVy8wJ{1cJ9DbIcTKJCd8OkD)
zF)n$G4Pl@`81Us(=aj4i$KY-$PpOQmnMZ9UR!O9UldCstSwrQClu0cGFwCiV$9mUi
z9Bs{Ui?1^vP^2c#6#jg42486wHN&onTi0uAKo;xolHNX}LaN$v8Ans@b1j!y#bnRf
zuM*WmA6>L{Uxa?Xt#GW4>jre}^%4iNvLb3XjY>7yXq21`1I=hN%0t6aTy}-iQdv!$
zBggt5xS|Sf{YNwV6)^`hGxS|4c@$MRYu_4XQNLQJN?m(drEe0Am}wgN%9$5|fx5+K
z@*c<}S5}d<0$X_-HHGWf5dA@|DiIC@IN0#ZpF`EwEok^0!$G3-vMl{(HCv_x%Vj3A
z#vssE5y9$2-Qn@M?`J^|ZcLzbt&y|Hnq}ip*H;`2HGE!QF4<3Wn*B+?p~;e(oO<Zm
z#~UU}O~S@Kj5#Pct9j@Ao_L0?J2xviY(IgM;ZAR4j2($xh(bsptpN`5_ILiMIyWk6
zCK2i5MKU;3Q5>Bh+1oJ)tk;xr^VH{VZ4r5T{&U)rV2oTv9;+xtO{8TFee9HacR!$9
zP8n9(-5}Ex8vljk&0-|ynwLg9yz<6QHYjAQlc2+R>F!7|npBrz!evxcH1z7aXS_o=
zuz6XXnSur`ZNkJqpYm+~Jc=ZpHLkC>4_GK00GzMP%LgCw^NG<NKX^5Mv6EmMUOhez
z&CqE|e1L(`cP;ht71+eB)6-H4*w@w?DJY;gET2o1EEzJ`+OD_CelKIjdS9Cm-k9Ym
zgMIaU;~|e^>eua*F)1uUQO!WWR>uqp3?8M^YD$L28fNMI;@<J44qGQIf&>m0Hnz`3
zoK@Cv@t5DfSY-E*#T3m9(;2HHT=V<9d7{^Il62Yt!&~;%hgXka9_RE5TDSlU9HQe(
zJmQ)Jf`fAIEl$AV$LkI^elp}25EuFJ`*zm;k-$z1+#lPI^uAS<6_xA_%@>$!2{3cB
z`cm@UCkAA!y`U_TT46q;ou3{!=HP<3BbP<@sEb_CGe|-1ICz-tlANApEEvp|yqldl
zT&dX-&$U<=Qr29$voXU@D1guCR`YaZ7-Mg}+MrbLdA3>`p!+ewF}^ZaLM2J<OXW|P
z>%7)DTVImng7}?DNPxa|zmAzDOoc|xqS1Oqh=PnfzWK|7UOZ`@%IV^402;a7F1gG<
zHjH*85GZ;#S2}}zpZP)0V;|#iuT04n7eV#&t?nQJ=|}T+P`l+nf$cel&gNRUgTwv2
z{Vks{tEXFea(@2L<dCcD`+>e-q(fl>cWt@Fqp-*P?<$gaflqD*_oGSUljm#4k#~30
zisw|Z`K#US+kN+Xe7Av>e@&gBnu5%FpCbyNpTBw#5!GrwmZzr5VEW$TnQPQ4TBhGd
z_HlYsLt#`Z`iG99N;Iz6zS#~9($zoj2YH;?sd5+8P=0ZJLm$B77iHS|B!2N}>z9Ls
z99ER0-(If`*^$dB@o%i8NmiU4uE;YSEkZMYeL#gFu+>HxQB+b4#WI;%%;9qWv+A0b
zk|F(eeOjH)@AIz0;<YF75!JNi^7QiD(qI7V4}#P6QuthIbx!i})<`BgcyqsL!Dd;G
z`pyo`^5dm{Wpk=c(Lg#MdaUy!{n_GJ5pPQEd_IX3#(-X%(2!nx`m@=q<>A(N%&Jne
zsdInzn_<8GK9*YEctaa*7+X-`{tj?`?=o!qe3eDQ;}qy%w8Ydx>5)?S#a^($|2dx9
zaN5mvs^Z4*QVSbKE{RI(^XYvqPorukI`h>~2>o}v7vh8N4{@Y->!Rb1fr#^t*N4g*
zq=YkGrX87fCe7-kMHRw>KOVRf*N^;8lTVV6Y7(lLsfT$-1x2lI*i&_cEf>v>Hd~^H
zQ##$MqJJ}bvbe<eH~D9}bNJ+qk+7|4-OjB$ilxO-m@s$<kB`j4cs$-PiShWnQgL{^
znrATLHzFr|KP#~vx4Z`v_l%l4d*JF!R3!gq)LOo}yo{Kd{7o{Jkc@|LG@YDh&}eVg
z1YRgCB;DD5z2wjibL7uD*X2(@uAEbr_%IZYQJf?2ZI_Bq`r5eW_r8a~>3w4qw6(pD
zr8`}%-Z;({zY%y>&uXx`Hxj(T<EpCh*<9WY^iHC3Ybe`u#s1;J@jXK~_PJ)mOnLql
zm;!SevNguS*A{N%W|%g8D%i*)Fgi~B1kCEl)>qKcBG7xyWHF`IDWVGX(+M+(sKxqO
zFo-{4Y1=YL=@Qj3L@CDFQcId{DVuLLzl`B|mcA^AjE-EGa$Iu$2o669Pr}RzP1$vL
z9nd#k_S!YJTE70AGK<SizoVoChm-MO6i8`0U{TiB%VnE+Y1n{cJgGqoUT~x{Iwwr{
zhZ9bM8oWZ0h@L47`7fMpiycFJXvIoIp-?m`*q3h1PD8(neKN|XOU%<roqSTfH3y)t
zt)L(*Y<F-Z%F_sZL0jjm9zTE2<R(6xKL|ill;qs$^Q8;qTe5Oe#KGPlEjvfYb5f!x
zRg#d1@6Qz8Z$}Ei22I}2f35eYw}{cvsicy_<HG_o^4YOmm=`3cC!#x7zu%p5KHlM|
z%=hETW_O38F;J^QN=p7Im1F7^S*Xa&u@)8;=hx3G6W?4L&bs~iqgp<5H<Dl+BA@va
z(={B(c=8)V9(=LLcDlxYER`;pnMj0=Gvc&*x<NO1dAqj=fq8V`>ynZp*UIO_`)l;G
z57{#s-WaWvMPNBnov76P@O_|lw)^x+cunBsG-y@PN?jt6P+Ay(;o;;qNKy=&ot+I$
z85ucRPh-hn<FLiV4b=aZ%3~OO*5BgT5-lf(xix1dk3}A`9jYql5Mj}nHeI8OgiTau
zJV>12Vl%~EsZ&OTdj_YcEg-DBS_x;#ZyitXc`a4;5~J2p?=WO*u+rTb*^)Gkdrtsm
zMnv-+B^q(r=FKe2O29U_LbGVPWtoDh@Wt2DY=Pi)U@+e(r$08nk)TdM;TON*-wWr%
znQU&FUK|7|0u}`Y>n+5hQgaZ$^=zRK@hL8Pf3QqWRUK=Rfd=MeCO6rO&jR&vo#_-T
zcXA;XLnzyp4~@oO2je(AIAG=qJRI8V{GWlTw}bCGB~>~a@vL|f5Q2V=C@A?D5tx!_
zsHiexqk8PNm>F<%)Xeeg9flzzj2GwUbQSOj3-x9TqXq$zfD)L<Wl^{J>`Mu*g?ZhD
z@;dJXQZ{=&Ho2g@dN0v)b5VsI0c6Q#ut}@6$>I5kD+WIq6o<bR<uyTq2>17;4f-Cp
zA|5{;GDaFG0GWk+z9n36*!}A9#)n7ZKuN2MfHP712uvt+5vg0ecVmE)gVgi6Px86U
z=DTZOse|jaVrFKZ&Ut;9DHx1tYJLv{{YsBHwOosxD2_fk?+SZPO!VKKiTZ9gnbA9s
zDQe>N8<olB#idqk_Fz|yAq-34d2UM7hSabD(i{;FL!rUqrfTSs=A1loQ419t<X75A
zaM|pTx41v_lPz^Yepi<?PNngqlwNV($|s}mTa9W&H2x1-kGrcpNg4t+mkOV`#Y)U2
z8W4y$4!YI%g{tkcx+~G=<AZ#Q3mIRoTt$?JYpcsOCQ+)W=JHBkWyxkkM&v=Dhyv^;
zk}#{y#rTucNkU^{VWALAl+Q;do#**tt1X2cKg%o(g1hoP%sp!ubR(8Zx7Jvuqpd9y
z?fVjF?hz8rk)23sbugXdJKHB7HEX-ts-n{-x5>j5mn<!6R0<n?d2&Erq28g%qYtPF
z!s}kGA^8#t0fFsJ_Nwzy_s2?JnuY6bI<+)03*%7)LuyPqJDWSAzHe46?^0rgRqc6T
z<E4B;cAwAc61ND3W48$O8F~5xdTWH5NOwo)xg$&%#4Ba^pTEWd=}wMxi}`Y^9W>=n
zUVbC)&CQ1puUGbQmhk4)HWlxQFx-ci$z~$fKMs7h>V^u@uF6y*a`=3GYGu}<qobGK
z+PEt*b=vLnhC}`{P>=goXB;S~5TUG*nR=^)L6rsz72c2XYS{aB*YU-8j%pp2XWs((
zRfh#jx6W&g(LINCGR|eAyVln!e)6I8^IemMvd@_Ss6y59pcXi+^fwoIzZmW+?`>*^
zgkrgc7UePnhmOy59Vp}Ebor76Vq0uFCkOXiOzC$>1R0!MYPPS|)cSAPvVj0-La~yu
z5;P>+bD(|RGb~rl-Z7Eo6zOuPMRiCDjf$qy1odQLk1h>Sr%((W<pts7et66b!V3}i
zJP_#}8wI>DaW~++8#W#R$coYMM-)wX<N2z6q6Wnsc|oY_M7q$1$35)>ZylYC2DdX+
zW*N=>zI1{D3mMtK-ly!Fcgr?D{lMQ)x1|d6n{}4iIoari%0C?o$sv1%r1@Ljzu`5;
z5j9Cs1=|quWF`}m(!x3;e28FL)38D{tM!EwOkL=S{keS~tZL;L^=C=nKS;zAUt@3;
zs#e?d&^UduITl{;=lb$%MsT9suZ9bgfG*Fk^tGYh;Y$L`RDM&#v`_pw@!=x(C+!cf
z@h+8>K52>7F#@fea9(02J@*}w(}TQWk)-35juYt|O()zE;r>6~Us(n%dDYaGh)b4#
zt7aDjr=$odpnua^O*DB{s9Nu|RuL#*DqODbjjA>Our}Mw4^nxJ_|agI=0#u#ZyG;a
zI`l>I=G(93P?YgaKF|K(!`!9QnPwCcbil(u1Nymtx>z~px%7u@{_c9aqrKznF5>jW
zhK`-1D-_6Y5ai|ShZ^#@EMg|qT85n*;VssA_Eh$M+Nq~q;~p)_GVCP}XGyf7U0a(Q
zaq9CG6q}r^S_*!BbyrV`3SaH&^UlsXPyJY!{$nv}9ptXWyE}0!(UASN?VfHF@48^{
zIQq>{LRywgx6><3mz)BPAt50QKqPlvjww=PGgYQhI~WK|zf9HKsG6H<n=~)}g-RwK
z>OEVVm6~i;XfW?0GCv$E{dXrIV#cAnHMha&Zf^SLbn$;G{32W`w$_?l?U0jR7Biz>
zyE8dJ##|&J_f3(Oz5~V1*K-V}?=Vo%-me?jZ<7i}5(+@lAYZV<X2p#dEn-n}tUWy@
zhX%^)oYOswu4rf+<?y(czJ7j4i;0THQ9bvLO2|`4{ryx{+=C5jc+@z0g$6~=5#1ec
z<&-Vuf_=`t-0#$xPKG*hk9k(5PDNPx#-&PVzrWgA9DaUKV1|iIvvH=T@c|P&3Ow$G
zE}5hDL!qSTX08!zoY_%uwvDKv<Mik@eQtcESX0=z<VeOadBIip4Ou^QqRTWsWRn#P
zGc&W`_^gT)!!k2vz1hwEL9W+p3$E<NoIkv6g@g#Tr47=XNQr)At9KX#-Hn$i<-1aq
zE-7SUy~y6d9dF>$xmTp=)#7=c&R33AZ#63vES+v&@PGc4&(9A9(g0@QO}DsMf&PIZ
zG)=fND2?PGL-FY86b2ns>#aKRg&KRq1or3S^OA4~I^l{x=Xu!M&oB3tvVmghJdlR)
zv%0ojB{$^FB~#GX6qOY(2nU@1P!nV{E7{Buv^Umay&uhG;E(sut&>XAIc&~ivyxP<
z42I-Wb2GSHFSK0c7OyU|c=l5D`_PlwH!xgWmM8jzudc3;*Voq}fI*^X7spi107$vO
z$M2034mm>D^8H-wDI+5a*;l|BODR1>i5Qs9SD~pAE$=-i=i&Lz=Y13Cb$5i~e)42L
zb6>7T70_jBBZrudf=bNe^H4D|=;cZA)iZM}FJl!$r+$W9s33#=z>l!I27CO6ryT4G
zSEwDW$BAj=$EY*)V+b8N!=(F$l&nIr=lKY<<`1tL6*%RCRaw{!QI$C`gzyf~W3^6(
ztBZY4n%IRP)>Kvjx66h0Su`0yNt&l!v%6dSJ@-E&k7&w}KE~mR>!_vPTi)?h-W8*{
zjgb{>`E+iTB{IrToy-a8ABqxBFuE#qHq+!3WJwx0kvS)x7@ppq^u$7UPus%8aznAO
z@%^1Dna_D7V(L~bfbZw55rv}%Zky%dxEH-9E0k*O#)|IaXMp4GmlrzB{|aJYSI5%1
zvsc-i!(B`uYvSB<>|6%xuY(yF{7f!$R9kpiTrU$|EICU3eV1$%1HEU27Pg*4=Ce9}
zzTv+4gId0`W1IvXCl53op+JCHuVoxq^EUhPzcX{SEKf1NJ#LvhPCczUZ~ahbEe!`>
z+gfm7*KxGjXqV*USW{HL-f&IVClNo{;NuOPxVJRZrC`pI3z^LRVeNb#@b}-#Liv0v
zn`I%Cue$st?5pnEZmaXaq{gi4bgq!U7v^`Zd|op4)1<vV$c*;Hbpj6Qa3#(!gj3$q
zcs|1-sl<8&_#@%;@$?86y(V(mbaPoh9tTsR?>&ZJOloC(Z=**j>1`G=3)*2B93l+X
z17SRNy;+i^g2*GWN#xmXBHi(P^;$)&)te?o&H&{Sib$s0^7tJH@}n*8+J<Anq62|G
zxkcOB!S*zWQs}iQ?H{iw8xj#%3Q&zZtbS^PgL0G8p7}{g!1+9`a@Eh7+4<_GA4zB9
zvHw|7)Ps3{i`y#CKux4hWVdS;>bON}Esg9R-*`1z{igs+|ByB!I}+maF-VnHVakD7
zM=M3ZuaW&RpdMs$36>reP9Bu09jJjBI4;w_{t}}2(JY!d+Yr;S+DmE4uOGQ^n$PWg
zdIQ8;=3{mI^)C5`quBwQiP5C^H4dAf!F)84Or#WKC?)cl$g&?d(kn-K!Wfz?W5&aG
z$J~~<4%T2KRGD<IH$nNPP*e<hs6=Sw!@a(rI&b&$W#m&6kk3kRc6p-y-(^?!LIpS4
z>JF}rh|nlTFFsyv;|!!_Wyz8mQf+66;z?nmq`rxp@mzYxgHmMmPv$>_&*4@11j~dp
z;8Mo^s(Hs82j9E>X$|o+a)`v{LuF=0d3}NzA;ZXa2-9LeL^mNR98hX)gALynz8lqR
zjBr+~R%$I@+;30u-IQmrd6Wzjiys(Eu`DRaANIX$$02h3TQ`#&^_@(M-G=tT-sA3U
z<gSTPw6?zzmo9ChwlpH*UKi`2{i;kj7X52`T+#KK@mS%1KCbb6%@J~`xT1n~2QUo#
z-Bq@ell?LDK1Q>Du|$zAp7C_AVO*k|hnf)u31}ZvOM0h~2pm%eh9h7d9xLwd?l5fL
z-tL~Bo>r!&AzabITwGj~Oh2+iz22IC1`60cUPO|i`P$c*r^4@_k|?h{Bv2dfz_-}<
zey^Cxeaju=z!#ov#W?)yp0#RlRr2-BG{`IsDn{bHAwzW|qTW(`(SBqdHDq&+6`|{F
zIWL>tJpOE_g+pG4?pmYEXWZ3t3DZ-5Yg}*?e7V@?`7YRp+vLuy-g+$AKgu*%|6h|%
zTYuNNhqp=1qV=j>L<+-_QDI3&d#)=FzSfXh0>vpR8fqx`>q8p6_4W0&NPAkr{b}h|
zE?Ae>{RwIlR)$veQ88HiX|1=^Gmu{uh3+j0s@kJuCQmLYuK?YS70dM^#3;%uosvaQ
z2C<v#EL*67+R=E$-j$9k=*2!dxu*8^c6QxeR$EoNHBM!V>sp<Z7B5fG{5C83h&o-m
zO4?nWv%ePCTULJy2Qv<&HA)pK@&agTqcMv|#rFFuKO3{c%Sx7dMnz9JxMX~d#kuB0
zQ(hj9<t7@)PlULqU@&^&)BY;=PpQ*^ZNBm*^Y&h7mMX)-@>FW!96mSs>{h!h6B4l`
z5e*vlqGezJUvawyv{QgbW}y;~+-xnL*6bIb(BdBZfiIN?^Pk5HU6)v=)m!<?sU;#l
z`T~TRnGg!r{DQL|XsV0VSW3!5ghsX@#vPuiGW!x!OAK!)LX2MXl<%>5!uJ%>_!v4Z
zUK&n^i*(FvWI;doLpnJ*xp#1Mb$i8S)9tcaU5wZ2Er&1`n$Pj@uDg4R*D%>CoH#ry
zX5+Fv?oA{Q7U0_z1pG#vVn`*`TRa*}x^o3zI=bG;JH2nolaLAIuXvH5iAoh4{`>?|
zzRiZ`q28hwI*~blElio?Lgut%He#b28LIH!j+Z;87N~&QB2zX`cELW_59gDsgGZ|T
z%CU`OwYJ7zKJe0bU_#K4ULRa@CBD7~vq5nAp-LsCT^}vt<$9#Vij@#V{Mr)4Nd`zr
zNIhM0czMnl>{d<qeCpEl>QVIk4usHOwybrU)Iahs+tqm&k5S^)^@7!1585fc*GXHh
zO{>Hos6`QIJ?p{uG;p?D=BW`gK>`|5j=g^c;*DxMxc_p3QDNh|k4Wj_4|jVfWut=y
zx59=kM7TzY1&r%-vbKZ%8jOFFis&oj%C}KD*H{gX=#fb#p*JZT`LTej4ulA^^uYmd
zwR?aT5Xf!4*eCA_&#!l2XY0AWh?JP-x@GhCFo-q+o{%B34=0oI%Nliu21!tJ|Eojt
z8}*o?5w)Bj1Hr(U>4MXOT%~!2RUZg6Ni21wG$9BY&LGyjT@$xH!P-K@8*LsICEJIZ
z3~n=+%SSERupMZ3))xYY4fhi!c>Y7Kkv99PogCkqRo);;1n*dZr$!T`D(Id&ANP2x
z^sSwa3SJt%<+~}%Zod_Q+WQH2V;Rt=A5;+OxiP-&9O)ZNNZ|b0@;0%>D%uBUM<lF~
zzml!n&A4Zi`F3fQG~#s7CkdmA_!I7nABeE2aX`IWT<q2ZY|W@=M=B!FTh_};(S6X4
zS8cV`{I={)o4Ry#?Gg#+R$7>$p%6j=Dd1v|>SE8Vl!oqfZ!vL3HR>KmSKyPnL7B}8
zr$yfZ!%?j&i7(1iH@iuMn$^L-CTti4+9L&<BwkZKqqTJ4#E)wK)V}^{zGkiq;Gv~i
zL6ZhI(|M<bdq+p33Jf5LPYCUAvuG{Lx6ODFNms`ML;i^#7J_(|(dKZWT+>v?4P8C|
z>yn?;@$<DVOgeJ69b5CS*oeysKdXZ}KG(8DYWc1rDaT(GWjD07c|95`4P0?fg+`c)
zhOZhHe6W1ZYJZ>KNSONIe%8bbL&vU{D_{u#U7HA?pC4PuoXczWEL}L|X{;`(0?By;
z$HHVu+t7%HSIAu*WjSb|1769*pw(P>Yxmy+-`c16?SPr`vK%GJ=h}@@{(eknoezP!
zPFIbrIz^&`++W~l{C?q#7?t!-t~K17#-+=PKzqYO8-bwjJc2o2ADyrB&`_Fnt@V1^
zTE#UsrGr4i*^ZTT7Uv_9U)EMiQ+a-a{QoQi6JspFLzEmvoc9{3bl_sKivE!jUw^u*
z>Q_+N3>&}`=!E^nLIM4L5QVZ#CjkkhWH7l%Wed~c?(u@UTZXabu`=XGnm*q}D)hPf
zkRkmDzx%0IkgAZ}@q0ag!yXdlnm0QdSK1>6v7&$^cBzbq*emDxD^iCRfag(U$u5z;
zfR4U`S*ggM6=e&b+wj)LSXgdsDw*f*Gu1ca8}l!ehx<6)Q>$8IQF4Ku=z1JIg%$Y2
z-B;yncwGgPzNuPueurgtQ-$-JvH6W@@~3yO_?u4zZPGz)Ikq|yi4Ptjx|J7lSSqF%
z5_Ya8=Q+PRQHmOD74G%VB9iZk-EjS=6f6#+YRn7ssJ^Zl3p}2BKTo*OU2dY?`i@@>
z_=ZS!d!Uh9qmfbKZ+A;i54!3oS;22K!8(wUPr+hiuEBK5x$EFcWakkHmbKiL?3FL!
zGC{fMR!yn=X>rnD*9g6ek+B^wK_DEC|NbMO-Hu0G`k@oe5NoYduyI+8<aEuCN#Lo{
zY{nmasYr6gqIY>3>>Db|?}PmhRu_uFpgISzc0pj+N~ylEW)-F^$X^{2<T41?F?erj
z*`>!DVZOkc>)`b2rw;*!SZp+xOl_Hr4tC>^>r}1SJ`WFhabPhL8mX<I@Rg!sUsao$
z>olgTs42VX=ioU3xEZn#zXWm{?|qjZhWT&rf@%2Sj&u)Xbj?5%95`s`J{TDPz$t$o
zYGJH#WbN`!@B9%dDS{3^qKsW=wU=|cbP&kiA~+R0ixTd$)DEoEb)tS)WTj^wSHY^S
zx8-F^<Pgx53j?X;h2~;lpbC|=XvN!UvC&czr6%QYuq84ME|KKPX$8R$FT-SQ1RX25
z!xrWQOyH8CUkscaJP5TcFo)Bi#^j>VzI|D$+D~e8Jjn`UVO2ptp=>(~7r|M+h)mq0
zb$MK{iB~~!>@w7IYHy(*U3dGmYQ^iK(`|n{1C`CtKoSD}2(g!v)b;jhY!P?OA`c8i
z7#Q2jU<ZP%#{?}5|2KKi!maf*MJ6YgV4kIU-HJ3BqZ%%U3Wf_e2G`2yf&uA`fl0WX
z{)dWJlXC~^;U@z^^cxyQdfjC_>kplz#{=CJa%N<qF$b0`-+oUhO=~CWxM^4)Tx3Eu
zGg20&e=B}JJD0?=Z}<`6e}>Sdk8qUkKK8&><ri!-oq47a;cuH~akW!&f)R=pP@GWr
zt2V@_<4HG1Pzp+|E<!(YQYB=*zmgq44}~RO%OD^dF#U@n&c!sb!MDA4n(x+vneZEL
zO2ZP28w2ZO+#Mhmjeok`>^Z@x>pUpu)S{Qv3_G8>`LcJ@KC$>~g*Il9Pd<tEn#R-7
zx1fvUJU|CMvq6_02r8Cb)T6w$3dK?($pr1>&-b&!Ot$Kw`%QWfc4CCX$os_%1Rdwj
zYa(WlXK$L${Sbj*{)2-D@;pJA8GQPMCbPptNf+AHL{ZzJf9kDe-l^cl+vn{myEoZi
zE%sr^hDg4OYT`9AvM~!ba&Tq%%+xBdgFqC}SEoTxci{mXg=rR*$3EY{6eRr2x%lOI
zP0CLj%cemfWTC)zmxw%&2qj3FtrLvk^5lihQTT3SwW%souJf?MGnx8>LrvIvs_9=6
z8&LbG2qptKDAeEQc9OA0FF~Sb5=Z{35A-TcyN;gSIlEmFXIQI%68)#)IoWMa`;S{z
z{HB#bxl!86xrV0)C-)I1b`9;+8!e;Y)IV$qtV%=9A9_GHI2<~N>=_6mUE{>bXP;Ug
zPm^=Jb(})646I<#%Tbau@cV<Gg~@|0(y{<|0K!Q?pM0R1f?rO;&?4wTA@`?cx=g$L
z?O8kT_J^?W@bxV8-8S7D8#<gmZ*FlNuLQntM6hshXj54{YFp)M)k`+>)b3!}<{CHF
zH~!xWuppXKi}G8OYfUBTrwTk$@VP`9%W@29LM%X_q-@dC=uKo>PJ8kELpMg75YSPk
zUm(?=9N~+V6R%`S=REA6^Uuw2vGb7K&DqyZfB!_F@tF2XY&7@eTsM}2sV+XfzOG3Z
zDX#Zch*T&gQTL{+l!xD`Hrww)x}VHbDNJSat;9r9C#MDw*;X2S;-|eLkRE-@5AJ{Q
z8|yBQdnkY7tJjIjM~8^if7}nb+wu05o>Rs#;*}Kv!GIX~O)ZBJGjzOA+F1ar%Qcx0
zaY(wOA^b%JTK=x(hO=<JP??3*TMrBbnLg$4_4V~6ARzddFH;H<tF~OKimNl3h?mP?
zm-W5e<h$7$Nk~+t4$9{9{F^{7J4?Xp_Osr8SL}K?jwCI|_an{4-o9shd)q=7iq-qY
z>3AZ8V`O^zXQgJPSO$k}@$SwJG!_=tdYh}Q%X+i@c&*`(h~;7h%6~j>GM&Qb=H^a6
zT5X_daynMrXtZ8Cy1KGc@bKW?{`b$1$LXkhGLuV5evVbU%_ZvXdWc|viHRw&pnwDs
z7IueRF0&M?i#%c3|A@4#BoOmMsn3P{vq|4$9`1NPec^85cK6YypERNFAv0k!%u{37
zI$WTJT+L29$QSQ<a%Xwdd=lB~6cY|~x_`fRPKK--$rK3&LIwTvnOBm$8m5ZB!26*Z
zGKsL=v0gbwsg*4uBSQ*_gm0xNnlBoG=`~j@``q6zV(@se;iRdFa|4u@h)GCB5WcgA
zo}b_Ti5?srL?x9p(9_e~etW)E77>A1Xs}eHCnEaI>Gf1IJT@l%O9M0NJBQrE2t|%v
zUoe7U0kGY-P_8D;ii6<b=!k}jh8A|d)-;!$o&7XZAZ~V8rQ_s+p|&HjtocVWf!wRh
z=RLKwl+pnxz#62<Kne;9u3v5U8H0g=y-nu{uK={&ZGC>IWu~QtB{S&R)H2cEK2ka=
zbun4qS|*a2P=^J1GDtn>#T3?>S5ZeI2P)w*kvP8YJ-K>&8#AjZ@wcW{c)1-*!%u$!
zM{oZ7D{_0eh32L^n*=>#=~L$TP9_K*;U6a=896!X;OFez9NdK{dBXPX;Z*s<`C7?J
zn`<===wE*r8hO^HA4-8b$PbUl1@J69t{3Rnyhw&a(F-9VArT-DxARFq7dQ9L!9>O_
zP=0z^t~LDHX?wofuqz}i+z<eTWP%K8W@+^=1CnvLkGnbf6^%NR<JCs%u^`}m&S+ua
zyL-5Jc=iAbzjAPJY(HJ^vfmf+#3z%yjEn83j?XaCYC1lpH?MH4o%L3W&XS|C!8ZvA
zSUWb#VJl-2gH!I(s-K8sT@nVaQk^sr_z9|m`?Z6C>OJ^SUdHeTssc)uzb1$si`|_n
z<UIsfELND?pSXRzxx<3G$0$2o?@tz(1a$GNb}t{ywvw&ao0+`#M=?&e`$CM|+&Is7
zcKl!OPlvL(ouwc^R{(AP(ae;Tp-CgU2>FWUx=fM!eZfqC>@u60n*Qm{X>mFpz&q~^
zqs#!DgaOL4LPA1!TwLzgsetB_X63}US+6zX3PCdt{wn`jim!sVT<qddzO{d@jpxpz
zCL2|4v%+yZd~P;Vv0+*ihAq|GUk*-)&8S;(r|{vAMMph{t(aDl(dQn~?_H`<R{-0w
zK^u8ZJIs)F!^|hlYOhdgHq?W*dU+^k0)C<j?^*>EI1B(;kVkxnf`U3bOmTfboXRQj
z`FJ*MwOIJ`1gr-gBoF|J0K2hPs!)(xrc@%;^>!m|?&87@3j;F~F<@XeUy9<p-TRfA
zkkIdPIuD!x4^K7{k4IIrT9>b9YHF&GsI9ftKut~U2R(iCa*cjKv-w=Hm%4gtvi`TD
zwWjK~-XJ*TYTb@yKo!C*mZ~OvA687Yy}Wo&cswfB=VrisBVs2OnSdT!iHfbnuIkxX
zwZF<ifi4seWL%!Efp;EGsK(@skl?E#b@^Yh+sKpP^hRAR;HeP*`b96m;8#KfhF~u1
zqkQofnC2)AAa|NvvQjzR!E*Io`p-pcF<*Lmc)*S&)6GTU^J%iXUHv*3O`=K6Gm0%K
zDXFnsqL&3MyTR+(VI&-brd~K0Uf(@CBpQ#08Zcj&FcJSxpv0Awkbt&m#%ea6$~1er
zq_+@2a(<>iR2+5E+{|h4@%9X4(^w~B(YP8l4CK&XWO9D!csyO{$8BfPpu@I%+*?>L
zRq3?Appi>V0p5^NrM|AN4+vL_kWf%5ZNFyWiy$kPeKl@QpWA}gi$QrL5R$=0wpplJ
zgDK`hEokXrZG%~>hVc5JtGMgVkSw9e5O{*ZEcO@2Xw0N%Kkd-HJ)_=P7BWdxexJ8g
zTt1I_tGB1?m+J&_rbnAiQpxzh?rs5Hz_fwqc8=#t0=v5SFHcUw!EP+vj7?2R(qvQ_
ziE-SXu87iNWBqV(aqWOG0RgOB57?B+=5-f(d3}YuxVQ)si+Or-+gYhI4FrlX2so_B
zdty{n`GC(`URD>%`zas*951cO_re4T9Mz$R(a6H|@oHPOUR(R^e_eLENp@T4p+PND
zwM`ZS5u<=9Ir?O|=d(_}HzsQR2Q2FYV>y}`(V^>cK-y6RRuecE<E|_eNBq{G_CXq5
zS^U1)%+S_AFxl^=wUGv@1+2gQ{yNB}l$t`jYLcR>_`Z&Js9Q`<vdmI2uuN<eGoy$}
zJEF0D4jf`8waqLEv|y%#m5}4%X=JB@L(77JYHNAaPRXO{PY>mHPM#NOAN_$MrYMKa
zdajqZ_p`_SNyoj1Bf!R4Dy3q^`_m-`r%T$gUcj4=s8(ufc+qR2&JGlmw;feth?-A7
zn?!D)@^e^!FX{bVsQ?_&%+x6<LV3(iYm>G2n`W1}zD-)|l=N&;v*N=LGK0iThj_mA
zzv2U(m}l#_i)`7X8xt7<N#}6ZSSorjiMF{EG6WdCPOFGJptyQo*Fo)?4HiOfZq4L;
ze0YGnCI*82_VevwKM=-9<>cg^)@-{L;z(hDtJLLu)jvW(flm<m-+!Veg#LGUNO5%U
z(a#9a$;tWe=*X;hWd*yWsEDXRGs?igAhag{s;h22R)ZQ3md{J$@AIBDQ{U|`?(n?2
zFTqrQbm}g<%1_<3s1v7@CizS9Ra~d;zQF5%{ma{Sj6R%MyfVD;;0qpGx`f8M77AjN
zOOH)TN?MY2#N9hMz{D}pz7;l0rqv+EX0u47WoC|_KfN`{`40h<mzUe!>`5vBc7dU*
zt1G0ejIL22hCoC?0oBsh1_y<Ri|l;5D67vnf8@l?YBo*y-0A%)G899Ab$>J?jE{vS
z4)OIX6#zHH#KpxE@VK3RLi_t+(y9|mN=gzPh*L<`sdp-e2!I(byKn1p+YOSx5=84_
zv1E?;;Rc|<k8W_aIf|lU3ve-Bra&xo%$feIGm+h^xK;D}#RTH~0@|Zom~XGCoWJNI
zzT2954mU?zE}bt)KtVx47ghMTyGydSw@2tt*gY^Hno9}$x2`Tu9XN`q?+37(vnn^8
zypJ)o*jz>d-N@LODA_pAcr3YHw_1FuT8{z`53e>W3$IG6sS25ZK>pjqW;6l692X#(
z|L~b~3Vjj?#L2_6tJCH(!<~hwOpP%*JWP2cfQO5FHJQa@o@}fjCI&e=F(LY&;E!es
z=CgQQhyVTir3y3(0n)tzkb^P+zrw@9h+19FM*-23qK4I|(*_3yLKqZPXEd^QTu*Jp
z6m_n+Yrar2O;EZs04Xv`Z(a*}8|-Imsmz>jN+K*<K6ZfZhJlK!d?f|f9^5x}kihR!
zd_HgTiyuCX8>VQ7VL4<)i6?wOs~unyhgI4wLhKgvp{5#`iBe?w<~YD&!~n^KMn;mt
z`1_@%e(Rl^Q<}DDkSLfDFfk#Ql9LmY1zHy|!qpJbSY?HO{UXfC$cV7n>XgNiza&G2
z`6k9p<hzH@X_tF;vQS=_k?|dJ__k&S=9^o#`1{L!!d8c8N`-1=Iw<xKpWR_kV&%VJ
z3$)4~{yw}Fqv|8)0Qq%v@ZmEVK-~W80spI1BDW40ixU}2S~yzn?L>YA>R`zsa)IxY
z(xT_S%|Jmv>n=6zH_#=z2ys_tmLry$avYfQuRQEwcROOG`^ynzP#!i&oAqOD#qv`k
z<MA_QC&QV4j=G^rxtLe85|hvCDfVo>4DAv4F-79IU%(b6ZJdN%6b>5%0|P@IV3h4r
zMM7*A^WDs8@HlK1{RyNnKwHZnwd&~0%?{~!MJh<pFoEDzrB>7Mf6ls6mBiNe(!b>j
z!0z|xqCGG7rvv{*Fb`ng!N<o3fD~V5ilnisY_~dTg`i0xzW5W`Rcbao0pTcLKIexv
z0BKkL^VK)|V~IeNn=X+jcmN!Rh?NyB3NG$AfbW`qAtVDF|IdP^QkjHtFfqw5W(q>q
z{|Oo(0Hw~*10N}AU!_xx`nwp%_+q6`%cjdw@H*JbqhiXMkBaq;+u(AatmoO)+hh*v
zM5aQ{VQ|kop>NgV(9e6`@l9#RCuigInVj1E6|Pv5$ty`pN=iUO(1O%HkqHoSBAGb3
zI(}@0lVMRsGW};t>BV%51*+u>!!^FebH}lS(1ZyiSA&OTJ4A@AhEE$|XcUyuyRe9x
zo!*|4lam*fml%fd+iLjXm}gvc-uegZLC7Azk+U<}OnWTrK2$|EdHze976S@hxEG8f
zf8m4vviQV--M`&gn)-~?TWMFX+d{R+tVg;FPQKiy-Yw1*dc6w#Rtv!d-y}?uR4ug5
zGPZ}P7;aAyVj;A}*%Kt5p^nV0;ZZweProdVG1SqaHgTa3Xz8k`-WAcTq7<FVM_Gt?
zfRr2^CriHMp;+<0(9`T*=F}tp6&U@KH$Ah%)n}cjd5?L!1BgQI5f>9MGF4FU1mC-U
zl_?X;;Lb{On2!G{W^Qvh?zZE#;`Whkci3_d)0|2b;X+fiZ_k}UQu?SpGMEuBd-6gS
z0v{f$qP1Og{PyJu5wsrJA>F>NlQw`B@)tZ{HOaiTb9OM<d?2DP?t6?#ohxq2V9CKZ
z<M#>Ij6XZ|qBCt9m4*MvU$++S6Uhi7r{SBwLX0i2@}Yq+Myu9aCBe^Ek9V7cKlZn>
z%6V@&e<WpgyqDgscPtBO_<~J~g$qXf{m@R-NnU!Ydj1z^(p{^er<aY~`PJ_?`AQPi
z7vIgw0+;8dzGZ4TMB@j$diILuXZAChQDmfBN&qR#6hYE`zDF9Z9cvljA7V-SlGU>a
z^q}u&7a>LP4ttJ5(XfKM(0RIk`K=Bm;^VUa7OY<#dtE1nFzd}%NFkjZ0I8R~?FQz<
z=M6Mbdj_)3#af@JgcX=|_w(*|lD&?;aF4|!>Hb&pGZY_{JLSYNR7ZsVgTt_Fsp4v?
zDvvh-(PX1(Bg_a-daThaY?a~lIzYB#!^&p5reJPl@JDjOF&7%stzEHb@)?t0?P@a!
zLPNS}I?~Q+Z7$5TYyi}+1`@#L$dsc4uh(Z8toyM*W8(rKfviL^+D&IgkEa*mI1R5<
zKhQa*8Lv`<si>5~&{J$FdivM13}00!2?so}QqN5acO@be(d}(M9?KNjH~vLA-R0eU
zVh1uU;RcsP49|Mi<8OEGzi)RCLg=OLXl>ge!6T=gQSBt;o<@ZRX8E0{gH$0N2#}6~
zz^cIoItLN{q}P#fn9hUVQ#%D|72;>(F_3|<hhZO~+kP6zN~dP6_!SLAW)OK$_Nm>1
zrp8Vgp$vPDkZYnBzNb8>7V1|78vb$F*Y2`7TwP4>v})GqEU@AF7cUnDA~xx_{(;!M
zqr(A1NTI8N?p}**N4laVBN+cM^vKP8N5p8mkF$HRqS=7RW>ptYPu>8?44yENVDvW#
z1=_=FR_1jh5v8I@YvIx2Jvla8mUh)g2G!@8KI?Gn)q*Y<3=c&=HSzwiX=?Nel?FaZ
zs1DV=T<|vpj6P_3a5<7LE;Ohb%6VxLe<dejI^W^;r}Pe5*&@qPiEzr%^c8eiu)gfX
zJ;)V#P{{$Qs#6(f>9tdlsvgx3StvXP3SJ;oI{@W@-R021(cS<n@_Rr0XT5Nc(;>IR
zd8gTb3kow|DYj99W@hYWZa7+ral71<n!=_jLtoYMOTSO1DxB5uo1qn;<mct8ezaYw
ziGFx3BWWJQ=+8QJapL8AO$nGijmgon%OGYCH>+oKy!?ZTCH*;FhMtZP+km5)g?dfH
zv=_3w$Ds`5B~8LK6X~(~RtwBdht%Pb+8CO(uZ2cyBjr2og<>WE6+($n!dtECFClV8
z(^jE6Z49g6!Hfb|akbmsa`gKp@E2tn%zH|UfG7P5-)#WaPgW!H7Z>UKXDg-d{i|e{
zqG9$umM9Fgyb^)7Wu3t>0XM)+CTKdMJ?m_EQBXNpnz7nL>0rf+2j%TCh3x2j*6DcG
z(EJ%4RFS-HAv$$(G|0Ooae1x%e-uc;ltdTcYWz+M@Js#TH!!I5cQNYJtgkqF@8sF7
zbM9VC*$bBG@DO~K<x?blyg2lWkQqnuF!m_8rBu=XeF6Kmk;DRyv<p*zer|bKK!Fth
z+T8Pe_BufQRng^eRPoPDI=`GW`=_(u_!XqI@9*hJL&D<H4^7V|*6MBH5=FC1Xwe}Q
z9pr;PP$(qmXARyzv+94QUNcIM_uaoTLUy6Q^;OL+9gBA1#}|KJFNRmrh(u)syH)WG
z>PYzC`aMJbzm7Xhs6poq_{AD-hPWeib;>)xug2=jWDGi6F=z2U%9FC0j)5H)9F7!T
zyMu@j2-OP|_p5pjeJX2UBx!JxglNEVHDjDH%}h`TS!b5B;E)A|x*^jMl*H}WHlz<G
zF>=x+X_sAoI{fR!Teaec2hX+37GYrx=^HJKz73HnnjS?K9a1153S0sg;6ZbxPsbd*
zyU0!B&u6HP4U4$<pXcV&d>x{?4t_5g@JQyc&kzDNff15EE+hG-Y;Gtkq5~8_ExY0`
zNO#xZ6J3ocg%Gv<1#kPMtapB8V;#6;tTy;f_$g>D|Em^~gkGWXcjI?a5HuNy-@s_H
z%IN=za}=iZp0F%S`3E*Qb)rs7TuzG1#@?r)*5V@@%Gks(r^Di4$E=pZ)cD^IRCth8
zK1wa&?sBk$0`2>Mg4l<5pZ<mvAJV&$gyKK73Q4^rgG+nm@0Wb}B>newexX;x7lu;^
z1v%b(uf##+|K34=oh;A-Ctfu4GZ_?$AM5930L?PN>+dZ5-u?lSQOI6*YpkehiC<Ep
zs&{}Ff*xs$#1Hmk9Ib-?b+8Z@ghH!ux-sTldxw-Im8Sw)_YWCc@aN=jW0trH8U)L=
z7GS<j8fr{>{hy%A1vs6)pHe{gYl33GfGH(WG<wBq_|g!6B7J154kCU)ihzJiQdCZ)
JO6ZsW{{!A)PU-*v

literal 0
HcmV?d00001


From 90def06035668596dc1327cb0ee6cda9a72a349a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 6 Jun 2023 20:13:08 +0100
Subject: [PATCH 240/828] Added print and summary methods for epichains class

---
 R/epichains.R | 152 ++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 152 insertions(+)
 create mode 100644 R/epichains.R

diff --git a/R/epichains.R b/R/epichains.R
new file mode 100644
index 00000000..62920f32
--- /dev/null
+++ b/R/epichains.R
@@ -0,0 +1,152 @@
+print.epichains <- function(x, ...) {
+  format(x, ...)
+}
+
+#' Format method for epichains class
+#'
+#' @param x epichains object
+#' @param ... further arguments passed to or from other methods
+#' @importFrom tibble as_tibble
+#' @return Invisibly returns an [`epichains`]. Called for printing side-effects.
+#' @export
+#'
+#' @examples
+format.epichains <- function(x, ...) {
+  chain_info <- summary(x)
+  if (attributes(x)$chain_type == "chains_tree") {
+    cat("head starting from first known ancestor \n")
+    print(tibble::as_tibble(head(subset(x, !is.na(ancestor)))))
+    cat("--- \n")
+    print(tail(tibble::as_tibble(x)))
+    writeLines(
+      c(
+        sprintf("`epichains` `chains_tree` object"),
+        sprintf("Chains simulated: %s", chain_info[["chains"]]),
+        sprintf(
+          "Unique number of ancestors: %s",
+          chain_info[["unique_ancestors"]]
+        ),
+        sprintf(
+          "Unique number of generations: %s", chain_info[["unique_generations"]]
+        )
+      )
+    )
+    writeLines(sprintf("Use View(<object_name>) to view the full output."))
+    invisible(x)
+  } else if (attributes(x)$chain_type == "chains_vec") {
+    cat(sprintf("epichains object \n"))
+    print(as.vector(x))
+    cat(sprintf("Number of chains simulated: %s",
+                chain_info[["unique_chains"]]
+                )
+        )
+    writeLines(
+      c(
+        cat("\n Simulated chain stats: \n"),
+        sprintf("Max: %s", chain_info[["max_chain_stat"]]),
+        sprintf("Min: %s", chain_info[["min_chain_stat"]])
+      )
+    )
+  }
+}
+
+
+
+#' Summary method for epichains class
+#'
+#' @param object epichains object
+#' @param ... further arguments passed to or from other methods
+#'
+#' @return data frame of information
+#' @export
+#'
+#' @examples
+summary.epichains <- function(x, ...) {
+  if (attributes(x)$chain_type == "chains_tree") {
+    is_epichains(x)
+
+    chains_ran <- length(x$n)
+
+    max_time <- max(x$time)
+
+    n_unique_ancestors <- length(
+      unique(x$ancestor[!is.na(x$ancestor)])
+    )
+
+    num_generations <- length(unique(x$generations))
+
+    # out of summary
+    res <- list(
+      unique_chains = chains_ran,
+      max_time = max_time,
+      unique_ancestors = n_unique_ancestors,
+      unique_generations = n_unique_ancestors,
+      num_generations = num_generations
+      # WIP
+    )
+  } else if (attributes(x)$chain_type == "chains_vec") {
+    chains_ran <- length(x)
+    max_chain_stat <- max(!is.infinite(x))
+    min_chain_stat <- min(!is.infinite(x))
+
+    res <- list(
+      unique_chains = chains_ran,
+      max_chain_stat = max_chain_stat,
+      min_chain_stat = min_chain_stat
+    )
+  }
+
+  return(res)
+}
+
+#' Checks whether the object is an `epichains`
+#'
+#' @param x An R object
+#'
+#' @return logical, `TRUE` if the object is an `epichains` and `FALSE`
+#' otherwise
+#' @export
+#'
+#' @examples
+is_epichains <- function(x) {
+  inherits(x, "epichains")
+}
+
+#' `epichains` class validator
+#'
+#' @param x An `epichains` object
+#'
+#' @return Checks if an object is of class `epichains` and if so
+#' checks that it's in the right format as a "data.frame" or vector.
+validate_epichains <- function(x) {
+  if (!is_epichains(x)) {
+    stop("Object must have an epichains class")
+  }
+
+  # check for class invariants
+
+  if (attributes(x)$is_tree) {
+    stopifnot(
+      "object does not contain the correct columns" =
+        c("n", "id", "ancestor", "generation", "time") %in%
+          colnames(x),
+      "column `n` must be a numeric" =
+        is.numeric(x$n),
+      "column `id` must be a numeric" =
+        is.numeric(x$id),
+      "column `ancestor` must be a numeric" =
+        is.numeric(x$ancestor),
+      "column `generation` must be a numeric" =
+        is.numeric(x$generation),
+      "column `time` must be a numeric" =
+        is.numeric(x$time)
+    )
+  } else {
+    stopifnot(
+      "object must be a numeric vector" =
+        is.numeric(x)
+    )
+  }
+
+  invisible(x)
+}

From 67573301ea50a12887c51e9c9225118b0913efe6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 6 Jun 2023 20:14:47 +0100
Subject: [PATCH 241/828] Added separate simulators for transmission chain
 trees and transmission chain vectors

---
 R/simulate.r | 351 +++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 351 insertions(+)

diff --git a/R/simulate.r b/R/simulate.r
index ef9fec91..516be648 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -244,3 +244,354 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
     return(stat_track)
   }
 }
+
+#' Simulate tree of infections
+#'
+#' @param nchains number of chains to simulate
+#' @param offspring_sampler Offspring distribution: a character string
+#' corresponding to the R distribution function (e.g., "pois" for Poisson,
+#' where \code{\link{rpois}} is the R function to generate Poisson random
+#' numbers)
+#' @param chain_statistic String; Statistic to calculate. Can be one of:
+#' \itemize{
+#'   \item "size": the total number of offspring.
+#'   \item "length": the total number of ancestors.
+#' }
+#' @param infinite A size or length above which the simulation results
+#' should be set to `Inf`. Defaults to `Inf`, resulting in no results
+#' ever set to `Inf`
+#' @param serials_sampler The serial interval generator function; the name of a
+#' user-defined named or anonymous function with only one argument `n`,
+#' representing the number of serial intervals to generate.
+#' @param t0 Start time (if serial interval is given); either a single value
+#' or a vector of same length as `nchains` (number of simulations) with
+#' initial times. Defaults to 0.
+#' @param tf End time (if serial interval is given).
+#' @param ... Parameters of the offspring distribution as required by R.
+#' @return an `epichains` object, which is basically a `data.frame` with
+#' columns `chain_id` (chain ID), `sim_id` (a unique ID within each simulation
+#' for each individual element of the chain), `ancestor`
+#' (the ID of the ancestor of each element), `generation`, and
+#' `time` (of infection)
+#' @author James M. Azam, Sebastian Funk
+#' @export
+#' @details
+#' `sim_chain_tree()` simulates a branching process of the form:
+#' WIP
+#' # The serial interval (`serials_sampler`):
+#'
+#' ## Assumptions/disambiguation
+#'
+#' In epidemiology, the generation interval is the duration between successive
+#' infectious events in a chain of transmission. Similarly, the serial
+#' interval is the duration between observed symptom onset times between
+#' successive cases in a transmission chain. The generation interval is
+#' often hard to observe because exact times of infection are hard to
+#' measure hence, the serial interval is often used instead . Here, we
+#' use the serial interval to represent what would normally be called the
+#' generation interval, that is, the time between successive cases.
+#'
+#' See References below for some literature on the subject.
+#'
+#' ## Specifying `serials_sampler` in `sim_chain_tree()`
+#'
+#' `serials_sampler` must be specified as a named or
+#' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) # nolint
+#' with one argument.
+#'
+#' For example, assuming we want to specify the serial interval
+#' generator as a random log-normally distributed variable with
+#' `meanlog = 0.58` and `sdlog = 1.58`, we could define a named function,
+#' let's call it "serial_interval", with only one argument representing the
+#' number of serial intervals to sample:
+#' \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
+#' and assign the name of the function to `serials_sampler` in
+#' `sim_chain_tree()` like so
+#' \code{sim_chain_tree(..., serials_sampler = serial_interval)},
+#' where `...` are the other arguments to `sim_chain_tree()`.
+#'
+#' Alternatively, we could assign an anonymous function to `serials_sampler`
+#' in the `sim_chain_tree()` call like so
+#' \code{sim_chain_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
+#' where `...` are the other arguments to `sim_chain_tree()`.
+#' @seealso [sim_chain_vec()] for simulating transmission chains as a vector
+#' @examples
+#' set.seed(123)
+#' chains <- sim_chain_tree(nchains = 10, serials_sampler = function(x) 3,
+#' offspring = "pois", lambda = 2, infinite = 10)
+#' chains
+#' \references{Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
+#' between serial interval, infectiousness profile and generation time.
+#' J R Soc Interface. 2021 Jan;18(174):20200756.
+#' doi: 10.1098/rsif.2020.0756. Epub 2021 Jan 6.
+#' PMID: 33402022; PMCID: PMC7879757.
+#' }
+#'
+#' \references{Fine PE. The interval between successive cases of an
+#' infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
+#' doi: 10.1093/aje/kwg251. PMID: 14630599.
+#' }
+sim_chain_tree <- function(nchains, offspring_sampler,
+                           chain_statistic = c("size", "length"),
+                           infinite = Inf, serials_sampler, t0 = 0,
+                           tf = Inf, ...) {
+  chain_statistic <- match.arg(chain_statistic)
+
+  check_nchains_valid(nchains = nchains)
+
+  # check that offspring is properly specified
+  check_offspring_valid(offspring_sampler)
+
+  # check that offspring function exists in base R
+  roffspring_name <- paste0("r", offspring_sampler)
+  check_offspring_func_valid(roffspring_name)
+
+  if (!missing(serials_sampler)) {
+    check_serial_valid(serials_sampler)
+  } else if (!missing(tf)) {
+    stop("If `tf` is specified, `serials_sampler` must be specified too.")
+  }
+
+  # Initialisations
+  stat_track <- rep(1, nchains) # track length or size (depending on `chain_statistic`) #nolint
+  n_offspring <- rep(1, nchains) # current number of offspring
+  sim <- seq_len(nchains) # track chains that are still being simulated
+  ancestor_ids <- rep(1, nchains) # all chains start in generation 1
+
+  # initialise data frame to hold the transmission trees
+  generation <- 1L
+  tdf <- data.frame(
+    n = seq_len(nchains),
+    id = 1L,
+    ancestor = NA_integer_,
+    generation = generation
+  )
+
+  if (!missing(serials_sampler)) {
+    tdf$time <- t0
+    times <- tdf$time
+  }
+
+  # next, simulate n chains
+  while (length(sim) > 0) {
+    # simulate next generation
+    next_gen <- get(roffspring_name)(n = sum(n_offspring[sim]), ...)
+    if (any(next_gen %% 1 > 0)) {
+      stop("Offspring distribution must return integers")
+    }
+
+    # record indices corresponding to the number of offspring
+    indices <- rep(sim, n_offspring[sim])
+
+    # initialise placeholder for the number of offspring
+    n_offspring <- rep(0, nchains)
+    # assign offspring sum to indices still being simulated
+    n_offspring[sim] <- tapply(next_gen, indices, sum)
+
+    # track size/length
+    stat_track <- update_chain_stat(stat_type = chain_statistic,
+                                    stat_latest = stat_track,
+                                    n_offspring = n_offspring)
+
+    # record times/ancestors
+    if (sum(n_offspring[sim]) > 0) {
+      ancestors <- rep(ancestor_ids, next_gen)
+      current_max_id <- unname(tapply(ancestor_ids, indices, max))
+      indices <- rep(sim, n_offspring[sim])
+
+      # create new ids
+      ids <- rep(current_max_id, n_offspring[sim]) +
+        unlist(lapply(n_offspring[sim], seq_len))
+
+      # increment the generation
+      generation <- generation + 1L
+
+      # store new simulation results
+      new_df <-
+        data.frame(
+          n = indices,
+          id = ids,
+          ancestor = ancestors,
+          generation = generation
+        )
+
+      # if a serial interval model/function was specified, use it
+      # to generate serial intervals for the cases
+      if (!missing(serials_sampler)) {
+        times <- rep(times, next_gen) + serials_sampler(sum(n_offspring))
+        current_min_time <- unname(tapply(times, indices, min))
+        new_df$time <- times
+      }
+      tdf <- rbind(tdf, new_df)
+    }
+
+    ## only continue to simulate chains that have offspring and aren't of
+    ## infinite size/length
+    sim <- which(n_offspring > 0 & stat_track < infinite)
+    if (length(sim) > 0) {
+      if (!missing(serials_sampler)) {
+        ## only continue to simulate chains that don't go beyond tf
+        sim <- intersect(sim, unique(indices)[current_min_time < tf])
+      }
+      if (!missing(serials_sampler)) {
+          times <- times[indices %in% sim]
+          }
+        ancestor_ids <- ids[indices %in% sim]
+    }
+    }
+
+  if (!missing(tf)) {
+    tdf <- tdf[tdf$time < tf, ]
+  }
+
+  structure(
+    tdf,
+    chain_type = "chains_tree",
+    chains = nchains,
+    rownames = NULL,
+    class = c("epichains", "tbl", "data.frame")
+  )
+}
+
+
+
+#' Simulate transmission chains without tree (as a vector)
+#'
+#' @inheritParams sim_chain_tree
+#'
+#' @examples #' sim_chain_vect(n = 10, offspring_sampler = "pois", lambda = 2,
+#' infinite = 10)
+sim_chain_vect <- function(nchains, offspring_sampler,
+                           chain_statistic = c("size", "length"),
+                           infinite = Inf, ...) {
+  chain_statistic <- match.arg(chain_statistic)
+
+  check_nchains_valid(nchains = nchains)
+
+  # check that offspring is properly specified
+  check_offspring_valid(offspring_sampler)
+
+  # check that offspring function exists in base R
+  roffspring_name <- paste0("r", offspring_sampler)
+  check_offspring_func_valid(roffspring_name)
+
+  # Initialisations
+  stat_track <- rep(1, nchains) ## track length or size (depending on `stat`)
+  n_offspring <- rep(1, nchains) ## current number of offspring
+  sim <- seq_len(nchains) ## track chains that are still being simulated
+
+  ## next, simulate nchains chains
+  while (length(sim) > 0) {
+    ## simulate next generation
+    next_gen <- get(roffspring_name)(n = sum(n_offspring[sim]), ...)
+    if (any(next_gen %% 1 > 0)) {
+      stop("Offspring distribution must return integers")
+    }
+
+    ## record indices corresponding to the number of offspring
+    indices <- rep(sim, n_offspring[sim])
+
+    ## initialise number of offspring
+    n_offspring <- rep(0, nchains)
+    ## assign offspring sum to indices still being simulated
+    n_offspring[sim] <- tapply(next_gen, indices, sum)
+
+    # track size/length
+    stat_track <- update_chain_stat(stat_type = chain_statistic,
+                                    stat_latest = stat_track,
+                                    n_offspring = n_offspring
+                                    )
+
+    ## only continue to simulate chains that offspring and aren't of
+    ## infinite size/length
+    sim <- which(n_offspring > 0 & stat_track < infinite)
+  }
+
+  stat_track[stat_track >= infinite] <- Inf
+
+  structure(
+    stat_track,
+    chain_type = "chains_vec",
+    chains = nchains,
+    class = c("epichains", class(stat_track))
+  )
+}
+
+
+#' Check if offspring argument is specified as a character string
+#'
+#' @param offspring
+#'
+#' @return
+#' @export
+#' @keywords internal
+#' @examples
+check_offspring_valid <- function(offspring) {
+  if (!is.character(offspring)) {
+    stop(sprintf(
+      "%s %s",
+      "'offspring' must be specified as a character string.",
+      "Did you forget to enclose it in quotes?"
+    ))
+  }
+}
+
+
+#' Check if constructed random number generator for offspring exists
+#'
+#' @param roffspring_name
+#'
+#' @return
+#' @export
+#'
+#' @examples check_offspring_exists("rpois")
+check_offspring_func_valid <- function(roffspring_name) {
+  if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
+    stop("Function ", roffspring_name, " does not exist.")
+  }
+}
+
+
+#' Check if the serials_sampler argument is specified as a function
+#'
+#' @param serials_sampler
+#'
+#' @return
+#' @export
+#' @keywords internal
+#' @examples
+check_serial_valid <- function(serials_sampler) {
+  if (!is.function(serials_sampler)) {
+    stop(sprintf(
+      "%s %s",
+      "The `serials_sampler` argument must be a function",
+      "(see details in ?sim_chain_tree)."
+    ))
+  }
+}
+
+
+check_nchains_valid <- function(nchains) {
+  if (nchains < 1 || is.infinite(nchains)) {
+    stop("`nchains` must be > 0 but less than `Inf`")
+  }
+}
+
+#' Determine and update the chain statistic being tracked
+#'
+#' @param stat_type
+#' @param noffspring
+#'
+#' @return
+#' @export
+#' @keywords internal
+#' @examples
+update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
+  if (stat_type == "size") {
+    stat_latest <- stat_latest + n_offspring
+  } else if (stat_type == "length") {
+    stat_latest <- stat_latest + pmin(1, n_offspring)
+  }
+
+  return(stat_latest)
+}

From 2f7c3ccb438a439b42d7fa4f0575e2b8d68060cf Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 13:02:23 +0100
Subject: [PATCH 242/828] added new CITATION file

---
 CITATION.cff | 57 +++++++++++++++++++++++++++-------------------------
 1 file changed, 30 insertions(+), 27 deletions(-)

diff --git a/CITATION.cff b/CITATION.cff
index 1b32ebeb..a4b1966a 100644
--- a/CITATION.cff
+++ b/CITATION.cff
@@ -1,15 +1,15 @@
 # -----------------------------------------------------------
-# CITATION file created with {cffr} R package, v0.4.1
+# CITATION file created with {cffr} R package, v0.5.0
 # See also: https://docs.ropensci.org/cffr/
 # -----------------------------------------------------------
  
 cff-version: 1.2.0
-message: 'To cite package "bpmodels" in publications use:'
+message: 'To cite package "epichains" in publications use:'
 type: software
 license: MIT
-title: 'bpmodels: Analysing transmission chain statistics using branching process
+title: 'epichains: Analysing transmission chain statistics using branching process
   models'
-version: 0.2.0
+version: 0.2.1
 abstract: Provides methods to analyse and simulate the size and length of branching
   processes with an arbitrary offspring distribution. These can be used, for example,
   to analyse the distribution of chain sizes or length of infectious disease outbreaks,
@@ -27,23 +27,23 @@ authors:
   given-names: James M.
   email: james.azam@lshtm.ac.uk
   orcid: https://orcid.org/0000-0001-5782-7330
-repository-code: https://github.com/epiverse-trace/bpmodels
-url: https://epiverse-trace.github.io/bpmodels/
+preferred-citation:
+  type: manual
+  title: 'epichains: Analysing transmission chain statistics using branching process
+    models'
+  authors:
+  - name: Sebastian Funk
+  - name: Flavio Finger
+  - name: James M. Azam
+  year: '2023'
+  url: https://github.com/epiverse-trace/epichains/
+repository-code: https://github.com/epiverse-trace/epichains
+url: https://epiverse-trace.github.io/epichains/
 contact:
 - family-names: Azam
   given-names: James M.
   email: james.azam@lshtm.ac.uk
   orcid: https://orcid.org/0000-0001-5782-7330
-keywords:
-- branching-process
-- epidemic-dynamics
-- epidemic-modelling
-- epidemic-simulations
-- outbreak-simulator
-- r
-- r-package
-- transmission-chain
-- transmission-chain-reconstruction
 references:
 - type: software
   title: 'R: A Language and Environment for Statistical Computing'
@@ -56,7 +56,7 @@ references:
   year: '2023'
   institution:
     name: R Foundation for Statistical Computing
-  version: '>= 3.0.0'
+  version: '>= 3.6.0'
 - type: software
   title: bookdown
   abstract: 'bookdown: Authoring Books and Technical Documents with R Markdown'
@@ -190,35 +190,38 @@ references:
   authors:
   - family-names: Allaire
     given-names: JJ
-    email: jj@rstudio.com
+    email: jj@posit.co
   - family-names: Xie
     given-names: Yihui
     email: xie@yihui.name
     orcid: https://orcid.org/0000-0003-0645-5666
+  - family-names: Dervieux
+    given-names: Christophe
+    email: cderv@posit.co
+    orcid: https://orcid.org/0000-0003-4474-2498
   - family-names: McPherson
     given-names: Jonathan
-    email: jonathan@rstudio.com
+    email: jonathan@posit.co
   - family-names: Luraschi
     given-names: Javier
-    email: javier@rstudio.com
   - family-names: Ushey
     given-names: Kevin
-    email: kevin@rstudio.com
+    email: kevin@posit.co
   - family-names: Atkins
     given-names: Aron
-    email: aron@rstudio.com
+    email: aron@posit.co
   - family-names: Wickham
     given-names: Hadley
-    email: hadley@rstudio.com
+    email: hadley@posit.co
   - family-names: Cheng
     given-names: Joe
-    email: joe@rstudio.com
+    email: joe@posit.co
   - family-names: Chang
     given-names: Winston
-    email: winston@rstudio.com
+    email: winston@posit.co
   - family-names: Iannone
     given-names: Richard
-    email: rich@rstudio.com
+    email: rich@posit.co
     orcid: https://orcid.org/0000-0003-3925-190X
   year: '2023'
 - type: software
@@ -230,7 +233,7 @@ references:
   authors:
   - family-names: Wickham
     given-names: Hadley
-    email: hadley@rstudio.com
+    email: hadley@posit.co
   year: '2023'
 - type: software
   title: truncdist

From 741fb617a7dafda3ca058f43ec34b893807f594e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 13:03:27 +0100
Subject: [PATCH 243/828] Added a script with input checking functions

---
 R/checks.R | 58 ++++++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 58 insertions(+)
 create mode 100644 R/checks.R

diff --git a/R/checks.R b/R/checks.R
new file mode 100644
index 00000000..dea04268
--- /dev/null
+++ b/R/checks.R
@@ -0,0 +1,58 @@
+#' Check if offspring argument is specified as a character string
+#'
+#' @param offspring
+#'
+#' @return
+#' @export
+#' @keywords internal
+#' @examples
+check_offspring_valid <- function(offspring) {
+  if (!is.character(offspring)) {
+    stop(sprintf(
+      "%s %s",
+      "'offspring' must be specified as a character string.",
+      "Did you forget to enclose it in quotes?"
+    ))
+  }
+}
+
+
+#' Check if constructed random number generator for offspring exists
+#'
+#' @param roffspring_name
+#'
+#' @return
+#' @export
+#'
+#' @examples check_offspring_exists("rpois")
+check_offspring_func_valid <- function(roffspring_name) {
+  if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
+    stop("Function ", roffspring_name, " does not exist.")
+  }
+}
+
+
+#' Check if the serials_sampler argument is specified as a function
+#'
+#' @param serials_sampler
+#'
+#' @return
+#' @export
+#' @keywords internal
+#' @examples
+check_serial_valid <- function(serials_sampler) {
+  if (!is.function(serials_sampler)) {
+    stop(sprintf(
+      "%s %s",
+      "The `serials_sampler` argument must be a function",
+      "(see details in ?sim_chain_tree)."
+    ))
+  }
+}
+
+
+check_nchains_valid <- function(nchains) {
+  if (nchains < 1 || is.infinite(nchains)) {
+    stop("`nchains` must be > 0 but less than `Inf`")
+  }
+}

From acf504cff7f58b81f1d563138a76567cb9547378 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 13:03:55 +0100
Subject: [PATCH 244/828] Added a script for helper functions

---
 R/helpers.R | 18 ++++++++++++++++++
 1 file changed, 18 insertions(+)
 create mode 100644 R/helpers.R

diff --git a/R/helpers.R b/R/helpers.R
new file mode 100644
index 00000000..53c93dbd
--- /dev/null
+++ b/R/helpers.R
@@ -0,0 +1,18 @@
+#' Determine and update the chain statistic being tracked
+#'
+#' @param stat_type
+#' @param noffspring
+#'
+#' @return
+#' @export
+#' @keywords internal
+#' @examples
+update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
+  if (stat_type == "size") {
+    stat_latest <- stat_latest + n_offspring
+  } else if (stat_type == "length") {
+    stat_latest <- stat_latest + pmin(1, n_offspring)
+  }
+
+  return(stat_latest)
+}

From 338371b9eb658387345adc99930a60dba200ff51 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 15:18:57 +0100
Subject: [PATCH 245/828] Restructured the references

---
 R/simulate.r | 8 +++++---
 1 file changed, 5 insertions(+), 3 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 516be648..10af068b 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -320,14 +320,16 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
 #' chains <- sim_chain_tree(nchains = 10, serials_sampler = function(x) 3,
 #' offspring = "pois", lambda = 2, infinite = 10)
 #' chains
-#' \references{Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
+#' @references
+#'
+#' Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
 #' between serial interval, infectiousness profile and generation time.
 #' J R Soc Interface. 2021 Jan;18(174):20200756.
 #' doi: 10.1098/rsif.2020.0756. Epub 2021 Jan 6.
 #' PMID: 33402022; PMCID: PMC7879757.
-#' }
 #'
-#' \references{Fine PE. The interval between successive cases of an
+#'
+#' Fine PE. The interval between successive cases of an
 #' infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
 #' doi: 10.1093/aje/kwg251. PMID: 14630599.
 #' }

From d8475d4d5ed568f5dd496d8ac0be79fc37baa60c Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 15:19:29 +0100
Subject: [PATCH 246/828] Renamed infinite to chain_stat_max

---
 R/simulate.r | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 10af068b..79e79093 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -257,9 +257,9 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
 #'   \item "size": the total number of offspring.
 #'   \item "length": the total number of ancestors.
 #' }
-#' @param infinite A size or length above which the simulation results
-#' should be set to `Inf`. Defaults to `Inf`, resulting in no results
-#' ever set to `Inf`
+#' @param chain_stat_max A cut off for the chain statistic (size/length) being
+#' computed. Results above the specified value, are set to this value.
+#' Defaults to `Inf`.
 #' @param serials_sampler The serial interval generator function; the name of a
 #' user-defined named or anonymous function with only one argument `n`,
 #' representing the number of serial intervals to generate.
@@ -332,10 +332,10 @@ chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
 #' Fine PE. The interval between successive cases of an
 #' infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
 #' doi: 10.1093/aje/kwg251. PMID: 14630599.
-#' }
-sim_chain_tree <- function(nchains, offspring_sampler,
+#'
+simulate_tree <- function(nchains, offspring_sampler,
                            chain_statistic = c("size", "length"),
-                           infinite = Inf, serials_sampler, t0 = 0,
+                           chain_stat_max = Inf, serials_sampler, t0 = 0,
                            tf = Inf, ...) {
   chain_statistic <- match.arg(chain_statistic)
 

From 7bcadbc6ddc0af327470148c11a23075ab5562b4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 15:20:02 +0100
Subject: [PATCH 247/828] Deleted chain_sim function

---
 R/simulate.r | 247 ---------------------------------------------------
 1 file changed, 247 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 79e79093..820f3cf2 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,250 +1,3 @@
-#' Simulate transmission chains using a branching process
-#'
-#' @description \code{chain_sim()} is a stochastic simulator for generating
-#' transmission chain data with key inputs such as the offspring distribution
-#' and serial interval distribution.
-#' @param n Number of simulations to run.
-#' @param offspring Offspring distribution: a character string corresponding to
-#'   the R distribution function (e.g., "pois" for Poisson, where
-#'   \code{\link{rpois}} is the R function to generate Poisson random numbers)
-#' @param stat String; Statistic to calculate. Can be one of:
-#' \itemize{
-#'   \item "size": the total number of offspring.
-#'   \item "length": the total number of ancestors.
-#' }
-#' @param infinite A size or length above which the simulation results
-#' should be set to `Inf`. Defaults to `Inf`, resulting in no results
-#' ever set to `Inf`
-#' @param tree Logical. Should the transmission tree be returned? Defaults
-#' to `FALSE`.
-#' @param serial The serial interval generator function; the name of a
-#' user-defined named or anonymous function with only one argument `n`,
-#' representing the number of serial intervals to generate.
-#' @param t0 Start time (if serial interval is given); either a single value
-#' or a vector of length `n` (number of simulations) with initial times.
-#' Defaults to 0.
-#' @param tf End time (if serial interval is given).
-#' @param ... Parameters of the offspring distribution as required by R.
-#' @return Either:
-#' \itemize{
-#'  \item{A vector of sizes/lengths (if \code{tree == FALSE} OR serial
-#'   interval function not specified, since that implies
-#'   \code{tree == FALSE})}, or
-#'   \item {a data frame with
-#'   columns `n` (simulation ID), `time` (if the serial interval is given) and
-#'   (if \code{tree == TRUE}), `id` (a unique ID within each simulation for
-#'   each individual element of the chain), `ancestor` (the ID of the
-#'   ancestor of each element), and `generation`.}
-#' }
-#' @author Sebastian Funk, James M. Azam
-#' @export
-#' @details
-#' `chain_sim()` either returns a vector or a data.frame. The output is
-#' either a vector if `serial` is not provided, which automatically sets
-#' \code{tree = FALSE}, or a `data.frame`, which means that `serial` was
-#' provided as a function. When `serial` is provided, it means
-#' \code{tree = TRUE} automatically. However, setting \code{tree = TRUE}
-#' would require providing a function for `serial`.
-#'
-#' # The serial interval (`serial`):
-#'
-#' ## Assumptions/disambiguation
-#'
-#' In epidemiology, the generation interval is the duration between successive
-#' infectious events in a chain of transmission. Similarly, the serial
-#' interval is the duration between observed symptom onset times between
-#' successive cases in a transmission chain. The generation interval is
-#' often hard to observe because exact times of infection are hard to
-#' measure hence, the serial interval is often used instead. Here, we
-#' use the serial interval to represent what would normally be called the
-#' generation interval, that is, the time between successive cases.
-#'
-#' ## Specifying `serial` in `chain_sim()`
-#'
-#' `serial` must be specified as a named or
-#' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) # nolint
-#' with one argument.
-#'
-#' If `serial` is specified, `chain_sim()` returns times of
-#' infection as a column in the output. Moreover, specifying a function
-#' for `serial` implies \code{tree = TRUE} and a tree of
-#' infectors (`ancestor`) and infectees (`id`) will be generated in the output.
-#'
-#' For example, assuming we want to specify the serial interval
-#' generator as a random log-normally distributed variable with
-#' `meanlog = 0.58` and `sdlog = 1.58`, we could define a named function,
-#' let's call it "serial_interval", with only one argument representing the
-#' number of serial intervals to sample:
-#' \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
-#' and assign the name of the function to serial in `chain_sim()` like so
-#' \code{chain_sim(..., serial = serial_interval)},
-#' where `...` are the other arguments to `chain_sim()`. Alternatively, we
-#' could assign an anonymous function to serial in the `chain_sim()` call
-#' like so \code{chain_sim(..., serial = function(n){rlnorm(n, 0.58, 1.38)})},
-#' where `...` are the other arguments to `chain_sim()`.
-#' @examples
-#' # Specifying no `serial` and `tree == FALSE` (default) returns a vector
-#' set.seed(123)
-#' chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5,
-#' tree = FALSE)
-#'
-#' # Specifying `serial` without specifying `tree` will set `tree = TRUE`
-#' # internally.
-#'
-#' # We'll first define the serial function
-#' set.seed(123)
-#' serial_interval <- function(n) {
-#'   rlnorm(n, meanlog = 0.58, sdlog = 1.58)
-#' }
-#' chain_sim(
-#'   n = 5, offspring = "pois", lambda = 0.5, stat = "length",
-#'   infinite = 100,
-#'   serial = serial_interval
-#' )
-#'
-#' # Specifying `serial` and `tree = FALSE` will throw an error
-#' set.seed(123)
-#' \dontrun{
-#' try(chain_sim(
-#'   n = 10, serial = function(x) 3, offspring = "pois", lambda = 2,
-#'   infinite = 10, tree = FALSE
-#' ))
-#' }
-chain_sim <- function(n, offspring, stat = c("size", "length"), infinite = Inf,
-                      tree = FALSE, serial, t0 = 0, tf = Inf, ...) {
-  stat <- match.arg(stat)
-
-  ## first, get random function as given by `offspring`
-  if (!is.character(offspring)) {
-    stop(sprintf("%s %s",
-                 "Object passed as 'offspring' is not a character string.",
-                 "Did you forget to enclose it in quotes?"
-               )
-         )
-  }
-
-  roffspring_name <- paste0("r", offspring)
-  if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
-    stop("Function ", roffspring_name, " does not exist.")
-  }
-
-  if (!missing(serial)) {
-    if (!is.function(serial)) {
-      stop(sprintf("%s %s",
-                   "The `serial` argument must be a function",
-                 "(see details in ?chain_sim)."
-                 )
-           )
-    }
-      if (!missing(tree) && isFALSE(tree)) {
-            warning(sprintf("%s %s",
-                            "`serial` can't be used with `tree = FALSE`;",
-                          "Setting `tree = TRUE` internally."
-                          )
-                    )
-      }
-    tree <- TRUE
-  } else if (!missing(tf)) {
-    stop("If `tf` is specified, `serial` must be specified too.")
-  }
-
-  stat_track <- rep(1, n) ## track length or size (depending on `stat`)
-  n_offspring <- rep(1, n) ## current number of offspring
-  sim <- seq_len(n) ## track chains that are still being simulated
-
-  ## initialise data frame to hold the trees
-  if (tree) {
-    generation <- 1L
-    tdf <-
-      data.frame(
-        n = seq_len(n),
-        id = 1L,
-        ancestor = NA_integer_,
-        generation = generation
-      )
-
-    ancestor_ids <- rep(1, n)
-    if (!missing(serial)) {
-      tdf$time <- t0
-      times <- tdf$time
-    }
-  }
-
-  ## next, simulate n chains
-  while (length(sim) > 0) {
-    ## simulate next generation
-    next_gen <- get(roffspring_name)(n = sum(n_offspring[sim]), ...)
-    if (any(next_gen %% 1 > 0)) {
-      stop("Offspring distribution must return integers")
-    }
-
-    ## record indices corresponding to the number of offspring
-    indices <- rep(sim, n_offspring[sim])
-
-    ## initialise number of offspring
-    n_offspring <- rep(0, n)
-    ## assign offspring sum to indices still being simulated
-    n_offspring[sim] <- tapply(next_gen, indices, sum)
-
-    ## track size/length
-    if (stat == "size") {
-      stat_track <- stat_track + n_offspring
-    } else if (stat == "length") {
-      stat_track <- stat_track + pmin(1, n_offspring)
-    }
-
-    ## record times/ancestors (if tree==TRUE)
-    if (tree && sum(n_offspring[sim]) > 0) {
-      ancestors <- rep(ancestor_ids, next_gen)
-      current_max_id <- unname(tapply(ancestor_ids, indices, max))
-      indices <- rep(sim, n_offspring[sim])
-      ids <- rep(current_max_id, n_offspring[sim]) +
-        unlist(lapply(n_offspring[sim], seq_len))
-      generation <- generation + 1L
-      new_df <-
-        data.frame(
-          n = indices,
-          id = ids,
-          ancestor = ancestors,
-          generation = generation
-        )
-      if (!missing(serial)) {
-        times <- rep(times, next_gen) + serial(sum(n_offspring))
-        current_min_time <- unname(tapply(times, indices, min))
-        new_df$time <- times
-      }
-      tdf <- rbind(tdf, new_df)
-    }
-
-    ## only continue to simulate chains that offspring and aren't of
-    ## infinite size/length
-    sim <- which(n_offspring > 0 & stat_track < infinite)
-    if (length(sim) > 0) {
-      if (!missing(serial)) {
-        ## only continue to simulate chains that don't go beyond tf
-        sim <- intersect(sim, unique(indices)[current_min_time < tf])
-      }
-      if (tree) {
-        if (!missing(serial)) {
-          times <- times[indices %in% sim]
-        }
-        ancestor_ids <- ids[indices %in% sim]
-      }
-    }
-  }
-
-  if (tree) {
-    if (!missing(tf)) {
-      tdf <- tdf[tdf$time < tf, ]
-    }
-    rownames(tdf) <- NULL
-    return(tdf)
-  } else {
-    stat_track[stat_track >= infinite] <- Inf
-    return(stat_track)
-  }
-}
-
 #' Simulate tree of infections
 #'
 #' @param nchains number of chains to simulate

From 176bf592879c51ef34d8e5bd9ced5d6d59beec67 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 15:21:42 +0100
Subject: [PATCH 248/828] Renamed tdf to tree_df

---
 R/simulate.r | 17 ++++++++---------
 1 file changed, 8 insertions(+), 9 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 820f3cf2..d7951972 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -115,7 +115,7 @@ simulate_tree <- function(nchains, offspring_sampler,
 
   # initialise data frame to hold the transmission trees
   generation <- 1L
-  tdf <- data.frame(
+  tree_df <- data.frame(
     n = seq_len(nchains),
     id = 1L,
     ancestor = NA_integer_,
@@ -123,8 +123,8 @@ simulate_tree <- function(nchains, offspring_sampler,
   )
 
   if (!missing(serials_sampler)) {
-    tdf$time <- t0
-    times <- tdf$time
+    tree_df$time <- t0
+    times <- tree_df$time
   }
 
   # next, simulate n chains
@@ -177,12 +177,12 @@ simulate_tree <- function(nchains, offspring_sampler,
         current_min_time <- unname(tapply(times, indices, min))
         new_df$time <- times
       }
-      tdf <- rbind(tdf, new_df)
+      tree_df <- rbind(tree_df, new_df)
     }
 
     ## only continue to simulate chains that have offspring and aren't of
     ## infinite size/length
-    sim <- which(n_offspring > 0 & stat_track < infinite)
+    sim <- which(n_offspring > 0 & stat_track < chain_stat_max)
     if (length(sim) > 0) {
       if (!missing(serials_sampler)) {
         ## only continue to simulate chains that don't go beyond tf
@@ -196,15 +196,14 @@ simulate_tree <- function(nchains, offspring_sampler,
     }
 
   if (!missing(tf)) {
-    tdf <- tdf[tdf$time < tf, ]
+    tree_df <- tree_df[tree_df$time < tf, ]
   }
 
   structure(
-    tdf,
-    chain_type = "chains_tree",
+    tree_df,
     chains = nchains,
     rownames = NULL,
-    class = c("epichains", "tbl", "data.frame")
+    class = c("epichains_tree", "tbl", "data.frame")
   )
 }
 

From 79d9ff9e1b02b907df446040a785c0d17f5670bc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 15:23:13 +0100
Subject: [PATCH 249/828] Renamed infinite to chain_stat_max

---
 R/simulate.r | 14 +++++++-------
 1 file changed, 7 insertions(+), 7 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index d7951972..99579f74 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -213,11 +213,11 @@ simulate_tree <- function(nchains, offspring_sampler,
 #'
 #' @inheritParams sim_chain_tree
 #'
-#' @examples #' sim_chain_vect(n = 10, offspring_sampler = "pois", lambda = 2,
-#' infinite = 10)
-sim_chain_vect <- function(nchains, offspring_sampler,
+#' @examples #' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
+#' chain_stat_max = 10)
+simulate_vect <- function(nchains, offspring_sampler,
                            chain_statistic = c("size", "length"),
-                           infinite = Inf, ...) {
+                           chain_stat_max = Inf, ...) {
   chain_statistic <- match.arg(chain_statistic)
 
   check_nchains_valid(nchains = nchains)
@@ -257,11 +257,11 @@ sim_chain_vect <- function(nchains, offspring_sampler,
                                     )
 
     ## only continue to simulate chains that offspring and aren't of
-    ## infinite size/length
-    sim <- which(n_offspring > 0 & stat_track < infinite)
+    ## chain_stat_max size/length
+    sim <- which(n_offspring > 0 & stat_track < chain_stat_max)
   }
 
-  stat_track[stat_track >= infinite] <- Inf
+  stat_track[stat_track >= chain_stat_max] <- Inf
 
   structure(
     stat_track,

From adaa03e82a38ab1cc0672a877c44de5a0f9ad510 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:05:53 +0100
Subject: [PATCH 250/828] Modified simulate_tree title

---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index 99579f74..a1661026 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,4 +1,4 @@
-#' Simulate tree of infections
+#' Simulate a tree of infections with a serial and offspring distributions
 #'
 #' @param nchains number of chains to simulate
 #' @param offspring_sampler Offspring distribution: a character string

From 87eb4fc2de1ba339d771d484cb5b242e879ec64d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:06:27 +0100
Subject: [PATCH 251/828] Changed the tree_df column names

---
 R/simulate.r | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index a1661026..cc9c2ee1 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -116,8 +116,8 @@ simulate_tree <- function(nchains, offspring_sampler,
   # initialise data frame to hold the transmission trees
   generation <- 1L
   tree_df <- data.frame(
-    n = seq_len(nchains),
-    id = 1L,
+    chain_id = seq_len(nchains),
+    sim_id = 1L,
     ancestor = NA_integer_,
     generation = generation
   )
@@ -164,8 +164,8 @@ simulate_tree <- function(nchains, offspring_sampler,
       # store new simulation results
       new_df <-
         data.frame(
-          n = indices,
-          id = ids,
+          chain_id = indices,
+          sim_id = ids,
           ancestor = ancestors,
           generation = generation
         )

From 0c533168861678a2e3a9c9819302525ac49ad8c1 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:08:25 +0100
Subject: [PATCH 252/828] FixModified epichains object attributes

---
 R/simulate.r | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index cc9c2ee1..fcc47686 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -202,8 +202,9 @@ simulate_tree <- function(nchains, offspring_sampler,
   structure(
     tree_df,
     chains = nchains,
+    chain_type = "chains_tree",
     rownames = NULL,
-    class = c("epichains_tree", "tbl", "data.frame")
+    class = c("epichains", "tbl", "data.frame")
   )
 }
 

From d1364ce203f67314dcffb64ebeb9fe3d61dd6092 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:09:08 +0100
Subject: [PATCH 253/828] Replaced old function names with new in function docs

---
 R/simulate.r | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index fcc47686..b867f28b 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -65,12 +65,12 @@
 #'
 #' Alternatively, we could assign an anonymous function to `serials_sampler`
 #' in the `sim_chain_tree()` call like so
-#' \code{sim_chain_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
-#' where `...` are the other arguments to `sim_chain_tree()`.
-#' @seealso [sim_chain_vec()] for simulating transmission chains as a vector
+#' \code{simulate_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
+#' where `...` are the other arguments to `simulate_tree()`.
+#' @seealso [simulate_vec()] for simulating transmission chains as a vector
 #' @examples
 #' set.seed(123)
-#' chains <- sim_chain_tree(nchains = 10, serials_sampler = function(x) 3,
+#' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
 #' offspring = "pois", lambda = 2, infinite = 10)
 #' chains
 #' @references

From 05dd1af0674176c5c76ff88bd06d28dd9b01240f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:09:32 +0100
Subject: [PATCH 254/828] Documented chain_stat_max in simulate_vec()

---
 R/simulate.r | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index b867f28b..0fe282c2 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -213,7 +213,8 @@ simulate_tree <- function(nchains, offspring_sampler,
 #' Simulate transmission chains without tree (as a vector)
 #'
 #' @inheritParams sim_chain_tree
-#'
+#' @param chain_stat_max A cut off for the chain statistic (size/length) being
+#' computed. Results above the specified value, are set to `Inf`.
 #' @examples #' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
 #' chain_stat_max = 10)
 simulate_vect <- function(nchains, offspring_sampler,

From 01fcd78f9a5079e980e1dc668d40703f3d5cdd28 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:23:13 +0100
Subject: [PATCH 255/828] Moved checking functions

---
 R/simulate.r | 41 -----------------------------------------
 1 file changed, 41 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 0fe282c2..b32ce592 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -274,13 +274,11 @@ simulate_vect <- function(nchains, offspring_sampler,
 }
 
 
-#' Check if offspring argument is specified as a character string
 #'
 #' @param offspring
 #'
 #' @return
 #' @export
-#' @keywords internal
 #' @examples
 check_offspring_valid <- function(offspring) {
   if (!is.character(offspring)) {
@@ -295,34 +293,12 @@ check_offspring_valid <- function(offspring) {
 
 #' Check if constructed random number generator for offspring exists
 #'
-#' @param roffspring_name
-#'
-#' @return
-#' @export
-#'
-#' @examples check_offspring_exists("rpois")
-check_offspring_func_valid <- function(roffspring_name) {
-  if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
-    stop("Function ", roffspring_name, " does not exist.")
   }
 }
 
 
 #' Check if the serials_sampler argument is specified as a function
 #'
-#' @param serials_sampler
-#'
-#' @return
-#' @export
-#' @keywords internal
-#' @examples
-check_serial_valid <- function(serials_sampler) {
-  if (!is.function(serials_sampler)) {
-    stop(sprintf(
-      "%s %s",
-      "The `serials_sampler` argument must be a function",
-      "(see details in ?sim_chain_tree)."
-    ))
   }
 }
 
@@ -330,24 +306,7 @@ check_serial_valid <- function(serials_sampler) {
 check_nchains_valid <- function(nchains) {
   if (nchains < 1 || is.infinite(nchains)) {
     stop("`nchains` must be > 0 but less than `Inf`")
-  }
-}
 
 #' Determine and update the chain statistic being tracked
-#'
-#' @param stat_type
-#' @param noffspring
-#'
-#' @return
-#' @export
-#' @keywords internal
-#' @examples
-update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
-  if (stat_type == "size") {
-    stat_latest <- stat_latest + n_offspring
-  } else if (stat_type == "length") {
-    stat_latest <- stat_latest + pmin(1, n_offspring)
   }
 
-  return(stat_latest)
-}

From 00d7fa93a60ad4cd6a5b5a5b573b63dfe514433e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:24:47 +0100
Subject: [PATCH 256/828] Updated the column names for col-type validation

---
 R/epichains.R | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 62920f32..0bfc2593 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -125,15 +125,15 @@ validate_epichains <- function(x) {
 
   # check for class invariants
 
-  if (attributes(x)$is_tree) {
+  if (attributes(x)$chain_type == "chains_tree") {
     stopifnot(
       "object does not contain the correct columns" =
-        c("n", "id", "ancestor", "generation", "time") %in%
+        c("chain_id", "sim_id", "ancestor", "generation", "time") %in%
           colnames(x),
-      "column `n` must be a numeric" =
-        is.numeric(x$n),
-      "column `id` must be a numeric" =
-        is.numeric(x$id),
+      "column `chain_id` must be a numeric" =
+        is.numeric(x$chain_id),
+      "column `sim_id` must be a numeric" =
+        is.numeric(x$sim_id),
       "column `ancestor` must be a numeric" =
         is.numeric(x$ancestor),
       "column `generation` must be a numeric" =

From 58c0c136c986c86b5c98641ee5c743cff0f0f354 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:25:21 +0100
Subject: [PATCH 257/828] Restructured the format method for epichains objects

---
 R/epichains.R | 35 ++++++++++++++++++++++++++++-------
 1 file changed, 28 insertions(+), 7 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 0bfc2593..02ef7bf6 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -12,15 +12,32 @@ print.epichains <- function(x, ...) {
 #'
 #' @examples
 format.epichains <- function(x, ...) {
+  # check that x is an epichains object
+  validate_epichains(x)
+
+  # summarise the information stored in x
   chain_info <- summary(x)
+
   if (attributes(x)$chain_type == "chains_tree") {
-    cat("head starting from first known ancestor \n")
-    print(tibble::as_tibble(head(subset(x, !is.na(ancestor)))))
-    cat("--- \n")
-    print(tail(tibble::as_tibble(x)))
     writeLines(
       c(
-        sprintf("`epichains` `chains_tree` object"),
+        sprintf("`epichains` object"),
+
+        "< tree head (from first known ancestor) >\n"
+        )
+      )
+
+    # print head of the simulation output
+    print(head(subset(as.data.frame(x), !is.na(ancestor))))
+
+    cat("< tree tail >\n")
+
+    # print tail of object
+    print(tail(as.data.frame(x)))
+
+    # print summary information
+    writeLines(
+      c(
         sprintf("Chains simulated: %s", chain_info[["chains"]]),
         sprintf(
           "Unique number of ancestors: %s",
@@ -31,8 +48,10 @@ format.epichains <- function(x, ...) {
         )
       )
     )
+
+    # Offer more information to view the full dataset
     writeLines(sprintf("Use View(<object_name>) to view the full output."))
-    invisible(x)
+
   } else if (attributes(x)$chain_type == "chains_vec") {
     cat(sprintf("epichains object \n"))
     print(as.vector(x))
@@ -42,12 +61,14 @@ format.epichains <- function(x, ...) {
         )
     writeLines(
       c(
-        cat("\n Simulated chain stats: \n"),
+        "\n Simulated chain stats: \n",
         sprintf("Max: %s", chain_info[["max_chain_stat"]]),
         sprintf("Min: %s", chain_info[["min_chain_stat"]])
       )
     )
   }
+
+  invisible(x)
 }
 
 
From bfaa94dcde874dcfba089e00dd53d80fee323d8e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:26:27 +0100
Subject: [PATCH 258/828] Added a functions for simulating infections with an
 initial susceptible pool

---
 R/simulate.r | 175 ++++++++++++++++++++++++++++++++++++++++++++-------
 1 file changed, 152 insertions(+), 23 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index b32ce592..19567632 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -273,40 +273,169 @@ simulate_vect <- function(nchains, offspring_sampler,
   )
 }
 
-
+#' Simulate a tree of infections from an initial susceptible population
+#' with initial immunity
+#'
+#' @param offspring_sampler offspring distribution sampler: a character string
+#' corresponding to the R distribution function. Currently only "pois" &
+#' "nbinom" are supported. Internally truncated distributions are used to
+#' avoid infecting more people than susceptibles available.
+#' @param mn_offspring the average number of secondary cases for each case
+#' @param disp_offspring the dispersion coefficient (var/mean) of the number of
+#'      secondary cases. Ignored if offspring == "pois". Must be > 1.
+#' @param serial_sampler the serial interval. A function that takes one
+#' parameter (`n`), the number of serial intervals to randomly sample.
+#'     Value must be >= 0.
+#' @param t0 start time
+#' @param tf end time
+#' @param pop the population
+#' @param initial_immune the number of initial immunes in the population
+#' @return a data frame with columns `time`, `id` (a unique ID for each
+#'     individual element of the chain), `ancestor` (the ID of the ancestor
+#'      of each element), and `generation`.
 #'
-#' @param offspring
+#' @details This function has a couple of key differences with chain_sim:
+#'     it can only simulate one chain at a time,
+#'     it can only handle implemented offspring distributions
+#'         ("pois" and "nbinom"),
+#'     it always tracks and returns a data frame containing the entire tree,
+#'     the maximal length of chains is limited with pop instead of infinite.
 #'
-#' @return
+#' @author Flavio Finger
+#' @author James M. Azam
 #' @export
 #' @examples
-check_offspring_valid <- function(offspring) {
-  if (!is.character(offspring)) {
-    stop(sprintf(
-      "%s %s",
-      "'offspring' must be specified as a character string.",
-      "Did you forget to enclose it in quotes?"
-    ))
+#' chain_sim_susc(pop = 100, offspring_sampler = "pois", mn_offspring = 0.5,
+#' serial_sampler = function(x) 3)
+simulate_tree_tracked <- function(pop = 100,
+                          offspring_sampler = c("pois", "nbinom"),
+                          mn_offspring,
+                          disp_offspring,
+                          serial_sampler,
+                          t0 = 0,
+                          tf = Inf,
+                          initial_immune = 0) {
+  offspring_sampler <- match.arg(offspring_sampler)
+
+  if (offspring_sampler == "pois") {
+    if (!missing(disp_offspring)) {
+      warning(sprintf("%s %s",
+                      "Argument 'disp_offspring' not used for",
+                      "poisson offspring distribution."
+                      )
+              )
+    }
+
+    ## using a right truncated poisson distribution
+    ## to avoid more cases than susceptibles
+    offspring_fun <- function(n, susc) {
+      truncdist::rtrunc(
+        n,
+        spec = "pois",
+        lambda = mn_offspring * susc / pop,
+        b = susc
+      )
+    }
+  } else if (offspring_sampler == "nbinom") {
+    if (missing(disp_offspring)) {
+      stop(sprintf("%s", "Argument 'disp_offspring' must be specified."))
+    } else if (disp_offspring <= 1) { ## dispersion coefficient
+      stop(sprintf("%s %s %s",
+                   "Offspring distribution 'nbinom' requires",
+                   "argument 'disp_offspring' > 1.",
+                   "Use 'pois' if there is no overdispersion."
+      ))
+    }
+    offspring_fun <- function(n, susc) {
+      ## get distribution params from mean and dispersion
+      ## see ?rnbinom for parameter definition
+      new_mn <- mn_offspring * susc / pop ## apply susceptibility
+      size <- new_mn / (disp_offspring - 1)
+
+      ## using a right truncated nbinom distribution
+      ## to avoid more cases than susceptibles
+      truncdist::rtrunc(
+        n,
+        spec = "nbinom",
+        b = susc,
+        mu = new_mn,
+        size = size
+      )
+    }
   }
-}
 
+  ## initializations
+  tdf <- data.frame(
+    id = 1L,
+    ancestor = NA_integer_,
+    generation = 1L,
+    time = t0,
+    offspring_generated = FALSE
+  )
 
-#' Check if constructed random number generator for offspring exists
-#'
-  }
-}
+  susc <- pop - initial_immune - 1L
+  t <- t0
 
+  ## continue if any unsimulated has t <= tf
+  ## AND there is still susceptibles left
+  while (
+    any(tdf$time[!tdf$offspring_generated] <= tf) &&
+    susc > 0
+  ) {
 
-#' Check if the serials_sampler argument is specified as a function
-#'
-  }
-}
+    ## select from which case to generate offspring
+    t <- min(tdf$time[!tdf$offspring_generated]) # lowest unsimulated t
+
+    ## index of the first in df with t, extract vars
+    idx <- which(tdf$time == t & !tdf$offspring_generated)[1]
+    id_parent <- tdf$id[idx]
+    t_parent <- tdf$time[idx]
+    gen_parent <- tdf$generation[idx]
+
+    ## generate it
+    current_max_id <- max(tdf$id)
+    n_offspring <- offspring_fun(1, susc)
+
+    if (n_offspring %% 1 > 0) {
+      stop("Offspring distribution must return integers")
+    }
 
+    ## mark as done
+    tdf$offspring_generated[idx] <- TRUE
 
-check_nchains_valid <- function(nchains) {
-  if (nchains < 1 || is.infinite(nchains)) {
-    stop("`nchains` must be > 0 but less than `Inf`")
+    ## add to df
+    if (n_offspring > 0) {
+      ## draw times
+      new_times <- serial(n_offspring)
+
+      if (any(new_times < 0)) {
+        stop("Serial interval must be >= 0.")
+      }
 
-#' Determine and update the chain statistic being tracked
+      new_df <- data.frame(
+        id = current_max_id + seq_len(n_offspring),
+        time = new_times + t_parent,
+        ancestor = id_parent,
+        generation = gen_parent + 1L,
+        offspring_generated = FALSE
+      )
+
+      ## add new cases to tdf
+      tdf <- rbind(tdf, new_df)
+    }
+
+    ## adjust susceptibles
+    susc <- susc - n_offspring
   }
 
+  ## remove cases with time > tf that could
+  ## have been generated in the last generation
+  tdf <- tdf[tdf$time <= tf, ]
+
+  ## sort output and remove columns not needed
+  tdf <- tdf[order(tdf$time, tdf$id), ]
+  tdf$offspring_generated <- NULL
+
+  return(tdf)
+}
+

From 0dd2321390f5e098fe1419a2fdf3a7defb82be46 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:23:23 +0100
Subject: [PATCH 259/828] Added an epichains attribute to indicate if pop is
 tracked

---
 R/simulate.r | 1 +
 1 file changed, 1 insertion(+)

diff --git a/R/simulate.r b/R/simulate.r
index 19567632..c9f20cfc 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -204,6 +204,7 @@ simulate_tree <- function(nchains, offspring_sampler,
     chains = nchains,
     chain_type = "chains_tree",
     rownames = NULL,
+    track_pop = FALSE,
     class = c("epichains", "tbl", "data.frame")
   )
 }

From 1c7ef52d5bf98de50b28c240022d5236ac9f7f5f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:23:54 +0100
Subject: [PATCH 260/828] Now summarising maximum generations

---
 R/epichains.R | 8 +++++---
 1 file changed, 5 insertions(+), 3 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 02ef7bf6..3ec42473 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -94,7 +94,9 @@ summary.epichains <- function(x, ...) {
       unique(x$ancestor[!is.na(x$ancestor)])
     )
 
-    num_generations <- length(unique(x$generations))
+    num_generations <- length(unique(x$generation))
+
+    max_generation <- max(x$generation)
 
     # out of summary
     res <- list(
@@ -102,8 +104,8 @@ summary.epichains <- function(x, ...) {
       max_time = max_time,
       unique_ancestors = n_unique_ancestors,
       unique_generations = n_unique_ancestors,
-      num_generations = num_generations
-      # WIP
+      num_generations = num_generations,
+      max_generation = max_generation
     )
   } else if (attributes(x)$chain_type == "chains_vec") {
     chains_ran <- length(x)

From 557861aedd5575087c59c1b198b1968aa75b8f03 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:24:13 +0100
Subject: [PATCH 261/828] Added epichains validation to summary method

---
 R/epichains.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 3ec42473..46ce1484 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -83,8 +83,9 @@ format.epichains <- function(x, ...) {
 #'
 #' @examples
 summary.epichains <- function(x, ...) {
+  validate_epichains(x)
+
   if (attributes(x)$chain_type == "chains_tree") {
-    is_epichains(x)
 
     chains_ran <- length(x$n)
 

From 57284a186d196d5b85fa32c50abc4cacad74135d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:24:37 +0100
Subject: [PATCH 262/828] Removed chain_id column as an invariant of epichains
 class

---
 R/epichains.R | 4 +---
 1 file changed, 1 insertion(+), 3 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 46ce1484..55ce2043 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -152,10 +152,8 @@ validate_epichains <- function(x) {
   if (attributes(x)$chain_type == "chains_tree") {
     stopifnot(
       "object does not contain the correct columns" =
-        c("chain_id", "sim_id", "ancestor", "generation", "time") %in%
+        c("sim_id", "ancestor", "generation", "time") %in%
           colnames(x),
-      "column `chain_id` must be a numeric" =
-        is.numeric(x$chain_id),
       "column `sim_id` must be a numeric" =
         is.numeric(x$sim_id),
       "column `ancestor` must be a numeric" =

From 0c027f338504deb9a5cb5a5f04dc8ce1307e2a37 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:25:10 +0100
Subject: [PATCH 263/828] Added a function to extract truncated poisson or
 nbinom function

---
 R/helpers.R | 40 ++++++++++++++++++++++++++++++++++++++++
 1 file changed, 40 insertions(+)

diff --git a/R/helpers.R b/R/helpers.R
index 53c93dbd..403e98da 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -16,3 +16,43 @@ update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
 
   return(stat_latest)
 }
+
+
+#' Get offspring sampling function
+#'
+#' @param offspring_sampler
+#'
+#' @return
+#' @export
+#'
+#' @examples
+get_offspring_func <- function(offspring_sampler) {
+  if (offspring_sampler == "nbinom") {
+    function(n, susc, pop, mean_offspring, disp_offspring) {
+      ## get distribution params from mean and dispersion
+      new_mn <- mean_offspring * susc / pop ## apply susceptibility
+      size <- new_mn / (disp_offspring - 1)
+
+      ## using a right truncated nbinom distribution
+      ## to avoid more cases than susceptibles
+      truncdist::rtrunc(
+        n,
+        spec = "nbinom",
+        b = susc,
+        mu = new_mn,
+        size = size
+      )
+    }
+  } else if (offspring_sampler == "pois") {
+    function(n, susc, pop, mean_offspring) {
+      truncdist::rtrunc(
+        n,
+        spec = "pois",
+        lambda = mean_offspring * susc / pop,
+        b = susc
+      )
+    }
+  } else{
+    stop("offspring_sampler must either be 'pois' or 'nbinom'")
+  }
+}

From 451ae90829872a6ef8f0aae154922a91bc2188c3 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:26:04 +0100
Subject: [PATCH 264/828] Added epichains class to simulation function

---
 R/simulate.r | 8 +++++++-
 1 file changed, 7 insertions(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index c9f20cfc..8ec8e5b2 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -437,6 +437,12 @@ simulate_tree_tracked <- function(pop = 100,
   tdf <- tdf[order(tdf$time, tdf$id), ]
   tdf$offspring_generated <- NULL
 
-  return(tdf)
+  structure(
+    tree_df,
+    chain_type = "chains_tree",
+    rownames = NULL,
+    track_pop = TRUE,
+    class = c("epichains", "tbl", "data.frame")
+  )
 }
 

From 70fc9494ad24cf04ee2158c709a558f5b3f91228 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:28:11 +0100
Subject: [PATCH 265/828] Moved the offspring function definition to the helper
 script

---
 R/simulate.r | 27 +++------------------------
 1 file changed, 3 insertions(+), 24 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 8ec8e5b2..4a8fee01 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -329,14 +329,8 @@ simulate_tree_tracked <- function(pop = 100,
 
     ## using a right truncated poisson distribution
     ## to avoid more cases than susceptibles
-    offspring_fun <- function(n, susc) {
-      truncdist::rtrunc(
-        n,
-        spec = "pois",
-        lambda = mn_offspring * susc / pop,
-        b = susc
-      )
-    }
+    offspring_fun <- get_offspring_func(offspring_sampler)
+
   } else if (offspring_sampler == "nbinom") {
     if (missing(disp_offspring)) {
       stop(sprintf("%s", "Argument 'disp_offspring' must be specified."))
@@ -347,22 +341,7 @@ simulate_tree_tracked <- function(pop = 100,
                    "Use 'pois' if there is no overdispersion."
       ))
     }
-    offspring_fun <- function(n, susc) {
-      ## get distribution params from mean and dispersion
-      ## see ?rnbinom for parameter definition
-      new_mn <- mn_offspring * susc / pop ## apply susceptibility
-      size <- new_mn / (disp_offspring - 1)
-
-      ## using a right truncated nbinom distribution
-      ## to avoid more cases than susceptibles
-      truncdist::rtrunc(
-        n,
-        spec = "nbinom",
-        b = susc,
-        mu = new_mn,
-        size = size
-      )
-    }
+    offspring_fun <- get_offspring_func(offspring_sampler)
   }
 
   ## initializations

From a0883b0d569d868dc63021070c4ffe9fee23d188 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:29:52 +0100
Subject: [PATCH 266/828] Documented the simulation function

---
 R/simulate.r | 56 ++++++++++++++++++++++++++++++++--------------------
 1 file changed, 35 insertions(+), 21 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 4a8fee01..ba0fd982 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -277,37 +277,51 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' Simulate a tree of infections from an initial susceptible population
 #' with initial immunity
 #'
-#' @param offspring_sampler offspring distribution sampler: a character string
+#' @param pop The susceptible population.
+#' @param offspring_sampler Offspring distribution sampler: a character string
 #' corresponding to the R distribution function. Currently only "pois" &
 #' "nbinom" are supported. Internally truncated distributions are used to
 #' avoid infecting more people than susceptibles available.
-#' @param mn_offspring the average number of secondary cases for each case
-#' @param disp_offspring the dispersion coefficient (var/mean) of the number of
-#'      secondary cases. Ignored if offspring == "pois". Must be > 1.
-#' @param serial_sampler the serial interval. A function that takes one
-#' parameter (`n`), the number of serial intervals to randomly sample.
-#'     Value must be >= 0.
-#' @param t0 start time
-#' @param tf end time
-#' @param pop the population
-#' @param initial_immune the number of initial immunes in the population
+#' @param mean_offspring The average number of secondary cases for each case.
+#' Same as R0.
+#' @param disp_offspring The dispersion parameter of the number of
+#' secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
+#' avoid division by 0 when calculating the size. See details and
+#'  \code{?rnbinom} for details on the parameterisation in Ecology.
+#' @param serial_sampler The serial interval. A function that takes one
+#' parameter (`n`), the number of serial intervals to randomly sample. Value
+#' must be >= 0.
+#' @param initial_immune The number of initial immunes in the population.
+#' @param t0 Start time; Defaults to 0.
+#' @param tf End time; Defaults to `Inf`.
 #' @return a data frame with columns `time`, `id` (a unique ID for each
-#'     individual element of the chain), `ancestor` (the ID of the ancestor
-#'      of each element), and `generation`.
+#' individual element of the chain), `ancestor` (the ID of the ancestor
+#' of each element), and `generation`.
+#' @details
+#'
+#' # Offspring models
+#'
+#' The poisson model is parametrised so that:
+#'
+#' lamda = mean_offspring * pop - initial_immune / pop
+#'
+#' The negative binomial model is parametrised as:
 #'
-#' @details This function has a couple of key differences with chain_sim:
-#'     it can only simulate one chain at a time,
-#'     it can only handle implemented offspring distributions
-#'         ("pois" and "nbinom"),
-#'     it always tracks and returns a data frame containing the entire tree,
-#'     the maximal length of chains is limited with pop instead of infinite.
+#' mu = mean_offspring * pop - initial immune / pop, and
+#' size = mu / (disp_offspring - 1). This is why disp_offspring must be greater
+#' than 1.
 #'
+#' simulate_tree_from_pop() has a couple of key different from simulate_tree():
+#'  * the maximal chain statistic is limited by `pop` instead of
+#'  `chain_stat_max` (in `simulate_tree()`),
+#'  * it can only handle implemented offspring distributions ("pois" and
+#' "nbinom").
 #' @author Flavio Finger
 #' @author James M. Azam
 #' @export
 #' @examples
-#' chain_sim_susc(pop = 100, offspring_sampler = "pois", mn_offspring = 0.5,
-#' serial_sampler = function(x) 3)
+#' # Simulate with poisson offspring
+#' simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
 simulate_tree_tracked <- function(pop = 100,
                           offspring_sampler = c("pois", "nbinom"),
                           mn_offspring,

From bf4a15bf961d352265c8ec33dab6932c46509768 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:30:45 +0100
Subject: [PATCH 267/828] Reworded an error and warning

---
 R/simulate.r | 9 +++++----
 1 file changed, 5 insertions(+), 4 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index ba0fd982..50b52d27 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -334,9 +334,10 @@ simulate_tree_tracked <- function(pop = 100,
 
   if (offspring_sampler == "pois") {
     if (!missing(disp_offspring)) {
-      warning(sprintf("%s %s",
-                      "Argument 'disp_offspring' not used for",
-                      "poisson offspring distribution."
+      warning(sprintf("%s %s %s",
+                      "'disp_offspring' is not used for",
+                      "poisson offspring distribution.",
+                      "Will be ignored."
                       )
               )
     }
@@ -347,7 +348,7 @@ simulate_tree_tracked <- function(pop = 100,
 
   } else if (offspring_sampler == "nbinom") {
     if (missing(disp_offspring)) {
-      stop(sprintf("%s", "Argument 'disp_offspring' must be specified."))
+      stop(sprintf("%s", "'disp_offspring' must be specified."))
     } else if (disp_offspring <= 1) { ## dispersion coefficient
       stop(sprintf("%s %s %s",
                    "Offspring distribution 'nbinom' requires",

From 9c9df7f4e4b8be631c87cbba6a74305b4c0f6511 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:31:22 +0100
Subject: [PATCH 268/828] Renamed the function

---
 R/simulate.r | 17 +++++++++--------
 1 file changed, 9 insertions(+), 8 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 50b52d27..51e3c336 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -322,14 +322,15 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' @examples
 #' # Simulate with poisson offspring
 #' simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
-simulate_tree_tracked <- function(pop = 100,
-                          offspring_sampler = c("pois", "nbinom"),
-                          mn_offspring,
-                          disp_offspring,
-                          serial_sampler,
-                          t0 = 0,
-                          tf = Inf,
-                          initial_immune = 0) {
+#' mean_offspring = 0.5, serial_sampler = function(x) 3)
+simulate_tree_from_pop <- function(pop,
+                                   offspring_sampler = c("pois", "nbinom"),
+                                   mean_offspring,
+                                   disp_offspring,
+                                   serial_sampler,
+                                   initial_immune = 0,
+                                   t0 = 0,
+                                   tf = Inf) {
   offspring_sampler <- match.arg(offspring_sampler)
 
   if (offspring_sampler == "pois") {

From 0c0dc3beb6a1aea6ee1de7f9851f28a48e5b8cf6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:31:48 +0100
Subject: [PATCH 269/828] Added an example for negative binomial offspring

---
 R/simulate.r | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/R/simulate.r b/R/simulate.r
index 51e3c336..a39a9e80 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -323,6 +323,10 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' # Simulate with poisson offspring
 #' simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
 #' mean_offspring = 0.5, serial_sampler = function(x) 3)
+#'
+#' #' # Simulate with negative binomial offspring
+#' simulate_tree_from_pop(pop = 100, offspring_sampler = "nbinom",
+#' mean_offspring = 0.5, disp_offspring = 1.1, serial_sampler = function(x) 3)
 simulate_tree_from_pop <- function(pop,
                                    offspring_sampler = c("pois", "nbinom"),
                                    mean_offspring,

From 5d5dc2a48a3f09323df735aee2eaba4ac48f0e08 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:32:20 +0100
Subject: [PATCH 270/828] Renamed some variables

---
 R/simulate.r | 47 ++++++++++++++++++++++-------------------------
 1 file changed, 22 insertions(+), 25 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index a39a9e80..00fe792e 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -365,63 +365,60 @@ simulate_tree_from_pop <- function(pop,
   }
 
   ## initializations
-  tdf <- data.frame(
-    id = 1L,
+  tree_df <- data.frame(
+    sim_id = 1L,
     ancestor = NA_integer_,
     generation = 1L,
     time = t0,
-    offspring_generated = FALSE
+    offspring_generated = FALSE #used to track simulation and dropped afterwards
   )
 
   susc <- pop - initial_immune - 1L
   t <- t0
 
-  ## continue if any unsimulated has t <= tf
+  ## continue if any unsimulated chains have t <= tf
   ## AND there is still susceptibles left
-  while (
-    any(tdf$time[!tdf$offspring_generated] <= tf) &&
-    susc > 0
-  ) {
+  while (any(tree_df$time[!tree_df$offspring_generated] <= tf) && susc > 0) {
 
     ## select from which case to generate offspring
-    t <- min(tdf$time[!tdf$offspring_generated]) # lowest unsimulated t
+    t <- min(tree_df$time[!tree_df$offspring_generated]) # lowest unsimulated t
 
     ## index of the first in df with t, extract vars
-    idx <- which(tdf$time == t & !tdf$offspring_generated)[1]
-    id_parent <- tdf$id[idx]
-    t_parent <- tdf$time[idx]
-    gen_parent <- tdf$generation[idx]
+    idx <- which(tree_df$time == t & !tree_df$offspring_generated)[1]
+    id_parent <- tree_df$sim_id[idx]
+    t_parent <- tree_df$time[idx]
+    gen_parent <- tree_df$generation[idx]
 
     ## generate it
-    current_max_id <- max(tdf$id)
-    n_offspring <- offspring_fun(1, susc)
+    current_max_id <- max(tree_df$sim_id)
+    n_offspring <- offspring_fun(1, susc, pop, mean_offspring, disp_offspring)
 
     if (n_offspring %% 1 > 0) {
       stop("Offspring distribution must return integers")
     }
 
     ## mark as done
-    tdf$offspring_generated[idx] <- TRUE
+    tree_df$offspring_generated[idx] <- TRUE
 
     ## add to df
     if (n_offspring > 0) {
-      ## draw times
-      new_times <- serial(n_offspring)
+      ## draw serial times
+      new_times <- serial_sampler(n_offspring)
 
       if (any(new_times < 0)) {
         stop("Serial interval must be >= 0.")
       }
 
       new_df <- data.frame(
-        id = current_max_id + seq_len(n_offspring),
-        time = new_times + t_parent,
+        sim_id = current_max_id + seq_len(n_offspring),
         ancestor = id_parent,
         generation = gen_parent + 1L,
+        time = new_times + t_parent,
         offspring_generated = FALSE
       )
 
-      ## add new cases to tdf
-      tdf <- rbind(tdf, new_df)
+      ## add new cases to tree_df
+      tree_df <- rbind(tree_df, new_df)
     }
 
     ## adjust susceptibles
@@ -430,11 +427,11 @@ simulate_tree_from_pop <- function(pop,
 
   ## remove cases with time > tf that could
   ## have been generated in the last generation
-  tdf <- tdf[tdf$time <= tf, ]
+  tree_df <- tree_df[tree_df$time <= tf, ]
 
   ## sort output and remove columns not needed
-  tdf <- tdf[order(tdf$time, tdf$id), ]
-  tdf$offspring_generated <- NULL
+  tree_df <- tree_df[order(tree_df$time, tree_df$sim_id), ]
+  tree_df$offspring_generated <- NULL
 
   structure(
     tree_df,

From 0354cffe446282efbc1b7005197fb1b9aff1e4c5 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:37:20 +0100
Subject: [PATCH 271/828] Linting: removed whitespaces

---
 R/helpers.R  | 2 +-
 R/simulate.r | 1 -
 2 files changed, 1 insertion(+), 2 deletions(-)

diff --git a/R/helpers.R b/R/helpers.R
index 403e98da..d835653e 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -52,7 +52,7 @@ get_offspring_func <- function(offspring_sampler) {
         b = susc
       )
     }
-  } else{
+  } else {
     stop("offspring_sampler must either be 'pois' or 'nbinom'")
   }
 }
diff --git a/R/simulate.r b/R/simulate.r
index 00fe792e..cef83fc2 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -441,4 +441,3 @@ simulate_tree_from_pop <- function(pop,
     class = c("epichains", "tbl", "data.frame")
   )
 }
-

From ef4a8703d80ccaecd5239fec3dab8a21783091ed Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 13 Jun 2023 09:52:55 +0100
Subject: [PATCH 272/828] Added methods for head() and tail()

---
 R/epichains.R | 20 ++++++++++++++++++++
 1 file changed, 20 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index 55ce2043..6224ae39 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -172,3 +172,23 @@ validate_epichains <- function(x) {
 
   invisible(x)
 }
+
+#' `head` and `tail` methods for [`epichains`] class
+#'
+#' @param x An [`epichains`] object
+#' @param ... further arguments passed to or from other methods
+#'
+#' @return object of class `data.frame`
+#' @export
+#'
+#' @importFrom utils head
+#' @importFrom utils tail
+head.epichains <- function(x, ...) {
+  utils::head(as.data.frame(x), ...)
+}
+
+#' @rdname head.epichains
+#' @export
+tail.epichains <- function(x, ...) {
+  utils::tail(as.data.frame(x), ...)
+}

From 205de4614249c1624c514084da8dcce3ed843826 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 13 Jun 2023 10:23:43 +0100
Subject: [PATCH 273/828] Added a plotting method for epichains objects with
 chains_tree attribute

---
 R/epichains.R | 39 +++++++++++++++++++++++++++++++++++++++
 1 file changed, 39 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index 6224ae39..bf59d969 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -192,3 +192,42 @@ head.epichains <- function(x, ...) {
 tail.epichains <- function(x, ...) {
   utils::tail(as.data.frame(x), ...)
 }
+
+#' Plot epichains tree objects
+#'
+#' @param x an [`epichains`] object with a chains_tree attribute
+#' @param ...
+#'
+#' @return
+#' @export
+#' @author James M. Azam
+#' @examples
+plot.epichains <- function(x, ...){
+  validate_epichains(x)
+
+  if (attributes(x)$chain_type != "chains_tree") {
+    stop("Object must be an epichains object with a chains_tree attribute.")
+  }
+
+  cases_per_generation <- aggregate(sim_id ~ generation, x = as.data.frame(x), FUN = NROW)
+
+  cases_per_time <- aggregate(sim_id ~ time, x = as.data.frame(x), FUN = NROW)
+
+  graphics::par(mfrow = c(1, 2), mar = c(4, 3, 3, 1), oma = c(0, 0, 0, 0))
+
+  plot(cases_per_generation$generation,
+       cases_per_generation$sim_id,
+       xlab = "Generation",
+       ylab = "Cases",
+       type = "b",
+       main = "Number of cases per generation"
+       )
+
+  plot(cases_per_time$time,
+       cases_per_time$sim_id,
+       xlab = "Time",
+       ylab = "Cases",
+       type = "b",
+       main = "Number of cases per time"
+  )
+}

From cc642f6976e132fd6ae70d84149611eea156f832 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 13 Jun 2023 11:57:20 +0100
Subject: [PATCH 274/828] Moved chain_ll and helpers here

---
 R/likelihood_estimation.R | 102 ++++++++++++++++++++++++++++++++++++++
 1 file changed, 102 insertions(+)
 create mode 100644 R/likelihood_estimation.R

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
new file mode 100644
index 00000000..3540efd5
--- /dev/null
+++ b/R/likelihood_estimation.R
@@ -0,0 +1,102 @@
+#' Likelihood for the outcome of a branching process
+#'
+#' @param x vector of sizes or lengths of transmission chains
+#' @param stat statistic given as \code{x} ("size" or "length" of chains)
+#' @param obs_prob observation probability (assumed constant)
+#' @param infinite any chains of this size/length will be treated as infinite
+#' @param exclude any sizes/lengths to exclude from the likelihood calculation
+#' @param individual if TRUE, a vector of individual log-likelihood
+#' contributions will be returned rather than the sum
+#' @param nsim_obs number of simulations if the likelihood is to be
+#'   approximated for imperfect observations
+#' @param ... parameters for the offspring distribution
+#' @return likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or
+#'  a list of individual likelihood contributions (if \code{individual=TRUE})
+#' @inheritParams chain_sim
+#' @seealso pois_size_ll, nbinom_size_ll, gborel_size_ll, pois_length_ll,
+#'   geom_length_ll, offspring_ll
+#' @author Sebastian Funk
+#' @export
+#' @examples
+#' chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
+#' chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
+chain_ll <- function(x, offspring, stat = c("size", "length"), obs_prob = 1,
+                     infinite = Inf, exclude = NULL, individual = FALSE,
+                     nsim_obs, ...) {
+  stat <- match.arg(stat)
+
+  ## checks
+  if (!is.character(offspring)) {
+    stop("Object passed as 'offspring' is not a character string.")
+  }
+  if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
+  if (obs_prob < 1) {
+    if (missing(nsim_obs)) {
+      stop("'nsim_obs' must be specified if 'obs_prob' is <1")
+    }
+    if (stat == "size") {
+      sample_func <- rbinom_size
+    } else if (stat == "length") {
+      sample_func <- rgen_length
+    }
+    sampled_x <-
+      replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob),
+                               infinite), simplify = FALSE)
+    size_x <- unlist(sampled_x)
+    if (!is.finite(infinite)) infinite <- max(size_x) + 1
+  } else {
+    x[x >= infinite] <- infinite
+    size_x <- x
+    sampled_x <- list(x)
+  }
+
+  ## determine for which sizes to calculate the likelihood (for true chain size)
+  if (any(size_x == infinite)) {
+    calc_sizes <- seq_len(infinite - 1)
+  } else {
+    calc_sizes <- unique(c(size_x, exclude))
+  }
+
+  ## get likelihood function as given by `offspring` and `stat``
+  likelihoods <- vector(mode = "numeric")
+  ll_func <- paste(offspring, stat, "ll", sep = "_")
+  pars <- as.list(unlist(list(...))) ## converts vectors to lists
+
+  ## calculate likelihoods
+  if (exists(ll_func, where = asNamespace("epichains"), mode = "function")) {
+    func <- get(ll_func)
+    likelihoods[calc_sizes] <- do.call(func, c(list(x = calc_sizes), pars))
+  } else {
+    likelihoods[calc_sizes] <-
+      do.call(
+        offspring_ll,
+        c(list(
+          x = calc_sizes, offspring = offspring,
+          stat = stat, infinite = infinite
+        ), pars)
+      )
+  }
+
+  ## assign probabilities to infinite outbreak sizes
+  if (any(size_x == infinite)) {
+    likelihoods[infinite] <- complementary_logprob(likelihoods)
+  }
+
+  if (!missing(exclude)) {
+    likelihoods <- likelihoods - log(-expm1(sum(likelihoods[exclude])))
+    likelihoods[exclude] <- -Inf
+
+    sampled_x <- lapply(sampled_x, function(y) {
+      y[!(y %in% exclude)]
+    })
+  }
+
+  ## assign likelihoods
+  chains_likelihood <- lapply(sampled_x, function(sx) {
+    likelihoods[sx[!(sx %in% exclude)]]
+  })
+
+  if (!individual) chains_likelihood <- vapply(chains_likelihood, sum, 0)
+
+  return(chains_likelihood)
+}

From a777fc6434c8907c185cbf3ae7c5834426b06a82 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 13 Jun 2023 11:57:36 +0100
Subject: [PATCH 275/828] Moved chain_ll from here

---
 R/likelihoods.R | 103 ------------------------------------------------
 1 file changed, 103 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index bfb9ee91..dba44ef4 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -115,106 +115,3 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring = 100, ...) {
   lik[is.na(lik)] <- 0
   log(lik)
 }
-
-#' Likelihood for the outcome of a branching process
-#'
-#' @param x vector of sizes or lengths of transmission chains
-#' @param stat statistic given as \code{x} ("size" or "length" of chains)
-#' @param obs_prob observation probability (assumed constant)
-#' @param infinite any chains of this size/length will be treated as infinite
-#' @param exclude any sizes/lengths to exclude from the likelihood calculation
-#' @param individual if TRUE, a vector of individual log-likelihood
-#' contributions will be returned rather than the sum
-#' @param nsim_obs number of simulations if the likelihood is to be
-#'   approximated for imperfect observations
-#' @param ... parameters for the offspring distribution
-#' @return likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or
-#'  a list of individual likelihood contributions (if \code{individual=TRUE})
-#' @inheritParams chain_sim
-#' @seealso pois_size_ll, nbinom_size_ll, gborel_size_ll, pois_length_ll,
-#'   geom_length_ll, offspring_ll
-#' @author Sebastian Funk
-#' @export
-#' @examples
-#' chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-#' chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
-chain_ll <- function(x, offspring, stat = c("size", "length"), obs_prob = 1,
-                     infinite = Inf, exclude = NULL, individual = FALSE,
-                     nsim_obs, ...) {
-  stat <- match.arg(stat)
-
-  ## checks
-  if (!is.character(offspring)) {
-    stop("Object passed as 'offspring' is not a character string.")
-  }
-  if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
-  if (obs_prob < 1) {
-    if (missing(nsim_obs)) {
-      stop("'nsim_obs' must be specified if 'obs_prob' is <1")
-    }
-    if (stat == "size") {
-      sample_func <- rbinom_size
-    } else if (stat == "length") {
-      sample_func <- rgen_length
-    }
-    sampled_x <-
-      replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob),
-                               infinite), simplify = FALSE)
-    size_x <- unlist(sampled_x)
-    if (!is.finite(infinite)) infinite <- max(size_x) + 1
-  } else {
-    x[x >= infinite] <- infinite
-    size_x <- x
-    sampled_x <- list(x)
-  }
-
-  ## determine for which sizes to calculate the likelihood (for true chain size)
-  if (any(size_x == infinite)) {
-    calc_sizes <- seq_len(infinite - 1)
-  } else {
-    calc_sizes <- unique(c(size_x, exclude))
-  }
-
-  ## get likelihood function as given by `offspring` and `stat``
-  likelihoods <- vector(mode = "numeric")
-  ll_func <- paste(offspring, stat, "ll", sep = "_")
-  pars <- as.list(unlist(list(...))) ## converts vectors to lists
-
-  ## calculate likelihoods
-  if (exists(ll_func, where = asNamespace("epichains"), mode = "function")) {
-    func <- get(ll_func)
-    likelihoods[calc_sizes] <- do.call(func, c(list(x = calc_sizes), pars))
-  } else {
-    likelihoods[calc_sizes] <-
-      do.call(
-        offspring_ll,
-        c(list(
-          x = calc_sizes, offspring = offspring,
-          stat = stat, infinite = infinite
-        ), pars)
-      )
-  }
-
-  ## assign probabilities to infinite outbreak sizes
-  if (any(size_x == infinite)) {
-    likelihoods[infinite] <- complementary_logprob(likelihoods)
-  }
-
-  if (!missing(exclude)) {
-    likelihoods <- likelihoods - log(-expm1(sum(likelihoods[exclude])))
-    likelihoods[exclude] <- -Inf
-
-    sampled_x <- lapply(sampled_x, function(y) {
-      y[!(y %in% exclude)]
-    })
-  }
-
-  ## assign likelihoods
-  chains_likelihood <- lapply(sampled_x, function(sx) {
-    likelihoods[sx[!(sx %in% exclude)]]
-  })
-
-  if (!individual) chains_likelihood <- vapply(chains_likelihood, sum, 0)
-
-  return(chains_likelihood)
-}

From 6ac8206b2a1e79bc5938ad3b0ee95a3c7335d4ff Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 13 Jun 2023 11:58:06 +0100
Subject: [PATCH 276/828] Added script for testing refactored functions

---
 R/test_refactoring.R | 39 +++++++++++++++++++++++++++++++++++++++
 1 file changed, 39 insertions(+)
 create mode 100644 R/test_refactoring.R

diff --git a/R/test_refactoring.R b/R/test_refactoring.R
new file mode 100644
index 00000000..5bf9fcf8
--- /dev/null
+++ b/R/test_refactoring.R
@@ -0,0 +1,39 @@
+
+source("./R/checks.R")
+source("./R/helpers.R")
+source("./R/epichains.R")
+source("./R/simulate.r")
+
+
+# try simulate_tree()
+chains_tree <- simulate_tree(nchains = 10,
+                                   serials_sampler = function(n) {rpois(n, 5)},
+                                   offspring_sampler = "pois",
+                                   lambda = 2,
+                                   chain_stat_max = 10
+                                   )
+
+
+chains_tree
+summary(chains_tree)
+plot(chains_tree)
+
+# try simulate_tree_from_pop()
+
+chains_tree_from_pop <- simulate_tree_from_pop(
+  pop = 100, offspring_sampler = "nbinom",
+  mean_offspring = 0.5, disp_offspring = 1.1,
+  serial_sampler = function(x) 3)
+
+chains_tree_from_pop
+summary(chains_tree_from_pop)
+plot(chains_tree_from_pop)
+
+# try chain_vec simulation
+chains_vec <- simulate_vect(nchains = 10, offspring_sampler = "pois",
+                             lambda = 2, chain_stat_max = 10
+                             )
+
+chains_vec
+summary(chains_vec)
+# plot(chains_vec) #expect error

From e172b46788703638e2568927050e386e20db3ef4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 13 Jun 2023 11:58:28 +0100
Subject: [PATCH 277/828] Removed redundant roxygen tags

---
 R/checks.R | 18 ++++++------------
 1 file changed, 6 insertions(+), 12 deletions(-)

diff --git a/R/checks.R b/R/checks.R
index dea04268..69acb27d 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -1,11 +1,7 @@
 #' Check if offspring argument is specified as a character string
 #'
 #' @param offspring
-#'
-#' @return
-#' @export
 #' @keywords internal
-#' @examples
 check_offspring_valid <- function(offspring) {
   if (!is.character(offspring)) {
     stop(sprintf(
@@ -20,11 +16,7 @@ check_offspring_valid <- function(offspring) {
 #' Check if constructed random number generator for offspring exists
 #'
 #' @param roffspring_name
-#'
-#' @return
-#' @export
-#'
-#' @examples check_offspring_exists("rpois")
+#' @keywords internal
 check_offspring_func_valid <- function(roffspring_name) {
   if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
     stop("Function ", roffspring_name, " does not exist.")
@@ -36,10 +28,7 @@ check_offspring_func_valid <- function(roffspring_name) {
 #'
 #' @param serials_sampler
 #'
-#' @return
-#' @export
 #' @keywords internal
-#' @examples
 check_serial_valid <- function(serials_sampler) {
   if (!is.function(serials_sampler)) {
     stop(sprintf(
@@ -51,6 +40,11 @@ check_serial_valid <- function(serials_sampler) {
 }
 
 
+#' Check that nchains is greater than 0 and not infinite
+#'
+#' @param nchains
+#'
+#' @keywords internal
 check_nchains_valid <- function(nchains) {
   if (nchains < 1 || is.infinite(nchains)) {
     stop("`nchains` must be > 0 but less than `Inf`")

From 91bb7af53ffa4e6c8f37253f3117dc32f5279992 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:07:20 +0100
Subject: [PATCH 278/828] Broke the title into two lines

---
 DESCRIPTION | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index ff4097c8..b3ede00c 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,5 +1,6 @@
 Package: epichains
-Title: Analysing transmission chain statistics using branching process models
+Title: Analysing transmission chain statistics using branching process
+    models
 Version: 0.2.1
 Authors@R: c(
     person(

From 442d4f9b79e6ae95b8b5155a42e2472c32d707ba Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:07:43 +0100
Subject: [PATCH 279/828] Broke the URLs into two lines

---
 DESCRIPTION | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index b3ede00c..a007c9c6 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -37,7 +37,8 @@ Description: Provides methods to analyse and simulate the size and length
     or length of infectious disease outbreaks, as discussed in Farrington
     et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
 License: MIT + file LICENSE
-URL: https://github.com/epiverse-trace/epichains, https://epiverse-trace.github.io/epichains/
+URL: https://github.com/epiverse-trace/epichains,
+    https://epiverse-trace.github.io/epichains/
 BugReports: https://github.com/epiverse-trace/epichains/issues
 Depends:
     R (>= 3.6.0)

From 76ca9b050030aa17a8ff59a1e34447cdd914e102 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:08:02 +0100
Subject: [PATCH 280/828] Added stats to imports

---
 DESCRIPTION | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/DESCRIPTION b/DESCRIPTION
index a007c9c6..0446a24b 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -42,6 +42,8 @@ URL: https://github.com/epiverse-trace/epichains,
 BugReports: https://github.com/epiverse-trace/epichains/issues
 Depends:
     R (>= 3.6.0)
+Imports: 
+    stats
 Suggests:
     bookdown,
     covr,

From dfcd3f62d5f6f5accb75e7bbf41485d43841622e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:09:07 +0100
Subject: [PATCH 281/828] Automatically moved testthat config to right position
 in DESCRIPTION

---
 DESCRIPTION | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 0446a24b..f062e458 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -57,11 +57,11 @@ Suggests:
     testthat,
     truncdist,
     usethis
-Config/testthat/edition: 3
 VignetteBuilder:
     knitr
 Remotes:
     github::epiverse-trace/epiparameter
+Config/testthat/edition: 3
 Encoding: UTF-8
 LazyData: true
 Roxygen: list(markdown = TRUE)

From 0fa33a53f96bae2dfa8d64c43969857519afe7d6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:09:33 +0100
Subject: [PATCH 282/828] Automatically tidied up the author list

---
 DESCRIPTION | 35 ++++++++---------------------------
 1 file changed, 8 insertions(+), 27 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index f062e458..157f7803 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -3,33 +3,14 @@ Title: Analysing transmission chain statistics using branching process
     models
 Version: 0.2.1
 Authors@R: c(
-    person(
-    given = "Sebastian",
-    family = "Funk",
-    email = "sebastian.funk@lshtm.ac.uk",
-    role = "aut",
-    comment = c(ORCHID = "https://orcid.org/0000-0002-2842-3406")
-    ),
-    person(
-    given = "Zhian N.",
-    family = "Kamvar",
-    email = "zkamvar@gmail.com",
-    role = "ctb",
-    comment = c(ORCHID = "https://orcid.org/0000-0003-1458-7108")
-    ),
-    person(
-    given = "Flavio",
-    family = "Finger",
-    email = "flavio.finger@epicentre.msf.org",
-    role = "aut",
-    comment = c(ORCHID = "https://orcid.org/0000-0002-8613-5170")
-    ),
-    person(
-    given = "James M.",
-    family = "Azam",
-    email = "james.azam@lshtm.ac.uk",
-    role = c("aut", "cre"),
-    comment = c(ORCHID = "https://orcid.org/0000-0001-5782-7330"))
+    person("Sebastian", "Funk", , "sebastian.funk@lshtm.ac.uk", role = "aut",
+           comment = c(ORCID = "https://orcid.org/0000-0002-2842-3406")),
+    person("Zhian N.", "Kamvar", , "zkamvar@gmail.com", role = "ctb",
+           comment = c(ORCID = "https://orcid.org/0000-0003-1458-7108")),
+    person("Flavio", "Finger", , "flavio.finger@epicentre.msf.org", role = "aut",
+           comment = c(ORCID = "https://orcid.org/0000-0002-8613-5170")),
+    person("James M.", "Azam", , "james.azam@lshtm.ac.uk", role = c("aut", "cre"),
+           comment = c(ORCID = "https://orcid.org/0000-0001-5782-7330"))
   )
 Description: Provides methods to analyse and simulate the size and length
     of branching processes with an arbitrary offspring distribution. These

From 9bdd02d0c5c809a067638289e369943e9b1e0249 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:10:20 +0100
Subject: [PATCH 283/828] Regenerated function docs

---
 man/chain_ll.Rd                   |  59 ------------
 man/chain_sim.Rd                  | 147 ------------------------------
 man/chain_sim_susc.Rd             |  64 -------------
 man/check_nchains_valid.Rd        |  15 +++
 man/check_offspring_func_valid.Rd |  17 ++++
 man/check_offspring_valid.Rd      |  18 ++++
 man/check_serial_valid.Rd         |  17 ++++
 man/dborel.Rd                     |   6 +-
 man/estimate_likelihood.Rd        |  75 +++++++++++++++
 man/format.epichains.Rd           |  19 ++++
 man/get_offspring_func.Rd         |  43 +++++++++
 man/head.epichains.Rd             |  22 +++++
 man/is_epichains.Rd               |  18 ++++
 man/offspring_ll.Rd               |  42 +++++++--
 man/plot.epichains.Rd             |  22 +++++
 man/print.epichains.Rd            |  19 ++++
 man/rborel.Rd                     |   8 +-
 man/simulate_tree.Rd              | 123 +++++++++++++++++++++++++
 man/simulate_tree_from_pop.Rd     |  87 ++++++++++++++++++
 man/simulate_vect.Rd              |  40 ++++++++
 man/summary.epichains.Rd          |  19 ++++
 man/tail.epichains.Rd             |  19 ++++
 man/update_chain_stat.Rd          |  22 +++++
 man/validate_epichains.Rd         |  22 +++++
 24 files changed, 656 insertions(+), 287 deletions(-)
 delete mode 100644 man/chain_ll.Rd
 delete mode 100644 man/chain_sim.Rd
 delete mode 100644 man/chain_sim_susc.Rd
 create mode 100644 man/check_nchains_valid.Rd
 create mode 100644 man/check_offspring_func_valid.Rd
 create mode 100644 man/check_offspring_valid.Rd
 create mode 100644 man/check_serial_valid.Rd
 create mode 100644 man/estimate_likelihood.Rd
 create mode 100644 man/format.epichains.Rd
 create mode 100644 man/get_offspring_func.Rd
 create mode 100644 man/head.epichains.Rd
 create mode 100644 man/is_epichains.Rd
 create mode 100644 man/plot.epichains.Rd
 create mode 100644 man/print.epichains.Rd
 create mode 100644 man/simulate_tree.Rd
 create mode 100644 man/simulate_tree_from_pop.Rd
 create mode 100644 man/simulate_vect.Rd
 create mode 100644 man/summary.epichains.Rd
 create mode 100644 man/tail.epichains.Rd
 create mode 100644 man/update_chain_stat.Rd
 create mode 100644 man/validate_epichains.Rd

diff --git a/man/chain_ll.Rd b/man/chain_ll.Rd
deleted file mode 100644
index cbd5e549..00000000
--- a/man/chain_ll.Rd
+++ /dev/null
@@ -1,59 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/likelihoods.R
-\name{chain_ll}
-\alias{chain_ll}
-\title{Likelihood for the outcome of a branching process}
-\usage{
-chain_ll(
-  x,
-  offspring,
-  stat = c("size", "length"),
-  obs_prob = 1,
-  infinite = Inf,
-  exclude = NULL,
-  individual = FALSE,
-  nsim_obs,
-  ...
-)
-}
-\arguments{
-\item{x}{vector of sizes or lengths of transmission chains}
-
-\item{offspring}{Offspring distribution: a character string corresponding to
-the R distribution function (e.g., "pois" for Poisson, where
-\code{\link{rpois}} is the R function to generate Poisson random numbers)}
-
-\item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
-
-\item{obs_prob}{observation probability (assumed constant)}
-
-\item{infinite}{any chains of this size/length will be treated as infinite}
-
-\item{exclude}{any sizes/lengths to exclude from the likelihood calculation}
-
-\item{individual}{if TRUE, a vector of individual log-likelihood
-contributions will be returned rather than the sum}
-
-\item{nsim_obs}{number of simulations if the likelihood is to be
-approximated for imperfect observations}
-
-\item{...}{parameters for the offspring distribution}
-}
-\value{
-likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or
-a list of individual likelihood contributions (if \code{individual=TRUE})
-}
-\description{
-Likelihood for the outcome of a branching process
-}
-\examples{
-chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
-}
-\seealso{
-pois_size_ll, nbinom_size_ll, gborel_size_ll, pois_length_ll,
-geom_length_ll, offspring_ll
-}
-\author{
-Sebastian Funk
-}
diff --git a/man/chain_sim.Rd b/man/chain_sim.Rd
deleted file mode 100644
index ae3ae0c8..00000000
--- a/man/chain_sim.Rd
+++ /dev/null
@@ -1,147 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/simulate.r
-\name{chain_sim}
-\alias{chain_sim}
-\title{Simulate transmission chains using a branching process}
-\usage{
-chain_sim(
-  n,
-  offspring,
-  stat = c("size", "length"),
-  infinite = Inf,
-  tree = FALSE,
-  serial,
-  t0 = 0,
-  tf = Inf,
-  ...
-)
-}
-\arguments{
-\item{n}{Number of simulations to run.}
-
-\item{offspring}{Offspring distribution: a character string corresponding to
-the R distribution function (e.g., "pois" for Poisson, where
-\code{\link{rpois}} is the R function to generate Poisson random numbers)}
-
-\item{stat}{String; Statistic to calculate. Can be one of:
-\itemize{
-\item "size": the total number of offspring.
-\item "length": the total number of ancestors.
-}}
-
-\item{infinite}{A size or length above which the simulation results
-should be set to \code{Inf}. Defaults to \code{Inf}, resulting in no results
-ever set to \code{Inf}}
-
-\item{tree}{Logical. Should the transmission tree be returned? Defaults
-to \code{FALSE}.}
-
-\item{serial}{The serial interval generator function; the name of a
-user-defined named or anonymous function with only one argument \code{n},
-representing the number of serial intervals to generate.}
-
-\item{t0}{Start time (if serial interval is given); either a single value
-or a vector of length \code{n} (number of simulations) with initial times.
-Defaults to 0.}
-
-\item{tf}{End time (if serial interval is given).}
-
-\item{...}{Parameters of the offspring distribution as required by R.}
-}
-\value{
-Either:
-\itemize{
-\item{A vector of sizes/lengths (if \code{tree == FALSE} OR serial
-interval function not specified, since that implies
-\code{tree == FALSE})}, or
-\item {a data frame with
-columns \code{n} (simulation ID), \code{time} (if the serial interval is given) and
-(if \code{tree == TRUE}), \code{id} (a unique ID within each simulation for
-each individual element of the chain), \code{ancestor} (the ID of the
-ancestor of each element), and \code{generation}.}
-}
-}
-\description{
-\code{chain_sim()} is a stochastic simulator for generating
-transmission chain data with key inputs such as the offspring distribution
-and serial interval distribution.
-}
-\details{
-\code{chain_sim()} either returns a vector or a data.frame. The output is
-either a vector if \code{serial} is not provided, which automatically sets
-\code{tree = FALSE}, or a \code{data.frame}, which means that \code{serial} was
-provided as a function. When \code{serial} is provided, it means
-\code{tree = TRUE} automatically. However, setting \code{tree = TRUE}
-would require providing a function for \code{serial}.
-}
-\section{The serial interval (\code{serial}):}{
-\subsection{Assumptions/disambiguation}{
-
-In epidemiology, the generation interval is the duration between successive
-infectious events in a chain of transmission. Similarly, the serial
-interval is the duration between observed symptom onset times between
-successive cases in a transmission chain. The generation interval is
-often hard to observe because exact times of infection are hard to
-measure hence, the serial interval is often used instead. Here, we
-use the serial interval to represent what would normally be called the
-generation interval, that is, the time between successive cases.
-}
-
-\subsection{Specifying \code{serial} in \code{chain_sim()}}{
-
-\code{serial} must be specified as a named or
-\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function} # nolint
-with one argument.
-
-If \code{serial} is specified, \code{chain_sim()} returns times of
-infection as a column in the output. Moreover, specifying a function
-for \code{serial} implies \code{tree = TRUE} and a tree of
-infectors (\code{ancestor}) and infectees (\code{id}) will be generated in the output.
-
-For example, assuming we want to specify the serial interval
-generator as a random log-normally distributed variable with
-\code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
-let's call it "serial_interval", with only one argument representing the
-number of serial intervals to sample:
-\code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
-and assign the name of the function to serial in \code{chain_sim()} like so
-\code{chain_sim(..., serial = serial_interval)},
-where \code{...} are the other arguments to \code{chain_sim()}. Alternatively, we
-could assign an anonymous function to serial in the \code{chain_sim()} call
-like so \code{chain_sim(..., serial = function(n){rlnorm(n, 0.58, 1.38)})},
-where \code{...} are the other arguments to \code{chain_sim()}.
-}
-}
-
-\examples{
-# Specifying no `serial` and `tree == FALSE` (default) returns a vector
-set.seed(123)
-chain_sim(n = 5, offspring = "pois", stat = "size", lambda = 0.5,
-tree = FALSE)
-
-# Specifying `serial` without specifying `tree` will set `tree = TRUE`
-# internally.
-
-# We'll first define the serial function
-set.seed(123)
-serial_interval <- function(n) {
-  rlnorm(n, meanlog = 0.58, sdlog = 1.58)
-}
-chain_sim(
-  n = 5, offspring = "pois", lambda = 0.5, stat = "length",
-  infinite = 100,
-  serial = serial_interval
-)
-
-# Specifying `serial` and `tree = FALSE` will throw an error
-set.seed(123)
-\dontrun{
-try(chain_sim(
-  n = 10, serial = function(x) 3, offspring = "pois", lambda = 2,
-  infinite = 10, tree = FALSE
-))
-}
-}
-\author{
-Sebastian Funk, James M. Azam
-}
diff --git a/man/chain_sim_susc.Rd b/man/chain_sim_susc.Rd
deleted file mode 100644
index 3d1c7832..00000000
--- a/man/chain_sim_susc.Rd
+++ /dev/null
@@ -1,64 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/simulate_susceptibles.R
-\name{chain_sim_susc}
-\alias{chain_sim_susc}
-\title{Simulate a single chain using a branching process while accounting
-for depletion of susceptibles.}
-\usage{
-chain_sim_susc(
-  offspring = c("pois", "nbinom"),
-  mn_offspring,
-  disp_offspring,
-  serial,
-  t0 = 0,
-  tf = Inf,
-  pop,
-  initial_immune = 0
-)
-}
-\arguments{
-\item{offspring}{offspring distribution: a character string corresponding to
-the R distribution function. Currently only "pois" & "nbinom" are
-supported. Internally truncated distributions are used to avoid infecting
-more people than susceptibles available.}
-
-\item{mn_offspring}{the average number of secondary cases for each case}
-
-\item{disp_offspring}{the dispersion coefficient (var/mean) of the number of
-secondary cases. Ignored if offspring == "pois". Must be > 1.}
-
-\item{serial}{the serial interval. A function that takes one parameter
-(\code{n}), the number of serial intervals to randomly sample.
-Value must be >= 0.}
-
-\item{t0}{start time}
-
-\item{tf}{end time}
-
-\item{pop}{the population}
-
-\item{initial_immune}{the number of initial immunes in the population}
-}
-\value{
-a data frame with columns \code{time}, \code{id} (a unique ID for each
-individual element of the chain), \code{ancestor} (the ID of the ancestor
-of each element), and \code{generation}.
-}
-\description{
-Simulate a single chain using a branching process while accounting
-for depletion of susceptibles.
-}
-\details{
-This function has a couple of key differences with chain_sim:
-it can only simulate one chain at a time,
-it can only handle implemented offspring distributions
-("pois" and "nbinom"),
-it always tracks and returns a data frame containing the entire tree,
-the maximal length of chains is limited with pop instead of infinite.
-}
-\examples{
-chain_sim_susc("pois", mn_offspring = 0.5, serial = function(x) 3, pop = 100)
-}
-\author{
-Flavio Finger
-}
diff --git a/man/check_nchains_valid.Rd b/man/check_nchains_valid.Rd
new file mode 100644
index 00000000..6e565502
--- /dev/null
+++ b/man/check_nchains_valid.Rd
@@ -0,0 +1,15 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/checks.R
+\name{check_nchains_valid}
+\alias{check_nchains_valid}
+\title{Check that nchains is greater than 0 and not infinite}
+\usage{
+check_nchains_valid(nchains)
+}
+\arguments{
+\item{nchains}{Number of chains to simulate.}
+}
+\description{
+Check that nchains is greater than 0 and not infinite
+}
+\keyword{internal}
diff --git a/man/check_offspring_func_valid.Rd b/man/check_offspring_func_valid.Rd
new file mode 100644
index 00000000..b0e5a860
--- /dev/null
+++ b/man/check_offspring_func_valid.Rd
@@ -0,0 +1,17 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/checks.R
+\name{check_offspring_func_valid}
+\alias{check_offspring_func_valid}
+\title{Check if constructed random number generator for offspring exists}
+\usage{
+check_offspring_func_valid(roffspring_name)
+}
+\arguments{
+\item{roffspring_name}{Constructed random offspring sampler: a character
+string corresponding to the R distribution function (e.g., "rpois" for
+Poisson.}
+}
+\description{
+Check if constructed random number generator for offspring exists
+}
+\keyword{internal}
diff --git a/man/check_offspring_valid.Rd b/man/check_offspring_valid.Rd
new file mode 100644
index 00000000..83359dce
--- /dev/null
+++ b/man/check_offspring_valid.Rd
@@ -0,0 +1,18 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/checks.R
+\name{check_offspring_valid}
+\alias{check_offspring_valid}
+\title{Check if offspring argument is specified as a character string}
+\usage{
+check_offspring_valid(offspring_sampler)
+}
+\arguments{
+\item{offspring_sampler}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
+}
+\description{
+Check if offspring argument is specified as a character string
+}
+\keyword{internal}
diff --git a/man/check_serial_valid.Rd b/man/check_serial_valid.Rd
new file mode 100644
index 00000000..7a33c71f
--- /dev/null
+++ b/man/check_serial_valid.Rd
@@ -0,0 +1,17 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/checks.R
+\name{check_serial_valid}
+\alias{check_serial_valid}
+\title{Check if the serials_sampler argument is specified as a function}
+\usage{
+check_serial_valid(serials_sampler)
+}
+\arguments{
+\item{serials_sampler}{The serial interval generator function; the name of a
+user-defined named or anonymous function with only one argument \code{n},
+representing the number of serial intervals to generate.}
+}
+\description{
+Check if the serials_sampler argument is specified as a function
+}
+\keyword{internal}
diff --git a/man/dborel.Rd b/man/dborel.Rd
index 14d269d0..52c4db77 100644
--- a/man/dborel.Rd
+++ b/man/dborel.Rd
@@ -7,14 +7,14 @@
 dborel(x, mu, log = FALSE)
 }
 \arguments{
-\item{x}{vector of integers.}
+\item{x}{Vector of integers.}
 
 \item{mu}{mu parameter.}
 
-\item{log}{logical; if TRUE, probabilities p are given as log(p).}
+\item{log}{Logical; if TRUE, probabilities p are given as log(p).}
 }
 \value{
-probability mass.
+Probability mass.
 }
 \description{
 Density of the Borel distribution
diff --git a/man/estimate_likelihood.Rd b/man/estimate_likelihood.Rd
new file mode 100644
index 00000000..c0dc70e9
--- /dev/null
+++ b/man/estimate_likelihood.Rd
@@ -0,0 +1,75 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/likelihood_estimation.R
+\name{estimate_likelihood}
+\alias{estimate_likelihood}
+\title{Estimate the (log) likelihood for observed branching processes}
+\usage{
+estimate_likelihood(
+  chains_observed,
+  chain_statistic = c("size", "length"),
+  offspring_sampler,
+  nsim_obs,
+  log_trans = TRUE,
+  obs_prob = 1,
+  chain_stat_max = Inf,
+  exclude = NULL,
+  individual = FALSE,
+  ...
+)
+}
+\arguments{
+\item{chains_observed}{Vector of sizes/lengths of transmission chains.}
+
+\item{chain_statistic}{Statistic given as \code{chains_observed}
+("size" or "length" of chains).}
+
+\item{offspring_sampler}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
+
+\item{nsim_obs}{Number of simulations if the likelihood is to be
+approximated for imperfect observations.}
+
+\item{log_trans}{Logical; Should the results be log-transformed? (Defaults
+to TRUE).}
+
+\item{obs_prob}{Observation probability (assumed constant)}
+
+\item{chain_stat_max}{Any chains of this size/length will be
+treated as infinite.}
+
+\item{exclude}{A vector of indices of the sizes/lengths to exclude from the
+likelihood calculation.}
+
+\item{individual}{If TRUE, a vector of individual (log)likelihood
+contributions will be returned rather than the sum.}
+
+\item{...}{Parameters for the offspring distribution.}
+}
+\value{
+\itemize{
+\item A log-likelihood, if \code{log_trans = TRUE} (the default)
+\item A vector of log-likelihoods, if \code{log_trans = TRUE} (the default) and
+\code{obs_prob < 1}, or
+\item A list of individual log-likelihood contributions, if
+\code{log_trans = TRUE} (the default) and \code{individual = TRUE}.
+else raw likelihoods, or vector of likelihoods
+}
+}
+\description{
+Estimate the (log) likelihood for observed branching processes
+}
+\examples{
+# example of observed chain sizes
+chain_sizes <- c(1, 1, 4, 7)
+estimate_likelihood(chains_observed = chain_sizes, chain_statistic = "size",
+ offspring_sampler = "pois", nsim_obs = 100, lambda = 0.5)
+}
+\seealso{
+offspring_ll, pois_size_ll, nbinom_size_ll, gborel_size_ll,
+pois_length_ll, geom_length_ll.
+}
+\author{
+Sebastian Funk
+}
diff --git a/man/format.epichains.Rd b/man/format.epichains.Rd
new file mode 100644
index 00000000..cb0bb0f1
--- /dev/null
+++ b/man/format.epichains.Rd
@@ -0,0 +1,19 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{format.epichains}
+\alias{format.epichains}
+\title{Format method for epichains class}
+\usage{
+\method{format}{epichains}(x, ...)
+}
+\arguments{
+\item{x}{epichains object}
+
+\item{...}{further arguments passed to or from other methods}
+}
+\value{
+Invisibly returns an \code{\link{epichains}}. Called for printing side-effects.
+}
+\description{
+Format method for epichains class
+}
diff --git a/man/get_offspring_func.Rd b/man/get_offspring_func.Rd
new file mode 100644
index 00000000..10c61254
--- /dev/null
+++ b/man/get_offspring_func.Rd
@@ -0,0 +1,43 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/helpers.R
+\name{get_offspring_func}
+\alias{get_offspring_func}
+\title{Get offspring sampling function}
+\usage{
+get_offspring_func(
+  offspring_sampler,
+  n,
+  susc,
+  pop,
+  mean_offspring,
+  disp_offspring = NULL
+)
+}
+\arguments{
+\item{offspring_sampler}{Offspring distribution sampler: a character string
+corresponding to the R distribution function. Currently only "pois" &
+"nbinom" are supported. Internally truncated distributions are used to
+avoid infecting more people than susceptibles available.}
+
+\item{n}{Number of items to sample}
+
+\item{susc}{Susceptible population size (calculated
+inside \code{\link{simulate_tree_from_pop}}  as pop - initial_immune)}
+
+\item{pop}{The susceptible population.}
+
+\item{mean_offspring}{The average number of secondary cases for each case.
+Same as R0.}
+
+\item{disp_offspring}{The dispersion parameter of the number of
+secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
+avoid division by 0 when calculating the size. See details and
+\code{?rnbinom} for details on the parameterisation in Ecology.}
+}
+\value{
+An offspring sampling function
+}
+\description{
+Get offspring sampling function
+}
+\keyword{internal}
diff --git a/man/head.epichains.Rd b/man/head.epichains.Rd
new file mode 100644
index 00000000..3ee70b58
--- /dev/null
+++ b/man/head.epichains.Rd
@@ -0,0 +1,22 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{head.epichains}
+\alias{head.epichains}
+\title{\code{head} method for \code{\link{epichains}} class}
+\usage{
+\method{head}{epichains}(x, ...)
+}
+\arguments{
+\item{x}{An \code{\link{epichains}} object}
+
+\item{...}{further arguments passed to or from other methods}
+}
+\value{
+object of class \code{data.frame}
+}
+\description{
+\code{head} method for \code{\link{epichains}} class
+}
+\author{
+James M. Azam
+}
diff --git a/man/is_epichains.Rd b/man/is_epichains.Rd
new file mode 100644
index 00000000..dd365904
--- /dev/null
+++ b/man/is_epichains.Rd
@@ -0,0 +1,18 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{is_epichains}
+\alias{is_epichains}
+\title{Checks whether the object is an \code{epichains}}
+\usage{
+is_epichains(x)
+}
+\arguments{
+\item{x}{An R object}
+}
+\value{
+logical, \code{TRUE} if the object is an \code{epichains} and \code{FALSE}
+otherwise
+}
+\description{
+Checks whether the object is an \code{epichains}
+}
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 427eb61a..1280c21a 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -4,24 +4,37 @@
 \alias{offspring_ll}
 \title{Likelihood of the length of chains with generic offspring distribution}
 \usage{
-offspring_ll(x, offspring, stat, nsim_offspring = 100, ...)
+offspring_ll(
+  chains_observed,
+  offspring_sampler,
+  chain_statistic,
+  nsim_offspring = 100,
+  log_trans = TRUE,
+  ...
+)
 }
 \arguments{
-\item{x}{vector of sizes}
+\item{chains_observed}{Vector of sizes/lengths}
 
-\item{offspring}{Offspring distribution: a character string corresponding to
-the R distribution function (e.g., "pois" for Poisson, where
-\code{\link{rpois}} is the R function to generate Poisson random numbers)}
+\item{offspring_sampler}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
 
-\item{stat}{statistic given as \code{x} ("size" or "length" of chains)}
+\item{chain_statistic}{Statistic given as \code{chains_observed}
+("size" or "length" of chains).}
 
-\item{nsim_offspring}{number of simulations of the offspring distribution
-for approximation the size/length distribution}
+\item{nsim_offspring}{Number of simulations of the offspring distribution
+for approximating the chain_statistic (size/length) distribution}
 
-\item{...}{any parameters to pass to \code{\link{chain_sim}}}
+\item{log_trans}{Logical; Should the results be log-transformed? (Defaults
+to TRUE).}
+
+\item{...}{any parameters to pass to \code{\link{simulate_tree}}}
 }
 \value{
-log-likelihood values
+If \code{log_trans = TRUE} (the default), log-likelihood values,
+else raw likelihoods
 }
 \description{
 The likelihoods are calculated with a crude approximation using simulated
@@ -31,4 +44,13 @@ cumulative distribution function (ecdf).
 \author{
 Sebastian Funk
 }
+\keyword{Compute}
+\keyword{Cumulative}
+\keyword{Distribution}
+\keyword{Function}
+\keyword{chains}
+\keyword{empirical}
 \keyword{internal}
+\keyword{of}
+\keyword{simulated}
+\keyword{the}
diff --git a/man/plot.epichains.Rd b/man/plot.epichains.Rd
new file mode 100644
index 00000000..7fa17943
--- /dev/null
+++ b/man/plot.epichains.Rd
@@ -0,0 +1,22 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{plot.epichains}
+\alias{plot.epichains}
+\title{Plot epichains tree objects}
+\usage{
+\method{plot}{epichains}(x, ...)
+}
+\arguments{
+\item{x}{An \code{\link{epichains}} object with a chains_tree attribute}
+
+\item{...}{Other arguments passed to plot}
+}
+\value{
+A plot of cases over time and generation
+}
+\description{
+Plot epichains tree objects
+}
+\author{
+James M. Azam
+}
diff --git a/man/print.epichains.Rd b/man/print.epichains.Rd
new file mode 100644
index 00000000..22c24de2
--- /dev/null
+++ b/man/print.epichains.Rd
@@ -0,0 +1,19 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{print.epichains}
+\alias{print.epichains}
+\title{Print an \code{\link{epichains}} object}
+\usage{
+\method{print}{epichains}(x, ...)
+}
+\arguments{
+\item{x}{An \code{\link{epichains}} object.}
+
+\item{...}{Other parameters passed to \code{\link[=print]{print()}}.}
+}
+\value{
+Invisibly returns an \code{\link{epichains}}. Called for side-effects.
+}
+\description{
+Print an \code{\link{epichains}} object
+}
diff --git a/man/rborel.Rd b/man/rborel.Rd
index e32484ed..70cd22fb 100644
--- a/man/rborel.Rd
+++ b/man/rborel.Rd
@@ -7,15 +7,15 @@
 rborel(n, mu, infinite = Inf)
 }
 \arguments{
-\item{n}{number of random variates to generate.}
+\item{n}{Number of random variates to generate.}
 
 \item{mu}{mu parameter.}
 
-\item{infinite}{any number to treat as infinite; simulations will be stopped
-if this number is reached}
+\item{infinite}{Any number to treat as infinite; simulations will be
+stopped if this number is reached}
 }
 \value{
-vector of random numbers
+Vector of random numbers
 }
 \description{
 Random numbers are generated by simulating from a Poisson branching process
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
new file mode 100644
index 00000000..a29a2748
--- /dev/null
+++ b/man/simulate_tree.Rd
@@ -0,0 +1,123 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/simulate.r
+\name{simulate_tree}
+\alias{simulate_tree}
+\title{Simulate a tree of infections with a serial and offspring distributions}
+\usage{
+simulate_tree(
+  nchains,
+  offspring_sampler,
+  chain_statistic = c("size", "length"),
+  chain_stat_max = Inf,
+  serials_sampler,
+  t0 = 0,
+  tf = Inf,
+  ...
+)
+}
+\arguments{
+\item{nchains}{Number of chains to simulate.}
+
+\item{offspring_sampler}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers)}
+
+\item{chain_statistic}{String; Statistic to calculate. Can be one of:
+\itemize{
+\item "size": the total number of offspring.
+\item "length": the total number of ancestors.
+}}
+
+\item{chain_stat_max}{A cut off for the chain statistic (size/length) being
+computed. Results above the specified value, are set to this value.
+Defaults to \code{Inf}.}
+
+\item{serials_sampler}{The serial interval generator function; the name of a
+user-defined named or anonymous function with only one argument \code{n},
+representing the number of serial intervals to generate.}
+
+\item{t0}{Start time (if serial interval is given); either a single value
+or a vector of same length as \code{nchains} (number of simulations) with
+initial times. Defaults to 0.}
+
+\item{tf}{End time (if serial interval is given).}
+
+\item{...}{Parameters of the offspring distribution as required by R.}
+}
+\value{
+an \code{epichains} object, which is basically a \code{data.frame} with
+columns \code{chain_id} (chain ID), \code{sim_id} (a unique ID within each simulation
+for each individual element of the chain), \code{ancestor}
+(the ID of the ancestor of each element), \code{generation}, and
+\code{time} (of infection)
+}
+\description{
+Simulate a tree of infections with a serial and offspring distributions
+}
+\details{
+\code{simulate_tree()} simulates a branching process of the form:
+WIP
+}
+\section{The serial interval (\code{serials_sampler}):}{
+\subsection{Assumptions/disambiguation}{
+
+In epidemiology, the generation interval is the duration between successive
+infectious events in a chain of transmission. Similarly, the serial
+interval is the duration between observed symptom onset times between
+successive cases in a transmission chain. The generation interval is
+often hard to observe because exact times of infection are hard to
+measure hence, the serial interval is often used instead . Here, we
+use the serial interval to represent what would normally be called the
+generation interval, that is, the time between successive cases.
+
+See References below for some literature on the subject.
+}
+
+\subsection{Specifying \code{serials_sampler} in \code{simulate_tree()}}{
+
+\code{serials_sampler} must be specified as a named or
+\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function} # nolint
+with one argument.
+
+For example, assuming we want to specify the serial interval
+generator as a random log-normally distributed variable with
+\code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
+let's call it "serial_interval", with only one argument representing the
+number of serial intervals to sample:
+\code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
+and assign the name of the function to \code{serials_sampler} in
+\code{simulate_tree()} like so
+\code{simulate_tree(..., serials_sampler = serial_interval)},
+where \code{...} are the other arguments to \code{simulate_tree()}.
+
+Alternatively, we could assign an anonymous function to \code{serials_sampler}
+in the \code{simulate_tree()} call like so
+\code{simulate_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
+where \code{...} are the other arguments to \code{simulate_tree()}.
+}
+}
+
+\examples{
+set.seed(123)
+chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
+offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
+chains
+}
+\references{
+Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
+between serial interval, infectiousness profile and generation time.
+J R Soc Interface. 2021 Jan;18(174):20200756.
+doi: 10.1098/rsif.2020.0756. Epub 2021 Jan 6.
+PMID: 33402022; PMCID: PMC7879757.
+
+Fine PE. The interval between successive cases of an
+infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
+doi: 10.1093/aje/kwg251. PMID: 14630599.
+}
+\seealso{
+\code{\link[=simulate_vect]{simulate_vect()}} for simulating transmission chains as a vector
+}
+\author{
+James M. Azam, Sebastian Funk
+}
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
new file mode 100644
index 00000000..d2409fa4
--- /dev/null
+++ b/man/simulate_tree_from_pop.Rd
@@ -0,0 +1,87 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/simulate.r
+\name{simulate_tree_from_pop}
+\alias{simulate_tree_from_pop}
+\title{Simulate a tree of infections from an initial susceptible population
+with initial immunity}
+\usage{
+simulate_tree_from_pop(
+  pop,
+  offspring_sampler = c("pois", "nbinom"),
+  mean_offspring,
+  disp_offspring,
+  serial_sampler,
+  initial_immune = 0,
+  t0 = 0,
+  tf = Inf
+)
+}
+\arguments{
+\item{pop}{The susceptible population.}
+
+\item{offspring_sampler}{Offspring distribution sampler: a character string
+corresponding to the R distribution function. Currently only "pois" &
+"nbinom" are supported. Internally truncated distributions are used to
+avoid infecting more people than susceptibles available.}
+
+\item{mean_offspring}{The average number of secondary cases for each case.
+Same as R0.}
+
+\item{disp_offspring}{The dispersion parameter of the number of
+secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
+avoid division by 0 when calculating the size. See details and
+\code{?rnbinom} for details on the parameterisation in Ecology.}
+
+\item{serial_sampler}{The serial interval. A function that takes one
+parameter (\code{n}), the number of serial intervals to randomly sample. Value
+must be >= 0.}
+
+\item{initial_immune}{The number of initial immunes in the population.}
+
+\item{t0}{Start time; Defaults to 0.}
+
+\item{tf}{End time; Defaults to \code{Inf}.}
+}
+\value{
+a data frame with columns \code{time}, \code{id} (a unique ID for each
+individual element of the chain), \code{ancestor} (the ID of the ancestor
+of each element), and \code{generation}.
+}
+\description{
+Simulate a tree of infections from an initial susceptible population
+with initial immunity
+}
+\section{Offspring models}{
+The poisson model is parametrised so that:
+
+lamda = mean_offspring * pop - initial_immune / pop
+
+The negative binomial model is parametrised as:
+
+mu = mean_offspring * pop - initial immune / pop, and
+size = mu / (disp_offspring - 1). This is why disp_offspring must be greater
+than 1.
+
+simulate_tree_from_pop() has a couple of key different from simulate_tree():
+\itemize{
+\item the maximal chain statistic is limited by \code{pop} instead of
+\code{chain_stat_max} (in \code{simulate_tree()}),
+\item it can only handle implemented offspring distributions ("pois" and
+"nbinom").
+}
+}
+
+\examples{
+# Simulate with poisson offspring
+simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
+mean_offspring = 0.5, serial_sampler = function(x) 3)
+
+# Simulate with negative binomial offspring
+simulate_tree_from_pop(pop = 100, offspring_sampler = "nbinom",
+mean_offspring = 0.5, disp_offspring = 1.1, serial_sampler = function(x) 3)
+}
+\author{
+Flavio Finger
+
+James M. Azam
+}
diff --git a/man/simulate_vect.Rd b/man/simulate_vect.Rd
new file mode 100644
index 00000000..cd7fafc7
--- /dev/null
+++ b/man/simulate_vect.Rd
@@ -0,0 +1,40 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/simulate.r
+\name{simulate_vect}
+\alias{simulate_vect}
+\title{Simulate transmission chains without tree (as a vector)}
+\usage{
+simulate_vect(
+  nchains,
+  offspring_sampler,
+  chain_statistic = c("size", "length"),
+  chain_stat_max = Inf,
+  ...
+)
+}
+\arguments{
+\item{nchains}{Number of chains to simulate.}
+
+\item{offspring_sampler}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers)}
+
+\item{chain_statistic}{String; Statistic to calculate. Can be one of:
+\itemize{
+\item "size": the total number of offspring.
+\item "length": the total number of ancestors.
+}}
+
+\item{chain_stat_max}{A cut off for the chain statistic (size/length) being
+computed. Results above the specified value, are set to \code{Inf}.}
+
+\item{...}{Parameters of the offspring distribution as required by R.}
+}
+\description{
+Simulate transmission chains without tree (as a vector)
+}
+\examples{
+simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
+chain_stat_max = 10)
+}
diff --git a/man/summary.epichains.Rd b/man/summary.epichains.Rd
new file mode 100644
index 00000000..f6b81976
--- /dev/null
+++ b/man/summary.epichains.Rd
@@ -0,0 +1,19 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{summary.epichains}
+\alias{summary.epichains}
+\title{Summary method for epichains class}
+\usage{
+\method{summary}{epichains}(object, ...)
+}
+\arguments{
+\item{object}{An \code{\link{epichains}} object}
+
+\item{...}{further arguments passed to or from other methods}
+}
+\value{
+data frame of information
+}
+\description{
+Summary method for epichains class
+}
diff --git a/man/tail.epichains.Rd b/man/tail.epichains.Rd
new file mode 100644
index 00000000..d63fc88e
--- /dev/null
+++ b/man/tail.epichains.Rd
@@ -0,0 +1,19 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{tail.epichains}
+\alias{tail.epichains}
+\title{\code{tail} method for \code{\link{epichains}} class}
+\usage{
+\method{tail}{epichains}(x, ...)
+}
+\arguments{
+\item{x}{An \code{\link{epichains}} object}
+
+\item{...}{further arguments passed to or from other methods}
+}
+\description{
+\code{tail} method for \code{\link{epichains}} class
+}
+\author{
+James M. Azam
+}
diff --git a/man/update_chain_stat.Rd b/man/update_chain_stat.Rd
new file mode 100644
index 00000000..886c723b
--- /dev/null
+++ b/man/update_chain_stat.Rd
@@ -0,0 +1,22 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/helpers.R
+\name{update_chain_stat}
+\alias{update_chain_stat}
+\title{Determine and update the chain statistic being tracked}
+\usage{
+update_chain_stat(stat_type, stat_latest, n_offspring)
+}
+\arguments{
+\item{stat_type}{Chain statistic (size/length) to update.}
+
+\item{stat_latest}{The latest chain statistic vector to be updated.}
+
+\item{n_offspring}{A vector of offspring per chain.}
+}
+\value{
+A vector of chain statistics (size/length).
+}
+\description{
+Determine and update the chain statistic being tracked
+}
+\keyword{internal}
diff --git a/man/validate_epichains.Rd b/man/validate_epichains.Rd
new file mode 100644
index 00000000..03953a59
--- /dev/null
+++ b/man/validate_epichains.Rd
@@ -0,0 +1,22 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{validate_epichains}
+\alias{validate_epichains}
+\title{\code{epichains} class validator}
+\usage{
+validate_epichains(x)
+}
+\arguments{
+\item{x}{An \code{epichains} object}
+}
+\value{
+Checks if an object is of class \code{epichains} and if so
+checks that it's in the right format as a "data.frame" or vector.
+}
+\description{
+\code{epichains} class validator
+}
+\author{
+James M. Azam
+}
+\keyword{internal}

From 0b4e2545f6c3484ae00c83976fda6e58426b2cd0 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:10:45 +0100
Subject: [PATCH 284/828] Regenerated NAMESPACE

---
 NAMESPACE | 18 +++++++++++++++---
 1 file changed, 15 insertions(+), 3 deletions(-)

diff --git a/NAMESPACE b/NAMESPACE
index 1b73b8d9..61a29bb9 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -1,6 +1,18 @@
 # Generated by roxygen2: do not edit by hand
 
-export(chain_ll)
-export(chain_sim)
-export(chain_sim_susc)
+S3method(format,epichains)
+S3method(head,epichains)
+S3method(plot,epichains)
+S3method(print,epichains)
+S3method(summary,epichains)
+S3method(tail,epichains)
+export(dborel)
+export(estimate_likelihood)
+export(is_epichains)
+export(rborel)
 export(rnbinom_mean_disp)
+export(simulate_tree)
+export(simulate_tree_from_pop)
+export(simulate_vect)
+importFrom(utils,head)
+importFrom(utils,tail)

From db95cad5d8fc75a95f3e02f067dfac01f53620f7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:10:58 +0100
Subject: [PATCH 285/828] Deleted old script

---
 R/simulate_susceptibles.R | 163 --------------------------------------
 1 file changed, 163 deletions(-)
 delete mode 100644 R/simulate_susceptibles.R

diff --git a/R/simulate_susceptibles.R b/R/simulate_susceptibles.R
deleted file mode 100644
index 20fa9f83..00000000
--- a/R/simulate_susceptibles.R
+++ /dev/null
@@ -1,163 +0,0 @@
-#' Simulate a single chain using a branching process while accounting
-#' for depletion of susceptibles.
-#'
-#' @param offspring offspring distribution: a character string corresponding to
-#'   the R distribution function. Currently only "pois" & "nbinom" are
-#'   supported. Internally truncated distributions are used to avoid infecting
-#'   more people than susceptibles available.
-#' @param mn_offspring the average number of secondary cases for each case
-#' @param disp_offspring the dispersion coefficient (var/mean) of the number of
-#'      secondary cases. Ignored if offspring == "pois". Must be > 1.
-#' @param serial the serial interval. A function that takes one parameter
-#'     (`n`), the number of serial intervals to randomly sample.
-#'     Value must be >= 0.
-#' @param t0 start time
-#' @param tf end time
-#' @param pop the population
-#' @param initial_immune the number of initial immunes in the population
-#' @return a data frame with columns `time`, `id` (a unique ID for each
-#'     individual element of the chain), `ancestor` (the ID of the ancestor
-#'      of each element), and `generation`.
-#'
-#' @details This function has a couple of key differences with chain_sim:
-#'     it can only simulate one chain at a time,
-#'     it can only handle implemented offspring distributions
-#'         ("pois" and "nbinom"),
-#'     it always tracks and returns a data frame containing the entire tree,
-#'     the maximal length of chains is limited with pop instead of infinite.
-#'
-#' @author Flavio Finger
-#' @export
-#' @examples
-#' chain_sim_susc("pois", mn_offspring = 0.5, serial = function(x) 3, pop = 100)
-chain_sim_susc <- function(offspring = c("pois", "nbinom"),
-                           mn_offspring,
-                           disp_offspring,
-                           serial,
-                           t0 = 0,
-                           tf = Inf,
-                           pop,
-                           initial_immune = 0) {
-  offspring <- match.arg(offspring)
-
-  if (offspring == "pois") {
-    if (!missing(disp_offspring)) {
-      warning(sprintf("%s %s",
-                     "Argument 'disp_offspring' not used for",
-                    "poisson offspring distribution."
-                    )
-              )
-    }
-
-    ## using a right truncated poisson distribution
-    ## to avoid more cases than susceptibles
-    offspring_fun <- function(n, susc) {
-      truncdist::rtrunc(
-        n,
-        spec = "pois",
-        lambda = mn_offspring * susc / pop,
-        b = susc
-      )
-    }
-  } else if (offspring == "nbinom") {
-  if (missing(disp_offspring)) {
-    stop(sprintf("%s", "Argument 'disp_offspring' was not specified."))
-  } else if (disp_offspring <= 1) { ## dispersion index
-    stop(sprintf("%s %s %s",
-      "Offspring distribution 'nbinom' requires",
-      "argument 'disp_offspring' > 1.",
-      "Use 'pois' if there is no overdispersion."
-    ))
-  }
-    offspring_fun <- function(n, susc) {
-      ## get distribution params from mean and dispersion
-      ## see ?rnbinom for parameter definition
-      new_mn <- mn_offspring * susc / pop ## apply susceptibility
-      size <- new_mn / (disp_offspring - 1)
-
-      ## using a right truncated nbinom distribution
-      ## to avoid more cases than susceptibles
-      truncdist::rtrunc(
-        n,
-        spec = "nbinom",
-        b = susc,
-        mu = new_mn,
-        size = size
-      )
-    }
-  }
-
-  ## initializations
-  tdf <- data.frame(
-    id = 1L,
-    ancestor = NA_integer_,
-    generation = 1L,
-    time = t0,
-    offspring_generated = FALSE
-  )
-
-  susc <- pop - initial_immune - 1L
-  t <- t0
-
-  ## continue if any unsimulated has t <= tf
-  ## AND there is still susceptibles left
-  while (
-    any(tdf$time[!tdf$offspring_generated] <= tf) &&
-      susc > 0
-  ) {
-
-    ## select from which case to generate offspring
-    t <- min(tdf$time[!tdf$offspring_generated]) # lowest unsimulated t
-
-    ## index of the first in df with t, extract vars
-    idx <- which(tdf$time == t & !tdf$offspring_generated)[1]
-    id_parent <- tdf$id[idx]
-    t_parent <- tdf$time[idx]
-    gen_parent <- tdf$generation[idx]
-
-    ## generate it
-    current_max_id <- max(tdf$id)
-    n_offspring <- offspring_fun(1, susc)
-
-    if (n_offspring %% 1 > 0) {
-      stop("Offspring distribution must return integers")
-    }
-
-    ## mark as done
-    tdf$offspring_generated[idx] <- TRUE
-
-    ## add to df
-    if (n_offspring > 0) {
-      ## draw times
-      new_times <- serial(n_offspring)
-
-      if (any(new_times < 0)) {
-        stop("Serial interval must be >= 0.")
-      }
-
-      new_df <- data.frame(
-        id = current_max_id + seq_len(n_offspring),
-        time = new_times + t_parent,
-        ancestor = id_parent,
-        generation = gen_parent + 1L,
-        offspring_generated = FALSE
-      )
-
-      ## add new cases to tdf
-      tdf <- rbind(tdf, new_df)
-    }
-
-    ## adjust susceptibles
-    susc <- susc - n_offspring
-  }
-
-  ## remove cases with time > tf that could
-  ## have been generated in the last generation
-  tdf <- tdf[tdf$time <= tf, ]
-
-  ## sort output and remove columns not needed
-  tdf <- tdf[order(tdf$time, tdf$id), ]
-  tdf$offspring_generated <- NULL
-
-  return(tdf)
-}

From ff26f99c822b1d4c92d100b1d6506e2a4ced9276 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:13:08 +0100
Subject: [PATCH 286/828] Changed old argument documentation to sentence case

---
 R/borel.r    | 15 ++++++++-------
 R/simulate.r |  4 ++--
 2 files changed, 10 insertions(+), 9 deletions(-)

diff --git a/R/borel.r b/R/borel.r
index 9804cc55..05a93b46 100644
--- a/R/borel.r
+++ b/R/borel.r
@@ -1,10 +1,11 @@
 ##' Density of the Borel distribution
 ##'
-##' @param x vector of integers.
+##' @param x Vector of integers.
 ##' @param mu mu parameter.
-##' @param log logical; if TRUE, probabilities p are given as log(p).
-##' @return probability mass.
+##' @param log Logical; if TRUE, probabilities p are given as log(p).
+##' @return Probability mass.
 ##' @author Sebastian Funk
+##' @export
 dborel <- function(x, mu, log = FALSE) {
   if (x < 1) stop("'x' must be greater than 0")
   ld <- -mu * x + (x - 1) * log(mu * x) - lgamma(x + 1)
@@ -15,11 +16,11 @@ dborel <- function(x, mu, log = FALSE) {
 ##' Generate random numbers from the Borel distribution
 ##'
 ##' Random numbers are generated by simulating from a Poisson branching process
-##' @param n number of random variates to generate.
+##' @param n Number of random variates to generate.
 ##' @param mu mu parameter.
-##' @param infinite any number to treat as infinite; simulations will be stopped
-##'     if this number is reached
-##' @return vector of random numbers
+##' @param infinite Any number to treat as infinite; simulations will be
+##' stopped if this number is reached
+##' @return Vector of random numbers
 ##' @author Sebastian Funk
 rborel <- function(n, mu, infinite = Inf) {
   chain_sim(n, "pois", "size", infinite = infinite, lambda = mu)
diff --git a/R/simulate.r b/R/simulate.r
index cef83fc2..f9467900 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,6 +1,6 @@
 #' Simulate a tree of infections with a serial and offspring distributions
 #'
-#' @param nchains number of chains to simulate
+#' @param nchains Number of chains to simulate.
 #' @param offspring_sampler Offspring distribution: a character string
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
 #' where \code{\link{rpois}} is the R function to generate Poisson random
@@ -324,7 +324,7 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
 #' mean_offspring = 0.5, serial_sampler = function(x) 3)
 #'
-#' #' # Simulate with negative binomial offspring
+#' # Simulate with negative binomial offspring
 #' simulate_tree_from_pop(pop = 100, offspring_sampler = "nbinom",
 #' mean_offspring = 0.5, disp_offspring = 1.1, serial_sampler = function(x) 3)
 simulate_tree_from_pop <- function(pop,

From 4b6648dd2829a81dea7197db0dc55cefc016be91 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:13:43 +0100
Subject: [PATCH 287/828] Replaced call to chain_sim() with simulate_vect()

---
 R/borel.r | 8 +++++++-
 1 file changed, 7 insertions(+), 1 deletion(-)

diff --git a/R/borel.r b/R/borel.r
index 05a93b46..9d470d4b 100644
--- a/R/borel.r
+++ b/R/borel.r
@@ -22,6 +22,12 @@ dborel <- function(x, mu, log = FALSE) {
 ##' stopped if this number is reached
 ##' @return Vector of random numbers
 ##' @author Sebastian Funk
+##' @export
 rborel <- function(n, mu, infinite = Inf) {
-  chain_sim(n, "pois", "size", infinite = infinite, lambda = mu)
+  simulate_vect(nchains = n,
+                offspring_sampler = "pois",
+                chain_statistic = "size",
+                chain_stat_max = infinite,
+                lambda = mu
+                )
 }

From c66caa902a373af9d61804f2c65af00fa72addc7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:14:43 +0100
Subject: [PATCH 288/828] Documented the print method

---
 R/epichains.R | 6 ++++++
 1 file changed, 6 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index bf59d969..883aa708 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -1,3 +1,9 @@
+#' Print an [`epichains`] object
+#'
+#' @param x An [`epichains`] object.
+#' @param ... Other parameters passed to [print()].
+#' @return Invisibly returns an [`epichains`]. Called for side-effects.
+#' @export
 print.epichains <- function(x, ...) {
   format(x, ...)
 }

From 502d14df5b2940133126b8c8e1d613c9069165c2 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:15:09 +0100
Subject: [PATCH 289/828] Removed tibble import

---
 R/epichains.R | 1 -
 1 file changed, 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 883aa708..7d49ce1b 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -12,7 +12,6 @@ print.epichains <- function(x, ...) {
 #'
 #' @param x epichains object
 #' @param ... further arguments passed to or from other methods
-#' @importFrom tibble as_tibble
 #' @return Invisibly returns an [`epichains`]. Called for printing side-effects.
 #' @export
 #'

From c4b188555cb9ae5ad9b821bb72670a915cf75fe9 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:15:39 +0100
Subject: [PATCH 290/828] Replaced subset() call with [ call

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 7d49ce1b..9b885d4b 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -33,7 +33,7 @@ format.epichains <- function(x, ...) {
       )
 
     # print head of the simulation output
-    print(head(subset(as.data.frame(x), !is.na(ancestor))))
+    print(head(x[!is.na(x$ancestor), ]))
 
     cat("< tree tail >\n")
 

From 4eac2ff2d2225fc30b5574e6895eef9a99cf7cd1 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:16:55 +0100
Subject: [PATCH 291/828] Removed old chain_ll() tests

---
 tests/testthat/tests-ll.r | 43 +--------------------------------------
 1 file changed, 1 insertion(+), 42 deletions(-)

diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
index 73085038..a29bb9ac 100644
--- a/tests/testthat/tests-ll.r
+++ b/tests/testthat/tests-ll.r
@@ -1,24 +1,4 @@
 chains <- c(1, 1, 4, 7)
-
-test_that("Likelihoods can be calculated", {
-  expect_lt(chain_ll(chains, "pois", "size", lambda = 0.5), 0)
-  expect_lt(chain_ll(chains, "pois", "size", lambda = 0.5, exclude = 1), 0)
-  expect_lt(chain_ll(chains, "pois", "size", lambda = 0.5, infinite = 5), 0)
-  expect_lt(chain_ll(chains, "pois", "size",
-    lambda = 0.5, obs_prob = 0.5,
-    nsim_obs = 1
-  ), 0)
-  expect_lt(chain_ll(chains, "pois", "length",
-    lambda = 0.5, obs_prob = 0.5,
-    nsim_obs = 1
-  ), 0)
-  expect_lt(chain_ll(chains, "pois", "size",
-    lambda = 0.5, infinite = 5,
-    obs_prob = 0.5, nsim_obs = 1
-  ), 0)
-  expect_lt(chain_ll(chains, "binom", "size", size = 1, prob = 0.5), 0)
-})
-
 test_that("Analytical size or length distributions are implemented", {
   expect_true(all(pois_size_ll(chains, lambda = 0.5) < 0))
   expect_true(all(nbinom_size_ll(chains, mu = 0.5, size = 0.2) < 0))
@@ -29,25 +9,4 @@ test_that("Analytical size or length distributions are implemented", {
   expect_true(all(geom_length_ll(chains, prob = 0.5) < 0))
 })
 
-test_that("Errors are thrown", {
-  expect_error(
-    chain_ll(chains, list(), "size", lambda = 0.5),
-    "not a character"
-  )
-  expect_error(
-    chain_ll(chains, "pois", "size", lambda = 0.5, obs_prob = 3),
-    "must be within"
-  )
-  expect_error(
-    chain_ll(chains, "pois", "size", lambda = 0.5, obs_prob = 0.5),
-    "must be specified"
-  )
-  expect_error(
-    nbinom_size_ll(chains, mu = 0.5, size = 0.2, prob = 0.1),
-    "both specified"
-  )
-  expect_error(
-    gborel_size_ll(chains, mu = 0.5, size = 0.2, prob = 0.1),
-    "both specified"
-  )
-})
+

From 56f6941a15c53c2defcd1890f02abbdf1e784ba7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:18:16 +0100
Subject: [PATCH 292/828] Replaced calls to sim_chain_tree() with
 simulate_tree()

---
 R/simulate.r | 16 +++++++---------
 1 file changed, 7 insertions(+), 9 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index f9467900..392ab074 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -29,7 +29,7 @@
 #' @author James M. Azam, Sebastian Funk
 #' @export
 #' @details
-#' `sim_chain_tree()` simulates a branching process of the form:
+#' `simulate_tree()` simulates a branching process of the form:
 #' WIP
 #' # The serial interval (`serials_sampler`):
 #'
@@ -46,7 +46,7 @@
 #'
 #' See References below for some literature on the subject.
 #'
-#' ## Specifying `serials_sampler` in `sim_chain_tree()`
+#' ## Specifying `serials_sampler` in `simulate_tree()`
 #'
 #' `serials_sampler` must be specified as a named or
 #' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) # nolint
@@ -59,12 +59,12 @@
 #' number of serial intervals to sample:
 #' \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
 #' and assign the name of the function to `serials_sampler` in
-#' `sim_chain_tree()` like so
-#' \code{sim_chain_tree(..., serials_sampler = serial_interval)},
-#' where `...` are the other arguments to `sim_chain_tree()`.
+#' `simulate_tree()` like so
+#' \code{simulate_tree(..., serials_sampler = serial_interval)},
+#' where `...` are the other arguments to `simulate_tree()`.
 #'
 #' Alternatively, we could assign an anonymous function to `serials_sampler`
-#' in the `sim_chain_tree()` call like so
+#' in the `simulate_tree()` call like so
 #' \code{simulate_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
 #' where `...` are the other arguments to `simulate_tree()`.
 #' @seealso [simulate_vec()] for simulating transmission chains as a vector
@@ -81,11 +81,9 @@
 #' doi: 10.1098/rsif.2020.0756. Epub 2021 Jan 6.
 #' PMID: 33402022; PMCID: PMC7879757.
 #'
-#'
 #' Fine PE. The interval between successive cases of an
 #' infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
 #' doi: 10.1093/aje/kwg251. PMID: 14630599.
-#'
 simulate_tree <- function(nchains, offspring_sampler,
                            chain_statistic = c("size", "length"),
                            chain_stat_max = Inf, serials_sampler, t0 = 0,
@@ -213,7 +211,7 @@ simulate_tree <- function(nchains, offspring_sampler,
 
 #' Simulate transmission chains without tree (as a vector)
 #'
-#' @inheritParams sim_chain_tree
+#' @inheritParams simulate_tree
 #' @param chain_stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
 #' @examples #' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,

From ea78cde63890a16d2bc03ecac4e5ef9207be967c Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:18:50 +0100
Subject: [PATCH 293/828] Fixed wrong call to simulate_vect() as simulate_vec()

---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index 392ab074..3305d70d 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -67,7 +67,7 @@
 #' in the `simulate_tree()` call like so
 #' \code{simulate_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
 #' where `...` are the other arguments to `simulate_tree()`.
-#' @seealso [simulate_vec()] for simulating transmission chains as a vector
+#' @seealso [simulate_vect()] for simulating transmission chains as a vector
 #' @examples
 #' set.seed(123)
 #' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,

From 82eac7ebf74a26c52a91a2c0a7cc3b30b08b9124 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:19:35 +0100
Subject: [PATCH 294/828] Fixed examples with right function and argument names

---
 R/simulate.r | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 3305d70d..707bee1d 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -71,7 +71,7 @@
 #' @examples
 #' set.seed(123)
 #' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
-#' offspring = "pois", lambda = 2, infinite = 10)
+#' offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
 #' chains
 #' @references
 #'
@@ -214,7 +214,8 @@ simulate_tree <- function(nchains, offspring_sampler,
 #' @inheritParams simulate_tree
 #' @param chain_stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
-#' @examples #' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
+#' @examples
+#' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
 #' chain_stat_max = 10)
 simulate_vect <- function(nchains, offspring_sampler,
                            chain_statistic = c("size", "length"),

From d15cc7721b095fd2f9fe3795db8ece4d2c24026d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:20:13 +0100
Subject: [PATCH 295/828] Added export tags

---
 R/simulate.r | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index 707bee1d..ff6fad0e 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -217,6 +217,7 @@ simulate_tree <- function(nchains, offspring_sampler,
 #' @examples
 #' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
 #' chain_stat_max = 10)
+#' @export
 simulate_vect <- function(nchains, offspring_sampler,
                            chain_statistic = c("size", "length"),
                            chain_stat_max = Inf, ...) {
@@ -317,7 +318,6 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' "nbinom").
 #' @author Flavio Finger
 #' @author James M. Azam
-#' @export
 #' @examples
 #' # Simulate with poisson offspring
 #' simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
@@ -326,6 +326,7 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' # Simulate with negative binomial offspring
 #' simulate_tree_from_pop(pop = 100, offspring_sampler = "nbinom",
 #' mean_offspring = 0.5, disp_offspring = 1.1, serial_sampler = function(x) 3)
+#' @export
 simulate_tree_from_pop <- function(pop,
                                    offspring_sampler = c("pois", "nbinom"),
                                    mean_offspring,

From 79bc4f626ede2fb0d528627c7742e7c711818b47 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:21:17 +0100
Subject: [PATCH 296/828] Fixed the documentation of plot()

---
 R/epichains.R | 9 ++++-----
 1 file changed, 4 insertions(+), 5 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 9b885d4b..17fd9b7e 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -200,13 +200,12 @@ tail.epichains <- function(x, ...) {
 
 #' Plot epichains tree objects
 #'
-#' @param x an [`epichains`] object with a chains_tree attribute
-#' @param ...
+#' @param x An [`epichains`] object with a chains_tree attribute
+#' @param ... Other arguments passed to plot
 #'
-#' @return
-#' @export
+#' @return A plot of cases over time and generation
 #' @author James M. Azam
-#' @examples
+#' @export
 plot.epichains <- function(x, ...){
   validate_epichains(x)
 

From 20e46d88e645ed8c4e534cc1b305c23cf104c410 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:22:09 +0100
Subject: [PATCH 297/828] Added explicit namespacing for imports

---
 R/epichains.R | 7 +++++--
 1 file changed, 5 insertions(+), 2 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 17fd9b7e..aa0bf437 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -216,10 +216,12 @@ plot.epichains <- function(x, ...){
   cases_per_generation <- aggregate(sim_id ~ generation, x = as.data.frame(x), FUN = NROW)
 
   cases_per_time <- aggregate(sim_id ~ time, x = as.data.frame(x), FUN = NROW)
+  cases_per_time <- stats::aggregate(sim_id ~ time, x = as.data.frame(x), FUN = NROW)
 
   graphics::par(mfrow = c(1, 2), mar = c(4, 3, 3, 1), oma = c(0, 0, 0, 0))
 
-  plot(cases_per_generation$generation,
+  # Make first plot
+  graphics::plot(cases_per_generation$generation,
        cases_per_generation$sim_id,
        xlab = "Generation",
        ylab = "Cases",
@@ -227,7 +229,8 @@ plot.epichains <- function(x, ...){
        main = "Number of cases per generation"
        )
 
-  plot(cases_per_time$time,
+  # Make second plot
+  graphics::plot(cases_per_time$time,
        cases_per_time$sim_id,
        xlab = "Time",
        ylab = "Cases",

From 7b9ee7d6646a655d30ee91a17e09b358c6a31d11 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:22:31 +0100
Subject: [PATCH 298/828] Removed example tag from format method

---
 R/epichains.R | 2 --
 1 file changed, 2 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index aa0bf437..f4f734fb 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -14,8 +14,6 @@ print.epichains <- function(x, ...) {
 #' @param ... further arguments passed to or from other methods
 #' @return Invisibly returns an [`epichains`]. Called for printing side-effects.
 #' @export
-#'
-#' @examples
 format.epichains <- function(x, ...) {
   # check that x is an epichains object
   validate_epichains(x)

From ce128e9b4343e27dd1ddf41b69a258a89c84e5d7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:28:11 +0100
Subject: [PATCH 299/828] Added code to calculate cases per generation

---
 R/epichains.R | 10 +++++++---
 1 file changed, 7 insertions(+), 3 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index f4f734fb..1e2718ac 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -211,11 +211,15 @@ plot.epichains <- function(x, ...){
     stop("Object must be an epichains object with a chains_tree attribute.")
   }
 
-  cases_per_generation <- aggregate(sim_id ~ generation, x = as.data.frame(x), FUN = NROW)
-
-  cases_per_time <- aggregate(sim_id ~ time, x = as.data.frame(x), FUN = NROW)
+  # Count the number of cases per generation
+  cases_per_generation <- stats::aggregate(sim_id ~ generation,
+                                           x = as.data.frame(x),
+                                           FUN = NROW
+                                           )
+  # Count the number of cases per time
   cases_per_time <- stats::aggregate(sim_id ~ time, x = as.data.frame(x), FUN = NROW)
 
+  # Set up grid
   graphics::par(mfrow = c(1, 2), mar = c(4, 3, 3, 1), oma = c(0, 0, 0, 0))
 
   # Make first plot

From b91792db263509346359a77cad3493da639b00d6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:29:05 +0100
Subject: [PATCH 300/828] Cleaned up documentation of the head and tail methods

---
 R/epichains.R | 15 +++++++++------
 1 file changed, 9 insertions(+), 6 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 1e2718ac..368e2f79 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -145,6 +145,7 @@ is_epichains <- function(x) {
 #'
 #' @return Checks if an object is of class `epichains` and if so
 #' checks that it's in the right format as a "data.frame" or vector.
+#' @keywords internal
 validate_epichains <- function(x) {
   if (!is_epichains(x)) {
     stop("Object must have an epichains class")
@@ -176,21 +177,23 @@ validate_epichains <- function(x) {
   invisible(x)
 }
 
-#' `head` and `tail` methods for [`epichains`] class
+#' `head` method for [`epichains`] class
 #'
 #' @param x An [`epichains`] object
 #' @param ... further arguments passed to or from other methods
-#'
+#' @importFrom utils head
 #' @return object of class `data.frame`
+#' @author James M. Azam
 #' @export
-#'
-#' @importFrom utils head
-#' @importFrom utils tail
 head.epichains <- function(x, ...) {
   utils::head(as.data.frame(x), ...)
 }
 
-#' @rdname head.epichains
+#' `tail` method for [`epichains`] class
+#' @param x An [`epichains`] object
+#' @param ... further arguments passed to or from other methods
+#' @importFrom utils tail
+#' @author James M. Azam
 #' @export
 tail.epichains <- function(x, ...) {
   utils::tail(as.data.frame(x), ...)

From d69d9862828aeb50f6eb5ced32a531ce03a8d32b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:30:11 +0100
Subject: [PATCH 301/828] Documented validate_epichains() and is_epichains()

---
 R/epichains.R | 3 +--
 1 file changed, 1 insertion(+), 2 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 368e2f79..f2afb071 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -133,8 +133,6 @@ summary.epichains <- function(x, ...) {
 #' @return logical, `TRUE` if the object is an `epichains` and `FALSE`
 #' otherwise
 #' @export
-#'
-#' @examples
 is_epichains <- function(x) {
   inherits(x, "epichains")
 }
@@ -146,6 +144,7 @@ is_epichains <- function(x) {
 #' @return Checks if an object is of class `epichains` and if so
 #' checks that it's in the right format as a "data.frame" or vector.
 #' @keywords internal
+#' @author James M. Azam
 validate_epichains <- function(x) {
   if (!is_epichains(x)) {
     stop("Object must have an epichains class")

From 8b9d1bbedfce3b402d91ccf4963c383e16c2c8da Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:31:13 +0100
Subject: [PATCH 302/828] Replaced "x" with "object" in summary method to align
 with generic

---
 R/epichains.R | 28 +++++++++++++---------------
 1 file changed, 13 insertions(+), 15 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index f2afb071..360580ec 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -78,29 +78,27 @@ format.epichains <- function(x, ...) {
 
 #' Summary method for epichains class
 #'
-#' @param object epichains object
+#' @param object An [`epichains`] object
 #' @param ... further arguments passed to or from other methods
 #'
 #' @return data frame of information
 #' @export
-#'
-#' @examples
-summary.epichains <- function(x, ...) {
-  validate_epichains(x)
+summary.epichains <- function(object, ...) {
+  validate_epichains(object)
 
-  if (attributes(x)$chain_type == "chains_tree") {
+  if (attributes(object)$chain_type == "chains_tree") {
 
-    chains_ran <- length(x$n)
+    chains_ran <- length(object$n)
 
-    max_time <- max(x$time)
+    max_time <- max(object$time)
 
     n_unique_ancestors <- length(
-      unique(x$ancestor[!is.na(x$ancestor)])
+      unique(object$ancestor[!is.na(object$ancestor)])
     )
 
-    num_generations <- length(unique(x$generation))
+    num_generations <- length(unique(object$generation))
 
-    max_generation <- max(x$generation)
+    max_generation <- max(object$generation)
 
     # out of summary
     res <- list(
@@ -111,10 +109,10 @@ summary.epichains <- function(x, ...) {
       num_generations = num_generations,
       max_generation = max_generation
     )
-  } else if (attributes(x)$chain_type == "chains_vec") {
-    chains_ran <- length(x)
-    max_chain_stat <- max(!is.infinite(x))
-    min_chain_stat <- min(!is.infinite(x))
+  } else if (attributes(object)$chain_type == "chains_vec") {
+    chains_ran <- length(object)
+    max_chain_stat <- max(!is.infinite(object))
+    min_chain_stat <- min(!is.infinite(object))
 
     res <- list(
       unique_chains = chains_ran,

From 4912e3a7b0cc842f21de67761ca33bd86c71a10d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:32:05 +0100
Subject: [PATCH 303/828] Cleaned up documentation of update_chain_stat()

---
 R/helpers.R | 10 ++++------
 1 file changed, 4 insertions(+), 6 deletions(-)

diff --git a/R/helpers.R b/R/helpers.R
index d835653e..94bc981a 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -1,12 +1,10 @@
 #' Determine and update the chain statistic being tracked
 #'
-#' @param stat_type
-#' @param noffspring
-#'
-#' @return
-#' @export
+#' @param stat_type Chain statistic (size/length) to update.
+#' @param stat_latest The latest chain statistic vector to be updated.
+#' @param n_offspring A vector of offspring per chain.
+#' @return A vector of chain statistics (size/length).
 #' @keywords internal
-#' @examples
 update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
   if (stat_type == "size") {
     stat_latest <- stat_latest + n_offspring

From 29d67756ab6b665711717b2875fa2293a948c9cd Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:33:03 +0100
Subject: [PATCH 304/828] Cleaned up documentation of get_offspring_func()

---
 R/helpers.R | 13 ++++++++-----
 1 file changed, 8 insertions(+), 5 deletions(-)

diff --git a/R/helpers.R b/R/helpers.R
index 94bc981a..7489c2e7 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -18,12 +18,15 @@ update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
 
 #' Get offspring sampling function
 #'
-#' @param offspring_sampler
+#' @param n Number of items to sample
+#' @param susc Susceptible population size (calculated
+#' inside \code{\link{simulate_tree_from_pop}}  as pop - initial_immune)
+#' @inheritParams simulate_tree_from_pop
 #'
-#' @return
-#' @export
-#'
-#' @examples
+#' @return An offspring sampling function
+#' @keywords internal
+get_offspring_func <- function(offspring_sampler, n, susc, pop,
+                               mean_offspring, disp_offspring = NULL) {
 get_offspring_func <- function(offspring_sampler) {
   if (offspring_sampler == "nbinom") {
     function(n, susc, pop, mean_offspring, disp_offspring) {

From cb455301107fca49ec1767d2f3c87f76b334c25b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:34:18 +0100
Subject: [PATCH 305/828] Deleted old arguments of get_offspring_func()

---
 R/helpers.R | 1 -
 1 file changed, 1 deletion(-)

diff --git a/R/helpers.R b/R/helpers.R
index 7489c2e7..ce56da8e 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -27,7 +27,6 @@ update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
 #' @keywords internal
 get_offspring_func <- function(offspring_sampler, n, susc, pop,
                                mean_offspring, disp_offspring = NULL) {
-get_offspring_func <- function(offspring_sampler) {
   if (offspring_sampler == "nbinom") {
     function(n, susc, pop, mean_offspring, disp_offspring) {
       ## get distribution params from mean and dispersion

From 32319a34617b611f95470507c0b65abfc5411784 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:35:10 +0100
Subject: [PATCH 306/828] Added required arguments to truncated poisson
 function

---
 R/helpers.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/helpers.R b/R/helpers.R
index ce56da8e..bd4fc844 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -44,7 +44,7 @@ get_offspring_func <- function(offspring_sampler, n, susc, pop,
       )
     }
   } else if (offspring_sampler == "pois") {
-    function(n, susc, pop, mean_offspring) {
+    function(n, susc, pop, mean_offspring, disp_offspring) {
       truncdist::rtrunc(
         n,
         spec = "pois",

From 1637ba9081c0a303d3e13d303a46abdaeff6c4da Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:36:17 +0100
Subject: [PATCH 307/828] Replaced old tests with a minimal set of tests for
 the simulate_ family of functions

---
 tests/testthat/tests-sim.r | 187 ++++---------------------------------
 1 file changed, 20 insertions(+), 167 deletions(-)

diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index bb053a43..84a7b5e9 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -1,175 +1,28 @@
-test_that("Chains can be simulated", {
-  expect_length(chain_sim(n = 2, "pois", lambda = 0.5), 2)
-  expect_length(chain_sim(n = 10, "pois", "length", lambda = 0.9), 10)
+test_that("Simulators output epichains objects", {
   expect_s3_class(
-    chain_sim(n = 10,
-              "pois",
-              lambda = 2,
-              tree = TRUE,
-              infinite = 10
-              ),
-    "data.frame"
+    simulate_tree(nchains = 10,
+                  offspring_sampler = "pois",
+                  lambda = 2,
+                  chain_statistic = "size",
+                  chain_stat_max = 10
+                  ),
+    "epichains"
     )
-  expect_false(any(is.finite(chain_sim(
-    n = 2, "pois", "length", lambda = 0.5,
-    infinite = 1
-  ))))
-  expect_no_error(chain_sim(
-    n = 2, offspring = "pois", "size", lambda = 0.9,
-    tree = TRUE
-  ))
-})
-
-test_that("Errors are thrown", {
-  expect_error(chain_sim(n = 2, "dummy"), "does not exist")
-  expect_error(chain_sim(n = 2, "lnorm", meanlog = log(1.6)), "integer")
-  expect_error(
-    chain_sim(n = 2, offspring = pois, "length", lambda = 0.9),
-    "not found"
-  )
-  expect_error(chain_sim(
-    n = 2, offspring = "pois", "size", lambda = 0.9,
-    serial = c(1, 2), "must be a function"
-  ))
-  expect_error(
-    chain_sim(n = 2, offspring = c(1, 2), "length", lambda = 0.9),
-    "not a character string"
-  )
-  expect_error(
-    chain_sim(n = 2, offspring = list(1, 2), "length", lambda = 0.9),
-    "not a character string"
-  )
-  expect_error(
-    chain_sim(
-      n = 2,
-      offspring = "pois",
-      "size",
-      lambda = 0.9,
-      tf = 5,
-      tree = FALSE
-    ),
-    "If `tf` is specified, `serial` must be specified too."
-  )
-})
-
-test_that("Chains can be simulated", {
   expect_s3_class(
-      chain_sim_susc(
-        "pois",
-        mn_offspring = 2,
-        serial = function(x) 3,
-        pop = 100
-      ),
-      "data.frame"
-  )
-
-  expect_s3_class(
-      chain_sim_susc(
-        "nbinom",
-        mn_offspring = 2,
-        disp_offspring = 1.5,
-        serial = function(x) 3,
-        pop = 100
-      ),
-      "data.frame"
-  )
-
-  expect_identical(
-    nrow(
-      chain_sim_susc(
-        "pois",
-        mn_offspring = 2,
-        serial = function(x) 3,
-        pop = 1
-      )
-    ),
-    1L
-  )
-
-  expect_identical(
-    nrow(
-      chain_sim_susc(
-        "pois",
-        mn_offspring = 100,
-        tf = 2,
-        serial = function(x) 3,
-        pop = 999
-      )
-    ),
-    1L
-  )
-
-  expect_identical(
-    nrow(
-      chain_sim_susc(
-        "pois",
-        mn_offspring = 100,
-        serial = function(x) 3,
-        pop = 999,
-        initial_immune = 998
-      )
-    ),
-    1L
-  )
-})
-
-test_that("Errors are thrown", {
-  expect_error(
-    chain_sim_susc(
-      "dummy",
-      mn_offspring = 3,
-      serial = function(x) 3,
-      pop = 100
+    simulate_tree_from_pop(pop = 100,
+                           offspring_sampler = "nbinom",
+                           mean_offspring = 0.5,
+                           disp_offspring = 1.1,
+                           serial_sampler = function(x) 3
     ),
-    paste0("'arg' should be one of ", dQuote("pois"), ", ", dQuote("nbinom"))
+    "epichains"
   )
-  expect_error(
-    chain_sim_susc(
-      "nbinom",
-      mn_offspring = 3,
-      disp_offspring = 1,
-      serial = function(x) 3,
-      pop = 100
-    ),
-    paste("Offspring distribution 'nbinom'",
-          "requires argument 'disp_offspring' > 1.",
-          "Use 'pois' if there is no overdispersion."
-          )
-    )
-  expect_error(
-    chain_sim_susc(
-      "nbinom",
-      mn_offspring = 3,
-      serial = function(x) 3,
-      pop = 100
-    ),
-    "Argument 'disp_offspring' was not specified."
-  )
-})
-
-test_that("warnings work as expected", {
-  expect_warning(
-    chain_sim_susc(
-      "pois",
-      mn_offspring = 3,
-      disp_offspring = 1,
-      serial = function(x) 3,
-      pop = 100
-    ),
-    "Argument 'disp_offspring' not used for poisson offspring distribution."
-  )
-  expect_warning(
-    chain_sim(
-      n = 2,
-      offspring = "pois",
-      "size",
-      lambda = 0.9,
-      serial = function(x) rpois(x, 0.9),
-      tree = FALSE
+  expect_s3_class(
+    simulate_vect(n = 10,
+                  offspring_sampler = "pois",
+                  lambda = 2,
+                  chain_stat_max = 10
     ),
-    sprintf("%s %s",
-            "`serial` can't be used with `tree = FALSE`;",
-            "Setting `tree = TRUE` internally."
-    )
+    "epichains"
   )
 })

From 61b06b30ba456d9315212874b357cb32b0e872fe Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:37:29 +0100
Subject: [PATCH 308/828] Reworded title of estimate_likelihood

---
 R/likelihood_estimation.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 3540efd5..d09c11ef 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -1,4 +1,4 @@
-#' Likelihood for the outcome of a branching process
+#' Estimate the (log) likelihood for observed branching processes
 #'
 #' @param x vector of sizes or lengths of transmission chains
 #' @param stat statistic given as \code{x} ("size" or "length" of chains)

From e3ffff1fc6be2def4d5d792b6ffc13f2ec40b279 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:41:01 +0100
Subject: [PATCH 309/828] Redocumented estimate_likelihood() owing to new and
 renamed arguments

---
 R/likelihood_estimation.R | 56 ++++++++++++++++++++++++---------------
 1 file changed, 34 insertions(+), 22 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index d09c11ef..a6c38603 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -1,29 +1,41 @@
 #' Estimate the (log) likelihood for observed branching processes
 #'
-#' @param x vector of sizes or lengths of transmission chains
-#' @param stat statistic given as \code{x} ("size" or "length" of chains)
-#' @param obs_prob observation probability (assumed constant)
-#' @param infinite any chains of this size/length will be treated as infinite
-#' @param exclude any sizes/lengths to exclude from the likelihood calculation
-#' @param individual if TRUE, a vector of individual log-likelihood
-#' contributions will be returned rather than the sum
-#' @param nsim_obs number of simulations if the likelihood is to be
-#'   approximated for imperfect observations
-#' @param ... parameters for the offspring distribution
-#' @return likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or
-#'  a list of individual likelihood contributions (if \code{individual=TRUE})
-#' @inheritParams chain_sim
-#' @seealso pois_size_ll, nbinom_size_ll, gborel_size_ll, pois_length_ll,
-#'   geom_length_ll, offspring_ll
+#' @param chains_observed Vector of sizes/lengths of transmission chains.
+#' @param chain_statistic Statistic given as \code{chains_observed}
+#' ("size" or "length" of chains).
+#' @param offspring_sampler Offspring distribution: a character string
+#' corresponding to the R distribution function (e.g., "pois" for Poisson,
+#' where \code{\link{rpois}} is the R function to generate Poisson random
+#' numbers).
+#' @param nsim_obs Number of simulations if the likelihood is to be
+#' approximated for imperfect observations.
+#' @param log_trans Logical; Should the results be log-transformed? (Defaults
+#' to TRUE).
+#' @param obs_prob Observation probability (assumed constant)
+#' @param chain_stat_max Any chains of this size/length will be
+#' treated as infinite.
+#' @param exclude A vector of indices of the sizes/lengths to exclude from the
+#' likelihood calculation.
+#' @param individual If TRUE, a vector of individual (log)likelihood
+#' contributions will be returned rather than the sum.
+#' @param ... Parameters for the offspring distribution.
+#' @return
+#' * A log-likelihood, if \code{log_trans = TRUE} (the default)
+#' * A vector of log-likelihoods, if \code{log_trans = TRUE} (the default) and
+#' \code{obs_prob < 1}, or
+#' * A list of individual log-likelihood contributions, if
+#' \code{log_trans = TRUE} (the default) and \code{individual = TRUE}.
+#' else raw likelihoods, or vector of likelihoods
+#' @seealso offspring_ll, pois_size_ll, nbinom_size_ll, gborel_size_ll,
+#' pois_length_ll, geom_length_ll.
 #' @author Sebastian Funk
-#' @export
 #' @examples
-#' chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-#' chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
-chain_ll <- function(x, offspring, stat = c("size", "length"), obs_prob = 1,
-                     infinite = Inf, exclude = NULL, individual = FALSE,
-                     nsim_obs, ...) {
-  stat <- match.arg(stat)
+#' # example of observed chain sizes
+#' chain_sizes <- c(1, 1, 4, 7)
+#' estimate_likelihood(chains_observed = chain_sizes, chain_statistic = "size",
+#'  offspring_sampler = "pois", nsim_obs = 100, lambda = 0.5)
+#' @export
+estimate_likelihood <- function(chains_observed,
 
   ## checks
   if (!is.character(offspring)) {

From 3df01b2a7769980e0ceed06d6c9dac4195b45d51 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:42:29 +0100
Subject: [PATCH 310/828] Reset up the arguments to estimate_likelihood()

---
 R/likelihood_estimation.R | 7 +++++++
 1 file changed, 7 insertions(+)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index a6c38603..bacb5847 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -36,6 +36,13 @@
 #'  offspring_sampler = "pois", nsim_obs = 100, lambda = 0.5)
 #' @export
 estimate_likelihood <- function(chains_observed,
+                                chain_statistic = c("size", "length"),
+                                offspring_sampler,
+                                nsim_obs,
+                                log_trans = TRUE,
+                                obs_prob = 1, chain_stat_max = Inf,
+                                exclude = NULL, individual = FALSE, ...) {
+  chain_statistic <- match.arg(chain_statistic)
 
   ## checks
   if (!is.character(offspring)) {

From 61d79e8ee08e86d1d93a698eb08e2de99e8c875e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:44:59 +0100
Subject: [PATCH 311/828] Renamed infinite to chain_stat_max and offspring to
 offspring_sampler

---
 R/likelihood_estimation.R | 29 +++++++++++++++--------------
 1 file changed, 15 insertions(+), 14 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index bacb5847..76cbe3de 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -45,8 +45,8 @@ estimate_likelihood <- function(chains_observed,
   chain_statistic <- match.arg(chain_statistic)
 
   ## checks
-  if (!is.character(offspring)) {
-    stop("Object passed as 'offspring' is not a character string.")
+  if (!is.character(offspring_sampler)) {
+    stop("Object passed as 'offspring_sampler' is not a character string.")
   }
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
@@ -60,25 +60,26 @@ estimate_likelihood <- function(chains_observed,
     }
     sampled_x <-
       replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob),
-                               infinite), simplify = FALSE)
+                                           ),
+                               chain_stat_max), simplify = FALSE)
     size_x <- unlist(sampled_x)
-    if (!is.finite(infinite)) infinite <- max(size_x) + 1
+    if (!is.finite(chain_stat_max)) chain_stat_max <- max(size_x) + 1
   } else {
-    x[x >= infinite] <- infinite
+    chains_observed[chains_observed >= chain_stat_max] <- chain_stat_max
     size_x <- x
     sampled_x <- list(x)
   }
 
   ## determine for which sizes to calculate the likelihood (for true chain size)
-  if (any(size_x == infinite)) {
-    calc_sizes <- seq_len(infinite - 1)
+  if (any(size_x == chain_stat_max)) {
+    calc_sizes <- seq_len(chain_stat_max - 1)
   } else {
     calc_sizes <- unique(c(size_x, exclude))
   }
 
-  ## get likelihood function as given by `offspring` and `stat``
+  ## get likelihood function as given by offspring_sampler and chain_statistic
   likelihoods <- vector(mode = "numeric")
-  ll_func <- paste(offspring, stat, "ll", sep = "_")
+  ll_func <- paste(offspring_sampler, chain_statistic, "ll", sep = "_")
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
   ## calculate likelihoods
@@ -90,15 +91,15 @@ estimate_likelihood <- function(chains_observed,
       do.call(
         offspring_ll,
         c(list(
-          x = calc_sizes, offspring = offspring,
-          stat = stat, infinite = infinite
+          chains_observed = calc_sizes, offspring_sampler = offspring_sampler,
+          chain_statistic = chain_statistic, chain_stat_max = chain_stat_max
         ), pars)
       )
   }
 
-  ## assign probabilities to infinite outbreak sizes
-  if (any(size_x == infinite)) {
-    likelihoods[infinite] <- complementary_logprob(likelihoods)
+  ## assign probabilities to chain_stat_max outbreak sizes
+  if (any(size_x == chain_stat_max)) {
+    likelihoods[chain_stat_max] <- complementary_logprob(likelihoods)
   }
 
   if (!missing(exclude)) {

From 7ef5c54bce62390deddd076cef81cf5a395fcb89 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:45:38 +0100
Subject: [PATCH 312/828] Minor styling

---
 R/likelihood_estimation.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 76cbe3de..0f7d7cc3 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -51,7 +51,7 @@ estimate_likelihood <- function(chains_observed,
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
     if (missing(nsim_obs)) {
-      stop("'nsim_obs' must be specified if 'obs_prob' is <1")
+      stop("'nsim_obs' must be specified if 'obs_prob' is < 1")
     }
     if (stat == "size") {
       sample_func <- rbinom_size

From 54088e9ae986708d0d1fb560b35e02e14741d276 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:46:13 +0100
Subject: [PATCH 313/828] Renamed "stat" to "chain_statistic"

---
 R/likelihood_estimation.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 0f7d7cc3..be4c37bb 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -53,9 +53,9 @@ estimate_likelihood <- function(chains_observed,
     if (missing(nsim_obs)) {
       stop("'nsim_obs' must be specified if 'obs_prob' is < 1")
     }
-    if (stat == "size") {
+    if (chain_statistic == "size") {
       sample_func <- rbinom_size
-    } else if (stat == "length") {
+    } else if (chain_statistic == "length") {
       sample_func <- rgen_length
     }
     sampled_x <-

From f317342f69d1b15e3285e7155502d8db9a48f802 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:46:55 +0100
Subject: [PATCH 314/828] Renamed "x" to "chains_observed"

---
 R/likelihood_estimation.R | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index be4c37bb..7b02a509 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -58,16 +58,16 @@ estimate_likelihood <- function(chains_observed,
     } else if (chain_statistic == "length") {
       sample_func <- rgen_length
     }
-    sampled_x <-
-      replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob),
+    sampled_x <- replicate(nsim_obs, pmin(sample_func(length(chains_observed),
+                                           chains_observed, obs_prob
                                            ),
                                chain_stat_max), simplify = FALSE)
     size_x <- unlist(sampled_x)
     if (!is.finite(chain_stat_max)) chain_stat_max <- max(size_x) + 1
   } else {
     chains_observed[chains_observed >= chain_stat_max] <- chain_stat_max
-    size_x <- x
-    sampled_x <- list(x)
+    size_x <- chains_observed
+    sampled_x <- list(chains_observed)
   }
 
   ## determine for which sizes to calculate the likelihood (for true chain size)

From 6e53f2b9a242ca35d5d709535e044bc173bedd01 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:47:54 +0100
Subject: [PATCH 315/828] Ignore test_refactoring script

---
 .gitignore           |  2 ++
 R/test_refactoring.R | 39 ---------------------------------------
 2 files changed, 2 insertions(+), 39 deletions(-)
 delete mode 100644 R/test_refactoring.R

diff --git a/.gitignore b/.gitignore
index 211a19d8..a2549bb3 100644
--- a/.gitignore
+++ b/.gitignore
@@ -31,3 +31,5 @@ rsconnect/
 /Meta/
 /docs/
 .DS_Store
+
+R/test_refactoring.R
diff --git a/R/test_refactoring.R b/R/test_refactoring.R
deleted file mode 100644
index 5bf9fcf8..00000000
--- a/R/test_refactoring.R
+++ /dev/null
@@ -1,39 +0,0 @@
-
-source("./R/checks.R")
-source("./R/helpers.R")
-source("./R/epichains.R")
-source("./R/simulate.r")
-
-
-# try simulate_tree()
-chains_tree <- simulate_tree(nchains = 10,
-                                   serials_sampler = function(n) {rpois(n, 5)},
-                                   offspring_sampler = "pois",
-                                   lambda = 2,
-                                   chain_stat_max = 10
-                                   )
-
-
-chains_tree
-summary(chains_tree)
-plot(chains_tree)
-
-# try simulate_tree_from_pop()
-
-chains_tree_from_pop <- simulate_tree_from_pop(
-  pop = 100, offspring_sampler = "nbinom",
-  mean_offspring = 0.5, disp_offspring = 1.1,
-  serial_sampler = function(x) 3)
-
-chains_tree_from_pop
-summary(chains_tree_from_pop)
-plot(chains_tree_from_pop)
-
-# try chain_vec simulation
-chains_vec <- simulate_vect(nchains = 10, offspring_sampler = "pois",
-                             lambda = 2, chain_stat_max = 10
-                             )
-
-chains_vec
-summary(chains_vec)
-# plot(chains_vec) #expect error

From 54b87030d959c2b77d7f15d1ca86582ecab28302 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:49:08 +0100
Subject: [PATCH 316/828] Added comments to offspring_ll

---
 R/likelihoods.R | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index dba44ef4..4b92184c 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -103,8 +103,11 @@ geom_length_ll <- function(x, prob) {
 #' @keywords internal
 offspring_ll <- function(x, offspring, stat, nsim_offspring = 100, ...) {
   dist <- chain_sim(nsim_offspring, offspring, stat, ...)
+  # Simulate the chains
+  # Compute the empirical Cumulative Distribution Function of the
+  # simulated chains
 
-  ## linear approximation
+  # Perform a lagged linear interpolation of the points
   f <- stats::ecdf(dist)
   acdf <-
     diff(c(0, stats::approx(

From f9c38497548e20a561d340ca0ad117b469b7649c Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:51:24 +0100
Subject: [PATCH 317/828] Renamed the arguments in offspring_ll

---
 R/likelihoods.R | 10 +++++-----
 1 file changed, 5 insertions(+), 5 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 4b92184c..cca45123 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -101,8 +101,8 @@ geom_length_ll <- function(x, prob) {
 #' @inheritParams chain_ll
 #' @inheritParams chain_sim
 #' @keywords internal
-offspring_ll <- function(x, offspring, stat, nsim_offspring = 100, ...) {
-  dist <- chain_sim(nsim_offspring, offspring, stat, ...)
+offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
+                         nsim_offspring = 100, log_trans = TRUE, ...) {
   # Simulate the chains
   # Compute the empirical Cumulative Distribution Function of the
   # simulated chains
@@ -111,10 +111,10 @@ offspring_ll <- function(x, offspring, stat, nsim_offspring = 100, ...) {
   f <- stats::ecdf(dist)
   acdf <-
     diff(c(0, stats::approx(
-      unique(dist), f(unique(dist)),
-      seq_len(max(dist[is.finite(dist)]))
+      unique(chains), chains_empirical_cdf(unique(chains)),
+      seq_len(max(chains[is.finite(chains)]))
     )$y))
-  lik <- acdf[x]
+  lik <- acdf[chains_observed]
   lik[is.na(lik)] <- 0
   log(lik)
 }

From 3db0c0036417622c73022eaf3402101fd811bed9 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:52:38 +0100
Subject: [PATCH 318/828] Replaced chain_sim() with simulate_tree() in
 offspring_ll()

---
 R/likelihoods.R | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index cca45123..06ef998c 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -104,6 +104,8 @@ geom_length_ll <- function(x, prob) {
 offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
                          nsim_offspring = 100, log_trans = TRUE, ...) {
   # Simulate the chains
+  chains <- simulate_tree(nsim_offspring, offspring_sampler,
+                          chain_statistic, ...)
   # Compute the empirical Cumulative Distribution Function of the
   # simulated chains
 

From d8cdd590f8c0534087269f459fcbaa1a34b05029 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:53:31 +0100
Subject: [PATCH 319/828] Replaced chain_ll() and chain_sim() with new versions

---
 R/likelihoods.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 06ef998c..b7f1b597 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -98,8 +98,8 @@ geom_length_ll <- function(x, prob) {
 #' @param ... any parameters to pass to \code{\link{chain_sim}}
 #' @return log-likelihood values
 #' @author Sebastian Funk
-#' @inheritParams chain_ll
-#' @inheritParams chain_sim
+#' @inheritParams estimate_likelihood
+#' @inheritParams simulate_vec
 #' @keywords internal
 offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
                          nsim_offspring = 100, log_trans = TRUE, ...) {

From cadd6afbdf18165a590ca83b98ef6a97bb68a4d8 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:53:59 +0100
Subject: [PATCH 320/828] Moved ecdf calculation under new comment

---
 R/likelihoods.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index b7f1b597..0984491f 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -108,9 +108,9 @@ offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
                           chain_statistic, ...)
   # Compute the empirical Cumulative Distribution Function of the
   # simulated chains
+  chains_empirical_cdf <- stats::ecdf(chains)
 
   # Perform a lagged linear interpolation of the points
-  f <- stats::ecdf(dist)
   acdf <-
     diff(c(0, stats::approx(
       unique(chains), chains_empirical_cdf(unique(chains)),

From c7b970dbb788039f309681b3396996df8a394c15 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:54:48 +0100
Subject: [PATCH 321/828] Introduced the log_trans argument to log transform
 output of offspring_ll()

---
 R/likelihoods.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 0984491f..8488a53b 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -118,5 +118,6 @@ offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
     )$y))
   lik <- acdf[chains_observed]
   lik[is.na(lik)] <- 0
-  log(lik)
+  out <- ifelse(base::isTRUE(log_trans), log(lik), lik)
+  return(out)
 }

From 62b7ed1bb0a3b88044a583da6de1d7c12777df62 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:55:35 +0100
Subject: [PATCH 322/828] Redocumented offspring_ll()

---
 R/likelihoods.R | 17 ++++++++++-------
 1 file changed, 10 insertions(+), 7 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 8488a53b..7bce529b 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -90,13 +90,16 @@ geom_length_ll <- function(x, prob) {
 #' Likelihood of the length of chains with generic offspring distribution
 #'
 #' The likelihoods are calculated with a crude approximation using simulated
-#'   chains by linearly approximating any missing values in the empirical
-#'   cumulative distribution function (ecdf).
-#' @param x vector of sizes
-#' @param nsim_offspring number of simulations of the offspring distribution
-#'   for approximation the size/length distribution
-#' @param ... any parameters to pass to \code{\link{chain_sim}}
-#' @return log-likelihood values
+#' chains by linearly approximating any missing values in the empirical
+#' cumulative distribution function (ecdf).
+#' @param chains_observed Vector of sizes/lengths
+#' @param nsim_offspring Number of simulations of the offspring distribution
+#' for approximating the chain_statistic (size/length) distribution
+#' @param log_trans Logical; Should the results be log-transformed? (Defaults
+#' to TRUE).
+#' @param ... any parameters to pass to \code{\link{simulate_tree}}
+#' @return If \code{log_trans = TRUE} (the default), log-likelihood values,
+#' else raw likelihoods
 #' @author Sebastian Funk
 #' @inheritParams estimate_likelihood
 #' @inheritParams simulate_vec

From 3ad94fe213bfa89549ea16c29fda6629931c15b5 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:55:51 +0100
Subject: [PATCH 323/828] Minor: added new lines

---
 R/likelihoods.R | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 7bce529b..c30d49a6 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -106,9 +106,11 @@ geom_length_ll <- function(x, prob) {
 #' @keywords internal
 offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
                          nsim_offspring = 100, log_trans = TRUE, ...) {
+
   # Simulate the chains
   chains <- simulate_tree(nsim_offspring, offspring_sampler,
                           chain_statistic, ...)
+
   # Compute the empirical Cumulative Distribution Function of the
   # simulated chains
   chains_empirical_cdf <- stats::ecdf(chains)

From b2f7626ac767d55ee8fbf3955bb09bd19b2be4ff Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:56:20 +0100
Subject: [PATCH 324/828] Documented check_offspring_valid()

---
 R/checks.R | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/R/checks.R b/R/checks.R
index 69acb27d..5e003aca 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -1,6 +1,9 @@
 #' Check if offspring argument is specified as a character string
 #'
-#' @param offspring
+#' @param offspring_sampler Offspring distribution: a character string
+#' corresponding to the R distribution function (e.g., "pois" for Poisson,
+#' where \code{\link{rpois}} is the R function to generate Poisson random
+#' numbers).
 #' @keywords internal
 check_offspring_valid <- function(offspring) {
   if (!is.character(offspring)) {

From 6a9fbedbe7e56409462ed23052b8063d994c2ecd Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:56:37 +0100
Subject: [PATCH 325/828] Documented check_offspring_func_valid()

---
 R/checks.R | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/R/checks.R b/R/checks.R
index 5e003aca..c5fb542b 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -18,7 +18,9 @@ check_offspring_valid <- function(offspring) {
 
 #' Check if constructed random number generator for offspring exists
 #'
-#' @param roffspring_name
+#' @param roffspring_name Constructed random offspring sampler: a character
+#' string corresponding to the R distribution function (e.g., "rpois" for
+#' Poisson.
 #' @keywords internal
 check_offspring_func_valid <- function(roffspring_name) {
   if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {

From d4b7d27cc2ce39c0aa8b40ef356572a6bfdcc4a1 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:56:53 +0100
Subject: [PATCH 326/828] Documented check_nchains_valid()

---
 R/checks.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/checks.R b/R/checks.R
index c5fb542b..178a77e6 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -47,7 +47,7 @@ check_serial_valid <- function(serials_sampler) {
 
 #' Check that nchains is greater than 0 and not infinite
 #'
-#' @param nchains
+#' @param nchains Number of chains to simulate.
 #'
 #' @keywords internal
 check_nchains_valid <- function(nchains) {

From 5f745f6b3980ce67675e85861c0dcc628160fc01 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:57:06 +0100
Subject: [PATCH 327/828] Documented check_serial_valid()

---
 R/checks.R | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/R/checks.R b/R/checks.R
index 178a77e6..17051512 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -31,7 +31,9 @@ check_offspring_func_valid <- function(roffspring_name) {
 
 #' Check if the serials_sampler argument is specified as a function
 #'
-#' @param serials_sampler
+#' @param serials_sampler The serial interval generator function; the name of a
+#' user-defined named or anonymous function with only one argument `n`,
+#' representing the number of serial intervals to generate.
 #'
 #' @keywords internal
 check_serial_valid <- function(serials_sampler) {

From 651b7159f2501ada56c7e00b73c6cbcd0f577571 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:58:20 +0100
Subject: [PATCH 328/828] Renamed "offspring" to "offspring_sampler" in
 check_offspring_valid()

---
 R/checks.R | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/R/checks.R b/R/checks.R
index 17051512..967b7eee 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -5,11 +5,11 @@
 #' where \code{\link{rpois}} is the R function to generate Poisson random
 #' numbers).
 #' @keywords internal
-check_offspring_valid <- function(offspring) {
-  if (!is.character(offspring)) {
+check_offspring_valid <- function(offspring_sampler) {
+  if (!is.character(offspring_sampler)) {
     stop(sprintf(
       "%s %s",
-      "'offspring' must be specified as a character string.",
+      "'offspring_sampler' must be specified as a character string.",
       "Did you forget to enclose it in quotes?"
     ))
   }

From dd5dab817644182e8b4f9ee1cb9ee6e2eb389665 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 16:03:12 +0100
Subject: [PATCH 329/828] Fixed linting issues

---
 R/epichains.R             | 5 +++--
 tests/testthat/tests-ll.r | 2 --
 2 files changed, 3 insertions(+), 4 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 360580ec..10730f47 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -204,7 +204,7 @@ tail.epichains <- function(x, ...) {
 #' @return A plot of cases over time and generation
 #' @author James M. Azam
 #' @export
-plot.epichains <- function(x, ...){
+plot.epichains <- function(x, ...) {
   validate_epichains(x)
 
   if (attributes(x)$chain_type != "chains_tree") {
@@ -217,7 +217,8 @@ plot.epichains <- function(x, ...){
                                            FUN = NROW
                                            )
   # Count the number of cases per time
-  cases_per_time <- stats::aggregate(sim_id ~ time, x = as.data.frame(x), FUN = NROW)
+  cases_per_time <- stats::aggregate(sim_id ~ time, x = as.data.frame(x),
+                                     FUN = NROW)
 
   # Set up grid
   graphics::par(mfrow = c(1, 2), mar = c(4, 3, 3, 1), oma = c(0, 0, 0, 0))
diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
index a29bb9ac..13a7c339 100644
--- a/tests/testthat/tests-ll.r
+++ b/tests/testthat/tests-ll.r
@@ -8,5 +8,3 @@ test_that("Analytical size or length distributions are implemented", {
   expect_true(all(pois_length_ll(chains, lambda = 0.5) < 0))
   expect_true(all(geom_length_ll(chains, prob = 0.5) < 0))
 })
-
-

From 1c1c475c83b728fda4d24a171b7c301087f29dfd Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 16:03:32 +0100
Subject: [PATCH 330/828] Deleted forecasting vignette as belongs to bpmodels

---
 vignettes/projecting_incidence.Rmd | 370 -----------------------------
 1 file changed, 370 deletions(-)
 delete mode 100644 vignettes/projecting_incidence.Rmd

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
deleted file mode 100644
index fb36b764..00000000
--- a/vignettes/projecting_incidence.Rmd
+++ /dev/null
@@ -1,370 +0,0 @@
----
-title: "Projecting infectious disease incidence: a COVID-19 example"
-author: "James Azam, Sebastian Funk"
-output:
-  bookdown::html_vignette2:
-    fig_caption: yes
-    code_folding: show
-pkgdown:
-  as_is: true
-bibliography: references.json
-link-citations: true
-vignette: >
-  %\VignetteEncoding{UTF-8}
-  %\VignetteIndexEntry{Projecting infectious disease incidence: a COVID-19 example}
-  %\VignetteEngine{knitr::rmarkdown}
-editor_options: 
-  chunk_output_type: console
----
-
-```{r setup, include=FALSE}
-knitr::opts_chunk$set(echo = TRUE,
-                      message = FALSE,
-                      warning = FALSE,
-                      collapse = TRUE,
-                      comment = "#>"
-                      )
-
-```
-
-## Overview
-
-Branching processes can be used to project infectious disease trends in time 
-provided we can characterize the distribution of times between 
-successive cases (serial interval), and the distribution of secondary cases 
-produced by a single individual (offspring distribution). Such simulations can 
-be achieved in `epichains` with the `chain_sim()` function and @pearson2020, and 
-@abbott2020 illustrate its application to COVID-19. 
-
-The purpose of this vignette is to use early data on COVID-19 in South Africa 
-[@marivate2020] to illustrate how `epichains` can be used to forecast an 
-outbreak. 
-
-Let's load the required packages
-
-```{r packages, include=TRUE}
-library("epichains")
-library("dplyr")
-library("ggplot2")
-library("lubridate")
-library("epiparameter")
-```
-
-## Data
-
-Included in `epichains` is a cleaned time series of the first 15 days of 
-the COVID-19 outbreak in South Africa. This can be loaded into 
-memory as follows: 
-```{r}
-data("covid19_sa", package = "epichains")
-```
-
-Let us examine the first 6 entries of the dataset.
-```{r}
-head(covid19_sa)
-```
-
-## Setting up the inputs  
-
-### Onset times 
-
-`chain_sim()` requires a vector of onset times, `t0`, for each 
-chain/individual/simulation. 
-
-The `covid19_sa` dataset above is aggregated, so we will have to disaggregate
-it into a linelist with each row representing a case and their onset time. 
-
-To achieve this, we will first use the date of the index case as the reference 
-and find the difference between each date and the reference. 
-```{r linelist_gen, message=FALSE}
-days_since_index <- as.integer(covid19_sa$date - min(covid19_sa$date))
-days_since_index
-```
-
-Using the vector of start times for the time series, we will then 
-create the linelist by disaggregating the time series so 
-that each case has a corresponding start time.
-```{r}
-start_times <- unlist(mapply(
-  function(x, y) rep(x, times = ifelse(y == 0, 1, y)),
-  days_since_index,
-  covid19_sa$cases
-))
-
-start_times
-```
-
-### Serial interval
-
-The log-normal distribution is commonly used in epidemiology to characterise 
-quantities such as the serial interval because it has a large variance 
-and can only be positive-valued [@nishiura2007; @limpert2001]. 
-
-In this example, we will assume based on COVID-19 literature that the 
-serial interval, S, is log-normal distributed with parameters, 
-$\mu = 4.7$ and $\sigma = 2.9$ [@pearson2020]. Note that when the distribution
-is described this way, it means $\mu$ and $\sigma$ are the expected value 
-and standard deviation of the natural logarithm of the serial interval. Hence, 
-in order to sample the "back-transformed" measured serial interval with 
-expectation/mean, $E[S]$ and standard deviation, $SD [S]$, 
-we can use the following parametrisation:
-
-\begin{align}
-E[S] &= \ln \left( \dfrac{\mu^2}{(\sqrt{\mu^2 + \sigma^2}} \right) \\
-
-SD [S] &= \sqrt {\ln \left(1 + \dfrac{\sigma^2}{\mu^2} \right)}
- 
-\end{align}
-
-See [Wikipedia](https://en.wikipedia.org/wiki/Log-normal_distribution) for a 
-detailed explanation of this parametrisation.
-
-The [epiparameter](https://github.com/epiverse-trace/epiparameter) R package 
-provides the function `epiparameter::lnorm_meansd2meanlogsdlog()` for implementing 
-this parametrisation. It takes as inputs the mean, $\mu$ and standard 
-deviation, $\sigma$ and returns a list with the transformed mean and 
-standard deviation. Refer to `?epiparameter::lnorm_meansd2meanlogsdlog` 
-for more details.
-
-Let us set up the serial interval function with the appropriate inputs:
-```{r input_prep3, message=FALSE}
-mu <- 4.7
-sgma <- 2.9
-
-log_mean <- lnorm_meansd2meanlogsdlog(mu, sgma)[[1]]  # log mean
-log_sd <- lnorm_meansd2meanlogsdlog(mu, sgma)[[2]] # log sd
-
-#' serial interval function
-serial_interval <- function(sample_size) {
-  si <- rlnorm(sample_size, meanlog = log_mean, sdlog = log_sd)
-  return(si)
-}
-```
-
-### Offspring distribution
-
-The negative binomial distribution is commonly used in epidemiology to
-account for individual variation in transmissibility, 
-also known as superspreading [@lloyd-smith2005].
-
-For this example, we will assume that the offspring distribution is 
-characterised by a negative binomial with $R = 2.5$ [@abbott2020] and 
-$k = 0.58$ [@wang2020]. In this parameterization, $R$ 
-represents the $R_0$, which is defined as the average number of 
-cases produced by a single individual in an entirely susceptible population. 
-The parameter $k$ represents superspreading, that is, the degree of 
-heterogeneity in transmission by single individuals.
-
-### Simulation controls
-
-`chain_sim()` also requires the end time for the simulations. For this 
-example, we will simulate outbreaks that end 14 days after the last date 
-of observations in `covid19_sa`.   
-```{r input_prep2, message=FALSE}
-#' Date to end simulation (14 day projection in this case)
-projection_window <- 14 # 14 days/ 2-week ahead projection
-projection_end_day <- max(days_since_index) + projection_window
-projection_end_day
-```
-
-`chain_sim()` is stochastic, meaning the results are different every 
-time it is run for the same set of parameters, so we will run the simulations
-many times and summarise the results. 
-
-We will, therefore, run each simulation $100$ times.
-```{r}
-#' Number of simulations
-sim_rep <- 100
-```
-
-Lastly, `chain_sim()` requires the maximum size of each chain. 
-Above this value, the simulation is cut off. If this value is 
-not specified, it assumes a value of infinity. Here, we will
-assume a maximum chain size of $1000$.
-```{r}
-#' Maximum chain size allowed
-chain_threshold <- 1000
-```
-
-## Modelling assumptions
-
-`chain_sim()` makes the following simplifying assumptions:
-
-1. All cases are observed
-1. There is no reporting delay
-1. Reporting rate is constant through the course of the epidemic
-1. No interventions have been implemented
-1. Population is homogeneous and well-mixed
-
-To summarise the whole set up so far, we are going to simulate 
-each chain `r sim_rep` times, projecting COVID-19 cases over
-`r projection_window` days after the first $15$ days, and 
-assuming that no outbreak size exceeds `r chain_threshold` cases. 
-
-## Running the simulations
-
-We will use the function `lapply()` to run the simulations and bind them
-by rows with `dplyr::bind_rows()`.
-```{r simulations, message=FALSE}
-set.seed(1234)
-sim_chain_sizes <- lapply(
-  seq_len(sim_rep),
-  function(sim) {
-    chain_sim(
-      n = length(start_times),
-      offspring = "nbinom",
-      mu = 2.5,
-      size = 0.58,
-      stat = "size",
-      infinite = chain_threshold,
-      serial = serial_interval,
-      t0 = start_times,
-      tf = projection_end_day,
-      tree = TRUE
-    ) %>%
-      mutate(sim = sim)
-  }
-)
-
-sim_output <- bind_rows(sim_chain_sizes)
-```
-
-Let us view the first few rows of the simulation results.
-```{r sim_output_head}
-head(sim_output)
-```
-
-## Post-processing
-
-Now, we will summarise the simulation results. 
-
-We want to plot the individual simulated daily time series and show 
-the median cases per day aggregated over all simulations.
-
-First, we will create the daily time series per simulation by
-aggregating the number of cases per day of each simulation.
-```{r post_processing}
-# Daily number of cases for each simulation
-incidence_ts <- sim_output %>%
-  mutate(day = ceiling(time)) %>%
-  group_by(sim, day) %>%
-  summarise(cases = n()) %>%
-  ungroup()
-
-head(incidence_ts)
-```
-
-Next, we will add a date column to the results of each simulation 
-set. We will use the date of the first case in the observed data 
-as the reference start date.
-```{r}
-# Get start date from the observed data
-index_date <- min(covid19_sa$date)
-index_date
-
-# Add a dates column to each simulation result
-incidence_ts_by_date <- incidence_ts %>%
-  group_by(sim) %>%
-  mutate(date = index_date + days(seq(0, n() - 1))) %>%
-  ungroup()
-
-head(incidence_ts_by_date)
-```
-
-Now we will aggregate the simulations by day and evaluate the median 
-daily cases across all simulations.
-```{r}
-# Median daily number of cases aggregated across all simulations
-median_daily_cases <- incidence_ts %>%
-  group_by(day) %>%
-  summarise(median_cases = median(cases)) %>%
-  ungroup() %>%
-  arrange(day)
-
-head(median_daily_cases)
-```
-
-As was done for the individual simulations, we will add a date column in the
-same manner.
-```{r}
-# Add dates
-median_daily_cases <- median_daily_cases %>%
-  mutate(date = index_date + days(seq(0, projection_end_day))) %>%
-  ungroup()
-
-head(median_daily_cases)
-```
-
-## Visualization
-
-We will now plot the individual simulation results alongside the median
-of the aggregated results.
-```{r viz, fig.cap ="COVID-19 incidence projected over a two week window. The gray lines represent individual simulations, red connected dots represent the median daily cases across all simulations, and the black triangles represent the observed data.", fig.width=6.0, fig.height=6}
-
-ggplot(data = incidence_ts_by_date) +
-  geom_line(
-    aes(
-      x = date,
-      y = cases,
-      group = sim
-    ),
-    color = "grey",
-    linewidth = 0.2,
-    alpha = 0.25
-  ) +
-  geom_line(
-    data = median_daily_cases,
-    aes(
-      x = date,
-      y = median_cases
-    ),
-    color = "tomato3",
-    linewidth = 1.8
-  ) +
-  geom_point(
-    data = covid19_sa,
-    aes(
-      x = date,
-      y = cases
-    ),
-    color = "black",
-    size = 1.75,
-    shape = 21
-  ) +
-  geom_line(
-    data = covid19_sa,
-    aes(
-      x = date,
-      y = cases
-    ),
-    color = "black",
-    size = 1.75,
-    shape = 21
-  ) +
-  scale_x_continuous(
-    breaks = seq(
-      min(incidence_ts_by_date$date),
-      max(incidence_ts_by_date$date),
-      10
-    ),
-    labels = seq(
-      min(incidence_ts_by_date$date),
-      max(incidence_ts_by_date$date),
-      10
-    )
-  ) +
-  scale_y_continuous(
-    breaks = seq(
-      0,
-      max(incidence_ts_by_date$cases) + 200,
-      250
-    ),
-    labels = seq(
-      0,
-      max(incidence_ts_by_date$cases) + 200,
-      250
-    )
-  ) +
-  labs(x = "Date", y = "Daily cases (median)")
-```
-## References

From c20a3817110b1c1e799ec154eb8b2b18517f43c3 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 16:08:45 +0100
Subject: [PATCH 331/828] Deleted references file (for now)

---
 vignettes/references.json | 794 --------------------------------------
 1 file changed, 794 deletions(-)
 delete mode 100644 vignettes/references.json

diff --git a/vignettes/references.json b/vignettes/references.json
deleted file mode 100644
index dcbb4440..00000000
--- a/vignettes/references.json
+++ /dev/null
@@ -1,794 +0,0 @@
-[
-	{
-		"id": "abbott2020",
-		"type": "article-journal",
-		"container-title": "Wellcome open research",
-		"note": "publisher: The Wellcome Trust",
-		"title": "The transmissibility of novel Coronavirus in the early stages of the 2019-20 outbreak in Wuhan: Exploring initial point-source exposure sizes and durations using scenario analysis",
-		"volume": "5",
-		"author": [
-			{
-				"family": "Abbott",
-				"given": "Sam"
-			},
-			{
-				"family": "Hellewell",
-				"given": "Joel"
-			},
-			{
-				"family": "Munday",
-				"given": "James"
-			},
-			{
-				"family": "Funk",
-				"given": "Sebastian"
-			},
-			{
-				"family": "group",
-				"given": "CMMID",
-				"dropping-particle": "nCoV working"
-			},
-			{
-				"literal": "others"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2020"
-				]
-			]
-		}
-	},
-	{
-		"id": "alene2021",
-		"type": "article-journal",
-		"abstract": "Background: Understanding the epidemiological parameters that determine the transmission dynamics of COVID-19 is essential for public health intervention. Globally, a number of studies were conducted to estimate the average serial interval and incubation period of COVID-19. Combining findings of existing studies that estimate the average serial interval and incubation period of COVID-19 significantly improves the quality of evidence. Hence, this study aimed to determine the overall average serial interval and incubation period of COVID-19. Methods: We followed the PRISMA checklist to present this study. A comprehensive search strategy was carried out from international electronic databases (Google Scholar, PubMed, Science Direct, Web of Science, CINAHL, and Cochrane Library) by two experienced reviewers (MAA and DBK) authors between the 1st of June and the 31st of July 2020. All observational studies either reporting the serial interval or incubation period in persons diagnosed with COVID-19 were included in this study. Heterogeneity across studies was assessed using the I2 and Higgins test. The NOS adapted for cross-sectional studies was used to evaluate the quality of studies. A random effect Meta-analysis was employed to determine the pooled estimate with 95% (CI). Microsoft Excel was used for data extraction and R software was used for analysis. Results: We combined a total of 23 studies to estimate the overall mean serial interval of COVID-19. The mean serial interval of COVID-19 ranged from 4. 2 to 7.5 days. Our meta-analysis showed that the weighted pooled mean serial interval of COVID-19 was 5.2 (95%CI: 4.9–5.5) days. Additionally, to pool the mean incubation period of COVID-19, we included 14 articles. The mean incubation period of COVID-19 also ranged from 4.8 to 9 days. Accordingly, the weighted pooled mean incubation period of COVID-19 was 6.5 (95%CI: 5.9–7.1) days. Conclusions: This systematic review and meta-analysis showed that the weighted pooled mean serial interval and incubation period of COVID-19 were 5.2, and 6.5 days, respectively. In this study, the average serial interval of COVID-19 is shorter than the average incubation period, which suggests that substantial numbers of COVID-19 cases will be attributed to presymptomatic transmission.",
-		"container-title": "BMC Infectious Diseases",
-		"DOI": "10.1186/s12879-021-05950-x",
-		"ISSN": "14712334",
-		"issue": "1",
-		"note": "publisher: BMC Infectious Diseases\nPMID: 33706702",
-		"page": "1–9",
-		"title": "Serial interval and incubation period of COVID-19: a systematic review and meta-analysis",
-		"volume": "21",
-		"author": [
-			{
-				"family": "Alene",
-				"given": "Muluneh"
-			},
-			{
-				"family": "Yismaw",
-				"given": "Leltework"
-			},
-			{
-				"family": "Assemie",
-				"given": "Moges Agazhe"
-			},
-			{
-				"family": "Ketema",
-				"given": "Daniel Bekele"
-			},
-			{
-				"family": "Gietaneh",
-				"given": "Wodaje"
-			},
-			{
-				"family": "Birhan",
-				"given": "Tilahun Yemanu"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2021"
-				]
-			]
-		}
-	},
-	{
-		"id": "allen2012",
-		"type": "article-journal",
-		"abstract": "The basic reproduction number, ℛ(0), one of the most well-known thresholds in deterministic epidemic theory, predicts a disease outbreak if ℛ(0)>1. In stochastic epidemic theory, there are also thresholds that predict a major outbreak. In the case of a single infectious group, if ℛ(0)>1 and i infectious individuals are introduced into a susceptible population, then the probability of a major outbreak is approximately 1-(1/ℛ(0))( i ). With multiple infectious groups from which the disease could emerge, this result no longer holds. Stochastic thresholds for multiple groups depend on the number of individuals within each group, i ( j ), j=1, \\ldots, n, and on the probability of disease extinction for each group, q ( j ). It follows from multitype branching processes that the probability of a major outbreak is approximately [Formula: see text]. In this investigation, we summarize some of the deterministic and stochastic threshold theory, illustrate how to calculate the stochastic thresholds, and derive some new relationships between the deterministic and stochastic thresholds.",
-		"container-title": "Journal of Biological Dynamics",
-		"DOI": "10.1080/17513758.2012.665502",
-		"ISSN": "17513758",
-		"issue": "2",
-		"page": "590–611",
-		"title": "Extinction thresholds in deterministic and stochastic epidemic models",
-		"volume": "6",
-		"author": [
-			{
-				"family": "Allen",
-				"given": "Linda J.S."
-			},
-			{
-				"family": "Lahodny",
-				"given": "Glenn E."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2012"
-				]
-			]
-		}
-	},
-	{
-		"id": "blumberg2013",
-		"type": "article-journal",
-		"abstract": "Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. © 2013 Elsevier B.V.",
-		"container-title": "Epidemics",
-		"DOI": "10.1016/j.epidem.2013.05.002",
-		"ISSN": "17554365",
-		"issue": "3",
-		"note": "publisher: Elsevier B.V.\nPMID: 24021520",
-		"page": "131–145",
-		"title": "Comparing methods for estimating R0 from the size distribution of subcritical transmission chains",
-		"URL": "http://dx.doi.org/10.1016/j.epidem.2013.05.002",
-		"volume": "5",
-		"author": [
-			{
-				"family": "Blumberg",
-				"given": "S."
-			},
-			{
-				"family": "Lloyd-Smith",
-				"given": "J. O."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2013"
-				]
-			]
-		}
-	},
-	{
-		"id": "blumberg2013a",
-		"type": "article-journal",
-		"abstract": "For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, 0<R0<1 and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring R0 and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer R0, but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, k) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both R0 and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for R0 is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in R0 is detectable. In addition, by allowing for superspreading events, inference of k shifts the threshold above which a transmission chain should be considered anomalously large for a given value of R0 (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results.",
-		"container-title": "PLoS Computational Biology",
-		"DOI": "10.1371/journal.pcbi.1002993",
-		"ISSN": "15537358",
-		"issue": "5",
-		"note": "PMID: 23658504",
-		"page": "1–17",
-		"title": "Inference of R0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains",
-		"volume": "9",
-		"author": [
-			{
-				"family": "Blumberg",
-				"given": "Seth"
-			},
-			{
-				"family": "Lloyd-Smith",
-				"given": "James O."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2013"
-				]
-			]
-		}
-	},
-	{
-		"id": "chen2022",
-		"type": "article-journal",
-		"abstract": "The generation time distribution, reflecting the time between successive infections in transmission chains, is a key epidemiological parameter for describing COVID-19 transmission dynamics. However, because exact infection times are rarely known, it is often approximated by the serial interval distribution. This approximation holds under the assumption that infectors and infectees share the same incubation period distribution, which may not always be true. We estimated incubation period and serial interval distributions using 629 transmission pairs reconstructed by investigating 2989 confirmed cases in China in January-February 2020, and developed an inferential framework to estimate the generation time distribution that accounts for variation over time due to changes in epidemiology, sampling biases and public health and social measures. We identified substantial reductions over time in the serial interval and generation time distributions. Our proposed method provides more reliable estimation of the temporal variation in the generation time distribution, improving assessment of transmission dynamics.",
-		"container-title": "Nature Communications",
-		"DOI": "10.1038/s41467-022-35496-8",
-		"ISSN": "20411723",
-		"issue": "1",
-		"note": "publisher: Springer US",
-		"title": "Inferring time-varying generation time, serial interval, and incubation period distributions for COVID-19",
-		"volume": "13",
-		"author": [
-			{
-				"family": "Chen",
-				"given": "Dongxuan"
-			},
-			{
-				"family": "Lau",
-				"given": "Yiu Chung"
-			},
-			{
-				"family": "Xu",
-				"given": "Xiao Ke"
-			},
-			{
-				"family": "Wang",
-				"given": "Lin"
-			},
-			{
-				"family": "Du",
-				"given": "Zhanwei"
-			},
-			{
-				"family": "Tsang",
-				"given": "Tim K."
-			},
-			{
-				"family": "Wu",
-				"given": "Peng"
-			},
-			{
-				"family": "Lau",
-				"given": "Eric H.Y."
-			},
-			{
-				"family": "Wallinga",
-				"given": "Jacco"
-			},
-			{
-				"family": "Cowling",
-				"given": "Benjamin J."
-			},
-			{
-				"family": "Ali",
-				"given": "Sheikh Taslim"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2022"
-				]
-			]
-		}
-	},
-	{
-		"id": "farrington1999",
-		"type": "article-journal",
-		"abstract": "We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.",
-		"container-title": "Journal of Applied Probability",
-		"DOI": "10.1239/jap/1032374633",
-		"ISSN": "00219002",
-		"issue": "3",
-		"page": "771–779",
-		"title": "The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease",
-		"volume": "36",
-		"author": [
-			{
-				"family": "Farrington",
-				"given": "C. P."
-			},
-			{
-				"family": "Grant",
-				"given": "A. D."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"1999"
-				]
-			]
-		}
-	},
-	{
-		"id": "farrington2003",
-		"type": "article-journal",
-		"abstract": "Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.",
-		"container-title": "Biostatistics (Oxford, England)",
-		"DOI": "10.1093/biostatistics/4.2.279",
-		"ISSN": "14654644",
-		"issue": "2",
-		"page": "279–295",
-		"title": "Branching process models for surveillance of infectious diseases controlled by mass vaccination.",
-		"volume": "4",
-		"author": [
-			{
-				"family": "Farrington",
-				"given": "C. P."
-			},
-			{
-				"family": "Kanaan",
-				"given": "M. N."
-			},
-			{
-				"family": "Gay",
-				"given": "N. J."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2003"
-				]
-			]
-		}
-	},
-	{
-		"id": "fine2003",
-		"type": "article-journal",
-		"abstract": "The interval between successive cases of an infectious disease is determined by the time from infection to infectiousness, the duration of infectiousness, the time from infection to disease onset (incubation period), the duration of any extra-human phase of the infectious agent, and the proportion clinically affected among infected individuals. The interval is important in the interpretation of infectious disease surveillance and trend data, in the identification of outbreaks, and in the optimization of quarantine and contact tracing. This paper discusses the properties of these intervals, as measured between transmission events or between clinical onsets of successive infected individuals, noting the determinants of their ranges and frequency distributions, the circumstances under which secondary cases may arise before primaries, and under which the infection transmission interval will be different from the interval between clinical onsets of successive cases. It discusses the derivation of interval distribution statistics from descriptive data given in standard textbooks, with illustrations from published data on outbreaks, households, and epidemiologic tracing. Finally, it discusses the implications of such measures for studies of secondary attack rates, for the persistence of infection in human communities, for outbreak response, and for elimination or eradication programs.",
-		"container-title": "American Journal of Epidemiology",
-		"DOI": "10.1093/aje/kwg251",
-		"ISSN": "00029262",
-		"issue": "11",
-		"note": "ISBN: 0002-9262 (Print) 0002-9262 (Linking)\nPMID: 14630599",
-		"page": "1039–1047",
-		"title": "The Interval between Successive Cases of an Infectious Disease",
-		"volume": "158",
-		"author": [
-			{
-				"family": "Fine",
-				"given": "Paul E.M."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2003"
-				]
-			]
-		}
-	},
-	{
-		"id": "grassly2006",
-		"type": "article-journal",
-		"abstract": "Seasonal change in the incidence of infectious diseases is a common phenomenon in both temperate and tropical climates. However, the mechanisms responsible for seasonal disease incidence, and the epidemiological consequences of seasonality, are poorly understood with rare exception. Standard epidemiological theory and concepts such as the basic reproductive number R 0 no longer apply, and the implications for interventions that themselves may be periodic, such as pulse vaccination, have not been formally examined. This paper examines the causes and consequences of seasonality, and in so doing derives several new results concerning vaccination strategy and the interpretation of disease outbreak data. It begins with a brief review of published scientific studies in support of different causes of seasonality in infectious diseases of humans, identifying four principal mechanisms and their association with different routes of transmission. It then describes the consequences of seasonality for R 0 , disease outbreaks, endemic dynamics and persistence. Finally, a mathematical analysis of routine and pulse vaccination programmes for seasonal infections is presented. The synthesis of seasonal infectious disease epidemiology attempted by this paper highlights the need for further empirical and theoretical work. © 2006 The Royal Society.",
-		"container-title": "Proceedings of the Royal Society B: Biological Sciences",
-		"DOI": "10.1098/rspb.2006.3604",
-		"ISSN": "14712970",
-		"issue": "1600",
-		"page": "2541–2550",
-		"title": "Seasonal infectious disease epidemiology",
-		"volume": "273",
-		"author": [
-			{
-				"family": "Grassly",
-				"given": "Nicholas C."
-			},
-			{
-				"family": "Fraser",
-				"given": "Christophe"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2006"
-				]
-			]
-		}
-	},
-	{
-		"id": "griffin2020",
-		"type": "article-journal",
-		"abstract": "The serial interval is the time between symptom onsets in an infector-infectee pair. The generation time, also known as the generation interval, is the time between infection events in an infector-infectee pair. The serial interval and the generation time are key parameters for assessing the dynamics of a disease. A number of scientific papers reported information pertaining to the serial interval and/or generation time for COVID-19. Objective Conduct a review of available evidence to advise on appropriate parameter values for serial interval and generation time in national COVID-19 transmission models for Ireland and on methodological issues relating to those parameters. Methods We conducted a rapid review of the literature covering the period 1 January 2020 and 21 August 2020, following predefined eligibility criteria. Forty scientific papers met our inclusion criteria and were included in the review. Results The mean of the serial interval ranged from 3.03 to 7.6 days, based on 38 estimates, and the median from 1.0 to 6.0 days (based on 15 estimates). Only three estimates were provided for the mean of the generation time. These ranged from 3.95 to 5.20 days. One estimate of 5.0 days was provided for the median of the generation time. Discussion Estimates of the serial interval and the generation time are very dependent on the specific factors that apply at the time that the data are collected, including the level of social contact. Consequently, the estimates may not be entirely relevant to other environments. Therefore, local estimates should be obtained as soon as possible. Careful consideration should be given to the methodology that is used. Real-time estimations of the serial interval/generation time, allowing for variations over time, may provide more accurate estimates of reproduction numbers than using conventionally fixed serial interval/generation time distributions.",
-		"container-title": "BMJ Open",
-		"DOI": "10.1136/bmjopen-2020-040263",
-		"ISSN": "20446055",
-		"issue": "11",
-		"note": "ISBN: 9789241512763\nPMID: 33234640",
-		"page": "1–9",
-		"title": "Rapid review of available evidence on the serial interval and generation time of COVID-19",
-		"volume": "10",
-		"author": [
-			{
-				"family": "Griffin",
-				"given": "John"
-			},
-			{
-				"family": "Casey",
-				"given": "Miriam"
-			},
-			{
-				"family": "Collins",
-				"given": "Áine"
-			},
-			{
-				"family": "Hunt",
-				"given": "Kevin"
-			},
-			{
-				"family": "McEvoy",
-				"given": "David"
-			},
-			{
-				"family": "Byrne",
-				"given": "Andrew"
-			},
-			{
-				"family": "McAloon",
-				"given": "Conor"
-			},
-			{
-				"family": "Barber",
-				"given": "Ann"
-			},
-			{
-				"family": "Lane",
-				"given": "Elizabeth Ann"
-			},
-			{
-				"family": "More",
-				"given": "Simon"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2020"
-				]
-			]
-		}
-	},
-	{
-		"id": "jacob2010",
-		"type": "article-journal",
-		"abstract": "Branching processes are stochastic individual-based processes leading consequently to a bottom-up approach. In addition, since the state variables are random integer variables (representing population sizes), the extinction occurs at random finite time on the extinction set, thus leading to fine and realistic predictions. Starting from the simplest and well-known single-type Bienaymé-Galton-Watson branching process that was used by several authors for approximating the beginning of an epidemic, we then present a general branching model with age and population dependent individual transitions. However contrary to the classical Bienaymé-Galton-Watson or asymptotically Bienaymé-Galton-Watson setting, where the asymptotic behavior of the process, as time tends to infinity, is well understood, the asymptotic behavior of this general process is a new question. Here we give some solutions for dealing with this problem depending on whether the initial population size is large or small, and whether the disease is rare or non-rare when the initial population size is large.",
-		"container-title": "International Journal of Environmental Research and Public Health",
-		"DOI": "10.3390/ijerph7031204",
-		"ISSN": "16604601",
-		"issue": "3",
-		"page": "1186–1204",
-		"title": "Branching processes: Their role in epidemiology",
-		"volume": "7",
-		"author": [
-			{
-				"family": "Jacob",
-				"given": "Christine"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2010"
-				]
-			]
-		}
-	},
-	{
-		"id": "lehtinen2021",
-		"type": "article-journal",
-		"abstract": "The timing of transmission plays a key role in the dynamics and controllability of an epidemic. However, observing generation times - the time interval between the infection of an infector and an infectee in a transmission pair - requires data on infection times, which are generally unknown. The timing of symptom onset is more easily observed; generation times are therefore often estimated based on serial intervals - the time interval between symptom onset of an infector and an infectee. This estimation follows one of two approaches: (i) approximating the generation time distribution by the serial interval distribution or (ii) deriving the generation time distribution from the serial interval and incubation period - the time interval between infection and symptom onset in a single individual - distributions. These two approaches make different - and not always explicitly stated - assumptions about the relationship between infectiousness and symptoms, resulting in different generation time distributions with the same mean but unequal variances. Here, we clarify the assumptions that each approach makes and show that neither set of assumptions is plausible for most pathogens. However, the variances of the generation time distribution derived under each assumption can reasonably be considered as upper (approximation with serial interval) and lower (derivation from serial interval) bounds. Thus, we suggest a pragmatic solution is to use both approaches and treat these as edge cases in downstream analysis. We discuss the impact of the variance of the generation time distribution on the controllability of an epidemic through strategies based on contact tracing, and we show that underestimating this variance is likely to overestimate controllability.",
-		"container-title": "Journal of the Royal Society Interface",
-		"DOI": "10.1098/rsif.2020.0756",
-		"ISSN": "17425662",
-		"issue": "174",
-		"note": "PMID: 33402022",
-		"title": "On the relationship between serial interval, infectiousness profile and generation time: On the relationship between serial interval, infectiousness profile and generation time",
-		"volume": "18",
-		"author": [
-			{
-				"family": "Lehtinen",
-				"given": "Sonja"
-			},
-			{
-				"family": "Ashcroft",
-				"given": "Peter"
-			},
-			{
-				"family": "Bonhoeffer",
-				"given": "Sebastian"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2021"
-				]
-			]
-		}
-	},
-	{
-		"id": "limpert2001",
-		"type": "article-journal",
-		"abstract": "On the charms of statistics, and how mechanical models resembling gambling machines offer a link to a handy way to characterize log-normal distributions, which can provide deeper insight into variability and probability - Normal or log-normal: That is the question.",
-		"container-title": "BioScience",
-		"DOI": "10.1641/0006-3568(2001)051[0341:LNDATS]2.0.CO;2",
-		"ISSN": "00063568",
-		"issue": "5",
-		"page": "341–352",
-		"title": "Log-normal distributions across the sciences: Keys and clues",
-		"volume": "51",
-		"author": [
-			{
-				"family": "Limpert",
-				"given": "Eckhard"
-			},
-			{
-				"family": "Stahel",
-				"given": "Werner A."
-			},
-			{
-				"family": "Abbt",
-				"given": "Markus"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2001"
-				]
-			]
-		}
-	},
-	{
-		"id": "lloyd-smith2005",
-		"type": "article-journal",
-		"abstract": "Population-level analyses often use average quantities to describe heterogeneous systems, particularly when variation does not arise from identifiable groups. A prominent example, central to our current understanding of epidemic spread, is the basic reproductive number, R0, which is defined as the mean number of infections caused by an infected individual in a susceptible population. Population estimates of R0 can obscure considerable individual variation in infectiousness, as highlighted during the global emergence of severe acute respiratory syndrome (SARS) by numerous 'superspreading events' in which certain individuals infected unusually large numbers of secondary cases. For diseases transmitted by non-sexual direct contacts, such as SARS or smallpox, individual variation is difficult to measure empirically, and thus its importance for outbreak dynamics has been unclear. Here we present an integrated theoretical and statistical analysis of the influence of individual variation in infectiousness on disease emergence. Using contact tracing data from eight directly transmitted diseases, we show that the distribution of individual infectiousness around R0 is often highly skewed. Model predictions accounting for this variation differ sharply from average-based approaches, with disease extinction more likely and outbreaks rarer but more explosive. Using these models, we explore implications for outbreak control, showing that individual-specific control measures outperform population-wide measures. Moreover, the dramatic improvements achieved through targeted control policies emphasize the need to identify predictive correlates of higher infectiousness. Our findings indicate that superspreading is a normal feature of disease spread, and to frame ongoing discussion we propose a rigorous definition for superspreading events and a method to predict their frequency. © 2005 Nature Publishing Group.",
-		"container-title": "Nature",
-		"DOI": "10.1038/nature04153",
-		"ISSN": "14764687",
-		"issue": "7066",
-		"note": "PMID: 16292310",
-		"page": "355–359",
-		"title": "Superspreading and the effect of individual variation on disease emergence",
-		"volume": "438",
-		"author": [
-			{
-				"family": "Lloyd-Smith",
-				"given": "J. O."
-			},
-			{
-				"family": "Schreiber",
-				"given": "S. J."
-			},
-			{
-				"family": "Kopp",
-				"given": "P. E."
-			},
-			{
-				"family": "Getz",
-				"given": "W. M."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2005"
-				]
-			]
-		}
-	},
-	{
-		"id": "marivate2020",
-		"type": "article-journal",
-		"container-title": "arXiv preprint arXiv:2004.04813",
-		"title": "Use of available data to inform the COVID-19 outbreak in South Africa: a case study",
-		"author": [
-			{
-				"family": "Marivate",
-				"given": "Vukosi"
-			},
-			{
-				"family": "Combrink",
-				"given": "Herkulaas MvE"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2020"
-				]
-			]
-		}
-	},
-	{
-		"id": "nishiura2007",
-		"type": "article-journal",
-		"abstract": "The incubation period of infectious diseases, the time from infection with a microorganism to onset of disease, is directly relevant to prevention and control. Since explicit models of the incubation period enhance our understanding of the spread of disease, previous classic studies were revisited, focusing on the modeling methods employed and paying particular attention to relatively unknown historical efforts. The earliest study on the incubation period of pandemic influenza was published in 1919, providing estimates of the incubation period of Spanish flu using the daily incidence on ships departing from several ports in Australia. Although the study explicitly dealt with an unknown time of exposure, the assumed periods of exposure, which had an equal probability of infection, were too long, and thus, likely resulted in slight underestimates of the incubation period. After the suggestion that the incubation period follows lognormal distribution, Japanese epidemiologists extended this assumption to estimates of the time of exposure during a point source outbreak. Although the reason why the incubation period of acute infectious diseases tends to reveal a right-skewed distribution has been explored several times, the validity of the lognormal assumption is yet to be fully clarified. At present, various different distributions are assumed, and the lack of validity in assuming lognormal distribution is particularly apparent in the case of slowly progressing diseases. The present paper indicates that (1) analysis using well-defined short periods of exposure with appropriate statistical methods is critical when the exact time of exposure is unknown, and (2) when assuming a specific distribution for the incubation period, comparisons using different distributions are needed in addition to estimations using different datasets, analyses of the determinants of incubation period, and an understanding of the underlying disease mechanisms. © 2007 Nishiura; licensee BioMed Central Ltd.",
-		"container-title": "Emerging Themes in Epidemiology",
-		"DOI": "10.1186/1742-7622-4-2",
-		"ISSN": "17427622",
-		"page": "1–12",
-		"title": "Early efforts in modeling the incubation period of infectious diseases with an acute course of illness",
-		"volume": "4",
-		"author": [
-			{
-				"family": "Nishiura",
-				"given": "Hiroshi"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2007"
-				]
-			]
-		}
-	},
-	{
-		"id": "nishiura2012",
-		"type": "article-journal",
-		"abstract": "Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. © 2011.",
-		"container-title": "Journal of Theoretical Biology",
-		"DOI": "10.1016/j.jtbi.2011.10.039",
-		"ISSN": "00225193",
-		"note": "publisher: Elsevier\nPMID: 22079419",
-		"page": "48–55",
-		"title": "Estimating the transmission potential of supercritical processes based on the final size distribution of minor outbreaks",
-		"URL": "http://dx.doi.org/10.1016/j.jtbi.2011.10.039",
-		"volume": "294",
-		"author": [
-			{
-				"family": "Nishiura",
-				"given": "Hiroshi"
-			},
-			{
-				"family": "Yan",
-				"given": "Ping"
-			},
-			{
-				"family": "Sleeman",
-				"given": "Candace K."
-			},
-			{
-				"family": "Mode",
-				"given": "Charles J."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2012"
-				]
-			]
-		}
-	},
-	{
-		"id": "pearson2020",
-		"type": "article-journal",
-		"abstract": "For 45 African countries/territories already reporting COVID-19 cases before 23 March 2020, we estimate the dates of reporting 1,000 and 10,000 cases. Assuming early epidemic trends without interventions, all 45 were likely to exceed 1,000 confirmed cases by the end of April 2020, with most exceeding 10,000 a few weeks later.",
-		"container-title": "Eurosurveillance",
-		"DOI": "10.2807/1560-7917.ES.2020.25.18.2000543",
-		"ISSN": "15607917",
-		"issue": "18",
-		"note": "publisher: European Centre for Disease Prevention and Control (ECDC)\nPMID: 32400361",
-		"page": "1–6",
-		"title": "Projected early spread of COVID-19 in Africa through 1 June 2020",
-		"URL": "http://dx.doi.org/10.2807/1560-7917.ES.2020.25.18.2000543",
-		"volume": "25",
-		"author": [
-			{
-				"family": "Pearson",
-				"given": "Carl A.B."
-			},
-			{
-				"family": "Schalkwyk",
-				"given": "Cari",
-				"non-dropping-particle": "van"
-			},
-			{
-				"family": "Foss",
-				"given": "Anna M."
-			},
-			{
-				"family": "O'Reilly",
-				"given": "Kathleen M."
-			},
-			{
-				"family": "Pulliam",
-				"given": "Juliet R.C."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2020"
-				]
-			]
-		}
-	},
-	{
-		"id": "becker1977",
-		"type": "article-journal",
-		"container-title": "Biometrics",
-		"ISSN": "0006-341X",
-		"issue": "3",
-		"note": "publisher: JSTOR",
-		"page": "515–522",
-		"title": "Estimation for discrete time branching processes with application to epidemics",
-		"volume": "33",
-		"author": [
-			{
-				"family": "Becker",
-				"given": "Niels"
-			},
-			{
-				"family": "Society",
-				"given": "International Biometric"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"1977"
-				]
-			]
-		}
-	},
-	{
-		"id": "wang2020",
-		"type": "article-journal",
-		"abstract": "Coronavirus disease 2019 (COVID-19) was first identified in late 2019 in Wuhan, Hubei Province, China and spread globally in months, sparking worldwide concern. However, it is unclear whether super-spreading events occurred during the early outbreak phase, as has been observed for other emerging viruses. Here, we analyse 208 publicly available SARS-CoV-2 genome sequences collected during the early outbreak phase. We combine phylogenetic analysis with Bayesian inference under an epidemiological model to trace person-to-person transmission. The dispersion parameter of the offspring distribution in the inferred transmission chain was estimated to be 0.23 (95% CI: 0.13–0.38), indicating there are individuals who directly infected a disproportionately large number of people. Our results showed that super-spreading events played an important role in the early stage of the COVID-19 outbreak.",
-		"container-title": "Nature Communications",
-		"DOI": "10.1038/s41467-020-18836-4",
-		"ISSN": "20411723",
-		"issue": "1",
-		"note": "publisher: Springer US\nPMID: 33024095",
-		"page": "1–6",
-		"title": "Inference of person-to-person transmission of COVID-19 reveals hidden super-spreading events during the early outbreak phase",
-		"URL": "http://dx.doi.org/10.1038/s41467-020-18836-4",
-		"volume": "11",
-		"author": [
-			{
-				"family": "Wang",
-				"given": "Liang"
-			},
-			{
-				"family": "Didelot",
-				"given": "Xavier"
-			},
-			{
-				"family": "Yang",
-				"given": "Jing"
-			},
-			{
-				"family": "Wong",
-				"given": "Gary"
-			},
-			{
-				"family": "Shi",
-				"given": "Yi"
-			},
-			{
-				"family": "Liu",
-				"given": "Wenjun"
-			},
-			{
-				"family": "Gao",
-				"given": "George F."
-			},
-			{
-				"family": "Bi",
-				"given": "Yuhai"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2020"
-				]
-			]
-		}
-	},
-	{
-		"id": "yadav2021",
-		"type": "article-journal",
-		"abstract": "In this review, we have discussed the different statistical modeling and prediction techniques for various infectious diseases including the recent pandemic of COVID-19. The distribution fitting, time series modeling along with predictive monitoring approaches, and epidemiological modeling are illustrated. When the epidemiology data is sufficient to fit with the required sample size, the normal distribution in general or other theoretical distributions are fitted and the best-fitted distribution is chosen for the prediction of the spread of the disease. The infectious diseases develop over time and we have data on the single variable that is the number of infections that happened, therefore, time series models are fitted and the prediction is done based on the best-fitted model. Monitoring approaches may also be applied to time series models which could estimate the parameters more precisely. In epidemiological modeling, more biological parameters are incorporated in the models and the forecasting of the disease spread is carried out. We came up with, how to improve the existing modeling methods, the use of fuzzy variables, and detection of fraud in the available data. Ultimately, we have reviewed the results of recent statistical modeling efforts to predict the course of COVID-19 spread.",
-		"container-title": "Frontiers in Public Health",
-		"DOI": "10.3389/fpubh.2021.645405",
-		"ISSN": "22962565",
-		"issue": "June",
-		"note": "PMID: 34222166",
-		"page": "1–27",
-		"title": "Statistical Modeling for the Prediction of Infectious Disease Dissemination With Special Reference to COVID-19 Spread",
-		"volume": "9",
-		"author": [
-			{
-				"family": "Yadav",
-				"given": "Subhash Kumar"
-			},
-			{
-				"family": "Akhter",
-				"given": "Yusuf"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2021"
-				]
-			]
-		}
-	}
-]
\ No newline at end of file

From 986228b2a4b2c6ac5bfd40f00fb1e2ba29c488f6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:18:39 +0100
Subject: [PATCH 332/828] Regenerated roxygen docs

---
 man/construct_offspring_ll_name.Rd | 29 +++++++++++++++++++++++++++++
 man/get_chain_statistic_func.Rd    | 22 ++++++++++++++++++++++
 2 files changed, 51 insertions(+)
 create mode 100644 man/construct_offspring_ll_name.Rd
 create mode 100644 man/get_chain_statistic_func.Rd

diff --git a/man/construct_offspring_ll_name.Rd b/man/construct_offspring_ll_name.Rd
new file mode 100644
index 00000000..b6f5a91f
--- /dev/null
+++ b/man/construct_offspring_ll_name.Rd
@@ -0,0 +1,29 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/helpers.R
+\name{construct_offspring_ll_name}
+\alias{construct_offspring_ll_name}
+\title{Construct name of analytical function for estimating loglikelihood of
+offspring}
+\usage{
+construct_offspring_ll_name(offspring_sampler, chain_statistic)
+}
+\arguments{
+\item{offspring_sampler}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers)}
+
+\item{chain_statistic}{String; Statistic to calculate. Can be one of:
+\itemize{
+\item "size": the total number of offspring.
+\item "length": the total number of ancestors.
+}}
+}
+\value{
+an analytical offspring likelihood function
+}
+\description{
+Construct name of analytical function for estimating loglikelihood of
+offspring
+}
+\keyword{internal}
diff --git a/man/get_chain_statistic_func.Rd b/man/get_chain_statistic_func.Rd
new file mode 100644
index 00000000..3fad9d5f
--- /dev/null
+++ b/man/get_chain_statistic_func.Rd
@@ -0,0 +1,22 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/helpers.R
+\name{get_chain_statistic_func}
+\alias{get_chain_statistic_func}
+\title{Return a function for calculating chain statistics}
+\usage{
+get_chain_statistic_func(chain_statistic)
+}
+\arguments{
+\item{chain_statistic}{String; Statistic to calculate. Can be one of:
+\itemize{
+\item "size": the total number of offspring.
+\item "length": the total number of ancestors.
+}}
+}
+\value{
+a function for calculating chain statistics
+}
+\description{
+Return a function for calculating chain statistics
+}
+\keyword{internal}

From d30a8640ed5988a3134403113edef5389daa0349 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:19:10 +0100
Subject: [PATCH 333/828] Regenerated docs for offspring_ll

---
 man/offspring_ll.Rd | 9 ---------
 1 file changed, 9 deletions(-)

diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 1280c21a..8556f5b1 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -44,13 +44,4 @@ cumulative distribution function (ecdf).
 \author{
 Sebastian Funk
 }
-\keyword{Compute}
-\keyword{Cumulative}
-\keyword{Distribution}
-\keyword{Function}
-\keyword{chains}
-\keyword{empirical}
 \keyword{internal}
-\keyword{of}
-\keyword{simulated}
-\keyword{the}

From 35fe0d612131f7e77091a23d5a647f9174e509f7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:19:34 +0100
Subject: [PATCH 334/828] Added two more helper functions

---
 R/helpers.R | 29 +++++++++++++++++++++++++++++
 1 file changed, 29 insertions(+)

diff --git a/R/helpers.R b/R/helpers.R
index bd4fc844..f0d11f01 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -56,3 +56,32 @@ get_offspring_func <- function(offspring_sampler, n, susc, pop,
     stop("offspring_sampler must either be 'pois' or 'nbinom'")
   }
 }
+
+
+
+#' Return a function for calculating chain statistics
+#'
+#' @inheritParams simulate_tree
+#'
+#' @return a function for calculating chain statistics
+#' @keywords internal
+get_chain_statistic_func <- function(chain_statistic){
+  func <- if (chain_statistic == "size") {
+    rbinom_size
+  } else if (chain_statistic == "length") {
+    rgen_length
+  }
+  return(func)
+}
+
+#' Construct name of analytical function for estimating loglikelihood of
+#' offspring
+#'
+#' @inheritParams simulate_tree
+#'
+#' @return an analytical offspring likelihood function
+#' @keywords internal
+construct_offspring_ll_name <- function(offspring_sampler, chain_statistic){
+  ll_name <- paste(offspring_sampler, chain_statistic, "ll", sep = "_")
+  return(ll_name)
+}

From 66b2a93ffd2f5e05068060532773957ffa3bc13e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:20:08 +0100
Subject: [PATCH 335/828] Replaced a wrong call of simulate_tree() with
 simulate_vect()

---
 R/likelihoods.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index c30d49a6..521052c9 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -108,7 +108,7 @@ offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
                          nsim_offspring = 100, log_trans = TRUE, ...) {
 
   # Simulate the chains
-  chains <- simulate_tree(nsim_offspring, offspring_sampler,
+  chains <- simulate_vect(nsim_offspring, offspring_sampler,
                           chain_statistic, ...)
 
   # Compute the empirical Cumulative Distribution Function of the

From 54c72ca8758e6f0d395f1490ff239254ffbc3c72 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:20:53 +0100
Subject: [PATCH 336/828] Added a log_trans argument for log-transforming the
 likelihoods

---
 R/likelihood_estimation.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 7b02a509..83eaab24 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -92,7 +92,8 @@ estimate_likelihood <- function(chains_observed,
         offspring_ll,
         c(list(
           chains_observed = calc_sizes, offspring_sampler = offspring_sampler,
-          chain_statistic = chain_statistic, chain_stat_max = chain_stat_max
+          chain_statistic = chain_statistic, chain_stat_max = chain_stat_max,
+          log_trans = log_trans
         ), pars)
       )
   }

From ed1fad35db23842c0b1c14912616c032f60ac5f9 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:21:20 +0100
Subject: [PATCH 337/828] Replaced input checking with a helper function

---
 R/likelihood_estimation.R | 5 ++---
 1 file changed, 2 insertions(+), 3 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 83eaab24..83f121b9 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -45,9 +45,8 @@ estimate_likelihood <- function(chains_observed,
   chain_statistic <- match.arg(chain_statistic)
 
   ## checks
-  if (!is.character(offspring_sampler)) {
-    stop("Object passed as 'offspring_sampler' is not a character string.")
-  }
+  check_offspring_valid(offspring_sampler)
+
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
     if (missing(nsim_obs)) {

From a0420c982ce94ce890659c83d54a185309cccf9a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:21:57 +0100
Subject: [PATCH 338/828] Added curly brackets to inline function to improve
 readability

---
 R/likelihood_estimation.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 83f121b9..ffbebc17 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -62,7 +62,7 @@ estimate_likelihood <- function(chains_observed,
                                            ),
                                chain_stat_max), simplify = FALSE)
     size_x <- unlist(sampled_x)
-    if (!is.finite(chain_stat_max)) chain_stat_max <- max(size_x) + 1
+    if (!is.finite(chain_stat_max)) {chain_stat_max <- max(size_x) + 1}
   } else {
     chains_observed[chains_observed >= chain_stat_max] <- chain_stat_max
     size_x <- chains_observed
@@ -116,7 +116,7 @@ estimate_likelihood <- function(chains_observed,
     likelihoods[sx[!(sx %in% exclude)]]
   })
 
-  if (!individual) chains_likelihood <- vapply(chains_likelihood, sum, 0)
+  if (!individual) {chains_likelihood <- vapply(chains_likelihood, sum, 0)}
 
   return(chains_likelihood)
 }

From a5ccd65e6fa5978f11b6a31cae65bbcc7093c720 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:22:33 +0100
Subject: [PATCH 339/828] Replaced explicit code with helper functions

---
 R/likelihood_estimation.R | 10 ++++------
 1 file changed, 4 insertions(+), 6 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index ffbebc17..93e0e201 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -52,11 +52,9 @@ estimate_likelihood <- function(chains_observed,
     if (missing(nsim_obs)) {
       stop("'nsim_obs' must be specified if 'obs_prob' is < 1")
     }
-    if (chain_statistic == "size") {
-      sample_func <- rbinom_size
-    } else if (chain_statistic == "length") {
-      sample_func <- rgen_length
-    }
+
+    sample_func <- get_chain_statistic_func(chain_statistic)
+
     sampled_x <- replicate(nsim_obs, pmin(sample_func(length(chains_observed),
                                            chains_observed, obs_prob
                                            ),
@@ -78,7 +76,7 @@ estimate_likelihood <- function(chains_observed,
 
   ## get likelihood function as given by offspring_sampler and chain_statistic
   likelihoods <- vector(mode = "numeric")
-  ll_func <- paste(offspring_sampler, chain_statistic, "ll", sep = "_")
+  ll_func <- construct_offspring_ll_name(offspring_sampler, chain_statistic)
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
   ## calculate likelihoods

From 6a350dabddd5b8c0f8476cf52c1b5c9e4abbfd86 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:31:55 +0100
Subject: [PATCH 340/828] Linting

---
 R/helpers.R               | 4 ++--
 R/likelihood_estimation.R | 8 ++++++--
 2 files changed, 8 insertions(+), 4 deletions(-)

diff --git a/R/helpers.R b/R/helpers.R
index f0d11f01..99f66da6 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -65,7 +65,7 @@ get_offspring_func <- function(offspring_sampler, n, susc, pop,
 #'
 #' @return a function for calculating chain statistics
 #' @keywords internal
-get_chain_statistic_func <- function(chain_statistic){
+get_chain_statistic_func <- function(chain_statistic) {
   func <- if (chain_statistic == "size") {
     rbinom_size
   } else if (chain_statistic == "length") {
@@ -81,7 +81,7 @@ get_chain_statistic_func <- function(chain_statistic){
 #'
 #' @return an analytical offspring likelihood function
 #' @keywords internal
-construct_offspring_ll_name <- function(offspring_sampler, chain_statistic){
+construct_offspring_ll_name <- function(offspring_sampler, chain_statistic) {
   ll_name <- paste(offspring_sampler, chain_statistic, "ll", sep = "_")
   return(ll_name)
 }
diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 93e0e201..8f663805 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -60,7 +60,9 @@ estimate_likelihood <- function(chains_observed,
                                            ),
                                chain_stat_max), simplify = FALSE)
     size_x <- unlist(sampled_x)
-    if (!is.finite(chain_stat_max)) {chain_stat_max <- max(size_x) + 1}
+    if (!is.finite(chain_stat_max)) {
+      chain_stat_max <- max(size_x) + 1
+      }
   } else {
     chains_observed[chains_observed >= chain_stat_max] <- chain_stat_max
     size_x <- chains_observed
@@ -114,7 +116,9 @@ estimate_likelihood <- function(chains_observed,
     likelihoods[sx[!(sx %in% exclude)]]
   })
 
-  if (!individual) {chains_likelihood <- vapply(chains_likelihood, sum, 0)}
+  if (!individual) {
+    chains_likelihood <- vapply(chains_likelihood, sum, 0)
+    }
 
   return(chains_likelihood)
 }

From 6a1980fc7f11545f5493d02d4c90218a7db5ffc7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:00:12 +0100
Subject: [PATCH 341/828] Changed n to nchains to fix a partial matching issue

---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index ff6fad0e..f0601fe3 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -215,7 +215,7 @@ simulate_tree <- function(nchains, offspring_sampler,
 #' @param chain_stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
 #' @examples
-#' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
+#' simulate_vect(nchains = 10, offspring_sampler = "pois", lambda = 2,
 #' chain_stat_max = 10)
 #' @export
 simulate_vect <- function(nchains, offspring_sampler,

From ff35d49333956d633786fc4b3102318c7a43860f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:00:58 +0100
Subject: [PATCH 342/828] Added a check for the chains_tree attribute

---
 R/checks.R | 11 +++++++++++
 1 file changed, 11 insertions(+)

diff --git a/R/checks.R b/R/checks.R
index 967b7eee..ed4338fe 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -57,3 +57,14 @@ check_nchains_valid <- function(nchains) {
     stop("`nchains` must be > 0 but less than `Inf`")
   }
 }
+
+#' Title
+#'
+#' @param x An [`epichains`] object
+#'
+#' @keywords internal
+check_chain_tree_attribute <- function(x){
+  if (attributes(x)$chain_type != "chains_tree") {
+    stop("Object must be an epichains object with a chains_tree attribute.")
+  }
+}

From 0a143fb80ca9ae6193f520e1d3131056f16d2267 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:01:14 +0100
Subject: [PATCH 343/828] Added aggregate to NAMESPACE

---
 NAMESPACE | 1 +
 1 file changed, 1 insertion(+)

diff --git a/NAMESPACE b/NAMESPACE
index 61a29bb9..05a8ce35 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -1,5 +1,6 @@
 # Generated by roxygen2: do not edit by hand
 
+S3method(aggregate,epichains)
 S3method(format,epichains)
 S3method(head,epichains)
 S3method(plot,epichains)

From 4f13cdf3d56bff4f53d73f6f90669c4cc3d4b002 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:01:54 +0100
Subject: [PATCH 344/828] Added a check for the epichains_aggregate_df class

---
 R/epichains.R | 11 +++++++++++
 1 file changed, 11 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index 10730f47..ad5683b0 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -135,6 +135,17 @@ is_epichains <- function(x) {
   inherits(x, "epichains")
 }
 
+#' Check if an object is of class "epichains_aggregate_df"
+#'
+#' @param x An [`epichains`] object
+#'
+#' @keywords internal
+is_epichains_aggregate_df <- function(x) {
+  if (!inherits(x, "epichains_aggregate_df")) {
+    stop("Object must have class 'epichains_aggregate_df'")
+  }
+}
+
 #' `epichains` class validator
 #'
 #' @param x An `epichains` object

From 406d0177466c9f55e3a6e8992a941c319f589652 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:07:31 +0100
Subject: [PATCH 345/828] Rewrote plot() to use objects aggregated through
 aggregate()

---
 R/epichains.R | 116 ++++++++++++++++++++++++++++++++++----------------
 1 file changed, 80 insertions(+), 36 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index ad5683b0..ad739422 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -209,46 +209,90 @@ tail.epichains <- function(x, ...) {
 
 #' Plot epichains tree objects
 #'
-#' @param x An [`epichains`] object with a chains_tree attribute
-#' @param ... Other arguments passed to plot
+#' This method accepts epichains aggregated through the `aggregate` method,
+#' which returns an object of class `epichains_aggregate_df` with an
+#' `aggregated_over` attribute that tells `plot()` which variable to plot.
 #'
-#' @return A plot of cases over time and generation
+#' @param x An [`epichains`] object with a chains_tree attribute.
+#' @param ... Other arguments passed to plot.
+#'
+#' @return A plot of cases over time and generation.
 #' @author James M. Azam
+#' @example
+#' # Generate chains with poisson offspring using `simulate_tree()`
+#' set.seed(123)
+#' chains <- simulate_tree(nchains = 10,
+#' serials_sampler = function(x) rpois(x, 2),
+#' offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
+#'
+#' # Aggregate cases per time and plot the results
+#' cases_per_time <- aggregate(chains, "time")
+#' plot(cases_per_time)
+#'
+#' # Aggregate cases per generation and plot the results
+#' cases_per_gen <- aggregate(chains, "generation")
+#' plot(cases_per_gen)
+#'
+#' # Aggregate cases per time and generation and plot the results
+#' cases_aggreg <- aggregate(chains, "both")
+#' plot(cases_aggreg)
+#'
+#' # Generate chains with negative
+#' # binomial offspring and from a fixed population size using
+#' # `simulate_tree_from_pop()`
+#' set.seed(123)
+#' chains_bn <- simulate_tree_from_pop(pop = 1000, offspring_sampler = "nbinom",
+#' mean_offspring = 0.5, disp_offspring = 1.1,
+#' serial_sampler = function(x) rpois(x, 2))
+#'
+#' # Plot them
+#' plot(aggregate(chains_bn, "time"))
 #' @export
+#' @author James M. Azam
 plot.epichains <- function(x, ...) {
-  validate_epichains(x)
 
-  if (attributes(x)$chain_type != "chains_tree") {
-    stop("Object must be an epichains object with a chains_tree attribute.")
-  }
+  # Object should have been aggregated using the aggregate.epichains method
+  is_epichains_aggregate_df(x)
+
+  check_chain_tree_attribute(x)
 
-  # Count the number of cases per generation
-  cases_per_generation <- stats::aggregate(sim_id ~ generation,
-                                           x = as.data.frame(x),
-                                           FUN = NROW
-                                           )
-  # Count the number of cases per time
-  cases_per_time <- stats::aggregate(sim_id ~ time, x = as.data.frame(x),
-                                     FUN = NROW)
-
-  # Set up grid
-  graphics::par(mfrow = c(1, 2), mar = c(4, 3, 3, 1), oma = c(0, 0, 0, 0))
-
-  # Make first plot
-  graphics::plot(cases_per_generation$generation,
-       cases_per_generation$sim_id,
-       xlab = "Generation",
-       ylab = "Cases",
-       type = "b",
-       main = "Number of cases per generation"
-       )
-
-  # Make second plot
-  graphics::plot(cases_per_time$time,
-       cases_per_time$sim_id,
-       xlab = "Time",
-       ylab = "Cases",
-       type = "b",
-       main = "Number of cases per time"
-  )
+  plotting_var <- attributes(x)$aggregated_over
+
+  if (plotting_var == "time") {
+    graphics::barplot(x$cases,
+      names.arg = x$time,
+      xlab = "Time",
+      ylab = "Cases",
+      type = "b", ,
+      col = "tomato3",
+      main = "Number of cases per time"
+    )
+  } else if (plotting_var == "generation") {
+    graphics::barplot(x$cases,
+      names.arg = x$generation,
+      xlab = "Generation",
+      ylab = "Cases", ,
+      col = "steelblue",
+      main = "Number of cases per generation"
+    )
+  } else if (plotting_var == "both") {
+    par(mfrow = c(1, 2))
+    # Make first plot
+    graphics::barplot(x[[1]]$cases,
+      names.arg = x$time,
+      xlab = "Time",
+      ylab = "Cases",
+      type = "b", ,
+      col = "tomato3",
+      main = "Number of cases per time"
+    )
+    # Make second plot
+    graphics::barplot(x[[2]]$cases,
+      names.arg = x$generation,
+      xlab = "Generation",
+      ylab = "Cases", ,
+      col = "steelblue",
+      main = "Number of cases per generation"
+    )
+  }
 }

From 81c2319e3ed4a559e1ac8b7c5b62d2148e00ecd3 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:07:48 +0100
Subject: [PATCH 346/828] Added an aggregate method

---
 R/epichains.R | 76 +++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 76 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index ad739422..cf0a68c2 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -296,3 +296,79 @@ plot.epichains <- function(x, ...) {
     )
   }
 }
+
+#' Aggregate cases in epichains objects according to a grouping variable
+#'
+#' @param x An [`epichains`] object.
+#' @param grouping_var The variable to group and count over. Options include
+#' "time", "generation", and "both".
+#' @param ... Other arguments passed to aggregate.
+#'
+#' @return If grouping_var is either "time" or "generation", a data.frame
+#' with cases aggregated over `grouping_var`; If
+#' \code{grouping_var = "both"}, a list of data.frames, the first being for
+#'  cases over time, and the second being for cases over generations.
+#' @export
+#'
+#' @examples
+#' set.seed(123)
+#' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
+#' offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
+#' chains
+#'
+#' # Aggregate cases per time
+#' aggregate(chains, grouping_var = "time")
+#'
+#' # Aggregate cases per generation
+#' aggregate(chains, grouping_var = "generation")
+#'
+#' # Aggregate cases per both time and generation
+#' aggregate(chains, grouping_var = "both")
+aggregate.epichains <- function(x,
+                                grouping_var = c("time",
+                                                 "generation",
+                                                 "both"
+                                                 ),
+                                ...) {
+  validate_epichains(x)
+  # Check that the object is of type "chains_tree"
+  if (attributes(x)$chain_type == "chains_vec") {
+    stop("object must be an epichains object with 'chains_tree' attribute.")
+  }
+
+  # Get grouping variable
+  grouping_var <- match.arg(grouping_var)
+
+  out <- if (grouping_var == "time") {
+    # Count the number of cases per generation
+    stats::aggregate(list(cases = x$sim_id),
+      list(time = x$time),
+      FUN = NROW
+    )
+  } else if (grouping_var == "generation") {
+    # Count the number of cases per time
+    stats::aggregate(list(cases = x$sim_id),
+      list(generation = x$generation),
+      FUN = NROW
+    )
+  } else if (grouping_var == "both") {
+    # Count the number of cases per time
+    list(
+      stats::aggregate(list(cases = x$sim_id),
+        list(time = x$time),
+        FUN = NROW
+      ),
+      # Count the number of cases per generation
+      stats::aggregate(list(cases = x$sim_id),
+        list(generation = x$generation),
+        FUN = NROW
+      )
+    )
+  }
+
+  structure(out,
+    class = c("epichains_aggregate_df", "tbl", "data.frame"),
+    chain_type = attributes(x)$chain_type,
+    aggregated_over = grouping_var
+  )
+}

From 054b200caf1862e88e554a37c09dc5a4523751b8 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:08:06 +0100
Subject: [PATCH 347/828] Regenerated roxygen docs

---
 man/aggregate.epichains.Rd        | 40 +++++++++++++++++++++++++++++++
 man/check_chain_tree_attribute.Rd | 15 ++++++++++++
 man/is_epichains_aggregate_df.Rd  | 15 ++++++++++++
 man/plot.epichains.Rd             | 10 ++++----
 man/simulate_vect.Rd              |  2 +-
 5 files changed, 77 insertions(+), 5 deletions(-)
 create mode 100644 man/aggregate.epichains.Rd
 create mode 100644 man/check_chain_tree_attribute.Rd
 create mode 100644 man/is_epichains_aggregate_df.Rd

diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
new file mode 100644
index 00000000..df7ef62c
--- /dev/null
+++ b/man/aggregate.epichains.Rd
@@ -0,0 +1,40 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{aggregate.epichains}
+\alias{aggregate.epichains}
+\title{Aggregate cases in epichains objects according to a grouping variable}
+\usage{
+\method{aggregate}{epichains}(x, grouping_var = c("time", "generation", "both"), ...)
+}
+\arguments{
+\item{x}{An \code{\link{epichains}} object.}
+
+\item{grouping_var}{The variable to group and count over. Options include
+"time", "generation", and "both".}
+
+\item{...}{Other arguments passed to aggregate.}
+}
+\value{
+If grouping_var is either "time" or "generation", a data.frame
+with cases aggregated over \code{grouping_var}; If
+\code{grouping_var = "both"}, a list of data.frames, the first being for
+cases over time, and the second being for cases over generations.
+}
+\description{
+Aggregate cases in epichains objects according to a grouping variable
+}
+\examples{
+set.seed(123)
+chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
+offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
+chains
+
+# Aggregate cases per time
+aggregate(chains, grouping_var = "time")
+
+# Aggregate cases per generation
+aggregate(chains, grouping_var = "generation")
+
+# Aggregate cases per both time and generation
+aggregate(chains, grouping_var = "both")
+}
diff --git a/man/check_chain_tree_attribute.Rd b/man/check_chain_tree_attribute.Rd
new file mode 100644
index 00000000..c0156936
--- /dev/null
+++ b/man/check_chain_tree_attribute.Rd
@@ -0,0 +1,15 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/checks.R
+\name{check_chain_tree_attribute}
+\alias{check_chain_tree_attribute}
+\title{Title}
+\usage{
+check_chain_tree_attribute(x)
+}
+\arguments{
+\item{x}{An \code{\link{epichains}} object}
+}
+\description{
+Title
+}
+\keyword{internal}
diff --git a/man/is_epichains_aggregate_df.Rd b/man/is_epichains_aggregate_df.Rd
new file mode 100644
index 00000000..ceeb73aa
--- /dev/null
+++ b/man/is_epichains_aggregate_df.Rd
@@ -0,0 +1,15 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{is_epichains_aggregate_df}
+\alias{is_epichains_aggregate_df}
+\title{Check if an object is of class "epichains_aggregate_df"}
+\usage{
+is_epichains_aggregate_df(x)
+}
+\arguments{
+\item{x}{An \code{\link{epichains}} object}
+}
+\description{
+Check if an object is of class "epichains_aggregate_df"
+}
+\keyword{internal}
diff --git a/man/plot.epichains.Rd b/man/plot.epichains.Rd
index 7fa17943..2c27e15e 100644
--- a/man/plot.epichains.Rd
+++ b/man/plot.epichains.Rd
@@ -7,15 +7,17 @@
 \method{plot}{epichains}(x, ...)
 }
 \arguments{
-\item{x}{An \code{\link{epichains}} object with a chains_tree attribute}
+\item{x}{An \code{\link{epichains}} object with a chains_tree attribute.}
 
-\item{...}{Other arguments passed to plot}
+\item{...}{Other arguments passed to plot.}
 }
 \value{
-A plot of cases over time and generation
+A plot of cases over time and generation.
 }
 \description{
-Plot epichains tree objects
+This method accepts epichains aggregated through the \code{aggregate} method,
+which returns an object of class \code{epichains_aggregate_df} with an
+\code{aggregated_over}.
 }
 \author{
 James M. Azam
diff --git a/man/simulate_vect.Rd b/man/simulate_vect.Rd
index cd7fafc7..cdef8113 100644
--- a/man/simulate_vect.Rd
+++ b/man/simulate_vect.Rd
@@ -35,6 +35,6 @@ computed. Results above the specified value, are set to \code{Inf}.}
 Simulate transmission chains without tree (as a vector)
 }
 \examples{
-simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
+simulate_vect(nchains = 10, offspring_sampler = "pois", lambda = 2,
 chain_stat_max = 10)
 }

From 8e3dca465a706233d04b12e4c7197c6863e68726 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 13:19:30 +0100
Subject: [PATCH 348/828] Cleaned up documentation for the plot method

---
 R/epichains.R         | 9 ++++-----
 man/plot.epichains.Rd | 5 +++--
 2 files changed, 7 insertions(+), 7 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index cf0a68c2..02ae4386 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -216,10 +216,11 @@ tail.epichains <- function(x, ...) {
 #' @param x An [`epichains`] object with a chains_tree attribute.
 #' @param ... Other arguments passed to plot.
 #'
-#' @return A plot of cases over time and generation.
+#' @return A plot of cases over time, generation, or both, depending on what
+#' was aggregated over. See \code{?epichains::aggregate}.
 #' @author James M. Azam
 #' @example
-#' # Generate chains with poisson offspring using `simulate_tree()`
+#' # Generate chains with poisson offspring using simulate_tree()
 #' set.seed(123)
 #' chains <- simulate_tree(nchains = 10,
 #' serials_sampler = function(x) rpois(x, 2),
@@ -228,7 +229,6 @@ tail.epichains <- function(x, ...) {
 #' # Aggregate cases per time and plot the results
 #' cases_per_time <- aggregate(chains, "time")
 #' plot(cases_per_time)
-#'
 #' # Aggregate cases per generation and plot the results
 #' cases_per_gen <- aggregate(chains, "generation")
 #' plot(cases_per_gen)
@@ -239,7 +239,7 @@ tail.epichains <- function(x, ...) {
 #'
 #' # Generate chains with negative
 #' # binomial offspring and from a fixed population size using
-#' # `simulate_tree_from_pop()`
+#' # simulate_tree_from_pop()
 #' set.seed(123)
 #' chains_bn <- simulate_tree_from_pop(pop = 1000, offspring_sampler = "nbinom",
 #' mean_offspring = 0.5, disp_offspring = 1.1,
@@ -248,7 +248,6 @@ tail.epichains <- function(x, ...) {
 #' # Plot them
 #' plot(aggregate(chains_bn, "time"))
 #' @export
-#' @author James M. Azam
 plot.epichains <- function(x, ...) {
 
   # Object should have been aggregated using the aggregate.epichains method
diff --git a/man/plot.epichains.Rd b/man/plot.epichains.Rd
index 2c27e15e..c8e06d1b 100644
--- a/man/plot.epichains.Rd
+++ b/man/plot.epichains.Rd
@@ -12,12 +12,13 @@
 \item{...}{Other arguments passed to plot.}
 }
 \value{
-A plot of cases over time and generation.
+A plot of cases over time, generation, or both, depending on what
+was aggregated over. See \code{?epichains::aggregate}.
 }
 \description{
 This method accepts epichains aggregated through the \code{aggregate} method,
 which returns an object of class \code{epichains_aggregate_df} with an
-\code{aggregated_over}.
+\code{aggregated_over} attribute that tells \code{plot()} which variable to plot.
 }
 \author{
 James M. Azam

From 5ba5072246c6f168329470f0fbffca976ec84120 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:55:06 +0100
Subject: [PATCH 349/828] Added utils to imports

---
 DESCRIPTION | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 157f7803..b50545a1 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -24,7 +24,8 @@ BugReports: https://github.com/epiverse-trace/epichains/issues
 Depends:
     R (>= 3.6.0)
 Imports: 
-    stats
+    stats,
+    utils
 Suggests:
     bookdown,
     covr,

From 079ba24571083588c456ce98691e4b2c40c613e8 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:55:25 +0100
Subject: [PATCH 350/828] Unexported is_epichains()

---
 NAMESPACE | 1 -
 1 file changed, 1 deletion(-)

diff --git a/NAMESPACE b/NAMESPACE
index 05a8ce35..d7aa250a 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -9,7 +9,6 @@ S3method(summary,epichains)
 S3method(tail,epichains)
 export(dborel)
 export(estimate_likelihood)
-export(is_epichains)
 export(rborel)
 export(rnbinom_mean_disp)
 export(simulate_tree)

From 48a642a6014cb902d6a07b490503d7b8dbd0b473 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:55:57 +0100
Subject: [PATCH 351/828] Imported functions from graphics and stats

---
 NAMESPACE | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/NAMESPACE b/NAMESPACE
index d7aa250a..9f7d5e08 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -14,5 +14,8 @@ export(rnbinom_mean_disp)
 export(simulate_tree)
 export(simulate_tree_from_pop)
 export(simulate_vect)
+importFrom(graphics,barplot)
+importFrom(graphics,par)
+importFrom(stats,aggregate)
 importFrom(utils,head)
 importFrom(utils,tail)

From e0e56707b6490cb0c0b635edacb4e0bf362b5b9f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:56:18 +0100
Subject: [PATCH 352/828] Add keyword internal to is_epichains

---
 man/is_epichains.Rd | 1 +
 1 file changed, 1 insertion(+)

diff --git a/man/is_epichains.Rd b/man/is_epichains.Rd
index dd365904..aa2d540d 100644
--- a/man/is_epichains.Rd
+++ b/man/is_epichains.Rd
@@ -16,3 +16,4 @@ otherwise
 \description{
 Checks whether the object is an \code{epichains}
 }
+\keyword{internal}

From d38ae76178bb039b0075f5480e2855587cd49a05 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:56:46 +0100
Subject: [PATCH 353/828] Redocumented the plot method

---
 man/plot.epichains.Rd | 36 +++++++++++++++++++++++++++++++++---
 1 file changed, 33 insertions(+), 3 deletions(-)

diff --git a/man/plot.epichains.Rd b/man/plot.epichains.Rd
index c8e06d1b..cc42095a 100644
--- a/man/plot.epichains.Rd
+++ b/man/plot.epichains.Rd
@@ -7,19 +7,49 @@
 \method{plot}{epichains}(x, ...)
 }
 \arguments{
-\item{x}{An \code{\link{epichains}} object with a chains_tree attribute.}
+\item{x}{An \code{epichains_aggregate_df} object with a \code{chains_tree} attribute.}
 
 \item{...}{Other arguments passed to plot.}
 }
 \value{
-A plot of cases over time, generation, or both, depending on what
-was aggregated over. See \code{?epichains::aggregate}.
+A plot of cases over time, generation, or both, depending on which
+of the options in the simulated dataset was aggregated over. See
+\code{?epichains::aggregate}.
 }
 \description{
 This method accepts epichains aggregated through the \code{aggregate} method,
 which returns an object of class \code{epichains_aggregate_df} with an
 \code{aggregated_over} attribute that tells \code{plot()} which variable to plot.
 }
+\examples{
+# Generate chains with poisson offspring using simulate_tree()
+set.seed(123)
+chains <- simulate_tree(nchains = 10,
+serials_sampler = function(x) rpois(x, 2),
+offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
+
+# Aggregate cases per time and plot the results
+cases_per_time <- aggregate(chains, "time")
+plot(cases_per_time)
+# Aggregate cases per generation and plot the results
+cases_per_gen <- aggregate(chains, "generation")
+plot(cases_per_gen)
+
+# Aggregate cases per time and generation and plot the results
+cases_aggreg <- aggregate(chains, "both")
+plot(cases_aggreg)
+
+# Generate chains with negative
+# binomial offspring and from a fixed population size using
+# simulate_tree_from_pop()
+set.seed(123)
+chains_bn <- simulate_tree_from_pop(pop = 1000, offspring_sampler = "nbinom",
+mean_offspring = 0.5, disp_offspring = 1.1,
+serial_sampler = function(x) rpois(x, 2))
+
+# Plot them
+plot(aggregate(chains_bn, "time"))
+}
 \author{
 James M. Azam
 }

From 8cddff8ebed929e686c825392dd708ad679c6fb8 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:57:13 +0100
Subject: [PATCH 354/828] Made is_epichains internal

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 02ae4386..646af886 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -130,7 +130,7 @@ summary.epichains <- function(object, ...) {
 #'
 #' @return logical, `TRUE` if the object is an `epichains` and `FALSE`
 #' otherwise
-#' @export
+#' @keywords internal
 is_epichains <- function(x) {
   inherits(x, "epichains")
 }

From 68697ca82f9d81733aff48a15d2baa7b1384dc0d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:57:55 +0100
Subject: [PATCH 355/828] Cleaned up plotting method roxygen docs

---
 R/epichains.R | 9 +++++----
 1 file changed, 5 insertions(+), 4 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 646af886..a8e1c9fe 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -213,11 +213,12 @@ tail.epichains <- function(x, ...) {
 #' which returns an object of class `epichains_aggregate_df` with an
 #' `aggregated_over` attribute that tells `plot()` which variable to plot.
 #'
-#' @param x An [`epichains`] object with a chains_tree attribute.
+#' @param x An `epichains_aggregate_df` object with a `chains_tree` attribute.
 #' @param ... Other arguments passed to plot.
-#'
-#' @return A plot of cases over time, generation, or both, depending on what
-#' was aggregated over. See \code{?epichains::aggregate}.
+#' @importFrom graphics barplot par
+#' @return A plot of cases over time, generation, or both, depending on which
+#' of the options in the simulated dataset was aggregated over. See
+#' \code{?epichains::aggregate}.
 #' @author James M. Azam
 #' @example
 #' # Generate chains with poisson offspring using simulate_tree()

From 5f5b7998e6f0060500c099b48c096d184f2abb31 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:58:22 +0100
Subject: [PATCH 356/828] Fixed example tag

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index a8e1c9fe..12504fa6 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -220,7 +220,7 @@ tail.epichains <- function(x, ...) {
 #' of the options in the simulated dataset was aggregated over. See
 #' \code{?epichains::aggregate}.
 #' @author James M. Azam
-#' @example
+#' @examples
 #' # Generate chains with poisson offspring using simulate_tree()
 #' set.seed(123)
 #' chains <- simulate_tree(nchains = 10,

From e4c263611e7fb3ae4cfe436f98a9bf87bbdebf7e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:59:08 +0100
Subject: [PATCH 357/828] Fixed the returned structure from the aggregate
 method

---
 R/epichains.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 12504fa6..bfcc8c38 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -367,8 +367,9 @@ aggregate.epichains <- function(x,
   }
 
   structure(out,
-    class = c("epichains_aggregate_df", "tbl", "data.frame"),
+    class = c("epichains_aggregate_df", class(out)),
     chain_type = attributes(x)$chain_type,
+    rownames = NULL,
     aggregated_over = grouping_var
   )
 }

From 4b544a194fb9aa1460f30a995f92fafc4b627ec3 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:59:52 +0100
Subject: [PATCH 358/828] Tightened the check for the chains_tree attribute in
 the aggregate method

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index bfcc8c38..cbd06d85 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -332,7 +332,7 @@ aggregate.epichains <- function(x,
                                 ...) {
   validate_epichains(x)
   # Check that the object is of type "chains_tree"
-  if (attributes(x)$chain_type == "chains_vec") {
+  if (attributes(x)$chain_type != "chains_tree") {
     stop("object must be an epichains object with 'chains_tree' attribute.")
   }
 

From f393f3fd870f272b29d740a2b1fc06e3bcf217b6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 17:00:14 +0100
Subject: [PATCH 359/828] Styled the code

---
 R/epichains.R | 10 ++++------
 1 file changed, 4 insertions(+), 6 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index cbd06d85..95e975cf 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -355,14 +355,12 @@ aggregate.epichains <- function(x,
     # Count the number of cases per time
     list(
       stats::aggregate(list(cases = x$sim_id),
-        list(time = x$time),
-        FUN = NROW
-      ),
+                       list(time = x$time),
+                       FUN = NROW),
       # Count the number of cases per generation
       stats::aggregate(list(cases = x$sim_id),
-        list(generation = x$generation),
-        FUN = NROW
-      )
+                       list(generation = x$generation),
+                       FUN = NROW)
     )
   }
 

From 22a63b181e248789be576a15e7e44e1fcebe0937 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 17:00:57 +0100
Subject: [PATCH 360/828] Removed the type argument passed to barplot

---
 R/epichains.R | 1 -
 1 file changed, 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 95e975cf..2298cba7 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -263,7 +263,6 @@ plot.epichains <- function(x, ...) {
       names.arg = x$time,
       xlab = "Time",
       ylab = "Cases",
-      type = "b", ,
       col = "tomato3",
       main = "Number of cases per time"
     )

From c0be73a375253a411c105acf0549c18b747aad6d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 17:16:25 +0100
Subject: [PATCH 361/828] Imported aggregate generic

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 2298cba7..f214858a 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -302,7 +302,7 @@ plot.epichains <- function(x, ...) {
 #' @param grouping_var The variable to group and count over. Options include
 #' "time", "generation", and "both".
 #' @param ... Other arguments passed to aggregate.
-#'
+#' @importFrom stats aggregate
 #' @return If grouping_var is either "time" or "generation", a data.frame
 #' with cases aggregated over `grouping_var`; If
 #' \code{grouping_var = "both"}, a list of data.frames, the first being for

From af36ab3f0a7dfac743bf4bf7ed1aeab2bb1b42b1 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 17:16:48 +0100
Subject: [PATCH 362/828] Removed plotting method

---
 NAMESPACE             |  3 --
 R/epichains.R         | 89 -------------------------------------------
 man/plot.epichains.Rd | 55 --------------------------
 3 files changed, 147 deletions(-)
 delete mode 100644 man/plot.epichains.Rd

diff --git a/NAMESPACE b/NAMESPACE
index 9f7d5e08..e31e4230 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -3,7 +3,6 @@
 S3method(aggregate,epichains)
 S3method(format,epichains)
 S3method(head,epichains)
-S3method(plot,epichains)
 S3method(print,epichains)
 S3method(summary,epichains)
 S3method(tail,epichains)
@@ -14,8 +13,6 @@ export(rnbinom_mean_disp)
 export(simulate_tree)
 export(simulate_tree_from_pop)
 export(simulate_vect)
-importFrom(graphics,barplot)
-importFrom(graphics,par)
 importFrom(stats,aggregate)
 importFrom(utils,head)
 importFrom(utils,tail)
diff --git a/R/epichains.R b/R/epichains.R
index f214858a..caa4bbd8 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -207,95 +207,6 @@ tail.epichains <- function(x, ...) {
   utils::tail(as.data.frame(x), ...)
 }
 
-#' Plot epichains tree objects
-#'
-#' This method accepts epichains aggregated through the `aggregate` method,
-#' which returns an object of class `epichains_aggregate_df` with an
-#' `aggregated_over` attribute that tells `plot()` which variable to plot.
-#'
-#' @param x An `epichains_aggregate_df` object with a `chains_tree` attribute.
-#' @param ... Other arguments passed to plot.
-#' @importFrom graphics barplot par
-#' @return A plot of cases over time, generation, or both, depending on which
-#' of the options in the simulated dataset was aggregated over. See
-#' \code{?epichains::aggregate}.
-#' @author James M. Azam
-#' @examples
-#' # Generate chains with poisson offspring using simulate_tree()
-#' set.seed(123)
-#' chains <- simulate_tree(nchains = 10,
-#' serials_sampler = function(x) rpois(x, 2),
-#' offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
-#'
-#' # Aggregate cases per time and plot the results
-#' cases_per_time <- aggregate(chains, "time")
-#' plot(cases_per_time)
-#' # Aggregate cases per generation and plot the results
-#' cases_per_gen <- aggregate(chains, "generation")
-#' plot(cases_per_gen)
-#'
-#' # Aggregate cases per time and generation and plot the results
-#' cases_aggreg <- aggregate(chains, "both")
-#' plot(cases_aggreg)
-#'
-#' # Generate chains with negative
-#' # binomial offspring and from a fixed population size using
-#' # simulate_tree_from_pop()
-#' set.seed(123)
-#' chains_bn <- simulate_tree_from_pop(pop = 1000, offspring_sampler = "nbinom",
-#' mean_offspring = 0.5, disp_offspring = 1.1,
-#' serial_sampler = function(x) rpois(x, 2))
-#'
-#' # Plot them
-#' plot(aggregate(chains_bn, "time"))
-#' @export
-plot.epichains <- function(x, ...) {
-
-  # Object should have been aggregated using the aggregate.epichains method
-  is_epichains_aggregate_df(x)
-
-  check_chain_tree_attribute(x)
-
-  plotting_var <- attributes(x)$aggregated_over
-
-  if (plotting_var == "time") {
-    graphics::barplot(x$cases,
-      names.arg = x$time,
-      xlab = "Time",
-      ylab = "Cases",
-      col = "tomato3",
-      main = "Number of cases per time"
-    )
-  } else if (plotting_var == "generation") {
-    graphics::barplot(x$cases,
-      names.arg = x$generation,
-      xlab = "Generation",
-      ylab = "Cases", ,
-      col = "steelblue",
-      main = "Number of cases per generation"
-    )
-  } else if (plotting_var == "both") {
-    par(mfrow = c(1, 2))
-    # Make first plot
-    graphics::barplot(x[[1]]$cases,
-      names.arg = x$time,
-      xlab = "Time",
-      ylab = "Cases",
-      type = "b", ,
-      col = "tomato3",
-      main = "Number of cases per time"
-    )
-    # Make second plot
-    graphics::barplot(x[[2]]$cases,
-      names.arg = x$generation,
-      xlab = "Generation",
-      ylab = "Cases", ,
-      col = "steelblue",
-      main = "Number of cases per generation"
-    )
-  }
-}
-
 #' Aggregate cases in epichains objects according to a grouping variable
 #'
 #' @param x An [`epichains`] object.
diff --git a/man/plot.epichains.Rd b/man/plot.epichains.Rd
deleted file mode 100644
index cc42095a..00000000
--- a/man/plot.epichains.Rd
+++ /dev/null
@@ -1,55 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{plot.epichains}
-\alias{plot.epichains}
-\title{Plot epichains tree objects}
-\usage{
-\method{plot}{epichains}(x, ...)
-}
-\arguments{
-\item{x}{An \code{epichains_aggregate_df} object with a \code{chains_tree} attribute.}
-
-\item{...}{Other arguments passed to plot.}
-}
-\value{
-A plot of cases over time, generation, or both, depending on which
-of the options in the simulated dataset was aggregated over. See
-\code{?epichains::aggregate}.
-}
-\description{
-This method accepts epichains aggregated through the \code{aggregate} method,
-which returns an object of class \code{epichains_aggregate_df} with an
-\code{aggregated_over} attribute that tells \code{plot()} which variable to plot.
-}
-\examples{
-# Generate chains with poisson offspring using simulate_tree()
-set.seed(123)
-chains <- simulate_tree(nchains = 10,
-serials_sampler = function(x) rpois(x, 2),
-offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
-
-# Aggregate cases per time and plot the results
-cases_per_time <- aggregate(chains, "time")
-plot(cases_per_time)
-# Aggregate cases per generation and plot the results
-cases_per_gen <- aggregate(chains, "generation")
-plot(cases_per_gen)
-
-# Aggregate cases per time and generation and plot the results
-cases_aggreg <- aggregate(chains, "both")
-plot(cases_aggreg)
-
-# Generate chains with negative
-# binomial offspring and from a fixed population size using
-# simulate_tree_from_pop()
-set.seed(123)
-chains_bn <- simulate_tree_from_pop(pop = 1000, offspring_sampler = "nbinom",
-mean_offspring = 0.5, disp_offspring = 1.1,
-serial_sampler = function(x) rpois(x, 2))
-
-# Plot them
-plot(aggregate(chains_bn, "time"))
-}
-\author{
-James M. Azam
-}

From cf06552d3879c25455b564fc246d8524db1c9ab4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 17:26:55 +0100
Subject: [PATCH 363/828] Linting

---
 R/checks.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/checks.R b/R/checks.R
index ed4338fe..c4bacce9 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -63,7 +63,7 @@ check_nchains_valid <- function(nchains) {
 #' @param x An [`epichains`] object
 #'
 #' @keywords internal
-check_chain_tree_attribute <- function(x){
+check_chain_tree_attribute <- function(x) {
   if (attributes(x)$chain_type != "chains_tree") {
     stop("Object must be an epichains object with a chains_tree attribute.")
   }

From 4907a4688a58f43084bbd9eca22fdfd74592aa94 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 14:38:52 +0100
Subject: [PATCH 364/828] Added demo vignette

---
 vignettes/epichains_demo.Rmd | 114 +++++
 vignettes/references.json    | 794 +++++++++++++++++++++++++++++++++++
 2 files changed, 908 insertions(+)
 create mode 100644 vignettes/epichains_demo.Rmd
 create mode 100644 vignettes/references.json

diff --git a/vignettes/epichains_demo.Rmd b/vignettes/epichains_demo.Rmd
new file mode 100644
index 00000000..ae587436
--- /dev/null
+++ b/vignettes/epichains_demo.Rmd
@@ -0,0 +1,114 @@
+---
+title: "Getting started with epichains"
+author: "James Azam"
+output:
+  bookdown::html_vignette2:
+    fig_caption: yes
+    code_folding: show
+pkgdown:
+  as_is: true
+bibliography: references.json
+link-citations: true
+vignette: >
+  %\VignetteIndexEntry{Getting started with epichains}
+  %\VignetteEncoding{UTF-8}
+  %\VignetteEngine{knitr::rmarkdown}
+editor_options: 
+  chunk_output_type: console
+---
+
+```{r include=FALSE}
+knitr::opts_chunk$set(
+  echo = TRUE,
+  message = FALSE,
+  warning = FALSE,
+  collapse = TRUE,
+  comment = "#>"
+)
+```
+
+## Functionality
+
+`epichains` currently has 4 core functions:
+
+* `simulate_tree()`: simulate transmission trees from a given number of chains.
+* `simulate_tree_from_pop()`: simulate transmission trees from a given number 
+  population size and initial immunity.
+* `simulate_vect()`: simulate a vector of observed transmission chains 
+  sizes/lengths from a given number of chains.
+* `estimate_likelihood()`: estimate the likelihood/loglikelihood of observing
+  chains of given sizes/lengths.
+
+### Object-orientation
+
+#### Classes
+
+* An `epichains` class:
+  * superclass of `data.frame` with attributes for tracking `chain_type` as: 
+    * `chains_tree`, if returned from `simulate_tree()` or 
+    `simulate_tree_from_pop()`
+    * `chains_vec`, if returned from `simulate_vect()`.
+* An `epichains_aggregate_df` class:
+  * superclass of `data.frame` with attributes for tracking if aggregation is 
+  done over "time", "generation" or "both". Useful for `plot` method dispatch 
+  (see methods section below).
+
+#### Methods
+
+* `print()`
+* `summary()`
+* `aggregate()`
+
+## Demo
+
+### Printing and summary
+```{r include=TRUE,echo=TRUE}
+library(epichains)
+# simulate_tree()
+simulate_tree_eg <- simulate_tree(nchains = 10,
+                                  serials_sampler = function(x) 3,
+                                  offspring_sampler = "pois", 
+                                  lambda = 2, 
+                                  chain_stat_max = 10
+                                  )
+
+simulate_tree_eg # print the output
+
+# simulate_vect()
+simulate_vect_eg <- simulate_vect(nchains = 10, offspring_sampler = "pois", 
+                                  lambda = 2, chain_stat_max = 10)
+
+simulate_vect_eg # print the output
+
+# simulate_tree_from_pop()
+# Simulate with poisson offspring
+simulate_vect_eg_pois <- simulate_tree_from_pop(pop = 100,
+                                                offspring_sampler = "pois",
+                                                mean_offspring = 0.5,
+                                                serial_sampler = function(x) 3
+                                                )
+
+simulate_vect_eg_pois # print the output
+# Simulate with negative binomial offspring
+simulate_vect_eg_nbinom <- simulate_tree_from_pop(pop = 100,
+                                                  offspring_sampler = "nbinom",
+                                                  mean_offspring = 0.5,
+                                                  disp_offspring = 1.1,
+                                                  serial_sampler = function(x) 3
+                                                  )
+
+simulate_vect_eg_nbinom # print the output
+```
+
+### Aggregation
+```{r include=TRUE,echo=TRUE}
+
+# aggregate by time
+aggregate(simulate_vect_eg_pois, "time")
+
+# aggregate by generation
+aggregate(simulate_vect_eg_pois, "generation")
+
+# aggregate by both time and generation
+aggregate(simulate_vect_eg_pois, "both") 
+```
diff --git a/vignettes/references.json b/vignettes/references.json
new file mode 100644
index 00000000..dcbb4440
--- /dev/null
+++ b/vignettes/references.json
@@ -0,0 +1,794 @@
+[
+	{
+		"id": "abbott2020",
+		"type": "article-journal",
+		"container-title": "Wellcome open research",
+		"note": "publisher: The Wellcome Trust",
+		"title": "The transmissibility of novel Coronavirus in the early stages of the 2019-20 outbreak in Wuhan: Exploring initial point-source exposure sizes and durations using scenario analysis",
+		"volume": "5",
+		"author": [
+			{
+				"family": "Abbott",
+				"given": "Sam"
+			},
+			{
+				"family": "Hellewell",
+				"given": "Joel"
+			},
+			{
+				"family": "Munday",
+				"given": "James"
+			},
+			{
+				"family": "Funk",
+				"given": "Sebastian"
+			},
+			{
+				"family": "group",
+				"given": "CMMID",
+				"dropping-particle": "nCoV working"
+			},
+			{
+				"literal": "others"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "alene2021",
+		"type": "article-journal",
+		"abstract": "Background: Understanding the epidemiological parameters that determine the transmission dynamics of COVID-19 is essential for public health intervention. Globally, a number of studies were conducted to estimate the average serial interval and incubation period of COVID-19. Combining findings of existing studies that estimate the average serial interval and incubation period of COVID-19 significantly improves the quality of evidence. Hence, this study aimed to determine the overall average serial interval and incubation period of COVID-19. Methods: We followed the PRISMA checklist to present this study. A comprehensive search strategy was carried out from international electronic databases (Google Scholar, PubMed, Science Direct, Web of Science, CINAHL, and Cochrane Library) by two experienced reviewers (MAA and DBK) authors between the 1st of June and the 31st of July 2020. All observational studies either reporting the serial interval or incubation period in persons diagnosed with COVID-19 were included in this study. Heterogeneity across studies was assessed using the I2 and Higgins test. The NOS adapted for cross-sectional studies was used to evaluate the quality of studies. A random effect Meta-analysis was employed to determine the pooled estimate with 95% (CI). Microsoft Excel was used for data extraction and R software was used for analysis. Results: We combined a total of 23 studies to estimate the overall mean serial interval of COVID-19. The mean serial interval of COVID-19 ranged from 4. 2 to 7.5 days. Our meta-analysis showed that the weighted pooled mean serial interval of COVID-19 was 5.2 (95%CI: 4.9–5.5) days. Additionally, to pool the mean incubation period of COVID-19, we included 14 articles. The mean incubation period of COVID-19 also ranged from 4.8 to 9 days. Accordingly, the weighted pooled mean incubation period of COVID-19 was 6.5 (95%CI: 5.9–7.1) days. Conclusions: This systematic review and meta-analysis showed that the weighted pooled mean serial interval and incubation period of COVID-19 were 5.2, and 6.5 days, respectively. In this study, the average serial interval of COVID-19 is shorter than the average incubation period, which suggests that substantial numbers of COVID-19 cases will be attributed to presymptomatic transmission.",
+		"container-title": "BMC Infectious Diseases",
+		"DOI": "10.1186/s12879-021-05950-x",
+		"ISSN": "14712334",
+		"issue": "1",
+		"note": "publisher: BMC Infectious Diseases\nPMID: 33706702",
+		"page": "1–9",
+		"title": "Serial interval and incubation period of COVID-19: a systematic review and meta-analysis",
+		"volume": "21",
+		"author": [
+			{
+				"family": "Alene",
+				"given": "Muluneh"
+			},
+			{
+				"family": "Yismaw",
+				"given": "Leltework"
+			},
+			{
+				"family": "Assemie",
+				"given": "Moges Agazhe"
+			},
+			{
+				"family": "Ketema",
+				"given": "Daniel Bekele"
+			},
+			{
+				"family": "Gietaneh",
+				"given": "Wodaje"
+			},
+			{
+				"family": "Birhan",
+				"given": "Tilahun Yemanu"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2021"
+				]
+			]
+		}
+	},
+	{
+		"id": "allen2012",
+		"type": "article-journal",
+		"abstract": "The basic reproduction number, ℛ(0), one of the most well-known thresholds in deterministic epidemic theory, predicts a disease outbreak if ℛ(0)>1. In stochastic epidemic theory, there are also thresholds that predict a major outbreak. In the case of a single infectious group, if ℛ(0)>1 and i infectious individuals are introduced into a susceptible population, then the probability of a major outbreak is approximately 1-(1/ℛ(0))( i ). With multiple infectious groups from which the disease could emerge, this result no longer holds. Stochastic thresholds for multiple groups depend on the number of individuals within each group, i ( j ), j=1, \\ldots, n, and on the probability of disease extinction for each group, q ( j ). It follows from multitype branching processes that the probability of a major outbreak is approximately [Formula: see text]. In this investigation, we summarize some of the deterministic and stochastic threshold theory, illustrate how to calculate the stochastic thresholds, and derive some new relationships between the deterministic and stochastic thresholds.",
+		"container-title": "Journal of Biological Dynamics",
+		"DOI": "10.1080/17513758.2012.665502",
+		"ISSN": "17513758",
+		"issue": "2",
+		"page": "590–611",
+		"title": "Extinction thresholds in deterministic and stochastic epidemic models",
+		"volume": "6",
+		"author": [
+			{
+				"family": "Allen",
+				"given": "Linda J.S."
+			},
+			{
+				"family": "Lahodny",
+				"given": "Glenn E."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2012"
+				]
+			]
+		}
+	},
+	{
+		"id": "blumberg2013",
+		"type": "article-journal",
+		"abstract": "Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. © 2013 Elsevier B.V.",
+		"container-title": "Epidemics",
+		"DOI": "10.1016/j.epidem.2013.05.002",
+		"ISSN": "17554365",
+		"issue": "3",
+		"note": "publisher: Elsevier B.V.\nPMID: 24021520",
+		"page": "131–145",
+		"title": "Comparing methods for estimating R0 from the size distribution of subcritical transmission chains",
+		"URL": "http://dx.doi.org/10.1016/j.epidem.2013.05.002",
+		"volume": "5",
+		"author": [
+			{
+				"family": "Blumberg",
+				"given": "S."
+			},
+			{
+				"family": "Lloyd-Smith",
+				"given": "J. O."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2013"
+				]
+			]
+		}
+	},
+	{
+		"id": "blumberg2013a",
+		"type": "article-journal",
+		"abstract": "For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, 0<R0<1 and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring R0 and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer R0, but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, k) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both R0 and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for R0 is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in R0 is detectable. In addition, by allowing for superspreading events, inference of k shifts the threshold above which a transmission chain should be considered anomalously large for a given value of R0 (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results.",
+		"container-title": "PLoS Computational Biology",
+		"DOI": "10.1371/journal.pcbi.1002993",
+		"ISSN": "15537358",
+		"issue": "5",
+		"note": "PMID: 23658504",
+		"page": "1–17",
+		"title": "Inference of R0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains",
+		"volume": "9",
+		"author": [
+			{
+				"family": "Blumberg",
+				"given": "Seth"
+			},
+			{
+				"family": "Lloyd-Smith",
+				"given": "James O."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2013"
+				]
+			]
+		}
+	},
+	{
+		"id": "chen2022",
+		"type": "article-journal",
+		"abstract": "The generation time distribution, reflecting the time between successive infections in transmission chains, is a key epidemiological parameter for describing COVID-19 transmission dynamics. However, because exact infection times are rarely known, it is often approximated by the serial interval distribution. This approximation holds under the assumption that infectors and infectees share the same incubation period distribution, which may not always be true. We estimated incubation period and serial interval distributions using 629 transmission pairs reconstructed by investigating 2989 confirmed cases in China in January-February 2020, and developed an inferential framework to estimate the generation time distribution that accounts for variation over time due to changes in epidemiology, sampling biases and public health and social measures. We identified substantial reductions over time in the serial interval and generation time distributions. Our proposed method provides more reliable estimation of the temporal variation in the generation time distribution, improving assessment of transmission dynamics.",
+		"container-title": "Nature Communications",
+		"DOI": "10.1038/s41467-022-35496-8",
+		"ISSN": "20411723",
+		"issue": "1",
+		"note": "publisher: Springer US",
+		"title": "Inferring time-varying generation time, serial interval, and incubation period distributions for COVID-19",
+		"volume": "13",
+		"author": [
+			{
+				"family": "Chen",
+				"given": "Dongxuan"
+			},
+			{
+				"family": "Lau",
+				"given": "Yiu Chung"
+			},
+			{
+				"family": "Xu",
+				"given": "Xiao Ke"
+			},
+			{
+				"family": "Wang",
+				"given": "Lin"
+			},
+			{
+				"family": "Du",
+				"given": "Zhanwei"
+			},
+			{
+				"family": "Tsang",
+				"given": "Tim K."
+			},
+			{
+				"family": "Wu",
+				"given": "Peng"
+			},
+			{
+				"family": "Lau",
+				"given": "Eric H.Y."
+			},
+			{
+				"family": "Wallinga",
+				"given": "Jacco"
+			},
+			{
+				"family": "Cowling",
+				"given": "Benjamin J."
+			},
+			{
+				"family": "Ali",
+				"given": "Sheikh Taslim"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2022"
+				]
+			]
+		}
+	},
+	{
+		"id": "farrington1999",
+		"type": "article-journal",
+		"abstract": "We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.",
+		"container-title": "Journal of Applied Probability",
+		"DOI": "10.1239/jap/1032374633",
+		"ISSN": "00219002",
+		"issue": "3",
+		"page": "771–779",
+		"title": "The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease",
+		"volume": "36",
+		"author": [
+			{
+				"family": "Farrington",
+				"given": "C. P."
+			},
+			{
+				"family": "Grant",
+				"given": "A. D."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"1999"
+				]
+			]
+		}
+	},
+	{
+		"id": "farrington2003",
+		"type": "article-journal",
+		"abstract": "Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.",
+		"container-title": "Biostatistics (Oxford, England)",
+		"DOI": "10.1093/biostatistics/4.2.279",
+		"ISSN": "14654644",
+		"issue": "2",
+		"page": "279–295",
+		"title": "Branching process models for surveillance of infectious diseases controlled by mass vaccination.",
+		"volume": "4",
+		"author": [
+			{
+				"family": "Farrington",
+				"given": "C. P."
+			},
+			{
+				"family": "Kanaan",
+				"given": "M. N."
+			},
+			{
+				"family": "Gay",
+				"given": "N. J."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2003"
+				]
+			]
+		}
+	},
+	{
+		"id": "fine2003",
+		"type": "article-journal",
+		"abstract": "The interval between successive cases of an infectious disease is determined by the time from infection to infectiousness, the duration of infectiousness, the time from infection to disease onset (incubation period), the duration of any extra-human phase of the infectious agent, and the proportion clinically affected among infected individuals. The interval is important in the interpretation of infectious disease surveillance and trend data, in the identification of outbreaks, and in the optimization of quarantine and contact tracing. This paper discusses the properties of these intervals, as measured between transmission events or between clinical onsets of successive infected individuals, noting the determinants of their ranges and frequency distributions, the circumstances under which secondary cases may arise before primaries, and under which the infection transmission interval will be different from the interval between clinical onsets of successive cases. It discusses the derivation of interval distribution statistics from descriptive data given in standard textbooks, with illustrations from published data on outbreaks, households, and epidemiologic tracing. Finally, it discusses the implications of such measures for studies of secondary attack rates, for the persistence of infection in human communities, for outbreak response, and for elimination or eradication programs.",
+		"container-title": "American Journal of Epidemiology",
+		"DOI": "10.1093/aje/kwg251",
+		"ISSN": "00029262",
+		"issue": "11",
+		"note": "ISBN: 0002-9262 (Print) 0002-9262 (Linking)\nPMID: 14630599",
+		"page": "1039–1047",
+		"title": "The Interval between Successive Cases of an Infectious Disease",
+		"volume": "158",
+		"author": [
+			{
+				"family": "Fine",
+				"given": "Paul E.M."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2003"
+				]
+			]
+		}
+	},
+	{
+		"id": "grassly2006",
+		"type": "article-journal",
+		"abstract": "Seasonal change in the incidence of infectious diseases is a common phenomenon in both temperate and tropical climates. However, the mechanisms responsible for seasonal disease incidence, and the epidemiological consequences of seasonality, are poorly understood with rare exception. Standard epidemiological theory and concepts such as the basic reproductive number R 0 no longer apply, and the implications for interventions that themselves may be periodic, such as pulse vaccination, have not been formally examined. This paper examines the causes and consequences of seasonality, and in so doing derives several new results concerning vaccination strategy and the interpretation of disease outbreak data. It begins with a brief review of published scientific studies in support of different causes of seasonality in infectious diseases of humans, identifying four principal mechanisms and their association with different routes of transmission. It then describes the consequences of seasonality for R 0 , disease outbreaks, endemic dynamics and persistence. Finally, a mathematical analysis of routine and pulse vaccination programmes for seasonal infections is presented. The synthesis of seasonal infectious disease epidemiology attempted by this paper highlights the need for further empirical and theoretical work. © 2006 The Royal Society.",
+		"container-title": "Proceedings of the Royal Society B: Biological Sciences",
+		"DOI": "10.1098/rspb.2006.3604",
+		"ISSN": "14712970",
+		"issue": "1600",
+		"page": "2541–2550",
+		"title": "Seasonal infectious disease epidemiology",
+		"volume": "273",
+		"author": [
+			{
+				"family": "Grassly",
+				"given": "Nicholas C."
+			},
+			{
+				"family": "Fraser",
+				"given": "Christophe"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2006"
+				]
+			]
+		}
+	},
+	{
+		"id": "griffin2020",
+		"type": "article-journal",
+		"abstract": "The serial interval is the time between symptom onsets in an infector-infectee pair. The generation time, also known as the generation interval, is the time between infection events in an infector-infectee pair. The serial interval and the generation time are key parameters for assessing the dynamics of a disease. A number of scientific papers reported information pertaining to the serial interval and/or generation time for COVID-19. Objective Conduct a review of available evidence to advise on appropriate parameter values for serial interval and generation time in national COVID-19 transmission models for Ireland and on methodological issues relating to those parameters. Methods We conducted a rapid review of the literature covering the period 1 January 2020 and 21 August 2020, following predefined eligibility criteria. Forty scientific papers met our inclusion criteria and were included in the review. Results The mean of the serial interval ranged from 3.03 to 7.6 days, based on 38 estimates, and the median from 1.0 to 6.0 days (based on 15 estimates). Only three estimates were provided for the mean of the generation time. These ranged from 3.95 to 5.20 days. One estimate of 5.0 days was provided for the median of the generation time. Discussion Estimates of the serial interval and the generation time are very dependent on the specific factors that apply at the time that the data are collected, including the level of social contact. Consequently, the estimates may not be entirely relevant to other environments. Therefore, local estimates should be obtained as soon as possible. Careful consideration should be given to the methodology that is used. Real-time estimations of the serial interval/generation time, allowing for variations over time, may provide more accurate estimates of reproduction numbers than using conventionally fixed serial interval/generation time distributions.",
+		"container-title": "BMJ Open",
+		"DOI": "10.1136/bmjopen-2020-040263",
+		"ISSN": "20446055",
+		"issue": "11",
+		"note": "ISBN: 9789241512763\nPMID: 33234640",
+		"page": "1–9",
+		"title": "Rapid review of available evidence on the serial interval and generation time of COVID-19",
+		"volume": "10",
+		"author": [
+			{
+				"family": "Griffin",
+				"given": "John"
+			},
+			{
+				"family": "Casey",
+				"given": "Miriam"
+			},
+			{
+				"family": "Collins",
+				"given": "Áine"
+			},
+			{
+				"family": "Hunt",
+				"given": "Kevin"
+			},
+			{
+				"family": "McEvoy",
+				"given": "David"
+			},
+			{
+				"family": "Byrne",
+				"given": "Andrew"
+			},
+			{
+				"family": "McAloon",
+				"given": "Conor"
+			},
+			{
+				"family": "Barber",
+				"given": "Ann"
+			},
+			{
+				"family": "Lane",
+				"given": "Elizabeth Ann"
+			},
+			{
+				"family": "More",
+				"given": "Simon"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "jacob2010",
+		"type": "article-journal",
+		"abstract": "Branching processes are stochastic individual-based processes leading consequently to a bottom-up approach. In addition, since the state variables are random integer variables (representing population sizes), the extinction occurs at random finite time on the extinction set, thus leading to fine and realistic predictions. Starting from the simplest and well-known single-type Bienaymé-Galton-Watson branching process that was used by several authors for approximating the beginning of an epidemic, we then present a general branching model with age and population dependent individual transitions. However contrary to the classical Bienaymé-Galton-Watson or asymptotically Bienaymé-Galton-Watson setting, where the asymptotic behavior of the process, as time tends to infinity, is well understood, the asymptotic behavior of this general process is a new question. Here we give some solutions for dealing with this problem depending on whether the initial population size is large or small, and whether the disease is rare or non-rare when the initial population size is large.",
+		"container-title": "International Journal of Environmental Research and Public Health",
+		"DOI": "10.3390/ijerph7031204",
+		"ISSN": "16604601",
+		"issue": "3",
+		"page": "1186–1204",
+		"title": "Branching processes: Their role in epidemiology",
+		"volume": "7",
+		"author": [
+			{
+				"family": "Jacob",
+				"given": "Christine"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2010"
+				]
+			]
+		}
+	},
+	{
+		"id": "lehtinen2021",
+		"type": "article-journal",
+		"abstract": "The timing of transmission plays a key role in the dynamics and controllability of an epidemic. However, observing generation times - the time interval between the infection of an infector and an infectee in a transmission pair - requires data on infection times, which are generally unknown. The timing of symptom onset is more easily observed; generation times are therefore often estimated based on serial intervals - the time interval between symptom onset of an infector and an infectee. This estimation follows one of two approaches: (i) approximating the generation time distribution by the serial interval distribution or (ii) deriving the generation time distribution from the serial interval and incubation period - the time interval between infection and symptom onset in a single individual - distributions. These two approaches make different - and not always explicitly stated - assumptions about the relationship between infectiousness and symptoms, resulting in different generation time distributions with the same mean but unequal variances. Here, we clarify the assumptions that each approach makes and show that neither set of assumptions is plausible for most pathogens. However, the variances of the generation time distribution derived under each assumption can reasonably be considered as upper (approximation with serial interval) and lower (derivation from serial interval) bounds. Thus, we suggest a pragmatic solution is to use both approaches and treat these as edge cases in downstream analysis. We discuss the impact of the variance of the generation time distribution on the controllability of an epidemic through strategies based on contact tracing, and we show that underestimating this variance is likely to overestimate controllability.",
+		"container-title": "Journal of the Royal Society Interface",
+		"DOI": "10.1098/rsif.2020.0756",
+		"ISSN": "17425662",
+		"issue": "174",
+		"note": "PMID: 33402022",
+		"title": "On the relationship between serial interval, infectiousness profile and generation time: On the relationship between serial interval, infectiousness profile and generation time",
+		"volume": "18",
+		"author": [
+			{
+				"family": "Lehtinen",
+				"given": "Sonja"
+			},
+			{
+				"family": "Ashcroft",
+				"given": "Peter"
+			},
+			{
+				"family": "Bonhoeffer",
+				"given": "Sebastian"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2021"
+				]
+			]
+		}
+	},
+	{
+		"id": "limpert2001",
+		"type": "article-journal",
+		"abstract": "On the charms of statistics, and how mechanical models resembling gambling machines offer a link to a handy way to characterize log-normal distributions, which can provide deeper insight into variability and probability - Normal or log-normal: That is the question.",
+		"container-title": "BioScience",
+		"DOI": "10.1641/0006-3568(2001)051[0341:LNDATS]2.0.CO;2",
+		"ISSN": "00063568",
+		"issue": "5",
+		"page": "341–352",
+		"title": "Log-normal distributions across the sciences: Keys and clues",
+		"volume": "51",
+		"author": [
+			{
+				"family": "Limpert",
+				"given": "Eckhard"
+			},
+			{
+				"family": "Stahel",
+				"given": "Werner A."
+			},
+			{
+				"family": "Abbt",
+				"given": "Markus"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2001"
+				]
+			]
+		}
+	},
+	{
+		"id": "lloyd-smith2005",
+		"type": "article-journal",
+		"abstract": "Population-level analyses often use average quantities to describe heterogeneous systems, particularly when variation does not arise from identifiable groups. A prominent example, central to our current understanding of epidemic spread, is the basic reproductive number, R0, which is defined as the mean number of infections caused by an infected individual in a susceptible population. Population estimates of R0 can obscure considerable individual variation in infectiousness, as highlighted during the global emergence of severe acute respiratory syndrome (SARS) by numerous 'superspreading events' in which certain individuals infected unusually large numbers of secondary cases. For diseases transmitted by non-sexual direct contacts, such as SARS or smallpox, individual variation is difficult to measure empirically, and thus its importance for outbreak dynamics has been unclear. Here we present an integrated theoretical and statistical analysis of the influence of individual variation in infectiousness on disease emergence. Using contact tracing data from eight directly transmitted diseases, we show that the distribution of individual infectiousness around R0 is often highly skewed. Model predictions accounting for this variation differ sharply from average-based approaches, with disease extinction more likely and outbreaks rarer but more explosive. Using these models, we explore implications for outbreak control, showing that individual-specific control measures outperform population-wide measures. Moreover, the dramatic improvements achieved through targeted control policies emphasize the need to identify predictive correlates of higher infectiousness. Our findings indicate that superspreading is a normal feature of disease spread, and to frame ongoing discussion we propose a rigorous definition for superspreading events and a method to predict their frequency. © 2005 Nature Publishing Group.",
+		"container-title": "Nature",
+		"DOI": "10.1038/nature04153",
+		"ISSN": "14764687",
+		"issue": "7066",
+		"note": "PMID: 16292310",
+		"page": "355–359",
+		"title": "Superspreading and the effect of individual variation on disease emergence",
+		"volume": "438",
+		"author": [
+			{
+				"family": "Lloyd-Smith",
+				"given": "J. O."
+			},
+			{
+				"family": "Schreiber",
+				"given": "S. J."
+			},
+			{
+				"family": "Kopp",
+				"given": "P. E."
+			},
+			{
+				"family": "Getz",
+				"given": "W. M."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2005"
+				]
+			]
+		}
+	},
+	{
+		"id": "marivate2020",
+		"type": "article-journal",
+		"container-title": "arXiv preprint arXiv:2004.04813",
+		"title": "Use of available data to inform the COVID-19 outbreak in South Africa: a case study",
+		"author": [
+			{
+				"family": "Marivate",
+				"given": "Vukosi"
+			},
+			{
+				"family": "Combrink",
+				"given": "Herkulaas MvE"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "nishiura2007",
+		"type": "article-journal",
+		"abstract": "The incubation period of infectious diseases, the time from infection with a microorganism to onset of disease, is directly relevant to prevention and control. Since explicit models of the incubation period enhance our understanding of the spread of disease, previous classic studies were revisited, focusing on the modeling methods employed and paying particular attention to relatively unknown historical efforts. The earliest study on the incubation period of pandemic influenza was published in 1919, providing estimates of the incubation period of Spanish flu using the daily incidence on ships departing from several ports in Australia. Although the study explicitly dealt with an unknown time of exposure, the assumed periods of exposure, which had an equal probability of infection, were too long, and thus, likely resulted in slight underestimates of the incubation period. After the suggestion that the incubation period follows lognormal distribution, Japanese epidemiologists extended this assumption to estimates of the time of exposure during a point source outbreak. Although the reason why the incubation period of acute infectious diseases tends to reveal a right-skewed distribution has been explored several times, the validity of the lognormal assumption is yet to be fully clarified. At present, various different distributions are assumed, and the lack of validity in assuming lognormal distribution is particularly apparent in the case of slowly progressing diseases. The present paper indicates that (1) analysis using well-defined short periods of exposure with appropriate statistical methods is critical when the exact time of exposure is unknown, and (2) when assuming a specific distribution for the incubation period, comparisons using different distributions are needed in addition to estimations using different datasets, analyses of the determinants of incubation period, and an understanding of the underlying disease mechanisms. © 2007 Nishiura; licensee BioMed Central Ltd.",
+		"container-title": "Emerging Themes in Epidemiology",
+		"DOI": "10.1186/1742-7622-4-2",
+		"ISSN": "17427622",
+		"page": "1–12",
+		"title": "Early efforts in modeling the incubation period of infectious diseases with an acute course of illness",
+		"volume": "4",
+		"author": [
+			{
+				"family": "Nishiura",
+				"given": "Hiroshi"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2007"
+				]
+			]
+		}
+	},
+	{
+		"id": "nishiura2012",
+		"type": "article-journal",
+		"abstract": "Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. © 2011.",
+		"container-title": "Journal of Theoretical Biology",
+		"DOI": "10.1016/j.jtbi.2011.10.039",
+		"ISSN": "00225193",
+		"note": "publisher: Elsevier\nPMID: 22079419",
+		"page": "48–55",
+		"title": "Estimating the transmission potential of supercritical processes based on the final size distribution of minor outbreaks",
+		"URL": "http://dx.doi.org/10.1016/j.jtbi.2011.10.039",
+		"volume": "294",
+		"author": [
+			{
+				"family": "Nishiura",
+				"given": "Hiroshi"
+			},
+			{
+				"family": "Yan",
+				"given": "Ping"
+			},
+			{
+				"family": "Sleeman",
+				"given": "Candace K."
+			},
+			{
+				"family": "Mode",
+				"given": "Charles J."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2012"
+				]
+			]
+		}
+	},
+	{
+		"id": "pearson2020",
+		"type": "article-journal",
+		"abstract": "For 45 African countries/territories already reporting COVID-19 cases before 23 March 2020, we estimate the dates of reporting 1,000 and 10,000 cases. Assuming early epidemic trends without interventions, all 45 were likely to exceed 1,000 confirmed cases by the end of April 2020, with most exceeding 10,000 a few weeks later.",
+		"container-title": "Eurosurveillance",
+		"DOI": "10.2807/1560-7917.ES.2020.25.18.2000543",
+		"ISSN": "15607917",
+		"issue": "18",
+		"note": "publisher: European Centre for Disease Prevention and Control (ECDC)\nPMID: 32400361",
+		"page": "1–6",
+		"title": "Projected early spread of COVID-19 in Africa through 1 June 2020",
+		"URL": "http://dx.doi.org/10.2807/1560-7917.ES.2020.25.18.2000543",
+		"volume": "25",
+		"author": [
+			{
+				"family": "Pearson",
+				"given": "Carl A.B."
+			},
+			{
+				"family": "Schalkwyk",
+				"given": "Cari",
+				"non-dropping-particle": "van"
+			},
+			{
+				"family": "Foss",
+				"given": "Anna M."
+			},
+			{
+				"family": "O'Reilly",
+				"given": "Kathleen M."
+			},
+			{
+				"family": "Pulliam",
+				"given": "Juliet R.C."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "becker1977",
+		"type": "article-journal",
+		"container-title": "Biometrics",
+		"ISSN": "0006-341X",
+		"issue": "3",
+		"note": "publisher: JSTOR",
+		"page": "515–522",
+		"title": "Estimation for discrete time branching processes with application to epidemics",
+		"volume": "33",
+		"author": [
+			{
+				"family": "Becker",
+				"given": "Niels"
+			},
+			{
+				"family": "Society",
+				"given": "International Biometric"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"1977"
+				]
+			]
+		}
+	},
+	{
+		"id": "wang2020",
+		"type": "article-journal",
+		"abstract": "Coronavirus disease 2019 (COVID-19) was first identified in late 2019 in Wuhan, Hubei Province, China and spread globally in months, sparking worldwide concern. However, it is unclear whether super-spreading events occurred during the early outbreak phase, as has been observed for other emerging viruses. Here, we analyse 208 publicly available SARS-CoV-2 genome sequences collected during the early outbreak phase. We combine phylogenetic analysis with Bayesian inference under an epidemiological model to trace person-to-person transmission. The dispersion parameter of the offspring distribution in the inferred transmission chain was estimated to be 0.23 (95% CI: 0.13–0.38), indicating there are individuals who directly infected a disproportionately large number of people. Our results showed that super-spreading events played an important role in the early stage of the COVID-19 outbreak.",
+		"container-title": "Nature Communications",
+		"DOI": "10.1038/s41467-020-18836-4",
+		"ISSN": "20411723",
+		"issue": "1",
+		"note": "publisher: Springer US\nPMID: 33024095",
+		"page": "1–6",
+		"title": "Inference of person-to-person transmission of COVID-19 reveals hidden super-spreading events during the early outbreak phase",
+		"URL": "http://dx.doi.org/10.1038/s41467-020-18836-4",
+		"volume": "11",
+		"author": [
+			{
+				"family": "Wang",
+				"given": "Liang"
+			},
+			{
+				"family": "Didelot",
+				"given": "Xavier"
+			},
+			{
+				"family": "Yang",
+				"given": "Jing"
+			},
+			{
+				"family": "Wong",
+				"given": "Gary"
+			},
+			{
+				"family": "Shi",
+				"given": "Yi"
+			},
+			{
+				"family": "Liu",
+				"given": "Wenjun"
+			},
+			{
+				"family": "Gao",
+				"given": "George F."
+			},
+			{
+				"family": "Bi",
+				"given": "Yuhai"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "yadav2021",
+		"type": "article-journal",
+		"abstract": "In this review, we have discussed the different statistical modeling and prediction techniques for various infectious diseases including the recent pandemic of COVID-19. The distribution fitting, time series modeling along with predictive monitoring approaches, and epidemiological modeling are illustrated. When the epidemiology data is sufficient to fit with the required sample size, the normal distribution in general or other theoretical distributions are fitted and the best-fitted distribution is chosen for the prediction of the spread of the disease. The infectious diseases develop over time and we have data on the single variable that is the number of infections that happened, therefore, time series models are fitted and the prediction is done based on the best-fitted model. Monitoring approaches may also be applied to time series models which could estimate the parameters more precisely. In epidemiological modeling, more biological parameters are incorporated in the models and the forecasting of the disease spread is carried out. We came up with, how to improve the existing modeling methods, the use of fuzzy variables, and detection of fraud in the available data. Ultimately, we have reviewed the results of recent statistical modeling efforts to predict the course of COVID-19 spread.",
+		"container-title": "Frontiers in Public Health",
+		"DOI": "10.3389/fpubh.2021.645405",
+		"ISSN": "22962565",
+		"issue": "June",
+		"note": "PMID: 34222166",
+		"page": "1–27",
+		"title": "Statistical Modeling for the Prediction of Infectious Disease Dissemination With Special Reference to COVID-19 Spread",
+		"volume": "9",
+		"author": [
+			{
+				"family": "Yadav",
+				"given": "Subhash Kumar"
+			},
+			{
+				"family": "Akhter",
+				"given": "Yusuf"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2021"
+				]
+			]
+		}
+	}
+]
\ No newline at end of file

From 41f959fedca6074d83145c03324d67f63f1d5200 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 6 Jun 2023 20:13:08 +0100
Subject: [PATCH 365/828] Added print and summary methods for epichains class

---
 R/epichains.R | 201 +++++++++-----------------------------------------
 1 file changed, 35 insertions(+), 166 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index caa4bbd8..62920f32 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -1,9 +1,3 @@
-#' Print an [`epichains`] object
-#'
-#' @param x An [`epichains`] object.
-#' @param ... Other parameters passed to [print()].
-#' @return Invisibly returns an [`epichains`]. Called for side-effects.
-#' @export
 print.epichains <- function(x, ...) {
   format(x, ...)
 }
@@ -12,35 +6,21 @@ print.epichains <- function(x, ...) {
 #'
 #' @param x epichains object
 #' @param ... further arguments passed to or from other methods
+#' @importFrom tibble as_tibble
 #' @return Invisibly returns an [`epichains`]. Called for printing side-effects.
 #' @export
+#'
+#' @examples
 format.epichains <- function(x, ...) {
-  # check that x is an epichains object
-  validate_epichains(x)
-
-  # summarise the information stored in x
   chain_info <- summary(x)
-
   if (attributes(x)$chain_type == "chains_tree") {
+    cat("head starting from first known ancestor \n")
+    print(tibble::as_tibble(head(subset(x, !is.na(ancestor)))))
+    cat("--- \n")
+    print(tail(tibble::as_tibble(x)))
     writeLines(
       c(
-        sprintf("`epichains` object"),
-
-        "< tree head (from first known ancestor) >\n"
-        )
-      )
-
-    # print head of the simulation output
-    print(head(x[!is.na(x$ancestor), ]))
-
-    cat("< tree tail >\n")
-
-    # print tail of object
-    print(tail(as.data.frame(x)))
-
-    # print summary information
-    writeLines(
-      c(
+        sprintf("`epichains` `chains_tree` object"),
         sprintf("Chains simulated: %s", chain_info[["chains"]]),
         sprintf(
           "Unique number of ancestors: %s",
@@ -51,10 +31,8 @@ format.epichains <- function(x, ...) {
         )
       )
     )
-
-    # Offer more information to view the full dataset
     writeLines(sprintf("Use View(<object_name>) to view the full output."))
-
+    invisible(x)
   } else if (attributes(x)$chain_type == "chains_vec") {
     cat(sprintf("epichains object \n"))
     print(as.vector(x))
@@ -64,41 +42,38 @@ format.epichains <- function(x, ...) {
         )
     writeLines(
       c(
-        "\n Simulated chain stats: \n",
+        cat("\n Simulated chain stats: \n"),
         sprintf("Max: %s", chain_info[["max_chain_stat"]]),
         sprintf("Min: %s", chain_info[["min_chain_stat"]])
       )
     )
   }
-
-  invisible(x)
 }
 
 
 #' Summary method for epichains class
 #'
-#' @param object An [`epichains`] object
+#' @param object epichains object
 #' @param ... further arguments passed to or from other methods
 #'
 #' @return data frame of information
 #' @export
-summary.epichains <- function(object, ...) {
-  validate_epichains(object)
-
-  if (attributes(object)$chain_type == "chains_tree") {
+#'
+#' @examples
+summary.epichains <- function(x, ...) {
+  if (attributes(x)$chain_type == "chains_tree") {
+    is_epichains(x)
 
-    chains_ran <- length(object$n)
+    chains_ran <- length(x$n)
 
-    max_time <- max(object$time)
+    max_time <- max(x$time)
 
     n_unique_ancestors <- length(
-      unique(object$ancestor[!is.na(object$ancestor)])
+      unique(x$ancestor[!is.na(x$ancestor)])
     )
 
-    num_generations <- length(unique(object$generation))
-
-    max_generation <- max(object$generation)
+    num_generations <- length(unique(x$generations))
 
     # out of summary
     res <- list(
@@ -106,13 +81,13 @@ summary.epichains <- function(object, ...) {
       max_time = max_time,
       unique_ancestors = n_unique_ancestors,
       unique_generations = n_unique_ancestors,
-      num_generations = num_generations,
-      max_generation = max_generation
+      num_generations = num_generations
+      # WIP
     )
-  } else if (attributes(object)$chain_type == "chains_vec") {
-    chains_ran <- length(object)
-    max_chain_stat <- max(!is.infinite(object))
-    min_chain_stat <- min(!is.infinite(object))
+  } else if (attributes(x)$chain_type == "chains_vec") {
+    chains_ran <- length(x)
+    max_chain_stat <- max(!is.infinite(x))
+    min_chain_stat <- min(!is.infinite(x))
 
     res <- list(
       unique_chains = chains_ran,
@@ -130,30 +105,19 @@ summary.epichains <- function(object, ...) {
 #'
 #' @return logical, `TRUE` if the object is an `epichains` and `FALSE`
 #' otherwise
-#' @keywords internal
+#' @export
+#'
+#' @examples
 is_epichains <- function(x) {
   inherits(x, "epichains")
 }
 
-#' Check if an object is of class "epichains_aggregate_df"
-#'
-#' @param x An [`epichains`] object
-#'
-#' @keywords internal
-is_epichains_aggregate_df <- function(x) {
-  if (!inherits(x, "epichains_aggregate_df")) {
-    stop("Object must have class 'epichains_aggregate_df'")
-  }
-}
-
 #' `epichains` class validator
 #'
 #' @param x An `epichains` object
 #'
 #' @return Checks if an object is of class `epichains` and if so
 #' checks that it's in the right format as a "data.frame" or vector.
-#' @keywords internal
-#' @author James M. Azam
 validate_epichains <- function(x) {
   if (!is_epichains(x)) {
     stop("Object must have an epichains class")
@@ -161,13 +125,15 @@ validate_epichains <- function(x) {
 
   # check for class invariants
 
-  if (attributes(x)$chain_type == "chains_tree") {
+  if (attributes(x)$is_tree) {
     stopifnot(
       "object does not contain the correct columns" =
-        c("sim_id", "ancestor", "generation", "time") %in%
+        c("n", "id", "ancestor", "generation", "time") %in%
           colnames(x),
-      "column `sim_id` must be a numeric" =
-        is.numeric(x$sim_id),
+      "column `n` must be a numeric" =
+        is.numeric(x$n),
+      "column `id` must be a numeric" =
+        is.numeric(x$id),
       "column `ancestor` must be a numeric" =
         is.numeric(x$ancestor),
       "column `generation` must be a numeric" =
@@ -184,100 +150,3 @@ validate_epichains <- function(x) {
 
   invisible(x)
 }
-
-#' `head` method for [`epichains`] class
-#'
-#' @param x An [`epichains`] object
-#' @param ... further arguments passed to or from other methods
-#' @importFrom utils head
-#' @return object of class `data.frame`
-#' @author James M. Azam
-#' @export
-head.epichains <- function(x, ...) {
-  utils::head(as.data.frame(x), ...)
-}
-
-#' `tail` method for [`epichains`] class
-#' @param x An [`epichains`] object
-#' @param ... further arguments passed to or from other methods
-#' @importFrom utils tail
-#' @author James M. Azam
-#' @export
-tail.epichains <- function(x, ...) {
-  utils::tail(as.data.frame(x), ...)
-}
-
-#' Aggregate cases in epichains objects according to a grouping variable
-#'
-#' @param x An [`epichains`] object.
-#' @param grouping_var The variable to group and count over. Options include
-#' "time", "generation", and "both".
-#' @param ... Other arguments passed to aggregate.
-#' @importFrom stats aggregate
-#' @return If grouping_var is either "time" or "generation", a data.frame
-#' with cases aggregated over `grouping_var`; If
-#' \code{grouping_var = "both"}, a list of data.frames, the first being for
-#'  cases over time, and the second being for cases over generations.
-#' @export
-#'
-#' @examples
-#' set.seed(123)
-#' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
-#' offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
-#' chains
-#'
-#' # Aggregate cases per time
-#' aggregate(chains, grouping_var = "time")
-#'
-#' # Aggregate cases per generation
-#' aggregate(chains, grouping_var = "generation")
-#'
-#' # Aggregate cases per both time and generation
-#' aggregate(chains, grouping_var = "both")
-aggregate.epichains <- function(x,
-                                grouping_var = c("time",
-                                                 "generation",
-                                                 "both"
-                                                 ),
-                                ...) {
-  validate_epichains(x)
-  # Check that the object is of type "chains_tree"
-  if (attributes(x)$chain_type != "chains_tree") {
-    stop("object must be an epichains object with 'chains_tree' attribute.")
-  }
-
-  # Get grouping variable
-  grouping_var <- match.arg(grouping_var)
-
-  out <- if (grouping_var == "time") {
-    # Count the number of cases per generation
-    stats::aggregate(list(cases = x$sim_id),
-      list(time = x$time),
-      FUN = NROW
-    )
-  } else if (grouping_var == "generation") {
-    # Count the number of cases per time
-    stats::aggregate(list(cases = x$sim_id),
-      list(generation = x$generation),
-      FUN = NROW
-    )
-  } else if (grouping_var == "both") {
-    # Count the number of cases per time
-    list(
-      stats::aggregate(list(cases = x$sim_id),
-                       list(time = x$time),
-                       FUN = NROW),
-      # Count the number of cases per generation
-      stats::aggregate(list(cases = x$sim_id),
-                       list(generation = x$generation),
-                       FUN = NROW)
-    )
-  }
-
-  structure(out,
-    class = c("epichains_aggregate_df", class(out)),
-    chain_type = attributes(x)$chain_type,
-    rownames = NULL,
-    aggregated_over = grouping_var
-  )
-}

From 9a883477a5753e6f6be06561247593c737076dc0 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 6 Jun 2023 20:14:47 +0100
Subject: [PATCH 366/828] Added separate simulators for transmission chain
 trees and transmission chain vectors

---
 R/simulate.r | 351 +++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 351 insertions(+)

diff --git a/R/simulate.r b/R/simulate.r
index f0601fe3..3c16fd4b 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -441,3 +441,354 @@ simulate_tree_from_pop <- function(pop,
     class = c("epichains", "tbl", "data.frame")
   )
 }
+
+#' Simulate tree of infections
+#'
+#' @param nchains number of chains to simulate
+#' @param offspring_sampler Offspring distribution: a character string
+#' corresponding to the R distribution function (e.g., "pois" for Poisson,
+#' where \code{\link{rpois}} is the R function to generate Poisson random
+#' numbers)
+#' @param chain_statistic String; Statistic to calculate. Can be one of:
+#' \itemize{
+#'   \item "size": the total number of offspring.
+#'   \item "length": the total number of ancestors.
+#' }
+#' @param infinite A size or length above which the simulation results
+#' should be set to `Inf`. Defaults to `Inf`, resulting in no results
+#' ever set to `Inf`
+#' @param serials_sampler The serial interval generator function; the name of a
+#' user-defined named or anonymous function with only one argument `n`,
+#' representing the number of serial intervals to generate.
+#' @param t0 Start time (if serial interval is given); either a single value
+#' or a vector of same length as `nchains` (number of simulations) with
+#' initial times. Defaults to 0.
+#' @param tf End time (if serial interval is given).
+#' @param ... Parameters of the offspring distribution as required by R.
+#' @return an `epichains` object, which is basically a `data.frame` with
+#' columns `chain_id` (chain ID), `sim_id` (a unique ID within each simulation
+#' for each individual element of the chain), `ancestor`
+#' (the ID of the ancestor of each element), `generation`, and
+#' `time` (of infection)
+#' @author James M. Azam, Sebastian Funk
+#' @export
+#' @details
+#' `sim_chain_tree()` simulates a branching process of the form:
+#' WIP
+#' # The serial interval (`serials_sampler`):
+#'
+#' ## Assumptions/disambiguation
+#'
+#' In epidemiology, the generation interval is the duration between successive
+#' infectious events in a chain of transmission. Similarly, the serial
+#' interval is the duration between observed symptom onset times between
+#' successive cases in a transmission chain. The generation interval is
+#' often hard to observe because exact times of infection are hard to
+#' measure hence, the serial interval is often used instead . Here, we
+#' use the serial interval to represent what would normally be called the
+#' generation interval, that is, the time between successive cases.
+#'
+#' See References below for some literature on the subject.
+#'
+#' ## Specifying `serials_sampler` in `sim_chain_tree()`
+#'
+#' `serials_sampler` must be specified as a named or
+#' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) # nolint
+#' with one argument.
+#'
+#' For example, assuming we want to specify the serial interval
+#' generator as a random log-normally distributed variable with
+#' `meanlog = 0.58` and `sdlog = 1.58`, we could define a named function,
+#' let's call it "serial_interval", with only one argument representing the
+#' number of serial intervals to sample:
+#' \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
+#' and assign the name of the function to `serials_sampler` in
+#' `sim_chain_tree()` like so
+#' \code{sim_chain_tree(..., serials_sampler = serial_interval)},
+#' where `...` are the other arguments to `sim_chain_tree()`.
+#'
+#' Alternatively, we could assign an anonymous function to `serials_sampler`
+#' in the `sim_chain_tree()` call like so
+#' \code{sim_chain_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
+#' where `...` are the other arguments to `sim_chain_tree()`.
+#' @seealso [sim_chain_vec()] for simulating transmission chains as a vector
+#' @examples
+#' set.seed(123)
+#' chains <- sim_chain_tree(nchains = 10, serials_sampler = function(x) 3,
+#' offspring = "pois", lambda = 2, infinite = 10)
+#' chains
+#' \references{Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
+#' between serial interval, infectiousness profile and generation time.
+#' J R Soc Interface. 2021 Jan;18(174):20200756.
+#' doi: 10.1098/rsif.2020.0756. Epub 2021 Jan 6.
+#' PMID: 33402022; PMCID: PMC7879757.
+#' }
+#'
+#' \references{Fine PE. The interval between successive cases of an
+#' infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
+#' doi: 10.1093/aje/kwg251. PMID: 14630599.
+#' }
+sim_chain_tree <- function(nchains, offspring_sampler,
+                           chain_statistic = c("size", "length"),
+                           infinite = Inf, serials_sampler, t0 = 0,
+                           tf = Inf, ...) {
+  chain_statistic <- match.arg(chain_statistic)
+
+  check_nchains_valid(nchains = nchains)
+
+  # check that offspring is properly specified
+  check_offspring_valid(offspring_sampler)
+
+  # check that offspring function exists in base R
+  roffspring_name <- paste0("r", offspring_sampler)
+  check_offspring_func_valid(roffspring_name)
+
+  if (!missing(serials_sampler)) {
+    check_serial_valid(serials_sampler)
+  } else if (!missing(tf)) {
+    stop("If `tf` is specified, `serials_sampler` must be specified too.")
+  }
+
+  # Initialisations
+  stat_track <- rep(1, nchains) # track length or size (depending on `chain_statistic`) #nolint
+  n_offspring <- rep(1, nchains) # current number of offspring
+  sim <- seq_len(nchains) # track chains that are still being simulated
+  ancestor_ids <- rep(1, nchains) # all chains start in generation 1
+
+  # initialise data frame to hold the transmission trees
+  generation <- 1L
+  tdf <- data.frame(
+    n = seq_len(nchains),
+    id = 1L,
+    ancestor = NA_integer_,
+    generation = generation
+  )
+
+  if (!missing(serials_sampler)) {
+    tdf$time <- t0
+    times <- tdf$time
+  }
+
+  # next, simulate n chains
+  while (length(sim) > 0) {
+    # simulate next generation
+    next_gen <- get(roffspring_name)(n = sum(n_offspring[sim]), ...)
+    if (any(next_gen %% 1 > 0)) {
+      stop("Offspring distribution must return integers")
+    }
+
+    # record indices corresponding to the number of offspring
+    indices <- rep(sim, n_offspring[sim])
+
+    # initialise placeholder for the number of offspring
+    n_offspring <- rep(0, nchains)
+    # assign offspring sum to indices still being simulated
+    n_offspring[sim] <- tapply(next_gen, indices, sum)
+
+    # track size/length
+    stat_track <- update_chain_stat(stat_type = chain_statistic,
+                                    stat_latest = stat_track,
+                                    n_offspring = n_offspring)
+
+    # record times/ancestors
+    if (sum(n_offspring[sim]) > 0) {
+      ancestors <- rep(ancestor_ids, next_gen)
+      current_max_id <- unname(tapply(ancestor_ids, indices, max))
+      indices <- rep(sim, n_offspring[sim])
+
+      # create new ids
+      ids <- rep(current_max_id, n_offspring[sim]) +
+        unlist(lapply(n_offspring[sim], seq_len))
+
+      # increment the generation
+      generation <- generation + 1L
+
+      # store new simulation results
+      new_df <-
+        data.frame(
+          n = indices,
+          id = ids,
+          ancestor = ancestors,
+          generation = generation
+        )
+
+      # if a serial interval model/function was specified, use it
+      # to generate serial intervals for the cases
+      if (!missing(serials_sampler)) {
+        times <- rep(times, next_gen) + serials_sampler(sum(n_offspring))
+        current_min_time <- unname(tapply(times, indices, min))
+        new_df$time <- times
+      }
+      tdf <- rbind(tdf, new_df)
+    }
+
+    ## only continue to simulate chains that have offspring and aren't of
+    ## infinite size/length
+    sim <- which(n_offspring > 0 & stat_track < infinite)
+    if (length(sim) > 0) {
+      if (!missing(serials_sampler)) {
+        ## only continue to simulate chains that don't go beyond tf
+        sim <- intersect(sim, unique(indices)[current_min_time < tf])
+      }
+      if (!missing(serials_sampler)) {
+          times <- times[indices %in% sim]
+          }
+        ancestor_ids <- ids[indices %in% sim]
+    }
+    }
+
+  if (!missing(tf)) {
+    tdf <- tdf[tdf$time < tf, ]
+  }
+
+  structure(
+    tdf,
+    chain_type = "chains_tree",
+    chains = nchains,
+    rownames = NULL,
+    class = c("epichains", "tbl", "data.frame")
+  )
+}
+
+
+
+#' Simulate transmission chains without tree (as a vector)
+#'
+#' @inheritParams sim_chain_tree
+#'
+#' @examples #' sim_chain_vect(n = 10, offspring_sampler = "pois", lambda = 2,
+#' infinite = 10)
+sim_chain_vect <- function(nchains, offspring_sampler,
+                           chain_statistic = c("size", "length"),
+                           infinite = Inf, ...) {
+  chain_statistic <- match.arg(chain_statistic)
+
+  check_nchains_valid(nchains = nchains)
+
+  # check that offspring is properly specified
+  check_offspring_valid(offspring_sampler)
+
+  # check that offspring function exists in base R
+  roffspring_name <- paste0("r", offspring_sampler)
+  check_offspring_func_valid(roffspring_name)
+
+  # Initialisations
+  stat_track <- rep(1, nchains) ## track length or size (depending on `stat`)
+  n_offspring <- rep(1, nchains) ## current number of offspring
+  sim <- seq_len(nchains) ## track chains that are still being simulated
+
+  ## next, simulate nchains chains
+  while (length(sim) > 0) {
+    ## simulate next generation
+    next_gen <- get(roffspring_name)(n = sum(n_offspring[sim]), ...)
+    if (any(next_gen %% 1 > 0)) {
+      stop("Offspring distribution must return integers")
+    }
+
+    ## record indices corresponding to the number of offspring
+    indices <- rep(sim, n_offspring[sim])
+
+    ## initialise number of offspring
+    n_offspring <- rep(0, nchains)
+    ## assign offspring sum to indices still being simulated
+    n_offspring[sim] <- tapply(next_gen, indices, sum)
+
+    # track size/length
+    stat_track <- update_chain_stat(stat_type = chain_statistic,
+                                    stat_latest = stat_track,
+                                    n_offspring = n_offspring
+                                    )
+
+    ## only continue to simulate chains that offspring and aren't of
+    ## infinite size/length
+    sim <- which(n_offspring > 0 & stat_track < infinite)
+  }
+
+  stat_track[stat_track >= infinite] <- Inf
+
+  structure(
+    stat_track,
+    chain_type = "chains_vec",
+    chains = nchains,
+    class = c("epichains", class(stat_track))
+  )
+}
+
+
+#' Check if offspring argument is specified as a character string
+#'
+#' @param offspring
+#'
+#' @return
+#' @export
+#' @keywords internal
+#' @examples
+check_offspring_valid <- function(offspring) {
+  if (!is.character(offspring)) {
+    stop(sprintf(
+      "%s %s",
+      "'offspring' must be specified as a character string.",
+      "Did you forget to enclose it in quotes?"
+    ))
+  }
+}
+
+
+#' Check if constructed random number generator for offspring exists
+#'
+#' @param roffspring_name
+#'
+#' @return
+#' @export
+#'
+#' @examples check_offspring_exists("rpois")
+check_offspring_func_valid <- function(roffspring_name) {
+  if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
+    stop("Function ", roffspring_name, " does not exist.")
+  }
+}
+
+
+#' Check if the serials_sampler argument is specified as a function
+#'
+#' @param serials_sampler
+#'
+#' @return
+#' @export
+#' @keywords internal
+#' @examples
+check_serial_valid <- function(serials_sampler) {
+  if (!is.function(serials_sampler)) {
+    stop(sprintf(
+      "%s %s",
+      "The `serials_sampler` argument must be a function",
+      "(see details in ?sim_chain_tree)."
+    ))
+  }
+}
+
+
+check_nchains_valid <- function(nchains) {
+  if (nchains < 1 || is.infinite(nchains)) {
+    stop("`nchains` must be > 0 but less than `Inf`")
+  }
+}
+
+#' Determine and update the chain statistic being tracked
+#'
+#' @param stat_type
+#' @param noffspring
+#'
+#' @return
+#' @export
+#' @keywords internal
+#' @examples
+update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
+  if (stat_type == "size") {
+    stat_latest <- stat_latest + n_offspring
+  } else if (stat_type == "length") {
+    stat_latest <- stat_latest + pmin(1, n_offspring)
+  }
+
+  return(stat_latest)
+}

From 363f5af586ab9aa78f7d38e3dbcff253e7b02925 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 13:03:27 +0100
Subject: [PATCH 367/828] Added a script with input checking functions

---
 R/checks.R | 48 ++++++++++++++++++------------------------------
 1 file changed, 18 insertions(+), 30 deletions(-)

diff --git a/R/checks.R b/R/checks.R
index c4bacce9..dea04268 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -1,15 +1,16 @@
 #' Check if offspring argument is specified as a character string
 #'
-#' @param offspring_sampler Offspring distribution: a character string
-#' corresponding to the R distribution function (e.g., "pois" for Poisson,
-#' where \code{\link{rpois}} is the R function to generate Poisson random
-#' numbers).
+#' @param offspring
+#'
+#' @return
+#' @export
 #' @keywords internal
-check_offspring_valid <- function(offspring_sampler) {
-  if (!is.character(offspring_sampler)) {
+#' @examples
+check_offspring_valid <- function(offspring) {
+  if (!is.character(offspring)) {
     stop(sprintf(
       "%s %s",
-      "'offspring_sampler' must be specified as a character string.",
+      "'offspring' must be specified as a character string.",
       "Did you forget to enclose it in quotes?"
     ))
   }
@@ -18,10 +19,12 @@ check_offspring_valid <- function(offspring_sampler) {
 
 #' Check if constructed random number generator for offspring exists
 #'
-#' @param roffspring_name Constructed random offspring sampler: a character
-#' string corresponding to the R distribution function (e.g., "rpois" for
-#' Poisson.
-#' @keywords internal
+#' @param roffspring_name
+#'
+#' @return
+#' @export
+#'
+#' @examples check_offspring_exists("rpois")
 check_offspring_func_valid <- function(roffspring_name) {
   if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
     stop("Function ", roffspring_name, " does not exist.")
@@ -31,11 +34,12 @@ check_offspring_func_valid <- function(roffspring_name) {
 
 #' Check if the serials_sampler argument is specified as a function
 #'
-#' @param serials_sampler The serial interval generator function; the name of a
-#' user-defined named or anonymous function with only one argument `n`,
-#' representing the number of serial intervals to generate.
+#' @param serials_sampler
 #'
+#' @return
+#' @export
 #' @keywords internal
+#' @examples
 check_serial_valid <- function(serials_sampler) {
   if (!is.function(serials_sampler)) {
     stop(sprintf(
@@ -47,24 +51,8 @@ check_serial_valid <- function(serials_sampler) {
 }
 
 
-#' Check that nchains is greater than 0 and not infinite
-#'
-#' @param nchains Number of chains to simulate.
-#'
-#' @keywords internal
 check_nchains_valid <- function(nchains) {
   if (nchains < 1 || is.infinite(nchains)) {
     stop("`nchains` must be > 0 but less than `Inf`")
   }
 }
-
-#' Title
-#'
-#' @param x An [`epichains`] object
-#'
-#' @keywords internal
-check_chain_tree_attribute <- function(x) {
-  if (attributes(x)$chain_type != "chains_tree") {
-    stop("Object must be an epichains object with a chains_tree attribute.")
-  }
-}

From b582363af31d531aea713aa0dc57524258f76163 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 13:03:55 +0100
Subject: [PATCH 368/828] Added a script for helper functions

---
 R/helpers.R | 81 ++++-------------------------------------------------
 1 file changed, 6 insertions(+), 75 deletions(-)

diff --git a/R/helpers.R b/R/helpers.R
index 99f66da6..53c93dbd 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -1,10 +1,12 @@
 #' Determine and update the chain statistic being tracked
 #'
-#' @param stat_type Chain statistic (size/length) to update.
-#' @param stat_latest The latest chain statistic vector to be updated.
-#' @param n_offspring A vector of offspring per chain.
-#' @return A vector of chain statistics (size/length).
+#' @param stat_type
+#' @param noffspring
+#'
+#' @return
+#' @export
 #' @keywords internal
+#' @examples
 update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
   if (stat_type == "size") {
     stat_latest <- stat_latest + n_offspring
@@ -14,74 +16,3 @@ update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
 
   return(stat_latest)
 }
-
-
-#' Get offspring sampling function
-#'
-#' @param n Number of items to sample
-#' @param susc Susceptible population size (calculated
-#' inside \code{\link{simulate_tree_from_pop}}  as pop - initial_immune)
-#' @inheritParams simulate_tree_from_pop
-#'
-#' @return An offspring sampling function
-#' @keywords internal
-get_offspring_func <- function(offspring_sampler, n, susc, pop,
-                               mean_offspring, disp_offspring = NULL) {
-  if (offspring_sampler == "nbinom") {
-    function(n, susc, pop, mean_offspring, disp_offspring) {
-      ## get distribution params from mean and dispersion
-      new_mn <- mean_offspring * susc / pop ## apply susceptibility
-      size <- new_mn / (disp_offspring - 1)
-
-      ## using a right truncated nbinom distribution
-      ## to avoid more cases than susceptibles
-      truncdist::rtrunc(
-        n,
-        spec = "nbinom",
-        b = susc,
-        mu = new_mn,
-        size = size
-      )
-    }
-  } else if (offspring_sampler == "pois") {
-    function(n, susc, pop, mean_offspring, disp_offspring) {
-      truncdist::rtrunc(
-        n,
-        spec = "pois",
-        lambda = mean_offspring * susc / pop,
-        b = susc
-      )
-    }
-  } else {
-    stop("offspring_sampler must either be 'pois' or 'nbinom'")
-  }
-}
-
-
-
-#' Return a function for calculating chain statistics
-#'
-#' @inheritParams simulate_tree
-#'
-#' @return a function for calculating chain statistics
-#' @keywords internal
-get_chain_statistic_func <- function(chain_statistic) {
-  func <- if (chain_statistic == "size") {
-    rbinom_size
-  } else if (chain_statistic == "length") {
-    rgen_length
-  }
-  return(func)
-}
-
-#' Construct name of analytical function for estimating loglikelihood of
-#' offspring
-#'
-#' @inheritParams simulate_tree
-#'
-#' @return an analytical offspring likelihood function
-#' @keywords internal
-construct_offspring_ll_name <- function(offspring_sampler, chain_statistic) {
-  ll_name <- paste(offspring_sampler, chain_statistic, "ll", sep = "_")
-  return(ll_name)
-}

From 4362e2f384ce0c2ada2112a8176cdd06dff79017 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 15:18:57 +0100
Subject: [PATCH 369/828] Restructured the references

---
 R/simulate.r | 8 +++++---
 1 file changed, 5 insertions(+), 3 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 3c16fd4b..49c961d6 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -517,14 +517,16 @@ simulate_tree_from_pop <- function(pop,
 #' chains <- sim_chain_tree(nchains = 10, serials_sampler = function(x) 3,
 #' offspring = "pois", lambda = 2, infinite = 10)
 #' chains
-#' \references{Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
+#' @references
+#'
+#' Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
 #' between serial interval, infectiousness profile and generation time.
 #' J R Soc Interface. 2021 Jan;18(174):20200756.
 #' doi: 10.1098/rsif.2020.0756. Epub 2021 Jan 6.
 #' PMID: 33402022; PMCID: PMC7879757.
-#' }
 #'
-#' \references{Fine PE. The interval between successive cases of an
+#'
+#' Fine PE. The interval between successive cases of an
 #' infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
 #' doi: 10.1093/aje/kwg251. PMID: 14630599.
 #' }

From fe19b56b4a95159769cf490f20437b2156ddad6f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 15:19:29 +0100
Subject: [PATCH 370/828] Renamed infinite to chain_stat_max

---
 R/simulate.r | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 49c961d6..444374e9 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -454,9 +454,9 @@ simulate_tree_from_pop <- function(pop,
 #'   \item "size": the total number of offspring.
 #'   \item "length": the total number of ancestors.
 #' }
-#' @param infinite A size or length above which the simulation results
-#' should be set to `Inf`. Defaults to `Inf`, resulting in no results
-#' ever set to `Inf`
+#' @param chain_stat_max A cut off for the chain statistic (size/length) being
+#' computed. Results above the specified value, are set to this value.
+#' Defaults to `Inf`.
 #' @param serials_sampler The serial interval generator function; the name of a
 #' user-defined named or anonymous function with only one argument `n`,
 #' representing the number of serial intervals to generate.
@@ -529,10 +529,10 @@ simulate_tree_from_pop <- function(pop,
 #' Fine PE. The interval between successive cases of an
 #' infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
 #' doi: 10.1093/aje/kwg251. PMID: 14630599.
-#' }
-sim_chain_tree <- function(nchains, offspring_sampler,
+#'
+simulate_tree <- function(nchains, offspring_sampler,
                            chain_statistic = c("size", "length"),
-                           infinite = Inf, serials_sampler, t0 = 0,
+                           chain_stat_max = Inf, serials_sampler, t0 = 0,
                            tf = Inf, ...) {
   chain_statistic <- match.arg(chain_statistic)
 

From 16f75d0a82e92a009318ecb01426e037e7a0dc61 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 15:20:02 +0100
Subject: [PATCH 371/828] Deleted chain_sim function

---
 R/simulate.r | 444 ---------------------------------------------------
 1 file changed, 444 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 444374e9..820f3cf2 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,447 +1,3 @@
-#' Simulate a tree of infections with a serial and offspring distributions
-#'
-#' @param nchains Number of chains to simulate.
-#' @param offspring_sampler Offspring distribution: a character string
-#' corresponding to the R distribution function (e.g., "pois" for Poisson,
-#' where \code{\link{rpois}} is the R function to generate Poisson random
-#' numbers)
-#' @param chain_statistic String; Statistic to calculate. Can be one of:
-#' \itemize{
-#'   \item "size": the total number of offspring.
-#'   \item "length": the total number of ancestors.
-#' }
-#' @param chain_stat_max A cut off for the chain statistic (size/length) being
-#' computed. Results above the specified value, are set to this value.
-#' Defaults to `Inf`.
-#' @param serials_sampler The serial interval generator function; the name of a
-#' user-defined named or anonymous function with only one argument `n`,
-#' representing the number of serial intervals to generate.
-#' @param t0 Start time (if serial interval is given); either a single value
-#' or a vector of same length as `nchains` (number of simulations) with
-#' initial times. Defaults to 0.
-#' @param tf End time (if serial interval is given).
-#' @param ... Parameters of the offspring distribution as required by R.
-#' @return an `epichains` object, which is basically a `data.frame` with
-#' columns `chain_id` (chain ID), `sim_id` (a unique ID within each simulation
-#' for each individual element of the chain), `ancestor`
-#' (the ID of the ancestor of each element), `generation`, and
-#' `time` (of infection)
-#' @author James M. Azam, Sebastian Funk
-#' @export
-#' @details
-#' `simulate_tree()` simulates a branching process of the form:
-#' WIP
-#' # The serial interval (`serials_sampler`):
-#'
-#' ## Assumptions/disambiguation
-#'
-#' In epidemiology, the generation interval is the duration between successive
-#' infectious events in a chain of transmission. Similarly, the serial
-#' interval is the duration between observed symptom onset times between
-#' successive cases in a transmission chain. The generation interval is
-#' often hard to observe because exact times of infection are hard to
-#' measure hence, the serial interval is often used instead . Here, we
-#' use the serial interval to represent what would normally be called the
-#' generation interval, that is, the time between successive cases.
-#'
-#' See References below for some literature on the subject.
-#'
-#' ## Specifying `serials_sampler` in `simulate_tree()`
-#'
-#' `serials_sampler` must be specified as a named or
-#' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) # nolint
-#' with one argument.
-#'
-#' For example, assuming we want to specify the serial interval
-#' generator as a random log-normally distributed variable with
-#' `meanlog = 0.58` and `sdlog = 1.58`, we could define a named function,
-#' let's call it "serial_interval", with only one argument representing the
-#' number of serial intervals to sample:
-#' \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
-#' and assign the name of the function to `serials_sampler` in
-#' `simulate_tree()` like so
-#' \code{simulate_tree(..., serials_sampler = serial_interval)},
-#' where `...` are the other arguments to `simulate_tree()`.
-#'
-#' Alternatively, we could assign an anonymous function to `serials_sampler`
-#' in the `simulate_tree()` call like so
-#' \code{simulate_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
-#' where `...` are the other arguments to `simulate_tree()`.
-#' @seealso [simulate_vect()] for simulating transmission chains as a vector
-#' @examples
-#' set.seed(123)
-#' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
-#' offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
-#' chains
-#' @references
-#'
-#' Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
-#' between serial interval, infectiousness profile and generation time.
-#' J R Soc Interface. 2021 Jan;18(174):20200756.
-#' doi: 10.1098/rsif.2020.0756. Epub 2021 Jan 6.
-#' PMID: 33402022; PMCID: PMC7879757.
-#'
-#' Fine PE. The interval between successive cases of an
-#' infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
-#' doi: 10.1093/aje/kwg251. PMID: 14630599.
-simulate_tree <- function(nchains, offspring_sampler,
-                           chain_statistic = c("size", "length"),
-                           chain_stat_max = Inf, serials_sampler, t0 = 0,
-                           tf = Inf, ...) {
-  chain_statistic <- match.arg(chain_statistic)
-
-  check_nchains_valid(nchains = nchains)
-
-  # check that offspring is properly specified
-  check_offspring_valid(offspring_sampler)
-
-  # check that offspring function exists in base R
-  roffspring_name <- paste0("r", offspring_sampler)
-  check_offspring_func_valid(roffspring_name)
-
-  if (!missing(serials_sampler)) {
-    check_serial_valid(serials_sampler)
-  } else if (!missing(tf)) {
-    stop("If `tf` is specified, `serials_sampler` must be specified too.")
-  }
-
-  # Initialisations
-  stat_track <- rep(1, nchains) # track length or size (depending on `chain_statistic`) #nolint
-  n_offspring <- rep(1, nchains) # current number of offspring
-  sim <- seq_len(nchains) # track chains that are still being simulated
-  ancestor_ids <- rep(1, nchains) # all chains start in generation 1
-
-  # initialise data frame to hold the transmission trees
-  generation <- 1L
-  tree_df <- data.frame(
-    chain_id = seq_len(nchains),
-    sim_id = 1L,
-    ancestor = NA_integer_,
-    generation = generation
-  )
-
-  if (!missing(serials_sampler)) {
-    tree_df$time <- t0
-    times <- tree_df$time
-  }
-
-  # next, simulate n chains
-  while (length(sim) > 0) {
-    # simulate next generation
-    next_gen <- get(roffspring_name)(n = sum(n_offspring[sim]), ...)
-    if (any(next_gen %% 1 > 0)) {
-      stop("Offspring distribution must return integers")
-    }
-
-    # record indices corresponding to the number of offspring
-    indices <- rep(sim, n_offspring[sim])
-
-    # initialise placeholder for the number of offspring
-    n_offspring <- rep(0, nchains)
-    # assign offspring sum to indices still being simulated
-    n_offspring[sim] <- tapply(next_gen, indices, sum)
-
-    # track size/length
-    stat_track <- update_chain_stat(stat_type = chain_statistic,
-                                    stat_latest = stat_track,
-                                    n_offspring = n_offspring)
-
-    # record times/ancestors
-    if (sum(n_offspring[sim]) > 0) {
-      ancestors <- rep(ancestor_ids, next_gen)
-      current_max_id <- unname(tapply(ancestor_ids, indices, max))
-      indices <- rep(sim, n_offspring[sim])
-
-      # create new ids
-      ids <- rep(current_max_id, n_offspring[sim]) +
-        unlist(lapply(n_offspring[sim], seq_len))
-
-      # increment the generation
-      generation <- generation + 1L
-
-      # store new simulation results
-      new_df <-
-        data.frame(
-          chain_id = indices,
-          sim_id = ids,
-          ancestor = ancestors,
-          generation = generation
-        )
-
-      # if a serial interval model/function was specified, use it
-      # to generate serial intervals for the cases
-      if (!missing(serials_sampler)) {
-        times <- rep(times, next_gen) + serials_sampler(sum(n_offspring))
-        current_min_time <- unname(tapply(times, indices, min))
-        new_df$time <- times
-      }
-      tree_df <- rbind(tree_df, new_df)
-    }
-
-    ## only continue to simulate chains that have offspring and aren't of
-    ## infinite size/length
-    sim <- which(n_offspring > 0 & stat_track < chain_stat_max)
-    if (length(sim) > 0) {
-      if (!missing(serials_sampler)) {
-        ## only continue to simulate chains that don't go beyond tf
-        sim <- intersect(sim, unique(indices)[current_min_time < tf])
-      }
-      if (!missing(serials_sampler)) {
-          times <- times[indices %in% sim]
-          }
-        ancestor_ids <- ids[indices %in% sim]
-    }
-    }
-
-  if (!missing(tf)) {
-    tree_df <- tree_df[tree_df$time < tf, ]
-  }
-
-  structure(
-    tree_df,
-    chains = nchains,
-    chain_type = "chains_tree",
-    rownames = NULL,
-    track_pop = FALSE,
-    class = c("epichains", "tbl", "data.frame")
-  )
-}
-
-
-
-#' Simulate transmission chains without tree (as a vector)
-#'
-#' @inheritParams simulate_tree
-#' @param chain_stat_max A cut off for the chain statistic (size/length) being
-#' computed. Results above the specified value, are set to `Inf`.
-#' @examples
-#' simulate_vect(nchains = 10, offspring_sampler = "pois", lambda = 2,
-#' chain_stat_max = 10)
-#' @export
-simulate_vect <- function(nchains, offspring_sampler,
-                           chain_statistic = c("size", "length"),
-                           chain_stat_max = Inf, ...) {
-  chain_statistic <- match.arg(chain_statistic)
-
-  check_nchains_valid(nchains = nchains)
-
-  # check that offspring is properly specified
-  check_offspring_valid(offspring_sampler)
-
-  # check that offspring function exists in base R
-  roffspring_name <- paste0("r", offspring_sampler)
-  check_offspring_func_valid(roffspring_name)
-
-  # Initialisations
-  stat_track <- rep(1, nchains) ## track length or size (depending on `stat`)
-  n_offspring <- rep(1, nchains) ## current number of offspring
-  sim <- seq_len(nchains) ## track chains that are still being simulated
-
-  ## next, simulate nchains chains
-  while (length(sim) > 0) {
-    ## simulate next generation
-    next_gen <- get(roffspring_name)(n = sum(n_offspring[sim]), ...)
-    if (any(next_gen %% 1 > 0)) {
-      stop("Offspring distribution must return integers")
-    }
-
-    ## record indices corresponding to the number of offspring
-    indices <- rep(sim, n_offspring[sim])
-
-    ## initialise number of offspring
-    n_offspring <- rep(0, nchains)
-    ## assign offspring sum to indices still being simulated
-    n_offspring[sim] <- tapply(next_gen, indices, sum)
-
-    # track size/length
-    stat_track <- update_chain_stat(stat_type = chain_statistic,
-                                    stat_latest = stat_track,
-                                    n_offspring = n_offspring
-                                    )
-
-    ## only continue to simulate chains that offspring and aren't of
-    ## chain_stat_max size/length
-    sim <- which(n_offspring > 0 & stat_track < chain_stat_max)
-  }
-
-  stat_track[stat_track >= chain_stat_max] <- Inf
-
-  structure(
-    stat_track,
-    chain_type = "chains_vec",
-    chains = nchains,
-    class = c("epichains", class(stat_track))
-  )
-}
-
-#' Simulate a tree of infections from an initial susceptible population
-#' with initial immunity
-#'
-#' @param pop The susceptible population.
-#' @param offspring_sampler Offspring distribution sampler: a character string
-#' corresponding to the R distribution function. Currently only "pois" &
-#' "nbinom" are supported. Internally truncated distributions are used to
-#' avoid infecting more people than susceptibles available.
-#' @param mean_offspring The average number of secondary cases for each case.
-#' Same as R0.
-#' @param disp_offspring The dispersion parameter of the number of
-#' secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
-#' avoid division by 0 when calculating the size. See details and
-#'  \code{?rnbinom} for details on the parameterisation in Ecology.
-#' @param serial_sampler The serial interval. A function that takes one
-#' parameter (`n`), the number of serial intervals to randomly sample. Value
-#' must be >= 0.
-#' @param initial_immune The number of initial immunes in the population.
-#' @param t0 Start time; Defaults to 0.
-#' @param tf End time; Defaults to `Inf`.
-#' @return a data frame with columns `time`, `id` (a unique ID for each
-#' individual element of the chain), `ancestor` (the ID of the ancestor
-#' of each element), and `generation`.
-#' @details
-#'
-#' # Offspring models
-#'
-#' The poisson model is parametrised so that:
-#'
-#' lamda = mean_offspring * pop - initial_immune / pop
-#'
-#' The negative binomial model is parametrised as:
-#'
-#' mu = mean_offspring * pop - initial immune / pop, and
-#' size = mu / (disp_offspring - 1). This is why disp_offspring must be greater
-#' than 1.
-#'
-#' simulate_tree_from_pop() has a couple of key different from simulate_tree():
-#'  * the maximal chain statistic is limited by `pop` instead of
-#'  `chain_stat_max` (in `simulate_tree()`),
-#'  * it can only handle implemented offspring distributions ("pois" and
-#' "nbinom").
-#' @author Flavio Finger
-#' @author James M. Azam
-#' @examples
-#' # Simulate with poisson offspring
-#' simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
-#' mean_offspring = 0.5, serial_sampler = function(x) 3)
-#'
-#' # Simulate with negative binomial offspring
-#' simulate_tree_from_pop(pop = 100, offspring_sampler = "nbinom",
-#' mean_offspring = 0.5, disp_offspring = 1.1, serial_sampler = function(x) 3)
-#' @export
-simulate_tree_from_pop <- function(pop,
-                                   offspring_sampler = c("pois", "nbinom"),
-                                   mean_offspring,
-                                   disp_offspring,
-                                   serial_sampler,
-                                   initial_immune = 0,
-                                   t0 = 0,
-                                   tf = Inf) {
-  offspring_sampler <- match.arg(offspring_sampler)
-
-  if (offspring_sampler == "pois") {
-    if (!missing(disp_offspring)) {
-      warning(sprintf("%s %s %s",
-                      "'disp_offspring' is not used for",
-                      "poisson offspring distribution.",
-                      "Will be ignored."
-                      )
-              )
-    }
-
-    ## using a right truncated poisson distribution
-    ## to avoid more cases than susceptibles
-    offspring_fun <- get_offspring_func(offspring_sampler)
-
-  } else if (offspring_sampler == "nbinom") {
-    if (missing(disp_offspring)) {
-      stop(sprintf("%s", "'disp_offspring' must be specified."))
-    } else if (disp_offspring <= 1) { ## dispersion coefficient
-      stop(sprintf("%s %s %s",
-                   "Offspring distribution 'nbinom' requires",
-                   "argument 'disp_offspring' > 1.",
-                   "Use 'pois' if there is no overdispersion."
-      ))
-    }
-    offspring_fun <- get_offspring_func(offspring_sampler)
-  }
-
-  ## initializations
-  tree_df <- data.frame(
-    sim_id = 1L,
-    ancestor = NA_integer_,
-    generation = 1L,
-    time = t0,
-    offspring_generated = FALSE #used to track simulation and dropped afterwards
-  )
-
-  susc <- pop - initial_immune - 1L
-  t <- t0
-
-  ## continue if any unsimulated chains have t <= tf
-  ## AND there is still susceptibles left
-  while (any(tree_df$time[!tree_df$offspring_generated] <= tf) && susc > 0) {
-
-    ## select from which case to generate offspring
-    t <- min(tree_df$time[!tree_df$offspring_generated]) # lowest unsimulated t
-
-    ## index of the first in df with t, extract vars
-    idx <- which(tree_df$time == t & !tree_df$offspring_generated)[1]
-    id_parent <- tree_df$sim_id[idx]
-    t_parent <- tree_df$time[idx]
-    gen_parent <- tree_df$generation[idx]
-
-    ## generate it
-    current_max_id <- max(tree_df$sim_id)
-    n_offspring <- offspring_fun(1, susc, pop, mean_offspring, disp_offspring)
-
-    if (n_offspring %% 1 > 0) {
-      stop("Offspring distribution must return integers")
-    }
-
-    ## mark as done
-    tree_df$offspring_generated[idx] <- TRUE
-
-    ## add to df
-    if (n_offspring > 0) {
-      ## draw serial times
-      new_times <- serial_sampler(n_offspring)
-
-      if (any(new_times < 0)) {
-        stop("Serial interval must be >= 0.")
-      }
-
-      new_df <- data.frame(
-        sim_id = current_max_id + seq_len(n_offspring),
-        ancestor = id_parent,
-        generation = gen_parent + 1L,
-        time = new_times + t_parent,
-        offspring_generated = FALSE
-      )
-
-      ## add new cases to tree_df
-      tree_df <- rbind(tree_df, new_df)
-    }
-
-    ## adjust susceptibles
-    susc <- susc - n_offspring
-  }
-
-  ## remove cases with time > tf that could
-  ## have been generated in the last generation
-  tree_df <- tree_df[tree_df$time <= tf, ]
-
-  ## sort output and remove columns not needed
-  tree_df <- tree_df[order(tree_df$time, tree_df$sim_id), ]
-  tree_df$offspring_generated <- NULL
-
-  structure(
-    tree_df,
-    chain_type = "chains_tree",
-    rownames = NULL,
-    track_pop = TRUE,
-    class = c("epichains", "tbl", "data.frame")
-  )
-}
-
 #' Simulate tree of infections
 #'
 #' @param nchains number of chains to simulate

From c89b989bd64e53fc811833aa3eb8c0528dedbc87 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 15:21:42 +0100
Subject: [PATCH 372/828] Renamed tdf to tree_df

---
 R/simulate.r | 17 ++++++++---------
 1 file changed, 8 insertions(+), 9 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 820f3cf2..d7951972 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -115,7 +115,7 @@ simulate_tree <- function(nchains, offspring_sampler,
 
   # initialise data frame to hold the transmission trees
   generation <- 1L
-  tdf <- data.frame(
+  tree_df <- data.frame(
     n = seq_len(nchains),
     id = 1L,
     ancestor = NA_integer_,
@@ -123,8 +123,8 @@ simulate_tree <- function(nchains, offspring_sampler,
   )
 
   if (!missing(serials_sampler)) {
-    tdf$time <- t0
-    times <- tdf$time
+    tree_df$time <- t0
+    times <- tree_df$time
   }
 
   # next, simulate n chains
@@ -177,12 +177,12 @@ simulate_tree <- function(nchains, offspring_sampler,
         current_min_time <- unname(tapply(times, indices, min))
         new_df$time <- times
       }
-      tdf <- rbind(tdf, new_df)
+      tree_df <- rbind(tree_df, new_df)
     }
 
     ## only continue to simulate chains that have offspring and aren't of
     ## infinite size/length
-    sim <- which(n_offspring > 0 & stat_track < infinite)
+    sim <- which(n_offspring > 0 & stat_track < chain_stat_max)
     if (length(sim) > 0) {
       if (!missing(serials_sampler)) {
         ## only continue to simulate chains that don't go beyond tf
@@ -196,15 +196,14 @@ simulate_tree <- function(nchains, offspring_sampler,
     }
 
   if (!missing(tf)) {
-    tdf <- tdf[tdf$time < tf, ]
+    tree_df <- tree_df[tree_df$time < tf, ]
   }
 
   structure(
-    tdf,
-    chain_type = "chains_tree",
+    tree_df,
     chains = nchains,
     rownames = NULL,
-    class = c("epichains", "tbl", "data.frame")
+    class = c("epichains_tree", "tbl", "data.frame")
   )
 }
 

From c6ae948ba76efc7f5bb9405d40bfe7f2eb06ae64 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 15:23:13 +0100
Subject: [PATCH 373/828] Renamed infinite to chain_stat_max

---
 R/simulate.r | 14 +++++++-------
 1 file changed, 7 insertions(+), 7 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index d7951972..99579f74 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -213,11 +213,11 @@ simulate_tree <- function(nchains, offspring_sampler,
 #'
 #' @inheritParams sim_chain_tree
 #'
-#' @examples #' sim_chain_vect(n = 10, offspring_sampler = "pois", lambda = 2,
-#' infinite = 10)
-sim_chain_vect <- function(nchains, offspring_sampler,
+#' @examples #' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
+#' chain_stat_max = 10)
+simulate_vect <- function(nchains, offspring_sampler,
                            chain_statistic = c("size", "length"),
-                           infinite = Inf, ...) {
+                           chain_stat_max = Inf, ...) {
   chain_statistic <- match.arg(chain_statistic)
 
   check_nchains_valid(nchains = nchains)
@@ -257,11 +257,11 @@ sim_chain_vect <- function(nchains, offspring_sampler,
                                     )
 
     ## only continue to simulate chains that offspring and aren't of
-    ## infinite size/length
-    sim <- which(n_offspring > 0 & stat_track < infinite)
+    ## chain_stat_max size/length
+    sim <- which(n_offspring > 0 & stat_track < chain_stat_max)
   }
 
-  stat_track[stat_track >= infinite] <- Inf
+  stat_track[stat_track >= chain_stat_max] <- Inf
 
   structure(
     stat_track,

From d4e806b2c4ba5f2231b58fa7bf7b35f5ca668407 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:05:53 +0100
Subject: [PATCH 374/828] Modified simulate_tree title

---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index 99579f74..a1661026 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,4 +1,4 @@
-#' Simulate tree of infections
+#' Simulate a tree of infections with a serial and offspring distributions
 #'
 #' @param nchains number of chains to simulate
 #' @param offspring_sampler Offspring distribution: a character string

From 08b509f95dc1c44065cb2e2180ada2449322af6e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:06:27 +0100
Subject: [PATCH 375/828] Changed the tree_df column names

---
 R/simulate.r | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index a1661026..cc9c2ee1 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -116,8 +116,8 @@ simulate_tree <- function(nchains, offspring_sampler,
   # initialise data frame to hold the transmission trees
   generation <- 1L
   tree_df <- data.frame(
-    n = seq_len(nchains),
-    id = 1L,
+    chain_id = seq_len(nchains),
+    sim_id = 1L,
     ancestor = NA_integer_,
     generation = generation
   )
@@ -164,8 +164,8 @@ simulate_tree <- function(nchains, offspring_sampler,
       # store new simulation results
       new_df <-
         data.frame(
-          n = indices,
-          id = ids,
+          chain_id = indices,
+          sim_id = ids,
           ancestor = ancestors,
           generation = generation
         )

From bcaf02b0abc7e650f59fb9dfcf2baee69bb785fc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:08:25 +0100
Subject: [PATCH 376/828] FixModified epichains object attributes

---
 R/simulate.r | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index cc9c2ee1..fcc47686 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -202,8 +202,9 @@ simulate_tree <- function(nchains, offspring_sampler,
   structure(
     tree_df,
     chains = nchains,
+    chain_type = "chains_tree",
     rownames = NULL,
-    class = c("epichains_tree", "tbl", "data.frame")
+    class = c("epichains", "tbl", "data.frame")
   )
 }
 

From 73e44ddcc333922cbfb95ae1cff78b4957b36c13 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:09:08 +0100
Subject: [PATCH 377/828] Replaced old function names with new in function docs

---
 R/simulate.r | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index fcc47686..b867f28b 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -65,12 +65,12 @@
 #'
 #' Alternatively, we could assign an anonymous function to `serials_sampler`
 #' in the `sim_chain_tree()` call like so
-#' \code{sim_chain_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
-#' where `...` are the other arguments to `sim_chain_tree()`.
-#' @seealso [sim_chain_vec()] for simulating transmission chains as a vector
+#' \code{simulate_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
+#' where `...` are the other arguments to `simulate_tree()`.
+#' @seealso [simulate_vec()] for simulating transmission chains as a vector
 #' @examples
 #' set.seed(123)
-#' chains <- sim_chain_tree(nchains = 10, serials_sampler = function(x) 3,
+#' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
 #' offspring = "pois", lambda = 2, infinite = 10)
 #' chains
 #' @references

From 2b4a5cdab0278d8c36338b75b01917fd50576d35 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:09:32 +0100
Subject: [PATCH 378/828] Documented chain_stat_max in simulate_vec()

---
 R/simulate.r | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index b867f28b..0fe282c2 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -213,7 +213,8 @@ simulate_tree <- function(nchains, offspring_sampler,
 #' Simulate transmission chains without tree (as a vector)
 #'
 #' @inheritParams sim_chain_tree
-#'
+#' @param chain_stat_max A cut off for the chain statistic (size/length) being
+#' computed. Results above the specified value, are set to `Inf`.
 #' @examples #' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
 #' chain_stat_max = 10)
 simulate_vect <- function(nchains, offspring_sampler,

From 49ec41257c23772a122b10fed0165fdf282d848e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:23:13 +0100
Subject: [PATCH 379/828] Moved checking functions

---
 R/simulate.r | 41 -----------------------------------------
 1 file changed, 41 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 0fe282c2..b32ce592 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -274,13 +274,11 @@ simulate_vect <- function(nchains, offspring_sampler,
 }
 
 
-#' Check if offspring argument is specified as a character string
 #'
 #' @param offspring
 #'
 #' @return
 #' @export
-#' @keywords internal
 #' @examples
 check_offspring_valid <- function(offspring) {
   if (!is.character(offspring)) {
@@ -295,34 +293,12 @@ check_offspring_valid <- function(offspring) {
 
 #' Check if constructed random number generator for offspring exists
 #'
-#' @param roffspring_name
-#'
-#' @return
-#' @export
-#'
-#' @examples check_offspring_exists("rpois")
-check_offspring_func_valid <- function(roffspring_name) {
-  if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
-    stop("Function ", roffspring_name, " does not exist.")
   }
 }
 
 
 #' Check if the serials_sampler argument is specified as a function
 #'
-#' @param serials_sampler
-#'
-#' @return
-#' @export
-#' @keywords internal
-#' @examples
-check_serial_valid <- function(serials_sampler) {
-  if (!is.function(serials_sampler)) {
-    stop(sprintf(
-      "%s %s",
-      "The `serials_sampler` argument must be a function",
-      "(see details in ?sim_chain_tree)."
-    ))
   }
 }
 
@@ -330,24 +306,7 @@ check_serial_valid <- function(serials_sampler) {
 check_nchains_valid <- function(nchains) {
   if (nchains < 1 || is.infinite(nchains)) {
     stop("`nchains` must be > 0 but less than `Inf`")
-  }
-}
 
 #' Determine and update the chain statistic being tracked
-#'
-#' @param stat_type
-#' @param noffspring
-#'
-#' @return
-#' @export
-#' @keywords internal
-#' @examples
-update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
-  if (stat_type == "size") {
-    stat_latest <- stat_latest + n_offspring
-  } else if (stat_type == "length") {
-    stat_latest <- stat_latest + pmin(1, n_offspring)
   }
 
-  return(stat_latest)
-}

From d143ae1571e372339856c27d664b96e1dc75d3cc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:24:47 +0100
Subject: [PATCH 380/828] Updated the column names for col-type validation

---
 R/epichains.R | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 62920f32..0bfc2593 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -125,15 +125,15 @@ validate_epichains <- function(x) {
 
   # check for class invariants
 
-  if (attributes(x)$is_tree) {
+  if (attributes(x)$chain_type == "chains_tree") {
     stopifnot(
       "object does not contain the correct columns" =
-        c("n", "id", "ancestor", "generation", "time") %in%
+        c("chain_id", "sim_id", "ancestor", "generation", "time") %in%
           colnames(x),
-      "column `n` must be a numeric" =
-        is.numeric(x$n),
-      "column `id` must be a numeric" =
-        is.numeric(x$id),
+      "column `chain_id` must be a numeric" =
+        is.numeric(x$chain_id),
+      "column `sim_id` must be a numeric" =
+        is.numeric(x$sim_id),
       "column `ancestor` must be a numeric" =
         is.numeric(x$ancestor),
       "column `generation` must be a numeric" =

From a68917abc3ba926dd677b8cceab735df1dd6bf51 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:25:21 +0100
Subject: [PATCH 381/828] Restructured the format method for epichains objects

---
 R/epichains.R | 35 ++++++++++++++++++++++++++++-------
 1 file changed, 28 insertions(+), 7 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 0bfc2593..02ef7bf6 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -12,15 +12,32 @@ print.epichains <- function(x, ...) {
 #'
 #' @examples
 format.epichains <- function(x, ...) {
+  # check that x is an epichains object
+  validate_epichains(x)
+
+  # summarise the information stored in x
   chain_info <- summary(x)
+
   if (attributes(x)$chain_type == "chains_tree") {
-    cat("head starting from first known ancestor \n")
-    print(tibble::as_tibble(head(subset(x, !is.na(ancestor)))))
-    cat("--- \n")
-    print(tail(tibble::as_tibble(x)))
     writeLines(
       c(
-        sprintf("`epichains` `chains_tree` object"),
+        sprintf("`epichains` object"),
+
+        "< tree head (from first known ancestor) >\n"
+        )
+      )
+
+    # print head of the simulation output
+    print(head(subset(as.data.frame(x), !is.na(ancestor))))
+
+    cat("< tree tail >\n")
+
+    # print tail of object
+    print(tail(as.data.frame(x)))
+
+    # print summary information
+    writeLines(
+      c(
         sprintf("Chains simulated: %s", chain_info[["chains"]]),
         sprintf(
           "Unique number of ancestors: %s",
@@ -31,8 +48,10 @@ format.epichains <- function(x, ...) {
         )
       )
     )
+
+    # Offer more information to view the full dataset
     writeLines(sprintf("Use View(<object_name>) to view the full output."))
-    invisible(x)
+
   } else if (attributes(x)$chain_type == "chains_vec") {
     cat(sprintf("epichains object \n"))
     print(as.vector(x))
@@ -42,12 +61,14 @@ format.epichains <- function(x, ...) {
         )
     writeLines(
       c(
-        cat("\n Simulated chain stats: \n"),
+        "\n Simulated chain stats: \n",
         sprintf("Max: %s", chain_info[["max_chain_stat"]]),
         sprintf("Min: %s", chain_info[["min_chain_stat"]])
       )
     )
   }
+
+  invisible(x)
 }
 
 
From e2dd762a1af64aabf015461308a7f8979d31e684 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 17:26:27 +0100
Subject: [PATCH 382/828] Added a functions for simulating infections with an
 initial susceptible pool

---
 R/simulate.r | 175 ++++++++++++++++++++++++++++++++++++++++++++-------
 1 file changed, 152 insertions(+), 23 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index b32ce592..19567632 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -273,40 +273,169 @@ simulate_vect <- function(nchains, offspring_sampler,
   )
 }
 
-
+#' Simulate a tree of infections from an initial susceptible population
+#' with initial immunity
+#'
+#' @param offspring_sampler offspring distribution sampler: a character string
+#' corresponding to the R distribution function. Currently only "pois" &
+#' "nbinom" are supported. Internally truncated distributions are used to
+#' avoid infecting more people than susceptibles available.
+#' @param mn_offspring the average number of secondary cases for each case
+#' @param disp_offspring the dispersion coefficient (var/mean) of the number of
+#'      secondary cases. Ignored if offspring == "pois". Must be > 1.
+#' @param serial_sampler the serial interval. A function that takes one
+#' parameter (`n`), the number of serial intervals to randomly sample.
+#'     Value must be >= 0.
+#' @param t0 start time
+#' @param tf end time
+#' @param pop the population
+#' @param initial_immune the number of initial immunes in the population
+#' @return a data frame with columns `time`, `id` (a unique ID for each
+#'     individual element of the chain), `ancestor` (the ID of the ancestor
+#'      of each element), and `generation`.
 #'
-#' @param offspring
+#' @details This function has a couple of key differences with chain_sim:
+#'     it can only simulate one chain at a time,
+#'     it can only handle implemented offspring distributions
+#'         ("pois" and "nbinom"),
+#'     it always tracks and returns a data frame containing the entire tree,
+#'     the maximal length of chains is limited with pop instead of infinite.
 #'
-#' @return
+#' @author Flavio Finger
+#' @author James M. Azam
 #' @export
 #' @examples
-check_offspring_valid <- function(offspring) {
-  if (!is.character(offspring)) {
-    stop(sprintf(
-      "%s %s",
-      "'offspring' must be specified as a character string.",
-      "Did you forget to enclose it in quotes?"
-    ))
+#' chain_sim_susc(pop = 100, offspring_sampler = "pois", mn_offspring = 0.5,
+#' serial_sampler = function(x) 3)
+simulate_tree_tracked <- function(pop = 100,
+                          offspring_sampler = c("pois", "nbinom"),
+                          mn_offspring,
+                          disp_offspring,
+                          serial_sampler,
+                          t0 = 0,
+                          tf = Inf,
+                          initial_immune = 0) {
+  offspring_sampler <- match.arg(offspring_sampler)
+
+  if (offspring_sampler == "pois") {
+    if (!missing(disp_offspring)) {
+      warning(sprintf("%s %s",
+                      "Argument 'disp_offspring' not used for",
+                      "poisson offspring distribution."
+                      )
+              )
+    }
+
+    ## using a right truncated poisson distribution
+    ## to avoid more cases than susceptibles
+    offspring_fun <- function(n, susc) {
+      truncdist::rtrunc(
+        n,
+        spec = "pois",
+        lambda = mn_offspring * susc / pop,
+        b = susc
+      )
+    }
+  } else if (offspring_sampler == "nbinom") {
+    if (missing(disp_offspring)) {
+      stop(sprintf("%s", "Argument 'disp_offspring' must be specified."))
+    } else if (disp_offspring <= 1) { ## dispersion coefficient
+      stop(sprintf("%s %s %s",
+                   "Offspring distribution 'nbinom' requires",
+                   "argument 'disp_offspring' > 1.",
+                   "Use 'pois' if there is no overdispersion."
+      ))
+    }
+    offspring_fun <- function(n, susc) {
+      ## get distribution params from mean and dispersion
+      ## see ?rnbinom for parameter definition
+      new_mn <- mn_offspring * susc / pop ## apply susceptibility
+      size <- new_mn / (disp_offspring - 1)
+
+      ## using a right truncated nbinom distribution
+      ## to avoid more cases than susceptibles
+      truncdist::rtrunc(
+        n,
+        spec = "nbinom",
+        b = susc,
+        mu = new_mn,
+        size = size
+      )
+    }
   }
-}
 
+  ## initializations
+  tdf <- data.frame(
+    id = 1L,
+    ancestor = NA_integer_,
+    generation = 1L,
+    time = t0,
+    offspring_generated = FALSE
+  )
 
-#' Check if constructed random number generator for offspring exists
-#'
-  }
-}
+  susc <- pop - initial_immune - 1L
+  t <- t0
 
+  ## continue if any unsimulated has t <= tf
+  ## AND there is still susceptibles left
+  while (
+    any(tdf$time[!tdf$offspring_generated] <= tf) &&
+    susc > 0
+  ) {
 
-#' Check if the serials_sampler argument is specified as a function
-#'
-  }
-}
+    ## select from which case to generate offspring
+    t <- min(tdf$time[!tdf$offspring_generated]) # lowest unsimulated t
+
+    ## index of the first in df with t, extract vars
+    idx <- which(tdf$time == t & !tdf$offspring_generated)[1]
+    id_parent <- tdf$id[idx]
+    t_parent <- tdf$time[idx]
+    gen_parent <- tdf$generation[idx]
+
+    ## generate it
+    current_max_id <- max(tdf$id)
+    n_offspring <- offspring_fun(1, susc)
+
+    if (n_offspring %% 1 > 0) {
+      stop("Offspring distribution must return integers")
+    }
 
+    ## mark as done
+    tdf$offspring_generated[idx] <- TRUE
 
-check_nchains_valid <- function(nchains) {
-  if (nchains < 1 || is.infinite(nchains)) {
-    stop("`nchains` must be > 0 but less than `Inf`")
+    ## add to df
+    if (n_offspring > 0) {
+      ## draw times
+      new_times <- serial(n_offspring)
+
+      if (any(new_times < 0)) {
+        stop("Serial interval must be >= 0.")
+      }
 
-#' Determine and update the chain statistic being tracked
+      new_df <- data.frame(
+        id = current_max_id + seq_len(n_offspring),
+        time = new_times + t_parent,
+        ancestor = id_parent,
+        generation = gen_parent + 1L,
+        offspring_generated = FALSE
+      )
+
+      ## add new cases to tdf
+      tdf <- rbind(tdf, new_df)
+    }
+
+    ## adjust susceptibles
+    susc <- susc - n_offspring
   }
 
+  ## remove cases with time > tf that could
+  ## have been generated in the last generation
+  tdf <- tdf[tdf$time <= tf, ]
+
+  ## sort output and remove columns not needed
+  tdf <- tdf[order(tdf$time, tdf$id), ]
+  tdf$offspring_generated <- NULL
+
+  return(tdf)
+}
+

From d0a86db568f5319b5ebaa42acf862aaa39204703 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:23:23 +0100
Subject: [PATCH 383/828] Added an epichains attribute to indicate if pop is
 tracked

---
 R/simulate.r | 1 +
 1 file changed, 1 insertion(+)

diff --git a/R/simulate.r b/R/simulate.r
index 19567632..c9f20cfc 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -204,6 +204,7 @@ simulate_tree <- function(nchains, offspring_sampler,
     chains = nchains,
     chain_type = "chains_tree",
     rownames = NULL,
+    track_pop = FALSE,
     class = c("epichains", "tbl", "data.frame")
   )
 }

From 85556fd32fb4a3706a6f7a1d5b1090281e5daba6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:23:54 +0100
Subject: [PATCH 384/828] Now summarising maximum generations

---
 R/epichains.R | 8 +++++---
 1 file changed, 5 insertions(+), 3 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 02ef7bf6..3ec42473 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -94,7 +94,9 @@ summary.epichains <- function(x, ...) {
       unique(x$ancestor[!is.na(x$ancestor)])
     )
 
-    num_generations <- length(unique(x$generations))
+    num_generations <- length(unique(x$generation))
+
+    max_generation <- max(x$generation)
 
     # out of summary
     res <- list(
@@ -102,8 +104,8 @@ summary.epichains <- function(x, ...) {
       max_time = max_time,
       unique_ancestors = n_unique_ancestors,
       unique_generations = n_unique_ancestors,
-      num_generations = num_generations
-      # WIP
+      num_generations = num_generations,
+      max_generation = max_generation
     )
   } else if (attributes(x)$chain_type == "chains_vec") {
     chains_ran <- length(x)

From 3405366a419a804a4cb624b9ecdf6579eeeb84af Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:24:13 +0100
Subject: [PATCH 385/828] Added epichains validation to summary method

---
 R/epichains.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 3ec42473..46ce1484 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -83,8 +83,9 @@ format.epichains <- function(x, ...) {
 #'
 #' @examples
 summary.epichains <- function(x, ...) {
+  validate_epichains(x)
+
   if (attributes(x)$chain_type == "chains_tree") {
-    is_epichains(x)
 
     chains_ran <- length(x$n)
 

From f73ba66cbb467eabd8c1c3f46991c2da8e5a7f11 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:24:37 +0100
Subject: [PATCH 386/828] Removed chain_id column as an invariant of epichains
 class

---
 R/epichains.R | 4 +---
 1 file changed, 1 insertion(+), 3 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 46ce1484..55ce2043 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -152,10 +152,8 @@ validate_epichains <- function(x) {
   if (attributes(x)$chain_type == "chains_tree") {
     stopifnot(
       "object does not contain the correct columns" =
-        c("chain_id", "sim_id", "ancestor", "generation", "time") %in%
+        c("sim_id", "ancestor", "generation", "time") %in%
           colnames(x),
-      "column `chain_id` must be a numeric" =
-        is.numeric(x$chain_id),
       "column `sim_id` must be a numeric" =
         is.numeric(x$sim_id),
       "column `ancestor` must be a numeric" =

From c8d75d39d41d025e9dc69a648c27982f06d37e78 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:25:10 +0100
Subject: [PATCH 387/828] Added a function to extract truncated poisson or
 nbinom function

---
 R/helpers.R | 40 ++++++++++++++++++++++++++++++++++++++++
 1 file changed, 40 insertions(+)

diff --git a/R/helpers.R b/R/helpers.R
index 53c93dbd..403e98da 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -16,3 +16,43 @@ update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
 
   return(stat_latest)
 }
+
+
+#' Get offspring sampling function
+#'
+#' @param offspring_sampler
+#'
+#' @return
+#' @export
+#'
+#' @examples
+get_offspring_func <- function(offspring_sampler) {
+  if (offspring_sampler == "nbinom") {
+    function(n, susc, pop, mean_offspring, disp_offspring) {
+      ## get distribution params from mean and dispersion
+      new_mn <- mean_offspring * susc / pop ## apply susceptibility
+      size <- new_mn / (disp_offspring - 1)
+
+      ## using a right truncated nbinom distribution
+      ## to avoid more cases than susceptibles
+      truncdist::rtrunc(
+        n,
+        spec = "nbinom",
+        b = susc,
+        mu = new_mn,
+        size = size
+      )
+    }
+  } else if (offspring_sampler == "pois") {
+    function(n, susc, pop, mean_offspring) {
+      truncdist::rtrunc(
+        n,
+        spec = "pois",
+        lambda = mean_offspring * susc / pop,
+        b = susc
+      )
+    }
+  } else{
+    stop("offspring_sampler must either be 'pois' or 'nbinom'")
+  }
+}

From b412abc4d19fe1f7806402e1b2975db840da86d3 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:26:04 +0100
Subject: [PATCH 388/828] Added epichains class to simulation function

---
 R/simulate.r | 8 +++++++-
 1 file changed, 7 insertions(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index c9f20cfc..8ec8e5b2 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -437,6 +437,12 @@ simulate_tree_tracked <- function(pop = 100,
   tdf <- tdf[order(tdf$time, tdf$id), ]
   tdf$offspring_generated <- NULL
 
-  return(tdf)
+  structure(
+    tree_df,
+    chain_type = "chains_tree",
+    rownames = NULL,
+    track_pop = TRUE,
+    class = c("epichains", "tbl", "data.frame")
+  )
 }
 

From a1564ecedca2a5e704644ef28857e8780b12e2a4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:28:11 +0100
Subject: [PATCH 389/828] Moved the offspring function definition to the helper
 script

---
 R/simulate.r | 27 +++------------------------
 1 file changed, 3 insertions(+), 24 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 8ec8e5b2..4a8fee01 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -329,14 +329,8 @@ simulate_tree_tracked <- function(pop = 100,
 
     ## using a right truncated poisson distribution
     ## to avoid more cases than susceptibles
-    offspring_fun <- function(n, susc) {
-      truncdist::rtrunc(
-        n,
-        spec = "pois",
-        lambda = mn_offspring * susc / pop,
-        b = susc
-      )
-    }
+    offspring_fun <- get_offspring_func(offspring_sampler)
+
   } else if (offspring_sampler == "nbinom") {
     if (missing(disp_offspring)) {
       stop(sprintf("%s", "Argument 'disp_offspring' must be specified."))
@@ -347,22 +341,7 @@ simulate_tree_tracked <- function(pop = 100,
                    "Use 'pois' if there is no overdispersion."
       ))
     }
-    offspring_fun <- function(n, susc) {
-      ## get distribution params from mean and dispersion
-      ## see ?rnbinom for parameter definition
-      new_mn <- mn_offspring * susc / pop ## apply susceptibility
-      size <- new_mn / (disp_offspring - 1)
-
-      ## using a right truncated nbinom distribution
-      ## to avoid more cases than susceptibles
-      truncdist::rtrunc(
-        n,
-        spec = "nbinom",
-        b = susc,
-        mu = new_mn,
-        size = size
-      )
-    }
+    offspring_fun <- get_offspring_func(offspring_sampler)
   }
 
   ## initializations

From 780394ec4911f423006e30475c347a210596db62 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:29:52 +0100
Subject: [PATCH 390/828] Documented the simulation function

---
 R/simulate.r | 56 ++++++++++++++++++++++++++++++++--------------------
 1 file changed, 35 insertions(+), 21 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 4a8fee01..ba0fd982 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -277,37 +277,51 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' Simulate a tree of infections from an initial susceptible population
 #' with initial immunity
 #'
-#' @param offspring_sampler offspring distribution sampler: a character string
+#' @param pop The susceptible population.
+#' @param offspring_sampler Offspring distribution sampler: a character string
 #' corresponding to the R distribution function. Currently only "pois" &
 #' "nbinom" are supported. Internally truncated distributions are used to
 #' avoid infecting more people than susceptibles available.
-#' @param mn_offspring the average number of secondary cases for each case
-#' @param disp_offspring the dispersion coefficient (var/mean) of the number of
-#'      secondary cases. Ignored if offspring == "pois". Must be > 1.
-#' @param serial_sampler the serial interval. A function that takes one
-#' parameter (`n`), the number of serial intervals to randomly sample.
-#'     Value must be >= 0.
-#' @param t0 start time
-#' @param tf end time
-#' @param pop the population
-#' @param initial_immune the number of initial immunes in the population
+#' @param mean_offspring The average number of secondary cases for each case.
+#' Same as R0.
+#' @param disp_offspring The dispersion parameter of the number of
+#' secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
+#' avoid division by 0 when calculating the size. See details and
+#'  \code{?rnbinom} for details on the parameterisation in Ecology.
+#' @param serial_sampler The serial interval. A function that takes one
+#' parameter (`n`), the number of serial intervals to randomly sample. Value
+#' must be >= 0.
+#' @param initial_immune The number of initial immunes in the population.
+#' @param t0 Start time; Defaults to 0.
+#' @param tf End time; Defaults to `Inf`.
 #' @return a data frame with columns `time`, `id` (a unique ID for each
-#'     individual element of the chain), `ancestor` (the ID of the ancestor
-#'      of each element), and `generation`.
+#' individual element of the chain), `ancestor` (the ID of the ancestor
+#' of each element), and `generation`.
+#' @details
+#'
+#' # Offspring models
+#'
+#' The poisson model is parametrised so that:
+#'
+#' lamda = mean_offspring * pop - initial_immune / pop
+#'
+#' The negative binomial model is parametrised as:
 #'
-#' @details This function has a couple of key differences with chain_sim:
-#'     it can only simulate one chain at a time,
-#'     it can only handle implemented offspring distributions
-#'         ("pois" and "nbinom"),
-#'     it always tracks and returns a data frame containing the entire tree,
-#'     the maximal length of chains is limited with pop instead of infinite.
+#' mu = mean_offspring * pop - initial immune / pop, and
+#' size = mu / (disp_offspring - 1). This is why disp_offspring must be greater
+#' than 1.
 #'
+#' simulate_tree_from_pop() has a couple of key different from simulate_tree():
+#'  * the maximal chain statistic is limited by `pop` instead of
+#'  `chain_stat_max` (in `simulate_tree()`),
+#'  * it can only handle implemented offspring distributions ("pois" and
+#' "nbinom").
 #' @author Flavio Finger
 #' @author James M. Azam
 #' @export
 #' @examples
-#' chain_sim_susc(pop = 100, offspring_sampler = "pois", mn_offspring = 0.5,
-#' serial_sampler = function(x) 3)
+#' # Simulate with poisson offspring
+#' simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
 simulate_tree_tracked <- function(pop = 100,
                           offspring_sampler = c("pois", "nbinom"),
                           mn_offspring,

From c1fb62847e18aec99618aaf60c765f76f9ee39ef Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:30:45 +0100
Subject: [PATCH 391/828] Reworded an error and warning

---
 R/simulate.r | 9 +++++----
 1 file changed, 5 insertions(+), 4 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index ba0fd982..50b52d27 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -334,9 +334,10 @@ simulate_tree_tracked <- function(pop = 100,
 
   if (offspring_sampler == "pois") {
     if (!missing(disp_offspring)) {
-      warning(sprintf("%s %s",
-                      "Argument 'disp_offspring' not used for",
-                      "poisson offspring distribution."
+      warning(sprintf("%s %s %s",
+                      "'disp_offspring' is not used for",
+                      "poisson offspring distribution.",
+                      "Will be ignored."
                       )
               )
     }
@@ -347,7 +348,7 @@ simulate_tree_tracked <- function(pop = 100,
 
   } else if (offspring_sampler == "nbinom") {
     if (missing(disp_offspring)) {
-      stop(sprintf("%s", "Argument 'disp_offspring' must be specified."))
+      stop(sprintf("%s", "'disp_offspring' must be specified."))
     } else if (disp_offspring <= 1) { ## dispersion coefficient
       stop(sprintf("%s %s %s",
                    "Offspring distribution 'nbinom' requires",

From e99cc707aecbabfe9b431a9a02e9ac33d758e83b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:31:22 +0100
Subject: [PATCH 392/828] Renamed the function

---
 R/simulate.r | 17 +++++++++--------
 1 file changed, 9 insertions(+), 8 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 50b52d27..51e3c336 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -322,14 +322,15 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' @examples
 #' # Simulate with poisson offspring
 #' simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
-simulate_tree_tracked <- function(pop = 100,
-                          offspring_sampler = c("pois", "nbinom"),
-                          mn_offspring,
-                          disp_offspring,
-                          serial_sampler,
-                          t0 = 0,
-                          tf = Inf,
-                          initial_immune = 0) {
+#' mean_offspring = 0.5, serial_sampler = function(x) 3)
+simulate_tree_from_pop <- function(pop,
+                                   offspring_sampler = c("pois", "nbinom"),
+                                   mean_offspring,
+                                   disp_offspring,
+                                   serial_sampler,
+                                   initial_immune = 0,
+                                   t0 = 0,
+                                   tf = Inf) {
   offspring_sampler <- match.arg(offspring_sampler)
 
   if (offspring_sampler == "pois") {

From 6c270689c8359fcfb1cbb3325eab2f9628b8d07a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:31:48 +0100
Subject: [PATCH 393/828] Added an example for negative binomial offspring

---
 R/simulate.r | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/R/simulate.r b/R/simulate.r
index 51e3c336..a39a9e80 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -323,6 +323,10 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' # Simulate with poisson offspring
 #' simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
 #' mean_offspring = 0.5, serial_sampler = function(x) 3)
+#'
+#' #' # Simulate with negative binomial offspring
+#' simulate_tree_from_pop(pop = 100, offspring_sampler = "nbinom",
+#' mean_offspring = 0.5, disp_offspring = 1.1, serial_sampler = function(x) 3)
 simulate_tree_from_pop <- function(pop,
                                    offspring_sampler = c("pois", "nbinom"),
                                    mean_offspring,

From 5451aa96b52d6663cfe8f361004aa9bbca9789f6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:32:20 +0100
Subject: [PATCH 394/828] Renamed some variables

---
 R/simulate.r | 47 ++++++++++++++++++++++-------------------------
 1 file changed, 22 insertions(+), 25 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index a39a9e80..00fe792e 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -365,63 +365,60 @@ simulate_tree_from_pop <- function(pop,
   }
 
   ## initializations
-  tdf <- data.frame(
-    id = 1L,
+  tree_df <- data.frame(
+    sim_id = 1L,
     ancestor = NA_integer_,
     generation = 1L,
     time = t0,
-    offspring_generated = FALSE
+    offspring_generated = FALSE #used to track simulation and dropped afterwards
   )
 
   susc <- pop - initial_immune - 1L
   t <- t0
 
-  ## continue if any unsimulated has t <= tf
+  ## continue if any unsimulated chains have t <= tf
   ## AND there is still susceptibles left
-  while (
-    any(tdf$time[!tdf$offspring_generated] <= tf) &&
-    susc > 0
-  ) {
+  while (any(tree_df$time[!tree_df$offspring_generated] <= tf) && susc > 0) {
 
     ## select from which case to generate offspring
-    t <- min(tdf$time[!tdf$offspring_generated]) # lowest unsimulated t
+    t <- min(tree_df$time[!tree_df$offspring_generated]) # lowest unsimulated t
 
     ## index of the first in df with t, extract vars
-    idx <- which(tdf$time == t & !tdf$offspring_generated)[1]
-    id_parent <- tdf$id[idx]
-    t_parent <- tdf$time[idx]
-    gen_parent <- tdf$generation[idx]
+    idx <- which(tree_df$time == t & !tree_df$offspring_generated)[1]
+    id_parent <- tree_df$sim_id[idx]
+    t_parent <- tree_df$time[idx]
+    gen_parent <- tree_df$generation[idx]
 
     ## generate it
-    current_max_id <- max(tdf$id)
-    n_offspring <- offspring_fun(1, susc)
+    current_max_id <- max(tree_df$sim_id)
+    n_offspring <- offspring_fun(1, susc, pop, mean_offspring, disp_offspring)
 
     if (n_offspring %% 1 > 0) {
       stop("Offspring distribution must return integers")
     }
 
     ## mark as done
-    tdf$offspring_generated[idx] <- TRUE
+    tree_df$offspring_generated[idx] <- TRUE
 
     ## add to df
     if (n_offspring > 0) {
-      ## draw times
-      new_times <- serial(n_offspring)
+      ## draw serial times
+      new_times <- serial_sampler(n_offspring)
 
       if (any(new_times < 0)) {
         stop("Serial interval must be >= 0.")
       }
 
       new_df <- data.frame(
-        id = current_max_id + seq_len(n_offspring),
-        time = new_times + t_parent,
+        sim_id = current_max_id + seq_len(n_offspring),
         ancestor = id_parent,
         generation = gen_parent + 1L,
+        time = new_times + t_parent,
         offspring_generated = FALSE
       )
 
-      ## add new cases to tdf
-      tdf <- rbind(tdf, new_df)
+      ## add new cases to tree_df
+      tree_df <- rbind(tree_df, new_df)
     }
 
     ## adjust susceptibles
@@ -430,11 +427,11 @@ simulate_tree_from_pop <- function(pop,
 
   ## remove cases with time > tf that could
   ## have been generated in the last generation
-  tdf <- tdf[tdf$time <= tf, ]
+  tree_df <- tree_df[tree_df$time <= tf, ]
 
   ## sort output and remove columns not needed
-  tdf <- tdf[order(tdf$time, tdf$id), ]
-  tdf$offspring_generated <- NULL
+  tree_df <- tree_df[order(tree_df$time, tree_df$sim_id), ]
+  tree_df$offspring_generated <- NULL
 
   structure(
     tree_df,

From e3f72f9c49139a10e0c16cd1e08f341e7e1a231e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 12 Jun 2023 23:37:20 +0100
Subject: [PATCH 395/828] Linting: removed whitespaces

---
 R/helpers.R  | 2 +-
 R/simulate.r | 1 -
 2 files changed, 1 insertion(+), 2 deletions(-)

diff --git a/R/helpers.R b/R/helpers.R
index 403e98da..d835653e 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -52,7 +52,7 @@ get_offspring_func <- function(offspring_sampler) {
         b = susc
       )
     }
-  } else{
+  } else {
     stop("offspring_sampler must either be 'pois' or 'nbinom'")
   }
 }
diff --git a/R/simulate.r b/R/simulate.r
index 00fe792e..cef83fc2 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -441,4 +441,3 @@ simulate_tree_from_pop <- function(pop,
     class = c("epichains", "tbl", "data.frame")
   )
 }
-

From b776bff8fefbd8f6d65aac0105709bbef697e790 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 13 Jun 2023 09:52:55 +0100
Subject: [PATCH 396/828] Added methods for head() and tail()

---
 R/epichains.R | 20 ++++++++++++++++++++
 1 file changed, 20 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index 55ce2043..6224ae39 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -172,3 +172,23 @@ validate_epichains <- function(x) {
 
   invisible(x)
 }
+
+#' `head` and `tail` methods for [`epichains`] class
+#'
+#' @param x An [`epichains`] object
+#' @param ... further arguments passed to or from other methods
+#'
+#' @return object of class `data.frame`
+#' @export
+#'
+#' @importFrom utils head
+#' @importFrom utils tail
+head.epichains <- function(x, ...) {
+  utils::head(as.data.frame(x), ...)
+}
+
+#' @rdname head.epichains
+#' @export
+tail.epichains <- function(x, ...) {
+  utils::tail(as.data.frame(x), ...)
+}

From b44259cada94fc49aee852ced59a166bbfbeab3b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 13 Jun 2023 10:23:43 +0100
Subject: [PATCH 397/828] Added a plotting method for epichains objects with
 chains_tree attribute

---
 R/epichains.R | 39 +++++++++++++++++++++++++++++++++++++++
 1 file changed, 39 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index 6224ae39..bf59d969 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -192,3 +192,42 @@ head.epichains <- function(x, ...) {
 tail.epichains <- function(x, ...) {
   utils::tail(as.data.frame(x), ...)
 }
+
+#' Plot epichains tree objects
+#'
+#' @param x an [`epichains`] object with a chains_tree attribute
+#' @param ...
+#'
+#' @return
+#' @export
+#' @author James M. Azam
+#' @examples
+plot.epichains <- function(x, ...){
+  validate_epichains(x)
+
+  if (attributes(x)$chain_type != "chains_tree") {
+    stop("Object must be an epichains object with a chains_tree attribute.")
+  }
+
+  cases_per_generation <- aggregate(sim_id ~ generation, x = as.data.frame(x), FUN = NROW)
+
+  cases_per_time <- aggregate(sim_id ~ time, x = as.data.frame(x), FUN = NROW)
+
+  graphics::par(mfrow = c(1, 2), mar = c(4, 3, 3, 1), oma = c(0, 0, 0, 0))
+
+  plot(cases_per_generation$generation,
+       cases_per_generation$sim_id,
+       xlab = "Generation",
+       ylab = "Cases",
+       type = "b",
+       main = "Number of cases per generation"
+       )
+
+  plot(cases_per_time$time,
+       cases_per_time$sim_id,
+       xlab = "Time",
+       ylab = "Cases",
+       type = "b",
+       main = "Number of cases per time"
+  )
+}

From 69acc06401b9911505c4e0539ef52e44e910bcbf Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 13 Jun 2023 11:57:20 +0100
Subject: [PATCH 398/828] Moved chain_ll and helpers here

---
 R/likelihood_estimation.R | 120 ++++++++++++++++----------------------
 1 file changed, 49 insertions(+), 71 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 8f663805..3540efd5 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -1,84 +1,65 @@
-#' Estimate the (log) likelihood for observed branching processes
+#' Likelihood for the outcome of a branching process
 #'
-#' @param chains_observed Vector of sizes/lengths of transmission chains.
-#' @param chain_statistic Statistic given as \code{chains_observed}
-#' ("size" or "length" of chains).
-#' @param offspring_sampler Offspring distribution: a character string
-#' corresponding to the R distribution function (e.g., "pois" for Poisson,
-#' where \code{\link{rpois}} is the R function to generate Poisson random
-#' numbers).
-#' @param nsim_obs Number of simulations if the likelihood is to be
-#' approximated for imperfect observations.
-#' @param log_trans Logical; Should the results be log-transformed? (Defaults
-#' to TRUE).
-#' @param obs_prob Observation probability (assumed constant)
-#' @param chain_stat_max Any chains of this size/length will be
-#' treated as infinite.
-#' @param exclude A vector of indices of the sizes/lengths to exclude from the
-#' likelihood calculation.
-#' @param individual If TRUE, a vector of individual (log)likelihood
-#' contributions will be returned rather than the sum.
-#' @param ... Parameters for the offspring distribution.
-#' @return
-#' * A log-likelihood, if \code{log_trans = TRUE} (the default)
-#' * A vector of log-likelihoods, if \code{log_trans = TRUE} (the default) and
-#' \code{obs_prob < 1}, or
-#' * A list of individual log-likelihood contributions, if
-#' \code{log_trans = TRUE} (the default) and \code{individual = TRUE}.
-#' else raw likelihoods, or vector of likelihoods
-#' @seealso offspring_ll, pois_size_ll, nbinom_size_ll, gborel_size_ll,
-#' pois_length_ll, geom_length_ll.
+#' @param x vector of sizes or lengths of transmission chains
+#' @param stat statistic given as \code{x} ("size" or "length" of chains)
+#' @param obs_prob observation probability (assumed constant)
+#' @param infinite any chains of this size/length will be treated as infinite
+#' @param exclude any sizes/lengths to exclude from the likelihood calculation
+#' @param individual if TRUE, a vector of individual log-likelihood
+#' contributions will be returned rather than the sum
+#' @param nsim_obs number of simulations if the likelihood is to be
+#'   approximated for imperfect observations
+#' @param ... parameters for the offspring distribution
+#' @return likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or
+#'  a list of individual likelihood contributions (if \code{individual=TRUE})
+#' @inheritParams chain_sim
+#' @seealso pois_size_ll, nbinom_size_ll, gborel_size_ll, pois_length_ll,
+#'   geom_length_ll, offspring_ll
 #' @author Sebastian Funk
-#' @examples
-#' # example of observed chain sizes
-#' chain_sizes <- c(1, 1, 4, 7)
-#' estimate_likelihood(chains_observed = chain_sizes, chain_statistic = "size",
-#'  offspring_sampler = "pois", nsim_obs = 100, lambda = 0.5)
 #' @export
-estimate_likelihood <- function(chains_observed,
-                                chain_statistic = c("size", "length"),
-                                offspring_sampler,
-                                nsim_obs,
-                                log_trans = TRUE,
-                                obs_prob = 1, chain_stat_max = Inf,
-                                exclude = NULL, individual = FALSE, ...) {
-  chain_statistic <- match.arg(chain_statistic)
+#' @examples
+#' chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
+#' chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
+chain_ll <- function(x, offspring, stat = c("size", "length"), obs_prob = 1,
+                     infinite = Inf, exclude = NULL, individual = FALSE,
+                     nsim_obs, ...) {
+  stat <- match.arg(stat)
 
   ## checks
-  check_offspring_valid(offspring_sampler)
-
+  if (!is.character(offspring)) {
+    stop("Object passed as 'offspring' is not a character string.")
+  }
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
     if (missing(nsim_obs)) {
-      stop("'nsim_obs' must be specified if 'obs_prob' is < 1")
+      stop("'nsim_obs' must be specified if 'obs_prob' is <1")
     }
-
-    sample_func <- get_chain_statistic_func(chain_statistic)
-
-    sampled_x <- replicate(nsim_obs, pmin(sample_func(length(chains_observed),
-                                           chains_observed, obs_prob
-                                           ),
-                               chain_stat_max), simplify = FALSE)
+    if (stat == "size") {
+      sample_func <- rbinom_size
+    } else if (stat == "length") {
+      sample_func <- rgen_length
+    }
+    sampled_x <-
+      replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob),
+                               infinite), simplify = FALSE)
     size_x <- unlist(sampled_x)
-    if (!is.finite(chain_stat_max)) {
-      chain_stat_max <- max(size_x) + 1
-      }
+    if (!is.finite(infinite)) infinite <- max(size_x) + 1
   } else {
-    chains_observed[chains_observed >= chain_stat_max] <- chain_stat_max
-    size_x <- chains_observed
-    sampled_x <- list(chains_observed)
+    x[x >= infinite] <- infinite
+    size_x <- x
+    sampled_x <- list(x)
   }
 
   ## determine for which sizes to calculate the likelihood (for true chain size)
-  if (any(size_x == chain_stat_max)) {
-    calc_sizes <- seq_len(chain_stat_max - 1)
+  if (any(size_x == infinite)) {
+    calc_sizes <- seq_len(infinite - 1)
   } else {
     calc_sizes <- unique(c(size_x, exclude))
   }
 
-  ## get likelihood function as given by offspring_sampler and chain_statistic
+  ## get likelihood function as given by `offspring` and `stat``
   likelihoods <- vector(mode = "numeric")
-  ll_func <- construct_offspring_ll_name(offspring_sampler, chain_statistic)
+  ll_func <- paste(offspring, stat, "ll", sep = "_")
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
   ## calculate likelihoods
@@ -90,16 +71,15 @@ estimate_likelihood <- function(chains_observed,
       do.call(
         offspring_ll,
         c(list(
-          chains_observed = calc_sizes, offspring_sampler = offspring_sampler,
-          chain_statistic = chain_statistic, chain_stat_max = chain_stat_max,
-          log_trans = log_trans
+          x = calc_sizes, offspring = offspring,
+          stat = stat, infinite = infinite
         ), pars)
       )
   }
 
-  ## assign probabilities to chain_stat_max outbreak sizes
-  if (any(size_x == chain_stat_max)) {
-    likelihoods[chain_stat_max] <- complementary_logprob(likelihoods)
+  ## assign probabilities to infinite outbreak sizes
+  if (any(size_x == infinite)) {
+    likelihoods[infinite] <- complementary_logprob(likelihoods)
   }
 
   if (!missing(exclude)) {
@@ -116,9 +96,7 @@ estimate_likelihood <- function(chains_observed,
     likelihoods[sx[!(sx %in% exclude)]]
   })
 
-  if (!individual) {
-    chains_likelihood <- vapply(chains_likelihood, sum, 0)
-    }
+  if (!individual) chains_likelihood <- vapply(chains_likelihood, sum, 0)
 
   return(chains_likelihood)
 }

From f858c278d3255237d5f9b3c9c38a4b565bc0f9d3 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 13 Jun 2023 11:57:36 +0100
Subject: [PATCH 399/828] Moved chain_ll from here

---
 R/likelihoods.R | 3 +--
 1 file changed, 1 insertion(+), 2 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 521052c9..74a8af0e 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -123,6 +123,5 @@ offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
     )$y))
   lik <- acdf[chains_observed]
   lik[is.na(lik)] <- 0
-  out <- ifelse(base::isTRUE(log_trans), log(lik), lik)
-  return(out)
+  log(lik)
 }

From 77459c5f720b3a2f9ea0fd050a7b2799e9dd2ac6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 13 Jun 2023 11:58:06 +0100
Subject: [PATCH 400/828] Added script for testing refactored functions

---
 R/test_refactoring.R | 39 +++++++++++++++++++++++++++++++++++++++
 1 file changed, 39 insertions(+)
 create mode 100644 R/test_refactoring.R

diff --git a/R/test_refactoring.R b/R/test_refactoring.R
new file mode 100644
index 00000000..5bf9fcf8
--- /dev/null
+++ b/R/test_refactoring.R
@@ -0,0 +1,39 @@
+
+source("./R/checks.R")
+source("./R/helpers.R")
+source("./R/epichains.R")
+source("./R/simulate.r")
+
+
+# try simulate_tree()
+chains_tree <- simulate_tree(nchains = 10,
+                                   serials_sampler = function(n) {rpois(n, 5)},
+                                   offspring_sampler = "pois",
+                                   lambda = 2,
+                                   chain_stat_max = 10
+                                   )
+
+
+chains_tree
+summary(chains_tree)
+plot(chains_tree)
+
+# try simulate_tree_from_pop()
+
+chains_tree_from_pop <- simulate_tree_from_pop(
+  pop = 100, offspring_sampler = "nbinom",
+  mean_offspring = 0.5, disp_offspring = 1.1,
+  serial_sampler = function(x) 3)
+
+chains_tree_from_pop
+summary(chains_tree_from_pop)
+plot(chains_tree_from_pop)
+
+# try chain_vec simulation
+chains_vec <- simulate_vect(nchains = 10, offspring_sampler = "pois",
+                             lambda = 2, chain_stat_max = 10
+                             )
+
+chains_vec
+summary(chains_vec)
+# plot(chains_vec) #expect error

From 5e70fa09d461b168ddfca5a17a088b5a0c6f9f61 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 13 Jun 2023 11:58:28 +0100
Subject: [PATCH 401/828] Removed redundant roxygen tags

---
 R/checks.R | 18 ++++++------------
 1 file changed, 6 insertions(+), 12 deletions(-)

diff --git a/R/checks.R b/R/checks.R
index dea04268..69acb27d 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -1,11 +1,7 @@
 #' Check if offspring argument is specified as a character string
 #'
 #' @param offspring
-#'
-#' @return
-#' @export
 #' @keywords internal
-#' @examples
 check_offspring_valid <- function(offspring) {
   if (!is.character(offspring)) {
     stop(sprintf(
@@ -20,11 +16,7 @@ check_offspring_valid <- function(offspring) {
 #' Check if constructed random number generator for offspring exists
 #'
 #' @param roffspring_name
-#'
-#' @return
-#' @export
-#'
-#' @examples check_offspring_exists("rpois")
+#' @keywords internal
 check_offspring_func_valid <- function(roffspring_name) {
   if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
     stop("Function ", roffspring_name, " does not exist.")
@@ -36,10 +28,7 @@ check_offspring_func_valid <- function(roffspring_name) {
 #'
 #' @param serials_sampler
 #'
-#' @return
-#' @export
 #' @keywords internal
-#' @examples
 check_serial_valid <- function(serials_sampler) {
   if (!is.function(serials_sampler)) {
     stop(sprintf(
@@ -51,6 +40,11 @@ check_serial_valid <- function(serials_sampler) {
 }
 
 
+#' Check that nchains is greater than 0 and not infinite
+#'
+#' @param nchains
+#'
+#' @keywords internal
 check_nchains_valid <- function(nchains) {
   if (nchains < 1 || is.infinite(nchains)) {
     stop("`nchains` must be > 0 but less than `Inf`")

From 9daaea29b9f5d0ca95f537ca9a82ee62f2092e25 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:08:02 +0100
Subject: [PATCH 402/828] Added stats to imports

---
 DESCRIPTION | 3 +--
 1 file changed, 1 insertion(+), 2 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index b50545a1..157f7803 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -24,8 +24,7 @@ BugReports: https://github.com/epiverse-trace/epichains/issues
 Depends:
     R (>= 3.6.0)
 Imports: 
-    stats,
-    utils
+    stats
 Suggests:
     bookdown,
     covr,

From 3bf9e8336c3707d7a8f6c21cf5a5ce17b1bb6072 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:10:20 +0100
Subject: [PATCH 403/828] Regenerated function docs

---
 man/is_epichains.Rd   |  1 -
 man/offspring_ll.Rd   |  9 +++++++++
 man/plot.epichains.Rd | 22 ++++++++++++++++++++++
 man/simulate_vect.Rd  |  2 +-
 4 files changed, 32 insertions(+), 2 deletions(-)
 create mode 100644 man/plot.epichains.Rd

diff --git a/man/is_epichains.Rd b/man/is_epichains.Rd
index aa2d540d..dd365904 100644
--- a/man/is_epichains.Rd
+++ b/man/is_epichains.Rd
@@ -16,4 +16,3 @@ otherwise
 \description{
 Checks whether the object is an \code{epichains}
 }
-\keyword{internal}
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 8556f5b1..1280c21a 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -44,4 +44,13 @@ cumulative distribution function (ecdf).
 \author{
 Sebastian Funk
 }
+\keyword{Compute}
+\keyword{Cumulative}
+\keyword{Distribution}
+\keyword{Function}
+\keyword{chains}
+\keyword{empirical}
 \keyword{internal}
+\keyword{of}
+\keyword{simulated}
+\keyword{the}
diff --git a/man/plot.epichains.Rd b/man/plot.epichains.Rd
new file mode 100644
index 00000000..7fa17943
--- /dev/null
+++ b/man/plot.epichains.Rd
@@ -0,0 +1,22 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{plot.epichains}
+\alias{plot.epichains}
+\title{Plot epichains tree objects}
+\usage{
+\method{plot}{epichains}(x, ...)
+}
+\arguments{
+\item{x}{An \code{\link{epichains}} object with a chains_tree attribute}
+
+\item{...}{Other arguments passed to plot}
+}
+\value{
+A plot of cases over time and generation
+}
+\description{
+Plot epichains tree objects
+}
+\author{
+James M. Azam
+}
diff --git a/man/simulate_vect.Rd b/man/simulate_vect.Rd
index cdef8113..cd7fafc7 100644
--- a/man/simulate_vect.Rd
+++ b/man/simulate_vect.Rd
@@ -35,6 +35,6 @@ computed. Results above the specified value, are set to \code{Inf}.}
 Simulate transmission chains without tree (as a vector)
 }
 \examples{
-simulate_vect(nchains = 10, offspring_sampler = "pois", lambda = 2,
+simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
 chain_stat_max = 10)
 }

From 94bff6a36aba9955a39a46f6aeb00c00cd2d5c0b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:10:45 +0100
Subject: [PATCH 404/828] Regenerated NAMESPACE

---
 NAMESPACE | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/NAMESPACE b/NAMESPACE
index e31e4230..61a29bb9 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -1,18 +1,18 @@
 # Generated by roxygen2: do not edit by hand
 
-S3method(aggregate,epichains)
 S3method(format,epichains)
 S3method(head,epichains)
+S3method(plot,epichains)
 S3method(print,epichains)
 S3method(summary,epichains)
 S3method(tail,epichains)
 export(dborel)
 export(estimate_likelihood)
+export(is_epichains)
 export(rborel)
 export(rnbinom_mean_disp)
 export(simulate_tree)
 export(simulate_tree_from_pop)
 export(simulate_vect)
-importFrom(stats,aggregate)
 importFrom(utils,head)
 importFrom(utils,tail)

From f3d75729a76a28b2faa8db1a6bdc5b6288b15cc5 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:13:08 +0100
Subject: [PATCH 405/828] Changed old argument documentation to sentence case

---
 R/simulate.r | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index cef83fc2..f9467900 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,6 +1,6 @@
 #' Simulate a tree of infections with a serial and offspring distributions
 #'
-#' @param nchains number of chains to simulate
+#' @param nchains Number of chains to simulate.
 #' @param offspring_sampler Offspring distribution: a character string
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
 #' where \code{\link{rpois}} is the R function to generate Poisson random
@@ -324,7 +324,7 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
 #' mean_offspring = 0.5, serial_sampler = function(x) 3)
 #'
-#' #' # Simulate with negative binomial offspring
+#' # Simulate with negative binomial offspring
 #' simulate_tree_from_pop(pop = 100, offspring_sampler = "nbinom",
 #' mean_offspring = 0.5, disp_offspring = 1.1, serial_sampler = function(x) 3)
 simulate_tree_from_pop <- function(pop,

From 252a11d76df9c93dc341618e9774fca1b927df3a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:14:43 +0100
Subject: [PATCH 406/828] Documented the print method

---
 R/epichains.R | 6 ++++++
 1 file changed, 6 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index bf59d969..883aa708 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -1,3 +1,9 @@
+#' Print an [`epichains`] object
+#'
+#' @param x An [`epichains`] object.
+#' @param ... Other parameters passed to [print()].
+#' @return Invisibly returns an [`epichains`]. Called for side-effects.
+#' @export
 print.epichains <- function(x, ...) {
   format(x, ...)
 }

From 50536e33215ebdd60a95246584fced77dd81fbbc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:15:09 +0100
Subject: [PATCH 407/828] Removed tibble import

---
 R/epichains.R | 1 -
 1 file changed, 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 883aa708..7d49ce1b 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -12,7 +12,6 @@ print.epichains <- function(x, ...) {
 #'
 #' @param x epichains object
 #' @param ... further arguments passed to or from other methods
-#' @importFrom tibble as_tibble
 #' @return Invisibly returns an [`epichains`]. Called for printing side-effects.
 #' @export
 #'

From 59aa3c3d38238e57a8d8a3b580aad02695f927ae Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:15:39 +0100
Subject: [PATCH 408/828] Replaced subset() call with [ call

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 7d49ce1b..9b885d4b 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -33,7 +33,7 @@ format.epichains <- function(x, ...) {
       )
 
     # print head of the simulation output
-    print(head(subset(as.data.frame(x), !is.na(ancestor))))
+    print(head(x[!is.na(x$ancestor), ]))
 
     cat("< tree tail >\n")
 

From 631e53270c44fef429a3b45e68dd61c28decb551 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:16:55 +0100
Subject: [PATCH 409/828] Removed old chain_ll() tests

---
 tests/testthat/tests-ll.r | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
index 13a7c339..a29bb9ac 100644
--- a/tests/testthat/tests-ll.r
+++ b/tests/testthat/tests-ll.r
@@ -8,3 +8,5 @@ test_that("Analytical size or length distributions are implemented", {
   expect_true(all(pois_length_ll(chains, lambda = 0.5) < 0))
   expect_true(all(geom_length_ll(chains, prob = 0.5) < 0))
 })
+
+

From 90f97589179ec723e1468f73446dea1effcb311c Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:18:16 +0100
Subject: [PATCH 410/828] Replaced calls to sim_chain_tree() with
 simulate_tree()

---
 R/simulate.r | 16 +++++++---------
 1 file changed, 7 insertions(+), 9 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index f9467900..392ab074 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -29,7 +29,7 @@
 #' @author James M. Azam, Sebastian Funk
 #' @export
 #' @details
-#' `sim_chain_tree()` simulates a branching process of the form:
+#' `simulate_tree()` simulates a branching process of the form:
 #' WIP
 #' # The serial interval (`serials_sampler`):
 #'
@@ -46,7 +46,7 @@
 #'
 #' See References below for some literature on the subject.
 #'
-#' ## Specifying `serials_sampler` in `sim_chain_tree()`
+#' ## Specifying `serials_sampler` in `simulate_tree()`
 #'
 #' `serials_sampler` must be specified as a named or
 #' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) # nolint
@@ -59,12 +59,12 @@
 #' number of serial intervals to sample:
 #' \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
 #' and assign the name of the function to `serials_sampler` in
-#' `sim_chain_tree()` like so
-#' \code{sim_chain_tree(..., serials_sampler = serial_interval)},
-#' where `...` are the other arguments to `sim_chain_tree()`.
+#' `simulate_tree()` like so
+#' \code{simulate_tree(..., serials_sampler = serial_interval)},
+#' where `...` are the other arguments to `simulate_tree()`.
 #'
 #' Alternatively, we could assign an anonymous function to `serials_sampler`
-#' in the `sim_chain_tree()` call like so
+#' in the `simulate_tree()` call like so
 #' \code{simulate_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
 #' where `...` are the other arguments to `simulate_tree()`.
 #' @seealso [simulate_vec()] for simulating transmission chains as a vector
@@ -81,11 +81,9 @@
 #' doi: 10.1098/rsif.2020.0756. Epub 2021 Jan 6.
 #' PMID: 33402022; PMCID: PMC7879757.
 #'
-#'
 #' Fine PE. The interval between successive cases of an
 #' infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
 #' doi: 10.1093/aje/kwg251. PMID: 14630599.
-#'
 simulate_tree <- function(nchains, offspring_sampler,
                            chain_statistic = c("size", "length"),
                            chain_stat_max = Inf, serials_sampler, t0 = 0,
@@ -213,7 +211,7 @@ simulate_tree <- function(nchains, offspring_sampler,
 
 #' Simulate transmission chains without tree (as a vector)
 #'
-#' @inheritParams sim_chain_tree
+#' @inheritParams simulate_tree
 #' @param chain_stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
 #' @examples #' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,

From 57107ae860dffb23183b674ab39185f0bda97ffa Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:18:50 +0100
Subject: [PATCH 411/828] Fixed wrong call to simulate_vect() as simulate_vec()

---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index 392ab074..3305d70d 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -67,7 +67,7 @@
 #' in the `simulate_tree()` call like so
 #' \code{simulate_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
 #' where `...` are the other arguments to `simulate_tree()`.
-#' @seealso [simulate_vec()] for simulating transmission chains as a vector
+#' @seealso [simulate_vect()] for simulating transmission chains as a vector
 #' @examples
 #' set.seed(123)
 #' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,

From 5e03fbf00e9bf2c2d1ec3279623abafbc2233a96 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:19:35 +0100
Subject: [PATCH 412/828] Fixed examples with right function and argument names

---
 R/simulate.r | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 3305d70d..707bee1d 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -71,7 +71,7 @@
 #' @examples
 #' set.seed(123)
 #' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
-#' offspring = "pois", lambda = 2, infinite = 10)
+#' offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
 #' chains
 #' @references
 #'
@@ -214,7 +214,8 @@ simulate_tree <- function(nchains, offspring_sampler,
 #' @inheritParams simulate_tree
 #' @param chain_stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
-#' @examples #' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
+#' @examples
+#' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
 #' chain_stat_max = 10)
 simulate_vect <- function(nchains, offspring_sampler,
                            chain_statistic = c("size", "length"),

From 8ff3754fdc5c63e62b6652fb12d35465f011a584 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:20:13 +0100
Subject: [PATCH 413/828] Added export tags

---
 R/simulate.r | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index 707bee1d..ff6fad0e 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -217,6 +217,7 @@ simulate_tree <- function(nchains, offspring_sampler,
 #' @examples
 #' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
 #' chain_stat_max = 10)
+#' @export
 simulate_vect <- function(nchains, offspring_sampler,
                            chain_statistic = c("size", "length"),
                            chain_stat_max = Inf, ...) {
@@ -317,7 +318,6 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' "nbinom").
 #' @author Flavio Finger
 #' @author James M. Azam
-#' @export
 #' @examples
 #' # Simulate with poisson offspring
 #' simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
@@ -326,6 +326,7 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' # Simulate with negative binomial offspring
 #' simulate_tree_from_pop(pop = 100, offspring_sampler = "nbinom",
 #' mean_offspring = 0.5, disp_offspring = 1.1, serial_sampler = function(x) 3)
+#' @export
 simulate_tree_from_pop <- function(pop,
                                    offspring_sampler = c("pois", "nbinom"),
                                    mean_offspring,

From 8e18032c528218b7da9b0a01eccdd1bddca40a43 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:21:17 +0100
Subject: [PATCH 414/828] Fixed the documentation of plot()

---
 R/epichains.R | 9 ++++-----
 1 file changed, 4 insertions(+), 5 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 9b885d4b..17fd9b7e 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -200,13 +200,12 @@ tail.epichains <- function(x, ...) {
 
 #' Plot epichains tree objects
 #'
-#' @param x an [`epichains`] object with a chains_tree attribute
-#' @param ...
+#' @param x An [`epichains`] object with a chains_tree attribute
+#' @param ... Other arguments passed to plot
 #'
-#' @return
-#' @export
+#' @return A plot of cases over time and generation
 #' @author James M. Azam
-#' @examples
+#' @export
 plot.epichains <- function(x, ...){
   validate_epichains(x)
 

From b1360a4fc59e2045b6d61caa2a08720b438ea03b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:22:09 +0100
Subject: [PATCH 415/828] Added explicit namespacing for imports

---
 R/epichains.R | 7 +++++--
 1 file changed, 5 insertions(+), 2 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 17fd9b7e..aa0bf437 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -216,10 +216,12 @@ plot.epichains <- function(x, ...){
   cases_per_generation <- aggregate(sim_id ~ generation, x = as.data.frame(x), FUN = NROW)
 
   cases_per_time <- aggregate(sim_id ~ time, x = as.data.frame(x), FUN = NROW)
+  cases_per_time <- stats::aggregate(sim_id ~ time, x = as.data.frame(x), FUN = NROW)
 
   graphics::par(mfrow = c(1, 2), mar = c(4, 3, 3, 1), oma = c(0, 0, 0, 0))
 
-  plot(cases_per_generation$generation,
+  # Make first plot
+  graphics::plot(cases_per_generation$generation,
        cases_per_generation$sim_id,
        xlab = "Generation",
        ylab = "Cases",
@@ -227,7 +229,8 @@ plot.epichains <- function(x, ...){
        main = "Number of cases per generation"
        )
 
-  plot(cases_per_time$time,
+  # Make second plot
+  graphics::plot(cases_per_time$time,
        cases_per_time$sim_id,
        xlab = "Time",
        ylab = "Cases",

From 9acc289b6dd4d2b939f0496ec0e5a1837ca341bc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:22:31 +0100
Subject: [PATCH 416/828] Removed example tag from format method

---
 R/epichains.R | 2 --
 1 file changed, 2 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index aa0bf437..f4f734fb 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -14,8 +14,6 @@ print.epichains <- function(x, ...) {
 #' @param ... further arguments passed to or from other methods
 #' @return Invisibly returns an [`epichains`]. Called for printing side-effects.
 #' @export
-#'
-#' @examples
 format.epichains <- function(x, ...) {
   # check that x is an epichains object
   validate_epichains(x)

From f98debce7a8f1e6a5768e13871ba67f5ba866515 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:28:11 +0100
Subject: [PATCH 417/828] Added code to calculate cases per generation

---
 R/epichains.R | 10 +++++++---
 1 file changed, 7 insertions(+), 3 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index f4f734fb..1e2718ac 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -211,11 +211,15 @@ plot.epichains <- function(x, ...){
     stop("Object must be an epichains object with a chains_tree attribute.")
   }
 
-  cases_per_generation <- aggregate(sim_id ~ generation, x = as.data.frame(x), FUN = NROW)
-
-  cases_per_time <- aggregate(sim_id ~ time, x = as.data.frame(x), FUN = NROW)
+  # Count the number of cases per generation
+  cases_per_generation <- stats::aggregate(sim_id ~ generation,
+                                           x = as.data.frame(x),
+                                           FUN = NROW
+                                           )
+  # Count the number of cases per time
   cases_per_time <- stats::aggregate(sim_id ~ time, x = as.data.frame(x), FUN = NROW)
 
+  # Set up grid
   graphics::par(mfrow = c(1, 2), mar = c(4, 3, 3, 1), oma = c(0, 0, 0, 0))
 
   # Make first plot

From f92b1610e2e412a480b8c03b5002b53532d293f4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:29:05 +0100
Subject: [PATCH 418/828] Cleaned up documentation of the head and tail methods

---
 R/epichains.R | 15 +++++++++------
 1 file changed, 9 insertions(+), 6 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 1e2718ac..368e2f79 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -145,6 +145,7 @@ is_epichains <- function(x) {
 #'
 #' @return Checks if an object is of class `epichains` and if so
 #' checks that it's in the right format as a "data.frame" or vector.
+#' @keywords internal
 validate_epichains <- function(x) {
   if (!is_epichains(x)) {
     stop("Object must have an epichains class")
@@ -176,21 +177,23 @@ validate_epichains <- function(x) {
   invisible(x)
 }
 
-#' `head` and `tail` methods for [`epichains`] class
+#' `head` method for [`epichains`] class
 #'
 #' @param x An [`epichains`] object
 #' @param ... further arguments passed to or from other methods
-#'
+#' @importFrom utils head
 #' @return object of class `data.frame`
+#' @author James M. Azam
 #' @export
-#'
-#' @importFrom utils head
-#' @importFrom utils tail
 head.epichains <- function(x, ...) {
   utils::head(as.data.frame(x), ...)
 }
 
-#' @rdname head.epichains
+#' `tail` method for [`epichains`] class
+#' @param x An [`epichains`] object
+#' @param ... further arguments passed to or from other methods
+#' @importFrom utils tail
+#' @author James M. Azam
 #' @export
 tail.epichains <- function(x, ...) {
   utils::tail(as.data.frame(x), ...)

From 7020d5f9795394eceff6847101b742bce82e939a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:30:11 +0100
Subject: [PATCH 419/828] Documented validate_epichains() and is_epichains()

---
 R/epichains.R | 3 +--
 1 file changed, 1 insertion(+), 2 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 368e2f79..f2afb071 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -133,8 +133,6 @@ summary.epichains <- function(x, ...) {
 #' @return logical, `TRUE` if the object is an `epichains` and `FALSE`
 #' otherwise
 #' @export
-#'
-#' @examples
 is_epichains <- function(x) {
   inherits(x, "epichains")
 }
@@ -146,6 +144,7 @@ is_epichains <- function(x) {
 #' @return Checks if an object is of class `epichains` and if so
 #' checks that it's in the right format as a "data.frame" or vector.
 #' @keywords internal
+#' @author James M. Azam
 validate_epichains <- function(x) {
   if (!is_epichains(x)) {
     stop("Object must have an epichains class")

From 1c2b7e016afa1895b5bc8179dfeab78b469d76f5 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:31:13 +0100
Subject: [PATCH 420/828] Replaced "x" with "object" in summary method to align
 with generic

---
 R/epichains.R | 28 +++++++++++++---------------
 1 file changed, 13 insertions(+), 15 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index f2afb071..360580ec 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -78,29 +78,27 @@ format.epichains <- function(x, ...) {
 
 #' Summary method for epichains class
 #'
-#' @param object epichains object
+#' @param object An [`epichains`] object
 #' @param ... further arguments passed to or from other methods
 #'
 #' @return data frame of information
 #' @export
-#'
-#' @examples
-summary.epichains <- function(x, ...) {
-  validate_epichains(x)
+summary.epichains <- function(object, ...) {
+  validate_epichains(object)
 
-  if (attributes(x)$chain_type == "chains_tree") {
+  if (attributes(object)$chain_type == "chains_tree") {
 
-    chains_ran <- length(x$n)
+    chains_ran <- length(object$n)
 
-    max_time <- max(x$time)
+    max_time <- max(object$time)
 
     n_unique_ancestors <- length(
-      unique(x$ancestor[!is.na(x$ancestor)])
+      unique(object$ancestor[!is.na(object$ancestor)])
     )
 
-    num_generations <- length(unique(x$generation))
+    num_generations <- length(unique(object$generation))
 
-    max_generation <- max(x$generation)
+    max_generation <- max(object$generation)
 
     # out of summary
     res <- list(
@@ -111,10 +109,10 @@ summary.epichains <- function(x, ...) {
       num_generations = num_generations,
       max_generation = max_generation
     )
-  } else if (attributes(x)$chain_type == "chains_vec") {
-    chains_ran <- length(x)
-    max_chain_stat <- max(!is.infinite(x))
-    min_chain_stat <- min(!is.infinite(x))
+  } else if (attributes(object)$chain_type == "chains_vec") {
+    chains_ran <- length(object)
+    max_chain_stat <- max(!is.infinite(object))
+    min_chain_stat <- min(!is.infinite(object))
 
     res <- list(
       unique_chains = chains_ran,

From 95d56ae0d3c571c7e43e80fab9cd70d040555119 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:32:05 +0100
Subject: [PATCH 421/828] Cleaned up documentation of update_chain_stat()

---
 R/helpers.R | 10 ++++------
 1 file changed, 4 insertions(+), 6 deletions(-)

diff --git a/R/helpers.R b/R/helpers.R
index d835653e..94bc981a 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -1,12 +1,10 @@
 #' Determine and update the chain statistic being tracked
 #'
-#' @param stat_type
-#' @param noffspring
-#'
-#' @return
-#' @export
+#' @param stat_type Chain statistic (size/length) to update.
+#' @param stat_latest The latest chain statistic vector to be updated.
+#' @param n_offspring A vector of offspring per chain.
+#' @return A vector of chain statistics (size/length).
 #' @keywords internal
-#' @examples
 update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
   if (stat_type == "size") {
     stat_latest <- stat_latest + n_offspring

From 01c0cab8bb50fc919321c27650de35a0463ae562 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:33:03 +0100
Subject: [PATCH 422/828] Cleaned up documentation of get_offspring_func()

---
 R/helpers.R | 13 ++++++++-----
 1 file changed, 8 insertions(+), 5 deletions(-)

diff --git a/R/helpers.R b/R/helpers.R
index 94bc981a..7489c2e7 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -18,12 +18,15 @@ update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
 
 #' Get offspring sampling function
 #'
-#' @param offspring_sampler
+#' @param n Number of items to sample
+#' @param susc Susceptible population size (calculated
+#' inside \code{\link{simulate_tree_from_pop}}  as pop - initial_immune)
+#' @inheritParams simulate_tree_from_pop
 #'
-#' @return
-#' @export
-#'
-#' @examples
+#' @return An offspring sampling function
+#' @keywords internal
+get_offspring_func <- function(offspring_sampler, n, susc, pop,
+                               mean_offspring, disp_offspring = NULL) {
 get_offspring_func <- function(offspring_sampler) {
   if (offspring_sampler == "nbinom") {
     function(n, susc, pop, mean_offspring, disp_offspring) {

From cbb328056d7d596a4dab6e78a0309cb3244c2fd2 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:34:18 +0100
Subject: [PATCH 423/828] Deleted old arguments of get_offspring_func()

---
 R/helpers.R | 1 -
 1 file changed, 1 deletion(-)

diff --git a/R/helpers.R b/R/helpers.R
index 7489c2e7..ce56da8e 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -27,7 +27,6 @@ update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
 #' @keywords internal
 get_offspring_func <- function(offspring_sampler, n, susc, pop,
                                mean_offspring, disp_offspring = NULL) {
-get_offspring_func <- function(offspring_sampler) {
   if (offspring_sampler == "nbinom") {
     function(n, susc, pop, mean_offspring, disp_offspring) {
       ## get distribution params from mean and dispersion

From 62079a4d6e07eb5146392f6a93486449138189d0 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:35:10 +0100
Subject: [PATCH 424/828] Added required arguments to truncated poisson
 function

---
 R/helpers.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/helpers.R b/R/helpers.R
index ce56da8e..bd4fc844 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -44,7 +44,7 @@ get_offspring_func <- function(offspring_sampler, n, susc, pop,
       )
     }
   } else if (offspring_sampler == "pois") {
-    function(n, susc, pop, mean_offspring) {
+    function(n, susc, pop, mean_offspring, disp_offspring) {
       truncdist::rtrunc(
         n,
         spec = "pois",

From 2f181af0cd4fd17d2894c843208c77044bf3e07c Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:37:29 +0100
Subject: [PATCH 425/828] Reworded title of estimate_likelihood

---
 R/likelihood_estimation.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 3540efd5..d09c11ef 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -1,4 +1,4 @@
-#' Likelihood for the outcome of a branching process
+#' Estimate the (log) likelihood for observed branching processes
 #'
 #' @param x vector of sizes or lengths of transmission chains
 #' @param stat statistic given as \code{x} ("size" or "length" of chains)

From e5a8b8b26a62ce43abbd9dd72d3895d32184a564 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:41:01 +0100
Subject: [PATCH 426/828] Redocumented estimate_likelihood() owing to new and
 renamed arguments

---
 R/likelihood_estimation.R | 56 ++++++++++++++++++++++++---------------
 1 file changed, 34 insertions(+), 22 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index d09c11ef..a6c38603 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -1,29 +1,41 @@
 #' Estimate the (log) likelihood for observed branching processes
 #'
-#' @param x vector of sizes or lengths of transmission chains
-#' @param stat statistic given as \code{x} ("size" or "length" of chains)
-#' @param obs_prob observation probability (assumed constant)
-#' @param infinite any chains of this size/length will be treated as infinite
-#' @param exclude any sizes/lengths to exclude from the likelihood calculation
-#' @param individual if TRUE, a vector of individual log-likelihood
-#' contributions will be returned rather than the sum
-#' @param nsim_obs number of simulations if the likelihood is to be
-#'   approximated for imperfect observations
-#' @param ... parameters for the offspring distribution
-#' @return likelihood, or vector of likelihoods (if \code{obs_prob} < 1), or
-#'  a list of individual likelihood contributions (if \code{individual=TRUE})
-#' @inheritParams chain_sim
-#' @seealso pois_size_ll, nbinom_size_ll, gborel_size_ll, pois_length_ll,
-#'   geom_length_ll, offspring_ll
+#' @param chains_observed Vector of sizes/lengths of transmission chains.
+#' @param chain_statistic Statistic given as \code{chains_observed}
+#' ("size" or "length" of chains).
+#' @param offspring_sampler Offspring distribution: a character string
+#' corresponding to the R distribution function (e.g., "pois" for Poisson,
+#' where \code{\link{rpois}} is the R function to generate Poisson random
+#' numbers).
+#' @param nsim_obs Number of simulations if the likelihood is to be
+#' approximated for imperfect observations.
+#' @param log_trans Logical; Should the results be log-transformed? (Defaults
+#' to TRUE).
+#' @param obs_prob Observation probability (assumed constant)
+#' @param chain_stat_max Any chains of this size/length will be
+#' treated as infinite.
+#' @param exclude A vector of indices of the sizes/lengths to exclude from the
+#' likelihood calculation.
+#' @param individual If TRUE, a vector of individual (log)likelihood
+#' contributions will be returned rather than the sum.
+#' @param ... Parameters for the offspring distribution.
+#' @return
+#' * A log-likelihood, if \code{log_trans = TRUE} (the default)
+#' * A vector of log-likelihoods, if \code{log_trans = TRUE} (the default) and
+#' \code{obs_prob < 1}, or
+#' * A list of individual log-likelihood contributions, if
+#' \code{log_trans = TRUE} (the default) and \code{individual = TRUE}.
+#' else raw likelihoods, or vector of likelihoods
+#' @seealso offspring_ll, pois_size_ll, nbinom_size_ll, gborel_size_ll,
+#' pois_length_ll, geom_length_ll.
 #' @author Sebastian Funk
-#' @export
 #' @examples
-#' chain_sizes <- c(1, 1, 4, 7) # example of observed chain sizes
-#' chain_ll(chain_sizes, "pois", "size", lambda = 0.5)
-chain_ll <- function(x, offspring, stat = c("size", "length"), obs_prob = 1,
-                     infinite = Inf, exclude = NULL, individual = FALSE,
-                     nsim_obs, ...) {
-  stat <- match.arg(stat)
+#' # example of observed chain sizes
+#' chain_sizes <- c(1, 1, 4, 7)
+#' estimate_likelihood(chains_observed = chain_sizes, chain_statistic = "size",
+#'  offspring_sampler = "pois", nsim_obs = 100, lambda = 0.5)
+#' @export
+estimate_likelihood <- function(chains_observed,
 
   ## checks
   if (!is.character(offspring)) {

From d0011a69ac2ba73aeced5534f7407a9cff20a389 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:42:29 +0100
Subject: [PATCH 427/828] Reset up the arguments to estimate_likelihood()

---
 R/likelihood_estimation.R | 7 +++++++
 1 file changed, 7 insertions(+)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index a6c38603..bacb5847 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -36,6 +36,13 @@
 #'  offspring_sampler = "pois", nsim_obs = 100, lambda = 0.5)
 #' @export
 estimate_likelihood <- function(chains_observed,
+                                chain_statistic = c("size", "length"),
+                                offspring_sampler,
+                                nsim_obs,
+                                log_trans = TRUE,
+                                obs_prob = 1, chain_stat_max = Inf,
+                                exclude = NULL, individual = FALSE, ...) {
+  chain_statistic <- match.arg(chain_statistic)
 
   ## checks
   if (!is.character(offspring)) {

From 4ee4aec4773d89d322f5375bb755a344009b62f3 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:44:59 +0100
Subject: [PATCH 428/828] Renamed infinite to chain_stat_max and offspring to
 offspring_sampler

---
 R/likelihood_estimation.R | 29 +++++++++++++++--------------
 1 file changed, 15 insertions(+), 14 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index bacb5847..76cbe3de 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -45,8 +45,8 @@ estimate_likelihood <- function(chains_observed,
   chain_statistic <- match.arg(chain_statistic)
 
   ## checks
-  if (!is.character(offspring)) {
-    stop("Object passed as 'offspring' is not a character string.")
+  if (!is.character(offspring_sampler)) {
+    stop("Object passed as 'offspring_sampler' is not a character string.")
   }
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
@@ -60,25 +60,26 @@ estimate_likelihood <- function(chains_observed,
     }
     sampled_x <-
       replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob),
-                               infinite), simplify = FALSE)
+                                           ),
+                               chain_stat_max), simplify = FALSE)
     size_x <- unlist(sampled_x)
-    if (!is.finite(infinite)) infinite <- max(size_x) + 1
+    if (!is.finite(chain_stat_max)) chain_stat_max <- max(size_x) + 1
   } else {
-    x[x >= infinite] <- infinite
+    chains_observed[chains_observed >= chain_stat_max] <- chain_stat_max
     size_x <- x
     sampled_x <- list(x)
   }
 
   ## determine for which sizes to calculate the likelihood (for true chain size)
-  if (any(size_x == infinite)) {
-    calc_sizes <- seq_len(infinite - 1)
+  if (any(size_x == chain_stat_max)) {
+    calc_sizes <- seq_len(chain_stat_max - 1)
   } else {
     calc_sizes <- unique(c(size_x, exclude))
   }
 
-  ## get likelihood function as given by `offspring` and `stat``
+  ## get likelihood function as given by offspring_sampler and chain_statistic
   likelihoods <- vector(mode = "numeric")
-  ll_func <- paste(offspring, stat, "ll", sep = "_")
+  ll_func <- paste(offspring_sampler, chain_statistic, "ll", sep = "_")
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
   ## calculate likelihoods
@@ -90,15 +91,15 @@ estimate_likelihood <- function(chains_observed,
       do.call(
         offspring_ll,
         c(list(
-          x = calc_sizes, offspring = offspring,
-          stat = stat, infinite = infinite
+          chains_observed = calc_sizes, offspring_sampler = offspring_sampler,
+          chain_statistic = chain_statistic, chain_stat_max = chain_stat_max
         ), pars)
       )
   }
 
-  ## assign probabilities to infinite outbreak sizes
-  if (any(size_x == infinite)) {
-    likelihoods[infinite] <- complementary_logprob(likelihoods)
+  ## assign probabilities to chain_stat_max outbreak sizes
+  if (any(size_x == chain_stat_max)) {
+    likelihoods[chain_stat_max] <- complementary_logprob(likelihoods)
   }
 
   if (!missing(exclude)) {

From 498284203ce38a971a186f98ee07dda979e3a8bc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:45:38 +0100
Subject: [PATCH 429/828] Minor styling

---
 R/likelihood_estimation.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 76cbe3de..0f7d7cc3 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -51,7 +51,7 @@ estimate_likelihood <- function(chains_observed,
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
     if (missing(nsim_obs)) {
-      stop("'nsim_obs' must be specified if 'obs_prob' is <1")
+      stop("'nsim_obs' must be specified if 'obs_prob' is < 1")
     }
     if (stat == "size") {
       sample_func <- rbinom_size

From 0c03f6e24266e1efc6b0b859cf3bef40bd139765 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:46:13 +0100
Subject: [PATCH 430/828] Renamed "stat" to "chain_statistic"

---
 R/likelihood_estimation.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 0f7d7cc3..be4c37bb 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -53,9 +53,9 @@ estimate_likelihood <- function(chains_observed,
     if (missing(nsim_obs)) {
       stop("'nsim_obs' must be specified if 'obs_prob' is < 1")
     }
-    if (stat == "size") {
+    if (chain_statistic == "size") {
       sample_func <- rbinom_size
-    } else if (stat == "length") {
+    } else if (chain_statistic == "length") {
       sample_func <- rgen_length
     }
     sampled_x <-

From bd9b316cde56711f87cc13b64439e120926bd2d3 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:46:55 +0100
Subject: [PATCH 431/828] Renamed "x" to "chains_observed"

---
 R/likelihood_estimation.R | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index be4c37bb..7b02a509 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -58,16 +58,16 @@ estimate_likelihood <- function(chains_observed,
     } else if (chain_statistic == "length") {
       sample_func <- rgen_length
     }
-    sampled_x <-
-      replicate(nsim_obs, pmin(sample_func(length(x), x, obs_prob),
+    sampled_x <- replicate(nsim_obs, pmin(sample_func(length(chains_observed),
+                                           chains_observed, obs_prob
                                            ),
                                chain_stat_max), simplify = FALSE)
     size_x <- unlist(sampled_x)
     if (!is.finite(chain_stat_max)) chain_stat_max <- max(size_x) + 1
   } else {
     chains_observed[chains_observed >= chain_stat_max] <- chain_stat_max
-    size_x <- x
-    sampled_x <- list(x)
+    size_x <- chains_observed
+    sampled_x <- list(chains_observed)
   }
 
   ## determine for which sizes to calculate the likelihood (for true chain size)

From 62585995eb9a884c6420f0a9958bd971dcd39720 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:47:54 +0100
Subject: [PATCH 432/828] Ignore test_refactoring script

---
 R/test_refactoring.R | 39 ---------------------------------------
 1 file changed, 39 deletions(-)
 delete mode 100644 R/test_refactoring.R

diff --git a/R/test_refactoring.R b/R/test_refactoring.R
deleted file mode 100644
index 5bf9fcf8..00000000
--- a/R/test_refactoring.R
+++ /dev/null
@@ -1,39 +0,0 @@
-
-source("./R/checks.R")
-source("./R/helpers.R")
-source("./R/epichains.R")
-source("./R/simulate.r")
-
-
-# try simulate_tree()
-chains_tree <- simulate_tree(nchains = 10,
-                                   serials_sampler = function(n) {rpois(n, 5)},
-                                   offspring_sampler = "pois",
-                                   lambda = 2,
-                                   chain_stat_max = 10
-                                   )
-
-
-chains_tree
-summary(chains_tree)
-plot(chains_tree)
-
-# try simulate_tree_from_pop()
-
-chains_tree_from_pop <- simulate_tree_from_pop(
-  pop = 100, offspring_sampler = "nbinom",
-  mean_offspring = 0.5, disp_offspring = 1.1,
-  serial_sampler = function(x) 3)
-
-chains_tree_from_pop
-summary(chains_tree_from_pop)
-plot(chains_tree_from_pop)
-
-# try chain_vec simulation
-chains_vec <- simulate_vect(nchains = 10, offspring_sampler = "pois",
-                             lambda = 2, chain_stat_max = 10
-                             )
-
-chains_vec
-summary(chains_vec)
-# plot(chains_vec) #expect error

From f8867c2374fc64b9d1b146913bfaf5e5fd60824d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:49:08 +0100
Subject: [PATCH 433/828] Added comments to offspring_ll

---
 R/likelihoods.R | 10 +++-------
 1 file changed, 3 insertions(+), 7 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 74a8af0e..a19ade71 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -104,18 +104,14 @@ geom_length_ll <- function(x, prob) {
 #' @inheritParams estimate_likelihood
 #' @inheritParams simulate_vec
 #' @keywords internal
-offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
-                         nsim_offspring = 100, log_trans = TRUE, ...) {
-
+offspring_ll <- function(x, offspring, stat, nsim_offspring = 100, ...) {
+  dist <- chain_sim(nsim_offspring, offspring, stat, ...)
   # Simulate the chains
-  chains <- simulate_vect(nsim_offspring, offspring_sampler,
-                          chain_statistic, ...)
-
   # Compute the empirical Cumulative Distribution Function of the
   # simulated chains
-  chains_empirical_cdf <- stats::ecdf(chains)
 
   # Perform a lagged linear interpolation of the points
+  f <- stats::ecdf(dist)
   acdf <-
     diff(c(0, stats::approx(
       unique(chains), chains_empirical_cdf(unique(chains)),

From 6f2d8586fc426644460ff151108e3b90a43c0027 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:51:24 +0100
Subject: [PATCH 434/828] Renamed the arguments in offspring_ll

---
 R/likelihoods.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index a19ade71..0f541ee6 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -104,8 +104,8 @@ geom_length_ll <- function(x, prob) {
 #' @inheritParams estimate_likelihood
 #' @inheritParams simulate_vec
 #' @keywords internal
-offspring_ll <- function(x, offspring, stat, nsim_offspring = 100, ...) {
-  dist <- chain_sim(nsim_offspring, offspring, stat, ...)
+offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
+                         nsim_offspring = 100, log_trans = TRUE, ...) {
   # Simulate the chains
   # Compute the empirical Cumulative Distribution Function of the
   # simulated chains

From 758f799423d1ef6820cddfe70217f49d7c474258 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:52:38 +0100
Subject: [PATCH 435/828] Replaced chain_sim() with simulate_tree() in
 offspring_ll()

---
 R/likelihoods.R | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 0f541ee6..69064e3a 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -107,6 +107,8 @@ geom_length_ll <- function(x, prob) {
 offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
                          nsim_offspring = 100, log_trans = TRUE, ...) {
   # Simulate the chains
+  chains <- simulate_tree(nsim_offspring, offspring_sampler,
+                          chain_statistic, ...)
   # Compute the empirical Cumulative Distribution Function of the
   # simulated chains
 

From baa15a0d58a0737383bce49a9865618575ca138e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:53:59 +0100
Subject: [PATCH 436/828] Moved ecdf calculation under new comment

---
 R/likelihoods.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 69064e3a..2efa7e66 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -111,9 +111,9 @@ offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
                           chain_statistic, ...)
   # Compute the empirical Cumulative Distribution Function of the
   # simulated chains
+  chains_empirical_cdf <- stats::ecdf(chains)
 
   # Perform a lagged linear interpolation of the points
-  f <- stats::ecdf(dist)
   acdf <-
     diff(c(0, stats::approx(
       unique(chains), chains_empirical_cdf(unique(chains)),

From ddafa8a131452aa3e81272907d920691cd858072 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:54:48 +0100
Subject: [PATCH 437/828] Introduced the log_trans argument to log transform
 output of offspring_ll()

---
 R/likelihoods.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 2efa7e66..7bce529b 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -121,5 +121,6 @@ offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
     )$y))
   lik <- acdf[chains_observed]
   lik[is.na(lik)] <- 0
-  log(lik)
+  out <- ifelse(base::isTRUE(log_trans), log(lik), lik)
+  return(out)
 }

From 8809f27be52ed49f570bdb938d0ef484514495cc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:55:51 +0100
Subject: [PATCH 438/828] Minor: added new lines

---
 R/likelihoods.R | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index 7bce529b..c30d49a6 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -106,9 +106,11 @@ geom_length_ll <- function(x, prob) {
 #' @keywords internal
 offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
                          nsim_offspring = 100, log_trans = TRUE, ...) {
+
   # Simulate the chains
   chains <- simulate_tree(nsim_offspring, offspring_sampler,
                           chain_statistic, ...)
+
   # Compute the empirical Cumulative Distribution Function of the
   # simulated chains
   chains_empirical_cdf <- stats::ecdf(chains)

From cba1cc9eecc96167f16b7d26cb4fca569c99b232 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:56:20 +0100
Subject: [PATCH 439/828] Documented check_offspring_valid()

---
 R/checks.R | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/R/checks.R b/R/checks.R
index 69acb27d..5e003aca 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -1,6 +1,9 @@
 #' Check if offspring argument is specified as a character string
 #'
-#' @param offspring
+#' @param offspring_sampler Offspring distribution: a character string
+#' corresponding to the R distribution function (e.g., "pois" for Poisson,
+#' where \code{\link{rpois}} is the R function to generate Poisson random
+#' numbers).
 #' @keywords internal
 check_offspring_valid <- function(offspring) {
   if (!is.character(offspring)) {

From 630b7cc0ece571da57803bfd43e7b98889cc0d3d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:56:37 +0100
Subject: [PATCH 440/828] Documented check_offspring_func_valid()

---
 R/checks.R | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/R/checks.R b/R/checks.R
index 5e003aca..c5fb542b 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -18,7 +18,9 @@ check_offspring_valid <- function(offspring) {
 
 #' Check if constructed random number generator for offspring exists
 #'
-#' @param roffspring_name
+#' @param roffspring_name Constructed random offspring sampler: a character
+#' string corresponding to the R distribution function (e.g., "rpois" for
+#' Poisson.
 #' @keywords internal
 check_offspring_func_valid <- function(roffspring_name) {
   if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {

From 1e650551b2897f4972c9614e73d31682175bb1bf Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:56:53 +0100
Subject: [PATCH 441/828] Documented check_nchains_valid()

---
 R/checks.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/checks.R b/R/checks.R
index c5fb542b..178a77e6 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -47,7 +47,7 @@ check_serial_valid <- function(serials_sampler) {
 
 #' Check that nchains is greater than 0 and not infinite
 #'
-#' @param nchains
+#' @param nchains Number of chains to simulate.
 #'
 #' @keywords internal
 check_nchains_valid <- function(nchains) {

From 722df89e974a74381e9a7311ab595e38dce1d97c Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:57:06 +0100
Subject: [PATCH 442/828] Documented check_serial_valid()

---
 R/checks.R | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/R/checks.R b/R/checks.R
index 178a77e6..17051512 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -31,7 +31,9 @@ check_offspring_func_valid <- function(roffspring_name) {
 
 #' Check if the serials_sampler argument is specified as a function
 #'
-#' @param serials_sampler
+#' @param serials_sampler The serial interval generator function; the name of a
+#' user-defined named or anonymous function with only one argument `n`,
+#' representing the number of serial intervals to generate.
 #'
 #' @keywords internal
 check_serial_valid <- function(serials_sampler) {

From 7d0df7ab564bd696db8691aec0ca559d82e10081 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 15:58:20 +0100
Subject: [PATCH 443/828] Renamed "offspring" to "offspring_sampler" in
 check_offspring_valid()

---
 R/checks.R | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/R/checks.R b/R/checks.R
index 17051512..967b7eee 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -5,11 +5,11 @@
 #' where \code{\link{rpois}} is the R function to generate Poisson random
 #' numbers).
 #' @keywords internal
-check_offspring_valid <- function(offspring) {
-  if (!is.character(offspring)) {
+check_offspring_valid <- function(offspring_sampler) {
+  if (!is.character(offspring_sampler)) {
     stop(sprintf(
       "%s %s",
-      "'offspring' must be specified as a character string.",
+      "'offspring_sampler' must be specified as a character string.",
       "Did you forget to enclose it in quotes?"
     ))
   }

From a24f8bf3c4629e78a420b02393f9f0655f418f7a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 16:03:12 +0100
Subject: [PATCH 444/828] Fixed linting issues

---
 R/epichains.R             | 5 +++--
 tests/testthat/tests-ll.r | 2 --
 2 files changed, 3 insertions(+), 4 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 360580ec..10730f47 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -204,7 +204,7 @@ tail.epichains <- function(x, ...) {
 #' @return A plot of cases over time and generation
 #' @author James M. Azam
 #' @export
-plot.epichains <- function(x, ...){
+plot.epichains <- function(x, ...) {
   validate_epichains(x)
 
   if (attributes(x)$chain_type != "chains_tree") {
@@ -217,7 +217,8 @@ plot.epichains <- function(x, ...){
                                            FUN = NROW
                                            )
   # Count the number of cases per time
-  cases_per_time <- stats::aggregate(sim_id ~ time, x = as.data.frame(x), FUN = NROW)
+  cases_per_time <- stats::aggregate(sim_id ~ time, x = as.data.frame(x),
+                                     FUN = NROW)
 
   # Set up grid
   graphics::par(mfrow = c(1, 2), mar = c(4, 3, 3, 1), oma = c(0, 0, 0, 0))
diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
index a29bb9ac..13a7c339 100644
--- a/tests/testthat/tests-ll.r
+++ b/tests/testthat/tests-ll.r
@@ -8,5 +8,3 @@ test_that("Analytical size or length distributions are implemented", {
   expect_true(all(pois_length_ll(chains, lambda = 0.5) < 0))
   expect_true(all(geom_length_ll(chains, prob = 0.5) < 0))
 })
-
-

From b3ae99024c3cdc64797bd646f13079af400335a3 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 16:08:45 +0100
Subject: [PATCH 445/828] Deleted references file (for now)

---
 vignettes/references.json | 794 --------------------------------------
 1 file changed, 794 deletions(-)
 delete mode 100644 vignettes/references.json

diff --git a/vignettes/references.json b/vignettes/references.json
deleted file mode 100644
index dcbb4440..00000000
--- a/vignettes/references.json
+++ /dev/null
@@ -1,794 +0,0 @@
-[
-	{
-		"id": "abbott2020",
-		"type": "article-journal",
-		"container-title": "Wellcome open research",
-		"note": "publisher: The Wellcome Trust",
-		"title": "The transmissibility of novel Coronavirus in the early stages of the 2019-20 outbreak in Wuhan: Exploring initial point-source exposure sizes and durations using scenario analysis",
-		"volume": "5",
-		"author": [
-			{
-				"family": "Abbott",
-				"given": "Sam"
-			},
-			{
-				"family": "Hellewell",
-				"given": "Joel"
-			},
-			{
-				"family": "Munday",
-				"given": "James"
-			},
-			{
-				"family": "Funk",
-				"given": "Sebastian"
-			},
-			{
-				"family": "group",
-				"given": "CMMID",
-				"dropping-particle": "nCoV working"
-			},
-			{
-				"literal": "others"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2020"
-				]
-			]
-		}
-	},
-	{
-		"id": "alene2021",
-		"type": "article-journal",
-		"abstract": "Background: Understanding the epidemiological parameters that determine the transmission dynamics of COVID-19 is essential for public health intervention. Globally, a number of studies were conducted to estimate the average serial interval and incubation period of COVID-19. Combining findings of existing studies that estimate the average serial interval and incubation period of COVID-19 significantly improves the quality of evidence. Hence, this study aimed to determine the overall average serial interval and incubation period of COVID-19. Methods: We followed the PRISMA checklist to present this study. A comprehensive search strategy was carried out from international electronic databases (Google Scholar, PubMed, Science Direct, Web of Science, CINAHL, and Cochrane Library) by two experienced reviewers (MAA and DBK) authors between the 1st of June and the 31st of July 2020. All observational studies either reporting the serial interval or incubation period in persons diagnosed with COVID-19 were included in this study. Heterogeneity across studies was assessed using the I2 and Higgins test. The NOS adapted for cross-sectional studies was used to evaluate the quality of studies. A random effect Meta-analysis was employed to determine the pooled estimate with 95% (CI). Microsoft Excel was used for data extraction and R software was used for analysis. Results: We combined a total of 23 studies to estimate the overall mean serial interval of COVID-19. The mean serial interval of COVID-19 ranged from 4. 2 to 7.5 days. Our meta-analysis showed that the weighted pooled mean serial interval of COVID-19 was 5.2 (95%CI: 4.9–5.5) days. Additionally, to pool the mean incubation period of COVID-19, we included 14 articles. The mean incubation period of COVID-19 also ranged from 4.8 to 9 days. Accordingly, the weighted pooled mean incubation period of COVID-19 was 6.5 (95%CI: 5.9–7.1) days. Conclusions: This systematic review and meta-analysis showed that the weighted pooled mean serial interval and incubation period of COVID-19 were 5.2, and 6.5 days, respectively. In this study, the average serial interval of COVID-19 is shorter than the average incubation period, which suggests that substantial numbers of COVID-19 cases will be attributed to presymptomatic transmission.",
-		"container-title": "BMC Infectious Diseases",
-		"DOI": "10.1186/s12879-021-05950-x",
-		"ISSN": "14712334",
-		"issue": "1",
-		"note": "publisher: BMC Infectious Diseases\nPMID: 33706702",
-		"page": "1–9",
-		"title": "Serial interval and incubation period of COVID-19: a systematic review and meta-analysis",
-		"volume": "21",
-		"author": [
-			{
-				"family": "Alene",
-				"given": "Muluneh"
-			},
-			{
-				"family": "Yismaw",
-				"given": "Leltework"
-			},
-			{
-				"family": "Assemie",
-				"given": "Moges Agazhe"
-			},
-			{
-				"family": "Ketema",
-				"given": "Daniel Bekele"
-			},
-			{
-				"family": "Gietaneh",
-				"given": "Wodaje"
-			},
-			{
-				"family": "Birhan",
-				"given": "Tilahun Yemanu"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2021"
-				]
-			]
-		}
-	},
-	{
-		"id": "allen2012",
-		"type": "article-journal",
-		"abstract": "The basic reproduction number, ℛ(0), one of the most well-known thresholds in deterministic epidemic theory, predicts a disease outbreak if ℛ(0)>1. In stochastic epidemic theory, there are also thresholds that predict a major outbreak. In the case of a single infectious group, if ℛ(0)>1 and i infectious individuals are introduced into a susceptible population, then the probability of a major outbreak is approximately 1-(1/ℛ(0))( i ). With multiple infectious groups from which the disease could emerge, this result no longer holds. Stochastic thresholds for multiple groups depend on the number of individuals within each group, i ( j ), j=1, \\ldots, n, and on the probability of disease extinction for each group, q ( j ). It follows from multitype branching processes that the probability of a major outbreak is approximately [Formula: see text]. In this investigation, we summarize some of the deterministic and stochastic threshold theory, illustrate how to calculate the stochastic thresholds, and derive some new relationships between the deterministic and stochastic thresholds.",
-		"container-title": "Journal of Biological Dynamics",
-		"DOI": "10.1080/17513758.2012.665502",
-		"ISSN": "17513758",
-		"issue": "2",
-		"page": "590–611",
-		"title": "Extinction thresholds in deterministic and stochastic epidemic models",
-		"volume": "6",
-		"author": [
-			{
-				"family": "Allen",
-				"given": "Linda J.S."
-			},
-			{
-				"family": "Lahodny",
-				"given": "Glenn E."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2012"
-				]
-			]
-		}
-	},
-	{
-		"id": "blumberg2013",
-		"type": "article-journal",
-		"abstract": "Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. © 2013 Elsevier B.V.",
-		"container-title": "Epidemics",
-		"DOI": "10.1016/j.epidem.2013.05.002",
-		"ISSN": "17554365",
-		"issue": "3",
-		"note": "publisher: Elsevier B.V.\nPMID: 24021520",
-		"page": "131–145",
-		"title": "Comparing methods for estimating R0 from the size distribution of subcritical transmission chains",
-		"URL": "http://dx.doi.org/10.1016/j.epidem.2013.05.002",
-		"volume": "5",
-		"author": [
-			{
-				"family": "Blumberg",
-				"given": "S."
-			},
-			{
-				"family": "Lloyd-Smith",
-				"given": "J. O."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2013"
-				]
-			]
-		}
-	},
-	{
-		"id": "blumberg2013a",
-		"type": "article-journal",
-		"abstract": "For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, 0<R0<1 and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring R0 and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer R0, but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, k) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both R0 and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for R0 is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in R0 is detectable. In addition, by allowing for superspreading events, inference of k shifts the threshold above which a transmission chain should be considered anomalously large for a given value of R0 (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results.",
-		"container-title": "PLoS Computational Biology",
-		"DOI": "10.1371/journal.pcbi.1002993",
-		"ISSN": "15537358",
-		"issue": "5",
-		"note": "PMID: 23658504",
-		"page": "1–17",
-		"title": "Inference of R0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains",
-		"volume": "9",
-		"author": [
-			{
-				"family": "Blumberg",
-				"given": "Seth"
-			},
-			{
-				"family": "Lloyd-Smith",
-				"given": "James O."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2013"
-				]
-			]
-		}
-	},
-	{
-		"id": "chen2022",
-		"type": "article-journal",
-		"abstract": "The generation time distribution, reflecting the time between successive infections in transmission chains, is a key epidemiological parameter for describing COVID-19 transmission dynamics. However, because exact infection times are rarely known, it is often approximated by the serial interval distribution. This approximation holds under the assumption that infectors and infectees share the same incubation period distribution, which may not always be true. We estimated incubation period and serial interval distributions using 629 transmission pairs reconstructed by investigating 2989 confirmed cases in China in January-February 2020, and developed an inferential framework to estimate the generation time distribution that accounts for variation over time due to changes in epidemiology, sampling biases and public health and social measures. We identified substantial reductions over time in the serial interval and generation time distributions. Our proposed method provides more reliable estimation of the temporal variation in the generation time distribution, improving assessment of transmission dynamics.",
-		"container-title": "Nature Communications",
-		"DOI": "10.1038/s41467-022-35496-8",
-		"ISSN": "20411723",
-		"issue": "1",
-		"note": "publisher: Springer US",
-		"title": "Inferring time-varying generation time, serial interval, and incubation period distributions for COVID-19",
-		"volume": "13",
-		"author": [
-			{
-				"family": "Chen",
-				"given": "Dongxuan"
-			},
-			{
-				"family": "Lau",
-				"given": "Yiu Chung"
-			},
-			{
-				"family": "Xu",
-				"given": "Xiao Ke"
-			},
-			{
-				"family": "Wang",
-				"given": "Lin"
-			},
-			{
-				"family": "Du",
-				"given": "Zhanwei"
-			},
-			{
-				"family": "Tsang",
-				"given": "Tim K."
-			},
-			{
-				"family": "Wu",
-				"given": "Peng"
-			},
-			{
-				"family": "Lau",
-				"given": "Eric H.Y."
-			},
-			{
-				"family": "Wallinga",
-				"given": "Jacco"
-			},
-			{
-				"family": "Cowling",
-				"given": "Benjamin J."
-			},
-			{
-				"family": "Ali",
-				"given": "Sheikh Taslim"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2022"
-				]
-			]
-		}
-	},
-	{
-		"id": "farrington1999",
-		"type": "article-journal",
-		"abstract": "We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.",
-		"container-title": "Journal of Applied Probability",
-		"DOI": "10.1239/jap/1032374633",
-		"ISSN": "00219002",
-		"issue": "3",
-		"page": "771–779",
-		"title": "The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease",
-		"volume": "36",
-		"author": [
-			{
-				"family": "Farrington",
-				"given": "C. P."
-			},
-			{
-				"family": "Grant",
-				"given": "A. D."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"1999"
-				]
-			]
-		}
-	},
-	{
-		"id": "farrington2003",
-		"type": "article-journal",
-		"abstract": "Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.",
-		"container-title": "Biostatistics (Oxford, England)",
-		"DOI": "10.1093/biostatistics/4.2.279",
-		"ISSN": "14654644",
-		"issue": "2",
-		"page": "279–295",
-		"title": "Branching process models for surveillance of infectious diseases controlled by mass vaccination.",
-		"volume": "4",
-		"author": [
-			{
-				"family": "Farrington",
-				"given": "C. P."
-			},
-			{
-				"family": "Kanaan",
-				"given": "M. N."
-			},
-			{
-				"family": "Gay",
-				"given": "N. J."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2003"
-				]
-			]
-		}
-	},
-	{
-		"id": "fine2003",
-		"type": "article-journal",
-		"abstract": "The interval between successive cases of an infectious disease is determined by the time from infection to infectiousness, the duration of infectiousness, the time from infection to disease onset (incubation period), the duration of any extra-human phase of the infectious agent, and the proportion clinically affected among infected individuals. The interval is important in the interpretation of infectious disease surveillance and trend data, in the identification of outbreaks, and in the optimization of quarantine and contact tracing. This paper discusses the properties of these intervals, as measured between transmission events or between clinical onsets of successive infected individuals, noting the determinants of their ranges and frequency distributions, the circumstances under which secondary cases may arise before primaries, and under which the infection transmission interval will be different from the interval between clinical onsets of successive cases. It discusses the derivation of interval distribution statistics from descriptive data given in standard textbooks, with illustrations from published data on outbreaks, households, and epidemiologic tracing. Finally, it discusses the implications of such measures for studies of secondary attack rates, for the persistence of infection in human communities, for outbreak response, and for elimination or eradication programs.",
-		"container-title": "American Journal of Epidemiology",
-		"DOI": "10.1093/aje/kwg251",
-		"ISSN": "00029262",
-		"issue": "11",
-		"note": "ISBN: 0002-9262 (Print) 0002-9262 (Linking)\nPMID: 14630599",
-		"page": "1039–1047",
-		"title": "The Interval between Successive Cases of an Infectious Disease",
-		"volume": "158",
-		"author": [
-			{
-				"family": "Fine",
-				"given": "Paul E.M."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2003"
-				]
-			]
-		}
-	},
-	{
-		"id": "grassly2006",
-		"type": "article-journal",
-		"abstract": "Seasonal change in the incidence of infectious diseases is a common phenomenon in both temperate and tropical climates. However, the mechanisms responsible for seasonal disease incidence, and the epidemiological consequences of seasonality, are poorly understood with rare exception. Standard epidemiological theory and concepts such as the basic reproductive number R 0 no longer apply, and the implications for interventions that themselves may be periodic, such as pulse vaccination, have not been formally examined. This paper examines the causes and consequences of seasonality, and in so doing derives several new results concerning vaccination strategy and the interpretation of disease outbreak data. It begins with a brief review of published scientific studies in support of different causes of seasonality in infectious diseases of humans, identifying four principal mechanisms and their association with different routes of transmission. It then describes the consequences of seasonality for R 0 , disease outbreaks, endemic dynamics and persistence. Finally, a mathematical analysis of routine and pulse vaccination programmes for seasonal infections is presented. The synthesis of seasonal infectious disease epidemiology attempted by this paper highlights the need for further empirical and theoretical work. © 2006 The Royal Society.",
-		"container-title": "Proceedings of the Royal Society B: Biological Sciences",
-		"DOI": "10.1098/rspb.2006.3604",
-		"ISSN": "14712970",
-		"issue": "1600",
-		"page": "2541–2550",
-		"title": "Seasonal infectious disease epidemiology",
-		"volume": "273",
-		"author": [
-			{
-				"family": "Grassly",
-				"given": "Nicholas C."
-			},
-			{
-				"family": "Fraser",
-				"given": "Christophe"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2006"
-				]
-			]
-		}
-	},
-	{
-		"id": "griffin2020",
-		"type": "article-journal",
-		"abstract": "The serial interval is the time between symptom onsets in an infector-infectee pair. The generation time, also known as the generation interval, is the time between infection events in an infector-infectee pair. The serial interval and the generation time are key parameters for assessing the dynamics of a disease. A number of scientific papers reported information pertaining to the serial interval and/or generation time for COVID-19. Objective Conduct a review of available evidence to advise on appropriate parameter values for serial interval and generation time in national COVID-19 transmission models for Ireland and on methodological issues relating to those parameters. Methods We conducted a rapid review of the literature covering the period 1 January 2020 and 21 August 2020, following predefined eligibility criteria. Forty scientific papers met our inclusion criteria and were included in the review. Results The mean of the serial interval ranged from 3.03 to 7.6 days, based on 38 estimates, and the median from 1.0 to 6.0 days (based on 15 estimates). Only three estimates were provided for the mean of the generation time. These ranged from 3.95 to 5.20 days. One estimate of 5.0 days was provided for the median of the generation time. Discussion Estimates of the serial interval and the generation time are very dependent on the specific factors that apply at the time that the data are collected, including the level of social contact. Consequently, the estimates may not be entirely relevant to other environments. Therefore, local estimates should be obtained as soon as possible. Careful consideration should be given to the methodology that is used. Real-time estimations of the serial interval/generation time, allowing for variations over time, may provide more accurate estimates of reproduction numbers than using conventionally fixed serial interval/generation time distributions.",
-		"container-title": "BMJ Open",
-		"DOI": "10.1136/bmjopen-2020-040263",
-		"ISSN": "20446055",
-		"issue": "11",
-		"note": "ISBN: 9789241512763\nPMID: 33234640",
-		"page": "1–9",
-		"title": "Rapid review of available evidence on the serial interval and generation time of COVID-19",
-		"volume": "10",
-		"author": [
-			{
-				"family": "Griffin",
-				"given": "John"
-			},
-			{
-				"family": "Casey",
-				"given": "Miriam"
-			},
-			{
-				"family": "Collins",
-				"given": "Áine"
-			},
-			{
-				"family": "Hunt",
-				"given": "Kevin"
-			},
-			{
-				"family": "McEvoy",
-				"given": "David"
-			},
-			{
-				"family": "Byrne",
-				"given": "Andrew"
-			},
-			{
-				"family": "McAloon",
-				"given": "Conor"
-			},
-			{
-				"family": "Barber",
-				"given": "Ann"
-			},
-			{
-				"family": "Lane",
-				"given": "Elizabeth Ann"
-			},
-			{
-				"family": "More",
-				"given": "Simon"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2020"
-				]
-			]
-		}
-	},
-	{
-		"id": "jacob2010",
-		"type": "article-journal",
-		"abstract": "Branching processes are stochastic individual-based processes leading consequently to a bottom-up approach. In addition, since the state variables are random integer variables (representing population sizes), the extinction occurs at random finite time on the extinction set, thus leading to fine and realistic predictions. Starting from the simplest and well-known single-type Bienaymé-Galton-Watson branching process that was used by several authors for approximating the beginning of an epidemic, we then present a general branching model with age and population dependent individual transitions. However contrary to the classical Bienaymé-Galton-Watson or asymptotically Bienaymé-Galton-Watson setting, where the asymptotic behavior of the process, as time tends to infinity, is well understood, the asymptotic behavior of this general process is a new question. Here we give some solutions for dealing with this problem depending on whether the initial population size is large or small, and whether the disease is rare or non-rare when the initial population size is large.",
-		"container-title": "International Journal of Environmental Research and Public Health",
-		"DOI": "10.3390/ijerph7031204",
-		"ISSN": "16604601",
-		"issue": "3",
-		"page": "1186–1204",
-		"title": "Branching processes: Their role in epidemiology",
-		"volume": "7",
-		"author": [
-			{
-				"family": "Jacob",
-				"given": "Christine"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2010"
-				]
-			]
-		}
-	},
-	{
-		"id": "lehtinen2021",
-		"type": "article-journal",
-		"abstract": "The timing of transmission plays a key role in the dynamics and controllability of an epidemic. However, observing generation times - the time interval between the infection of an infector and an infectee in a transmission pair - requires data on infection times, which are generally unknown. The timing of symptom onset is more easily observed; generation times are therefore often estimated based on serial intervals - the time interval between symptom onset of an infector and an infectee. This estimation follows one of two approaches: (i) approximating the generation time distribution by the serial interval distribution or (ii) deriving the generation time distribution from the serial interval and incubation period - the time interval between infection and symptom onset in a single individual - distributions. These two approaches make different - and not always explicitly stated - assumptions about the relationship between infectiousness and symptoms, resulting in different generation time distributions with the same mean but unequal variances. Here, we clarify the assumptions that each approach makes and show that neither set of assumptions is plausible for most pathogens. However, the variances of the generation time distribution derived under each assumption can reasonably be considered as upper (approximation with serial interval) and lower (derivation from serial interval) bounds. Thus, we suggest a pragmatic solution is to use both approaches and treat these as edge cases in downstream analysis. We discuss the impact of the variance of the generation time distribution on the controllability of an epidemic through strategies based on contact tracing, and we show that underestimating this variance is likely to overestimate controllability.",
-		"container-title": "Journal of the Royal Society Interface",
-		"DOI": "10.1098/rsif.2020.0756",
-		"ISSN": "17425662",
-		"issue": "174",
-		"note": "PMID: 33402022",
-		"title": "On the relationship between serial interval, infectiousness profile and generation time: On the relationship between serial interval, infectiousness profile and generation time",
-		"volume": "18",
-		"author": [
-			{
-				"family": "Lehtinen",
-				"given": "Sonja"
-			},
-			{
-				"family": "Ashcroft",
-				"given": "Peter"
-			},
-			{
-				"family": "Bonhoeffer",
-				"given": "Sebastian"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2021"
-				]
-			]
-		}
-	},
-	{
-		"id": "limpert2001",
-		"type": "article-journal",
-		"abstract": "On the charms of statistics, and how mechanical models resembling gambling machines offer a link to a handy way to characterize log-normal distributions, which can provide deeper insight into variability and probability - Normal or log-normal: That is the question.",
-		"container-title": "BioScience",
-		"DOI": "10.1641/0006-3568(2001)051[0341:LNDATS]2.0.CO;2",
-		"ISSN": "00063568",
-		"issue": "5",
-		"page": "341–352",
-		"title": "Log-normal distributions across the sciences: Keys and clues",
-		"volume": "51",
-		"author": [
-			{
-				"family": "Limpert",
-				"given": "Eckhard"
-			},
-			{
-				"family": "Stahel",
-				"given": "Werner A."
-			},
-			{
-				"family": "Abbt",
-				"given": "Markus"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2001"
-				]
-			]
-		}
-	},
-	{
-		"id": "lloyd-smith2005",
-		"type": "article-journal",
-		"abstract": "Population-level analyses often use average quantities to describe heterogeneous systems, particularly when variation does not arise from identifiable groups. A prominent example, central to our current understanding of epidemic spread, is the basic reproductive number, R0, which is defined as the mean number of infections caused by an infected individual in a susceptible population. Population estimates of R0 can obscure considerable individual variation in infectiousness, as highlighted during the global emergence of severe acute respiratory syndrome (SARS) by numerous 'superspreading events' in which certain individuals infected unusually large numbers of secondary cases. For diseases transmitted by non-sexual direct contacts, such as SARS or smallpox, individual variation is difficult to measure empirically, and thus its importance for outbreak dynamics has been unclear. Here we present an integrated theoretical and statistical analysis of the influence of individual variation in infectiousness on disease emergence. Using contact tracing data from eight directly transmitted diseases, we show that the distribution of individual infectiousness around R0 is often highly skewed. Model predictions accounting for this variation differ sharply from average-based approaches, with disease extinction more likely and outbreaks rarer but more explosive. Using these models, we explore implications for outbreak control, showing that individual-specific control measures outperform population-wide measures. Moreover, the dramatic improvements achieved through targeted control policies emphasize the need to identify predictive correlates of higher infectiousness. Our findings indicate that superspreading is a normal feature of disease spread, and to frame ongoing discussion we propose a rigorous definition for superspreading events and a method to predict their frequency. © 2005 Nature Publishing Group.",
-		"container-title": "Nature",
-		"DOI": "10.1038/nature04153",
-		"ISSN": "14764687",
-		"issue": "7066",
-		"note": "PMID: 16292310",
-		"page": "355–359",
-		"title": "Superspreading and the effect of individual variation on disease emergence",
-		"volume": "438",
-		"author": [
-			{
-				"family": "Lloyd-Smith",
-				"given": "J. O."
-			},
-			{
-				"family": "Schreiber",
-				"given": "S. J."
-			},
-			{
-				"family": "Kopp",
-				"given": "P. E."
-			},
-			{
-				"family": "Getz",
-				"given": "W. M."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2005"
-				]
-			]
-		}
-	},
-	{
-		"id": "marivate2020",
-		"type": "article-journal",
-		"container-title": "arXiv preprint arXiv:2004.04813",
-		"title": "Use of available data to inform the COVID-19 outbreak in South Africa: a case study",
-		"author": [
-			{
-				"family": "Marivate",
-				"given": "Vukosi"
-			},
-			{
-				"family": "Combrink",
-				"given": "Herkulaas MvE"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2020"
-				]
-			]
-		}
-	},
-	{
-		"id": "nishiura2007",
-		"type": "article-journal",
-		"abstract": "The incubation period of infectious diseases, the time from infection with a microorganism to onset of disease, is directly relevant to prevention and control. Since explicit models of the incubation period enhance our understanding of the spread of disease, previous classic studies were revisited, focusing on the modeling methods employed and paying particular attention to relatively unknown historical efforts. The earliest study on the incubation period of pandemic influenza was published in 1919, providing estimates of the incubation period of Spanish flu using the daily incidence on ships departing from several ports in Australia. Although the study explicitly dealt with an unknown time of exposure, the assumed periods of exposure, which had an equal probability of infection, were too long, and thus, likely resulted in slight underestimates of the incubation period. After the suggestion that the incubation period follows lognormal distribution, Japanese epidemiologists extended this assumption to estimates of the time of exposure during a point source outbreak. Although the reason why the incubation period of acute infectious diseases tends to reveal a right-skewed distribution has been explored several times, the validity of the lognormal assumption is yet to be fully clarified. At present, various different distributions are assumed, and the lack of validity in assuming lognormal distribution is particularly apparent in the case of slowly progressing diseases. The present paper indicates that (1) analysis using well-defined short periods of exposure with appropriate statistical methods is critical when the exact time of exposure is unknown, and (2) when assuming a specific distribution for the incubation period, comparisons using different distributions are needed in addition to estimations using different datasets, analyses of the determinants of incubation period, and an understanding of the underlying disease mechanisms. © 2007 Nishiura; licensee BioMed Central Ltd.",
-		"container-title": "Emerging Themes in Epidemiology",
-		"DOI": "10.1186/1742-7622-4-2",
-		"ISSN": "17427622",
-		"page": "1–12",
-		"title": "Early efforts in modeling the incubation period of infectious diseases with an acute course of illness",
-		"volume": "4",
-		"author": [
-			{
-				"family": "Nishiura",
-				"given": "Hiroshi"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2007"
-				]
-			]
-		}
-	},
-	{
-		"id": "nishiura2012",
-		"type": "article-journal",
-		"abstract": "Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. © 2011.",
-		"container-title": "Journal of Theoretical Biology",
-		"DOI": "10.1016/j.jtbi.2011.10.039",
-		"ISSN": "00225193",
-		"note": "publisher: Elsevier\nPMID: 22079419",
-		"page": "48–55",
-		"title": "Estimating the transmission potential of supercritical processes based on the final size distribution of minor outbreaks",
-		"URL": "http://dx.doi.org/10.1016/j.jtbi.2011.10.039",
-		"volume": "294",
-		"author": [
-			{
-				"family": "Nishiura",
-				"given": "Hiroshi"
-			},
-			{
-				"family": "Yan",
-				"given": "Ping"
-			},
-			{
-				"family": "Sleeman",
-				"given": "Candace K."
-			},
-			{
-				"family": "Mode",
-				"given": "Charles J."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2012"
-				]
-			]
-		}
-	},
-	{
-		"id": "pearson2020",
-		"type": "article-journal",
-		"abstract": "For 45 African countries/territories already reporting COVID-19 cases before 23 March 2020, we estimate the dates of reporting 1,000 and 10,000 cases. Assuming early epidemic trends without interventions, all 45 were likely to exceed 1,000 confirmed cases by the end of April 2020, with most exceeding 10,000 a few weeks later.",
-		"container-title": "Eurosurveillance",
-		"DOI": "10.2807/1560-7917.ES.2020.25.18.2000543",
-		"ISSN": "15607917",
-		"issue": "18",
-		"note": "publisher: European Centre for Disease Prevention and Control (ECDC)\nPMID: 32400361",
-		"page": "1–6",
-		"title": "Projected early spread of COVID-19 in Africa through 1 June 2020",
-		"URL": "http://dx.doi.org/10.2807/1560-7917.ES.2020.25.18.2000543",
-		"volume": "25",
-		"author": [
-			{
-				"family": "Pearson",
-				"given": "Carl A.B."
-			},
-			{
-				"family": "Schalkwyk",
-				"given": "Cari",
-				"non-dropping-particle": "van"
-			},
-			{
-				"family": "Foss",
-				"given": "Anna M."
-			},
-			{
-				"family": "O'Reilly",
-				"given": "Kathleen M."
-			},
-			{
-				"family": "Pulliam",
-				"given": "Juliet R.C."
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2020"
-				]
-			]
-		}
-	},
-	{
-		"id": "becker1977",
-		"type": "article-journal",
-		"container-title": "Biometrics",
-		"ISSN": "0006-341X",
-		"issue": "3",
-		"note": "publisher: JSTOR",
-		"page": "515–522",
-		"title": "Estimation for discrete time branching processes with application to epidemics",
-		"volume": "33",
-		"author": [
-			{
-				"family": "Becker",
-				"given": "Niels"
-			},
-			{
-				"family": "Society",
-				"given": "International Biometric"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"1977"
-				]
-			]
-		}
-	},
-	{
-		"id": "wang2020",
-		"type": "article-journal",
-		"abstract": "Coronavirus disease 2019 (COVID-19) was first identified in late 2019 in Wuhan, Hubei Province, China and spread globally in months, sparking worldwide concern. However, it is unclear whether super-spreading events occurred during the early outbreak phase, as has been observed for other emerging viruses. Here, we analyse 208 publicly available SARS-CoV-2 genome sequences collected during the early outbreak phase. We combine phylogenetic analysis with Bayesian inference under an epidemiological model to trace person-to-person transmission. The dispersion parameter of the offspring distribution in the inferred transmission chain was estimated to be 0.23 (95% CI: 0.13–0.38), indicating there are individuals who directly infected a disproportionately large number of people. Our results showed that super-spreading events played an important role in the early stage of the COVID-19 outbreak.",
-		"container-title": "Nature Communications",
-		"DOI": "10.1038/s41467-020-18836-4",
-		"ISSN": "20411723",
-		"issue": "1",
-		"note": "publisher: Springer US\nPMID: 33024095",
-		"page": "1–6",
-		"title": "Inference of person-to-person transmission of COVID-19 reveals hidden super-spreading events during the early outbreak phase",
-		"URL": "http://dx.doi.org/10.1038/s41467-020-18836-4",
-		"volume": "11",
-		"author": [
-			{
-				"family": "Wang",
-				"given": "Liang"
-			},
-			{
-				"family": "Didelot",
-				"given": "Xavier"
-			},
-			{
-				"family": "Yang",
-				"given": "Jing"
-			},
-			{
-				"family": "Wong",
-				"given": "Gary"
-			},
-			{
-				"family": "Shi",
-				"given": "Yi"
-			},
-			{
-				"family": "Liu",
-				"given": "Wenjun"
-			},
-			{
-				"family": "Gao",
-				"given": "George F."
-			},
-			{
-				"family": "Bi",
-				"given": "Yuhai"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2020"
-				]
-			]
-		}
-	},
-	{
-		"id": "yadav2021",
-		"type": "article-journal",
-		"abstract": "In this review, we have discussed the different statistical modeling and prediction techniques for various infectious diseases including the recent pandemic of COVID-19. The distribution fitting, time series modeling along with predictive monitoring approaches, and epidemiological modeling are illustrated. When the epidemiology data is sufficient to fit with the required sample size, the normal distribution in general or other theoretical distributions are fitted and the best-fitted distribution is chosen for the prediction of the spread of the disease. The infectious diseases develop over time and we have data on the single variable that is the number of infections that happened, therefore, time series models are fitted and the prediction is done based on the best-fitted model. Monitoring approaches may also be applied to time series models which could estimate the parameters more precisely. In epidemiological modeling, more biological parameters are incorporated in the models and the forecasting of the disease spread is carried out. We came up with, how to improve the existing modeling methods, the use of fuzzy variables, and detection of fraud in the available data. Ultimately, we have reviewed the results of recent statistical modeling efforts to predict the course of COVID-19 spread.",
-		"container-title": "Frontiers in Public Health",
-		"DOI": "10.3389/fpubh.2021.645405",
-		"ISSN": "22962565",
-		"issue": "June",
-		"note": "PMID: 34222166",
-		"page": "1–27",
-		"title": "Statistical Modeling for the Prediction of Infectious Disease Dissemination With Special Reference to COVID-19 Spread",
-		"volume": "9",
-		"author": [
-			{
-				"family": "Yadav",
-				"given": "Subhash Kumar"
-			},
-			{
-				"family": "Akhter",
-				"given": "Yusuf"
-			}
-		],
-		"issued": {
-			"date-parts": [
-				[
-					"2021"
-				]
-			]
-		}
-	}
-]
\ No newline at end of file

From bc013b7ceb1bc091f0508a128435cd0273253b59 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:19:10 +0100
Subject: [PATCH 446/828] Regenerated docs for offspring_ll

---
 man/offspring_ll.Rd | 9 ---------
 1 file changed, 9 deletions(-)

diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 1280c21a..8556f5b1 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -44,13 +44,4 @@ cumulative distribution function (ecdf).
 \author{
 Sebastian Funk
 }
-\keyword{Compute}
-\keyword{Cumulative}
-\keyword{Distribution}
-\keyword{Function}
-\keyword{chains}
-\keyword{empirical}
 \keyword{internal}
-\keyword{of}
-\keyword{simulated}
-\keyword{the}

From a85de0871442495d8ead09b2710186c113db2fce Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:19:34 +0100
Subject: [PATCH 447/828] Added two more helper functions

---
 R/helpers.R | 29 +++++++++++++++++++++++++++++
 1 file changed, 29 insertions(+)

diff --git a/R/helpers.R b/R/helpers.R
index bd4fc844..f0d11f01 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -56,3 +56,32 @@ get_offspring_func <- function(offspring_sampler, n, susc, pop,
     stop("offspring_sampler must either be 'pois' or 'nbinom'")
   }
 }
+
+
+
+#' Return a function for calculating chain statistics
+#'
+#' @inheritParams simulate_tree
+#'
+#' @return a function for calculating chain statistics
+#' @keywords internal
+get_chain_statistic_func <- function(chain_statistic){
+  func <- if (chain_statistic == "size") {
+    rbinom_size
+  } else if (chain_statistic == "length") {
+    rgen_length
+  }
+  return(func)
+}
+
+#' Construct name of analytical function for estimating loglikelihood of
+#' offspring
+#'
+#' @inheritParams simulate_tree
+#'
+#' @return an analytical offspring likelihood function
+#' @keywords internal
+construct_offspring_ll_name <- function(offspring_sampler, chain_statistic){
+  ll_name <- paste(offspring_sampler, chain_statistic, "ll", sep = "_")
+  return(ll_name)
+}

From 0fb8a6d0dd1178a50d80cf304b213c5fd0ee81f4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:20:08 +0100
Subject: [PATCH 448/828] Replaced a wrong call of simulate_tree() with
 simulate_vect()

---
 R/likelihoods.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihoods.R b/R/likelihoods.R
index c30d49a6..521052c9 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -108,7 +108,7 @@ offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
                          nsim_offspring = 100, log_trans = TRUE, ...) {
 
   # Simulate the chains
-  chains <- simulate_tree(nsim_offspring, offspring_sampler,
+  chains <- simulate_vect(nsim_offspring, offspring_sampler,
                           chain_statistic, ...)
 
   # Compute the empirical Cumulative Distribution Function of the

From 120f3d866b671bd7cfa99955f118378d0ff41104 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:20:53 +0100
Subject: [PATCH 449/828] Added a log_trans argument for log-transforming the
 likelihoods

---
 R/likelihood_estimation.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 7b02a509..83eaab24 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -92,7 +92,8 @@ estimate_likelihood <- function(chains_observed,
         offspring_ll,
         c(list(
           chains_observed = calc_sizes, offspring_sampler = offspring_sampler,
-          chain_statistic = chain_statistic, chain_stat_max = chain_stat_max
+          chain_statistic = chain_statistic, chain_stat_max = chain_stat_max,
+          log_trans = log_trans
         ), pars)
       )
   }

From 304ce9370fac3d9ba761067804e37d118fbd97fe Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:21:20 +0100
Subject: [PATCH 450/828] Replaced input checking with a helper function

---
 R/likelihood_estimation.R | 5 ++---
 1 file changed, 2 insertions(+), 3 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 83eaab24..83f121b9 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -45,9 +45,8 @@ estimate_likelihood <- function(chains_observed,
   chain_statistic <- match.arg(chain_statistic)
 
   ## checks
-  if (!is.character(offspring_sampler)) {
-    stop("Object passed as 'offspring_sampler' is not a character string.")
-  }
+  check_offspring_valid(offspring_sampler)
+
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
     if (missing(nsim_obs)) {

From ec08ef3489df46411bec5f364044c82256b5048f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:21:57 +0100
Subject: [PATCH 451/828] Added curly brackets to inline function to improve
 readability

---
 R/likelihood_estimation.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 83f121b9..ffbebc17 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -62,7 +62,7 @@ estimate_likelihood <- function(chains_observed,
                                            ),
                                chain_stat_max), simplify = FALSE)
     size_x <- unlist(sampled_x)
-    if (!is.finite(chain_stat_max)) chain_stat_max <- max(size_x) + 1
+    if (!is.finite(chain_stat_max)) {chain_stat_max <- max(size_x) + 1}
   } else {
     chains_observed[chains_observed >= chain_stat_max] <- chain_stat_max
     size_x <- chains_observed
@@ -116,7 +116,7 @@ estimate_likelihood <- function(chains_observed,
     likelihoods[sx[!(sx %in% exclude)]]
   })
 
-  if (!individual) chains_likelihood <- vapply(chains_likelihood, sum, 0)
+  if (!individual) {chains_likelihood <- vapply(chains_likelihood, sum, 0)}
 
   return(chains_likelihood)
 }

From d559a177954de7d2625cada6c26ff2d46105dd59 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:22:33 +0100
Subject: [PATCH 452/828] Replaced explicit code with helper functions

---
 R/likelihood_estimation.R | 10 ++++------
 1 file changed, 4 insertions(+), 6 deletions(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index ffbebc17..93e0e201 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -52,11 +52,9 @@ estimate_likelihood <- function(chains_observed,
     if (missing(nsim_obs)) {
       stop("'nsim_obs' must be specified if 'obs_prob' is < 1")
     }
-    if (chain_statistic == "size") {
-      sample_func <- rbinom_size
-    } else if (chain_statistic == "length") {
-      sample_func <- rgen_length
-    }
+
+    sample_func <- get_chain_statistic_func(chain_statistic)
+
     sampled_x <- replicate(nsim_obs, pmin(sample_func(length(chains_observed),
                                            chains_observed, obs_prob
                                            ),
@@ -78,7 +76,7 @@ estimate_likelihood <- function(chains_observed,
 
   ## get likelihood function as given by offspring_sampler and chain_statistic
   likelihoods <- vector(mode = "numeric")
-  ll_func <- paste(offspring_sampler, chain_statistic, "ll", sep = "_")
+  ll_func <- construct_offspring_ll_name(offspring_sampler, chain_statistic)
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
   ## calculate likelihoods

From d98eb7511bbe670a107ffbe2dbf059ffec03d1f1 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 14 Jun 2023 17:31:55 +0100
Subject: [PATCH 453/828] Linting

---
 R/helpers.R               | 4 ++--
 R/likelihood_estimation.R | 8 ++++++--
 2 files changed, 8 insertions(+), 4 deletions(-)

diff --git a/R/helpers.R b/R/helpers.R
index f0d11f01..99f66da6 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -65,7 +65,7 @@ get_offspring_func <- function(offspring_sampler, n, susc, pop,
 #'
 #' @return a function for calculating chain statistics
 #' @keywords internal
-get_chain_statistic_func <- function(chain_statistic){
+get_chain_statistic_func <- function(chain_statistic) {
   func <- if (chain_statistic == "size") {
     rbinom_size
   } else if (chain_statistic == "length") {
@@ -81,7 +81,7 @@ get_chain_statistic_func <- function(chain_statistic){
 #'
 #' @return an analytical offspring likelihood function
 #' @keywords internal
-construct_offspring_ll_name <- function(offspring_sampler, chain_statistic){
+construct_offspring_ll_name <- function(offspring_sampler, chain_statistic) {
   ll_name <- paste(offspring_sampler, chain_statistic, "ll", sep = "_")
   return(ll_name)
 }
diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 93e0e201..8f663805 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -60,7 +60,9 @@ estimate_likelihood <- function(chains_observed,
                                            ),
                                chain_stat_max), simplify = FALSE)
     size_x <- unlist(sampled_x)
-    if (!is.finite(chain_stat_max)) {chain_stat_max <- max(size_x) + 1}
+    if (!is.finite(chain_stat_max)) {
+      chain_stat_max <- max(size_x) + 1
+      }
   } else {
     chains_observed[chains_observed >= chain_stat_max] <- chain_stat_max
     size_x <- chains_observed
@@ -114,7 +116,9 @@ estimate_likelihood <- function(chains_observed,
     likelihoods[sx[!(sx %in% exclude)]]
   })
 
-  if (!individual) {chains_likelihood <- vapply(chains_likelihood, sum, 0)}
+  if (!individual) {
+    chains_likelihood <- vapply(chains_likelihood, sum, 0)
+    }
 
   return(chains_likelihood)
 }

From d66a539748c533aaf12f3f94672bb0635e4ea6ab Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:00:12 +0100
Subject: [PATCH 454/828] Changed n to nchains to fix a partial matching issue

---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index ff6fad0e..f0601fe3 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -215,7 +215,7 @@ simulate_tree <- function(nchains, offspring_sampler,
 #' @param chain_stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
 #' @examples
-#' simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
+#' simulate_vect(nchains = 10, offspring_sampler = "pois", lambda = 2,
 #' chain_stat_max = 10)
 #' @export
 simulate_vect <- function(nchains, offspring_sampler,

From c3cca09d14b74c15aa8df40aa23b1dfd33df1a54 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:00:58 +0100
Subject: [PATCH 455/828] Added a check for the chains_tree attribute

---
 R/checks.R | 11 +++++++++++
 1 file changed, 11 insertions(+)

diff --git a/R/checks.R b/R/checks.R
index 967b7eee..ed4338fe 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -57,3 +57,14 @@ check_nchains_valid <- function(nchains) {
     stop("`nchains` must be > 0 but less than `Inf`")
   }
 }
+
+#' Title
+#'
+#' @param x An [`epichains`] object
+#'
+#' @keywords internal
+check_chain_tree_attribute <- function(x){
+  if (attributes(x)$chain_type != "chains_tree") {
+    stop("Object must be an epichains object with a chains_tree attribute.")
+  }
+}

From 9d055c187b6db6f69c4e0e683e8c375f471ecf9d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:01:14 +0100
Subject: [PATCH 456/828] Added aggregate to NAMESPACE

---
 NAMESPACE | 1 +
 1 file changed, 1 insertion(+)

diff --git a/NAMESPACE b/NAMESPACE
index 61a29bb9..05a8ce35 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -1,5 +1,6 @@
 # Generated by roxygen2: do not edit by hand
 
+S3method(aggregate,epichains)
 S3method(format,epichains)
 S3method(head,epichains)
 S3method(plot,epichains)

From 47ca5d7e00b5a3dde6a655f1782a675c7ec7c08d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:01:54 +0100
Subject: [PATCH 457/828] Added a check for the epichains_aggregate_df class

---
 R/epichains.R | 11 +++++++++++
 1 file changed, 11 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index 10730f47..ad5683b0 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -135,6 +135,17 @@ is_epichains <- function(x) {
   inherits(x, "epichains")
 }
 
+#' Check if an object is of class "epichains_aggregate_df"
+#'
+#' @param x An [`epichains`] object
+#'
+#' @keywords internal
+is_epichains_aggregate_df <- function(x) {
+  if (!inherits(x, "epichains_aggregate_df")) {
+    stop("Object must have class 'epichains_aggregate_df'")
+  }
+}
+
 #' `epichains` class validator
 #'
 #' @param x An `epichains` object

From 72c0948caaff9ea3620fc9dc0b0892d28702c9cc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:07:31 +0100
Subject: [PATCH 458/828] Rewrote plot() to use objects aggregated through
 aggregate()

---
 R/epichains.R | 116 ++++++++++++++++++++++++++++++++++----------------
 1 file changed, 80 insertions(+), 36 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index ad5683b0..ad739422 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -209,46 +209,90 @@ tail.epichains <- function(x, ...) {
 
 #' Plot epichains tree objects
 #'
-#' @param x An [`epichains`] object with a chains_tree attribute
-#' @param ... Other arguments passed to plot
+#' This method accepts epichains aggregated through the `aggregate` method,
+#' which returns an object of class `epichains_aggregate_df` with an
+#' `aggregated_over` attribute that tells `plot()` which variable to plot.
 #'
-#' @return A plot of cases over time and generation
+#' @param x An [`epichains`] object with a chains_tree attribute.
+#' @param ... Other arguments passed to plot.
+#'
+#' @return A plot of cases over time and generation.
 #' @author James M. Azam
+#' @example
+#' # Generate chains with poisson offspring using `simulate_tree()`
+#' set.seed(123)
+#' chains <- simulate_tree(nchains = 10,
+#' serials_sampler = function(x) rpois(x, 2),
+#' offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
+#'
+#' # Aggregate cases per time and plot the results
+#' cases_per_time <- aggregate(chains, "time")
+#' plot(cases_per_time)
+#'
+#' # Aggregate cases per generation and plot the results
+#' cases_per_gen <- aggregate(chains, "generation")
+#' plot(cases_per_gen)
+#'
+#' # Aggregate cases per time and generation and plot the results
+#' cases_aggreg <- aggregate(chains, "both")
+#' plot(cases_aggreg)
+#'
+#' # Generate chains with negative
+#' # binomial offspring and from a fixed population size using
+#' # `simulate_tree_from_pop()`
+#' set.seed(123)
+#' chains_bn <- simulate_tree_from_pop(pop = 1000, offspring_sampler = "nbinom",
+#' mean_offspring = 0.5, disp_offspring = 1.1,
+#' serial_sampler = function(x) rpois(x, 2))
+#'
+#' # Plot them
+#' plot(aggregate(chains_bn, "time"))
 #' @export
+#' @author James M. Azam
 plot.epichains <- function(x, ...) {
-  validate_epichains(x)
 
-  if (attributes(x)$chain_type != "chains_tree") {
-    stop("Object must be an epichains object with a chains_tree attribute.")
-  }
+  # Object should have been aggregated using the aggregate.epichains method
+  is_epichains_aggregate_df(x)
+
+  check_chain_tree_attribute(x)
 
-  # Count the number of cases per generation
-  cases_per_generation <- stats::aggregate(sim_id ~ generation,
-                                           x = as.data.frame(x),
-                                           FUN = NROW
-                                           )
-  # Count the number of cases per time
-  cases_per_time <- stats::aggregate(sim_id ~ time, x = as.data.frame(x),
-                                     FUN = NROW)
-
-  # Set up grid
-  graphics::par(mfrow = c(1, 2), mar = c(4, 3, 3, 1), oma = c(0, 0, 0, 0))
-
-  # Make first plot
-  graphics::plot(cases_per_generation$generation,
-       cases_per_generation$sim_id,
-       xlab = "Generation",
-       ylab = "Cases",
-       type = "b",
-       main = "Number of cases per generation"
-       )
-
-  # Make second plot
-  graphics::plot(cases_per_time$time,
-       cases_per_time$sim_id,
-       xlab = "Time",
-       ylab = "Cases",
-       type = "b",
-       main = "Number of cases per time"
-  )
+  plotting_var <- attributes(x)$aggregated_over
+
+  if (plotting_var == "time") {
+    graphics::barplot(x$cases,
+      names.arg = x$time,
+      xlab = "Time",
+      ylab = "Cases",
+      type = "b", ,
+      col = "tomato3",
+      main = "Number of cases per time"
+    )
+  } else if (plotting_var == "generation") {
+    graphics::barplot(x$cases,
+      names.arg = x$generation,
+      xlab = "Generation",
+      ylab = "Cases", ,
+      col = "steelblue",
+      main = "Number of cases per generation"
+    )
+  } else if (plotting_var == "both") {
+    par(mfrow = c(1, 2))
+    # Make first plot
+    graphics::barplot(x[[1]]$cases,
+      names.arg = x$time,
+      xlab = "Time",
+      ylab = "Cases",
+      type = "b", ,
+      col = "tomato3",
+      main = "Number of cases per time"
+    )
+    # Make second plot
+    graphics::barplot(x[[2]]$cases,
+      names.arg = x$generation,
+      xlab = "Generation",
+      ylab = "Cases", ,
+      col = "steelblue",
+      main = "Number of cases per generation"
+    )
+  }
 }

From f7706e8ae67aba9fcd78651154ae34fdf541ce86 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:07:48 +0100
Subject: [PATCH 459/828] Added an aggregate method

---
 R/epichains.R | 76 +++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 76 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index ad739422..cf0a68c2 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -296,3 +296,79 @@ plot.epichains <- function(x, ...) {
     )
   }
 }
+
+#' Aggregate cases in epichains objects according to a grouping variable
+#'
+#' @param x An [`epichains`] object.
+#' @param grouping_var The variable to group and count over. Options include
+#' "time", "generation", and "both".
+#' @param ... Other arguments passed to aggregate.
+#'
+#' @return If grouping_var is either "time" or "generation", a data.frame
+#' with cases aggregated over `grouping_var`; If
+#' \code{grouping_var = "both"}, a list of data.frames, the first being for
+#'  cases over time, and the second being for cases over generations.
+#' @export
+#'
+#' @examples
+#' set.seed(123)
+#' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
+#' offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
+#' chains
+#'
+#' # Aggregate cases per time
+#' aggregate(chains, grouping_var = "time")
+#'
+#' # Aggregate cases per generation
+#' aggregate(chains, grouping_var = "generation")
+#'
+#' # Aggregate cases per both time and generation
+#' aggregate(chains, grouping_var = "both")
+aggregate.epichains <- function(x,
+                                grouping_var = c("time",
+                                                 "generation",
+                                                 "both"
+                                                 ),
+                                ...) {
+  validate_epichains(x)
+  # Check that the object is of type "chains_tree"
+  if (attributes(x)$chain_type == "chains_vec") {
+    stop("object must be an epichains object with 'chains_tree' attribute.")
+  }
+
+  # Get grouping variable
+  grouping_var <- match.arg(grouping_var)
+
+  out <- if (grouping_var == "time") {
+    # Count the number of cases per generation
+    stats::aggregate(list(cases = x$sim_id),
+      list(time = x$time),
+      FUN = NROW
+    )
+  } else if (grouping_var == "generation") {
+    # Count the number of cases per time
+    stats::aggregate(list(cases = x$sim_id),
+      list(generation = x$generation),
+      FUN = NROW
+    )
+  } else if (grouping_var == "both") {
+    # Count the number of cases per time
+    list(
+      stats::aggregate(list(cases = x$sim_id),
+        list(time = x$time),
+        FUN = NROW
+      ),
+      # Count the number of cases per generation
+      stats::aggregate(list(cases = x$sim_id),
+        list(generation = x$generation),
+        FUN = NROW
+      )
+    )
+  }
+
+  structure(out,
+    class = c("epichains_aggregate_df", "tbl", "data.frame"),
+    chain_type = attributes(x)$chain_type,
+    aggregated_over = grouping_var
+  )
+}

From 6e09b17bef838cfe29de51f91b1f4b20625e047d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 15 Jun 2023 17:08:06 +0100
Subject: [PATCH 460/828] Regenerated roxygen docs

---
 man/plot.epichains.Rd | 10 ++++++----
 man/simulate_vect.Rd  |  2 +-
 2 files changed, 7 insertions(+), 5 deletions(-)

diff --git a/man/plot.epichains.Rd b/man/plot.epichains.Rd
index 7fa17943..2c27e15e 100644
--- a/man/plot.epichains.Rd
+++ b/man/plot.epichains.Rd
@@ -7,15 +7,17 @@
 \method{plot}{epichains}(x, ...)
 }
 \arguments{
-\item{x}{An \code{\link{epichains}} object with a chains_tree attribute}
+\item{x}{An \code{\link{epichains}} object with a chains_tree attribute.}
 
-\item{...}{Other arguments passed to plot}
+\item{...}{Other arguments passed to plot.}
 }
 \value{
-A plot of cases over time and generation
+A plot of cases over time and generation.
 }
 \description{
-Plot epichains tree objects
+This method accepts epichains aggregated through the \code{aggregate} method,
+which returns an object of class \code{epichains_aggregate_df} with an
+\code{aggregated_over}.
 }
 \author{
 James M. Azam
diff --git a/man/simulate_vect.Rd b/man/simulate_vect.Rd
index cd7fafc7..cdef8113 100644
--- a/man/simulate_vect.Rd
+++ b/man/simulate_vect.Rd
@@ -35,6 +35,6 @@ computed. Results above the specified value, are set to \code{Inf}.}
 Simulate transmission chains without tree (as a vector)
 }
 \examples{
-simulate_vect(n = 10, offspring_sampler = "pois", lambda = 2,
+simulate_vect(nchains = 10, offspring_sampler = "pois", lambda = 2,
 chain_stat_max = 10)
 }

From 1964b2f0b7d3aef502d5b7878f93cf2c90285ed4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 13:19:30 +0100
Subject: [PATCH 461/828] Cleaned up documentation for the plot method

---
 R/epichains.R         | 9 ++++-----
 man/plot.epichains.Rd | 5 +++--
 2 files changed, 7 insertions(+), 7 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index cf0a68c2..02ae4386 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -216,10 +216,11 @@ tail.epichains <- function(x, ...) {
 #' @param x An [`epichains`] object with a chains_tree attribute.
 #' @param ... Other arguments passed to plot.
 #'
-#' @return A plot of cases over time and generation.
+#' @return A plot of cases over time, generation, or both, depending on what
+#' was aggregated over. See \code{?epichains::aggregate}.
 #' @author James M. Azam
 #' @example
-#' # Generate chains with poisson offspring using `simulate_tree()`
+#' # Generate chains with poisson offspring using simulate_tree()
 #' set.seed(123)
 #' chains <- simulate_tree(nchains = 10,
 #' serials_sampler = function(x) rpois(x, 2),
@@ -228,7 +229,6 @@ tail.epichains <- function(x, ...) {
 #' # Aggregate cases per time and plot the results
 #' cases_per_time <- aggregate(chains, "time")
 #' plot(cases_per_time)
-#'
 #' # Aggregate cases per generation and plot the results
 #' cases_per_gen <- aggregate(chains, "generation")
 #' plot(cases_per_gen)
@@ -239,7 +239,7 @@ tail.epichains <- function(x, ...) {
 #'
 #' # Generate chains with negative
 #' # binomial offspring and from a fixed population size using
-#' # `simulate_tree_from_pop()`
+#' # simulate_tree_from_pop()
 #' set.seed(123)
 #' chains_bn <- simulate_tree_from_pop(pop = 1000, offspring_sampler = "nbinom",
 #' mean_offspring = 0.5, disp_offspring = 1.1,
@@ -248,7 +248,6 @@ tail.epichains <- function(x, ...) {
 #' # Plot them
 #' plot(aggregate(chains_bn, "time"))
 #' @export
-#' @author James M. Azam
 plot.epichains <- function(x, ...) {
 
   # Object should have been aggregated using the aggregate.epichains method
diff --git a/man/plot.epichains.Rd b/man/plot.epichains.Rd
index 2c27e15e..c8e06d1b 100644
--- a/man/plot.epichains.Rd
+++ b/man/plot.epichains.Rd
@@ -12,12 +12,13 @@
 \item{...}{Other arguments passed to plot.}
 }
 \value{
-A plot of cases over time and generation.
+A plot of cases over time, generation, or both, depending on what
+was aggregated over. See \code{?epichains::aggregate}.
 }
 \description{
 This method accepts epichains aggregated through the \code{aggregate} method,
 which returns an object of class \code{epichains_aggregate_df} with an
-\code{aggregated_over}.
+\code{aggregated_over} attribute that tells \code{plot()} which variable to plot.
 }
 \author{
 James M. Azam

From 090c1135aedae1aaf5c19051c8111ac1d6242f6c Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:55:06 +0100
Subject: [PATCH 462/828] Added utils to imports

---
 DESCRIPTION | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index 157f7803..b50545a1 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -24,7 +24,8 @@ BugReports: https://github.com/epiverse-trace/epichains/issues
 Depends:
     R (>= 3.6.0)
 Imports: 
-    stats
+    stats,
+    utils
 Suggests:
     bookdown,
     covr,

From c934380e5fefc943b83e0a9ff79d0eff9e7443a0 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:55:25 +0100
Subject: [PATCH 463/828] Unexported is_epichains()

---
 NAMESPACE | 1 -
 1 file changed, 1 deletion(-)

diff --git a/NAMESPACE b/NAMESPACE
index 05a8ce35..d7aa250a 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -9,7 +9,6 @@ S3method(summary,epichains)
 S3method(tail,epichains)
 export(dborel)
 export(estimate_likelihood)
-export(is_epichains)
 export(rborel)
 export(rnbinom_mean_disp)
 export(simulate_tree)

From 4f7b0051b192dff74a8129f1750a303cdd802bad Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:55:57 +0100
Subject: [PATCH 464/828] Imported functions from graphics and stats

---
 NAMESPACE | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/NAMESPACE b/NAMESPACE
index d7aa250a..9f7d5e08 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -14,5 +14,8 @@ export(rnbinom_mean_disp)
 export(simulate_tree)
 export(simulate_tree_from_pop)
 export(simulate_vect)
+importFrom(graphics,barplot)
+importFrom(graphics,par)
+importFrom(stats,aggregate)
 importFrom(utils,head)
 importFrom(utils,tail)

From 1b44e62663662cd704eacd5767258c44d636b0fa Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:56:18 +0100
Subject: [PATCH 465/828] Add keyword internal to is_epichains

---
 man/is_epichains.Rd | 1 +
 1 file changed, 1 insertion(+)

diff --git a/man/is_epichains.Rd b/man/is_epichains.Rd
index dd365904..aa2d540d 100644
--- a/man/is_epichains.Rd
+++ b/man/is_epichains.Rd
@@ -16,3 +16,4 @@ otherwise
 \description{
 Checks whether the object is an \code{epichains}
 }
+\keyword{internal}

From 88dc3ab4b22a313e2dfd30e344b6fde8e13bbd5f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:56:46 +0100
Subject: [PATCH 466/828] Redocumented the plot method

---
 man/plot.epichains.Rd | 36 +++++++++++++++++++++++++++++++++---
 1 file changed, 33 insertions(+), 3 deletions(-)

diff --git a/man/plot.epichains.Rd b/man/plot.epichains.Rd
index c8e06d1b..cc42095a 100644
--- a/man/plot.epichains.Rd
+++ b/man/plot.epichains.Rd
@@ -7,19 +7,49 @@
 \method{plot}{epichains}(x, ...)
 }
 \arguments{
-\item{x}{An \code{\link{epichains}} object with a chains_tree attribute.}
+\item{x}{An \code{epichains_aggregate_df} object with a \code{chains_tree} attribute.}
 
 \item{...}{Other arguments passed to plot.}
 }
 \value{
-A plot of cases over time, generation, or both, depending on what
-was aggregated over. See \code{?epichains::aggregate}.
+A plot of cases over time, generation, or both, depending on which
+of the options in the simulated dataset was aggregated over. See
+\code{?epichains::aggregate}.
 }
 \description{
 This method accepts epichains aggregated through the \code{aggregate} method,
 which returns an object of class \code{epichains_aggregate_df} with an
 \code{aggregated_over} attribute that tells \code{plot()} which variable to plot.
 }
+\examples{
+# Generate chains with poisson offspring using simulate_tree()
+set.seed(123)
+chains <- simulate_tree(nchains = 10,
+serials_sampler = function(x) rpois(x, 2),
+offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
+
+# Aggregate cases per time and plot the results
+cases_per_time <- aggregate(chains, "time")
+plot(cases_per_time)
+# Aggregate cases per generation and plot the results
+cases_per_gen <- aggregate(chains, "generation")
+plot(cases_per_gen)
+
+# Aggregate cases per time and generation and plot the results
+cases_aggreg <- aggregate(chains, "both")
+plot(cases_aggreg)
+
+# Generate chains with negative
+# binomial offspring and from a fixed population size using
+# simulate_tree_from_pop()
+set.seed(123)
+chains_bn <- simulate_tree_from_pop(pop = 1000, offspring_sampler = "nbinom",
+mean_offspring = 0.5, disp_offspring = 1.1,
+serial_sampler = function(x) rpois(x, 2))
+
+# Plot them
+plot(aggregate(chains_bn, "time"))
+}
 \author{
 James M. Azam
 }

From d4d80f50182cee183be8754af3077ede0476c9b2 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:57:13 +0100
Subject: [PATCH 467/828] Made is_epichains internal

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 02ae4386..646af886 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -130,7 +130,7 @@ summary.epichains <- function(object, ...) {
 #'
 #' @return logical, `TRUE` if the object is an `epichains` and `FALSE`
 #' otherwise
-#' @export
+#' @keywords internal
 is_epichains <- function(x) {
   inherits(x, "epichains")
 }

From 5773a359078cba209e5b7a8f9e4e7fbaab3ec425 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:57:55 +0100
Subject: [PATCH 468/828] Cleaned up plotting method roxygen docs

---
 R/epichains.R | 9 +++++----
 1 file changed, 5 insertions(+), 4 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 646af886..a8e1c9fe 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -213,11 +213,12 @@ tail.epichains <- function(x, ...) {
 #' which returns an object of class `epichains_aggregate_df` with an
 #' `aggregated_over` attribute that tells `plot()` which variable to plot.
 #'
-#' @param x An [`epichains`] object with a chains_tree attribute.
+#' @param x An `epichains_aggregate_df` object with a `chains_tree` attribute.
 #' @param ... Other arguments passed to plot.
-#'
-#' @return A plot of cases over time, generation, or both, depending on what
-#' was aggregated over. See \code{?epichains::aggregate}.
+#' @importFrom graphics barplot par
+#' @return A plot of cases over time, generation, or both, depending on which
+#' of the options in the simulated dataset was aggregated over. See
+#' \code{?epichains::aggregate}.
 #' @author James M. Azam
 #' @example
 #' # Generate chains with poisson offspring using simulate_tree()

From 9f7d4adc68fbec037a0a27b87a78db7089860618 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:58:22 +0100
Subject: [PATCH 469/828] Fixed example tag

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index a8e1c9fe..12504fa6 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -220,7 +220,7 @@ tail.epichains <- function(x, ...) {
 #' of the options in the simulated dataset was aggregated over. See
 #' \code{?epichains::aggregate}.
 #' @author James M. Azam
-#' @example
+#' @examples
 #' # Generate chains with poisson offspring using simulate_tree()
 #' set.seed(123)
 #' chains <- simulate_tree(nchains = 10,

From b3cae424f3eb1c394204909851049bddad073e9b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:59:08 +0100
Subject: [PATCH 470/828] Fixed the returned structure from the aggregate
 method

---
 R/epichains.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 12504fa6..bfcc8c38 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -367,8 +367,9 @@ aggregate.epichains <- function(x,
   }
 
   structure(out,
-    class = c("epichains_aggregate_df", "tbl", "data.frame"),
+    class = c("epichains_aggregate_df", class(out)),
     chain_type = attributes(x)$chain_type,
+    rownames = NULL,
     aggregated_over = grouping_var
   )
 }

From 9e91db8087412278174349ca412f5ad0ee4e3d90 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 16:59:52 +0100
Subject: [PATCH 471/828] Tightened the check for the chains_tree attribute in
 the aggregate method

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index bfcc8c38..cbd06d85 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -332,7 +332,7 @@ aggregate.epichains <- function(x,
                                 ...) {
   validate_epichains(x)
   # Check that the object is of type "chains_tree"
-  if (attributes(x)$chain_type == "chains_vec") {
+  if (attributes(x)$chain_type != "chains_tree") {
     stop("object must be an epichains object with 'chains_tree' attribute.")
   }
 

From eb868dd88e217c68fc5542091a8eb43fdf312408 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 17:00:14 +0100
Subject: [PATCH 472/828] Styled the code

---
 R/epichains.R | 10 ++++------
 1 file changed, 4 insertions(+), 6 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index cbd06d85..95e975cf 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -355,14 +355,12 @@ aggregate.epichains <- function(x,
     # Count the number of cases per time
     list(
       stats::aggregate(list(cases = x$sim_id),
-        list(time = x$time),
-        FUN = NROW
-      ),
+                       list(time = x$time),
+                       FUN = NROW),
       # Count the number of cases per generation
       stats::aggregate(list(cases = x$sim_id),
-        list(generation = x$generation),
-        FUN = NROW
-      )
+                       list(generation = x$generation),
+                       FUN = NROW)
     )
   }
 

From e721e77f506a8d19ae462f882abf2734caef20a0 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 17:00:57 +0100
Subject: [PATCH 473/828] Removed the type argument passed to barplot

---
 R/epichains.R | 1 -
 1 file changed, 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 95e975cf..2298cba7 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -263,7 +263,6 @@ plot.epichains <- function(x, ...) {
       names.arg = x$time,
       xlab = "Time",
       ylab = "Cases",
-      type = "b", ,
       col = "tomato3",
       main = "Number of cases per time"
     )

From d7171eadb714b4866d5274209b2fc6a7b696f480 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 17:16:25 +0100
Subject: [PATCH 474/828] Imported aggregate generic

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 2298cba7..f214858a 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -302,7 +302,7 @@ plot.epichains <- function(x, ...) {
 #' @param grouping_var The variable to group and count over. Options include
 #' "time", "generation", and "both".
 #' @param ... Other arguments passed to aggregate.
-#'
+#' @importFrom stats aggregate
 #' @return If grouping_var is either "time" or "generation", a data.frame
 #' with cases aggregated over `grouping_var`; If
 #' \code{grouping_var = "both"}, a list of data.frames, the first being for

From 803cb36cdd3308418ffa15b3ecda352bec3b85d4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 17:16:48 +0100
Subject: [PATCH 475/828] Removed plotting method

---
 NAMESPACE             |  3 --
 R/epichains.R         | 89 -------------------------------------------
 man/plot.epichains.Rd | 55 --------------------------
 3 files changed, 147 deletions(-)
 delete mode 100644 man/plot.epichains.Rd

diff --git a/NAMESPACE b/NAMESPACE
index 9f7d5e08..e31e4230 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -3,7 +3,6 @@
 S3method(aggregate,epichains)
 S3method(format,epichains)
 S3method(head,epichains)
-S3method(plot,epichains)
 S3method(print,epichains)
 S3method(summary,epichains)
 S3method(tail,epichains)
@@ -14,8 +13,6 @@ export(rnbinom_mean_disp)
 export(simulate_tree)
 export(simulate_tree_from_pop)
 export(simulate_vect)
-importFrom(graphics,barplot)
-importFrom(graphics,par)
 importFrom(stats,aggregate)
 importFrom(utils,head)
 importFrom(utils,tail)
diff --git a/R/epichains.R b/R/epichains.R
index f214858a..caa4bbd8 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -207,95 +207,6 @@ tail.epichains <- function(x, ...) {
   utils::tail(as.data.frame(x), ...)
 }
 
-#' Plot epichains tree objects
-#'
-#' This method accepts epichains aggregated through the `aggregate` method,
-#' which returns an object of class `epichains_aggregate_df` with an
-#' `aggregated_over` attribute that tells `plot()` which variable to plot.
-#'
-#' @param x An `epichains_aggregate_df` object with a `chains_tree` attribute.
-#' @param ... Other arguments passed to plot.
-#' @importFrom graphics barplot par
-#' @return A plot of cases over time, generation, or both, depending on which
-#' of the options in the simulated dataset was aggregated over. See
-#' \code{?epichains::aggregate}.
-#' @author James M. Azam
-#' @examples
-#' # Generate chains with poisson offspring using simulate_tree()
-#' set.seed(123)
-#' chains <- simulate_tree(nchains = 10,
-#' serials_sampler = function(x) rpois(x, 2),
-#' offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
-#'
-#' # Aggregate cases per time and plot the results
-#' cases_per_time <- aggregate(chains, "time")
-#' plot(cases_per_time)
-#' # Aggregate cases per generation and plot the results
-#' cases_per_gen <- aggregate(chains, "generation")
-#' plot(cases_per_gen)
-#'
-#' # Aggregate cases per time and generation and plot the results
-#' cases_aggreg <- aggregate(chains, "both")
-#' plot(cases_aggreg)
-#'
-#' # Generate chains with negative
-#' # binomial offspring and from a fixed population size using
-#' # simulate_tree_from_pop()
-#' set.seed(123)
-#' chains_bn <- simulate_tree_from_pop(pop = 1000, offspring_sampler = "nbinom",
-#' mean_offspring = 0.5, disp_offspring = 1.1,
-#' serial_sampler = function(x) rpois(x, 2))
-#'
-#' # Plot them
-#' plot(aggregate(chains_bn, "time"))
-#' @export
-plot.epichains <- function(x, ...) {
-
-  # Object should have been aggregated using the aggregate.epichains method
-  is_epichains_aggregate_df(x)
-
-  check_chain_tree_attribute(x)
-
-  plotting_var <- attributes(x)$aggregated_over
-
-  if (plotting_var == "time") {
-    graphics::barplot(x$cases,
-      names.arg = x$time,
-      xlab = "Time",
-      ylab = "Cases",
-      col = "tomato3",
-      main = "Number of cases per time"
-    )
-  } else if (plotting_var == "generation") {
-    graphics::barplot(x$cases,
-      names.arg = x$generation,
-      xlab = "Generation",
-      ylab = "Cases", ,
-      col = "steelblue",
-      main = "Number of cases per generation"
-    )
-  } else if (plotting_var == "both") {
-    par(mfrow = c(1, 2))
-    # Make first plot
-    graphics::barplot(x[[1]]$cases,
-      names.arg = x$time,
-      xlab = "Time",
-      ylab = "Cases",
-      type = "b", ,
-      col = "tomato3",
-      main = "Number of cases per time"
-    )
-    # Make second plot
-    graphics::barplot(x[[2]]$cases,
-      names.arg = x$generation,
-      xlab = "Generation",
-      ylab = "Cases", ,
-      col = "steelblue",
-      main = "Number of cases per generation"
-    )
-  }
-}
-
 #' Aggregate cases in epichains objects according to a grouping variable
 #'
 #' @param x An [`epichains`] object.
diff --git a/man/plot.epichains.Rd b/man/plot.epichains.Rd
deleted file mode 100644
index cc42095a..00000000
--- a/man/plot.epichains.Rd
+++ /dev/null
@@ -1,55 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{plot.epichains}
-\alias{plot.epichains}
-\title{Plot epichains tree objects}
-\usage{
-\method{plot}{epichains}(x, ...)
-}
-\arguments{
-\item{x}{An \code{epichains_aggregate_df} object with a \code{chains_tree} attribute.}
-
-\item{...}{Other arguments passed to plot.}
-}
-\value{
-A plot of cases over time, generation, or both, depending on which
-of the options in the simulated dataset was aggregated over. See
-\code{?epichains::aggregate}.
-}
-\description{
-This method accepts epichains aggregated through the \code{aggregate} method,
-which returns an object of class \code{epichains_aggregate_df} with an
-\code{aggregated_over} attribute that tells \code{plot()} which variable to plot.
-}
-\examples{
-# Generate chains with poisson offspring using simulate_tree()
-set.seed(123)
-chains <- simulate_tree(nchains = 10,
-serials_sampler = function(x) rpois(x, 2),
-offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
-
-# Aggregate cases per time and plot the results
-cases_per_time <- aggregate(chains, "time")
-plot(cases_per_time)
-# Aggregate cases per generation and plot the results
-cases_per_gen <- aggregate(chains, "generation")
-plot(cases_per_gen)
-
-# Aggregate cases per time and generation and plot the results
-cases_aggreg <- aggregate(chains, "both")
-plot(cases_aggreg)
-
-# Generate chains with negative
-# binomial offspring and from a fixed population size using
-# simulate_tree_from_pop()
-set.seed(123)
-chains_bn <- simulate_tree_from_pop(pop = 1000, offspring_sampler = "nbinom",
-mean_offspring = 0.5, disp_offspring = 1.1,
-serial_sampler = function(x) rpois(x, 2))
-
-# Plot them
-plot(aggregate(chains_bn, "time"))
-}
-\author{
-James M. Azam
-}

From 0b37bd8815d0d2137be219fecc931bcce78a7a95 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 16 Jun 2023 17:26:55 +0100
Subject: [PATCH 476/828] Linting

---
 R/checks.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/checks.R b/R/checks.R
index ed4338fe..c4bacce9 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -63,7 +63,7 @@ check_nchains_valid <- function(nchains) {
 #' @param x An [`epichains`] object
 #'
 #' @keywords internal
-check_chain_tree_attribute <- function(x){
+check_chain_tree_attribute <- function(x) {
   if (attributes(x)$chain_type != "chains_tree") {
     stop("Object must be an epichains object with a chains_tree attribute.")
   }

From b11680ccecaea9fb83982e9658d0014e97e8d14f Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 14:48:12 +0100
Subject: [PATCH 477/828] Updated .lintr to match up with packagetemplate

---
 .lintr | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/.lintr b/.lintr
index 12d8a253..95404597 100644
--- a/.lintr
+++ b/.lintr
@@ -6,7 +6,10 @@ linters: linters_with_tags(
     extraction_operator_linter = NULL,
     todo_comment_linter = NULL,
     function_argument_linter = NULL,
-    cyclocomp_linter = NULL,
+    indentation_linter = NULL, # unstable as of lintr 3.1.0
     # Use minimum R declared in DESCRIPTION or fall back to current R version
     backport_linter(if (length(x <- etdev::extract_min_r_version())) x else getRversion())
     )
+    exclusions: list(
+    "tests/testthat.R" = list(unused_import_linter = Inf)
+  )

From 26fbba6ad44e1a5dba06247a81fcdf8d742eda1b Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 14:51:53 +0100
Subject: [PATCH 478/828] Fixed minor issues with the .lintr file

---
 .lintr | 7 ++++---
 1 file changed, 4 insertions(+), 3 deletions(-)

diff --git a/.lintr b/.lintr
index 95404597..e2ca0f34 100644
--- a/.lintr
+++ b/.lintr
@@ -7,9 +7,10 @@ linters: linters_with_tags(
     todo_comment_linter = NULL,
     function_argument_linter = NULL,
     indentation_linter = NULL, # unstable as of lintr 3.1.0
-    # Use minimum R declared in DESCRIPTION or fall back to current R version
+    # Use minimum R declared in DESCRIPTION or fall back to current R version.
+    # Install etdev package from https://github.com/epiverse-trace/etdev
     backport_linter(if (length(x <- etdev::extract_min_r_version())) x else getRversion())
-    )
-    exclusions: list(
+  )
+exclusions: list(
     "tests/testthat.R" = list(unused_import_linter = Inf)
   )

From 317c0abd9ca8030638b696ca63980b275f1388a2 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 14:52:55 +0100
Subject: [PATCH 479/828] Modified some comments to avoid lintr issues with
 commented code

---
 vignettes/epichains_demo.Rmd | 16 +++++++++-------
 1 file changed, 9 insertions(+), 7 deletions(-)

diff --git a/vignettes/epichains_demo.Rmd b/vignettes/epichains_demo.Rmd
index ae587436..a7b0f485 100644
--- a/vignettes/epichains_demo.Rmd
+++ b/vignettes/epichains_demo.Rmd
@@ -64,23 +64,24 @@ knitr::opts_chunk$set(
 ### Printing and summary
 ```{r include=TRUE,echo=TRUE}
 library(epichains)
-# simulate_tree()
+# Using `simulate_tree()`
 simulate_tree_eg <- simulate_tree(nchains = 10,
                                   serials_sampler = function(x) 3,
-                                  offspring_sampler = "pois", 
-                                  lambda = 2, 
+                                  offspring_sampler = "pois",
+                                  lambda = 2,
                                   chain_stat_max = 10
                                   )
 
 simulate_tree_eg # print the output
 
-# simulate_vect()
-simulate_vect_eg <- simulate_vect(nchains = 10, offspring_sampler = "pois", 
+# Using simulate_vect()
+simulate_vect_eg <- simulate_vect(nchains = 10, offspring_sampler = "pois",
                                   lambda = 2, chain_stat_max = 10)
 
 simulate_vect_eg # print the output
 
-# simulate_tree_from_pop()
+# Using `simulate_tree_from_pop()`
+
 # Simulate with poisson offspring
 simulate_vect_eg_pois <- simulate_tree_from_pop(pop = 100,
                                                 offspring_sampler = "pois",
@@ -89,6 +90,7 @@ simulate_vect_eg_pois <- simulate_tree_from_pop(pop = 100,
                                                 )
 
 simulate_vect_eg_pois # print the output
+
 # Simulate with negative binomial offspring
 simulate_vect_eg_nbinom <- simulate_tree_from_pop(pop = 100,
                                                   offspring_sampler = "nbinom",
@@ -110,5 +112,5 @@ aggregate(simulate_vect_eg_pois, "time")
 aggregate(simulate_vect_eg_pois, "generation")
 
 # aggregate by both time and generation
-aggregate(simulate_vect_eg_pois, "both") 
+aggregate(simulate_vect_eg_pois, "both")
 ```

From 5dd217b06831bddd1c776ad33637560a6d54c321 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 14:54:40 +0100
Subject: [PATCH 480/828] Removed line in gitignore

---
 .gitignore | 2 --
 1 file changed, 2 deletions(-)

diff --git a/.gitignore b/.gitignore
index a2549bb3..211a19d8 100644
--- a/.gitignore
+++ b/.gitignore
@@ -31,5 +31,3 @@ rsconnect/
 /Meta/
 /docs/
 .DS_Store
-
-R/test_refactoring.R

From 36a9178b47367bfeee3c43b099bdfb5edad507f6 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 14:59:05 +0100
Subject: [PATCH 481/828] Revised the package title and description

---
 DESCRIPTION | 6 +++---
 README.Rmd  | 2 +-
 2 files changed, 4 insertions(+), 4 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index b50545a1..eb4a2f1e 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,6 +1,6 @@
 Package: epichains
-Title: Analysing transmission chain statistics using branching process
-    models
+Title: Simulating and Analysing Transmission Chain Statistics Using Branching Process
+    Models
 Version: 0.2.1
 Authors@R: c(
     person("Sebastian", "Funk", , "sebastian.funk@lshtm.ac.uk", role = "aut",
@@ -12,7 +12,7 @@ Authors@R: c(
     person("James M.", "Azam", , "james.azam@lshtm.ac.uk", role = c("aut", "cre"),
            comment = c(ORCID = "https://orcid.org/0000-0001-5782-7330"))
   )
-Description: Provides methods to analyse and simulate the size and length
+Description: Provides methods to simulate and analyse the size and length
     of branching processes with an arbitrary offspring distribution. These
     can be used, for example, to analyse the distribution of chain sizes
     or length of infectious disease outbreaks, as discussed in Farrington
diff --git a/README.Rmd b/README.Rmd
index de86e1e6..7463d7d8 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -19,7 +19,7 @@ knitr::opts_chunk$set(
 )
 ```
 
-# _{{ packagename }}_: Methods for analysing the size and length of transmission chains from branching process models <img src="man/figures/epichains_logo.png" align="right" height="130" />
+# _{{ packagename }}_: Methods for simulating and analysing the size and length of transmission chains from branching process models <img src="man/figures/epichains_logo.png" align="right" height="130" />
 
 <!-- badges: start -->
 ![GitHub R package version](https://img.shields.io/github/r-package/v/epiverse-trace/epichains)

From 02c189a35245bccfec8bd1b9f1a3a271af8ae0e4 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 15:07:32 +0100
Subject: [PATCH 482/828] Added a workflow for keeping the CITATION.cff file up
 to date

---
 .github/workflows/update-citation-cff.yaml | 49 ++++++++++++++++++++++
 1 file changed, 49 insertions(+)
 create mode 100644 .github/workflows/update-citation-cff.yaml

diff --git a/.github/workflows/update-citation-cff.yaml b/.github/workflows/update-citation-cff.yaml
new file mode 100644
index 00000000..0aa84155
--- /dev/null
+++ b/.github/workflows/update-citation-cff.yaml
@@ -0,0 +1,49 @@
+# Workflow derived from https://github.com/r-lib/actions/tree/master/examples
+# The action runs when:
+# - A new release is published
+# - The DESCRIPTION or inst/CITATION are modified
+# - Can be run manually
+# For customizing the triggers, visit https://docs.github.com/en/actions/learn-github-actions/events-that-trigger-workflows
+on:
+  release:
+    types: [published]
+  push:
+    paths:
+      - DESCRIPTION
+      - inst/CITATION
+      - .github/workflows/update-citation-cff.yaml
+  workflow_dispatch:
+
+name: Update CITATION.cff
+
+jobs:
+  update-citation-cff:
+    runs-on: ubuntu-latest
+    env:
+      GITHUB_PAT: ${{ secrets.GITHUB_TOKEN }}
+    steps:
+      - uses: actions/checkout@v3
+      - uses: r-lib/actions/setup-r@v2
+      - uses: r-lib/actions/setup-r-dependencies@v2
+        with:
+          extra-packages: |
+            any::cffr
+            any::V8
+      - name: Update CITATION.cff
+        run: |
+          library(cffr)
+          # Customize with your own code
+          # See https://docs.ropensci.org/cffr/articles/cffr.html
+          # Write your own keys
+          mykeys <- list()
+          # Create your CITATION.cff file
+          cff_write(keys = mykeys)
+        shell: Rscript {0}
+
+      - name: Commit results
+        run: |
+          git config --local user.email "action@github.com"
+          git config --local user.name "GitHub Action"
+          git add CITATION.cff
+          git commit -m 'Update CITATION.cff' || echo "No changes to commit"
+          git push origin || echo "No changes to commit"

From 1ebe83d85170c211760174239525a4dcc47527d1 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 15:13:53 +0100
Subject: [PATCH 483/828] Changed the version to 0.1.0 and rearranged the
 author/contributor names

---
 DESCRIPTION | 11 +++++------
 1 file changed, 5 insertions(+), 6 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index eb4a2f1e..a1a3a293 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,17 +1,16 @@
 Package: epichains
 Title: Simulating and Analysing Transmission Chain Statistics Using Branching Process
     Models
-Version: 0.2.1
+Version: 0.1.0
 Authors@R: c(
-    person("Sebastian", "Funk", , "sebastian.funk@lshtm.ac.uk", role = "aut",
-           comment = c(ORCID = "https://orcid.org/0000-0002-2842-3406")),
+    person("James M.", "Azam", , "james.azam@lshtm.ac.uk", role = c("aut", "cre"),
+           comment = c(ORCID = "https://orcid.org/0000-0001-5782-7330"))),
     person("Zhian N.", "Kamvar", , "zkamvar@gmail.com", role = "ctb",
            comment = c(ORCID = "https://orcid.org/0000-0003-1458-7108")),
     person("Flavio", "Finger", , "flavio.finger@epicentre.msf.org", role = "aut",
            comment = c(ORCID = "https://orcid.org/0000-0002-8613-5170")),
-    person("James M.", "Azam", , "james.azam@lshtm.ac.uk", role = c("aut", "cre"),
-           comment = c(ORCID = "https://orcid.org/0000-0001-5782-7330"))
-  )
+    person("Sebastian", "Funk", , "sebastian.funk@lshtm.ac.uk", role = "aut",
+           comment = c(ORCID = "https://orcid.org/0000-0002-2842-3406"))
 Description: Provides methods to simulate and analyse the size and length
     of branching processes with an arbitrary offspring distribution. These
     can be used, for example, to analyse the distribution of chain sizes

From 1e8ee7784071ba1eff68cda5cc030b1334b7812e Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 15:15:14 +0100
Subject: [PATCH 484/828] Added a draft of NEWS for epichains v0.1.0

---
 NEWS.md | 36 ++++++++++++++++++++++++++++++++++++
 1 file changed, 36 insertions(+)

diff --git a/NEWS.md b/NEWS.md
index 2c1c5eef..e83021b4 100644
--- a/NEWS.md
+++ b/NEWS.md
@@ -1,3 +1,39 @@
+# epichains 0.1.0
+
+## Package name change
+
+* `epichains` is a re-implementation of `bpmodels` with a focus on providing
+a dedicated class of data structures for easy manipulation and interoperability
+with other new tools in the pipeline.
+
+### Features
+
+* `simulate_tree()`: simulate transmission trees from a given number of chains.
+* `simulate_tree_from_pop()`: simulate transmission trees from a given number 
+  population size and initial immunity.
+* `simulate_vect()`: simulate a vector of observed transmission chains 
+  sizes/lengths from a given number of chains.
+* `estimate_likelihood()`: estimate the likelihood/loglikelihood of observing
+  chains of given sizes/lengths.
+
+#### Classes
+
+* An `epichains` class:
+  * superclass of `data.frame` with attributes for tracking `chain_type` as: 
+    * `chains_tree`, if returned from `simulate_tree()` or 
+    `simulate_tree_from_pop()`
+    * `chains_vec`, if returned from `simulate_vect()`.
+* An `epichains_aggregate_df` class:
+  * superclass of `data.frame` with attributes for tracking if aggregation is 
+  done over "time", "generation" or "both". Useful for `plot` method dispatch 
+  (see methods section below).
+
+#### Methods
+
+* `print()`
+* `summary()`
+* `aggregate()`
+
 # bpmodels 0.2.1
 
 ## Minor functionality change

From 71a2db8ddce64dcb0bdc0e46f272179469b09426 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 16:12:42 +0100
Subject: [PATCH 485/828] Replaced custom tests with checkmate functions

---
 DESCRIPTION | 1 +
 R/checks.R  | 6 +++---
 2 files changed, 4 insertions(+), 3 deletions(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index a1a3a293..dc24e208 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -23,6 +23,7 @@ BugReports: https://github.com/epiverse-trace/epichains/issues
 Depends:
     R (>= 3.6.0)
 Imports: 
+    checkmate,
     stats,
     utils
 Suggests:
diff --git a/R/checks.R b/R/checks.R
index c4bacce9..218e8e01 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -37,7 +37,7 @@ check_offspring_func_valid <- function(roffspring_name) {
 #'
 #' @keywords internal
 check_serial_valid <- function(serials_sampler) {
-  if (!is.function(serials_sampler)) {
+  if (!test_function(serials_sampler)) {
     stop(sprintf(
       "%s %s",
       "The `serials_sampler` argument must be a function",
@@ -47,13 +47,13 @@ check_serial_valid <- function(serials_sampler) {
 }
 
 
-#' Check that nchains is greater than 0 and not infinite
+#' Check that nchains is greater than 0 and not infinity
 #'
 #' @param nchains Number of chains to simulate.
 #'
 #' @keywords internal
 check_nchains_valid <- function(nchains) {
-  if (nchains < 1 || is.infinite(nchains)) {
+  if (!test_count(nchains, positive = TRUE)) {
     stop("`nchains` must be > 0 but less than `Inf`")
   }
 }

From c8c017a11ab8c78591c6d104a4132b33f2c47840 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 16:50:27 +0100
Subject: [PATCH 486/828] Renamed the demo vignette

---
 vignettes/{epichains_demo.Rmd => epichains.Rmd} | 0
 1 file changed, 0 insertions(+), 0 deletions(-)
 rename vignettes/{epichains_demo.Rmd => epichains.Rmd} (100%)

diff --git a/vignettes/epichains_demo.Rmd b/vignettes/epichains.Rmd
similarity index 100%
rename from vignettes/epichains_demo.Rmd
rename to vignettes/epichains.Rmd

From f94cec049123a23c2271ac9af1da3bceda2c6e14 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 16:51:23 +0100
Subject: [PATCH 487/828] Ordered the tree data.frame by sim_id and ancestor
 for printing

---
 R/epichains.R | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index caa4bbd8..387130c7 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -30,6 +30,10 @@ format.epichains <- function(x, ...) {
         )
       )
 
+    #sort by ancestor first
+
+    x <- x[order(x$sim_id, x$ancestor), ]
+
     # print head of the simulation output
     print(head(x[!is.na(x$ancestor), ]))
 

From 2bd809d20ab63a6a96b93d1037ac7446a65078ac Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 16:54:22 +0100
Subject: [PATCH 488/828] Replaced `View()` with `as.data.frame()` in the
 printed comments

---
 R/epichains.R | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 387130c7..d4dec281 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -57,7 +57,9 @@ format.epichains <- function(x, ...) {
     )
 
     # Offer more information to view the full dataset
-    writeLines(sprintf("Use View(<object_name>) to view the full output."))
+    writeLines(sprintf("%s %s", "Use `as.data.frame(<object_name>)`",
+                       "to view the full output in the console.")
+               )
 
   } else if (attributes(x)$chain_type == "chains_vec") {
     cat(sprintf("epichains object \n"))

From 9c752a521cab26224f6855fee1ca2e06b80ebba9 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 17:59:09 +0100
Subject: [PATCH 489/828] Replaced duplicated checks for attributes with helper
 functions

---
 R/epichains.R | 31 +++++++++++++++++++++++++++----
 1 file changed, 27 insertions(+), 4 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index d4dec281..5415ad18 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -21,7 +21,7 @@ format.epichains <- function(x, ...) {
   # summarise the information stored in x
   chain_info <- summary(x)
 
-  if (attributes(x)$chain_type == "chains_tree") {
+  if (is_chains_tree(x)) {
     writeLines(
       c(
         sprintf("`epichains` object"),
@@ -61,7 +61,7 @@ format.epichains <- function(x, ...) {
                        "to view the full output in the console.")
                )
 
-  } else if (attributes(x)$chain_type == "chains_vec") {
+  } else if (is_chains_vec(x)) {
     cat(sprintf("epichains object \n"))
     print(as.vector(x))
     cat(sprintf("Number of chains simulated: %s",
@@ -92,7 +92,7 @@ format.epichains <- function(x, ...) {
 summary.epichains <- function(object, ...) {
   validate_epichains(object)
 
-  if (attributes(object)$chain_type == "chains_tree") {
+  if (is_chains_tree(object)) {
 
     chains_ran <- length(object$n)
 
@@ -115,7 +115,7 @@ summary.epichains <- function(object, ...) {
       num_generations = num_generations,
       max_generation = max_generation
     )
-  } else if (attributes(object)$chain_type == "chains_vec") {
+  } else if (is_chains_vec(object)) {
     chains_ran <- length(object)
     max_chain_stat <- max(!is.infinite(object))
     min_chain_stat <- min(!is.infinite(object))
@@ -191,6 +191,29 @@ validate_epichains <- function(x) {
   invisible(x)
 }
 
+#' Check if an epichains object has the `chains_tree` attribute
+#'
+#' @param x An [`epichains`] object
+#'
+#' @keywords internal
+#' @author James M. Azam
+is_chains_tree <- function(x) {
+  !is.null(attributes(x)$chain_type) &&
+    attributes(x)$chain_type == "chains_tree"
+}
+
+#' Check if an epichains object has the `chains_vec` attribute
+#'
+#' @param x An [`epichains`] object
+#'
+#' @keywords internal
+#' @author James M. Azam
+is_chains_vec <- function(x) {
+  !is.null(attributes(x)$chain_type) &&
+    attributes(x)$chain_type == "chains_vec"
+}
+
+
 #' `head` method for [`epichains`] class
 #'
 #' @param x An [`epichains`] object

From 1426966c0c1c92f7fff27b7514e889ede2ba00bd Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:00:21 +0100
Subject: [PATCH 490/828] Improved the summary method to deal with all-infinite
 chain stats

---
 R/epichains.R | 11 ++++++++---
 1 file changed, 8 insertions(+), 3 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 5415ad18..03c7d0ed 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -117,15 +117,20 @@ summary.epichains <- function(object, ...) {
     )
   } else if (is_chains_vec(object)) {
     chains_ran <- length(object)
-    max_chain_stat <- max(!is.infinite(object))
-    min_chain_stat <- min(!is.infinite(object))
+
+    if(!all(is.infinite(object))){
+    max_chain_stat <- max(object[!is.infinite(object)])
+    min_chain_stat <- min(object[!is.infinite(object)])
+    }else{
+    max_chain_stat <- min_chain_stat <- Inf
+    }
 
     res <- list(
       unique_chains = chains_ran,
       max_chain_stat = max_chain_stat,
       min_chain_stat = min_chain_stat
     )
-  }
+    }
 
   return(res)
 }

From a30a4260e922910d21d77d3374ef0faf4944dd26 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:06:44 +0100
Subject: [PATCH 491/828] Moved this function elsewhere

---
 R/checks.R | 11 -----------
 1 file changed, 11 deletions(-)

diff --git a/R/checks.R b/R/checks.R
index 218e8e01..b4b5e15f 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -57,14 +57,3 @@ check_nchains_valid <- function(nchains) {
     stop("`nchains` must be > 0 but less than `Inf`")
   }
 }
-
-#' Title
-#'
-#' @param x An [`epichains`] object
-#'
-#' @keywords internal
-check_chain_tree_attribute <- function(x) {
-  if (attributes(x)$chain_type != "chains_tree") {
-    stop("Object must be an epichains object with a chains_tree attribute.")
-  }
-}

From c8283660acd06e8d637e17b3a1d581c62a2da7a5 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:07:28 +0100
Subject: [PATCH 492/828] Replaced custom checks with equivalents from
 checkmate pkg

---
 R/checks.R | 9 +++++----
 1 file changed, 5 insertions(+), 4 deletions(-)

diff --git a/R/checks.R b/R/checks.R
index b4b5e15f..73d173ea 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -6,7 +6,7 @@
 #' numbers).
 #' @keywords internal
 check_offspring_valid <- function(offspring_sampler) {
-  if (!is.character(offspring_sampler)) {
+  if (!checkmate::test_string(offspring_sampler)) {
     stop(sprintf(
       "%s %s",
       "'offspring_sampler' must be specified as a character string.",
@@ -23,7 +23,8 @@ check_offspring_valid <- function(offspring_sampler) {
 #' Poisson.
 #' @keywords internal
 check_offspring_func_valid <- function(roffspring_name) {
-  if (!(exists(roffspring_name)) || !is.function(get(roffspring_name))) {
+  if (!(exists(roffspring_name)) ||
+      !checkmate::test_function(get(roffspring_name))) {
     stop("Function ", roffspring_name, " does not exist.")
   }
 }
@@ -37,7 +38,7 @@ check_offspring_func_valid <- function(roffspring_name) {
 #'
 #' @keywords internal
 check_serial_valid <- function(serials_sampler) {
-  if (!test_function(serials_sampler)) {
+  if (!checkmate::test_function(serials_sampler)) {
     stop(sprintf(
       "%s %s",
       "The `serials_sampler` argument must be a function",
@@ -53,7 +54,7 @@ check_serial_valid <- function(serials_sampler) {
 #'
 #' @keywords internal
 check_nchains_valid <- function(nchains) {
-  if (!test_count(nchains, positive = TRUE)) {
+  if (!checkmate::test_count(nchains, positive = TRUE)) {
     stop("`nchains` must be > 0 but less than `Inf`")
   }
 }

From 1a7d390802e4de4aeb3c07f02cda76de434a9819 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:08:30 +0100
Subject: [PATCH 493/828] Modified function to return a logical

---
 R/epichains.R | 4 +---
 1 file changed, 1 insertion(+), 3 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 03c7d0ed..d7609eaf 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -152,9 +152,7 @@ is_epichains <- function(x) {
 #'
 #' @keywords internal
 is_epichains_aggregate_df <- function(x) {
-  if (!inherits(x, "epichains_aggregate_df")) {
-    stop("Object must have class 'epichains_aggregate_df'")
-  }
+  inherits(x, "epichains_aggregate_df")
 }
 
 #' `epichains` class validator

From bfbe2b349021589b74eedc91370af576ae94816c Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:09:08 +0100
Subject: [PATCH 494/828] Replaced custom checks for attributes with helper
 functions

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index d7609eaf..0827f0f6 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -274,7 +274,7 @@ aggregate.epichains <- function(x,
                                 ...) {
   validate_epichains(x)
   # Check that the object is of type "chains_tree"
-  if (attributes(x)$chain_type != "chains_tree") {
+  if (!is_chains_tree(x)) {
     stop("object must be an epichains object with 'chains_tree' attribute.")
   }
 

From 4e8579a74371da61ce8051d722e4de37e2597f3e Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:09:22 +0100
Subject: [PATCH 495/828] Added author tags

---
 R/epichains.R | 7 ++++++-
 1 file changed, 6 insertions(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 0827f0f6..22456a56 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -3,6 +3,7 @@
 #' @param x An [`epichains`] object.
 #' @param ... Other parameters passed to [print()].
 #' @return Invisibly returns an [`epichains`]. Called for side-effects.
+#' @author James M. Azam
 #' @export
 print.epichains <- function(x, ...) {
   format(x, ...)
@@ -13,6 +14,7 @@ print.epichains <- function(x, ...) {
 #' @param x epichains object
 #' @param ... further arguments passed to or from other methods
 #' @return Invisibly returns an [`epichains`]. Called for printing side-effects.
+#' @author James M. Azam
 #' @export
 format.epichains <- function(x, ...) {
   # check that x is an epichains object
@@ -88,6 +90,7 @@ format.epichains <- function(x, ...) {
 #' @param ... further arguments passed to or from other methods
 #'
 #' @return data frame of information
+#' @author James M. Azam
 #' @export
 summary.epichains <- function(object, ...) {
   validate_epichains(object)
@@ -142,6 +145,7 @@ summary.epichains <- function(object, ...) {
 #' @return logical, `TRUE` if the object is an `epichains` and `FALSE`
 #' otherwise
 #' @keywords internal
+#' @author James M. Azam
 is_epichains <- function(x) {
   inherits(x, "epichains")
 }
@@ -151,6 +155,7 @@ is_epichains <- function(x) {
 #' @param x An [`epichains`] object
 #'
 #' @keywords internal
+#' @author James M. Azam
 is_epichains_aggregate_df <- function(x) {
   inherits(x, "epichains_aggregate_df")
 }
@@ -251,7 +256,7 @@ tail.epichains <- function(x, ...) {
 #' \code{grouping_var = "both"}, a list of data.frames, the first being for
 #'  cases over time, and the second being for cases over generations.
 #' @export
-#'
+#' @author James M. Azam
 #' @examples
 #' set.seed(123)
 #' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,

From 5e59f02c7aaf7536a78b7060f8a7a28779766caf Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:24:50 +0100
Subject: [PATCH 496/828] Revised the function docs

---
 R/epichains.R | 10 +++++-----
 1 file changed, 5 insertions(+), 5 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 22456a56..339c76e1 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -138,7 +138,7 @@ summary.epichains <- function(object, ...) {
   return(res)
 }
 
-#' Checks whether the object is an `epichains`
+#' Reports whether x is an `epichains` object
 #'
 #' @param x An R object
 #'
@@ -150,10 +150,11 @@ is_epichains <- function(x) {
   inherits(x, "epichains")
 }
 
-#' Check if an object is of class "epichains_aggregate_df"
+#' Reports whether x is an "epichains_aggregate_df" object
 #'
 #' @param x An [`epichains`] object
-#'
+#' @return logical, `TRUE` if the object is an `epichains_aggregate_df` and
+#' `FALSE` otherwise
 #' @keywords internal
 #' @author James M. Azam
 is_epichains_aggregate_df <- function(x) {
@@ -164,8 +165,7 @@ is_epichains_aggregate_df <- function(x) {
 #'
 #' @param x An `epichains` object
 #'
-#' @return Checks if an object is of class `epichains` and if so
-#' checks that it's in the right format as a "data.frame" or vector.
+#' @return No return.
 #' @keywords internal
 #' @author James M. Azam
 validate_epichains <- function(x) {

From 36d7476c4a6876177254387a2bb4f92c022f1c69 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:39:38 +0100
Subject: [PATCH 497/828] Revised the README

---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index 7463d7d8..88ded55e 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -41,7 +41,7 @@ models are often used in infectious disease epidemiology, where the chains repre
 transmission, and the offspring distribution represents the distribution of 
 secondary infections caused by an infected individual. 
 
-_{{ packagename }}_ re-implements [bpmodels]("https://github.com/epiverse-trace/bpmodels/") by providing dedicated classes that allow easy manipulation and interoperability with other existing
+_{{ packagename }}_ re-implements [bpmodels]("https://github.com/epiverse-trace/bpmodels/") by providing dedicated data structures that allow easy manipulation and interoperability with other existing
 packages for handling transmission chain and contact-tracing data.
 
 _{{ packagename }}_ is developed at the [Centre for the Mathematical Modelling of Infectious Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling-infectious-diseases) at the London School of Hygiene and Tropical Medicine as part of the [Epiverse Initiative](https://data.org/initiatives/epiverse/).

From 6920fe301a9e88241cd71c2a11b6c9d6a309b619 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:39:49 +0100
Subject: [PATCH 498/828] Added a references file

---
 vignettes/references.json | 794 ++++++++++++++++++++++++++++++++++++++
 1 file changed, 794 insertions(+)
 create mode 100644 vignettes/references.json

diff --git a/vignettes/references.json b/vignettes/references.json
new file mode 100644
index 00000000..dcbb4440
--- /dev/null
+++ b/vignettes/references.json
@@ -0,0 +1,794 @@
+[
+	{
+		"id": "abbott2020",
+		"type": "article-journal",
+		"container-title": "Wellcome open research",
+		"note": "publisher: The Wellcome Trust",
+		"title": "The transmissibility of novel Coronavirus in the early stages of the 2019-20 outbreak in Wuhan: Exploring initial point-source exposure sizes and durations using scenario analysis",
+		"volume": "5",
+		"author": [
+			{
+				"family": "Abbott",
+				"given": "Sam"
+			},
+			{
+				"family": "Hellewell",
+				"given": "Joel"
+			},
+			{
+				"family": "Munday",
+				"given": "James"
+			},
+			{
+				"family": "Funk",
+				"given": "Sebastian"
+			},
+			{
+				"family": "group",
+				"given": "CMMID",
+				"dropping-particle": "nCoV working"
+			},
+			{
+				"literal": "others"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "alene2021",
+		"type": "article-journal",
+		"abstract": "Background: Understanding the epidemiological parameters that determine the transmission dynamics of COVID-19 is essential for public health intervention. Globally, a number of studies were conducted to estimate the average serial interval and incubation period of COVID-19. Combining findings of existing studies that estimate the average serial interval and incubation period of COVID-19 significantly improves the quality of evidence. Hence, this study aimed to determine the overall average serial interval and incubation period of COVID-19. Methods: We followed the PRISMA checklist to present this study. A comprehensive search strategy was carried out from international electronic databases (Google Scholar, PubMed, Science Direct, Web of Science, CINAHL, and Cochrane Library) by two experienced reviewers (MAA and DBK) authors between the 1st of June and the 31st of July 2020. All observational studies either reporting the serial interval or incubation period in persons diagnosed with COVID-19 were included in this study. Heterogeneity across studies was assessed using the I2 and Higgins test. The NOS adapted for cross-sectional studies was used to evaluate the quality of studies. A random effect Meta-analysis was employed to determine the pooled estimate with 95% (CI). Microsoft Excel was used for data extraction and R software was used for analysis. Results: We combined a total of 23 studies to estimate the overall mean serial interval of COVID-19. The mean serial interval of COVID-19 ranged from 4. 2 to 7.5 days. Our meta-analysis showed that the weighted pooled mean serial interval of COVID-19 was 5.2 (95%CI: 4.9–5.5) days. Additionally, to pool the mean incubation period of COVID-19, we included 14 articles. The mean incubation period of COVID-19 also ranged from 4.8 to 9 days. Accordingly, the weighted pooled mean incubation period of COVID-19 was 6.5 (95%CI: 5.9–7.1) days. Conclusions: This systematic review and meta-analysis showed that the weighted pooled mean serial interval and incubation period of COVID-19 were 5.2, and 6.5 days, respectively. In this study, the average serial interval of COVID-19 is shorter than the average incubation period, which suggests that substantial numbers of COVID-19 cases will be attributed to presymptomatic transmission.",
+		"container-title": "BMC Infectious Diseases",
+		"DOI": "10.1186/s12879-021-05950-x",
+		"ISSN": "14712334",
+		"issue": "1",
+		"note": "publisher: BMC Infectious Diseases\nPMID: 33706702",
+		"page": "1–9",
+		"title": "Serial interval and incubation period of COVID-19: a systematic review and meta-analysis",
+		"volume": "21",
+		"author": [
+			{
+				"family": "Alene",
+				"given": "Muluneh"
+			},
+			{
+				"family": "Yismaw",
+				"given": "Leltework"
+			},
+			{
+				"family": "Assemie",
+				"given": "Moges Agazhe"
+			},
+			{
+				"family": "Ketema",
+				"given": "Daniel Bekele"
+			},
+			{
+				"family": "Gietaneh",
+				"given": "Wodaje"
+			},
+			{
+				"family": "Birhan",
+				"given": "Tilahun Yemanu"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2021"
+				]
+			]
+		}
+	},
+	{
+		"id": "allen2012",
+		"type": "article-journal",
+		"abstract": "The basic reproduction number, ℛ(0), one of the most well-known thresholds in deterministic epidemic theory, predicts a disease outbreak if ℛ(0)>1. In stochastic epidemic theory, there are also thresholds that predict a major outbreak. In the case of a single infectious group, if ℛ(0)>1 and i infectious individuals are introduced into a susceptible population, then the probability of a major outbreak is approximately 1-(1/ℛ(0))( i ). With multiple infectious groups from which the disease could emerge, this result no longer holds. Stochastic thresholds for multiple groups depend on the number of individuals within each group, i ( j ), j=1, \\ldots, n, and on the probability of disease extinction for each group, q ( j ). It follows from multitype branching processes that the probability of a major outbreak is approximately [Formula: see text]. In this investigation, we summarize some of the deterministic and stochastic threshold theory, illustrate how to calculate the stochastic thresholds, and derive some new relationships between the deterministic and stochastic thresholds.",
+		"container-title": "Journal of Biological Dynamics",
+		"DOI": "10.1080/17513758.2012.665502",
+		"ISSN": "17513758",
+		"issue": "2",
+		"page": "590–611",
+		"title": "Extinction thresholds in deterministic and stochastic epidemic models",
+		"volume": "6",
+		"author": [
+			{
+				"family": "Allen",
+				"given": "Linda J.S."
+			},
+			{
+				"family": "Lahodny",
+				"given": "Glenn E."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2012"
+				]
+			]
+		}
+	},
+	{
+		"id": "blumberg2013",
+		"type": "article-journal",
+		"abstract": "Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. © 2013 Elsevier B.V.",
+		"container-title": "Epidemics",
+		"DOI": "10.1016/j.epidem.2013.05.002",
+		"ISSN": "17554365",
+		"issue": "3",
+		"note": "publisher: Elsevier B.V.\nPMID: 24021520",
+		"page": "131–145",
+		"title": "Comparing methods for estimating R0 from the size distribution of subcritical transmission chains",
+		"URL": "http://dx.doi.org/10.1016/j.epidem.2013.05.002",
+		"volume": "5",
+		"author": [
+			{
+				"family": "Blumberg",
+				"given": "S."
+			},
+			{
+				"family": "Lloyd-Smith",
+				"given": "J. O."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2013"
+				]
+			]
+		}
+	},
+	{
+		"id": "blumberg2013a",
+		"type": "article-journal",
+		"abstract": "For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, 0<R0<1 and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring R0 and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer R0, but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, k) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both R0 and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for R0 is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in R0 is detectable. In addition, by allowing for superspreading events, inference of k shifts the threshold above which a transmission chain should be considered anomalously large for a given value of R0 (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results.",
+		"container-title": "PLoS Computational Biology",
+		"DOI": "10.1371/journal.pcbi.1002993",
+		"ISSN": "15537358",
+		"issue": "5",
+		"note": "PMID: 23658504",
+		"page": "1–17",
+		"title": "Inference of R0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains",
+		"volume": "9",
+		"author": [
+			{
+				"family": "Blumberg",
+				"given": "Seth"
+			},
+			{
+				"family": "Lloyd-Smith",
+				"given": "James O."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2013"
+				]
+			]
+		}
+	},
+	{
+		"id": "chen2022",
+		"type": "article-journal",
+		"abstract": "The generation time distribution, reflecting the time between successive infections in transmission chains, is a key epidemiological parameter for describing COVID-19 transmission dynamics. However, because exact infection times are rarely known, it is often approximated by the serial interval distribution. This approximation holds under the assumption that infectors and infectees share the same incubation period distribution, which may not always be true. We estimated incubation period and serial interval distributions using 629 transmission pairs reconstructed by investigating 2989 confirmed cases in China in January-February 2020, and developed an inferential framework to estimate the generation time distribution that accounts for variation over time due to changes in epidemiology, sampling biases and public health and social measures. We identified substantial reductions over time in the serial interval and generation time distributions. Our proposed method provides more reliable estimation of the temporal variation in the generation time distribution, improving assessment of transmission dynamics.",
+		"container-title": "Nature Communications",
+		"DOI": "10.1038/s41467-022-35496-8",
+		"ISSN": "20411723",
+		"issue": "1",
+		"note": "publisher: Springer US",
+		"title": "Inferring time-varying generation time, serial interval, and incubation period distributions for COVID-19",
+		"volume": "13",
+		"author": [
+			{
+				"family": "Chen",
+				"given": "Dongxuan"
+			},
+			{
+				"family": "Lau",
+				"given": "Yiu Chung"
+			},
+			{
+				"family": "Xu",
+				"given": "Xiao Ke"
+			},
+			{
+				"family": "Wang",
+				"given": "Lin"
+			},
+			{
+				"family": "Du",
+				"given": "Zhanwei"
+			},
+			{
+				"family": "Tsang",
+				"given": "Tim K."
+			},
+			{
+				"family": "Wu",
+				"given": "Peng"
+			},
+			{
+				"family": "Lau",
+				"given": "Eric H.Y."
+			},
+			{
+				"family": "Wallinga",
+				"given": "Jacco"
+			},
+			{
+				"family": "Cowling",
+				"given": "Benjamin J."
+			},
+			{
+				"family": "Ali",
+				"given": "Sheikh Taslim"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2022"
+				]
+			]
+		}
+	},
+	{
+		"id": "farrington1999",
+		"type": "article-journal",
+		"abstract": "We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.",
+		"container-title": "Journal of Applied Probability",
+		"DOI": "10.1239/jap/1032374633",
+		"ISSN": "00219002",
+		"issue": "3",
+		"page": "771–779",
+		"title": "The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease",
+		"volume": "36",
+		"author": [
+			{
+				"family": "Farrington",
+				"given": "C. P."
+			},
+			{
+				"family": "Grant",
+				"given": "A. D."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"1999"
+				]
+			]
+		}
+	},
+	{
+		"id": "farrington2003",
+		"type": "article-journal",
+		"abstract": "Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.",
+		"container-title": "Biostatistics (Oxford, England)",
+		"DOI": "10.1093/biostatistics/4.2.279",
+		"ISSN": "14654644",
+		"issue": "2",
+		"page": "279–295",
+		"title": "Branching process models for surveillance of infectious diseases controlled by mass vaccination.",
+		"volume": "4",
+		"author": [
+			{
+				"family": "Farrington",
+				"given": "C. P."
+			},
+			{
+				"family": "Kanaan",
+				"given": "M. N."
+			},
+			{
+				"family": "Gay",
+				"given": "N. J."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2003"
+				]
+			]
+		}
+	},
+	{
+		"id": "fine2003",
+		"type": "article-journal",
+		"abstract": "The interval between successive cases of an infectious disease is determined by the time from infection to infectiousness, the duration of infectiousness, the time from infection to disease onset (incubation period), the duration of any extra-human phase of the infectious agent, and the proportion clinically affected among infected individuals. The interval is important in the interpretation of infectious disease surveillance and trend data, in the identification of outbreaks, and in the optimization of quarantine and contact tracing. This paper discusses the properties of these intervals, as measured between transmission events or between clinical onsets of successive infected individuals, noting the determinants of their ranges and frequency distributions, the circumstances under which secondary cases may arise before primaries, and under which the infection transmission interval will be different from the interval between clinical onsets of successive cases. It discusses the derivation of interval distribution statistics from descriptive data given in standard textbooks, with illustrations from published data on outbreaks, households, and epidemiologic tracing. Finally, it discusses the implications of such measures for studies of secondary attack rates, for the persistence of infection in human communities, for outbreak response, and for elimination or eradication programs.",
+		"container-title": "American Journal of Epidemiology",
+		"DOI": "10.1093/aje/kwg251",
+		"ISSN": "00029262",
+		"issue": "11",
+		"note": "ISBN: 0002-9262 (Print) 0002-9262 (Linking)\nPMID: 14630599",
+		"page": "1039–1047",
+		"title": "The Interval between Successive Cases of an Infectious Disease",
+		"volume": "158",
+		"author": [
+			{
+				"family": "Fine",
+				"given": "Paul E.M."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2003"
+				]
+			]
+		}
+	},
+	{
+		"id": "grassly2006",
+		"type": "article-journal",
+		"abstract": "Seasonal change in the incidence of infectious diseases is a common phenomenon in both temperate and tropical climates. However, the mechanisms responsible for seasonal disease incidence, and the epidemiological consequences of seasonality, are poorly understood with rare exception. Standard epidemiological theory and concepts such as the basic reproductive number R 0 no longer apply, and the implications for interventions that themselves may be periodic, such as pulse vaccination, have not been formally examined. This paper examines the causes and consequences of seasonality, and in so doing derives several new results concerning vaccination strategy and the interpretation of disease outbreak data. It begins with a brief review of published scientific studies in support of different causes of seasonality in infectious diseases of humans, identifying four principal mechanisms and their association with different routes of transmission. It then describes the consequences of seasonality for R 0 , disease outbreaks, endemic dynamics and persistence. Finally, a mathematical analysis of routine and pulse vaccination programmes for seasonal infections is presented. The synthesis of seasonal infectious disease epidemiology attempted by this paper highlights the need for further empirical and theoretical work. © 2006 The Royal Society.",
+		"container-title": "Proceedings of the Royal Society B: Biological Sciences",
+		"DOI": "10.1098/rspb.2006.3604",
+		"ISSN": "14712970",
+		"issue": "1600",
+		"page": "2541–2550",
+		"title": "Seasonal infectious disease epidemiology",
+		"volume": "273",
+		"author": [
+			{
+				"family": "Grassly",
+				"given": "Nicholas C."
+			},
+			{
+				"family": "Fraser",
+				"given": "Christophe"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2006"
+				]
+			]
+		}
+	},
+	{
+		"id": "griffin2020",
+		"type": "article-journal",
+		"abstract": "The serial interval is the time between symptom onsets in an infector-infectee pair. The generation time, also known as the generation interval, is the time between infection events in an infector-infectee pair. The serial interval and the generation time are key parameters for assessing the dynamics of a disease. A number of scientific papers reported information pertaining to the serial interval and/or generation time for COVID-19. Objective Conduct a review of available evidence to advise on appropriate parameter values for serial interval and generation time in national COVID-19 transmission models for Ireland and on methodological issues relating to those parameters. Methods We conducted a rapid review of the literature covering the period 1 January 2020 and 21 August 2020, following predefined eligibility criteria. Forty scientific papers met our inclusion criteria and were included in the review. Results The mean of the serial interval ranged from 3.03 to 7.6 days, based on 38 estimates, and the median from 1.0 to 6.0 days (based on 15 estimates). Only three estimates were provided for the mean of the generation time. These ranged from 3.95 to 5.20 days. One estimate of 5.0 days was provided for the median of the generation time. Discussion Estimates of the serial interval and the generation time are very dependent on the specific factors that apply at the time that the data are collected, including the level of social contact. Consequently, the estimates may not be entirely relevant to other environments. Therefore, local estimates should be obtained as soon as possible. Careful consideration should be given to the methodology that is used. Real-time estimations of the serial interval/generation time, allowing for variations over time, may provide more accurate estimates of reproduction numbers than using conventionally fixed serial interval/generation time distributions.",
+		"container-title": "BMJ Open",
+		"DOI": "10.1136/bmjopen-2020-040263",
+		"ISSN": "20446055",
+		"issue": "11",
+		"note": "ISBN: 9789241512763\nPMID: 33234640",
+		"page": "1–9",
+		"title": "Rapid review of available evidence on the serial interval and generation time of COVID-19",
+		"volume": "10",
+		"author": [
+			{
+				"family": "Griffin",
+				"given": "John"
+			},
+			{
+				"family": "Casey",
+				"given": "Miriam"
+			},
+			{
+				"family": "Collins",
+				"given": "Áine"
+			},
+			{
+				"family": "Hunt",
+				"given": "Kevin"
+			},
+			{
+				"family": "McEvoy",
+				"given": "David"
+			},
+			{
+				"family": "Byrne",
+				"given": "Andrew"
+			},
+			{
+				"family": "McAloon",
+				"given": "Conor"
+			},
+			{
+				"family": "Barber",
+				"given": "Ann"
+			},
+			{
+				"family": "Lane",
+				"given": "Elizabeth Ann"
+			},
+			{
+				"family": "More",
+				"given": "Simon"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "jacob2010",
+		"type": "article-journal",
+		"abstract": "Branching processes are stochastic individual-based processes leading consequently to a bottom-up approach. In addition, since the state variables are random integer variables (representing population sizes), the extinction occurs at random finite time on the extinction set, thus leading to fine and realistic predictions. Starting from the simplest and well-known single-type Bienaymé-Galton-Watson branching process that was used by several authors for approximating the beginning of an epidemic, we then present a general branching model with age and population dependent individual transitions. However contrary to the classical Bienaymé-Galton-Watson or asymptotically Bienaymé-Galton-Watson setting, where the asymptotic behavior of the process, as time tends to infinity, is well understood, the asymptotic behavior of this general process is a new question. Here we give some solutions for dealing with this problem depending on whether the initial population size is large or small, and whether the disease is rare or non-rare when the initial population size is large.",
+		"container-title": "International Journal of Environmental Research and Public Health",
+		"DOI": "10.3390/ijerph7031204",
+		"ISSN": "16604601",
+		"issue": "3",
+		"page": "1186–1204",
+		"title": "Branching processes: Their role in epidemiology",
+		"volume": "7",
+		"author": [
+			{
+				"family": "Jacob",
+				"given": "Christine"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2010"
+				]
+			]
+		}
+	},
+	{
+		"id": "lehtinen2021",
+		"type": "article-journal",
+		"abstract": "The timing of transmission plays a key role in the dynamics and controllability of an epidemic. However, observing generation times - the time interval between the infection of an infector and an infectee in a transmission pair - requires data on infection times, which are generally unknown. The timing of symptom onset is more easily observed; generation times are therefore often estimated based on serial intervals - the time interval between symptom onset of an infector and an infectee. This estimation follows one of two approaches: (i) approximating the generation time distribution by the serial interval distribution or (ii) deriving the generation time distribution from the serial interval and incubation period - the time interval between infection and symptom onset in a single individual - distributions. These two approaches make different - and not always explicitly stated - assumptions about the relationship between infectiousness and symptoms, resulting in different generation time distributions with the same mean but unequal variances. Here, we clarify the assumptions that each approach makes and show that neither set of assumptions is plausible for most pathogens. However, the variances of the generation time distribution derived under each assumption can reasonably be considered as upper (approximation with serial interval) and lower (derivation from serial interval) bounds. Thus, we suggest a pragmatic solution is to use both approaches and treat these as edge cases in downstream analysis. We discuss the impact of the variance of the generation time distribution on the controllability of an epidemic through strategies based on contact tracing, and we show that underestimating this variance is likely to overestimate controllability.",
+		"container-title": "Journal of the Royal Society Interface",
+		"DOI": "10.1098/rsif.2020.0756",
+		"ISSN": "17425662",
+		"issue": "174",
+		"note": "PMID: 33402022",
+		"title": "On the relationship between serial interval, infectiousness profile and generation time: On the relationship between serial interval, infectiousness profile and generation time",
+		"volume": "18",
+		"author": [
+			{
+				"family": "Lehtinen",
+				"given": "Sonja"
+			},
+			{
+				"family": "Ashcroft",
+				"given": "Peter"
+			},
+			{
+				"family": "Bonhoeffer",
+				"given": "Sebastian"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2021"
+				]
+			]
+		}
+	},
+	{
+		"id": "limpert2001",
+		"type": "article-journal",
+		"abstract": "On the charms of statistics, and how mechanical models resembling gambling machines offer a link to a handy way to characterize log-normal distributions, which can provide deeper insight into variability and probability - Normal or log-normal: That is the question.",
+		"container-title": "BioScience",
+		"DOI": "10.1641/0006-3568(2001)051[0341:LNDATS]2.0.CO;2",
+		"ISSN": "00063568",
+		"issue": "5",
+		"page": "341–352",
+		"title": "Log-normal distributions across the sciences: Keys and clues",
+		"volume": "51",
+		"author": [
+			{
+				"family": "Limpert",
+				"given": "Eckhard"
+			},
+			{
+				"family": "Stahel",
+				"given": "Werner A."
+			},
+			{
+				"family": "Abbt",
+				"given": "Markus"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2001"
+				]
+			]
+		}
+	},
+	{
+		"id": "lloyd-smith2005",
+		"type": "article-journal",
+		"abstract": "Population-level analyses often use average quantities to describe heterogeneous systems, particularly when variation does not arise from identifiable groups. A prominent example, central to our current understanding of epidemic spread, is the basic reproductive number, R0, which is defined as the mean number of infections caused by an infected individual in a susceptible population. Population estimates of R0 can obscure considerable individual variation in infectiousness, as highlighted during the global emergence of severe acute respiratory syndrome (SARS) by numerous 'superspreading events' in which certain individuals infected unusually large numbers of secondary cases. For diseases transmitted by non-sexual direct contacts, such as SARS or smallpox, individual variation is difficult to measure empirically, and thus its importance for outbreak dynamics has been unclear. Here we present an integrated theoretical and statistical analysis of the influence of individual variation in infectiousness on disease emergence. Using contact tracing data from eight directly transmitted diseases, we show that the distribution of individual infectiousness around R0 is often highly skewed. Model predictions accounting for this variation differ sharply from average-based approaches, with disease extinction more likely and outbreaks rarer but more explosive. Using these models, we explore implications for outbreak control, showing that individual-specific control measures outperform population-wide measures. Moreover, the dramatic improvements achieved through targeted control policies emphasize the need to identify predictive correlates of higher infectiousness. Our findings indicate that superspreading is a normal feature of disease spread, and to frame ongoing discussion we propose a rigorous definition for superspreading events and a method to predict their frequency. © 2005 Nature Publishing Group.",
+		"container-title": "Nature",
+		"DOI": "10.1038/nature04153",
+		"ISSN": "14764687",
+		"issue": "7066",
+		"note": "PMID: 16292310",
+		"page": "355–359",
+		"title": "Superspreading and the effect of individual variation on disease emergence",
+		"volume": "438",
+		"author": [
+			{
+				"family": "Lloyd-Smith",
+				"given": "J. O."
+			},
+			{
+				"family": "Schreiber",
+				"given": "S. J."
+			},
+			{
+				"family": "Kopp",
+				"given": "P. E."
+			},
+			{
+				"family": "Getz",
+				"given": "W. M."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2005"
+				]
+			]
+		}
+	},
+	{
+		"id": "marivate2020",
+		"type": "article-journal",
+		"container-title": "arXiv preprint arXiv:2004.04813",
+		"title": "Use of available data to inform the COVID-19 outbreak in South Africa: a case study",
+		"author": [
+			{
+				"family": "Marivate",
+				"given": "Vukosi"
+			},
+			{
+				"family": "Combrink",
+				"given": "Herkulaas MvE"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "nishiura2007",
+		"type": "article-journal",
+		"abstract": "The incubation period of infectious diseases, the time from infection with a microorganism to onset of disease, is directly relevant to prevention and control. Since explicit models of the incubation period enhance our understanding of the spread of disease, previous classic studies were revisited, focusing on the modeling methods employed and paying particular attention to relatively unknown historical efforts. The earliest study on the incubation period of pandemic influenza was published in 1919, providing estimates of the incubation period of Spanish flu using the daily incidence on ships departing from several ports in Australia. Although the study explicitly dealt with an unknown time of exposure, the assumed periods of exposure, which had an equal probability of infection, were too long, and thus, likely resulted in slight underestimates of the incubation period. After the suggestion that the incubation period follows lognormal distribution, Japanese epidemiologists extended this assumption to estimates of the time of exposure during a point source outbreak. Although the reason why the incubation period of acute infectious diseases tends to reveal a right-skewed distribution has been explored several times, the validity of the lognormal assumption is yet to be fully clarified. At present, various different distributions are assumed, and the lack of validity in assuming lognormal distribution is particularly apparent in the case of slowly progressing diseases. The present paper indicates that (1) analysis using well-defined short periods of exposure with appropriate statistical methods is critical when the exact time of exposure is unknown, and (2) when assuming a specific distribution for the incubation period, comparisons using different distributions are needed in addition to estimations using different datasets, analyses of the determinants of incubation period, and an understanding of the underlying disease mechanisms. © 2007 Nishiura; licensee BioMed Central Ltd.",
+		"container-title": "Emerging Themes in Epidemiology",
+		"DOI": "10.1186/1742-7622-4-2",
+		"ISSN": "17427622",
+		"page": "1–12",
+		"title": "Early efforts in modeling the incubation period of infectious diseases with an acute course of illness",
+		"volume": "4",
+		"author": [
+			{
+				"family": "Nishiura",
+				"given": "Hiroshi"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2007"
+				]
+			]
+		}
+	},
+	{
+		"id": "nishiura2012",
+		"type": "article-journal",
+		"abstract": "Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. © 2011.",
+		"container-title": "Journal of Theoretical Biology",
+		"DOI": "10.1016/j.jtbi.2011.10.039",
+		"ISSN": "00225193",
+		"note": "publisher: Elsevier\nPMID: 22079419",
+		"page": "48–55",
+		"title": "Estimating the transmission potential of supercritical processes based on the final size distribution of minor outbreaks",
+		"URL": "http://dx.doi.org/10.1016/j.jtbi.2011.10.039",
+		"volume": "294",
+		"author": [
+			{
+				"family": "Nishiura",
+				"given": "Hiroshi"
+			},
+			{
+				"family": "Yan",
+				"given": "Ping"
+			},
+			{
+				"family": "Sleeman",
+				"given": "Candace K."
+			},
+			{
+				"family": "Mode",
+				"given": "Charles J."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2012"
+				]
+			]
+		}
+	},
+	{
+		"id": "pearson2020",
+		"type": "article-journal",
+		"abstract": "For 45 African countries/territories already reporting COVID-19 cases before 23 March 2020, we estimate the dates of reporting 1,000 and 10,000 cases. Assuming early epidemic trends without interventions, all 45 were likely to exceed 1,000 confirmed cases by the end of April 2020, with most exceeding 10,000 a few weeks later.",
+		"container-title": "Eurosurveillance",
+		"DOI": "10.2807/1560-7917.ES.2020.25.18.2000543",
+		"ISSN": "15607917",
+		"issue": "18",
+		"note": "publisher: European Centre for Disease Prevention and Control (ECDC)\nPMID: 32400361",
+		"page": "1–6",
+		"title": "Projected early spread of COVID-19 in Africa through 1 June 2020",
+		"URL": "http://dx.doi.org/10.2807/1560-7917.ES.2020.25.18.2000543",
+		"volume": "25",
+		"author": [
+			{
+				"family": "Pearson",
+				"given": "Carl A.B."
+			},
+			{
+				"family": "Schalkwyk",
+				"given": "Cari",
+				"non-dropping-particle": "van"
+			},
+			{
+				"family": "Foss",
+				"given": "Anna M."
+			},
+			{
+				"family": "O'Reilly",
+				"given": "Kathleen M."
+			},
+			{
+				"family": "Pulliam",
+				"given": "Juliet R.C."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "becker1977",
+		"type": "article-journal",
+		"container-title": "Biometrics",
+		"ISSN": "0006-341X",
+		"issue": "3",
+		"note": "publisher: JSTOR",
+		"page": "515–522",
+		"title": "Estimation for discrete time branching processes with application to epidemics",
+		"volume": "33",
+		"author": [
+			{
+				"family": "Becker",
+				"given": "Niels"
+			},
+			{
+				"family": "Society",
+				"given": "International Biometric"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"1977"
+				]
+			]
+		}
+	},
+	{
+		"id": "wang2020",
+		"type": "article-journal",
+		"abstract": "Coronavirus disease 2019 (COVID-19) was first identified in late 2019 in Wuhan, Hubei Province, China and spread globally in months, sparking worldwide concern. However, it is unclear whether super-spreading events occurred during the early outbreak phase, as has been observed for other emerging viruses. Here, we analyse 208 publicly available SARS-CoV-2 genome sequences collected during the early outbreak phase. We combine phylogenetic analysis with Bayesian inference under an epidemiological model to trace person-to-person transmission. The dispersion parameter of the offspring distribution in the inferred transmission chain was estimated to be 0.23 (95% CI: 0.13–0.38), indicating there are individuals who directly infected a disproportionately large number of people. Our results showed that super-spreading events played an important role in the early stage of the COVID-19 outbreak.",
+		"container-title": "Nature Communications",
+		"DOI": "10.1038/s41467-020-18836-4",
+		"ISSN": "20411723",
+		"issue": "1",
+		"note": "publisher: Springer US\nPMID: 33024095",
+		"page": "1–6",
+		"title": "Inference of person-to-person transmission of COVID-19 reveals hidden super-spreading events during the early outbreak phase",
+		"URL": "http://dx.doi.org/10.1038/s41467-020-18836-4",
+		"volume": "11",
+		"author": [
+			{
+				"family": "Wang",
+				"given": "Liang"
+			},
+			{
+				"family": "Didelot",
+				"given": "Xavier"
+			},
+			{
+				"family": "Yang",
+				"given": "Jing"
+			},
+			{
+				"family": "Wong",
+				"given": "Gary"
+			},
+			{
+				"family": "Shi",
+				"given": "Yi"
+			},
+			{
+				"family": "Liu",
+				"given": "Wenjun"
+			},
+			{
+				"family": "Gao",
+				"given": "George F."
+			},
+			{
+				"family": "Bi",
+				"given": "Yuhai"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "yadav2021",
+		"type": "article-journal",
+		"abstract": "In this review, we have discussed the different statistical modeling and prediction techniques for various infectious diseases including the recent pandemic of COVID-19. The distribution fitting, time series modeling along with predictive monitoring approaches, and epidemiological modeling are illustrated. When the epidemiology data is sufficient to fit with the required sample size, the normal distribution in general or other theoretical distributions are fitted and the best-fitted distribution is chosen for the prediction of the spread of the disease. The infectious diseases develop over time and we have data on the single variable that is the number of infections that happened, therefore, time series models are fitted and the prediction is done based on the best-fitted model. Monitoring approaches may also be applied to time series models which could estimate the parameters more precisely. In epidemiological modeling, more biological parameters are incorporated in the models and the forecasting of the disease spread is carried out. We came up with, how to improve the existing modeling methods, the use of fuzzy variables, and detection of fraud in the available data. Ultimately, we have reviewed the results of recent statistical modeling efforts to predict the course of COVID-19 spread.",
+		"container-title": "Frontiers in Public Health",
+		"DOI": "10.3389/fpubh.2021.645405",
+		"ISSN": "22962565",
+		"issue": "June",
+		"note": "PMID: 34222166",
+		"page": "1–27",
+		"title": "Statistical Modeling for the Prediction of Infectious Disease Dissemination With Special Reference to COVID-19 Spread",
+		"volume": "9",
+		"author": [
+			{
+				"family": "Yadav",
+				"given": "Subhash Kumar"
+			},
+			{
+				"family": "Akhter",
+				"given": "Yusuf"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2021"
+				]
+			]
+		}
+	}
+]
\ No newline at end of file

From 291aa29751fc329f69a78b9aea063636d8361deb Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:40:05 +0100
Subject: [PATCH 499/828] Generated the MD version of the README

---
 README.md | 34 +++++++++++++++++-----------------
 1 file changed, 17 insertions(+), 17 deletions(-)

diff --git a/README.md b/README.md
index 5af3535b..bd188d1b 100644
--- a/README.md
+++ b/README.md
@@ -5,7 +5,7 @@
 <!-- `packagename` is extracted from the DESCRIPTION file -->
 <!-- `gh_repo` is extracted via a special environment variable in GitHub Actions -->
 
-# epichains: Methods for analysing the size and length of transmission chains from branching process models <img src="man/figures/epichains_logo.png" align="right" height="130" />
+# *{{ packagename }}*: Methods for simulating and analysing the size and length of transmission chains from branching process models <img src="man/figures/epichains_logo.png" align="right" height="130" />
 
 <!-- badges: start -->
 
@@ -21,34 +21,34 @@ MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.or
 experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html#experimental)
 <!-- badges: end -->
 
-epichains is an R package to simulate, analyse, and visualize the size
-and length of branching processes with a given offspring distribution.
-These models are often used in infectious disease epidemiology, where
-the chains represent chains of transmission, and the offspring
-distribution represents the distribution of secondary infections caused
-by an infected individual.
+*{{ packagename }}* is an R package to simulate, analyse, and visualize
+the size and length of branching processes with a given offspring
+distribution. These models are often used in infectious disease
+epidemiology, where the chains represent chains of transmission, and the
+offspring distribution represents the distribution of secondary
+infections caused by an infected individual.
 
-epichains re-implements
+*{{ packagename }}* re-implements
 [bpmodels](%22https://github.com/epiverse-trace/bpmodels/%22) by
-providing dedicated classes that allow easy manipulation and
+providing dedicated data structures that allow easy manipulation and
 interoperability with other existing packages for handling transmission
 chain and contact-tracing data.
 
-epichains is developed at the [Centre for the Mathematical Modelling of
-Infectious
+*{{ packagename }}* is developed at the [Centre for the Mathematical
+Modelling of Infectious
 Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling-infectious-diseases)
 at the London School of Hygiene and Tropical Medicine as part of the
 [Epiverse Initiative](https://data.org/initiatives/epiverse/).
 
 # Installation
 
-The latest development version of the epichains package can be installed
-via
+The latest development version of the *{{ packagename }}* package can be
+installed via
 
 ``` r
 # check whether {pak} is installed
 if(!require("pak")) install.packages("pak")
-pak::pak("epiverse-trace/epichains")
+pak::pak("{{ gh_repo }}")
 ```
 
 To load the package, use
@@ -63,7 +63,7 @@ Work in progress
 
 ## Package vignettes
 
-Specific use cases of epichains can be found in the [online
+Specific use cases of *{{ packagename }}* can be found in the [online
 documentation as package
 vignettes](https://epiverse-trace.github.io/epichains/), under
 “Articles”.
@@ -81,8 +81,8 @@ guide](https://github.com/epiverse-trace/epichains/blob/main/.github/CONTRIBUTIN
 
 ## Code of conduct
 
-Please note that the epichains project is released with a [Contributor
-Code of
+Please note that the *{{ packagename }}* project is released with a
+[Contributor Code of
 Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md).
 By contributing to this project, you agree to abide by its terms.
 

From a40af0f6874555f47cf4391ce0aaaaa1b87a95c7 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:40:26 +0100
Subject: [PATCH 500/828] Fixed an error resulting from the author list

---
 DESCRIPTION | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/DESCRIPTION b/DESCRIPTION
index dc24e208..f915520d 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -4,13 +4,14 @@ Title: Simulating and Analysing Transmission Chain Statistics Using Branching Pr
 Version: 0.1.0
 Authors@R: c(
     person("James M.", "Azam", , "james.azam@lshtm.ac.uk", role = c("aut", "cre"),
-           comment = c(ORCID = "https://orcid.org/0000-0001-5782-7330"))),
+           comment = c(ORCID = "https://orcid.org/0000-0001-5782-7330")),
     person("Zhian N.", "Kamvar", , "zkamvar@gmail.com", role = "ctb",
            comment = c(ORCID = "https://orcid.org/0000-0003-1458-7108")),
     person("Flavio", "Finger", , "flavio.finger@epicentre.msf.org", role = "aut",
            comment = c(ORCID = "https://orcid.org/0000-0002-8613-5170")),
     person("Sebastian", "Funk", , "sebastian.funk@lshtm.ac.uk", role = "aut",
            comment = c(ORCID = "https://orcid.org/0000-0002-2842-3406"))
+           )
 Description: Provides methods to simulate and analyse the size and length
     of branching processes with an arbitrary offspring distribution. These
     can be used, for example, to analyse the distribution of chain sizes

From 47e63667af3fa9ee7a1811dd9f7ac89613723cb4 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:41:17 +0100
Subject: [PATCH 501/828] Now exporting the "is_" family of internal functions

---
 R/epichains.R | 10 +++++-----
 1 file changed, 5 insertions(+), 5 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 339c76e1..c8d92653 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -144,7 +144,7 @@ summary.epichains <- function(object, ...) {
 #'
 #' @return logical, `TRUE` if the object is an `epichains` and `FALSE`
 #' otherwise
-#' @keywords internal
+#' @export
 #' @author James M. Azam
 is_epichains <- function(x) {
   inherits(x, "epichains")
@@ -155,7 +155,7 @@ is_epichains <- function(x) {
 #' @param x An [`epichains`] object
 #' @return logical, `TRUE` if the object is an `epichains_aggregate_df` and
 #' `FALSE` otherwise
-#' @keywords internal
+#' @export
 #' @author James M. Azam
 is_epichains_aggregate_df <- function(x) {
   inherits(x, "epichains_aggregate_df")
@@ -166,7 +166,7 @@ is_epichains_aggregate_df <- function(x) {
 #' @param x An `epichains` object
 #'
 #' @return No return.
-#' @keywords internal
+#' @export
 #' @author James M. Azam
 validate_epichains <- function(x) {
   if (!is_epichains(x)) {
@@ -203,7 +203,7 @@ validate_epichains <- function(x) {
 #'
 #' @param x An [`epichains`] object
 #'
-#' @keywords internal
+#' @export
 #' @author James M. Azam
 is_chains_tree <- function(x) {
   !is.null(attributes(x)$chain_type) &&
@@ -214,7 +214,7 @@ is_chains_tree <- function(x) {
 #'
 #' @param x An [`epichains`] object
 #'
-#' @keywords internal
+#' @export
 #' @author James M. Azam
 is_chains_vec <- function(x) {
   !is.null(attributes(x)$chain_type) &&

From 2567a58799754d854ead0477b8d14d18be19be8f Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:41:32 +0100
Subject: [PATCH 502/828] Fixed some linting issues

---
 R/epichains.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index c8d92653..e0f653a8 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -121,10 +121,10 @@ summary.epichains <- function(object, ...) {
   } else if (is_chains_vec(object)) {
     chains_ran <- length(object)
 
-    if(!all(is.infinite(object))){
+    if (!all(is.infinite(object))) {
     max_chain_stat <- max(object[!is.infinite(object)])
     min_chain_stat <- min(object[!is.infinite(object)])
-    }else{
+    } else {
     max_chain_stat <- min_chain_stat <- Inf
     }
 

From a02d5ffe6e6817569cfe6098d8a97e62e33e96d9 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 26 Jul 2023 18:42:04 +0100
Subject: [PATCH 503/828] Regenerated the function .Rd files and NAMESPACE

---
 NAMESPACE                         |  5 +++++
 man/aggregate.epichains.Rd        |  3 +++
 man/check_chain_tree_attribute.Rd | 15 ---------------
 man/check_nchains_valid.Rd        |  4 ++--
 man/epichains-package.Rd          |  6 +++---
 man/format.epichains.Rd           |  3 +++
 man/is_chains_tree.Rd             | 17 +++++++++++++++++
 man/is_chains_vec.Rd              | 17 +++++++++++++++++
 man/is_epichains.Rd               |  8 +++++---
 man/is_epichains_aggregate_df.Rd  | 12 +++++++++---
 man/print.epichains.Rd            |  3 +++
 man/summary.epichains.Rd          |  3 +++
 man/validate_epichains.Rd         |  4 +---
 13 files changed, 71 insertions(+), 29 deletions(-)
 delete mode 100644 man/check_chain_tree_attribute.Rd
 create mode 100644 man/is_chains_tree.Rd
 create mode 100644 man/is_chains_vec.Rd

diff --git a/NAMESPACE b/NAMESPACE
index e31e4230..6e29ae0b 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -8,11 +8,16 @@ S3method(summary,epichains)
 S3method(tail,epichains)
 export(dborel)
 export(estimate_likelihood)
+export(is_chains_tree)
+export(is_chains_vec)
+export(is_epichains)
+export(is_epichains_aggregate_df)
 export(rborel)
 export(rnbinom_mean_disp)
 export(simulate_tree)
 export(simulate_tree_from_pop)
 export(simulate_vect)
+export(validate_epichains)
 importFrom(stats,aggregate)
 importFrom(utils,head)
 importFrom(utils,tail)
diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
index df7ef62c..a960a7e9 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains.Rd
@@ -38,3 +38,6 @@ aggregate(chains, grouping_var = "generation")
 # Aggregate cases per both time and generation
 aggregate(chains, grouping_var = "both")
 }
+\author{
+James M. Azam
+}
diff --git a/man/check_chain_tree_attribute.Rd b/man/check_chain_tree_attribute.Rd
deleted file mode 100644
index c0156936..00000000
--- a/man/check_chain_tree_attribute.Rd
+++ /dev/null
@@ -1,15 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/checks.R
-\name{check_chain_tree_attribute}
-\alias{check_chain_tree_attribute}
-\title{Title}
-\usage{
-check_chain_tree_attribute(x)
-}
-\arguments{
-\item{x}{An \code{\link{epichains}} object}
-}
-\description{
-Title
-}
-\keyword{internal}
diff --git a/man/check_nchains_valid.Rd b/man/check_nchains_valid.Rd
index 6e565502..1a20e8b5 100644
--- a/man/check_nchains_valid.Rd
+++ b/man/check_nchains_valid.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/checks.R
 \name{check_nchains_valid}
 \alias{check_nchains_valid}
-\title{Check that nchains is greater than 0 and not infinite}
+\title{Check that nchains is greater than 0 and not infinity}
 \usage{
 check_nchains_valid(nchains)
 }
@@ -10,6 +10,6 @@ check_nchains_valid(nchains)
 \item{nchains}{Number of chains to simulate.}
 }
 \description{
-Check that nchains is greater than 0 and not infinite
+Check that nchains is greater than 0 and not infinity
 }
 \keyword{internal}
diff --git a/man/epichains-package.Rd b/man/epichains-package.Rd
index ab3d4d31..5e66ab10 100644
--- a/man/epichains-package.Rd
+++ b/man/epichains-package.Rd
@@ -4,9 +4,9 @@
 \name{epichains-package}
 \alias{epichains}
 \alias{epichains-package}
-\title{epichains: Analysing transmission chain statistics using branching process models}
+\title{epichains: Simulating and Analysing Transmission Chain Statistics Using Branching Process Models}
 \description{
-Provides methods to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks, as discussed in Farrington et al. (2003) \doi{10.1093/biostatistics/4.2.279}.
+Provides methods to simulate and analyse the size and length of branching processes with an arbitrary offspring distribution. These can be used, for example, to analyse the distribution of chain sizes or length of infectious disease outbreaks, as discussed in Farrington et al. (2003) \doi{10.1093/biostatistics/4.2.279}.
 }
 \seealso{
 Useful links:
@@ -22,8 +22,8 @@ Useful links:
 
 Authors:
 \itemize{
-  \item Sebastian Funk \email{sebastian.funk@lshtm.ac.uk} (\href{https://orcid.org/0000-0002-2842-3406}{ORCID})
   \item Flavio Finger \email{flavio.finger@epicentre.msf.org} (\href{https://orcid.org/0000-0002-8613-5170}{ORCID})
+  \item Sebastian Funk \email{sebastian.funk@lshtm.ac.uk} (\href{https://orcid.org/0000-0002-2842-3406}{ORCID})
 }
 
 Other contributors:
diff --git a/man/format.epichains.Rd b/man/format.epichains.Rd
index cb0bb0f1..6b46c5ca 100644
--- a/man/format.epichains.Rd
+++ b/man/format.epichains.Rd
@@ -17,3 +17,6 @@ Invisibly returns an \code{\link{epichains}}. Called for printing side-effects.
 \description{
 Format method for epichains class
 }
+\author{
+James M. Azam
+}
diff --git a/man/is_chains_tree.Rd b/man/is_chains_tree.Rd
new file mode 100644
index 00000000..951a2bcd
--- /dev/null
+++ b/man/is_chains_tree.Rd
@@ -0,0 +1,17 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{is_chains_tree}
+\alias{is_chains_tree}
+\title{Check if an epichains object has the \code{chains_tree} attribute}
+\usage{
+is_chains_tree(x)
+}
+\arguments{
+\item{x}{An \code{\link{epichains}} object}
+}
+\description{
+Check if an epichains object has the \code{chains_tree} attribute
+}
+\author{
+James M. Azam
+}
diff --git a/man/is_chains_vec.Rd b/man/is_chains_vec.Rd
new file mode 100644
index 00000000..316f6f53
--- /dev/null
+++ b/man/is_chains_vec.Rd
@@ -0,0 +1,17 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{is_chains_vec}
+\alias{is_chains_vec}
+\title{Check if an epichains object has the \code{chains_vec} attribute}
+\usage{
+is_chains_vec(x)
+}
+\arguments{
+\item{x}{An \code{\link{epichains}} object}
+}
+\description{
+Check if an epichains object has the \code{chains_vec} attribute
+}
+\author{
+James M. Azam
+}
diff --git a/man/is_epichains.Rd b/man/is_epichains.Rd
index aa2d540d..5b327eb7 100644
--- a/man/is_epichains.Rd
+++ b/man/is_epichains.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/epichains.R
 \name{is_epichains}
 \alias{is_epichains}
-\title{Checks whether the object is an \code{epichains}}
+\title{Reports whether x is an \code{epichains} object}
 \usage{
 is_epichains(x)
 }
@@ -14,6 +14,8 @@ logical, \code{TRUE} if the object is an \code{epichains} and \code{FALSE}
 otherwise
 }
 \description{
-Checks whether the object is an \code{epichains}
+Reports whether x is an \code{epichains} object
+}
+\author{
+James M. Azam
 }
-\keyword{internal}
diff --git a/man/is_epichains_aggregate_df.Rd b/man/is_epichains_aggregate_df.Rd
index ceeb73aa..98d779c3 100644
--- a/man/is_epichains_aggregate_df.Rd
+++ b/man/is_epichains_aggregate_df.Rd
@@ -2,14 +2,20 @@
 % Please edit documentation in R/epichains.R
 \name{is_epichains_aggregate_df}
 \alias{is_epichains_aggregate_df}
-\title{Check if an object is of class "epichains_aggregate_df"}
+\title{Reports whether x is an "epichains_aggregate_df" object}
 \usage{
 is_epichains_aggregate_df(x)
 }
 \arguments{
 \item{x}{An \code{\link{epichains}} object}
 }
+\value{
+logical, \code{TRUE} if the object is an \code{epichains_aggregate_df} and
+\code{FALSE} otherwise
+}
 \description{
-Check if an object is of class "epichains_aggregate_df"
+Reports whether x is an "epichains_aggregate_df" object
+}
+\author{
+James M. Azam
 }
-\keyword{internal}
diff --git a/man/print.epichains.Rd b/man/print.epichains.Rd
index 22c24de2..ad9c2347 100644
--- a/man/print.epichains.Rd
+++ b/man/print.epichains.Rd
@@ -17,3 +17,6 @@ Invisibly returns an \code{\link{epichains}}. Called for side-effects.
 \description{
 Print an \code{\link{epichains}} object
 }
+\author{
+James M. Azam
+}
diff --git a/man/summary.epichains.Rd b/man/summary.epichains.Rd
index f6b81976..83e28801 100644
--- a/man/summary.epichains.Rd
+++ b/man/summary.epichains.Rd
@@ -17,3 +17,6 @@ data frame of information
 \description{
 Summary method for epichains class
 }
+\author{
+James M. Azam
+}
diff --git a/man/validate_epichains.Rd b/man/validate_epichains.Rd
index 03953a59..8cddc077 100644
--- a/man/validate_epichains.Rd
+++ b/man/validate_epichains.Rd
@@ -10,8 +10,7 @@ validate_epichains(x)
 \item{x}{An \code{epichains} object}
 }
 \value{
-Checks if an object is of class \code{epichains} and if so
-checks that it's in the right format as a "data.frame" or vector.
+No return.
 }
 \description{
 \code{epichains} class validator
@@ -19,4 +18,3 @@ checks that it's in the right format as a "data.frame" or vector.
 \author{
 James M. Azam
 }
-\keyword{internal}

From 359e71c0148df50e9ec5f0e21f7fc91d87a42aa5 Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Wed, 26 Jul 2023 18:01:58 +0000
Subject: [PATCH 504/828] Update CITATION.cff

---
 CITATION.cff | 91 ++++++++++++++++++++++++++++++++++++++++++++--------
 1 file changed, 77 insertions(+), 14 deletions(-)

diff --git a/CITATION.cff b/CITATION.cff
index a4b1966a..6f2cf95b 100644
--- a/CITATION.cff
+++ b/CITATION.cff
@@ -7,26 +7,26 @@ cff-version: 1.2.0
 message: 'To cite package "epichains" in publications use:'
 type: software
 license: MIT
-title: 'epichains: Analysing transmission chain statistics using branching process
-  models'
-version: 0.2.1
-abstract: Provides methods to analyse and simulate the size and length of branching
+title: 'epichains: Simulating and Analysing Transmission Chain Statistics Using Branching
+  Process Models'
+version: 0.1.0
+abstract: Provides methods to simulate and analyse the size and length of branching
   processes with an arbitrary offspring distribution. These can be used, for example,
   to analyse the distribution of chain sizes or length of infectious disease outbreaks,
   as discussed in Farrington et al. (2003) <doi:10.1093/biostatistics/4.2.279>.
 authors:
-- family-names: Funk
-  given-names: Sebastian
-  email: sebastian.funk@lshtm.ac.uk
-  orcid: https://orcid.org/0000-0002-2842-3406
-- family-names: Finger
-  given-names: Flavio
-  email: flavio.finger@epicentre.msf.org
-  orcid: https://orcid.org/0000-0002-8613-5170
 - family-names: Azam
   given-names: James M.
   email: james.azam@lshtm.ac.uk
   orcid: https://orcid.org/0000-0001-5782-7330
+- family-names: Finger
+  given-names: Flavio
+  email: flavio.finger@epicentre.msf.org
+  orcid: https://orcid.org/0000-0002-8613-5170
+- family-names: Funk
+  given-names: Sebastian
+  email: sebastian.funk@lshtm.ac.uk
+  orcid: https://orcid.org/0000-0002-2842-3406
 preferred-citation:
   type: manual
   title: 'epichains: Analysing transmission chain statistics using branching process
@@ -44,6 +44,16 @@ contact:
   given-names: James M.
   email: james.azam@lshtm.ac.uk
   orcid: https://orcid.org/0000-0001-5782-7330
+keywords:
+- branching-processes
+- epidemic-dynamics
+- epidemic-modelling
+- epidemic-simulations
+- outbreak-simulator
+- r-package
+- r-stats
+- transmission-chain
+- transmission-chain-reconstruction
 references:
 - type: software
   title: 'R: A Language and Environment for Statistical Computing'
@@ -57,6 +67,40 @@ references:
   institution:
     name: R Foundation for Statistical Computing
   version: '>= 3.6.0'
+- type: software
+  title: checkmate
+  abstract: 'checkmate: Fast and Versatile Argument Checks'
+  notes: Imports
+  url: https://mllg.github.io/checkmate/
+  repository: https://CRAN.R-project.org/package=checkmate
+  authors:
+  - family-names: Lang
+    given-names: Michel
+    email: michellang@gmail.com
+    orcid: https://orcid.org/0000-0001-9754-0393
+  year: '2023'
+- type: software
+  title: stats
+  abstract: 'R: A Language and Environment for Statistical Computing'
+  notes: Imports
+  authors:
+  - name: R Core Team
+  location:
+    name: Vienna, Austria
+  year: '2023'
+  institution:
+    name: R Foundation for Statistical Computing
+- type: software
+  title: utils
+  abstract: 'R: A Language and Environment for Statistical Computing'
+  notes: Imports
+  authors:
+  - name: R Core Team
+  location:
+    name: Vienna, Austria
+  year: '2023'
+  institution:
+    name: R Foundation for Statistical Computing
 - type: software
   title: bookdown
   abstract: 'bookdown: Authoring Books and Technical Documents with R Markdown'
@@ -224,6 +268,21 @@ references:
     email: rich@posit.co
     orcid: https://orcid.org/0000-0003-3925-190X
   year: '2023'
+- type: software
+  title: spelling
+  abstract: 'spelling: Tools for Spell Checking in R'
+  notes: Suggests
+  url: https://docs.ropensci.org/spelling/
+  repository: https://CRAN.R-project.org/package=spelling
+  authors:
+  - family-names: Ooms
+    given-names: Jeroen
+    email: jeroen@berkeley.edu
+    orcid: https://orcid.org/0000-0002-4035-0289
+  - family-names: Hester
+    given-names: Jim
+    email: james.hester@rstudio.com
+  year: '2023'
 - type: software
   title: testthat
   abstract: 'testthat: Unit Testing for R'
@@ -257,14 +316,18 @@ references:
   authors:
   - family-names: Wickham
     given-names: Hadley
-    email: hadley@rstudio.com
+    email: hadley@posit.co
     orcid: https://orcid.org/0000-0003-4757-117X
   - family-names: Bryan
     given-names: Jennifer
-    email: jenny@rstudio.com
+    email: jenny@posit.co
     orcid: https://orcid.org/0000-0002-6983-2759
   - family-names: Barrett
     given-names: Malcolm
     email: malcolmbarrett@gmail.com
     orcid: https://orcid.org/0000-0003-0299-5825
+  - family-names: Teucher
+    given-names: Andy
+    email: andy.teucher@posit.co
+    orcid: https://orcid.org/0000-0002-7840-692X
   year: '2023'

From ed267927baa5a34b768aac9d51614ec9e88fea9d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 27 Jul 2023 14:59:59 +0100
Subject: [PATCH 505/828] Used is_ function to check for attributes

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index e0f653a8..9a100f3d 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -175,7 +175,7 @@ validate_epichains <- function(x) {
 
   # check for class invariants
 
-  if (attributes(x)$chain_type == "chains_tree") {
+  if (is_chains_tree(x)) {
     stopifnot(
       "object does not contain the correct columns" =
         c("sim_id", "ancestor", "generation", "time") %in%

From 6421e450fb50f63ec6d9e2504763e4271cabefb4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 27 Jul 2023 15:00:24 +0100
Subject: [PATCH 506/828] Removed the "time" column from the invariants

---
 R/epichains.R | 6 ++----
 1 file changed, 2 insertions(+), 4 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 9a100f3d..17361bf6 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -178,16 +178,14 @@ validate_epichains <- function(x) {
   if (is_chains_tree(x)) {
     stopifnot(
       "object does not contain the correct columns" =
-        c("sim_id", "ancestor", "generation", "time") %in%
+        c("sim_id", "ancestor", "generation") %in%
           colnames(x),
       "column `sim_id` must be a numeric" =
         is.numeric(x$sim_id),
       "column `ancestor` must be a numeric" =
         is.numeric(x$ancestor),
       "column `generation` must be a numeric" =
-        is.numeric(x$generation),
-      "column `time` must be a numeric" =
-        is.numeric(x$time)
+        is.numeric(x$generation)
     )
   } else {
     stopifnot(

From 4523934072ae7ec8a078d57b6e7b7620fbdccb08 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 27 Jul 2023 16:36:59 +0100
Subject: [PATCH 507/828] Customized the head() and tail() to sort the object
 before printing the top and bottom 5.

---
 R/epichains.R | 27 ++++++++++++++++++++-------
 1 file changed, 20 insertions(+), 7 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 17361bf6..d1b0413e 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -32,17 +32,13 @@ format.epichains <- function(x, ...) {
         )
       )
 
-    #sort by ancestor first
-
-    x <- x[order(x$sim_id, x$ancestor), ]
-
     # print head of the simulation output
-    print(head(x[!is.na(x$ancestor), ]))
+    print(head(x))
 
     cat("< tree tail >\n")
 
     # print tail of object
-    print(tail(as.data.frame(x)))
+    print(tail(x))
 
     # print summary information
     writeLines(
@@ -228,18 +224,35 @@ is_chains_vec <- function(x) {
 #' @return object of class `data.frame`
 #' @author James M. Azam
 #' @export
+#' @details
+#' This returns the first 5 rows of an `epichains` object after
+#' its rows have first been sorted by `sim_id` and `ancestor` and the first
+#' unknown ancestors (NA) have been dropped. To view the full output,
+#' use `as.data.frame(<object_name>)`.
+#'
 head.epichains <- function(x, ...) {
+  #sort by ancestor first
+  x <- x[order(x$sim_id, x$ancestor), ]
+  # print head of the simulation output
+  x <- x[!is.na(x$ancestor), ]
   utils::head(as.data.frame(x), ...)
 }
 
 #' `tail` method for [`epichains`] class
+#'
 #' @param x An [`epichains`] object
 #' @param ... further arguments passed to or from other methods
 #' @importFrom utils tail
 #' @author James M. Azam
 #' @export
+#' @details
+#' This returns the last 5 rows of an `epichains` object after
+#' its rows have first been sorted by `sim_id` and `ancestor`. To
+#' view the full output, use `as.data.frame(<object_name>)`.
+#'
 tail.epichains <- function(x, ...) {
-  utils::tail(as.data.frame(x), ...)
+  x <- x[order(x$sim_id, x$ancestor), ]
+  utils::tail(as.data.frame(x), n = 5L, ...)
 }
 
 #' Aggregate cases in epichains objects according to a grouping variable

From 643065eacbaeb79747167b319af1fe6b7e479352 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 27 Jul 2023 17:52:57 +0100
Subject: [PATCH 508/828] Updated the title for get_offspring_func()

---
 R/helpers.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/R/helpers.R b/R/helpers.R
index 99f66da6..e1882fe5 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -16,7 +16,8 @@ update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
 }
 
 
-#' Get offspring sampling function
+#' Get offspring sampling function that takes into account susceptible
+#' depletion
 #'
 #' @param n Number of items to sample
 #' @param susc Susceptible population size (calculated

From d45d389f1f8103c07977571b807d81972a2d3fbb Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Thu, 27 Jul 2023 18:04:36 +0100
Subject: [PATCH 509/828] Removed an indentation.R

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 R/likelihood_estimation.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index 8f663805..c17b2584 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -118,7 +118,7 @@ estimate_likelihood <- function(chains_observed,
 
   if (!individual) {
     chains_likelihood <- vapply(chains_likelihood, sum, 0)
-    }
+  }
 
   return(chains_likelihood)
 }

From 4d8d6d675921fb121852525657754967d44d5adc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 12:11:05 +0100
Subject: [PATCH 510/828] Remove "tbl" from the list of classes

---
 R/simulate.r | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index f0601fe3..3cf51d64 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -203,7 +203,7 @@ simulate_tree <- function(nchains, offspring_sampler,
     chain_type = "chains_tree",
     rownames = NULL,
     track_pop = FALSE,
-    class = c("epichains", "tbl", "data.frame")
+    class = c("epichains", "data.frame")
   )
 }
 
@@ -438,6 +438,6 @@ simulate_tree_from_pop <- function(pop,
     chain_type = "chains_tree",
     rownames = NULL,
     track_pop = TRUE,
-    class = c("epichains", "tbl", "data.frame")
+    class = c("epichains", "data.frame")
   )
 }

From 0790c2538c25f253be9bbd7396953178877add4f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 12:12:17 +0100
Subject: [PATCH 511/828] Now sorting tree_df by sim_id and ancestor in the
 simulation functions

---
 R/simulate.r | 7 +++++--
 1 file changed, 5 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 3cf51d64..e0d6c1f6 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -197,6 +197,9 @@ simulate_tree <- function(nchains, offspring_sampler,
     tree_df <- tree_df[tree_df$time < tf, ]
   }
 
+  #sort by sim_id and ancestor
+  tree_df <- tree_df[order(tree_df$sim_id, tree_df$ancestor), ]
+
   structure(
     tree_df,
     chains = nchains,
@@ -429,8 +432,8 @@ simulate_tree_from_pop <- function(pop,
   ## have been generated in the last generation
   tree_df <- tree_df[tree_df$time <= tf, ]
 
-  ## sort output and remove columns not needed
-  tree_df <- tree_df[order(tree_df$time, tree_df$sim_id), ]
+  #sort by sim_id and ancestor
+  tree_df <- tree_df[order(tree_df$sim_id, tree_df$ancestor), ]
   tree_df$offspring_generated <- NULL
 
   structure(

From ab4dd5d841acc4a876981d8f81100906ea27a1a7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 12:13:41 +0100
Subject: [PATCH 512/828] Modified the epichains vector printing method

---
 R/epichains.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index d1b0413e..93dc9370 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -60,9 +60,9 @@ format.epichains <- function(x, ...) {
                )
 
   } else if (is_chains_vec(x)) {
-    cat(sprintf("epichains object \n"))
+    writeLines(sprintf("`epichains` object \n"))
     print(as.vector(x))
-    cat(sprintf("Number of chains simulated: %s",
+    writeLines(sprintf("\n Number of chains simulated: %s",
                 chain_info[["unique_chains"]]
                 )
         )

From a65a93417dabc60bc421c590e54ada836db38d1d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 12:36:54 +0100
Subject: [PATCH 513/828] Moved the sorting operation to the simulation
 functions

---
 R/epichains.R | 7 ++-----
 1 file changed, 2 insertions(+), 5 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 93dc9370..472e100f 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -231,9 +231,7 @@ is_chains_vec <- function(x) {
 #' use `as.data.frame(<object_name>)`.
 #'
 head.epichains <- function(x, ...) {
-  #sort by ancestor first
-  x <- x[order(x$sim_id, x$ancestor), ]
-  # print head of the simulation output
+  # print head of the simulation output from the first known ancestor
   x <- x[!is.na(x$ancestor), ]
   utils::head(as.data.frame(x), ...)
 }
@@ -251,8 +249,7 @@ head.epichains <- function(x, ...) {
 #' view the full output, use `as.data.frame(<object_name>)`.
 #'
 tail.epichains <- function(x, ...) {
-  x <- x[order(x$sim_id, x$ancestor), ]
-  utils::tail(as.data.frame(x), n = 5L, ...)
+  utils::tail(as.data.frame(x), ...)
 }
 
 #' Aggregate cases in epichains objects according to a grouping variable

From a1fbd8fd5e16b011e64e842b9c20dfcf5c11f86d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 14:33:45 +0100
Subject: [PATCH 514/828] Moved the headers from the format method to the head
 and tail methods

---
 R/epichains.R | 16 ++++------------
 1 file changed, 4 insertions(+), 12 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 472e100f..0982a2f6 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -24,19 +24,9 @@ format.epichains <- function(x, ...) {
   chain_info <- summary(x)
 
   if (is_chains_tree(x)) {
-    writeLines(
-      c(
-        sprintf("`epichains` object"),
-
-        "< tree head (from first known ancestor) >\n"
-        )
-      )
-
-    # print head of the simulation output
+    writeLines(sprintf("`epichains` object\n"))
+    # print head of the object
     print(head(x))
-
-    cat("< tree tail >\n")
-
     # print tail of object
     print(tail(x))
 
@@ -231,6 +221,7 @@ is_chains_vec <- function(x) {
 #' use `as.data.frame(<object_name>)`.
 #'
 head.epichains <- function(x, ...) {
+  writeLines("< tree head (from first known ancestor) >\n")
   # print head of the simulation output from the first known ancestor
   x <- x[!is.na(x$ancestor), ]
   utils::head(as.data.frame(x), ...)
@@ -249,6 +240,7 @@ head.epichains <- function(x, ...) {
 #' view the full output, use `as.data.frame(<object_name>)`.
 #'
 tail.epichains <- function(x, ...) {
+  writeLines("\n< tree tail >\n")
   utils::tail(as.data.frame(x), ...)
 }
 

From ade6dec58874e210b0239532a76f82d75e602b91 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 14:33:59 +0100
Subject: [PATCH 515/828] Edited the details

---
 R/epichains.R | 15 ++++++++-------
 1 file changed, 8 insertions(+), 7 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 0982a2f6..420e9cf2 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -215,10 +215,10 @@ is_chains_vec <- function(x) {
 #' @author James M. Azam
 #' @export
 #' @details
-#' This returns the first 5 rows of an `epichains` object after
-#' its rows have first been sorted by `sim_id` and `ancestor` and the first
-#' unknown ancestors (NA) have been dropped. To view the full output,
-#' use `as.data.frame(<object_name>)`.
+#' This returns the top rows of an `epichains` object. Note that the object
+#' is originally sorted by `sim_id` and `ancestor` and the first
+#' unknown ancestors (NA) have been dropped from
+#' printing method. To view the full output, use `as.data.frame(<object_name>)`.
 #'
 head.epichains <- function(x, ...) {
   writeLines("< tree head (from first known ancestor) >\n")
@@ -235,9 +235,10 @@ head.epichains <- function(x, ...) {
 #' @author James M. Azam
 #' @export
 #' @details
-#' This returns the last 5 rows of an `epichains` object after
-#' its rows have first been sorted by `sim_id` and `ancestor`. To
-#' view the full output, use `as.data.frame(<object_name>)`.
+#' This returns the top rows of an `epichains` object. Note that the object
+#' is originally sorted by `sim_id` and `ancestor` and the first
+#' unknown ancestors (NA) have been dropped from
+#' printing method. To view the full output, use `as.data.frame(<object_name>)`.
 #'
 tail.epichains <- function(x, ...) {
   writeLines("\n< tree tail >\n")

From 804e99680fdf09491037916e83220e101c272f40 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 17:52:24 +0100
Subject: [PATCH 516/828] Deleted unused function

---
 R/utils.r | 18 ------------------
 1 file changed, 18 deletions(-)

diff --git a/R/utils.r b/R/utils.r
index 4ff2e0e0..c4d135a5 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -40,24 +40,6 @@ rgen_length <- function(n, x, prob) {
     ceiling(log(stats::runif(n, 0, 1)) / log(1 - prob) - 1)
 }
 
-#' Finds the name of a function passed as an argument
-#'
-#' This works even when a function is passed multiple times (e.g., when used
-#' inside an \code{\link{optim}} call).
-#' See https://stackoverflow.com/a/46740314/10886760
-#' @param fun function of which the name is to be determined
-#' @return function name
-#' @author Sebastian Funk
-#' @keywords internal
-find_function_name <- function(fun) {
-  objects <- ls(envir = environment(fun))
-  for (i in objects) {
-    if (identical(fun, get(i, envir = environment(fun)))) {
-      return(i)
-    }
-  }
-}
-
 #' Negative binomial random numbers parametrized
 #' in terms of mean and dispersion coefficient
 #' @param n number of samples to draw

From 537e33eccd574a05737f992f7a233b11435b6482 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 17:57:51 +0100
Subject: [PATCH 517/828] Removed "chain" from argument and function names

---
 R/borel.r                                     |   6 +-
 R/checks.R                                    |   8 +-
 R/epichains.R                                 |   5 +-
 R/helpers.R                                   |   2 +-
 R/likelihood_estimation.R                     |  75 ++++++------
 R/likelihoods.R                               |  24 ++--
 R/simulate.r                                  | 114 +++++++++---------
 man/aggregate.epichains.Rd                    |   5 +-
 man/check_offspring_valid.Rd                  |   4 +-
 man/construct_offspring_ll_name.Rd            |  12 --
 man/estimate_likelihood.Rd                    |  37 +++---
 man/get_offspring_func.Rd                     |  19 +--
 ...tatistic_func.Rd => get_statistic_func.Rd} |  13 +-
 man/offspring_ll.Rd                           |  25 ++--
 man/simulate_tree.Rd                          |  26 ++--
 man/simulate_tree_from_pop.Rd                 |  28 ++---
 man/simulate_vect.Rd                          |  24 ++--
 tests/testthat/tests-sim.r                    |  16 +--
 18 files changed, 207 insertions(+), 236 deletions(-)
 rename man/{get_chain_statistic_func.Rd => get_statistic_func.Rd} (50%)

diff --git a/R/borel.r b/R/borel.r
index 9d470d4b..9e2747f8 100644
--- a/R/borel.r
+++ b/R/borel.r
@@ -25,9 +25,9 @@ dborel <- function(x, mu, log = FALSE) {
 ##' @export
 rborel <- function(n, mu, infinite = Inf) {
   simulate_vect(nchains = n,
-                offspring_sampler = "pois",
-                chain_statistic = "size",
-                chain_stat_max = infinite,
+                offspring_dist = "pois",
+                statistic = "size",
+                stat_max = infinite,
                 lambda = mu
                 )
 }
diff --git a/R/checks.R b/R/checks.R
index 73d173ea..53123296 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -1,15 +1,15 @@
 #' Check if offspring argument is specified as a character string
 #'
-#' @param offspring_sampler Offspring distribution: a character string
+#' @param offspring_dist Offspring distribution: a character string
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
 #' where \code{\link{rpois}} is the R function to generate Poisson random
 #' numbers).
 #' @keywords internal
-check_offspring_valid <- function(offspring_sampler) {
-  if (!checkmate::test_string(offspring_sampler)) {
+check_offspring_valid <- function(offspring_dist) {
+  if (!checkmate::test_string(offspring_dist)) {
     stop(sprintf(
       "%s %s",
-      "'offspring_sampler' must be specified as a character string.",
+      "'offspring_dist' must be specified as a character string.",
       "Did you forget to enclose it in quotes?"
     ))
   }
diff --git a/R/epichains.R b/R/epichains.R
index 420e9cf2..9ca5b310 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -260,8 +260,9 @@ tail.epichains <- function(x, ...) {
 #' @author James M. Azam
 #' @examples
 #' set.seed(123)
-#' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
-#' offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
+#' chains <- simulate_tree(nchains = 10, statistic = "size",
+#' offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
+#' lambda = 2)
 #' chains
 #'
 #' # Aggregate cases per time
diff --git a/R/helpers.R b/R/helpers.R
index e1882fe5..9770bb64 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -66,7 +66,7 @@ get_offspring_func <- function(offspring_sampler, n, susc, pop,
 #'
 #' @return a function for calculating chain statistics
 #' @keywords internal
-get_chain_statistic_func <- function(chain_statistic) {
+get_statistic_func <- function(chain_statistic) {
   func <- if (chain_statistic == "size") {
     rbinom_size
   } else if (chain_statistic == "length") {
diff --git a/R/likelihood_estimation.R b/R/likelihood_estimation.R
index c17b2584..42675ff1 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood_estimation.R
@@ -1,30 +1,23 @@
 #' Estimate the (log) likelihood for observed branching processes
 #'
-#' @param chains_observed Vector of sizes/lengths of transmission chains.
-#' @param chain_statistic Statistic given as \code{chains_observed}
-#' ("size" or "length" of chains).
-#' @param offspring_sampler Offspring distribution: a character string
-#' corresponding to the R distribution function (e.g., "pois" for Poisson,
-#' where \code{\link{rpois}} is the R function to generate Poisson random
-#' numbers).
+#' @inheritParams simulate_vect
+#' @param chains Vector of sizes/lengths of transmission chains.
 #' @param nsim_obs Number of simulations if the likelihood is to be
 #' approximated for imperfect observations.
-#' @param log_trans Logical; Should the results be log-transformed? (Defaults
+#' @param log Logical; Should the results be log-transformed? (Defaults
 #' to TRUE).
 #' @param obs_prob Observation probability (assumed constant)
-#' @param chain_stat_max Any chains of this size/length will be
-#' treated as infinite.
 #' @param exclude A vector of indices of the sizes/lengths to exclude from the
 #' likelihood calculation.
 #' @param individual If TRUE, a vector of individual (log)likelihood
 #' contributions will be returned rather than the sum.
 #' @param ... Parameters for the offspring distribution.
 #' @return
-#' * A log-likelihood, if \code{log_trans = TRUE} (the default)
-#' * A vector of log-likelihoods, if \code{log_trans = TRUE} (the default) and
+#' * A log-likelihood, if \code{log = TRUE} (the default)
+#' * A vector of log-likelihoods, if \code{log = TRUE} (the default) and
 #' \code{obs_prob < 1}, or
 #' * A list of individual log-likelihood contributions, if
-#' \code{log_trans = TRUE} (the default) and \code{individual = TRUE}.
+#' \code{log = TRUE} (the default) and \code{individual = TRUE}.
 #' else raw likelihoods, or vector of likelihoods
 #' @seealso offspring_ll, pois_size_ll, nbinom_size_ll, gborel_size_ll,
 #' pois_length_ll, geom_length_ll.
@@ -32,20 +25,20 @@
 #' @examples
 #' # example of observed chain sizes
 #' chain_sizes <- c(1, 1, 4, 7)
-#' estimate_likelihood(chains_observed = chain_sizes, chain_statistic = "size",
-#'  offspring_sampler = "pois", nsim_obs = 100, lambda = 0.5)
+#' estimate_likelihood(chains = chain_sizes, statistic = "size",
+#'  offspring_dist = "pois", nsim_obs = 100, lambda = 0.5)
 #' @export
-estimate_likelihood <- function(chains_observed,
-                                chain_statistic = c("size", "length"),
-                                offspring_sampler,
+estimate_likelihood <- function(chains,
+                                statistic = c("size", "length"),
+                                offspring_dist,
                                 nsim_obs,
-                                log_trans = TRUE,
-                                obs_prob = 1, chain_stat_max = Inf,
+                                log = TRUE,
+                                obs_prob = 1, stat_max = Inf,
                                 exclude = NULL, individual = FALSE, ...) {
-  chain_statistic <- match.arg(chain_statistic)
+  statistic <- match.arg(statistic)
 
   ## checks
-  check_offspring_valid(offspring_sampler)
+  check_offspring_valid(offspring_dist)
 
   if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
   if (obs_prob < 1) {
@@ -53,32 +46,32 @@ estimate_likelihood <- function(chains_observed,
       stop("'nsim_obs' must be specified if 'obs_prob' is < 1")
     }
 
-    sample_func <- get_chain_statistic_func(chain_statistic)
+    sample_func <- get_statistic_func(statistic)
 
-    sampled_x <- replicate(nsim_obs, pmin(sample_func(length(chains_observed),
-                                           chains_observed, obs_prob
+    sampled_x <- replicate(nsim_obs, pmin(sample_func(length(chains),
+                                           chains, obs_prob
                                            ),
-                               chain_stat_max), simplify = FALSE)
+                               stat_max), simplify = FALSE)
     size_x <- unlist(sampled_x)
-    if (!is.finite(chain_stat_max)) {
-      chain_stat_max <- max(size_x) + 1
+    if (!is.finite(stat_max)) {
+      stat_max <- max(size_x) + 1
       }
   } else {
-    chains_observed[chains_observed >= chain_stat_max] <- chain_stat_max
-    size_x <- chains_observed
-    sampled_x <- list(chains_observed)
+    chains[chains >= stat_max] <- stat_max
+    size_x <- chains
+    sampled_x <- list(chains)
   }
 
   ## determine for which sizes to calculate the likelihood (for true chain size)
-  if (any(size_x == chain_stat_max)) {
-    calc_sizes <- seq_len(chain_stat_max - 1)
+  if (any(size_x == stat_max)) {
+    calc_sizes <- seq_len(stat_max - 1)
   } else {
     calc_sizes <- unique(c(size_x, exclude))
   }
 
-  ## get likelihood function as given by offspring_sampler and chain_statistic
+  ## get likelihood function as given by offspring_dist and statistic
   likelihoods <- vector(mode = "numeric")
-  ll_func <- construct_offspring_ll_name(offspring_sampler, chain_statistic)
+  ll_func <- construct_offspring_ll_name(offspring_dist, statistic)
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
   ## calculate likelihoods
@@ -90,16 +83,16 @@ estimate_likelihood <- function(chains_observed,
       do.call(
         offspring_ll,
         c(list(
-          chains_observed = calc_sizes, offspring_sampler = offspring_sampler,
-          chain_statistic = chain_statistic, chain_stat_max = chain_stat_max,
-          log_trans = log_trans
+          chains = calc_sizes, offspring_dist = offspring_dist,
+          statistic = statistic, stat_max = stat_max,
+          log = log
         ), pars)
       )
   }
 
-  ## assign probabilities to chain_stat_max outbreak sizes
-  if (any(size_x == chain_stat_max)) {
-    likelihoods[chain_stat_max] <- complementary_logprob(likelihoods)
+  ## assign probabilities to stat_max outbreak sizes
+  if (any(size_x == stat_max)) {
+    likelihoods[stat_max] <- complementary_logprob(likelihoods)
   }
 
   if (!missing(exclude)) {
diff --git a/R/likelihoods.R b/R/likelihoods.R
index 521052c9..477d8bae 100644
--- a/R/likelihoods.R
+++ b/R/likelihoods.R
@@ -92,24 +92,24 @@ geom_length_ll <- function(x, prob) {
 #' The likelihoods are calculated with a crude approximation using simulated
 #' chains by linearly approximating any missing values in the empirical
 #' cumulative distribution function (ecdf).
-#' @param chains_observed Vector of sizes/lengths
+#' @inheritParams estimate_likelihood
+#' @inheritParams simulate_vec
+#' @param chains Vector of sizes/lengths
 #' @param nsim_offspring Number of simulations of the offspring distribution
-#' for approximating the chain_statistic (size/length) distribution
-#' @param log_trans Logical; Should the results be log-transformed? (Defaults
+#' for approximating the statistic (size/length) distribution
+#' @param log Logical; Should the results be log-transformed? (Defaults
 #' to TRUE).
 #' @param ... any parameters to pass to \code{\link{simulate_tree}}
-#' @return If \code{log_trans = TRUE} (the default), log-likelihood values,
+#' @return If \code{log = TRUE} (the default), log-likelihood values,
 #' else raw likelihoods
 #' @author Sebastian Funk
-#' @inheritParams estimate_likelihood
-#' @inheritParams simulate_vec
 #' @keywords internal
-offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
-                         nsim_offspring = 100, log_trans = TRUE, ...) {
+offspring_ll <- function(chains, offspring_dist, statistic,
+                         nsim_offspring = 100, log = TRUE, ...) {
 
   # Simulate the chains
-  chains <- simulate_vect(nsim_offspring, offspring_sampler,
-                          chain_statistic, ...)
+  chains <- simulate_vect(nsim_offspring, offspring_dist,
+                          statistic, ...)
 
   # Compute the empirical Cumulative Distribution Function of the
   # simulated chains
@@ -121,8 +121,8 @@ offspring_ll <- function(chains_observed, offspring_sampler, chain_statistic,
       unique(chains), chains_empirical_cdf(unique(chains)),
       seq_len(max(chains[is.finite(chains)]))
     )$y))
-  lik <- acdf[chains_observed]
+  lik <- acdf[chains]
   lik[is.na(lik)] <- 0
-  out <- ifelse(base::isTRUE(log_trans), log(lik), lik)
+  out <- ifelse(base::isTRUE(log), log(lik), lik)
   return(out)
 }
diff --git a/R/simulate.r b/R/simulate.r
index e0d6c1f6..f03ebf66 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,16 +1,16 @@
 #' Simulate a tree of infections with a serial and offspring distributions
 #'
 #' @param nchains Number of chains to simulate.
-#' @param offspring_sampler Offspring distribution: a character string
+#' @param offspring_dist Offspring distribution: a character string
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
 #' where \code{\link{rpois}} is the R function to generate Poisson random
-#' numbers)
-#' @param chain_statistic String; Statistic to calculate. Can be one of:
+#' numbers).
+#' @param statistic String; Statistic to calculate. Can be one of:
 #' \itemize{
 #'   \item "size": the total number of offspring.
 #'   \item "length": the total number of ancestors.
 #' }
-#' @param chain_stat_max A cut off for the chain statistic (size/length) being
+#' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to this value.
 #' Defaults to `Inf`.
 #' @param serials_sampler The serial interval generator function; the name of a
@@ -70,9 +70,9 @@
 #' @seealso [simulate_vect()] for simulating transmission chains as a vector
 #' @examples
 #' set.seed(123)
-#' chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
-#' offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
-#' chains
+#' chains <- simulate_tree(nchains = 10, statistic = "size",
+#' offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
+#' lambda = 2)
 #' @references
 #'
 #' Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
@@ -84,19 +84,19 @@
 #' Fine PE. The interval between successive cases of an
 #' infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
 #' doi: 10.1093/aje/kwg251. PMID: 14630599.
-simulate_tree <- function(nchains, offspring_sampler,
-                           chain_statistic = c("size", "length"),
-                           chain_stat_max = Inf, serials_sampler, t0 = 0,
-                           tf = Inf, ...) {
-  chain_statistic <- match.arg(chain_statistic)
+simulate_tree <- function(nchains, statistic = c("size", "length"),
+                          offspring_dist, stat_max = Inf,
+                          serials_sampler, t0 = 0,
+                          tf = Inf, ...) {
+  statistic <- match.arg(statistic)
 
   check_nchains_valid(nchains = nchains)
 
   # check that offspring is properly specified
-  check_offspring_valid(offspring_sampler)
+  check_offspring_valid(offspring_dist)
 
   # check that offspring function exists in base R
-  roffspring_name <- paste0("r", offspring_sampler)
+  roffspring_name <- paste0("r", offspring_dist)
   check_offspring_func_valid(roffspring_name)
 
   if (!missing(serials_sampler)) {
@@ -106,7 +106,7 @@ simulate_tree <- function(nchains, offspring_sampler,
   }
 
   # Initialisations
-  stat_track <- rep(1, nchains) # track length or size (depending on `chain_statistic`) #nolint
+  stat_track <- rep(1, nchains) # track length or size (depending on `statistic`) #nolint
   n_offspring <- rep(1, nchains) # current number of offspring
   sim <- seq_len(nchains) # track chains that are still being simulated
   ancestor_ids <- rep(1, nchains) # all chains start in generation 1
@@ -142,7 +142,7 @@ simulate_tree <- function(nchains, offspring_sampler,
     n_offspring[sim] <- tapply(next_gen, indices, sum)
 
     # track size/length
-    stat_track <- update_chain_stat(stat_type = chain_statistic,
+    stat_track <- update_chain_stat(stat_type = statistic,
                                     stat_latest = stat_track,
                                     n_offspring = n_offspring)
 
@@ -179,8 +179,8 @@ simulate_tree <- function(nchains, offspring_sampler,
     }
 
     ## only continue to simulate chains that have offspring and aren't of
-    ## infinite size/length
-    sim <- which(n_offspring > 0 & stat_track < chain_stat_max)
+    ## the specified maximum size/length
+    sim <- which(n_offspring > 0 & stat_track < stat_max)
     if (length(sim) > 0) {
       if (!missing(serials_sampler)) {
         ## only continue to simulate chains that don't go beyond tf
@@ -215,24 +215,24 @@ simulate_tree <- function(nchains, offspring_sampler,
 #' Simulate transmission chains without tree (as a vector)
 #'
 #' @inheritParams simulate_tree
-#' @param chain_stat_max A cut off for the chain statistic (size/length) being
+#' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
 #' @examples
-#' simulate_vect(nchains = 10, offspring_sampler = "pois", lambda = 2,
-#' chain_stat_max = 10)
+#' simulate_vect(nchains = 10, statistic = "size", offspring_dist = "pois",
+#' stat_max = 10, lambda = 2)
 #' @export
-simulate_vect <- function(nchains, offspring_sampler,
-                           chain_statistic = c("size", "length"),
-                           chain_stat_max = Inf, ...) {
-  chain_statistic <- match.arg(chain_statistic)
+simulate_vect <- function(nchains, statistic = c("size", "length"),
+                          offspring_dist,
+                          stat_max = Inf, ...) {
+  statistic <- match.arg(statistic)
 
   check_nchains_valid(nchains = nchains)
 
   # check that offspring is properly specified
-  check_offspring_valid(offspring_sampler)
+  check_offspring_valid(offspring_dist)
 
   # check that offspring function exists in base R
-  roffspring_name <- paste0("r", offspring_sampler)
+  roffspring_name <- paste0("r", offspring_dist)
   check_offspring_func_valid(roffspring_name)
 
   # Initialisations
@@ -257,17 +257,17 @@ simulate_vect <- function(nchains, offspring_sampler,
     n_offspring[sim] <- tapply(next_gen, indices, sum)
 
     # track size/length
-    stat_track <- update_chain_stat(stat_type = chain_statistic,
+    stat_track <- update_chain_stat(stat_type = statistic,
                                     stat_latest = stat_track,
                                     n_offspring = n_offspring
                                     )
 
     ## only continue to simulate chains that offspring and aren't of
-    ## chain_stat_max size/length
-    sim <- which(n_offspring > 0 & stat_track < chain_stat_max)
+    ## stat_max size/length
+    sim <- which(n_offspring > 0 & stat_track < stat_max)
   }
 
-  stat_track[stat_track >= chain_stat_max] <- Inf
+  stat_track[stat_track >= stat_max] <- Inf
 
   structure(
     stat_track,
@@ -281,13 +281,13 @@ simulate_vect <- function(nchains, offspring_sampler,
 #' with initial immunity
 #'
 #' @param pop The susceptible population.
-#' @param offspring_sampler Offspring distribution sampler: a character string
+#' @param offspring_dist Offspring distribution sampler: a character string
 #' corresponding to the R distribution function. Currently only "pois" &
 #' "nbinom" are supported. Internally truncated distributions are used to
 #' avoid infecting more people than susceptibles available.
-#' @param mean_offspring The average number of secondary cases for each case.
+#' @param offspring_mean The average number of secondary cases for each case.
 #' Same as R0.
-#' @param disp_offspring The dispersion parameter of the number of
+#' @param offspring_disp The dispersion parameter of the number of
 #' secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
 #' avoid division by 0 when calculating the size. See details and
 #'  \code{?rnbinom} for details on the parameterisation in Ecology.
@@ -306,44 +306,44 @@ simulate_vect <- function(nchains, offspring_sampler,
 #'
 #' The poisson model is parametrised so that:
 #'
-#' lamda = mean_offspring * pop - initial_immune / pop
+#' lamda = offspring_mean * pop - initial_immune / pop
 #'
 #' The negative binomial model is parametrised as:
 #'
-#' mu = mean_offspring * pop - initial immune / pop, and
-#' size = mu / (disp_offspring - 1). This is why disp_offspring must be greater
+#' mu = offspring_mean * pop - initial immune / pop, and
+#' size = mu / (offspring_disp - 1). This is why offspring_disp must be greater
 #' than 1.
 #'
 #' simulate_tree_from_pop() has a couple of key different from simulate_tree():
 #'  * the maximal chain statistic is limited by `pop` instead of
-#'  `chain_stat_max` (in `simulate_tree()`),
+#'  `stat_max` (in `simulate_tree()`),
 #'  * it can only handle implemented offspring distributions ("pois" and
 #' "nbinom").
 #' @author Flavio Finger
 #' @author James M. Azam
 #' @examples
 #' # Simulate with poisson offspring
-#' simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
-#' mean_offspring = 0.5, serial_sampler = function(x) 3)
+#' simulate_tree_from_pop(pop = 100, offspring_dist = "pois",
+#' offspring_mean = 0.5, serial_sampler = function(x) 3)
 #'
 #' # Simulate with negative binomial offspring
-#' simulate_tree_from_pop(pop = 100, offspring_sampler = "nbinom",
-#' mean_offspring = 0.5, disp_offspring = 1.1, serial_sampler = function(x) 3)
+#' simulate_tree_from_pop(pop = 100, offspring_dist = "nbinom",
+#' offspring_mean = 0.5, offspring_disp = 1.1, serial_sampler = function(x) 3)
 #' @export
 simulate_tree_from_pop <- function(pop,
-                                   offspring_sampler = c("pois", "nbinom"),
-                                   mean_offspring,
-                                   disp_offspring,
+                                   offspring_dist = c("pois", "nbinom"),
+                                   offspring_mean,
+                                   offspring_disp,
                                    serial_sampler,
                                    initial_immune = 0,
                                    t0 = 0,
                                    tf = Inf) {
-  offspring_sampler <- match.arg(offspring_sampler)
+  offspring_dist <- match.arg(offspring_dist)
 
-  if (offspring_sampler == "pois") {
-    if (!missing(disp_offspring)) {
+  if (offspring_dist == "pois") {
+    if (!missing(offspring_disp)) {
       warning(sprintf("%s %s %s",
-                      "'disp_offspring' is not used for",
+                      "'offspring_disp' is not used for",
                       "poisson offspring distribution.",
                       "Will be ignored."
                       )
@@ -352,19 +352,19 @@ simulate_tree_from_pop <- function(pop,
 
     ## using a right truncated poisson distribution
     ## to avoid more cases than susceptibles
-    offspring_fun <- get_offspring_func(offspring_sampler)
+    offspring_fun <- get_offspring_func(offspring_dist)
 
-  } else if (offspring_sampler == "nbinom") {
-    if (missing(disp_offspring)) {
-      stop(sprintf("%s", "'disp_offspring' must be specified."))
-    } else if (disp_offspring <= 1) { ## dispersion coefficient
+  } else if (offspring_dist == "nbinom") {
+    if (missing(offspring_disp)) {
+      stop(sprintf("%s", "'offspring_disp' must be specified."))
+    } else if (offspring_disp <= 1) { ## dispersion coefficient
       stop(sprintf("%s %s %s",
                    "Offspring distribution 'nbinom' requires",
-                   "argument 'disp_offspring' > 1.",
+                   "argument 'offspring_disp' > 1.",
                    "Use 'pois' if there is no overdispersion."
       ))
     }
-    offspring_fun <- get_offspring_func(offspring_sampler)
+    offspring_fun <- get_offspring_func(offspring_dist)
   }
 
   ## initializations
@@ -394,7 +394,7 @@ simulate_tree_from_pop <- function(pop,
 
     ## generate it
     current_max_id <- max(tree_df$sim_id)
-    n_offspring <- offspring_fun(1, susc, pop, mean_offspring, disp_offspring)
+    n_offspring <- offspring_fun(1, susc, pop, offspring_mean, offspring_disp)
 
     if (n_offspring %% 1 > 0) {
       stop("Offspring distribution must return integers")
diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
index a960a7e9..7d0bc0b2 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains.Rd
@@ -25,8 +25,9 @@ Aggregate cases in epichains objects according to a grouping variable
 }
 \examples{
 set.seed(123)
-chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
-offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
+chains <- simulate_tree(nchains = 10, statistic = "size",
+offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
+lambda = 2)
 chains
 
 # Aggregate cases per time
diff --git a/man/check_offspring_valid.Rd b/man/check_offspring_valid.Rd
index 83359dce..cd9bc32d 100644
--- a/man/check_offspring_valid.Rd
+++ b/man/check_offspring_valid.Rd
@@ -4,10 +4,10 @@
 \alias{check_offspring_valid}
 \title{Check if offspring argument is specified as a character string}
 \usage{
-check_offspring_valid(offspring_sampler)
+check_offspring_valid(offspring_dist)
 }
 \arguments{
-\item{offspring_sampler}{Offspring distribution: a character string
+\item{offspring_dist}{Offspring distribution: a character string
 corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers).}
diff --git a/man/construct_offspring_ll_name.Rd b/man/construct_offspring_ll_name.Rd
index b6f5a91f..2218c4b1 100644
--- a/man/construct_offspring_ll_name.Rd
+++ b/man/construct_offspring_ll_name.Rd
@@ -7,18 +7,6 @@ offspring}
 \usage{
 construct_offspring_ll_name(offspring_sampler, chain_statistic)
 }
-\arguments{
-\item{offspring_sampler}{Offspring distribution: a character string
-corresponding to the R distribution function (e.g., "pois" for Poisson,
-where \code{\link{rpois}} is the R function to generate Poisson random
-numbers)}
-
-\item{chain_statistic}{String; Statistic to calculate. Can be one of:
-\itemize{
-\item "size": the total number of offspring.
-\item "length": the total number of ancestors.
-}}
-}
 \value{
 an analytical offspring likelihood function
 }
diff --git a/man/estimate_likelihood.Rd b/man/estimate_likelihood.Rd
index c0dc70e9..b82f3857 100644
--- a/man/estimate_likelihood.Rd
+++ b/man/estimate_likelihood.Rd
@@ -5,25 +5,28 @@
 \title{Estimate the (log) likelihood for observed branching processes}
 \usage{
 estimate_likelihood(
-  chains_observed,
-  chain_statistic = c("size", "length"),
-  offspring_sampler,
+  chains,
+  statistic = c("size", "length"),
+  offspring_dist,
   nsim_obs,
-  log_trans = TRUE,
+  log = TRUE,
   obs_prob = 1,
-  chain_stat_max = Inf,
+  stat_max = Inf,
   exclude = NULL,
   individual = FALSE,
   ...
 )
 }
 \arguments{
-\item{chains_observed}{Vector of sizes/lengths of transmission chains.}
+\item{chains}{Vector of sizes/lengths of transmission chains.}
 
-\item{chain_statistic}{Statistic given as \code{chains_observed}
-("size" or "length" of chains).}
+\item{statistic}{String; Statistic to calculate. Can be one of:
+\itemize{
+\item "size": the total number of offspring.
+\item "length": the total number of ancestors.
+}}
 
-\item{offspring_sampler}{Offspring distribution: a character string
+\item{offspring_dist}{Offspring distribution: a character string
 corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers).}
@@ -31,13 +34,13 @@ numbers).}
 \item{nsim_obs}{Number of simulations if the likelihood is to be
 approximated for imperfect observations.}
 
-\item{log_trans}{Logical; Should the results be log-transformed? (Defaults
+\item{log}{Logical; Should the results be log-transformed? (Defaults
 to TRUE).}
 
 \item{obs_prob}{Observation probability (assumed constant)}
 
-\item{chain_stat_max}{Any chains of this size/length will be
-treated as infinite.}
+\item{stat_max}{A cut off for the chain statistic (size/length) being
+computed. Results above the specified value, are set to \code{Inf}.}
 
 \item{exclude}{A vector of indices of the sizes/lengths to exclude from the
 likelihood calculation.}
@@ -49,11 +52,11 @@ contributions will be returned rather than the sum.}
 }
 \value{
 \itemize{
-\item A log-likelihood, if \code{log_trans = TRUE} (the default)
-\item A vector of log-likelihoods, if \code{log_trans = TRUE} (the default) and
+\item A log-likelihood, if \code{log = TRUE} (the default)
+\item A vector of log-likelihoods, if \code{log = TRUE} (the default) and
 \code{obs_prob < 1}, or
 \item A list of individual log-likelihood contributions, if
-\code{log_trans = TRUE} (the default) and \code{individual = TRUE}.
+\code{log = TRUE} (the default) and \code{individual = TRUE}.
 else raw likelihoods, or vector of likelihoods
 }
 }
@@ -63,8 +66,8 @@ Estimate the (log) likelihood for observed branching processes
 \examples{
 # example of observed chain sizes
 chain_sizes <- c(1, 1, 4, 7)
-estimate_likelihood(chains_observed = chain_sizes, chain_statistic = "size",
- offspring_sampler = "pois", nsim_obs = 100, lambda = 0.5)
+estimate_likelihood(chains = chain_sizes, statistic = "size",
+ offspring_dist = "pois", nsim_obs = 100, lambda = 0.5)
 }
 \seealso{
 offspring_ll, pois_size_ll, nbinom_size_ll, gborel_size_ll,
diff --git a/man/get_offspring_func.Rd b/man/get_offspring_func.Rd
index 10c61254..a8f0757b 100644
--- a/man/get_offspring_func.Rd
+++ b/man/get_offspring_func.Rd
@@ -2,7 +2,8 @@
 % Please edit documentation in R/helpers.R
 \name{get_offspring_func}
 \alias{get_offspring_func}
-\title{Get offspring sampling function}
+\title{Get offspring sampling function that takes into account susceptible
+depletion}
 \usage{
 get_offspring_func(
   offspring_sampler,
@@ -14,30 +15,18 @@ get_offspring_func(
 )
 }
 \arguments{
-\item{offspring_sampler}{Offspring distribution sampler: a character string
-corresponding to the R distribution function. Currently only "pois" &
-"nbinom" are supported. Internally truncated distributions are used to
-avoid infecting more people than susceptibles available.}
-
 \item{n}{Number of items to sample}
 
 \item{susc}{Susceptible population size (calculated
 inside \code{\link{simulate_tree_from_pop}}  as pop - initial_immune)}
 
 \item{pop}{The susceptible population.}
-
-\item{mean_offspring}{The average number of secondary cases for each case.
-Same as R0.}
-
-\item{disp_offspring}{The dispersion parameter of the number of
-secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
-avoid division by 0 when calculating the size. See details and
-\code{?rnbinom} for details on the parameterisation in Ecology.}
 }
 \value{
 An offspring sampling function
 }
 \description{
-Get offspring sampling function
+Get offspring sampling function that takes into account susceptible
+depletion
 }
 \keyword{internal}
diff --git a/man/get_chain_statistic_func.Rd b/man/get_statistic_func.Rd
similarity index 50%
rename from man/get_chain_statistic_func.Rd
rename to man/get_statistic_func.Rd
index 3fad9d5f..fe37a9a2 100644
--- a/man/get_chain_statistic_func.Rd
+++ b/man/get_statistic_func.Rd
@@ -1,17 +1,10 @@
 % Generated by roxygen2: do not edit by hand
 % Please edit documentation in R/helpers.R
-\name{get_chain_statistic_func}
-\alias{get_chain_statistic_func}
+\name{get_statistic_func}
+\alias{get_statistic_func}
 \title{Return a function for calculating chain statistics}
 \usage{
-get_chain_statistic_func(chain_statistic)
-}
-\arguments{
-\item{chain_statistic}{String; Statistic to calculate. Can be one of:
-\itemize{
-\item "size": the total number of offspring.
-\item "length": the total number of ancestors.
-}}
+get_statistic_func(chain_statistic)
 }
 \value{
 a function for calculating chain statistics
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 8556f5b1..b3ebfda6 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -5,35 +5,38 @@
 \title{Likelihood of the length of chains with generic offspring distribution}
 \usage{
 offspring_ll(
-  chains_observed,
-  offspring_sampler,
-  chain_statistic,
+  chains,
+  offspring_dist,
+  statistic,
   nsim_offspring = 100,
-  log_trans = TRUE,
+  log = TRUE,
   ...
 )
 }
 \arguments{
-\item{chains_observed}{Vector of sizes/lengths}
+\item{chains}{Vector of sizes/lengths}
 
-\item{offspring_sampler}{Offspring distribution: a character string
+\item{offspring_dist}{Offspring distribution: a character string
 corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers).}
 
-\item{chain_statistic}{Statistic given as \code{chains_observed}
-("size" or "length" of chains).}
+\item{statistic}{String; Statistic to calculate. Can be one of:
+\itemize{
+\item "size": the total number of offspring.
+\item "length": the total number of ancestors.
+}}
 
 \item{nsim_offspring}{Number of simulations of the offspring distribution
-for approximating the chain_statistic (size/length) distribution}
+for approximating the statistic (size/length) distribution}
 
-\item{log_trans}{Logical; Should the results be log-transformed? (Defaults
+\item{log}{Logical; Should the results be log-transformed? (Defaults
 to TRUE).}
 
 \item{...}{any parameters to pass to \code{\link{simulate_tree}}}
 }
 \value{
-If \code{log_trans = TRUE} (the default), log-likelihood values,
+If \code{log = TRUE} (the default), log-likelihood values,
 else raw likelihoods
 }
 \description{
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index a29a2748..d5e84878 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -6,9 +6,9 @@
 \usage{
 simulate_tree(
   nchains,
-  offspring_sampler,
-  chain_statistic = c("size", "length"),
-  chain_stat_max = Inf,
+  statistic = c("size", "length"),
+  offspring_dist,
+  stat_max = Inf,
   serials_sampler,
   t0 = 0,
   tf = Inf,
@@ -18,18 +18,18 @@ simulate_tree(
 \arguments{
 \item{nchains}{Number of chains to simulate.}
 
-\item{offspring_sampler}{Offspring distribution: a character string
-corresponding to the R distribution function (e.g., "pois" for Poisson,
-where \code{\link{rpois}} is the R function to generate Poisson random
-numbers)}
-
-\item{chain_statistic}{String; Statistic to calculate. Can be one of:
+\item{statistic}{String; Statistic to calculate. Can be one of:
 \itemize{
 \item "size": the total number of offspring.
 \item "length": the total number of ancestors.
 }}
 
-\item{chain_stat_max}{A cut off for the chain statistic (size/length) being
+\item{offspring_dist}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
+
+\item{stat_max}{A cut off for the chain statistic (size/length) being
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
@@ -100,9 +100,9 @@ where \code{...} are the other arguments to \code{simulate_tree()}.
 
 \examples{
 set.seed(123)
-chains <- simulate_tree(nchains = 10, serials_sampler = function(x) 3,
-offspring_sampler = "pois", lambda = 2, chain_stat_max = 10)
-chains
+chains <- simulate_tree(nchains = 10, statistic = "size",
+offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
+lambda = 2)
 }
 \references{
 Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index d2409fa4..58e844bc 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -7,9 +7,9 @@ with initial immunity}
 \usage{
 simulate_tree_from_pop(
   pop,
-  offspring_sampler = c("pois", "nbinom"),
-  mean_offspring,
-  disp_offspring,
+  offspring_dist = c("pois", "nbinom"),
+  offspring_mean,
+  offspring_disp,
   serial_sampler,
   initial_immune = 0,
   t0 = 0,
@@ -19,15 +19,15 @@ simulate_tree_from_pop(
 \arguments{
 \item{pop}{The susceptible population.}
 
-\item{offspring_sampler}{Offspring distribution sampler: a character string
+\item{offspring_dist}{Offspring distribution sampler: a character string
 corresponding to the R distribution function. Currently only "pois" &
 "nbinom" are supported. Internally truncated distributions are used to
 avoid infecting more people than susceptibles available.}
 
-\item{mean_offspring}{The average number of secondary cases for each case.
+\item{offspring_mean}{The average number of secondary cases for each case.
 Same as R0.}
 
-\item{disp_offspring}{The dispersion parameter of the number of
+\item{offspring_disp}{The dispersion parameter of the number of
 secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
 avoid division by 0 when calculating the size. See details and
 \code{?rnbinom} for details on the parameterisation in Ecology.}
@@ -54,18 +54,18 @@ with initial immunity
 \section{Offspring models}{
 The poisson model is parametrised so that:
 
-lamda = mean_offspring * pop - initial_immune / pop
+lamda = offspring_mean * pop - initial_immune / pop
 
 The negative binomial model is parametrised as:
 
-mu = mean_offspring * pop - initial immune / pop, and
-size = mu / (disp_offspring - 1). This is why disp_offspring must be greater
+mu = offspring_mean * pop - initial immune / pop, and
+size = mu / (offspring_disp - 1). This is why offspring_disp must be greater
 than 1.
 
 simulate_tree_from_pop() has a couple of key different from simulate_tree():
 \itemize{
 \item the maximal chain statistic is limited by \code{pop} instead of
-\code{chain_stat_max} (in \code{simulate_tree()}),
+\code{stat_max} (in \code{simulate_tree()}),
 \item it can only handle implemented offspring distributions ("pois" and
 "nbinom").
 }
@@ -73,12 +73,12 @@ simulate_tree_from_pop() has a couple of key different from simulate_tree():
 
 \examples{
 # Simulate with poisson offspring
-simulate_tree_from_pop(pop = 100, offspring_sampler = "pois",
-mean_offspring = 0.5, serial_sampler = function(x) 3)
+simulate_tree_from_pop(pop = 100, offspring_dist = "pois",
+offspring_mean = 0.5, serial_sampler = function(x) 3)
 
 # Simulate with negative binomial offspring
-simulate_tree_from_pop(pop = 100, offspring_sampler = "nbinom",
-mean_offspring = 0.5, disp_offspring = 1.1, serial_sampler = function(x) 3)
+simulate_tree_from_pop(pop = 100, offspring_dist = "nbinom",
+offspring_mean = 0.5, offspring_disp = 1.1, serial_sampler = function(x) 3)
 }
 \author{
 Flavio Finger
diff --git a/man/simulate_vect.Rd b/man/simulate_vect.Rd
index cdef8113..4f2a050f 100644
--- a/man/simulate_vect.Rd
+++ b/man/simulate_vect.Rd
@@ -6,27 +6,27 @@
 \usage{
 simulate_vect(
   nchains,
-  offspring_sampler,
-  chain_statistic = c("size", "length"),
-  chain_stat_max = Inf,
+  statistic = c("size", "length"),
+  offspring_dist,
+  stat_max = Inf,
   ...
 )
 }
 \arguments{
 \item{nchains}{Number of chains to simulate.}
 
-\item{offspring_sampler}{Offspring distribution: a character string
-corresponding to the R distribution function (e.g., "pois" for Poisson,
-where \code{\link{rpois}} is the R function to generate Poisson random
-numbers)}
-
-\item{chain_statistic}{String; Statistic to calculate. Can be one of:
+\item{statistic}{String; Statistic to calculate. Can be one of:
 \itemize{
 \item "size": the total number of offspring.
 \item "length": the total number of ancestors.
 }}
 
-\item{chain_stat_max}{A cut off for the chain statistic (size/length) being
+\item{offspring_dist}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
+
+\item{stat_max}{A cut off for the chain statistic (size/length) being
 computed. Results above the specified value, are set to \code{Inf}.}
 
 \item{...}{Parameters of the offspring distribution as required by R.}
@@ -35,6 +35,6 @@ computed. Results above the specified value, are set to \code{Inf}.}
 Simulate transmission chains without tree (as a vector)
 }
 \examples{
-simulate_vect(nchains = 10, offspring_sampler = "pois", lambda = 2,
-chain_stat_max = 10)
+simulate_vect(nchains = 10, statistic = "size", offspring_dist = "pois",
+stat_max = 10, lambda = 2)
 }
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 84a7b5e9..0911e758 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -1,27 +1,27 @@
 test_that("Simulators output epichains objects", {
   expect_s3_class(
     simulate_tree(nchains = 10,
-                  offspring_sampler = "pois",
+                  offspring_dist = "pois",
                   lambda = 2,
-                  chain_statistic = "size",
-                  chain_stat_max = 10
+                  statistic = "size",
+                  stat_max = 10
                   ),
     "epichains"
     )
   expect_s3_class(
     simulate_tree_from_pop(pop = 100,
-                           offspring_sampler = "nbinom",
-                           mean_offspring = 0.5,
-                           disp_offspring = 1.1,
+                           offspring_dist = "nbinom",
+                           offspring_mean = 0.5,
+                           offspring_disp = 1.1,
                            serial_sampler = function(x) 3
     ),
     "epichains"
   )
   expect_s3_class(
     simulate_vect(n = 10,
-                  offspring_sampler = "pois",
+                  offspring_dist = "pois",
                   lambda = 2,
-                  chain_stat_max = 10
+                  stat_max = 10
     ),
     "epichains"
   )

From b1a5eb404ddf6f1fffc4f011896e98bf26da93b1 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 17:58:11 +0100
Subject: [PATCH 518/828] Removed doc file of deleted function

---
 man/find_function_name.Rd | 23 -----------------------
 1 file changed, 23 deletions(-)
 delete mode 100644 man/find_function_name.Rd

diff --git a/man/find_function_name.Rd b/man/find_function_name.Rd
deleted file mode 100644
index d330baed..00000000
--- a/man/find_function_name.Rd
+++ /dev/null
@@ -1,23 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/utils.r
-\name{find_function_name}
-\alias{find_function_name}
-\title{Finds the name of a function passed as an argument}
-\usage{
-find_function_name(fun)
-}
-\arguments{
-\item{fun}{function of which the name is to be determined}
-}
-\value{
-function name
-}
-\description{
-This works even when a function is passed multiple times (e.g., when used
-inside an \code{\link{optim}} call).
-See https://stackoverflow.com/a/46740314/10886760
-}
-\author{
-Sebastian Funk
-}
-\keyword{internal}

From d1b8a3d6a39c939d6ec598845e49a5261285a8a6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 17:58:31 +0100
Subject: [PATCH 519/828] Added details to the head() and tail() methods

---
 man/head.epichains.Rd | 6 ++++++
 man/tail.epichains.Rd | 6 ++++++
 2 files changed, 12 insertions(+)

diff --git a/man/head.epichains.Rd b/man/head.epichains.Rd
index 3ee70b58..7b06d4b4 100644
--- a/man/head.epichains.Rd
+++ b/man/head.epichains.Rd
@@ -17,6 +17,12 @@ object of class \code{data.frame}
 \description{
 \code{head} method for \code{\link{epichains}} class
 }
+\details{
+This returns the top rows of an \code{epichains} object. Note that the object
+is originally sorted by \code{sim_id} and \code{ancestor} and the first
+unknown ancestors (NA) have been dropped from
+printing method. To view the full output, use \verb{as.data.frame(<object_name>)}.
+}
 \author{
 James M. Azam
 }
diff --git a/man/tail.epichains.Rd b/man/tail.epichains.Rd
index d63fc88e..21502c04 100644
--- a/man/tail.epichains.Rd
+++ b/man/tail.epichains.Rd
@@ -14,6 +14,12 @@
 \description{
 \code{tail} method for \code{\link{epichains}} class
 }
+\details{
+This returns the top rows of an \code{epichains} object. Note that the object
+is originally sorted by \code{sim_id} and \code{ancestor} and the first
+unknown ancestors (NA) have been dropped from
+printing method. To view the full output, use \verb{as.data.frame(<object_name>)}.
+}
 \author{
 James M. Azam
 }

From d78607753846d0dc1884ca1d27c21a6910f046a0 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 17:59:21 +0100
Subject: [PATCH 520/828] Replaced the old argument names in the vignette

---
 vignettes/epichains.Rmd | 14 +++++++-------
 1 file changed, 7 insertions(+), 7 deletions(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index a7b0f485..cecd7863 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -66,26 +66,26 @@ knitr::opts_chunk$set(
 library(epichains)
 # Using `simulate_tree()`
 simulate_tree_eg <- simulate_tree(nchains = 10,
+                                  offspring_dist = "pois",
                                   serials_sampler = function(x) 3,
-                                  offspring_sampler = "pois",
                                   lambda = 2,
-                                  chain_stat_max = 10
+                                  stat_max = 10
                                   )
 
 simulate_tree_eg # print the output
 
 # Using simulate_vect()
-simulate_vect_eg <- simulate_vect(nchains = 10, offspring_sampler = "pois",
-                                  lambda = 2, chain_stat_max = 10)
+simulate_vect_eg <- simulate_vect(nchains = 10, offspring_dist = "pois",
+                                  lambda = 2, stat_max = 10)
 
 simulate_vect_eg # print the output
 
 # Using `simulate_tree_from_pop()`
 
 # Simulate with poisson offspring
-simulate_vect_eg_pois <- simulate_tree_from_pop(pop = 100,
-                                                offspring_sampler = "pois",
-                                                mean_offspring = 0.5,
+simulate_vect_eg_pois <- simulate_tree_from_pop(pop = 10000,
+                                                offspring_dist = "pois",
+                                                offspring_mean = 0.5,
                                                 serial_sampler = function(x) 3
                                                 )
 

From 35038a7752a65376e9c7ea6ddde62cf97cd90b8f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 18:00:06 +0100
Subject: [PATCH 521/828] Added an example of likelihood estimation to the
 vignette

---
 vignettes/epichains.Rmd | 7 +++++++
 1 file changed, 7 insertions(+)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index cecd7863..6f99debd 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -100,6 +100,13 @@ simulate_vect_eg_nbinom <- simulate_tree_from_pop(pop = 100,
                                                   )
 
 simulate_vect_eg_nbinom # print the output
+
+# Likelihoods
+
+chain_sizes <- c(1, 1, 4, 7)
+estimate_likelihood(chains = chain_sizes, statistic = "size",
+                    offspring_dist = "pois", nsim_obs = 100,
+                    lambda = 0.5)
 ```
 
 ### Aggregation

From eec0a08199c0b92154361b6cb2f81f81d1ea3352 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 1 Aug 2023 18:00:21 +0100
Subject: [PATCH 522/828] Replaced the renamed arguments

---
 vignettes/epichains.Rmd | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 6f99debd..4bc177c8 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -93,9 +93,9 @@ simulate_vect_eg_pois # print the output
 
 # Simulate with negative binomial offspring
 simulate_vect_eg_nbinom <- simulate_tree_from_pop(pop = 100,
-                                                  offspring_sampler = "nbinom",
-                                                  mean_offspring = 0.5,
-                                                  disp_offspring = 1.1,
+                                                  offspring_dist = "nbinom",
+                                                  offspring_mean = 0.5,
+                                                  offspring_disp = 1.1,
                                                   serial_sampler = function(x) 3
                                                   )
 

From 0bb63b0f417a4e5d73b853ccd9f563ff1f8093f4 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 17:46:27 +0100
Subject: [PATCH 523/828] Renamed chains_vec/vect to chains_summary

---
 NAMESPACE                                     |  5 ++---
 R/borel.r                                     |  2 +-
 R/epichains.R                                 | 10 +++++-----
 R/simulate.r                                  | 11 ++++++-----
 man/is_chains_summary.Rd                      | 17 +++++++++++++++++
 man/is_chains_vec.Rd                          | 17 -----------------
 man/{simulate_vect.Rd => simulate_summary.Rd} | 12 ++++++------
 man/simulate_tree.Rd                          |  3 ++-
 tests/testthat/tests-sim.r                    |  2 +-
 vignettes/epichains.Rmd                       |  6 +++---
 10 files changed, 43 insertions(+), 42 deletions(-)
 create mode 100644 man/is_chains_summary.Rd
 delete mode 100644 man/is_chains_vec.Rd
 rename man/{simulate_vect.Rd => simulate_summary.Rd} (77%)

diff --git a/NAMESPACE b/NAMESPACE
index 6e29ae0b..708fbf0a 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -7,16 +7,15 @@ S3method(print,epichains)
 S3method(summary,epichains)
 S3method(tail,epichains)
 export(dborel)
-export(estimate_likelihood)
+export(is_chains_summary)
 export(is_chains_tree)
-export(is_chains_vec)
 export(is_epichains)
 export(is_epichains_aggregate_df)
 export(rborel)
 export(rnbinom_mean_disp)
+export(simulate_summary)
 export(simulate_tree)
 export(simulate_tree_from_pop)
-export(simulate_vect)
 export(validate_epichains)
 importFrom(stats,aggregate)
 importFrom(utils,head)
diff --git a/R/borel.r b/R/borel.r
index 9e2747f8..4d67333d 100644
--- a/R/borel.r
+++ b/R/borel.r
@@ -24,7 +24,7 @@ dborel <- function(x, mu, log = FALSE) {
 ##' @author Sebastian Funk
 ##' @export
 rborel <- function(n, mu, infinite = Inf) {
-  simulate_vect(nchains = n,
+  simulate_summary(nchains = n,
                 offspring_dist = "pois",
                 statistic = "size",
                 stat_max = infinite,
diff --git a/R/epichains.R b/R/epichains.R
index 9ca5b310..2cb12368 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -49,7 +49,7 @@ format.epichains <- function(x, ...) {
                        "to view the full output in the console.")
                )
 
-  } else if (is_chains_vec(x)) {
+  } else if (is_chains_summary(x)) {
     writeLines(sprintf("`epichains` object \n"))
     print(as.vector(x))
     writeLines(sprintf("\n Number of chains simulated: %s",
@@ -104,7 +104,7 @@ summary.epichains <- function(object, ...) {
       num_generations = num_generations,
       max_generation = max_generation
     )
-  } else if (is_chains_vec(object)) {
+  } else if (is_chains_summary(object)) {
     chains_ran <- length(object)
 
     if (!all(is.infinite(object))) {
@@ -194,15 +194,15 @@ is_chains_tree <- function(x) {
     attributes(x)$chain_type == "chains_tree"
 }
 
-#' Check if an epichains object has the `chains_vec` attribute
+#' Check if an epichains object has the `chains_summary` attribute
 #'
 #' @param x An [`epichains`] object
 #'
 #' @export
 #' @author James M. Azam
-is_chains_vec <- function(x) {
+is_chains_summary <- function(x) {
   !is.null(attributes(x)$chain_type) &&
-    attributes(x)$chain_type == "chains_vec"
+    attributes(x)$chain_type == "chains_summary"
 }
 
 
diff --git a/R/simulate.r b/R/simulate.r
index f03ebf66..a2c72f78 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -67,7 +67,8 @@
 #' in the `simulate_tree()` call like so
 #' \code{simulate_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
 #' where `...` are the other arguments to `simulate_tree()`.
-#' @seealso [simulate_vect()] for simulating transmission chains as a vector
+#' @seealso [simulate_summary()] for simulating the transmission chains
+#' statistic without the tree of infections.
 #' @examples
 #' set.seed(123)
 #' chains <- simulate_tree(nchains = 10, statistic = "size",
@@ -212,16 +213,16 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 
 
-#' Simulate transmission chains without tree (as a vector)
+#' Simulate a summary of the transmission chain statistic
 #'
 #' @inheritParams simulate_tree
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
 #' @examples
-#' simulate_vect(nchains = 10, statistic = "size", offspring_dist = "pois",
+#' simulate_summary(nchains = 10, statistic = "size", offspring_dist = "pois",
 #' stat_max = 10, lambda = 2)
 #' @export
-simulate_vect <- function(nchains, statistic = c("size", "length"),
+simulate_summary <- function(nchains, statistic = c("size", "length"),
                           offspring_dist,
                           stat_max = Inf, ...) {
   statistic <- match.arg(statistic)
@@ -271,7 +272,7 @@ simulate_vect <- function(nchains, statistic = c("size", "length"),
 
   structure(
     stat_track,
-    chain_type = "chains_vec",
+    chain_type = "chains_summary",
     chains = nchains,
     class = c("epichains", class(stat_track))
   )
diff --git a/man/is_chains_summary.Rd b/man/is_chains_summary.Rd
new file mode 100644
index 00000000..6a7e0adb
--- /dev/null
+++ b/man/is_chains_summary.Rd
@@ -0,0 +1,17 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{is_chains_summary}
+\alias{is_chains_summary}
+\title{Check if an epichains object has the \code{chains_summary} attribute}
+\usage{
+is_chains_summary(x)
+}
+\arguments{
+\item{x}{An \code{\link{epichains}} object}
+}
+\description{
+Check if an epichains object has the \code{chains_summary} attribute
+}
+\author{
+James M. Azam
+}
diff --git a/man/is_chains_vec.Rd b/man/is_chains_vec.Rd
deleted file mode 100644
index 316f6f53..00000000
--- a/man/is_chains_vec.Rd
+++ /dev/null
@@ -1,17 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{is_chains_vec}
-\alias{is_chains_vec}
-\title{Check if an epichains object has the \code{chains_vec} attribute}
-\usage{
-is_chains_vec(x)
-}
-\arguments{
-\item{x}{An \code{\link{epichains}} object}
-}
-\description{
-Check if an epichains object has the \code{chains_vec} attribute
-}
-\author{
-James M. Azam
-}
diff --git a/man/simulate_vect.Rd b/man/simulate_summary.Rd
similarity index 77%
rename from man/simulate_vect.Rd
rename to man/simulate_summary.Rd
index 4f2a050f..ee63dcab 100644
--- a/man/simulate_vect.Rd
+++ b/man/simulate_summary.Rd
@@ -1,10 +1,10 @@
 % Generated by roxygen2: do not edit by hand
 % Please edit documentation in R/simulate.r
-\name{simulate_vect}
-\alias{simulate_vect}
-\title{Simulate transmission chains without tree (as a vector)}
+\name{simulate_summary}
+\alias{simulate_summary}
+\title{Simulate a summary of the transmission chain statistic}
 \usage{
-simulate_vect(
+simulate_summary(
   nchains,
   statistic = c("size", "length"),
   offspring_dist,
@@ -32,9 +32,9 @@ computed. Results above the specified value, are set to \code{Inf}.}
 \item{...}{Parameters of the offspring distribution as required by R.}
 }
 \description{
-Simulate transmission chains without tree (as a vector)
+Simulate a summary of the transmission chain statistic
 }
 \examples{
-simulate_vect(nchains = 10, statistic = "size", offspring_dist = "pois",
+simulate_summary(nchains = 10, statistic = "size", offspring_dist = "pois",
 stat_max = 10, lambda = 2)
 }
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index d5e84878..94da6229 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -116,7 +116,8 @@ infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
 doi: 10.1093/aje/kwg251. PMID: 14630599.
 }
 \seealso{
-\code{\link[=simulate_vect]{simulate_vect()}} for simulating transmission chains as a vector
+\code{\link[=simulate_summary]{simulate_summary()}} for simulating the transmission chains
+statistic without the tree of infections.
 }
 \author{
 James M. Azam, Sebastian Funk
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 0911e758..664932cf 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -18,7 +18,7 @@ test_that("Simulators output epichains objects", {
     "epichains"
   )
   expect_s3_class(
-    simulate_vect(n = 10,
+    simulate_summary(n = 10,
                   offspring_dist = "pois",
                   lambda = 2,
                   stat_max = 10
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 4bc177c8..80466747 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -34,9 +34,9 @@ knitr::opts_chunk$set(
 * `simulate_tree()`: simulate transmission trees from a given number of chains.
 * `simulate_tree_from_pop()`: simulate transmission trees from a given number 
   population size and initial immunity.
-* `simulate_vect()`: simulate a vector of observed transmission chains 
+* `simulate_summary()`: simulate a vector of observed transmission chains 
   sizes/lengths from a given number of chains.
-* `estimate_likelihood()`: estimate the likelihood/loglikelihood of observing
+* `likelihood()`: estimate the likelihood/loglikelihood of observing
   chains of given sizes/lengths.
 
 ### Object-orientation
@@ -47,7 +47,7 @@ knitr::opts_chunk$set(
   * superclass of `data.frame` with attributes for tracking `chain_type` as: 
     * `chains_tree`, if returned from `simulate_tree()` or 
     `simulate_tree_from_pop()`
-    * `chains_vec`, if returned from `simulate_vect()`.
+    * `chains_vec`, if returned from `simulate_summary()`.
 * An `epichains_aggregate_df` class:
   * superclass of `data.frame` with attributes for tracking if aggregation is 
   done over "time", "generation" or "both". Useful for `plot` method dispatch 

From b437d0941bf5526ddb31af03646921d6e34270c9 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 17:47:32 +0100
Subject: [PATCH 524/828] Renamed data to covid19_sa

---
 R/{data.R => covid19_sa.R} | 0
 man/covid19_sa.Rd          | 2 +-
 2 files changed, 1 insertion(+), 1 deletion(-)
 rename R/{data.R => covid19_sa.R} (100%)

diff --git a/R/data.R b/R/covid19_sa.R
similarity index 100%
rename from R/data.R
rename to R/covid19_sa.R
diff --git a/man/covid19_sa.Rd b/man/covid19_sa.Rd
index 1bc989c4..7eeceb3f 100644
--- a/man/covid19_sa.Rd
+++ b/man/covid19_sa.Rd
@@ -1,5 +1,5 @@
 % Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/data.R
+% Please edit documentation in R/covid19_sa.R
 \docType{data}
 \name{covid19_sa}
 \alias{covid19_sa}

From fa853ef9dd5f4d8dd10c7ddd4f2c8890e8d145e9 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 17:49:19 +0100
Subject: [PATCH 525/828] Renamed likelihoods.R to stat_likelihoods.R and
 estimate_likelihood() to likelihood()

---
 NAMESPACE                                     |  2 ++
 R/{likelihood_estimation.R => likelihood.R}   | 14 +++++---------
 R/{likelihoods.R => stat_likelihoods.R}       |  6 +++---
 man/gborel_size_ll.Rd                         |  2 +-
 man/geom_length_ll.Rd                         |  2 +-
 man/{estimate_likelihood.Rd => likelihood.Rd} | 10 +++++-----
 man/nbinom_size_ll.Rd                         |  2 +-
 man/offspring_ll.Rd                           |  3 +--
 man/pois_length_ll.Rd                         |  2 +-
 man/pois_size_ll.Rd                           |  2 +-
 10 files changed, 21 insertions(+), 24 deletions(-)
 rename R/{likelihood_estimation.R => likelihood.R} (88%)
 rename R/{likelihoods.R => stat_likelihoods.R} (97%)
 rename man/{estimate_likelihood.Rd => likelihood.Rd} (91%)

diff --git a/NAMESPACE b/NAMESPACE
index 708fbf0a..ab92359f 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -11,6 +11,8 @@ export(is_chains_summary)
 export(is_chains_tree)
 export(is_epichains)
 export(is_epichains_aggregate_df)
+export(likelihood)
+export(offspring_ll)
 export(rborel)
 export(rnbinom_mean_disp)
 export(simulate_summary)
diff --git a/R/likelihood_estimation.R b/R/likelihood.R
similarity index 88%
rename from R/likelihood_estimation.R
rename to R/likelihood.R
index 42675ff1..22bfc518 100644
--- a/R/likelihood_estimation.R
+++ b/R/likelihood.R
@@ -1,6 +1,6 @@
 #' Estimate the (log) likelihood for observed branching processes
 #'
-#' @inheritParams simulate_vect
+#' @inheritParams simulate_summary
 #' @param chains Vector of sizes/lengths of transmission chains.
 #' @param nsim_obs Number of simulations if the likelihood is to be
 #' approximated for imperfect observations.
@@ -25,16 +25,12 @@
 #' @examples
 #' # example of observed chain sizes
 #' chain_sizes <- c(1, 1, 4, 7)
-#' estimate_likelihood(chains = chain_sizes, statistic = "size",
+#' likelihood(chains = chain_sizes, statistic = "size",
 #'  offspring_dist = "pois", nsim_obs = 100, lambda = 0.5)
 #' @export
-estimate_likelihood <- function(chains,
-                                statistic = c("size", "length"),
-                                offspring_dist,
-                                nsim_obs,
-                                log = TRUE,
-                                obs_prob = 1, stat_max = Inf,
-                                exclude = NULL, individual = FALSE, ...) {
+likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
+                       nsim_obs, log = TRUE, obs_prob = 1, stat_max = Inf,
+                       exclude = NULL, individual = FALSE, ...) {
   statistic <- match.arg(statistic)
 
   ## checks
diff --git a/R/likelihoods.R b/R/stat_likelihoods.R
similarity index 97%
rename from R/likelihoods.R
rename to R/stat_likelihoods.R
index 477d8bae..b1cf9bee 100644
--- a/R/likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -92,7 +92,7 @@ geom_length_ll <- function(x, prob) {
 #' The likelihoods are calculated with a crude approximation using simulated
 #' chains by linearly approximating any missing values in the empirical
 #' cumulative distribution function (ecdf).
-#' @inheritParams estimate_likelihood
+#' @inheritParams likelihood
 #' @inheritParams simulate_vec
 #' @param chains Vector of sizes/lengths
 #' @param nsim_offspring Number of simulations of the offspring distribution
@@ -103,12 +103,12 @@ geom_length_ll <- function(x, prob) {
 #' @return If \code{log = TRUE} (the default), log-likelihood values,
 #' else raw likelihoods
 #' @author Sebastian Funk
-#' @keywords internal
+#' @export
 offspring_ll <- function(chains, offspring_dist, statistic,
                          nsim_offspring = 100, log = TRUE, ...) {
 
   # Simulate the chains
-  chains <- simulate_vect(nsim_offspring, offspring_dist,
+  chains <- simulate_summary(nsim_offspring, offspring_dist,
                           statistic, ...)
 
   # Compute the empirical Cumulative Distribution Function of the
diff --git a/man/gborel_size_ll.Rd b/man/gborel_size_ll.Rd
index 221bf270..618659f2 100644
--- a/man/gborel_size_ll.Rd
+++ b/man/gborel_size_ll.Rd
@@ -1,5 +1,5 @@
 % Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/likelihoods.R
+% Please edit documentation in R/stat_likelihoods.R
 \name{gborel_size_ll}
 \alias{gborel_size_ll}
 \title{Likelihood of the size of chains with gamma-Borel offspring distribution}
diff --git a/man/geom_length_ll.Rd b/man/geom_length_ll.Rd
index bdc6082d..f200df93 100644
--- a/man/geom_length_ll.Rd
+++ b/man/geom_length_ll.Rd
@@ -1,5 +1,5 @@
 % Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/likelihoods.R
+% Please edit documentation in R/stat_likelihoods.R
 \name{geom_length_ll}
 \alias{geom_length_ll}
 \title{Likelihood of the length of chains with geometric offspring distribution}
diff --git a/man/estimate_likelihood.Rd b/man/likelihood.Rd
similarity index 91%
rename from man/estimate_likelihood.Rd
rename to man/likelihood.Rd
index b82f3857..7e6d3dec 100644
--- a/man/estimate_likelihood.Rd
+++ b/man/likelihood.Rd
@@ -1,10 +1,10 @@
 % Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/likelihood_estimation.R
-\name{estimate_likelihood}
-\alias{estimate_likelihood}
+% Please edit documentation in R/likelihood.R
+\name{likelihood}
+\alias{likelihood}
 \title{Estimate the (log) likelihood for observed branching processes}
 \usage{
-estimate_likelihood(
+likelihood(
   chains,
   statistic = c("size", "length"),
   offspring_dist,
@@ -66,7 +66,7 @@ Estimate the (log) likelihood for observed branching processes
 \examples{
 # example of observed chain sizes
 chain_sizes <- c(1, 1, 4, 7)
-estimate_likelihood(chains = chain_sizes, statistic = "size",
+likelihood(chains = chain_sizes, statistic = "size",
  offspring_dist = "pois", nsim_obs = 100, lambda = 0.5)
 }
 \seealso{
diff --git a/man/nbinom_size_ll.Rd b/man/nbinom_size_ll.Rd
index 363ecd30..14003322 100644
--- a/man/nbinom_size_ll.Rd
+++ b/man/nbinom_size_ll.Rd
@@ -1,5 +1,5 @@
 % Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/likelihoods.R
+% Please edit documentation in R/stat_likelihoods.R
 \name{nbinom_size_ll}
 \alias{nbinom_size_ll}
 \title{Likelihood of the size of chains with Negative-Binomial offspring
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index b3ebfda6..65763d76 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -1,5 +1,5 @@
 % Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/likelihoods.R
+% Please edit documentation in R/stat_likelihoods.R
 \name{offspring_ll}
 \alias{offspring_ll}
 \title{Likelihood of the length of chains with generic offspring distribution}
@@ -47,4 +47,3 @@ cumulative distribution function (ecdf).
 \author{
 Sebastian Funk
 }
-\keyword{internal}
diff --git a/man/pois_length_ll.Rd b/man/pois_length_ll.Rd
index 4a767a99..63f6088e 100644
--- a/man/pois_length_ll.Rd
+++ b/man/pois_length_ll.Rd
@@ -1,5 +1,5 @@
 % Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/likelihoods.R
+% Please edit documentation in R/stat_likelihoods.R
 \name{pois_length_ll}
 \alias{pois_length_ll}
 \title{Likelihood of the length of chains with Poisson offspring distribution}
diff --git a/man/pois_size_ll.Rd b/man/pois_size_ll.Rd
index 931b1430..00e662d0 100644
--- a/man/pois_size_ll.Rd
+++ b/man/pois_size_ll.Rd
@@ -1,5 +1,5 @@
 % Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/likelihoods.R
+% Please edit documentation in R/stat_likelihoods.R
 \name{pois_size_ll}
 \alias{pois_size_ll}
 \title{Likelihood of the size of chains with Poisson offspring distribution}

From 53e7823c14787487c66b27a9e6ee3eb068cdc6a5 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 17:50:17 +0100
Subject: [PATCH 526/828] Linting

---
 R/epichains.R | 7 ++++++-
 1 file changed, 6 insertions(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 2cb12368..57493c99 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -221,6 +221,9 @@ is_chains_summary <- function(x) {
 #' printing method. To view the full output, use `as.data.frame(<object_name>)`.
 #'
 head.epichains <- function(x, ...) {
+  if (nrow(x) < 6) {
+    NextMethod()
+    }
   writeLines("< tree head (from first known ancestor) >\n")
   # print head of the simulation output from the first known ancestor
   x <- x[!is.na(x$ancestor), ]
@@ -239,8 +242,10 @@ head.epichains <- function(x, ...) {
 #' is originally sorted by `sim_id` and `ancestor` and the first
 #' unknown ancestors (NA) have been dropped from
 #' printing method. To view the full output, use `as.data.frame(<object_name>)`.
-#'
 tail.epichains <- function(x, ...) {
+  if (nrow(x) < 6) {
+    NextMethod()
+    }
   writeLines("\n< tree tail >\n")
   utils::tail(as.data.frame(x), ...)
 }

From 70659a254298eae4b2343242b7f4a32224680579 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 17:51:26 +0100
Subject: [PATCH 527/828] Removed unnecessary summaries

---
 R/epichains.R | 8 --------
 1 file changed, 8 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 57493c99..ec83ea0a 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -83,16 +83,12 @@ summary.epichains <- function(object, ...) {
 
   if (is_chains_tree(object)) {
 
-    chains_ran <- length(object$n)
-
     max_time <- max(object$time)
 
     n_unique_ancestors <- length(
       unique(object$ancestor[!is.na(object$ancestor)])
     )
 
-    num_generations <- length(unique(object$generation))
-
     max_generation <- max(object$generation)
 
     # out of summary
@@ -100,13 +96,9 @@ summary.epichains <- function(object, ...) {
       unique_chains = chains_ran,
       max_time = max_time,
       unique_ancestors = n_unique_ancestors,
-      unique_generations = n_unique_ancestors,
-      num_generations = num_generations,
       max_generation = max_generation
     )
   } else if (is_chains_summary(object)) {
-    chains_ran <- length(object)
-
     if (!all(is.infinite(object))) {
     max_chain_stat <- max(object[!is.infinite(object)])
     min_chain_stat <- min(object[!is.infinite(object)])

From cc601d30fa80ac5d433e1d0258a2fda961419a93 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 17:52:16 +0100
Subject: [PATCH 528/828] Cleaned up number of chains calculation

---
 R/epichains.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index ec83ea0a..aff3f57b 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -93,7 +93,7 @@ summary.epichains <- function(object, ...) {
 
     # out of summary
     res <- list(
-      unique_chains = chains_ran,
+      chains_ran =  attr(object, "chains", exact = TRUE),
       max_time = max_time,
       unique_ancestors = n_unique_ancestors,
       max_generation = max_generation
@@ -107,7 +107,7 @@ summary.epichains <- function(object, ...) {
     }
 
     res <- list(
-      unique_chains = chains_ran,
+      chain_ran = attr(object, "chains"),
       max_chain_stat = max_chain_stat,
       min_chain_stat = min_chain_stat
     )

From bdcb174d5f6616c71bd820d934e99d31833264d7 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 17:52:51 +0100
Subject: [PATCH 529/828] Revised object names in the vignette

---
 vignettes/epichains.Rmd | 28 ++++++++++++++--------------
 1 file changed, 14 insertions(+), 14 deletions(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 80466747..3eecfc56 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -63,48 +63,48 @@ knitr::opts_chunk$set(
 
 ### Printing and summary
 ```{r include=TRUE,echo=TRUE}
-library(epichains)
+devtools::load_all()
 # Using `simulate_tree()`
-simulate_tree_eg <- simulate_tree(nchains = 10,
+tree_from_pois_offspring <- simulate_tree(nchains = 10,
                                   offspring_dist = "pois",
                                   serials_sampler = function(x) 3,
                                   lambda = 2,
                                   stat_max = 10
                                   )
 
-simulate_tree_eg # print the output
+tree_from_pois_offspring # print the output
 
-# Using simulate_vect()
-simulate_vect_eg <- simulate_vect(nchains = 10, offspring_dist = "pois",
+# Using simulate_summary()
+summary_sim <- simulate_summary(nchains = 10, offspring_dist = "pois",
                                   lambda = 2, stat_max = 10)
 
-simulate_vect_eg # print the output
+summary_sim # print the output
 
 # Using `simulate_tree_from_pop()`
 
 # Simulate with poisson offspring
-simulate_vect_eg_pois <- simulate_tree_from_pop(pop = 10000,
+tree_from_pop_pois <- simulate_tree_from_pop(pop = 1000,
                                                 offspring_dist = "pois",
                                                 offspring_mean = 0.5,
                                                 serial_sampler = function(x) 3
                                                 )
 
-simulate_vect_eg_pois # print the output
+tree_from_pop_pois # print the output
 
 # Simulate with negative binomial offspring
-simulate_vect_eg_nbinom <- simulate_tree_from_pop(pop = 100,
+tree_from_pop_nbinom <- simulate_tree_from_pop(pop = 1000,
                                                   offspring_dist = "nbinom",
                                                   offspring_mean = 0.5,
                                                   offspring_disp = 1.1,
                                                   serial_sampler = function(x) 3
                                                   )
 
-simulate_vect_eg_nbinom # print the output
+tree_from_pop_nbinom # print the output
 
 # Likelihoods
 
 chain_sizes <- c(1, 1, 4, 7)
-estimate_likelihood(chains = chain_sizes, statistic = "size",
+likelihood(chains = chain_sizes, statistic = "size",
                     offspring_dist = "pois", nsim_obs = 100,
                     lambda = 0.5)
 ```
@@ -113,11 +113,11 @@ estimate_likelihood(chains = chain_sizes, statistic = "size",
 ```{r include=TRUE,echo=TRUE}
 
 # aggregate by time
-aggregate(simulate_vect_eg_pois, "time")
+aggregate(tree_from_pop_pois, "time")
 
 # aggregate by generation
-aggregate(simulate_vect_eg_pois, "generation")
+aggregate(tree_from_pop_pois, "generation")
 
 # aggregate by both time and generation
-aggregate(simulate_vect_eg_pois, "both")
+aggregate(tree_from_pop_pois, "both")
 ```

From 66df96192e139356e8914dbb7e1b17098497d65a Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 18:14:51 +0100
Subject: [PATCH 530/828] Used explicit arguments in the vignette

---
 vignettes/epichains.Rmd | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 3eecfc56..91c49899 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -76,7 +76,8 @@ tree_from_pois_offspring # print the output
 
 # Using simulate_summary()
 summary_sim <- simulate_summary(nchains = 10, offspring_dist = "pois",
-                                  lambda = 2, stat_max = 10)
+                                statistic = "length", lambda = 2, 
+                                stat_max = 10)
 
 summary_sim # print the output
 

From 8cd57b702785dc9948a7888fb09533778fbdc1d4 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 18:15:04 +0100
Subject: [PATCH 531/828] Loaded epichains in the vignette

---
 vignettes/epichains.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 91c49899..d16358ce 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -63,7 +63,7 @@ knitr::opts_chunk$set(
 
 ### Printing and summary
 ```{r include=TRUE,echo=TRUE}
-devtools::load_all()
+library(epichains)
 # Using `simulate_tree()`
 tree_from_pois_offspring <- simulate_tree(nchains = 10,
                                   offspring_dist = "pois",

From 9603af408df3d96f8e10085eb3cbc99277d5853f Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 18:16:04 +0100
Subject: [PATCH 532/828] Fixed the duplicated chains_ran calculation

---
 R/epichains.R | 6 ++++--
 1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index aff3f57b..518cb8c9 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -81,6 +81,8 @@ format.epichains <- function(x, ...) {
 summary.epichains <- function(object, ...) {
   validate_epichains(object)
 
+  chains_ran <- attr(object, "chains", exact = TRUE)
+
   if (is_chains_tree(object)) {
 
     max_time <- max(object$time)
@@ -93,7 +95,7 @@ summary.epichains <- function(object, ...) {
 
     # out of summary
     res <- list(
-      chains_ran =  attr(object, "chains", exact = TRUE),
+      chains_ran =  chains_ran,
       max_time = max_time,
       unique_ancestors = n_unique_ancestors,
       max_generation = max_generation
@@ -107,7 +109,7 @@ summary.epichains <- function(object, ...) {
     }
 
     res <- list(
-      chain_ran = attr(object, "chains"),
+      chain_ran = chains_ran,
       max_chain_stat = max_chain_stat,
       min_chain_stat = min_chain_stat
     )

From 842a8877ca28f33d2e3d7e5ab0319adf3364a48c Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 18:16:42 +0100
Subject: [PATCH 533/828] Cleaned up the summary method

---
 R/epichains.R | 8 +++++---
 1 file changed, 5 insertions(+), 3 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 518cb8c9..35682764 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -35,11 +35,11 @@ format.epichains <- function(x, ...) {
       c(
         sprintf("Chains simulated: %s", chain_info[["chains"]]),
         sprintf(
-          "Unique number of ancestors: %s",
+          "Number of ancestors (known): %s",
           chain_info[["unique_ancestors"]]
         ),
         sprintf(
-          "Unique number of generations: %s", chain_info[["unique_generations"]]
+          "Number of generations: %s", chain_info[["max_generation"]]
         )
       )
     )
@@ -58,7 +58,9 @@ format.epichains <- function(x, ...) {
         )
     writeLines(
       c(
-        "\n Simulated chain stats: \n",
+        sprintf("\n Simulated chain %ss: \n",
+                attr(x, "statistic", exact = TRUE)
+                ),
         sprintf("Max: %s", chain_info[["max_chain_stat"]]),
         sprintf("Min: %s", chain_info[["min_chain_stat"]])
       )

From d926d01f58469fc8c42488f0ee68cd0f37ab1925 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 18:17:41 +0100
Subject: [PATCH 534/828] Added the statistic argument to the attributes
 returned by simulate_summary

---
 R/simulate.r | 1 +
 1 file changed, 1 insertion(+)

diff --git a/R/simulate.r b/R/simulate.r
index a2c72f78..4de6d2e5 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -273,6 +273,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
   structure(
     stat_track,
     chain_type = "chains_summary",
+    statistic = statistic,
     chains = nchains,
     class = c("epichains", class(stat_track))
   )

From f68af54eb9cd1a82d11179250334df330e545c50 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 18:21:43 +0100
Subject: [PATCH 535/828] Changed the chains simulated to 50 from 10

---
 vignettes/epichains.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index d16358ce..5becabaf 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -75,7 +75,7 @@ tree_from_pois_offspring <- simulate_tree(nchains = 10,
 tree_from_pois_offspring # print the output
 
 # Using simulate_summary()
-summary_sim <- simulate_summary(nchains = 10, offspring_dist = "pois",
+summary_sim <- simulate_summary(nchains = 50, offspring_dist = "pois",
                                 statistic = "length", lambda = 2, 
                                 stat_max = 10)
 

From 647127853a8bbcf6c87bdd9744c9e198429cec59 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 7 Aug 2023 18:22:27 +0100
Subject: [PATCH 536/828] Fixed an error in the chains simulated summary
 extraction

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 35682764..a2b7e656 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -33,7 +33,7 @@ format.epichains <- function(x, ...) {
     # print summary information
     writeLines(
       c(
-        sprintf("Chains simulated: %s", chain_info[["chains"]]),
+        sprintf("Chains simulated: %s", chain_info[["chains_ran"]]),
         sprintf(
           "Number of ancestors (known): %s",
           chain_info[["unique_ancestors"]]

From 8ed354bdfc23f93a9a427b44bf00209593de4d9d Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Tue, 8 Aug 2023 18:11:04 +0100
Subject: [PATCH 537/828] Remove NextMethod() calls

---
 R/epichains.R | 6 ------
 1 file changed, 6 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index a2b7e656..0148f0da 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -217,9 +217,6 @@ is_chains_summary <- function(x) {
 #' printing method. To view the full output, use `as.data.frame(<object_name>)`.
 #'
 head.epichains <- function(x, ...) {
-  if (nrow(x) < 6) {
-    NextMethod()
-    }
   writeLines("< tree head (from first known ancestor) >\n")
   # print head of the simulation output from the first known ancestor
   x <- x[!is.na(x$ancestor), ]
@@ -239,9 +236,6 @@ head.epichains <- function(x, ...) {
 #' unknown ancestors (NA) have been dropped from
 #' printing method. To view the full output, use `as.data.frame(<object_name>)`.
 tail.epichains <- function(x, ...) {
-  if (nrow(x) < 6) {
-    NextMethod()
-    }
   writeLines("\n< tree tail >\n")
   utils::tail(as.data.frame(x), ...)
 }

From 6edb2d76aedf1bf760e8eb67fa6c911d8d8a7dd5 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Thu, 31 Aug 2023 16:33:46 +0100
Subject: [PATCH 538/828] Improved error message

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 R/epichains.R | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 0148f0da..b9b117be 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -277,7 +277,10 @@ aggregate.epichains <- function(x,
   validate_epichains(x)
   # Check that the object is of type "chains_tree"
   if (!is_chains_tree(x)) {
-    stop("object must be an epichains object with 'chains_tree' attribute.")
+    stop(
+      "object must be an epichains object with 'chains_tree' attribute, ",
+      "which can be generated using the `simulate_tree()` function."
+    )
   }
 
   # Get grouping variable

From 8188b6380d3e955f9c7f151003c7c07ff4f21f2c Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Thu, 31 Aug 2023 17:09:25 +0100
Subject: [PATCH 539/828] Cleaned up the changelog

---
 NEWS.md | 43 ++++++++++++++++++++++++-------------------
 1 file changed, 24 insertions(+), 19 deletions(-)

diff --git a/NEWS.md b/NEWS.md
index e83021b4..228aab65 100644
--- a/NEWS.md
+++ b/NEWS.md
@@ -3,32 +3,37 @@
 ## Package name change
 
 * `epichains` is a re-implementation of `bpmodels` with a focus on providing
-a dedicated class of data structures for easy manipulation and interoperability
-with other new tools in the pipeline.
+  a dedicated class of data structures for easy manipulation and interoperability
+  with other new tools in the pipeline.
 
-### Features
+### Functions
 
 * `simulate_tree()`: simulate transmission trees from a given number of chains.
-* `simulate_tree_from_pop()`: simulate transmission trees from a given number 
+* `simulate_tree_from_pop()`: simulate transmission trees from a given 
   population size and initial immunity.
-* `simulate_vect()`: simulate a vector of observed transmission chains 
+* `simulate_summary()`: simulate a vector of observed transmission chains 
   sizes/lengths from a given number of chains.
-* `estimate_likelihood()`: estimate the likelihood/loglikelihood of observing
+* `likelihood()`: estimate the likelihood/loglikelihood of observing
   chains of given sizes/lengths.
 
-#### Classes
-
-* An `epichains` class:
-  * superclass of `data.frame` with attributes for tracking `chain_type` as: 
-    * `chains_tree`, if returned from `simulate_tree()` or 
-    `simulate_tree_from_pop()`
-    * `chains_vec`, if returned from `simulate_vect()`.
-* An `epichains_aggregate_df` class:
-  * superclass of `data.frame` with attributes for tracking if aggregation is 
-  done over "time", "generation" or "both". Useful for `plot` method dispatch 
-  (see methods section below).
-
-#### Methods
+### Classes
+
+* An `epichains` class, which inherits from `data.frame` with attributes for
+  tracking:
+  - `chains`: number of chains simulated
+  - `chain_type`:
+    - `chains_tree`, if returned from `simulate_tree()` or 
+      `simulate_tree_from_pop()`
+    - `chains_summary`, if returned from `simulate_summary()`.
+  - `track_pop`: whether the susceptible population is tracked or not.
+* An `epichains_aggregate_df` class, which inherits from `data.frame` with
+  attributes for tracking:
+  - `chain_type`: as defined above, and
+  - `aggregated_over`: the variable(s) over which aggregation was done: "time",
+  "generation" or "both". Useful for easy plotting with the `plot` method (see
+  methods section below).
+
+### Methods
 
 * `print()`
 * `summary()`

From bcd1bf0bfcb30ae496602410d522f0438a20df27 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Sep 2023 14:29:00 +0100
Subject: [PATCH 540/828] Enable indentation_linter

---
 .lintr | 1 -
 1 file changed, 1 deletion(-)

diff --git a/.lintr b/.lintr
index e2ca0f34..a98cf968 100644
--- a/.lintr
+++ b/.lintr
@@ -6,7 +6,6 @@ linters: linters_with_tags(
     extraction_operator_linter = NULL,
     todo_comment_linter = NULL,
     function_argument_linter = NULL,
-    indentation_linter = NULL, # unstable as of lintr 3.1.0
     # Use minimum R declared in DESCRIPTION or fall back to current R version.
     # Install etdev package from https://github.com/epiverse-trace/etdev
     backport_linter(if (length(x <- etdev::extract_min_r_version())) x else getRversion())

From 9bb4c957eb121e5d639ec8f1b807857d3711202a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Sep 2023 14:29:38 +0100
Subject: [PATCH 541/828] Run styler to align with tidyverse style guide

---
 R/borel.r                  | 13 +++---
 R/checks.R                 |  2 +-
 R/epichains.R              | 78 ++++++++++++++++++++---------------
 R/likelihood.R             | 19 +++++----
 R/simulate.r               | 84 +++++++++++++++++++++-----------------
 R/stat_likelihoods.R       |  8 ++--
 README.Rmd                 |  2 +-
 tests/spelling.R           |  9 ++--
 tests/testthat/tests-sim.r | 35 ++++++++--------
 vignettes/epichains.Rmd    | 54 +++++++++++++-----------
 10 files changed, 171 insertions(+), 133 deletions(-)

diff --git a/R/borel.r b/R/borel.r
index 4d67333d..1ec154d9 100644
--- a/R/borel.r
+++ b/R/borel.r
@@ -24,10 +24,11 @@ dborel <- function(x, mu, log = FALSE) {
 ##' @author Sebastian Funk
 ##' @export
 rborel <- function(n, mu, infinite = Inf) {
-  simulate_summary(nchains = n,
-                offspring_dist = "pois",
-                statistic = "size",
-                stat_max = infinite,
-                lambda = mu
-                )
+  simulate_summary(
+    nchains = n,
+    offspring_dist = "pois",
+    statistic = "size",
+    stat_max = infinite,
+    lambda = mu
+  )
 }
diff --git a/R/checks.R b/R/checks.R
index 53123296..a78a5516 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -24,7 +24,7 @@ check_offspring_valid <- function(offspring_dist) {
 #' @keywords internal
 check_offspring_func_valid <- function(roffspring_name) {
   if (!(exists(roffspring_name)) ||
-      !checkmate::test_function(get(roffspring_name))) {
+        !checkmate::test_function(get(roffspring_name))) {
     stop("Function ", roffspring_name, " does not exist.")
   }
 }
diff --git a/R/epichains.R b/R/epichains.R
index b9b117be..45254c7e 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -45,22 +45,23 @@ format.epichains <- function(x, ...) {
     )
 
     # Offer more information to view the full dataset
-    writeLines(sprintf("%s %s", "Use `as.data.frame(<object_name>)`",
-                       "to view the full output in the console.")
-               )
-
+    writeLines(sprintf(
+      "%s %s", "Use `as.data.frame(<object_name>)`",
+      "to view the full output in the console."
+    ))
   } else if (is_chains_summary(x)) {
     writeLines(sprintf("`epichains` object \n"))
     print(as.vector(x))
-    writeLines(sprintf("\n Number of chains simulated: %s",
-                chain_info[["unique_chains"]]
-                )
-        )
+    writeLines(sprintf(
+      "\n Number of chains simulated: %s",
+      chain_info[["unique_chains"]]
+    ))
     writeLines(
       c(
-        sprintf("\n Simulated chain %ss: \n",
-                attr(x, "statistic", exact = TRUE)
-                ),
+        sprintf(
+          "\n Simulated chain %ss: \n",
+          attr(x, "statistic", exact = TRUE)
+        ),
         sprintf("Max: %s", chain_info[["max_chain_stat"]]),
         sprintf("Min: %s", chain_info[["min_chain_stat"]])
       )
@@ -86,7 +87,6 @@ summary.epichains <- function(object, ...) {
   chains_ran <- attr(object, "chains", exact = TRUE)
 
   if (is_chains_tree(object)) {
-
     max_time <- max(object$time)
 
     n_unique_ancestors <- length(
@@ -97,17 +97,17 @@ summary.epichains <- function(object, ...) {
 
     # out of summary
     res <- list(
-      chains_ran =  chains_ran,
+      chains_ran = chains_ran,
       max_time = max_time,
       unique_ancestors = n_unique_ancestors,
       max_generation = max_generation
     )
   } else if (is_chains_summary(object)) {
     if (!all(is.infinite(object))) {
-    max_chain_stat <- max(object[!is.infinite(object)])
-    min_chain_stat <- min(object[!is.infinite(object)])
+      max_chain_stat <- max(object[!is.infinite(object)])
+      min_chain_stat <- min(object[!is.infinite(object)])
     } else {
-    max_chain_stat <- min_chain_stat <- Inf
+      max_chain_stat <- min_chain_stat <- Inf
     }
 
     res <- list(
@@ -115,7 +115,7 @@ summary.epichains <- function(object, ...) {
       max_chain_stat = max_chain_stat,
       min_chain_stat = min_chain_stat
     )
-    }
+  }
 
   return(res)
 }
@@ -161,7 +161,7 @@ validate_epichains <- function(x) {
     stopifnot(
       "object does not contain the correct columns" =
         c("sim_id", "ancestor", "generation") %in%
-          colnames(x),
+        colnames(x),
       "column `sim_id` must be a numeric" =
         is.numeric(x$sim_id),
       "column `ancestor` must be a numeric" =
@@ -255,9 +255,11 @@ tail.epichains <- function(x, ...) {
 #' @author James M. Azam
 #' @examples
 #' set.seed(123)
-#' chains <- simulate_tree(nchains = 10, statistic = "size",
-#' offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
-#' lambda = 2)
+#' chains <- simulate_tree(
+#'   nchains = 10, statistic = "size",
+#'   offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
+#'   lambda = 2
+#' )
 #' chains
 #'
 #' # Aggregate cases per time
@@ -269,10 +271,11 @@ tail.epichains <- function(x, ...) {
 #' # Aggregate cases per both time and generation
 #' aggregate(chains, grouping_var = "both")
 aggregate.epichains <- function(x,
-                                grouping_var = c("time",
-                                                 "generation",
-                                                 "both"
-                                                 ),
+                                grouping_var = c(
+                                  "time",
+                                  "generation",
+                                  "both"
+                                ),
                                 ...) {
   validate_epichains(x)
   # Check that the object is of type "chains_tree"
@@ -288,30 +291,37 @@ aggregate.epichains <- function(x,
 
   out <- if (grouping_var == "time") {
     # Count the number of cases per generation
-    stats::aggregate(list(cases = x$sim_id),
+    stats::aggregate(
+      list(cases = x$sim_id),
       list(time = x$time),
       FUN = NROW
     )
   } else if (grouping_var == "generation") {
     # Count the number of cases per time
-    stats::aggregate(list(cases = x$sim_id),
+    stats::aggregate(
+      list(cases = x$sim_id),
       list(generation = x$generation),
       FUN = NROW
     )
   } else if (grouping_var == "both") {
     # Count the number of cases per time
     list(
-      stats::aggregate(list(cases = x$sim_id),
-                       list(time = x$time),
-                       FUN = NROW),
+      stats::aggregate(
+        list(cases = x$sim_id),
+        list(time = x$time),
+        FUN = NROW
+      ),
       # Count the number of cases per generation
-      stats::aggregate(list(cases = x$sim_id),
-                       list(generation = x$generation),
-                       FUN = NROW)
+      stats::aggregate(
+        list(cases = x$sim_id),
+        list(generation = x$generation),
+        FUN = NROW
+      )
     )
   }
 
-  structure(out,
+  structure(
+    out,
     class = c("epichains_aggregate_df", class(out)),
     chain_type = attributes(x)$chain_type,
     rownames = NULL,
diff --git a/R/likelihood.R b/R/likelihood.R
index 22bfc518..a9f566f0 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -25,8 +25,10 @@
 #' @examples
 #' # example of observed chain sizes
 #' chain_sizes <- c(1, 1, 4, 7)
-#' likelihood(chains = chain_sizes, statistic = "size",
-#'  offspring_dist = "pois", nsim_obs = 100, lambda = 0.5)
+#' likelihood(
+#'   chains = chain_sizes, statistic = "size",
+#'   offspring_dist = "pois", nsim_obs = 100, lambda = 0.5
+#' )
 #' @export
 likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
                        nsim_obs, log = TRUE, obs_prob = 1, stat_max = Inf,
@@ -44,14 +46,17 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
 
     sample_func <- get_statistic_func(statistic)
 
-    sampled_x <- replicate(nsim_obs, pmin(sample_func(length(chains),
-                                           chains, obs_prob
-                                           ),
-                               stat_max), simplify = FALSE)
+    sampled_x <- replicate(nsim_obs, pmin(
+      sample_func(
+        length(chains),
+        chains, obs_prob
+      ),
+      stat_max
+    ), simplify = FALSE)
     size_x <- unlist(sampled_x)
     if (!is.finite(stat_max)) {
       stat_max <- max(size_x) + 1
-      }
+    }
   } else {
     chains[chains >= stat_max] <- stat_max
     size_x <- chains
diff --git a/R/simulate.r b/R/simulate.r
index 4de6d2e5..ea109275 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -71,9 +71,11 @@
 #' statistic without the tree of infections.
 #' @examples
 #' set.seed(123)
-#' chains <- simulate_tree(nchains = 10, statistic = "size",
-#' offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
-#' lambda = 2)
+#' chains <- simulate_tree(
+#'   nchains = 10, statistic = "size",
+#'   offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
+#'   lambda = 2
+#' )
 #' @references
 #'
 #' Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
@@ -143,9 +145,11 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
     n_offspring[sim] <- tapply(next_gen, indices, sum)
 
     # track size/length
-    stat_track <- update_chain_stat(stat_type = statistic,
-                                    stat_latest = stat_track,
-                                    n_offspring = n_offspring)
+    stat_track <- update_chain_stat(
+      stat_type = statistic,
+      stat_latest = stat_track,
+      n_offspring = n_offspring
+    )
 
     # record times/ancestors
     if (sum(n_offspring[sim]) > 0) {
@@ -188,17 +192,17 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
         sim <- intersect(sim, unique(indices)[current_min_time < tf])
       }
       if (!missing(serials_sampler)) {
-          times <- times[indices %in% sim]
-          }
-        ancestor_ids <- ids[indices %in% sim]
-    }
+        times <- times[indices %in% sim]
+      }
+      ancestor_ids <- ids[indices %in% sim]
     }
+  }
 
   if (!missing(tf)) {
     tree_df <- tree_df[tree_df$time < tf, ]
   }
 
-  #sort by sim_id and ancestor
+  # sort by sim_id and ancestor
   tree_df <- tree_df[order(tree_df$sim_id, tree_df$ancestor), ]
 
   structure(
@@ -219,12 +223,14 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
 #' @examples
-#' simulate_summary(nchains = 10, statistic = "size", offspring_dist = "pois",
-#' stat_max = 10, lambda = 2)
+#' simulate_summary(
+#'   nchains = 10, statistic = "size", offspring_dist = "pois",
+#'   stat_max = 10, lambda = 2
+#' )
 #' @export
 simulate_summary <- function(nchains, statistic = c("size", "length"),
-                          offspring_dist,
-                          stat_max = Inf, ...) {
+                             offspring_dist,
+                             stat_max = Inf, ...) {
   statistic <- match.arg(statistic)
 
   check_nchains_valid(nchains = nchains)
@@ -258,10 +264,11 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
     n_offspring[sim] <- tapply(next_gen, indices, sum)
 
     # track size/length
-    stat_track <- update_chain_stat(stat_type = statistic,
-                                    stat_latest = stat_track,
-                                    n_offspring = n_offspring
-                                    )
+    stat_track <- update_chain_stat(
+      stat_type = statistic,
+      stat_latest = stat_track,
+      n_offspring = n_offspring
+    )
 
     ## only continue to simulate chains that offspring and aren't of
     ## stat_max size/length
@@ -325,12 +332,16 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' @author James M. Azam
 #' @examples
 #' # Simulate with poisson offspring
-#' simulate_tree_from_pop(pop = 100, offspring_dist = "pois",
-#' offspring_mean = 0.5, serial_sampler = function(x) 3)
+#' simulate_tree_from_pop(
+#'   pop = 100, offspring_dist = "pois",
+#'   offspring_mean = 0.5, serial_sampler = function(x) 3
+#' )
 #'
 #' # Simulate with negative binomial offspring
-#' simulate_tree_from_pop(pop = 100, offspring_dist = "nbinom",
-#' offspring_mean = 0.5, offspring_disp = 1.1, serial_sampler = function(x) 3)
+#' simulate_tree_from_pop(
+#'   pop = 100, offspring_dist = "nbinom",
+#'   offspring_mean = 0.5, offspring_disp = 1.1, serial_sampler = function(x) 3
+#' )
 #' @export
 simulate_tree_from_pop <- function(pop,
                                    offspring_dist = c("pois", "nbinom"),
@@ -344,26 +355,26 @@ simulate_tree_from_pop <- function(pop,
 
   if (offspring_dist == "pois") {
     if (!missing(offspring_disp)) {
-      warning(sprintf("%s %s %s",
-                      "'offspring_disp' is not used for",
-                      "poisson offspring distribution.",
-                      "Will be ignored."
-                      )
-              )
+      warning(sprintf(
+        "%s %s %s",
+        "'offspring_disp' is not used for",
+        "poisson offspring distribution.",
+        "Will be ignored."
+      ))
     }
 
     ## using a right truncated poisson distribution
     ## to avoid more cases than susceptibles
     offspring_fun <- get_offspring_func(offspring_dist)
-
   } else if (offspring_dist == "nbinom") {
     if (missing(offspring_disp)) {
       stop(sprintf("%s", "'offspring_disp' must be specified."))
     } else if (offspring_disp <= 1) { ## dispersion coefficient
-      stop(sprintf("%s %s %s",
-                   "Offspring distribution 'nbinom' requires",
-                   "argument 'offspring_disp' > 1.",
-                   "Use 'pois' if there is no overdispersion."
+      stop(sprintf(
+        "%s %s %s",
+        "Offspring distribution 'nbinom' requires",
+        "argument 'offspring_disp' > 1.",
+        "Use 'pois' if there is no overdispersion."
       ))
     }
     offspring_fun <- get_offspring_func(offspring_dist)
@@ -375,7 +386,7 @@ simulate_tree_from_pop <- function(pop,
     ancestor = NA_integer_,
     generation = 1L,
     time = t0,
-    offspring_generated = FALSE #used to track simulation and dropped afterwards
+    offspring_generated = FALSE # tracks simulation and dropped afterwards
   )
 
   susc <- pop - initial_immune - 1L
@@ -384,7 +395,6 @@ simulate_tree_from_pop <- function(pop,
   ## continue if any unsimulated chains have t <= tf
   ## AND there is still susceptibles left
   while (any(tree_df$time[!tree_df$offspring_generated] <= tf) && susc > 0) {
-
     ## select from which case to generate offspring
     t <- min(tree_df$time[!tree_df$offspring_generated]) # lowest unsimulated t
 
@@ -434,7 +444,7 @@ simulate_tree_from_pop <- function(pop,
   ## have been generated in the last generation
   tree_df <- tree_df[tree_df$time <= tf, ]
 
-  #sort by sim_id and ancestor
+  # sort by sim_id and ancestor
   tree_df <- tree_df[order(tree_df$sim_id, tree_df$ancestor), ]
   tree_df$offspring_generated <- NULL
 
diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index b1cf9bee..f54b7736 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -60,7 +60,6 @@ gborel_size_ll <- function(x, size, prob, mu) {
 #' @author Sebastian Funk
 #' @keywords internal
 pois_length_ll <- function(x, lambda) {
-
   ## iterated exponential function
   arg <- exp(lambda * exp(-lambda))
   itex <- 1
@@ -106,10 +105,11 @@ geom_length_ll <- function(x, prob) {
 #' @export
 offspring_ll <- function(chains, offspring_dist, statistic,
                          nsim_offspring = 100, log = TRUE, ...) {
-
   # Simulate the chains
-  chains <- simulate_summary(nsim_offspring, offspring_dist,
-                          statistic, ...)
+  chains <- simulate_summary(
+    nsim_offspring, offspring_dist,
+    statistic, ...
+  )
 
   # Compute the empirical Cumulative Distribution Function of the
   # simulated chains
diff --git a/README.Rmd b/README.Rmd
index 88ded55e..4ce2ff85 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -52,7 +52,7 @@ The latest development version of the _{{ packagename }}_ package can be install
 
 ```{r include=TRUE,eval=FALSE}
 # check whether {pak} is installed
-if(!require("pak")) install.packages("pak")
+if (!require("pak")) install.packages("pak")
 pak::pak("{{ gh_repo }}")
 ```
 
diff --git a/tests/spelling.R b/tests/spelling.R
index 33ef2c73..13f77d96 100644
--- a/tests/spelling.R
+++ b/tests/spelling.R
@@ -1,3 +1,6 @@
-if (requireNamespace("spelling", quietly = TRUE))
-  spelling::spell_check_test(vignettes = TRUE, error = FALSE,
-                             skip_on_cran = TRUE)
+if (requireNamespace("spelling", quietly = TRUE)) {
+  spelling::spell_check_test(
+    vignettes = TRUE, error = FALSE,
+    skip_on_cran = TRUE
+  )
+}
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 664932cf..30dca41a 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -1,27 +1,30 @@
 test_that("Simulators output epichains objects", {
   expect_s3_class(
-    simulate_tree(nchains = 10,
-                  offspring_dist = "pois",
-                  lambda = 2,
-                  statistic = "size",
-                  stat_max = 10
-                  ),
+    simulate_tree(
+      nchains = 10,
+      offspring_dist = "pois",
+      lambda = 2,
+      statistic = "size",
+      stat_max = 10
+    ),
     "epichains"
-    )
+  )
   expect_s3_class(
-    simulate_tree_from_pop(pop = 100,
-                           offspring_dist = "nbinom",
-                           offspring_mean = 0.5,
-                           offspring_disp = 1.1,
-                           serial_sampler = function(x) 3
+    simulate_tree_from_pop(
+      pop = 100,
+      offspring_dist = "nbinom",
+      offspring_mean = 0.5,
+      offspring_disp = 1.1,
+      serial_sampler = function(x) 3
     ),
     "epichains"
   )
   expect_s3_class(
-    simulate_summary(n = 10,
-                  offspring_dist = "pois",
-                  lambda = 2,
-                  stat_max = 10
+    simulate_summary(
+      n = 10,
+      offspring_dist = "pois",
+      lambda = 2,
+      stat_max = 10
     ),
     "epichains"
   )
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 5becabaf..1089f456 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -65,54 +65,60 @@ knitr::opts_chunk$set(
 ```{r include=TRUE,echo=TRUE}
 library(epichains)
 # Using `simulate_tree()`
-tree_from_pois_offspring <- simulate_tree(nchains = 10,
-                                  offspring_dist = "pois",
-                                  serials_sampler = function(x) 3,
-                                  lambda = 2,
-                                  stat_max = 10
-                                  )
+tree_from_pois_offspring <- simulate_tree(
+  nchains = 10,
+  offspring_dist = "pois",
+  serials_sampler = function(x) 3,
+  lambda = 2,
+  stat_max = 10
+)
 
 tree_from_pois_offspring # print the output
 
 # Using simulate_summary()
-summary_sim <- simulate_summary(nchains = 50, offspring_dist = "pois",
-                                statistic = "length", lambda = 2, 
-                                stat_max = 10)
+summary_sim <- simulate_summary(
+  nchains = 50, offspring_dist = "pois",
+  statistic = "length", lambda = 2,
+  stat_max = 10
+)
 
 summary_sim # print the output
 
 # Using `simulate_tree_from_pop()`
 
 # Simulate with poisson offspring
-tree_from_pop_pois <- simulate_tree_from_pop(pop = 1000,
-                                                offspring_dist = "pois",
-                                                offspring_mean = 0.5,
-                                                serial_sampler = function(x) 3
-                                                )
+tree_from_pop_pois <- simulate_tree_from_pop(
+  pop = 1000,
+  offspring_dist = "pois",
+  offspring_mean = 0.5,
+  serial_sampler = function(x) 3
+)
 
 tree_from_pop_pois # print the output
 
 # Simulate with negative binomial offspring
-tree_from_pop_nbinom <- simulate_tree_from_pop(pop = 1000,
-                                                  offspring_dist = "nbinom",
-                                                  offspring_mean = 0.5,
-                                                  offspring_disp = 1.1,
-                                                  serial_sampler = function(x) 3
-                                                  )
+tree_from_pop_nbinom <- simulate_tree_from_pop(
+  pop = 1000,
+  offspring_dist = "nbinom",
+  offspring_mean = 0.5,
+  offspring_disp = 1.1,
+  serial_sampler = function(x) 3
+)
 
 tree_from_pop_nbinom # print the output
 
 # Likelihoods
 
 chain_sizes <- c(1, 1, 4, 7)
-likelihood(chains = chain_sizes, statistic = "size",
-                    offspring_dist = "pois", nsim_obs = 100,
-                    lambda = 0.5)
+likelihood(
+  chains = chain_sizes, statistic = "size",
+  offspring_dist = "pois", nsim_obs = 100,
+  lambda = 0.5
+)
 ```
 
 ### Aggregation
 ```{r include=TRUE,echo=TRUE}
-
 # aggregate by time
 aggregate(tree_from_pop_pois, "time")
 

From 4c89189a08e8de7dcd3f373545c18547ab69749b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Sep 2023 17:06:13 +0100
Subject: [PATCH 542/828] Regenerated docs

---
 man/aggregate.epichains.Rd    |  8 +++++---
 man/likelihood.Rd             |  6 ++++--
 man/simulate_summary.Rd       |  6 ++++--
 man/simulate_tree.Rd          |  8 +++++---
 man/simulate_tree_from_pop.Rd | 12 ++++++++----
 5 files changed, 26 insertions(+), 14 deletions(-)

diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
index 7d0bc0b2..ab761c6e 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains.Rd
@@ -25,9 +25,11 @@ Aggregate cases in epichains objects according to a grouping variable
 }
 \examples{
 set.seed(123)
-chains <- simulate_tree(nchains = 10, statistic = "size",
-offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
-lambda = 2)
+chains <- simulate_tree(
+  nchains = 10, statistic = "size",
+  offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
+  lambda = 2
+)
 chains
 
 # Aggregate cases per time
diff --git a/man/likelihood.Rd b/man/likelihood.Rd
index 7e6d3dec..3bd9e76e 100644
--- a/man/likelihood.Rd
+++ b/man/likelihood.Rd
@@ -66,8 +66,10 @@ Estimate the (log) likelihood for observed branching processes
 \examples{
 # example of observed chain sizes
 chain_sizes <- c(1, 1, 4, 7)
-likelihood(chains = chain_sizes, statistic = "size",
- offspring_dist = "pois", nsim_obs = 100, lambda = 0.5)
+likelihood(
+  chains = chain_sizes, statistic = "size",
+  offspring_dist = "pois", nsim_obs = 100, lambda = 0.5
+)
 }
 \seealso{
 offspring_ll, pois_size_ll, nbinom_size_ll, gborel_size_ll,
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index ee63dcab..00ea66c2 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -35,6 +35,8 @@ computed. Results above the specified value, are set to \code{Inf}.}
 Simulate a summary of the transmission chain statistic
 }
 \examples{
-simulate_summary(nchains = 10, statistic = "size", offspring_dist = "pois",
-stat_max = 10, lambda = 2)
+simulate_summary(
+  nchains = 10, statistic = "size", offspring_dist = "pois",
+  stat_max = 10, lambda = 2
+)
 }
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 94da6229..1228087e 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -100,9 +100,11 @@ where \code{...} are the other arguments to \code{simulate_tree()}.
 
 \examples{
 set.seed(123)
-chains <- simulate_tree(nchains = 10, statistic = "size",
-offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
-lambda = 2)
+chains <- simulate_tree(
+  nchains = 10, statistic = "size",
+  offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
+  lambda = 2
+)
 }
 \references{
 Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 58e844bc..c24aca50 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -73,12 +73,16 @@ simulate_tree_from_pop() has a couple of key different from simulate_tree():
 
 \examples{
 # Simulate with poisson offspring
-simulate_tree_from_pop(pop = 100, offspring_dist = "pois",
-offspring_mean = 0.5, serial_sampler = function(x) 3)
+simulate_tree_from_pop(
+  pop = 100, offspring_dist = "pois",
+  offspring_mean = 0.5, serial_sampler = function(x) 3
+)
 
 # Simulate with negative binomial offspring
-simulate_tree_from_pop(pop = 100, offspring_dist = "nbinom",
-offspring_mean = 0.5, offspring_disp = 1.1, serial_sampler = function(x) 3)
+simulate_tree_from_pop(
+  pop = 100, offspring_dist = "nbinom",
+  offspring_mean = 0.5, offspring_disp = 1.1, serial_sampler = function(x) 3
+)
 }
 \author{
 Flavio Finger

From 22112d443e345ae2e46dfc68ce23590746e7bbe9 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Sep 2023 17:45:26 +0100
Subject: [PATCH 543/828] Rename serial_sampler argument to serial_dist

---
 R/simulate.r                  | 10 +++++-----
 man/simulate_tree_from_pop.Rd |  8 ++++----
 tests/testthat/tests-sim.r    |  2 +-
 vignettes/epichains.Rmd       |  4 ++--
 4 files changed, 12 insertions(+), 12 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index ea109275..49a294c7 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -300,7 +300,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
 #' avoid division by 0 when calculating the size. See details and
 #'  \code{?rnbinom} for details on the parameterisation in Ecology.
-#' @param serial_sampler The serial interval. A function that takes one
+#' @param serial_dist The serial interval. A function that takes one
 #' parameter (`n`), the number of serial intervals to randomly sample. Value
 #' must be >= 0.
 #' @param initial_immune The number of initial immunes in the population.
@@ -334,20 +334,20 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' # Simulate with poisson offspring
 #' simulate_tree_from_pop(
 #'   pop = 100, offspring_dist = "pois",
-#'   offspring_mean = 0.5, serial_sampler = function(x) 3
+#'   offspring_mean = 0.5, serial_dist = function(x) 3
 #' )
 #'
 #' # Simulate with negative binomial offspring
 #' simulate_tree_from_pop(
 #'   pop = 100, offspring_dist = "nbinom",
-#'   offspring_mean = 0.5, offspring_disp = 1.1, serial_sampler = function(x) 3
+#'   offspring_mean = 0.5, offspring_disp = 1.1, serial_dist = function(x) 3
 #' )
 #' @export
 simulate_tree_from_pop <- function(pop,
                                    offspring_dist = c("pois", "nbinom"),
                                    offspring_mean,
                                    offspring_disp,
-                                   serial_sampler,
+                                   serial_dist,
                                    initial_immune = 0,
                                    t0 = 0,
                                    tf = Inf) {
@@ -418,7 +418,7 @@ simulate_tree_from_pop <- function(pop,
     ## add to df
     if (n_offspring > 0) {
       ## draw serial times
-      new_times <- serial_sampler(n_offspring)
+      new_times <- serial_dist(n_offspring)
 
       if (any(new_times < 0)) {
         stop("Serial interval must be >= 0.")
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index c24aca50..019efc27 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -10,7 +10,7 @@ simulate_tree_from_pop(
   offspring_dist = c("pois", "nbinom"),
   offspring_mean,
   offspring_disp,
-  serial_sampler,
+  serial_dist,
   initial_immune = 0,
   t0 = 0,
   tf = Inf
@@ -32,7 +32,7 @@ secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
 avoid division by 0 when calculating the size. See details and
 \code{?rnbinom} for details on the parameterisation in Ecology.}
 
-\item{serial_sampler}{The serial interval. A function that takes one
+\item{serial_dist}{The serial interval. A function that takes one
 parameter (\code{n}), the number of serial intervals to randomly sample. Value
 must be >= 0.}
 
@@ -75,13 +75,13 @@ simulate_tree_from_pop() has a couple of key different from simulate_tree():
 # Simulate with poisson offspring
 simulate_tree_from_pop(
   pop = 100, offspring_dist = "pois",
-  offspring_mean = 0.5, serial_sampler = function(x) 3
+  offspring_mean = 0.5, serial_dist = function(x) 3
 )
 
 # Simulate with negative binomial offspring
 simulate_tree_from_pop(
   pop = 100, offspring_dist = "nbinom",
-  offspring_mean = 0.5, offspring_disp = 1.1, serial_sampler = function(x) 3
+  offspring_mean = 0.5, offspring_disp = 1.1, serial_dist = function(x) 3
 )
 }
 \author{
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 30dca41a..9b5868d0 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -15,7 +15,7 @@ test_that("Simulators output epichains objects", {
       offspring_dist = "nbinom",
       offspring_mean = 0.5,
       offspring_disp = 1.1,
-      serial_sampler = function(x) 3
+      serial_dist = function(x) 3
     ),
     "epichains"
   )
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 1089f456..68c21807 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -91,7 +91,7 @@ tree_from_pop_pois <- simulate_tree_from_pop(
   pop = 1000,
   offspring_dist = "pois",
   offspring_mean = 0.5,
-  serial_sampler = function(x) 3
+  serial_dist = function(x) 3
 )
 
 tree_from_pop_pois # print the output
@@ -102,7 +102,7 @@ tree_from_pop_nbinom <- simulate_tree_from_pop(
   offspring_dist = "nbinom",
   offspring_mean = 0.5,
   offspring_disp = 1.1,
-  serial_sampler = function(x) 3
+  serial_dist = function(x) 3
 )
 
 tree_from_pop_nbinom # print the output

From 11da0822e163757af6be5c101023445ef1bac0c1 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Sep 2023 18:14:40 +0100
Subject: [PATCH 544/828] Rename offspring_sampler to offspring_dist

---
 R/helpers.R                        | 12 ++++++------
 man/construct_offspring_ll_name.Rd |  8 +++++++-
 man/get_offspring_func.Rd          |  7 ++++++-
 3 files changed, 19 insertions(+), 8 deletions(-)

diff --git a/R/helpers.R b/R/helpers.R
index 9770bb64..6d42765c 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -26,9 +26,9 @@ update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
 #'
 #' @return An offspring sampling function
 #' @keywords internal
-get_offspring_func <- function(offspring_sampler, n, susc, pop,
+get_offspring_func <- function(offspring_dist, n, susc, pop,
                                mean_offspring, disp_offspring = NULL) {
-  if (offspring_sampler == "nbinom") {
+  if (offspring_dist == "nbinom") {
     function(n, susc, pop, mean_offspring, disp_offspring) {
       ## get distribution params from mean and dispersion
       new_mn <- mean_offspring * susc / pop ## apply susceptibility
@@ -44,7 +44,7 @@ get_offspring_func <- function(offspring_sampler, n, susc, pop,
         size = size
       )
     }
-  } else if (offspring_sampler == "pois") {
+  } else if (offspring_dist == "pois") {
     function(n, susc, pop, mean_offspring, disp_offspring) {
       truncdist::rtrunc(
         n,
@@ -54,7 +54,7 @@ get_offspring_func <- function(offspring_sampler, n, susc, pop,
       )
     }
   } else {
-    stop("offspring_sampler must either be 'pois' or 'nbinom'")
+    stop("offspring_dist must either be 'pois' or 'nbinom'")
   }
 }
 
@@ -82,7 +82,7 @@ get_statistic_func <- function(chain_statistic) {
 #'
 #' @return an analytical offspring likelihood function
 #' @keywords internal
-construct_offspring_ll_name <- function(offspring_sampler, chain_statistic) {
-  ll_name <- paste(offspring_sampler, chain_statistic, "ll", sep = "_")
+construct_offspring_ll_name <- function(offspring_dist, chain_statistic) {
+  ll_name <- paste(offspring_dist, chain_statistic, "ll", sep = "_")
   return(ll_name)
 }
diff --git a/man/construct_offspring_ll_name.Rd b/man/construct_offspring_ll_name.Rd
index 2218c4b1..47e9863e 100644
--- a/man/construct_offspring_ll_name.Rd
+++ b/man/construct_offspring_ll_name.Rd
@@ -5,7 +5,13 @@
 \title{Construct name of analytical function for estimating loglikelihood of
 offspring}
 \usage{
-construct_offspring_ll_name(offspring_sampler, chain_statistic)
+construct_offspring_ll_name(offspring_dist, chain_statistic)
+}
+\arguments{
+\item{offspring_dist}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
 }
 \value{
 an analytical offspring likelihood function
diff --git a/man/get_offspring_func.Rd b/man/get_offspring_func.Rd
index a8f0757b..b5ff6ba5 100644
--- a/man/get_offspring_func.Rd
+++ b/man/get_offspring_func.Rd
@@ -6,7 +6,7 @@
 depletion}
 \usage{
 get_offspring_func(
-  offspring_sampler,
+  offspring_dist,
   n,
   susc,
   pop,
@@ -15,6 +15,11 @@ get_offspring_func(
 )
 }
 \arguments{
+\item{offspring_dist}{Offspring distribution sampler: a character string
+corresponding to the R distribution function. Currently only "pois" &
+"nbinom" are supported. Internally truncated distributions are used to
+avoid infecting more people than susceptibles available.}
+
 \item{n}{Number of items to sample}
 
 \item{susc}{Susceptible population size (calculated

From c7a1fe2c313ed29214075ed04cf56806668a3184 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Sep 2023 22:17:29 +0100
Subject: [PATCH 545/828] Redocument offspring_dist by inheriting the argument

---
 man/get_offspring_func.Rd     | 8 ++++----
 man/simulate_tree_from_pop.Rd | 8 ++++----
 2 files changed, 8 insertions(+), 8 deletions(-)

diff --git a/man/get_offspring_func.Rd b/man/get_offspring_func.Rd
index b5ff6ba5..15a28e90 100644
--- a/man/get_offspring_func.Rd
+++ b/man/get_offspring_func.Rd
@@ -15,10 +15,10 @@ get_offspring_func(
 )
 }
 \arguments{
-\item{offspring_dist}{Offspring distribution sampler: a character string
-corresponding to the R distribution function. Currently only "pois" &
-"nbinom" are supported. Internally truncated distributions are used to
-avoid infecting more people than susceptibles available.}
+\item{offspring_dist}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
 
 \item{n}{Number of items to sample}
 
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 019efc27..c7cecd3b 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -19,10 +19,10 @@ simulate_tree_from_pop(
 \arguments{
 \item{pop}{The susceptible population.}
 
-\item{offspring_dist}{Offspring distribution sampler: a character string
-corresponding to the R distribution function. Currently only "pois" &
-"nbinom" are supported. Internally truncated distributions are used to
-avoid infecting more people than susceptibles available.}
+\item{offspring_dist}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
 
 \item{offspring_mean}{The average number of secondary cases for each case.
 Same as R0.}

From 59254c3ec67a29c2a2d2ad15e3ea7632d319edb2 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Sep 2023 22:18:11 +0100
Subject: [PATCH 546/828] Inherit the offspring argument from simulate_tree()

---
 R/simulate.r | 5 +----
 1 file changed, 1 insertion(+), 4 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 49a294c7..21a3c555 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -290,10 +290,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' with initial immunity
 #'
 #' @param pop The susceptible population.
-#' @param offspring_dist Offspring distribution sampler: a character string
-#' corresponding to the R distribution function. Currently only "pois" &
-#' "nbinom" are supported. Internally truncated distributions are used to
-#' avoid infecting more people than susceptibles available.
+#' @inheritParams simulate_tree
 #' @param offspring_mean The average number of secondary cases for each case.
 #' Same as R0.
 #' @param offspring_disp The dispersion parameter of the number of

From 37c8db2a740f6a3cc7eb314d8bef3d772a78a396 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Sep 2023 22:18:38 +0100
Subject: [PATCH 547/828] Add details about which offspring distributions are
 supported

---
 R/simulate.r                  | 5 ++++-
 man/simulate_tree_from_pop.Rd | 6 +++++-
 2 files changed, 9 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 21a3c555..6f7105b6 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -308,7 +308,10 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' of each element), and `generation`.
 #' @details
 #'
-#' # Offspring models
+#' # Offspring distributions
+#' Currently only "pois" & "nbinom" are supported. Internally truncated
+#' distributions are used to avoid infecting more people than susceptibles
+#' available.
 #'
 #' The poisson model is parametrised so that:
 #'
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index c7cecd3b..17b5ad2f 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -51,7 +51,11 @@ of each element), and \code{generation}.
 Simulate a tree of infections from an initial susceptible population
 with initial immunity
 }
-\section{Offspring models}{
+\section{Offspring distributions}{
+Currently only "pois" & "nbinom" are supported. Internally truncated
+distributions are used to avoid infecting more people than susceptibles
+available.
+
 The poisson model is parametrised so that:
 
 lamda = offspring_mean * pop - initial_immune / pop

From 46365ffe34fa4059aba70537a17873049b222d58 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 15:57:46 +0100
Subject: [PATCH 548/828] Set up pkgdown website

---
 .Rbuildignore                  |  1 +
 .github/workflows/pkgdown.yaml | 14 ++++++++++++--
 .gitignore                     |  1 +
 _pkgdown.yml                   |  4 ++++
 4 files changed, 18 insertions(+), 2 deletions(-)
 create mode 100644 _pkgdown.yml

diff --git a/.Rbuildignore b/.Rbuildignore
index 6bb079a6..b6f3df67 100644
--- a/.Rbuildignore
+++ b/.Rbuildignore
@@ -14,3 +14,4 @@
 ^pkgdown$
 ^data-raw$
 ^CITATION\.cff$
+^_pkgdown\.yml$
diff --git a/.github/workflows/pkgdown.yaml b/.github/workflows/pkgdown.yaml
index 269728a4..9c32a3dd 100644
--- a/.github/workflows/pkgdown.yaml
+++ b/.github/workflows/pkgdown.yaml
@@ -1,5 +1,11 @@
 # Workflow derived from https://github.com/r-lib/actions/tree/v2/examples
 # Need help debugging build failures? Start at https://github.com/r-lib/actions#where-to-find-help
+#
+# Reproduce locally by running:
+# ```r
+# pak::pak(c("any::pkgdown", "."), dependencies = "Config/Needs/website")
+# pkgdown::build_site()
+# ```
 on:
   push:
     branches: [main, master]
@@ -16,7 +22,6 @@ on:
       - '.Rbuildignore'
       - '.github/**'
   pull_request:
-    branches: [main, master]
     paths:
       - 'README.Rmd'
       - 'README.md'
@@ -68,6 +73,11 @@ jobs:
         if: github.event_name != 'pull_request'
         uses: JamesIves/github-pages-deploy-action@4.1.4
         with:
-          clean: false
+          # We clean on releases because we want to remove old vignettes,
+          # figures, etc. that have been deleted from the `main` branch.
+          # But we clean ONLY on releases because we want to be able to keep
+          # both the 'stable' and 'dev' websites.
+          # Also discussed in https://github.com/r-lib/actions/issues/484
+          clean: ${{ github.event_name == 'release' }}
           branch: gh-pages
           folder: docs
diff --git a/.gitignore b/.gitignore
index 211a19d8..574be8f8 100644
--- a/.gitignore
+++ b/.gitignore
@@ -31,3 +31,4 @@ rsconnect/
 /Meta/
 /docs/
 .DS_Store
+docs
diff --git a/_pkgdown.yml b/_pkgdown.yml
new file mode 100644
index 00000000..e92a4ce5
--- /dev/null
+++ b/_pkgdown.yml
@@ -0,0 +1,4 @@
+url: ~
+template:
+  package: epiversetheme
+

From 8561fc0101e22559b9f06ffeb184137a2461d124 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 16:12:22 +0100
Subject: [PATCH 549/828] Add epiversetheme to DESCRIPTION config

---
 DESCRIPTION | 1 +
 1 file changed, 1 insertion(+)

diff --git a/DESCRIPTION b/DESCRIPTION
index f915520d..48e3be49 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -44,6 +44,7 @@ VignetteBuilder:
     knitr
 Remotes:
     github::epiverse-trace/epiparameter
+Config/Needs/website:epiverse-trace/epiversetheme
 Config/testthat/edition: 3
 Encoding: UTF-8
 LazyData: true

From d65c87846f340b58623e180dcbf8e05b037b8642 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Thu, 7 Sep 2023 11:20:54 +0100
Subject: [PATCH 550/828] Configure the pkgdown yaml file

---
 _pkgdown.yml | 13 ++++++++++++-
 1 file changed, 12 insertions(+), 1 deletion(-)

diff --git a/_pkgdown.yml b/_pkgdown.yml
index e92a4ce5..1a1a21f3 100644
--- a/_pkgdown.yml
+++ b/_pkgdown.yml
@@ -1,4 +1,15 @@
-url: ~
+url: https://epiverse-trace.github.io/epichains/
 template:
   package: epiversetheme
+  bslib:
+    font_weight_base : 300
+development:
+  mode: auto
+articles:
+- title: Package vignettes
+  navbar: Package vignettes
+  contents:
+- title: Modelling guides and background
+  navbar: Modelling guides and background
+  contents:
 

From 1c4e438a27fb0e6ef51ab69c1a74c56699c5eb16 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 7 Sep 2023 11:23:31 +0100
Subject: [PATCH 551/828] Delete .md version of the README

---
 README.md | 107 ------------------------------------------------------
 1 file changed, 107 deletions(-)
 delete mode 100644 README.md

diff --git a/README.md b/README.md
deleted file mode 100644
index bd188d1b..00000000
--- a/README.md
+++ /dev/null
@@ -1,107 +0,0 @@
-
-<!-- README.md is generated from README.Rmd. Please edit that file. -->
-<!-- The code to render this README is stored in .github/workflows/render-readme.yaml -->
-<!-- Variables marked with double curly braces will be transformed beforehand: -->
-<!-- `packagename` is extracted from the DESCRIPTION file -->
-<!-- `gh_repo` is extracted via a special environment variable in GitHub Actions -->
-
-# *{{ packagename }}*: Methods for simulating and analysing the size and length of transmission chains from branching process models <img src="man/figures/epichains_logo.png" align="right" height="130" />
-
-<!-- badges: start -->
-
-![GitHub R package
-version](https://img.shields.io/github/r-package/v/epiverse-trace/epichains)
-[![R-CMD-check](https://github.com/epiverse-trace/epichains/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/epichains/actions/workflows/R-CMD-check.yaml)
-[![codecov](https://codecov.io/github/epiverse-trace/epichains/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/epichains)
-![GitHub
-contributors](https://img.shields.io/github/contributors/epiverse-trace/epichains)
-[![License:
-MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
-[![Lifecycle:
-experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html#experimental)
-<!-- badges: end -->
-
-*{{ packagename }}* is an R package to simulate, analyse, and visualize
-the size and length of branching processes with a given offspring
-distribution. These models are often used in infectious disease
-epidemiology, where the chains represent chains of transmission, and the
-offspring distribution represents the distribution of secondary
-infections caused by an infected individual.
-
-*{{ packagename }}* re-implements
-[bpmodels](%22https://github.com/epiverse-trace/bpmodels/%22) by
-providing dedicated data structures that allow easy manipulation and
-interoperability with other existing packages for handling transmission
-chain and contact-tracing data.
-
-*{{ packagename }}* is developed at the [Centre for the Mathematical
-Modelling of Infectious
-Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling-infectious-diseases)
-at the London School of Hygiene and Tropical Medicine as part of the
-[Epiverse Initiative](https://data.org/initiatives/epiverse/).
-
-# Installation
-
-The latest development version of the *{{ packagename }}* package can be
-installed via
-
-``` r
-# check whether {pak} is installed
-if(!require("pak")) install.packages("pak")
-pak::pak("{{ gh_repo }}")
-```
-
-To load the package, use
-
-``` r
-library("epichains")
-```
-
-# Quick start
-
-Work in progress
-
-## Package vignettes
-
-Specific use cases of *{{ packagename }}* can be found in the [online
-documentation as package
-vignettes](https://epiverse-trace.github.io/epichains/), under
-“Articles”.
-
-## Reporting bugs
-
-To report a bug please open an
-[issue](https://github.com/epiverse-trace/epichains/issues/new/choose).
-
-## Contribute
-
-We welcome contributions to enhance the package’s functionalities. If
-you wish to do so, please follow the [package contributing
-guide](https://github.com/epiverse-trace/epichains/blob/main/.github/CONTRIBUTING.md).
-
-## Code of conduct
-
-Please note that the *{{ packagename }}* project is released with a
-[Contributor Code of
-Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md).
-By contributing to this project, you agree to abide by its terms.
-
-## Citing this package
-
-``` r
-citation("epichains")
-#> To cite package epichains in publications use:
-#> 
-#>   Sebastian Funk, Flavio Finger, and James M. Azam (2023). epichains:
-#>   Analysing transmission chain statistics using branching process
-#>   models, website: https://github.com/epiverse-trace/epichains/
-#> 
-#> A BibTeX entry for LaTeX users is
-#> 
-#>   @Manual{,
-#>     title = {epichains: Analysing transmission chain statistics using branching process models},
-#>     author = {{Sebastian Funk} and {Flavio Finger} and {James M. Azam}},
-#>     year = {2023},
-#>     url = {https://github.com/epiverse-trace/epichains/},
-#>   }
-```

From dcb4931512c3f1ffac71c4b624c3c7a41a9ab7a9 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 7 Sep 2023 11:54:11 +0100
Subject: [PATCH 552/828] Touch README.Rmd to trigger render-readme workflow.

---
 README.Rmd | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index 4ce2ff85..fba9a261 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -41,7 +41,8 @@ models are often used in infectious disease epidemiology, where the chains repre
 transmission, and the offspring distribution represents the distribution of 
 secondary infections caused by an infected individual. 
 
-_{{ packagename }}_ re-implements [bpmodels]("https://github.com/epiverse-trace/bpmodels/") by providing dedicated data structures that allow easy manipulation and interoperability with other existing
+_{{ packagename }}_ re-implements [bpmodels]("https://github.com/epiverse-trace/bpmodels/")
+by providing dedicated data structures that allow easy manipulation and interoperability with other existing
 packages for handling transmission chain and contact-tracing data.
 
 _{{ packagename }}_ is developed at the [Centre for the Mathematical Modelling of Infectious Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling-infectious-diseases) at the London School of Hygiene and Tropical Medicine as part of the [Epiverse Initiative](https://data.org/initiatives/epiverse/).

From 6ab0c640caa7c643c71d55b10696102629046f5c Mon Sep 17 00:00:00 2001
From: Hugo Gruson <Bisaloo@users.noreply.github.com>
Date: Thu, 7 Sep 2023 12:12:52 +0000
Subject: [PATCH 553/828] Fix render_readme.yml syntax

---
 .github/workflows/render_readme.yml | 4 ----
 1 file changed, 4 deletions(-)

diff --git a/.github/workflows/render_readme.yml b/.github/workflows/render_readme.yml
index 0c427323..8e82a2ee 100644
--- a/.github/workflows/render_readme.yml
+++ b/.github/workflows/render_readme.yml
@@ -17,10 +17,6 @@ concurrency:
   group: ${{ github.workflow }}-${{ github.ref }}
   cancel-in-progress: true
 
-concurrency:
-  group: ${{ github.workflow }}-${{ github.ref }}
-  cancel-in-progress: true
-
 # A workflow run is made up of one or more jobs that can run sequentially or in parallel
 jobs:
   render-readme:

From a6dde106a1503b6223cf6e74533c1b10c39c5e3b Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Thu, 7 Sep 2023 12:15:46 +0000
Subject: [PATCH 554/828] Automatic readme update

---
 README.md | 107 ++++++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 107 insertions(+)
 create mode 100644 README.md

diff --git a/README.md b/README.md
new file mode 100644
index 00000000..cfe04da6
--- /dev/null
+++ b/README.md
@@ -0,0 +1,107 @@
+
+<!-- README.md is generated from README.Rmd. Please edit that file. -->
+<!-- The code to render this README is stored in .github/workflows/render-readme.yaml -->
+<!-- Variables marked with double curly braces will be transformed beforehand: -->
+<!-- `packagename` is extracted from the DESCRIPTION file -->
+<!-- `gh_repo` is extracted via a special environment variable in GitHub Actions -->
+
+# *epichains*: Methods for simulating and analysing the size and length of transmission chains from branching process models <img src="man/figures/epichains_logo.png" align="right" height="130" />
+
+<!-- badges: start -->
+
+![GitHub R package
+version](https://img.shields.io/github/r-package/v/epiverse-trace/epichains)
+[![R-CMD-check](https://github.com/epiverse-trace/epichains/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/epichains/actions/workflows/R-CMD-check.yaml)
+[![codecov](https://codecov.io/github/epiverse-trace/epichains/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/epichains)
+![GitHub
+contributors](https://img.shields.io/github/contributors/epiverse-trace/epichains)
+[![License:
+MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
+[![Lifecycle:
+experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html#experimental)
+<!-- badges: end -->
+
+*epichains* is an R package to simulate, analyse, and visualize the size
+and length of branching processes with a given offspring distribution.
+These models are often used in infectious disease epidemiology, where
+the chains represent chains of transmission, and the offspring
+distribution represents the distribution of secondary infections caused
+by an infected individual.
+
+*epichains* re-implements
+[bpmodels](%22https://github.com/epiverse-trace/bpmodels/%22) by
+providing dedicated data structures that allow easy manipulation and
+interoperability with other existing packages for handling transmission
+chain and contact-tracing data.
+
+*epichains* is developed at the [Centre for the Mathematical Modelling
+of Infectious
+Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling-infectious-diseases)
+at the London School of Hygiene and Tropical Medicine as part of the
+[Epiverse Initiative](https://data.org/initiatives/epiverse/).
+
+# Installation
+
+The latest development version of the *epichains* package can be
+installed via
+
+``` r
+# check whether {pak} is installed
+if (!require("pak")) install.packages("pak")
+pak::pak("epiverse-trace/epichains")
+```
+
+To load the package, use
+
+``` r
+library("epichains")
+```
+
+# Quick start
+
+Work in progress
+
+## Package vignettes
+
+Specific use cases of *epichains* can be found in the [online
+documentation as package
+vignettes](https://epiverse-trace.github.io/epichains/), under
+“Articles”.
+
+## Reporting bugs
+
+To report a bug please open an
+[issue](https://github.com/epiverse-trace/epichains/issues/new/choose).
+
+## Contribute
+
+We welcome contributions to enhance the package’s functionalities. If
+you wish to do so, please follow the [package contributing
+guide](https://github.com/epiverse-trace/epichains/blob/main/.github/CONTRIBUTING.md).
+
+## Code of conduct
+
+Please note that the *epichains* project is released with a [Contributor
+Code of
+Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md).
+By contributing to this project, you agree to abide by its terms.
+
+## Citing this package
+
+``` r
+citation("epichains")
+#> To cite package epichains in publications use:
+#> 
+#>   Sebastian Funk, Flavio Finger, and James M. Azam (2023). epichains:
+#>   Analysing transmission chain statistics using branching process
+#>   models, website: https://github.com/epiverse-trace/epichains/
+#> 
+#> A BibTeX entry for LaTeX users is
+#> 
+#>   @Manual{,
+#>     title = {epichains: Analysing transmission chain statistics using branching process models},
+#>     author = {{Sebastian Funk} and {Flavio Finger} and {James M. Azam}},
+#>     year = {2023},
+#>     url = {https://github.com/epiverse-trace/epichains/},
+#>   }
+```

From 66cf9d048acd7c0b2793be3a65bd4ecb68a71613 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Sun, 10 Sep 2023 21:17:56 +0100
Subject: [PATCH 555/828] Replace all occurrences of the "serial_sampler"
 argument with "serial_dist"

---
 R/checks.R                 | 10 +++++-----
 R/epichains.R              |  2 +-
 R/simulate.r               | 38 +++++++++++++++++++-------------------
 man/aggregate.epichains.Rd |  2 +-
 man/check_serial_valid.Rd  |  8 ++++----
 man/simulate_tree.Rd       | 22 +++++++++++-----------
 vignettes/epichains.Rmd    |  2 +-
 7 files changed, 42 insertions(+), 42 deletions(-)

diff --git a/R/checks.R b/R/checks.R
index a78a5516..957d2dab 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -30,18 +30,18 @@ check_offspring_func_valid <- function(roffspring_name) {
 }
 
 
-#' Check if the serials_sampler argument is specified as a function
+#' Check if the serials_dist argument is specified as a function
 #'
-#' @param serials_sampler The serial interval generator function; the name of a
+#' @param serials_dist The serial interval distribution function; the name of a
 #' user-defined named or anonymous function with only one argument `n`,
 #' representing the number of serial intervals to generate.
 #'
 #' @keywords internal
-check_serial_valid <- function(serials_sampler) {
-  if (!checkmate::test_function(serials_sampler)) {
+check_serial_valid <- function(serials_dist) {
+  if (!checkmate::test_function(serials_dist)) {
     stop(sprintf(
       "%s %s",
-      "The `serials_sampler` argument must be a function",
+      "The `serials_dist` argument must be a function",
       "(see details in ?sim_chain_tree)."
     ))
   }
diff --git a/R/epichains.R b/R/epichains.R
index 45254c7e..eb2a6ea7 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -257,7 +257,7 @@ tail.epichains <- function(x, ...) {
 #' set.seed(123)
 #' chains <- simulate_tree(
 #'   nchains = 10, statistic = "size",
-#'   offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
+#'   offspring_dist = "pois", stat_max = 10, serials_dist = function(x) 3,
 #'   lambda = 2
 #' )
 #' chains
diff --git a/R/simulate.r b/R/simulate.r
index 6f7105b6..88499333 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -13,8 +13,8 @@
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to this value.
 #' Defaults to `Inf`.
-#' @param serials_sampler The serial interval generator function; the name of a
-#' user-defined named or anonymous function with only one argument `n`,
+#' @param serials_dist The serial interval distribution function; the name
+#' of a user-defined named or anonymous function with only one argument `n`,
 #' representing the number of serial intervals to generate.
 #' @param t0 Start time (if serial interval is given); either a single value
 #' or a vector of same length as `nchains` (number of simulations) with
@@ -31,7 +31,7 @@
 #' @details
 #' `simulate_tree()` simulates a branching process of the form:
 #' WIP
-#' # The serial interval (`serials_sampler`):
+#' # The serial interval (`serials_dist`):
 #'
 #' ## Assumptions/disambiguation
 #'
@@ -46,9 +46,9 @@
 #'
 #' See References below for some literature on the subject.
 #'
-#' ## Specifying `serials_sampler` in `simulate_tree()`
+#' ## Specifying `serials_dist` in `simulate_tree()`
 #'
-#' `serials_sampler` must be specified as a named or
+#' `serials_dist` must be specified as a named or
 #' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) # nolint
 #' with one argument.
 #'
@@ -58,14 +58,14 @@
 #' let's call it "serial_interval", with only one argument representing the
 #' number of serial intervals to sample:
 #' \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
-#' and assign the name of the function to `serials_sampler` in
+#' and assign the name of the function to `serials_dist` in
 #' `simulate_tree()` like so
-#' \code{simulate_tree(..., serials_sampler = serial_interval)},
+#' \code{simulate_tree(..., serials_dist = serial_interval)},
 #' where `...` are the other arguments to `simulate_tree()`.
 #'
-#' Alternatively, we could assign an anonymous function to `serials_sampler`
+#' Alternatively, we could assign an anonymous function to `serials_dist`
 #' in the `simulate_tree()` call like so
-#' \code{simulate_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
+#' \code{simulate_tree(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
 #' where `...` are the other arguments to `simulate_tree()`.
 #' @seealso [simulate_summary()] for simulating the transmission chains
 #' statistic without the tree of infections.
@@ -73,7 +73,7 @@
 #' set.seed(123)
 #' chains <- simulate_tree(
 #'   nchains = 10, statistic = "size",
-#'   offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
+#'   offspring_dist = "pois", stat_max = 10, serials_dist = function(x) 3,
 #'   lambda = 2
 #' )
 #' @references
@@ -89,7 +89,7 @@
 #' doi: 10.1093/aje/kwg251. PMID: 14630599.
 simulate_tree <- function(nchains, statistic = c("size", "length"),
                           offspring_dist, stat_max = Inf,
-                          serials_sampler, t0 = 0,
+                          serials_dist, t0 = 0,
                           tf = Inf, ...) {
   statistic <- match.arg(statistic)
 
@@ -102,10 +102,10 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
   roffspring_name <- paste0("r", offspring_dist)
   check_offspring_func_valid(roffspring_name)
 
-  if (!missing(serials_sampler)) {
-    check_serial_valid(serials_sampler)
+  if (!missing(serials_dist)) {
+    check_serial_valid(serials_dist)
   } else if (!missing(tf)) {
-    stop("If `tf` is specified, `serials_sampler` must be specified too.")
+    stop("If `tf` is specified, `serials_dist` must be specified too.")
   }
 
   # Initialisations
@@ -123,7 +123,7 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
     generation = generation
   )
 
-  if (!missing(serials_sampler)) {
+  if (!missing(serials_dist)) {
     tree_df$time <- t0
     times <- tree_df$time
   }
@@ -175,8 +175,8 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 
       # if a serial interval model/function was specified, use it
       # to generate serial intervals for the cases
-      if (!missing(serials_sampler)) {
-        times <- rep(times, next_gen) + serials_sampler(sum(n_offspring))
+      if (!missing(serials_dist)) {
+        times <- rep(times, next_gen) + serials_dist(sum(n_offspring))
         current_min_time <- unname(tapply(times, indices, min))
         new_df$time <- times
       }
@@ -187,11 +187,11 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
     ## the specified maximum size/length
     sim <- which(n_offspring > 0 & stat_track < stat_max)
     if (length(sim) > 0) {
-      if (!missing(serials_sampler)) {
+      if (!missing(serials_dist)) {
         ## only continue to simulate chains that don't go beyond tf
         sim <- intersect(sim, unique(indices)[current_min_time < tf])
       }
-      if (!missing(serials_sampler)) {
+      if (!missing(serials_dist)) {
         times <- times[indices %in% sim]
       }
       ancestor_ids <- ids[indices %in% sim]
diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
index ab761c6e..eaf83f6d 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains.Rd
@@ -27,7 +27,7 @@ Aggregate cases in epichains objects according to a grouping variable
 set.seed(123)
 chains <- simulate_tree(
   nchains = 10, statistic = "size",
-  offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
+  offspring_dist = "pois", stat_max = 10, serials_dist = function(x) 3,
   lambda = 2
 )
 chains
diff --git a/man/check_serial_valid.Rd b/man/check_serial_valid.Rd
index 7a33c71f..3aef91b5 100644
--- a/man/check_serial_valid.Rd
+++ b/man/check_serial_valid.Rd
@@ -2,16 +2,16 @@
 % Please edit documentation in R/checks.R
 \name{check_serial_valid}
 \alias{check_serial_valid}
-\title{Check if the serials_sampler argument is specified as a function}
+\title{Check if the serials_dist argument is specified as a function}
 \usage{
-check_serial_valid(serials_sampler)
+check_serial_valid(serials_dist)
 }
 \arguments{
-\item{serials_sampler}{The serial interval generator function; the name of a
+\item{serials_dist}{The serial interval distribution function; the name of a
 user-defined named or anonymous function with only one argument \code{n},
 representing the number of serial intervals to generate.}
 }
 \description{
-Check if the serials_sampler argument is specified as a function
+Check if the serials_dist argument is specified as a function
 }
 \keyword{internal}
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 1228087e..1d2239cc 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -9,7 +9,7 @@ simulate_tree(
   statistic = c("size", "length"),
   offspring_dist,
   stat_max = Inf,
-  serials_sampler,
+  serials_dist,
   t0 = 0,
   tf = Inf,
   ...
@@ -33,8 +33,8 @@ numbers).}
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
-\item{serials_sampler}{The serial interval generator function; the name of a
-user-defined named or anonymous function with only one argument \code{n},
+\item{serials_dist}{The serial interval distribution function; the name
+of a user-defined named or anonymous function with only one argument \code{n},
 representing the number of serial intervals to generate.}
 
 \item{t0}{Start time (if serial interval is given); either a single value
@@ -59,7 +59,7 @@ Simulate a tree of infections with a serial and offspring distributions
 \code{simulate_tree()} simulates a branching process of the form:
 WIP
 }
-\section{The serial interval (\code{serials_sampler}):}{
+\section{The serial interval (\code{serials_dist}):}{
 \subsection{Assumptions/disambiguation}{
 
 In epidemiology, the generation interval is the duration between successive
@@ -74,9 +74,9 @@ generation interval, that is, the time between successive cases.
 See References below for some literature on the subject.
 }
 
-\subsection{Specifying \code{serials_sampler} in \code{simulate_tree()}}{
+\subsection{Specifying \code{serials_dist} in \code{simulate_tree()}}{
 
-\code{serials_sampler} must be specified as a named or
+\code{serials_dist} must be specified as a named or
 \href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function} # nolint
 with one argument.
 
@@ -86,14 +86,14 @@ generator as a random log-normally distributed variable with
 let's call it "serial_interval", with only one argument representing the
 number of serial intervals to sample:
 \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
-and assign the name of the function to \code{serials_sampler} in
+and assign the name of the function to \code{serials_dist} in
 \code{simulate_tree()} like so
-\code{simulate_tree(..., serials_sampler = serial_interval)},
+\code{simulate_tree(..., serials_dist = serial_interval)},
 where \code{...} are the other arguments to \code{simulate_tree()}.
 
-Alternatively, we could assign an anonymous function to \code{serials_sampler}
+Alternatively, we could assign an anonymous function to \code{serials_dist}
 in the \code{simulate_tree()} call like so
-\code{simulate_tree(..., serials_sampler = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
+\code{simulate_tree(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
 where \code{...} are the other arguments to \code{simulate_tree()}.
 }
 }
@@ -102,7 +102,7 @@ where \code{...} are the other arguments to \code{simulate_tree()}.
 set.seed(123)
 chains <- simulate_tree(
   nchains = 10, statistic = "size",
-  offspring_dist = "pois", stat_max = 10, serials_sampler = function(x) 3,
+  offspring_dist = "pois", stat_max = 10, serials_dist = function(x) 3,
   lambda = 2
 )
 }
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 68c21807..2e52afe1 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -68,7 +68,7 @@ library(epichains)
 tree_from_pois_offspring <- simulate_tree(
   nchains = 10,
   offspring_dist = "pois",
-  serials_sampler = function(x) 3,
+  serials_dist = function(x) 3,
   lambda = 2,
   stat_max = 10
 )

From 28a5918dc88d35a0182e10ee56d79f166e074e10 Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Sun, 10 Sep 2023 20:35:47 +0000
Subject: [PATCH 556/828] Update CITATION.cff

---
 CITATION.cff | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/CITATION.cff b/CITATION.cff
index 6f2cf95b..3dac125f 100644
--- a/CITATION.cff
+++ b/CITATION.cff
@@ -49,6 +49,8 @@ keywords:
 - epidemic-dynamics
 - epidemic-modelling
 - epidemic-simulations
+- epidemiology
+- epidemiology-models
 - outbreak-simulator
 - r-package
 - r-stats

From 7acf05f64b3920b2df590a6fc1662603cd9ee981 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 11 Sep 2023 13:37:18 +0100
Subject: [PATCH 557/828] Replace all occurences of the argument `serial_dist`
 with `serials_dist`

---
 R/simulate.r                  | 10 +++++-----
 man/simulate_tree_from_pop.Rd |  8 ++++----
 tests/testthat/tests-sim.r    |  2 +-
 vignettes/epichains.Rmd       |  4 ++--
 4 files changed, 12 insertions(+), 12 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 88499333..72fb6ab1 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -297,7 +297,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
 #' avoid division by 0 when calculating the size. See details and
 #'  \code{?rnbinom} for details on the parameterisation in Ecology.
-#' @param serial_dist The serial interval. A function that takes one
+#' @param serials_dist The serial interval. A function that takes one
 #' parameter (`n`), the number of serial intervals to randomly sample. Value
 #' must be >= 0.
 #' @param initial_immune The number of initial immunes in the population.
@@ -334,20 +334,20 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' # Simulate with poisson offspring
 #' simulate_tree_from_pop(
 #'   pop = 100, offspring_dist = "pois",
-#'   offspring_mean = 0.5, serial_dist = function(x) 3
+#'   offspring_mean = 0.5, serials_dist = function(x) 3
 #' )
 #'
 #' # Simulate with negative binomial offspring
 #' simulate_tree_from_pop(
 #'   pop = 100, offspring_dist = "nbinom",
-#'   offspring_mean = 0.5, offspring_disp = 1.1, serial_dist = function(x) 3
+#'   offspring_mean = 0.5, offspring_disp = 1.1, serials_dist = function(x) 3
 #' )
 #' @export
 simulate_tree_from_pop <- function(pop,
                                    offspring_dist = c("pois", "nbinom"),
                                    offspring_mean,
                                    offspring_disp,
-                                   serial_dist,
+                                   serials_dist,
                                    initial_immune = 0,
                                    t0 = 0,
                                    tf = Inf) {
@@ -418,7 +418,7 @@ simulate_tree_from_pop <- function(pop,
     ## add to df
     if (n_offspring > 0) {
       ## draw serial times
-      new_times <- serial_dist(n_offspring)
+      new_times <- serials_dist(n_offspring)
 
       if (any(new_times < 0)) {
         stop("Serial interval must be >= 0.")
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 17b5ad2f..57c5e83a 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -10,7 +10,7 @@ simulate_tree_from_pop(
   offspring_dist = c("pois", "nbinom"),
   offspring_mean,
   offspring_disp,
-  serial_dist,
+  serials_dist,
   initial_immune = 0,
   t0 = 0,
   tf = Inf
@@ -32,7 +32,7 @@ secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
 avoid division by 0 when calculating the size. See details and
 \code{?rnbinom} for details on the parameterisation in Ecology.}
 
-\item{serial_dist}{The serial interval. A function that takes one
+\item{serials_dist}{The serial interval. A function that takes one
 parameter (\code{n}), the number of serial intervals to randomly sample. Value
 must be >= 0.}
 
@@ -79,13 +79,13 @@ simulate_tree_from_pop() has a couple of key different from simulate_tree():
 # Simulate with poisson offspring
 simulate_tree_from_pop(
   pop = 100, offspring_dist = "pois",
-  offspring_mean = 0.5, serial_dist = function(x) 3
+  offspring_mean = 0.5, serials_dist = function(x) 3
 )
 
 # Simulate with negative binomial offspring
 simulate_tree_from_pop(
   pop = 100, offspring_dist = "nbinom",
-  offspring_mean = 0.5, offspring_disp = 1.1, serial_dist = function(x) 3
+  offspring_mean = 0.5, offspring_disp = 1.1, serials_dist = function(x) 3
 )
 }
 \author{
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 9b5868d0..67386a95 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -15,7 +15,7 @@ test_that("Simulators output epichains objects", {
       offspring_dist = "nbinom",
       offspring_mean = 0.5,
       offspring_disp = 1.1,
-      serial_dist = function(x) 3
+      serials_dist = function(x) 3
     ),
     "epichains"
   )
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 2e52afe1..2dcb1f0f 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -91,7 +91,7 @@ tree_from_pop_pois <- simulate_tree_from_pop(
   pop = 1000,
   offspring_dist = "pois",
   offspring_mean = 0.5,
-  serial_dist = function(x) 3
+  serials_dist = function(x) 3
 )
 
 tree_from_pop_pois # print the output
@@ -102,7 +102,7 @@ tree_from_pop_nbinom <- simulate_tree_from_pop(
   offspring_dist = "nbinom",
   offspring_mean = 0.5,
   offspring_disp = 1.1,
-  serial_dist = function(x) 3
+  serials_dist = function(x) 3
 )
 
 tree_from_pop_nbinom # print the output

From caa28929b2bcb95a722f331baa43eb7e616f8718 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Sep 2023 17:54:33 +0100
Subject: [PATCH 558/828] Delete duplicated return value in function doc

---
 R/likelihood.R | 1 -
 1 file changed, 1 deletion(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index a9f566f0..07b2aec4 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -13,7 +13,6 @@
 #' contributions will be returned rather than the sum.
 #' @param ... Parameters for the offspring distribution.
 #' @return
-#' * A log-likelihood, if \code{log = TRUE} (the default)
 #' * A vector of log-likelihoods, if \code{log = TRUE} (the default) and
 #' \code{obs_prob < 1}, or
 #' * A list of individual log-likelihood contributions, if

From 5b0892a7106c4d9dba925763868c8e8eb9860d1a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 17:11:27 +0100
Subject: [PATCH 559/828] Replace "likelihood" with "log-likelihood" to clarify
 that the latter is being calculated

---
 R/stat_likelihoods.R  | 17 +++++++++--------
 man/gborel_size_ll.Rd |  4 ++--
 man/geom_length_ll.Rd |  4 ++--
 man/likelihood.Rd     |  1 -
 man/nbinom_size_ll.Rd |  4 ++--
 man/offspring_ll.Rd   |  7 ++++---
 man/pois_length_ll.Rd |  4 ++--
 man/pois_size_ll.Rd   |  4 ++--
 8 files changed, 23 insertions(+), 22 deletions(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index f54b7736..028fba92 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -1,4 +1,4 @@
-#' Likelihood of the size of chains with Poisson offspring distribution
+#' Log-likelihood of the size of chains with Poisson offspring distribution
 #'
 #' @param x vector of sizes
 #' @param lambda rate of the Poisson distribution
@@ -9,7 +9,7 @@ pois_size_ll <- function(x, lambda) {
   (x - 1) * log(lambda) - lambda * x + (x - 2) * log(x) - lgamma(x)
 }
 
-#' Likelihood of the size of chains with Negative-Binomial offspring
+#' Log-likelihood of the size of chains with Negative-Binomial offspring
 #' distribution
 #'
 #' @param x vector of sizes
@@ -31,7 +31,7 @@ nbinom_size_ll <- function(x, size, prob, mu) {
     (size * x + (x - 1)) * log(1 + mu / size)
 }
 
-#' Likelihood of the size of chains with gamma-Borel offspring distribution
+#' Log-likelihood of the size of chains with gamma-Borel offspring distribution
 #'
 #' @param x vector of sizes
 #' @param size the dispersion parameter (often called \code{k} in ecological
@@ -52,7 +52,7 @@ gborel_size_ll <- function(x, size, prob, mu) {
     (x - 1) * log(x) - (size + x - 1) * log(x + size / mu)
 }
 
-#' Likelihood of the length of chains with Poisson offspring distribution
+#' Log-likelihood of the length of chains with Poisson offspring distribution
 #'
 #' @param x vector of sizes
 #' @param lambda rate of the Poisson distribution
@@ -70,7 +70,7 @@ pois_length_ll <- function(x, lambda) {
   log(Gk[x + 1] - Gk[x])
 }
 
-#' Likelihood of the length of chains with geometric offspring distribution
+#' Log-likelihood of the length of chains with geometric offspring distribution
 #'
 #' @param x vector of sizes
 #' @param prob probability of the geometric distribution with mean
@@ -86,10 +86,11 @@ geom_length_ll <- function(x, prob) {
   log(GkmGkm1)
 }
 
-#' Likelihood of the length of chains with generic offspring distribution
+#' Log-likelihood of the summary (size/length) of chains with generic offspring
+#' distribution
 #'
-#' The likelihoods are calculated with a crude approximation using simulated
-#' chains by linearly approximating any missing values in the empirical
+#' The log-likelihoods are calculated with a crude approximation using simulated
+#' chain summaries by linearly approximating any missing values in the empirical
 #' cumulative distribution function (ecdf).
 #' @inheritParams likelihood
 #' @inheritParams simulate_vec
diff --git a/man/gborel_size_ll.Rd b/man/gborel_size_ll.Rd
index 618659f2..752aa56a 100644
--- a/man/gborel_size_ll.Rd
+++ b/man/gborel_size_ll.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/stat_likelihoods.R
 \name{gborel_size_ll}
 \alias{gborel_size_ll}
-\title{Likelihood of the size of chains with gamma-Borel offspring distribution}
+\title{Log-likelihood of the size of chains with gamma-Borel offspring distribution}
 \usage{
 gborel_size_ll(x, size, prob, mu)
 }
@@ -21,7 +21,7 @@ applications)}
 log-likelihood values
 }
 \description{
-Likelihood of the size of chains with gamma-Borel offspring distribution
+Log-likelihood of the size of chains with gamma-Borel offspring distribution
 }
 \author{
 Sebastian Funk
diff --git a/man/geom_length_ll.Rd b/man/geom_length_ll.Rd
index f200df93..6c7dc6ad 100644
--- a/man/geom_length_ll.Rd
+++ b/man/geom_length_ll.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/stat_likelihoods.R
 \name{geom_length_ll}
 \alias{geom_length_ll}
-\title{Likelihood of the length of chains with geometric offspring distribution}
+\title{Log-likelihood of the length of chains with geometric offspring distribution}
 \usage{
 geom_length_ll(x, prob)
 }
@@ -16,7 +16,7 @@ geom_length_ll(x, prob)
 log-likelihood values
 }
 \description{
-Likelihood of the length of chains with geometric offspring distribution
+Log-likelihood of the length of chains with geometric offspring distribution
 }
 \author{
 Sebastian Funk
diff --git a/man/likelihood.Rd b/man/likelihood.Rd
index 3bd9e76e..b5b77844 100644
--- a/man/likelihood.Rd
+++ b/man/likelihood.Rd
@@ -52,7 +52,6 @@ contributions will be returned rather than the sum.}
 }
 \value{
 \itemize{
-\item A log-likelihood, if \code{log = TRUE} (the default)
 \item A vector of log-likelihoods, if \code{log = TRUE} (the default) and
 \code{obs_prob < 1}, or
 \item A list of individual log-likelihood contributions, if
diff --git a/man/nbinom_size_ll.Rd b/man/nbinom_size_ll.Rd
index 14003322..6bbd2475 100644
--- a/man/nbinom_size_ll.Rd
+++ b/man/nbinom_size_ll.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/stat_likelihoods.R
 \name{nbinom_size_ll}
 \alias{nbinom_size_ll}
-\title{Likelihood of the size of chains with Negative-Binomial offspring
+\title{Log-likelihood of the size of chains with Negative-Binomial offspring
 distribution}
 \usage{
 nbinom_size_ll(x, size, prob, mu)
@@ -22,7 +22,7 @@ applications)}
 log-likelihood values
 }
 \description{
-Likelihood of the size of chains with Negative-Binomial offspring
+Log-likelihood of the size of chains with Negative-Binomial offspring
 distribution
 }
 \author{
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 65763d76..d0edde23 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -2,7 +2,8 @@
 % Please edit documentation in R/stat_likelihoods.R
 \name{offspring_ll}
 \alias{offspring_ll}
-\title{Likelihood of the length of chains with generic offspring distribution}
+\title{Log-likelihood of the summary (size/length) of chains with generic offspring
+distribution}
 \usage{
 offspring_ll(
   chains,
@@ -36,11 +37,11 @@ to TRUE).}
 \item{...}{any parameters to pass to \code{\link{simulate_tree}}}
 }
 \value{
-If \code{log = TRUE} (the default), log-likelihood values,
+log-likelihood values
 else raw likelihoods
 }
 \description{
-The likelihoods are calculated with a crude approximation using simulated
+The log-likelihoods are calculated with a crude approximation using simulated
 chains by linearly approximating any missing values in the empirical
 cumulative distribution function (ecdf).
 }
diff --git a/man/pois_length_ll.Rd b/man/pois_length_ll.Rd
index 63f6088e..bf1f47ba 100644
--- a/man/pois_length_ll.Rd
+++ b/man/pois_length_ll.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/stat_likelihoods.R
 \name{pois_length_ll}
 \alias{pois_length_ll}
-\title{Likelihood of the length of chains with Poisson offspring distribution}
+\title{Log-likelihood of the length of chains with Poisson offspring distribution}
 \usage{
 pois_length_ll(x, lambda)
 }
@@ -15,7 +15,7 @@ pois_length_ll(x, lambda)
 log-likelihood values
 }
 \description{
-Likelihood of the length of chains with Poisson offspring distribution
+Log-likelihood of the length of chains with Poisson offspring distribution
 }
 \author{
 Sebastian Funk
diff --git a/man/pois_size_ll.Rd b/man/pois_size_ll.Rd
index 00e662d0..5e0645f3 100644
--- a/man/pois_size_ll.Rd
+++ b/man/pois_size_ll.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/stat_likelihoods.R
 \name{pois_size_ll}
 \alias{pois_size_ll}
-\title{Likelihood of the size of chains with Poisson offspring distribution}
+\title{Log-likelihood of the size of chains with Poisson offspring distribution}
 \usage{
 pois_size_ll(x, lambda)
 }
@@ -15,7 +15,7 @@ pois_size_ll(x, lambda)
 log-likelihood values
 }
 \description{
-Likelihood of the size of chains with Poisson offspring distribution
+Log-likelihood of the size of chains with Poisson offspring distribution
 }
 \author{
 Sebastian Funk

From 1eba519878743996c97ef0623ebf0cae4045174e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 17:13:11 +0100
Subject: [PATCH 560/828] Reword the docs of "offspring_ll"

---
 R/stat_likelihoods.R | 14 ++++++--------
 man/offspring_ll.Rd  | 20 ++++++++++++--------
 2 files changed, 18 insertions(+), 16 deletions(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 028fba92..befa7c3e 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -93,15 +93,13 @@ geom_length_ll <- function(x, prob) {
 #' chain summaries by linearly approximating any missing values in the empirical
 #' cumulative distribution function (ecdf).
 #' @inheritParams likelihood
-#' @inheritParams simulate_vec
-#' @param chains Vector of sizes/lengths
+#' @inheritParams simulate_summary
+#' @param chains Vector of chain summaries (sizes/lengths)
 #' @param nsim_offspring Number of simulations of the offspring distribution
-#' for approximating the statistic (size/length) distribution
-#' @param log Logical; Should the results be log-transformed? (Defaults
-#' to TRUE).
-#' @param ... any parameters to pass to \code{\link{simulate_tree}}
-#' @return If \code{log = TRUE} (the default), log-likelihood values,
-#' else raw likelihoods
+#' for approximating the distribution of the chain statistic summary
+#' (size/length)
+#' @param ... any parameters to pass to \code{\link{simulate_summary}}
+#' @return log-likelihood values
 #' @author Sebastian Funk
 #' @export
 offspring_ll <- function(chains, offspring_dist, statistic,
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index d0edde23..58381ff2 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -15,7 +15,7 @@ offspring_ll(
 )
 }
 \arguments{
-\item{chains}{Vector of sizes/lengths}
+\item{chains}{Vector of chain summaries (sizes/lengths)}
 
 \item{offspring_dist}{Offspring distribution: a character string
 corresponding to the R distribution function (e.g., "pois" for Poisson,
@@ -29,22 +29,26 @@ numbers).}
 }}
 
 \item{nsim_offspring}{Number of simulations of the offspring distribution
-for approximating the statistic (size/length) distribution}
+for approximating the distribution of the chain statistic summary
+(size/length)}
 
-\item{log}{Logical; Should the results be log-transformed? (Defaults
-to TRUE).}
-
-\item{...}{any parameters to pass to \code{\link{simulate_tree}}}
+\item{...}{any parameters to pass to \code{\link{simulate_summary}}}
 }
 \value{
 log-likelihood values
-else raw likelihoods
 }
 \description{
 The log-likelihoods are calculated with a crude approximation using simulated
-chains by linearly approximating any missing values in the empirical
+chain summaries by linearly approximating any missing values in the empirical
 cumulative distribution function (ecdf).
 }
+\examples{
+set.seed(123)
+}
+\seealso{
+\code{\link[=simulate_summary]{simulate_summary()}} for simulating a summary of the transmission
+chains statistic (without the tree of infections)
+}
 \author{
 Sebastian Funk
 }

From d73e4be116d5dfcfcc33b904457f49b790d46e4f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 17:14:01 +0100
Subject: [PATCH 561/828] Revert to returning log values to comform with
 function name

---
 R/stat_likelihoods.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index befa7c3e..81e88ee1 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -122,6 +122,6 @@ offspring_ll <- function(chains, offspring_dist, statistic,
     )$y))
   lik <- acdf[chains]
   lik[is.na(lik)] <- 0
-  out <- ifelse(base::isTRUE(log), log(lik), lik)
+  out <- log(lik)
   return(out)
 }

From e1c62f68837f8e6be10c5ea66efb1d111e247b86 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 17:14:26 +0100
Subject: [PATCH 562/828] Remove log argument

---
 R/stat_likelihoods.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 81e88ee1..f6e38e09 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -103,7 +103,7 @@ geom_length_ll <- function(x, prob) {
 #' @author Sebastian Funk
 #' @export
 offspring_ll <- function(chains, offspring_dist, statistic,
-                         nsim_offspring = 100, log = TRUE, ...) {
+                         nsim_offspring = 100, ...) {
   # Simulate the chains
   chains <- simulate_summary(
     nsim_offspring, offspring_dist,

From 3459c7c11d78e7deb6b88264a1a3a8280ffe5855 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 17:15:02 +0100
Subject: [PATCH 563/828] Explicitly assign arguments to avoid positioning
 matching

---
 R/stat_likelihoods.R | 6 ++++--
 1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index f6e38e09..ad435a35 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -106,8 +106,10 @@ offspring_ll <- function(chains, offspring_dist, statistic,
                          nsim_offspring = 100, ...) {
   # Simulate the chains
   chains <- simulate_summary(
-    nsim_offspring, offspring_dist,
-    statistic, ...
+    nchains = nsim_offspring,
+    offspring_dist = offspring_dist,
+    statistic = statistic,
+    ...
   )
 
   # Compute the empirical Cumulative Distribution Function of the

From 8886f3b7bd6f720225e5374cb6ab2c36f11f91b8 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 17:16:36 +0100
Subject: [PATCH 564/828] Add a seealso tag and clean up examples

---
 R/stat_likelihoods.R | 10 ++++++++++
 man/offspring_ll.Rd  |  9 +--------
 2 files changed, 11 insertions(+), 8 deletions(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index ad435a35..07bce3ab 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -102,6 +102,16 @@ geom_length_ll <- function(x, prob) {
 #' @return log-likelihood values
 #' @author Sebastian Funk
 #' @export
+#' @seealso [simulate_summary()] for simulating a summary of the transmission
+#' chains statistic (without the tree of infections)
+#' @examples
+#' set.seed(123)
+# chain_size_ll <- offspring_ll(
+#   chains = c(1, 5, 6, 8, 7, 8, 10),
+#   offspring_dist = "pois",
+#   statistic = "size",
+#   lambda = 2
+# )
 offspring_ll <- function(chains, offspring_dist, statistic,
                          nsim_offspring = 100, ...) {
   # Simulate the chains
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 58381ff2..14b602f7 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -5,14 +5,7 @@
 \title{Log-likelihood of the summary (size/length) of chains with generic offspring
 distribution}
 \usage{
-offspring_ll(
-  chains,
-  offspring_dist,
-  statistic,
-  nsim_offspring = 100,
-  log = TRUE,
-  ...
-)
+offspring_ll(chains, offspring_dist, statistic, nsim_offspring = 100, ...)
 }
 \arguments{
 \item{chains}{Vector of chain summaries (sizes/lengths)}

From 3f03422b29b6fe227ea3f2818a5e51a25dbb01b5 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:12:04 +0100
Subject: [PATCH 565/828] Fixed the comment tags in the examples

---
 R/stat_likelihoods.R | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 07bce3ab..2fda56e4 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -106,12 +106,12 @@ geom_length_ll <- function(x, prob) {
 #' chains statistic (without the tree of infections)
 #' @examples
 #' set.seed(123)
-# chain_size_ll <- offspring_ll(
-#   chains = c(1, 5, 6, 8, 7, 8, 10),
-#   offspring_dist = "pois",
-#   statistic = "size",
-#   lambda = 2
-# )
+#' chain_size_ll <- offspring_ll(
+#'   chains = c(1, 5, 6, 8, 7, 8, 10),
+#'   offspring_dist = "pois",
+#'   statistic = "size",
+#'   lambda = 2
+#' )
 offspring_ll <- function(chains, offspring_dist, statistic,
                          nsim_offspring = 100, ...) {
   # Simulate the chains

From f1afcffe71d81452cdaae57c4be0933b81c493be Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:32:26 +0100
Subject: [PATCH 566/828] Give nsim_obs a default value

---
 R/likelihood.R    | 2 +-
 man/likelihood.Rd | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 07b2aec4..a50afc42 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -30,7 +30,7 @@
 #' )
 #' @export
 likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
-                       nsim_obs, log = TRUE, obs_prob = 1, stat_max = Inf,
+                       nsim_obs = 100, log = TRUE, obs_prob = 1, stat_max = Inf,
                        exclude = NULL, individual = FALSE, ...) {
   statistic <- match.arg(statistic)
 
diff --git a/man/likelihood.Rd b/man/likelihood.Rd
index b5b77844..57b64b5c 100644
--- a/man/likelihood.Rd
+++ b/man/likelihood.Rd
@@ -8,7 +8,7 @@ likelihood(
   chains,
   statistic = c("size", "length"),
   offspring_dist,
-  nsim_obs,
+  nsim_obs = 100,
   log = TRUE,
   obs_prob = 1,
   stat_max = Inf,

From fc348744915e68458ac7022e0ba47401dc617606 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:33:59 +0100
Subject: [PATCH 567/828] Assign lambda a value less than 1 to prevent example
 outbreak from overshooting

---
 R/stat_likelihoods.R | 2 +-
 man/offspring_ll.Rd  | 6 ++++++
 2 files changed, 7 insertions(+), 1 deletion(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 2fda56e4..94428885 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -110,7 +110,7 @@ geom_length_ll <- function(x, prob) {
 #'   chains = c(1, 5, 6, 8, 7, 8, 10),
 #'   offspring_dist = "pois",
 #'   statistic = "size",
-#'   lambda = 2
+#'   lambda = 0.82
 #' )
 offspring_ll <- function(chains, offspring_dist, statistic,
                          nsim_offspring = 100, ...) {
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 14b602f7..4aa8d874 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -37,6 +37,12 @@ cumulative distribution function (ecdf).
 }
 \examples{
 set.seed(123)
+chain_size_ll <- offspring_ll(
+  chains = c(1, 5, 6, 8, 7, 8, 10),
+  offspring_dist = "pois",
+  statistic = "size",
+  lambda = 0.82
+)
 }
 \seealso{
 \code{\link[=simulate_summary]{simulate_summary()}} for simulating a summary of the transmission

From 60521f5b6519eb7dc18663d18db1377d5ef66ae4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:35:38 +0100
Subject: [PATCH 568/828] Make documentation of likelihood function more
 consistent by using "log-likelihood" all through

---
 R/likelihood.R    | 24 ++++++++++++------------
 man/likelihood.Rd | 27 ++++++++++++++-------------
 2 files changed, 26 insertions(+), 25 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index a50afc42..287c96dd 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -1,25 +1,25 @@
-#' Estimate the (log) likelihood for observed branching processes
+#' Estimate the log-likelihood/likelihood for observed branching processes
 #'
 #' @inheritParams simulate_summary
-#' @param chains Vector of sizes/lengths of transmission chains.
-#' @param nsim_obs Number of simulations if the likelihood is to be
-#' approximated for imperfect observations.
-#' @param log Logical; Should the results be log-transformed? (Defaults
-#' to TRUE).
+#' @inheritParams offspring_ll
+#' @param nsim_obs Number of simulations if the log-likelihood/likelihood is to
+#' be approximated for imperfect observations.
+#' @param log Logical; Should the log-likelihoods be transformed to
+#' likelihoods? (Defaults to TRUE).
 #' @param obs_prob Observation probability (assumed constant)
 #' @param exclude A vector of indices of the sizes/lengths to exclude from the
-#' likelihood calculation.
-#' @param individual If TRUE, a vector of individual (log)likelihood
+#' log-likelihood calculation.
+#' @param individual If TRUE, a vector of individual log-likelihood/likelihood
 #' contributions will be returned rather than the sum.
-#' @param ... Parameters for the offspring distribution.
 #' @return
 #' * A vector of log-likelihoods, if \code{log = TRUE} (the default) and
 #' \code{obs_prob < 1}, or
 #' * A list of individual log-likelihood contributions, if
 #' \code{log = TRUE} (the default) and \code{individual = TRUE}.
-#' else raw likelihoods, or vector of likelihoods
-#' @seealso offspring_ll, pois_size_ll, nbinom_size_ll, gborel_size_ll,
-#' pois_length_ll, geom_length_ll.
+#' The interpretation follows for the other combinations of `log` and
+#' `individual`.
+#' @seealso offspring_ll(), pois_size_ll(), nbinom_size_ll(), gborel_size_ll(),
+#' pois_length_ll(), geom_length_ll()
 #' @author Sebastian Funk
 #' @examples
 #' # example of observed chain sizes
diff --git a/man/likelihood.Rd b/man/likelihood.Rd
index 57b64b5c..c8d8ff2e 100644
--- a/man/likelihood.Rd
+++ b/man/likelihood.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/likelihood.R
 \name{likelihood}
 \alias{likelihood}
-\title{Estimate the (log) likelihood for observed branching processes}
+\title{Estimate the log-likelihood/likelihood for observed branching processes}
 \usage{
 likelihood(
   chains,
@@ -18,7 +18,7 @@ likelihood(
 )
 }
 \arguments{
-\item{chains}{Vector of sizes/lengths of transmission chains.}
+\item{chains}{Vector of chain summaries (sizes/lengths)}
 
 \item{statistic}{String; Statistic to calculate. Can be one of:
 \itemize{
@@ -31,11 +31,11 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers).}
 
-\item{nsim_obs}{Number of simulations if the likelihood is to be
-approximated for imperfect observations.}
+\item{nsim_obs}{Number of simulations if the log-likelihood/likelihood is to
+be approximated for imperfect observations.}
 
-\item{log}{Logical; Should the results be log-transformed? (Defaults
-to TRUE).}
+\item{log}{Logical; Should the log-likelihoods be transformed to
+likelihoods? (Defaults to TRUE).}
 
 \item{obs_prob}{Observation probability (assumed constant)}
 
@@ -43,12 +43,12 @@ to TRUE).}
 computed. Results above the specified value, are set to \code{Inf}.}
 
 \item{exclude}{A vector of indices of the sizes/lengths to exclude from the
-likelihood calculation.}
+log-likelihood calculation.}
 
-\item{individual}{If TRUE, a vector of individual (log)likelihood
+\item{individual}{If TRUE, a vector of individual log-likelihood/likelihood
 contributions will be returned rather than the sum.}
 
-\item{...}{Parameters for the offspring distribution.}
+\item{...}{Parameters of the offspring distribution as required by R.}
 }
 \value{
 \itemize{
@@ -56,11 +56,12 @@ contributions will be returned rather than the sum.}
 \code{obs_prob < 1}, or
 \item A list of individual log-likelihood contributions, if
 \code{log = TRUE} (the default) and \code{individual = TRUE}.
-else raw likelihoods, or vector of likelihoods
+The interpretation follows for the other combinations of \code{log} and
+\code{individual}.
 }
 }
 \description{
-Estimate the (log) likelihood for observed branching processes
+Estimate the log-likelihood/likelihood for observed branching processes
 }
 \examples{
 # example of observed chain sizes
@@ -71,8 +72,8 @@ likelihood(
 )
 }
 \seealso{
-offspring_ll, pois_size_ll, nbinom_size_ll, gborel_size_ll,
-pois_length_ll, geom_length_ll.
+offspring_ll(), pois_size_ll(), nbinom_size_ll(), gborel_size_ll(),
+pois_length_ll(), geom_length_ll()
 }
 \author{
 Sebastian Funk

From 5f052c6adea3eca327cd4fd97a8dec13dc35ddf2 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:36:00 +0100
Subject: [PATCH 569/828] Improve error message

---
 R/likelihood.R | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 287c96dd..47501a99 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -37,7 +37,9 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
   ## checks
   check_offspring_valid(offspring_dist)
 
-  if (obs_prob <= 0 || obs_prob > 1) stop("'obs_prob' must be within (0,1]")
+  if (obs_prob <= 0 || obs_prob > 1) {
+    stop("'obs_prob' is a probability and must be between 0 and 1 inclusive")
+    }
   if (obs_prob < 1) {
     if (missing(nsim_obs)) {
       stop("'nsim_obs' must be specified if 'obs_prob' is < 1")

From d91f5c6bf9f8419ce5e6a7a29b7f1ed2bd98129f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:36:20 +0100
Subject: [PATCH 570/828] Rename a variable

---
 R/likelihood.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 47501a99..7af5e924 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -45,10 +45,10 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
       stop("'nsim_obs' must be specified if 'obs_prob' is < 1")
     }
 
-    sample_func <- get_statistic_func(statistic)
+    statistic_func <- get_statistic_func(statistic)
 
     sampled_x <- replicate(nsim_obs, pmin(
-      sample_func(
+      statistic_func(
         length(chains),
         chains, obs_prob
       ),

From 1075c2736125eca0966d588296eb04de537c25bb Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:36:59 +0100
Subject: [PATCH 571/828] Replace "likelihood" with "log-likelihood"

---
 R/likelihood.R | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 7af5e924..63e56bef 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -64,19 +64,19 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
     sampled_x <- list(chains)
   }
 
-  ## determine for which sizes to calculate the likelihood (for true chain size)
+  ## determine for which sizes to calculate the log-likelihood (for true chain size)
   if (any(size_x == stat_max)) {
     calc_sizes <- seq_len(stat_max - 1)
   } else {
     calc_sizes <- unique(c(size_x, exclude))
   }
 
-  ## get likelihood function as given by offspring_dist and statistic
+  ## get log-likelihood function as given by offspring_dist and statistic
   likelihoods <- vector(mode = "numeric")
   ll_func <- construct_offspring_ll_name(offspring_dist, statistic)
   pars <- as.list(unlist(list(...))) ## converts vectors to lists
 
-  ## calculate likelihoods
+  ## calculate log-likelihoods
   if (exists(ll_func, where = asNamespace("epichains"), mode = "function")) {
     func <- get(ll_func)
     likelihoods[calc_sizes] <- do.call(func, c(list(x = calc_sizes), pars))

From dcd3d7ed35d36a0e4ad67e93186a916ace4c42be Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:37:24 +0100
Subject: [PATCH 572/828] Lint

---
 R/likelihood.R | 22 ++++++++++++----------
 1 file changed, 12 insertions(+), 10 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 63e56bef..5b07b1bc 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -55,9 +55,7 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
       stat_max
     ), simplify = FALSE)
     size_x <- unlist(sampled_x)
-    if (!is.finite(stat_max)) {
-      stat_max <- max(size_x) + 1
-    }
+    stat_max <- max(size_x) + 1
   } else {
     chains[chains >= stat_max] <- stat_max
     size_x <- chains
@@ -82,14 +80,18 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
     likelihoods[calc_sizes] <- do.call(func, c(list(x = calc_sizes), pars))
   } else {
     likelihoods[calc_sizes] <-
-      do.call(
-        offspring_ll,
-        c(list(
-          chains = calc_sizes, offspring_dist = offspring_dist,
-          statistic = statistic, stat_max = stat_max,
-          log = log
-        ), pars)
+    do.call(
+      offspring_ll,
+      c(
+        list(
+          chains = calc_sizes,
+          offspring_dist = offspring_dist,
+          statistic = statistic,
+          stat_max = stat_max
+        ),
+        pars
       )
+    )
   }
 
   ## assign probabilities to stat_max outbreak sizes

From 26c7831f227ddc4d1ec59e8b503b051ca8b0b653 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:38:25 +0100
Subject: [PATCH 573/828] Add exp transformation for when log=FALSE

---
 R/likelihood.R | 5 +++++
 1 file changed, 5 insertions(+)

diff --git a/R/likelihood.R b/R/likelihood.R
index 5b07b1bc..4a028f2b 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -113,6 +113,11 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
     likelihoods[sx[!(sx %in% exclude)]]
   })
 
+  ## transform log-likelihoods into likelihoods if required
+  if (!log) {
+    chains_likelihood <- lapply(chains_likelihood, function(ll) exp(ll))
+  }
+
   if (!individual) {
     chains_likelihood <- vapply(chains_likelihood, sum, 0)
   }

From 27500d8d6fcecf9af49d3c664a3fb6a1770a6a59 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:39:42 +0100
Subject: [PATCH 574/828] Add joint likelihood calculation for where
 individual=TRUE and depending on log=T/F

---
 R/likelihood.R | 9 ++++++++-
 1 file changed, 8 insertions(+), 1 deletion(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 4a028f2b..be4b2928 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -118,8 +118,15 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
     chains_likelihood <- lapply(chains_likelihood, function(ll) exp(ll))
   }
 
+  ## if individual == FALSE, return the joint log-likelihood
+  ## (sum of the log-likelihoods), if log == TRUE, else
+  ## multiply the likelihoods
   if (!individual) {
-    chains_likelihood <- vapply(chains_likelihood, sum, 0)
+    if (log) {
+      chains_likelihood <- vapply(chains_likelihood, sum, 0)
+    } else{
+      chains_likelihood <- vapply(chains_likelihood, prod, 0)
+    }
   }
 
   return(chains_likelihood)

From c3a7e173d1c90bf7c5a0072a181669e421334ddc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 17:11:27 +0100
Subject: [PATCH 575/828] Replace "likelihood" with "log-likelihood" to clarify
 that the latter is being calculated

---
 man/offspring_ll.Rd | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 4aa8d874..23d58eb2 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -29,10 +29,11 @@ for approximating the distribution of the chain statistic summary
 }
 \value{
 log-likelihood values
+else raw likelihoods
 }
 \description{
 The log-likelihoods are calculated with a crude approximation using simulated
-chain summaries by linearly approximating any missing values in the empirical
+chains by linearly approximating any missing values in the empirical
 cumulative distribution function (ecdf).
 }
 \examples{

From 26b4f2e36296ec30902ed9010d1bfdecc72d232f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 17:13:11 +0100
Subject: [PATCH 576/828] Reword the docs of "offspring_ll"

---
 man/offspring_ll.Rd | 3 +--
 1 file changed, 1 insertion(+), 2 deletions(-)

diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 23d58eb2..4aa8d874 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -29,11 +29,10 @@ for approximating the distribution of the chain statistic summary
 }
 \value{
 log-likelihood values
-else raw likelihoods
 }
 \description{
 The log-likelihoods are calculated with a crude approximation using simulated
-chains by linearly approximating any missing values in the empirical
+chain summaries by linearly approximating any missing values in the empirical
 cumulative distribution function (ecdf).
 }
 \examples{

From 0e6705ba379a94f4e39a2ffd86f13c5eac0c0f89 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 17:16:36 +0100
Subject: [PATCH 577/828] Add a seealso tag and clean up examples

---
 R/stat_likelihoods.R | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 94428885..07bce3ab 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -106,12 +106,12 @@ geom_length_ll <- function(x, prob) {
 #' chains statistic (without the tree of infections)
 #' @examples
 #' set.seed(123)
-#' chain_size_ll <- offspring_ll(
-#'   chains = c(1, 5, 6, 8, 7, 8, 10),
-#'   offspring_dist = "pois",
-#'   statistic = "size",
-#'   lambda = 0.82
-#' )
+# chain_size_ll <- offspring_ll(
+#   chains = c(1, 5, 6, 8, 7, 8, 10),
+#   offspring_dist = "pois",
+#   statistic = "size",
+#   lambda = 2
+# )
 offspring_ll <- function(chains, offspring_dist, statistic,
                          nsim_offspring = 100, ...) {
   # Simulate the chains

From 4ac5f954493aae3af53b13eed533791e898bd921 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:12:04 +0100
Subject: [PATCH 578/828] Fixed the comment tags in the examples

---
 R/stat_likelihoods.R | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 07bce3ab..2fda56e4 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -106,12 +106,12 @@ geom_length_ll <- function(x, prob) {
 #' chains statistic (without the tree of infections)
 #' @examples
 #' set.seed(123)
-# chain_size_ll <- offspring_ll(
-#   chains = c(1, 5, 6, 8, 7, 8, 10),
-#   offspring_dist = "pois",
-#   statistic = "size",
-#   lambda = 2
-# )
+#' chain_size_ll <- offspring_ll(
+#'   chains = c(1, 5, 6, 8, 7, 8, 10),
+#'   offspring_dist = "pois",
+#'   statistic = "size",
+#'   lambda = 2
+#' )
 offspring_ll <- function(chains, offspring_dist, statistic,
                          nsim_offspring = 100, ...) {
   # Simulate the chains

From ea00c3589948028b575803aa7e0b256a5b09780a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:33:59 +0100
Subject: [PATCH 579/828] Assign lambda a value less than 1 to prevent example
 outbreak from overshooting

---
 R/stat_likelihoods.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 2fda56e4..94428885 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -110,7 +110,7 @@ geom_length_ll <- function(x, prob) {
 #'   chains = c(1, 5, 6, 8, 7, 8, 10),
 #'   offspring_dist = "pois",
 #'   statistic = "size",
-#'   lambda = 2
+#'   lambda = 0.82
 #' )
 offspring_ll <- function(chains, offspring_dist, statistic,
                          nsim_offspring = 100, ...) {

From afacd873e861c6a662e2196a710befb4ba7e5931 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:38:25 +0100
Subject: [PATCH 580/828] Add exp transformation for when log=FALSE

---
 R/likelihood.R | 3 ---
 1 file changed, 3 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index be4b2928..43481066 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -118,9 +118,6 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
     chains_likelihood <- lapply(chains_likelihood, function(ll) exp(ll))
   }
 
-  ## if individual == FALSE, return the joint log-likelihood
-  ## (sum of the log-likelihoods), if log == TRUE, else
-  ## multiply the likelihoods
   if (!individual) {
     if (log) {
       chains_likelihood <- vapply(chains_likelihood, sum, 0)

From a68bb7d2e94ef59703d0c68aa4dd019c48be10b4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:39:42 +0100
Subject: [PATCH 581/828] Add joint likelihood calculation for where
 individual=TRUE and depending on log=T/F

---
 R/likelihood.R | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/R/likelihood.R b/R/likelihood.R
index 43481066..be4b2928 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -118,6 +118,9 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
     chains_likelihood <- lapply(chains_likelihood, function(ll) exp(ll))
   }
 
+  ## if individual == FALSE, return the joint log-likelihood
+  ## (sum of the log-likelihoods), if log == TRUE, else
+  ## multiply the likelihoods
   if (!individual) {
     if (log) {
       chains_likelihood <- vapply(chains_likelihood, sum, 0)

From 15e38407ed58718f423ae6ddce2cbe1ffae7b734 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:48:24 +0100
Subject: [PATCH 582/828] Linting

---
 R/likelihood.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index be4b2928..9694143a 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -39,7 +39,7 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
 
   if (obs_prob <= 0 || obs_prob > 1) {
     stop("'obs_prob' is a probability and must be between 0 and 1 inclusive")
-    }
+  }
   if (obs_prob < 1) {
     if (missing(nsim_obs)) {
       stop("'nsim_obs' must be specified if 'obs_prob' is < 1")

From d7071873e9ce557418c1fd169226b0bccf34cff2 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 6 Sep 2023 19:49:17 +0100
Subject: [PATCH 583/828] Linting

---
 R/likelihood.R | 29 +++++++++++++++--------------
 1 file changed, 15 insertions(+), 14 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 9694143a..d381cbe7 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -62,7 +62,8 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
     sampled_x <- list(chains)
   }
 
-  ## determine for which sizes to calculate the log-likelihood (for true chain size)
+  ## determine for which sizes to calculate the log-likelihood
+  ## (for true chain size)
   if (any(size_x == stat_max)) {
     calc_sizes <- seq_len(stat_max - 1)
   } else {
@@ -80,18 +81,18 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
     likelihoods[calc_sizes] <- do.call(func, c(list(x = calc_sizes), pars))
   } else {
     likelihoods[calc_sizes] <-
-    do.call(
-      offspring_ll,
-      c(
-        list(
-          chains = calc_sizes,
-          offspring_dist = offspring_dist,
-          statistic = statistic,
-          stat_max = stat_max
-        ),
-        pars
+      do.call(
+        offspring_ll,
+        c(
+          list(
+            chains = calc_sizes,
+            offspring_dist = offspring_dist,
+            statistic = statistic,
+            stat_max = stat_max
+          ),
+          pars
+        )
       )
-    )
   }
 
   ## assign probabilities to stat_max outbreak sizes
@@ -115,7 +116,7 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
 
   ## transform log-likelihoods into likelihoods if required
   if (!log) {
-    chains_likelihood <- lapply(chains_likelihood, function(ll) exp(ll))
+    chains_likelihood <- lapply(chains_likelihood, exp)
   }
 
   ## if individual == FALSE, return the joint log-likelihood
@@ -124,7 +125,7 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
   if (!individual) {
     if (log) {
       chains_likelihood <- vapply(chains_likelihood, sum, 0)
-    } else{
+    } else {
       chains_likelihood <- vapply(chains_likelihood, prod, 0)
     }
   }

From a83eeef69c0377d7d95a7492656075edee17e278 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 7 Sep 2023 17:23:01 +0100
Subject: [PATCH 584/828] Change type coersion to use explicit logical checks

---
 R/likelihood.R | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index d381cbe7..6e9fe7f7 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -115,15 +115,15 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
   })
 
   ## transform log-likelihoods into likelihoods if required
-  if (!log) {
+  if (!isTRUE(log)) {
     chains_likelihood <- lapply(chains_likelihood, exp)
   }
 
   ## if individual == FALSE, return the joint log-likelihood
   ## (sum of the log-likelihoods), if log == TRUE, else
   ## multiply the likelihoods
-  if (!individual) {
-    if (log) {
+  if (!isTRUE(individual)) {
+    if (isTRUE(log)) {
       chains_likelihood <- vapply(chains_likelihood, sum, 0)
     } else {
       chains_likelihood <- vapply(chains_likelihood, prod, 0)

From a5baf84ebb93f5b43d4d7ac3cd046c5973201681 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 7 Sep 2023 17:24:27 +0100
Subject: [PATCH 585/828] Correct length distribution function doc

---
 R/stat_likelihoods.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 94428885..8937811c 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -54,7 +54,7 @@ gborel_size_ll <- function(x, size, prob, mu) {
 
 #' Log-likelihood of the length of chains with Poisson offspring distribution
 #'
-#' @param x vector of sizes
+#' @param x vector of lengths
 #' @param lambda rate of the Poisson distribution
 #' @return log-likelihood values
 #' @author Sebastian Funk
@@ -72,7 +72,7 @@ pois_length_ll <- function(x, lambda) {
 
 #' Log-likelihood of the length of chains with geometric offspring distribution
 #'
-#' @param x vector of sizes
+#' @param x vector of lengths
 #' @param prob probability of the geometric distribution with mean
 #' \code{1/prob}
 #' @return log-likelihood values

From d98c72cea844ec391e74f1d0c9bac0b83a05ace9 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 7 Sep 2023 17:25:26 +0100
Subject: [PATCH 586/828] Generate length distribution function man files

---
 man/geom_length_ll.Rd | 2 +-
 man/pois_length_ll.Rd | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/man/geom_length_ll.Rd b/man/geom_length_ll.Rd
index 6c7dc6ad..6397d938 100644
--- a/man/geom_length_ll.Rd
+++ b/man/geom_length_ll.Rd
@@ -7,7 +7,7 @@
 geom_length_ll(x, prob)
 }
 \arguments{
-\item{x}{vector of sizes}
+\item{x}{vector of lengths}
 
 \item{prob}{probability of the geometric distribution with mean
 \code{1/prob}}
diff --git a/man/pois_length_ll.Rd b/man/pois_length_ll.Rd
index bf1f47ba..1aa38707 100644
--- a/man/pois_length_ll.Rd
+++ b/man/pois_length_ll.Rd
@@ -7,7 +7,7 @@
 pois_length_ll(x, lambda)
 }
 \arguments{
-\item{x}{vector of sizes}
+\item{x}{vector of lengths}
 
 \item{lambda}{rate of the Poisson distribution}
 }

From 8a3eb339d1c6c6acceb68ab325b3ac6af68dc608 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 7 Sep 2023 17:26:10 +0100
Subject: [PATCH 587/828] Use a bigger vector of chains for example

---
 R/likelihood.R    | 3 ++-
 man/likelihood.Rd | 3 ++-
 2 files changed, 4 insertions(+), 2 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 6e9fe7f7..b7ec5dbf 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -23,7 +23,8 @@
 #' @author Sebastian Funk
 #' @examples
 #' # example of observed chain sizes
-#' chain_sizes <- c(1, 1, 4, 7)
+#' set.seed(121)
+#' chain_sizes <- sample(1:10, 20, replace = TRUE)
 #' likelihood(
 #'   chains = chain_sizes, statistic = "size",
 #'   offspring_dist = "pois", nsim_obs = 100, lambda = 0.5
diff --git a/man/likelihood.Rd b/man/likelihood.Rd
index c8d8ff2e..66213269 100644
--- a/man/likelihood.Rd
+++ b/man/likelihood.Rd
@@ -65,7 +65,8 @@ Estimate the log-likelihood/likelihood for observed branching processes
 }
 \examples{
 # example of observed chain sizes
-chain_sizes <- c(1, 1, 4, 7)
+set.seed(121)
+chain_sizes <- sample(1:10, 20, replace = TRUE)
 likelihood(
   chains = chain_sizes, statistic = "size",
   offspring_dist = "pois", nsim_obs = 100, lambda = 0.5

From d5efee1a10f6740fbb995aba4e2d3b9e0438eccf Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 7 Sep 2023 17:26:23 +0100
Subject: [PATCH 588/828] Remove old tests

---
 tests/testthat/tests-ll.r | 10 ----------
 1 file changed, 10 deletions(-)
 delete mode 100644 tests/testthat/tests-ll.r

diff --git a/tests/testthat/tests-ll.r b/tests/testthat/tests-ll.r
deleted file mode 100644
index 13a7c339..00000000
--- a/tests/testthat/tests-ll.r
+++ /dev/null
@@ -1,10 +0,0 @@
-chains <- c(1, 1, 4, 7)
-test_that("Analytical size or length distributions are implemented", {
-  expect_true(all(pois_size_ll(chains, lambda = 0.5) < 0))
-  expect_true(all(nbinom_size_ll(chains, mu = 0.5, size = 0.2) < 0))
-  expect_true(all(nbinom_size_ll(chains, prob = 0.5, size = 0.2) < 0))
-  expect_true(all(gborel_size_ll(chains, prob = 0.5, size = 0.2) < 0))
-  expect_true(all(gborel_size_ll(chains, prob = 0.5, size = 0.2) < 0))
-  expect_true(all(pois_length_ll(chains, lambda = 0.5) < 0))
-  expect_true(all(geom_length_ll(chains, prob = 0.5) < 0))
-})

From 931d55801a49e3ecf8a583a899f79bbe365dd1a8 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 7 Sep 2023 17:26:55 +0100
Subject: [PATCH 589/828] Add tests for likelihood function

---
 tests/testthat/test-likelihood.R | 150 +++++++++++++++++++++++++++++++
 1 file changed, 150 insertions(+)
 create mode 100644 tests/testthat/test-likelihood.R

diff --git a/tests/testthat/test-likelihood.R b/tests/testthat/test-likelihood.R
new file mode 100644
index 00000000..c277ad89
--- /dev/null
+++ b/tests/testthat/test-likelihood.R
@@ -0,0 +1,150 @@
+chains <- c(1, 1, 4, 7)
+test_that(
+  "Likelihoods can be calculated",
+  {
+    expect_lt(
+      likelihood(
+        chains = chains,
+        statistic = "size",
+        offspring_dist = "pois",
+        lambda = 0.5
+      ),
+      0
+    )
+    expect_lt(
+      likelihood(
+        chains = chains,
+        statistic = "size",
+        offspring_dist = "pois",
+        lambda = 0.5,
+        exclude = 1
+      ),
+      0
+    )
+    expect_lt(
+      likelihood(
+        chains = chains,
+        statistic = "size",
+        offspring_dist = "pois",
+        lambda = 0.5,
+        stat_max = 5
+      ),
+      0
+    )
+    expect_lt(
+      likelihood(
+        chains = chains,
+        statistic = "size",
+        offspring_dist = "pois",
+        lambda = 0.5,
+        obs_prob = 0.5,
+        nsim_obs = 1
+      ),
+      0
+    )
+    expect_lt(
+      likelihood(
+        chains = chains,
+        statistic = "size",
+        offspring_dist = "pois",
+        lambda = 0.5,
+        stat_max = 5,
+        obs_prob = 0.5,
+        nsim_obs = 1
+      ),
+      0
+    )
+  }
+)
+
+test_that("Likelihoods are numerically correct", {
+  expect_identical(
+    round(
+      likelihood(
+        chains = chains,
+        statistic = "size",
+        offspring_dist = "pois",
+        lambda = 0.5
+      ), 5
+    ),
+    -8.6072
+  )
+  expect_identical(
+    round(
+      likelihood(
+        chains = chains,
+        statistic = "size",
+        offspring_dist = "nbinom",
+        mu = 0.5,
+        size = 0.2
+      ), 5
+    ),
+    -9.13437
+  )
+  expect_identical(
+    round(
+      likelihood(
+        chains = chains,
+        statistic = "size",
+        offspring_dist = "gborel",
+        prob = 0.5,
+        size = 0.2
+      ), 5
+    ),
+    -11.21929
+  )
+  expect_identical(
+    round(
+      likelihood(
+        chains = chains,
+        statistic = "length",
+        offspring_dist = "pois",
+        lambda = 0.5
+      ), 5
+    ),
+    -9.39945
+  )
+  expect_identical(
+    round(
+      likelihood(
+        chains = chains,
+        statistic = "length",
+        offspring_dist = "geom",
+        prob = 0.5
+      ), 5
+    ),
+    -12.48639
+  )
+})
+
+test_that("Errors are thrown", {
+  expect_error(
+    likelihood(
+      chains = chains,
+      offspring_dist = list(),
+      statistic = "size",
+      lambda = 0.5
+    ),
+    "must be specified as a character string"
+  )
+  expect_error(
+    likelihood(
+      chains = chains,
+      offspring_dist = "pois",
+      statistic = "size",
+      lambda = 0.5,
+      obs_prob = 3
+    ),
+    "must be between 0 and 1"
+  )
+  expect_error(
+    likelihood(
+      chains = chains,
+      offspring_dist = "pois",
+      statistic = "size",
+      lambda = 0.5,
+      obs_prob = 0.5
+    ),
+    "must be specified"
+  )
+})

From d996ea882499c13d77bda851231be92974d87ed6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 7 Sep 2023 17:27:32 +0100
Subject: [PATCH 590/828] Add tests for stat log-likelihood functions

---
 tests/testthat/test-stat_likelihoods.R | 116 +++++++++++++++++++++++++
 1 file changed, 116 insertions(+)
 create mode 100644 tests/testthat/test-stat_likelihoods.R

diff --git a/tests/testthat/test-stat_likelihoods.R b/tests/testthat/test-stat_likelihoods.R
new file mode 100644
index 00000000..2f9783d9
--- /dev/null
+++ b/tests/testthat/test-stat_likelihoods.R
@@ -0,0 +1,116 @@
+set.seed(1231)
+chains <- c(1, 1, 4, 7)
+test_that("Analytical chain size distributions are numerically correct", {
+  expect_identical(
+    round(
+      nbinom_size_ll(
+        x = chains,
+        mu = 0.5,
+        size = 0.2
+      ),
+      5
+    ),
+    c(-0.25055, -0.25055, -3.79542, -4.83785)
+  )
+  expect_identical(
+    round(
+      nbinom_size_ll(
+        x = chains,
+        prob = 0.5,
+        size = 0.2
+      ),
+      5
+    ),
+    c(-0.13863, -0.13863, -4.41775, -6.19443)
+  )
+  expect_identical(
+    round(
+      gborel_size_ll(
+        x = chains,
+        mu = 0.5,
+        size = 0.2
+      ),
+      5
+    ),
+    c(-0.25055, -0.25055, -4.58222, -5.83390)
+  )
+  expect_identical(
+    round(
+      gborel_size_ll(
+        x = chains,
+        prob = 0.5,
+        size = 0.2
+      ),
+      5
+    ),
+    c(-0.13863, -0.13863, -4.80803, -6.13400)
+  )
+})
+
+test_that("Analytical chain lengths distributions are numerically correct", {
+  expect_identical(
+    round(
+      pois_length_ll(
+        x = chains,
+        lambda = 0.5
+      ),
+      5
+    ),
+    c(-0.50000, -0.50000, -3.13243, -5.26702)
+  )
+  expect_identical(
+    round(
+      geom_length_ll(
+        x = chains,
+        prob = 0.5
+      ),
+      5
+    ),
+    c(-1.09861, -1.09861, -4.06260, -6.22657)
+  )
+})
+
+test_that("Generic offspring log-likelihoods are calculated", {
+  expect_true(
+    all(
+      offspring_ll(
+        chains = chains,
+        offspring_dist = "pois",
+        nsim_offspring = 100,
+        statistic = "size",
+        lambda = 0.82
+      ) < 0
+    )
+  )
+  expect_length(
+    offspring_ll(
+      chains = chains,
+      offspring_dist = "pois",
+      nsim_offspring = 100,
+      statistic = "size",
+      lambda = 0.82
+    ),
+    100
+  )
+})
+
+test_that("Errors are thrown", {
+  expect_error(
+    nbinom_size_ll(
+      x = chains,
+      mu = 0.5,
+      size = 0.2,
+      prob = 0.1
+    ),
+    "both specified"
+  )
+  expect_error(
+    gborel_size_ll(
+      x = chains,
+      mu = 0.5,
+      size = 0.2,
+      prob = 0.1
+    ),
+    "both specified"
+  )
+})

From a4c492212afb7fa2d7332a3171ea6f598bad6255 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 7 Sep 2023 20:25:19 +0100
Subject: [PATCH 591/828] Use the right stat vector for the log-likelihood
 calculation

---
 R/stat_likelihoods.R | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 8937811c..d23fdb84 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -115,7 +115,7 @@ geom_length_ll <- function(x, prob) {
 offspring_ll <- function(chains, offspring_dist, statistic,
                          nsim_offspring = 100, ...) {
   # Simulate the chains
-  chains <- simulate_summary(
+  dist <- simulate_summary(
     nchains = nsim_offspring,
     offspring_dist = offspring_dist,
     statistic = statistic,
@@ -124,13 +124,13 @@ offspring_ll <- function(chains, offspring_dist, statistic,
 
   # Compute the empirical Cumulative Distribution Function of the
   # simulated chains
-  chains_empirical_cdf <- stats::ecdf(chains)
+  chains_empirical_cdf <- stats::ecdf(dist)
 
   # Perform a lagged linear interpolation of the points
   acdf <-
     diff(c(0, stats::approx(
-      unique(chains), chains_empirical_cdf(unique(chains)),
-      seq_len(max(chains[is.finite(chains)]))
+      unique(dist), chains_empirical_cdf(unique(dist)),
+      seq_len(max(dist[is.finite(dist)]))
     )$y))
   lik <- acdf[chains]
   lik[is.na(lik)] <- 0

From 50de09191421ec4a597c33519afed51012de4896 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 7 Sep 2023 20:36:57 +0100
Subject: [PATCH 592/828] Lint

---
 R/stat_likelihoods.R | 17 ++++++++++++-----
 1 file changed, 12 insertions(+), 5 deletions(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index d23fdb84..cb0ba039 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -127,11 +127,18 @@ offspring_ll <- function(chains, offspring_dist, statistic,
   chains_empirical_cdf <- stats::ecdf(dist)
 
   # Perform a lagged linear interpolation of the points
-  acdf <-
-    diff(c(0, stats::approx(
-      unique(dist), chains_empirical_cdf(unique(dist)),
-      seq_len(max(dist[is.finite(dist)]))
-    )$y))
+  acdf <- diff(
+    c(
+      0,
+      stats::approx(
+        unique(dist),
+        chains_empirical_cdf(unique(dist)),
+        seq_len(
+          max(dist[is.finite(dist)])
+        )
+      )$y
+    )
+  )
   lik <- acdf[chains]
   lik[is.na(lik)] <- 0
   out <- log(lik)

From bf1f15d06269cf2f56976f236b35fea665f6a49a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 7 Sep 2023 20:37:10 +0100
Subject: [PATCH 593/828] Fix tests

---
 tests/testthat/test-stat_likelihoods.R | 2 +-
 tests/testthat/tests-sim.r             | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/tests/testthat/test-stat_likelihoods.R b/tests/testthat/test-stat_likelihoods.R
index 2f9783d9..a63a53df 100644
--- a/tests/testthat/test-stat_likelihoods.R
+++ b/tests/testthat/test-stat_likelihoods.R
@@ -90,7 +90,7 @@ test_that("Generic offspring log-likelihoods are calculated", {
       statistic = "size",
       lambda = 0.82
     ),
-    100
+    4
   )
 })
 
diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
index 67386a95..05b7b813 100644
--- a/tests/testthat/tests-sim.r
+++ b/tests/testthat/tests-sim.r
@@ -21,7 +21,7 @@ test_that("Simulators output epichains objects", {
   )
   expect_s3_class(
     simulate_summary(
-      n = 10,
+      nchains = 10,
       offspring_dist = "pois",
       lambda = 2,
       stat_max = 10

From 41ed6a4d96c0e94e50fe7c4b1574e8da3a53b3fa Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 11 Sep 2023 19:25:11 +0100
Subject: [PATCH 594/828] Validate the log argument

---
 R/likelihood.R | 6 ++++++
 1 file changed, 6 insertions(+)

diff --git a/R/likelihood.R b/R/likelihood.R
index b7ec5dbf..8e5ff0e8 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -37,6 +37,12 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
 
   ## checks
   check_offspring_valid(offspring_dist)
+  checkmate::assert_logical(
+    log,
+    any.missing = FALSE,
+    all.missing = FALSE,
+    len = 1
+  )
 
   if (obs_prob <= 0 || obs_prob > 1) {
     stop("'obs_prob' is a probability and must be between 0 and 1 inclusive")

From 3154035f37a575be25eab8a3cc0148a4d1e4d586 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 11 Sep 2023 19:25:46 +0100
Subject: [PATCH 595/828] Add more tests of the likelihood() function to
 improve coverage

---
 tests/testthat/test-likelihood.R | 44 ++++++++++++++++++++++++++++++++
 1 file changed, 44 insertions(+)

diff --git a/tests/testthat/test-likelihood.R b/tests/testthat/test-likelihood.R
index c277ad89..ac2750ec 100644
--- a/tests/testthat/test-likelihood.R
+++ b/tests/testthat/test-likelihood.R
@@ -54,6 +54,39 @@ test_that(
       ),
       0
     )
+    expect_lt(
+      likelihood(
+        chains = chains,
+        statistic = "length",
+        offspring_dist = "binom",
+        size = 1,
+        prob = 0.5
+      ),
+      0
+    )
+    expect_gte(
+      likelihood(
+        chains = chains,
+        statistic = "length",
+        offspring_dist = "binom",
+        size = 1,
+        prob = 0.5,
+        log = FALSE
+      ),
+      0
+    )
+    expect_gte(
+      likelihood(
+        chains = chains,
+        statistic = "length",
+        offspring_dist = "binom",
+        size = 1,
+        prob = 0.5,
+        individual = FALSE,
+        log = FALSE
+      ),
+      0
+    )
   }
 )
 
@@ -147,4 +180,15 @@ test_that("Errors are thrown", {
     ),
     "must be specified"
   )
+  expect_error(
+    likelihood(
+      chains = chains,
+      statistic = "size",
+      offspring_dist = "pois",
+      nsim_obs = 100,
+      lambda = 0.5,
+      log = "s"
+      ),
+  "Must be of type 'logical'"
+  )
 })

From d36373481f2bb68dfaa98638cdd998b18063479f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 11 Sep 2023 19:26:07 +0100
Subject: [PATCH 596/828] Add a test for construct_offspring_ll_name()

---
 tests/testthat/test-helpers.R | 9 +++++++++
 1 file changed, 9 insertions(+)
 create mode 100644 tests/testthat/test-helpers.R

diff --git a/tests/testthat/test-helpers.R b/tests/testthat/test-helpers.R
new file mode 100644
index 00000000..3745e5f9
--- /dev/null
+++ b/tests/testthat/test-helpers.R
@@ -0,0 +1,9 @@
+test_that("Helper functions work correctly", {
+  expect_equal(
+    construct_offspring_ll_name(
+      offspring_dist = "pois",
+      chain_statistic = "size"
+      ),
+    "pois_size_ll"
+  )
+})

From 6cb3ec2556e0d6f0217f840b4b64c9240edbdf19 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 11 Sep 2023 19:47:10 +0100
Subject: [PATCH 597/828] Fixed linting issues with the tests

---
 tests/testthat/test-helpers.R    | 4 ++--
 tests/testthat/test-likelihood.R | 4 ++--
 2 files changed, 4 insertions(+), 4 deletions(-)

diff --git a/tests/testthat/test-helpers.R b/tests/testthat/test-helpers.R
index 3745e5f9..1fbc99d3 100644
--- a/tests/testthat/test-helpers.R
+++ b/tests/testthat/test-helpers.R
@@ -1,9 +1,9 @@
 test_that("Helper functions work correctly", {
-  expect_equal(
+  expect_identical(
     construct_offspring_ll_name(
       offspring_dist = "pois",
       chain_statistic = "size"
-      ),
+    ),
     "pois_size_ll"
   )
 })
diff --git a/tests/testthat/test-likelihood.R b/tests/testthat/test-likelihood.R
index ac2750ec..cc41654b 100644
--- a/tests/testthat/test-likelihood.R
+++ b/tests/testthat/test-likelihood.R
@@ -188,7 +188,7 @@ test_that("Errors are thrown", {
       nsim_obs = 100,
       lambda = 0.5,
       log = "s"
-      ),
-  "Must be of type 'logical'"
+    ),
+    "Must be of type"
   )
 })

From 12c6506e0e1804f1d5a8d53682d64b13a01e05db Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 13:12:09 +0100
Subject: [PATCH 598/828] Remove default value of nsim_obs argument

---
 R/likelihood.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 8e5ff0e8..d4ddef79 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -31,7 +31,7 @@
 #' )
 #' @export
 likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
-                       nsim_obs = 100, log = TRUE, obs_prob = 1, stat_max = Inf,
+                       nsim_obs, log = TRUE, obs_prob = 1, stat_max = Inf,
                        exclude = NULL, individual = FALSE, ...) {
   statistic <- match.arg(statistic)
 

From 0280b13c7d98f1591a872f19535b763b43a94f01 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 14:16:07 +0100
Subject: [PATCH 599/828] Reinstate control to overwrite stat_max when
 specified as Inf

---
 R/likelihood.R | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/R/likelihood.R b/R/likelihood.R
index d4ddef79..5976f038 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -62,7 +62,9 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
       stat_max
     ), simplify = FALSE)
     size_x <- unlist(sampled_x)
+    if (!is.finite(stat_max)) {
     stat_max <- max(size_x) + 1
+    }
   } else {
     chains[chains >= stat_max] <- stat_max
     size_x <- chains

From 6c6f7f48d0f425903a811c4bf4c9880b98f3aee1 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 17:59:14 +0100
Subject: [PATCH 600/828] Inherit dot params from offspring_ll instead of
 simulate_summary

---
 R/likelihood.R    | 2 +-
 man/likelihood.Rd | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 5976f038..45dd8585 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -1,7 +1,7 @@
 #' Estimate the log-likelihood/likelihood for observed branching processes
 #'
-#' @inheritParams simulate_summary
 #' @inheritParams offspring_ll
+#' @inheritParams simulate_summary
 #' @param nsim_obs Number of simulations if the log-likelihood/likelihood is to
 #' be approximated for imperfect observations.
 #' @param log Logical; Should the log-likelihoods be transformed to
diff --git a/man/likelihood.Rd b/man/likelihood.Rd
index 66213269..2e25c133 100644
--- a/man/likelihood.Rd
+++ b/man/likelihood.Rd
@@ -48,7 +48,7 @@ log-likelihood calculation.}
 \item{individual}{If TRUE, a vector of individual log-likelihood/likelihood
 contributions will be returned rather than the sum.}
 
-\item{...}{Parameters of the offspring distribution as required by R.}
+\item{...}{any parameters to pass to \code{\link{simulate_summary}}}
 }
 \value{
 \itemize{

From 46f93fb88acb97d0c52f9e792924c9abdfe64cc1 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Tue, 12 Sep 2023 13:13:27 +0100
Subject: [PATCH 601/828] Add a comment to example

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 R/likelihood.R | 1 +
 1 file changed, 1 insertion(+)

diff --git a/R/likelihood.R b/R/likelihood.R
index 45dd8585..100940d3 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -24,6 +24,7 @@
 #' @examples
 #' # example of observed chain sizes
 #' set.seed(121)
+#' ## randomly generate 20 chains of size 1 to 10
 #' chain_sizes <- sample(1:10, 20, replace = TRUE)
 #' likelihood(
 #'   chains = chain_sizes, statistic = "size",

From 5c3601e91767d1779d99e9814cdc62e485685b16 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Tue, 12 Sep 2023 14:19:06 +0100
Subject: [PATCH 602/828] Replace !isTRUE to isFALSE

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 R/likelihood.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 100940d3..50704b37 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -125,14 +125,14 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
   })
 
   ## transform log-likelihoods into likelihoods if required
-  if (!isTRUE(log)) {
+  if (isFALSE(log)) {
     chains_likelihood <- lapply(chains_likelihood, exp)
   }
 
   ## if individual == FALSE, return the joint log-likelihood
   ## (sum of the log-likelihoods), if log == TRUE, else
   ## multiply the likelihoods
-  if (!isTRUE(individual)) {
+  if (isFALSE(individual)) {
     if (isTRUE(log)) {
       chains_likelihood <- vapply(chains_likelihood, sum, 0)
     } else {

From a507250bf03602ace42ceaefda5ec744025fcee6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 20:47:52 +0100
Subject: [PATCH 603/828] Generate doc for removed default value of nsim_obs

---
 man/likelihood.Rd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/man/likelihood.Rd b/man/likelihood.Rd
index 2e25c133..0a736029 100644
--- a/man/likelihood.Rd
+++ b/man/likelihood.Rd
@@ -8,7 +8,7 @@ likelihood(
   chains,
   statistic = c("size", "length"),
   offspring_dist,
-  nsim_obs = 100,
+  nsim_obs,
   log = TRUE,
   obs_prob = 1,
   stat_max = Inf,

From 1d073c84580128a75de39e2a7640c73ae7a66c84 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 20:48:25 +0100
Subject: [PATCH 604/828] Rename variables

---
 R/likelihood.R | 20 ++++++++++----------
 1 file changed, 10 insertions(+), 10 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 50704b37..be3cb0e5 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -55,29 +55,29 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
 
     statistic_func <- get_statistic_func(statistic)
 
-    sampled_x <- replicate(nsim_obs, pmin(
+    stat_rep_list <- replicate(nsim_obs, pmin(
       statistic_func(
         length(chains),
         chains, obs_prob
       ),
       stat_max
     ), simplify = FALSE)
-    size_x <- unlist(sampled_x)
+    stat_rep_vect <- unlist(stat_rep_list)
     if (!is.finite(stat_max)) {
-    stat_max <- max(size_x) + 1
+    stat_max <- max(stat_rep_vect) + 1
     }
   } else {
     chains[chains >= stat_max] <- stat_max
-    size_x <- chains
-    sampled_x <- list(chains)
+    stat_rep_vect <- chains
+    stat_rep_list <- list(chains)
   }
 
   ## determine for which sizes to calculate the log-likelihood
   ## (for true chain size)
-  if (any(size_x == stat_max)) {
+  if (any(stat_rep_vect == stat_max)) {
     calc_sizes <- seq_len(stat_max - 1)
   } else {
-    calc_sizes <- unique(c(size_x, exclude))
+    calc_sizes <- unique(c(stat_rep_vect, exclude))
   }
 
   ## get log-likelihood function as given by offspring_dist and statistic
@@ -106,7 +106,7 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
   }
 
   ## assign probabilities to stat_max outbreak sizes
-  if (any(size_x == stat_max)) {
+  if (any(stat_rep_vect == stat_max)) {
     likelihoods[stat_max] <- complementary_logprob(likelihoods)
   }
 
@@ -114,13 +114,13 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
     likelihoods <- likelihoods - log(-expm1(sum(likelihoods[exclude])))
     likelihoods[exclude] <- -Inf
 
-    sampled_x <- lapply(sampled_x, function(y) {
+    stat_rep_list <- lapply(stat_rep_list, function(y) {
       y[!(y %in% exclude)]
     })
   }
 
   ## assign likelihoods
-  chains_likelihood <- lapply(sampled_x, function(sx) {
+  chains_likelihood <- lapply(stat_rep_list, function(sx) {
     likelihoods[sx[!(sx %in% exclude)]]
   })
 

From f649286900938f0791750575d6b9f046191fd116 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 20:49:27 +0100
Subject: [PATCH 605/828] Revise documentation of likelihood function

---
 R/likelihood.R    | 24 ++++++++++++++++--------
 man/likelihood.Rd | 22 +++++++++++++++-------
 2 files changed, 31 insertions(+), 15 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index be3cb0e5..bb5c4404 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -10,21 +10,29 @@
 #' @param exclude A vector of indices of the sizes/lengths to exclude from the
 #' log-likelihood calculation.
 #' @param individual If TRUE, a vector of individual log-likelihood/likelihood
-#' contributions will be returned rather than the sum.
+#' contributions will be returned rather than the sum/product.
 #' @return
-#' * A vector of log-likelihoods, if \code{log = TRUE} (the default) and
-#' \code{obs_prob < 1}, or
-#' * A list of individual log-likelihood contributions, if
-#' \code{log = TRUE} (the default) and \code{individual = TRUE}.
-#' The interpretation follows for the other combinations of `log` and
-#' `individual`.
+#' If log = TRUE:
+#'
+#' * A joint log-likelihood (sum of individual log-likelihoods), if
+#' \code{individual == FALSE} (default) and \code{obs_prob = 1} (default), or
+#' * A list of individual log-likelihoods, if \code{individual == TRUE} and
+#' \code{obs_prob = 1} (default), or
+#' * A list of individual log-likelihoods (same length as `nsim_obs`), if
+#' \code{individual == TRUE} and \code{0 <= obs_prob < 1}, or
+#' * A vector of joint log-likelihoods (same length as `nsim_obs`), if
+#' individual == FALSE and \code{0 <= obs_prob < 1} (imperfect observation).
+#'
+#' If \code{log = FALSE}, likelihoods, instead of log-likelihoods, are returned
+#' and the joint likelihoods are the product, instead of the sum, of the
+#' individual likelihoods.
 #' @seealso offspring_ll(), pois_size_ll(), nbinom_size_ll(), gborel_size_ll(),
 #' pois_length_ll(), geom_length_ll()
 #' @author Sebastian Funk
 #' @examples
 #' # example of observed chain sizes
 #' set.seed(121)
-#' ## randomly generate 20 chains of size 1 to 10
+#' # randomly generate 20 chains of size 1 to 10
 #' chain_sizes <- sample(1:10, 20, replace = TRUE)
 #' likelihood(
 #'   chains = chain_sizes, statistic = "size",
diff --git a/man/likelihood.Rd b/man/likelihood.Rd
index 0a736029..4173e3a4 100644
--- a/man/likelihood.Rd
+++ b/man/likelihood.Rd
@@ -46,19 +46,26 @@ computed. Results above the specified value, are set to \code{Inf}.}
 log-likelihood calculation.}
 
 \item{individual}{If TRUE, a vector of individual log-likelihood/likelihood
-contributions will be returned rather than the sum.}
+contributions will be returned rather than the sum/product.}
 
 \item{...}{any parameters to pass to \code{\link{simulate_summary}}}
 }
 \value{
+If log = TRUE:
 \itemize{
-\item A vector of log-likelihoods, if \code{log = TRUE} (the default) and
-\code{obs_prob < 1}, or
-\item A list of individual log-likelihood contributions, if
-\code{log = TRUE} (the default) and \code{individual = TRUE}.
-The interpretation follows for the other combinations of \code{log} and
-\code{individual}.
+\item A joint log-likelihood (sum of individual log-likelihoods), if
+\code{individual == FALSE} (default) and \code{obs_prob = 1} (default), or
+\item A list of individual log-likelihoods, if \code{individual == TRUE} and
+\code{obs_prob = 1} (default), or
+\item A list of individual log-likelihoods (same length as \code{nsim_obs}), if
+\code{individual == TRUE} and \code{0 <= obs_prob < 1}, or
+\item A vector of joint log-likelihoods (same length as \code{nsim_obs}), if
+individual == FALSE and \code{0 <= obs_prob < 1} (imperfect observation).
 }
+
+If \code{log = FALSE}, likelihoods, instead of log-likelihoods, are returned
+and the joint likelihoods are the product, instead of the sum, of the
+individual likelihoods.
 }
 \description{
 Estimate the log-likelihood/likelihood for observed branching processes
@@ -66,6 +73,7 @@ Estimate the log-likelihood/likelihood for observed branching processes
 \examples{
 # example of observed chain sizes
 set.seed(121)
+# randomly generate 20 chains of size 1 to 10
 chain_sizes <- sample(1:10, 20, replace = TRUE)
 likelihood(
   chains = chain_sizes, statistic = "size",

From b185764fe1873ed22ba1cdaeb6d5b95862cd39b5 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 21:01:48 +0100
Subject: [PATCH 606/828] Rename chains to x

---
 R/stat_likelihoods.R | 8 ++++----
 man/offspring_ll.Rd  | 6 +++---
 2 files changed, 7 insertions(+), 7 deletions(-)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index cb0ba039..148ffbf6 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -94,7 +94,7 @@ geom_length_ll <- function(x, prob) {
 #' cumulative distribution function (ecdf).
 #' @inheritParams likelihood
 #' @inheritParams simulate_summary
-#' @param chains Vector of chain summaries (sizes/lengths)
+#' @param x Vector of chain summaries (sizes/lengths)
 #' @param nsim_offspring Number of simulations of the offspring distribution
 #' for approximating the distribution of the chain statistic summary
 #' (size/length)
@@ -107,12 +107,12 @@ geom_length_ll <- function(x, prob) {
 #' @examples
 #' set.seed(123)
 #' chain_size_ll <- offspring_ll(
-#'   chains = c(1, 5, 6, 8, 7, 8, 10),
+#'   x = c(1, 5, 6, 8, 7, 8, 10),
 #'   offspring_dist = "pois",
 #'   statistic = "size",
 #'   lambda = 0.82
 #' )
-offspring_ll <- function(chains, offspring_dist, statistic,
+offspring_ll <- function(x, offspring_dist, statistic,
                          nsim_offspring = 100, ...) {
   # Simulate the chains
   dist <- simulate_summary(
@@ -139,7 +139,7 @@ offspring_ll <- function(chains, offspring_dist, statistic,
       )$y
     )
   )
-  lik <- acdf[chains]
+  lik <- acdf[x]
   lik[is.na(lik)] <- 0
   out <- log(lik)
   return(out)
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 4aa8d874..53632fee 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -5,10 +5,10 @@
 \title{Log-likelihood of the summary (size/length) of chains with generic offspring
 distribution}
 \usage{
-offspring_ll(chains, offspring_dist, statistic, nsim_offspring = 100, ...)
+offspring_ll(x, offspring_dist, statistic, nsim_offspring = 100, ...)
 }
 \arguments{
-\item{chains}{Vector of chain summaries (sizes/lengths)}
+\item{x}{Vector of chain summaries (sizes/lengths)}
 
 \item{offspring_dist}{Offspring distribution: a character string
 corresponding to the R distribution function (e.g., "pois" for Poisson,
@@ -38,7 +38,7 @@ cumulative distribution function (ecdf).
 \examples{
 set.seed(123)
 chain_size_ll <- offspring_ll(
-  chains = c(1, 5, 6, 8, 7, 8, 10),
+  x = c(1, 5, 6, 8, 7, 8, 10),
   offspring_dist = "pois",
   statistic = "size",
   lambda = 0.82

From 984b29733d81190f828c803e59afcd699ffaff03 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 21:02:26 +0100
Subject: [PATCH 607/828] Document chains argument due to loss of inheritance

---
 R/likelihood.R | 1 +
 1 file changed, 1 insertion(+)

diff --git a/R/likelihood.R b/R/likelihood.R
index bb5c4404..01f1f78a 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -2,6 +2,7 @@
 #'
 #' @inheritParams offspring_ll
 #' @inheritParams simulate_summary
+#' @param chains Vector of chain summaries (sizes/lengths)
 #' @param nsim_obs Number of simulations if the log-likelihood/likelihood is to
 #' be approximated for imperfect observations.
 #' @param log Logical; Should the log-likelihoods be transformed to

From 1416315f2e18f30b87915e80860337a3e9c024b9 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 21:02:37 +0100
Subject: [PATCH 608/828] Linting

---
 R/likelihood.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 01f1f78a..634ecf1c 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -73,7 +73,7 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
     ), simplify = FALSE)
     stat_rep_vect <- unlist(stat_rep_list)
     if (!is.finite(stat_max)) {
-    stat_max <- max(stat_rep_vect) + 1
+      stat_max <- max(stat_rep_vect) + 1
     }
   } else {
     chains[chains >= stat_max] <- stat_max

From b5ae5b4bd08cde2010c595654b662fe09470eb25 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 21:02:57 +0100
Subject: [PATCH 609/828] Revise return value documentation

---
 R/likelihood.R    | 9 +++++----
 man/likelihood.Rd | 9 +++++----
 2 files changed, 10 insertions(+), 8 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 634ecf1c..23c5552d 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -13,7 +13,7 @@
 #' @param individual If TRUE, a vector of individual log-likelihood/likelihood
 #' contributions will be returned rather than the sum/product.
 #' @return
-#' If log = TRUE:
+#' If \code{log = TRUE}
 #'
 #' * A joint log-likelihood (sum of individual log-likelihoods), if
 #' \code{individual == FALSE} (default) and \code{obs_prob = 1} (default), or
@@ -24,9 +24,10 @@
 #' * A vector of joint log-likelihoods (same length as `nsim_obs`), if
 #' individual == FALSE and \code{0 <= obs_prob < 1} (imperfect observation).
 #'
-#' If \code{log = FALSE}, likelihoods, instead of log-likelihoods, are returned
-#' and the joint likelihoods are the product, instead of the sum, of the
-#' individual likelihoods.
+#' If \code{log = FALSE}, the same structure of outputs as above are returned,
+#' except that likelihoods, instead of log-likelihoods, are calculated in all
+#' cases. Moreover, the joint likelihoods are the product, instead of the sum,
+#' of the individual likelihoods.
 #' @seealso offspring_ll(), pois_size_ll(), nbinom_size_ll(), gborel_size_ll(),
 #' pois_length_ll(), geom_length_ll()
 #' @author Sebastian Funk
diff --git a/man/likelihood.Rd b/man/likelihood.Rd
index 4173e3a4..6c69b974 100644
--- a/man/likelihood.Rd
+++ b/man/likelihood.Rd
@@ -51,7 +51,7 @@ contributions will be returned rather than the sum/product.}
 \item{...}{any parameters to pass to \code{\link{simulate_summary}}}
 }
 \value{
-If log = TRUE:
+If \code{log = TRUE}
 \itemize{
 \item A joint log-likelihood (sum of individual log-likelihoods), if
 \code{individual == FALSE} (default) and \code{obs_prob = 1} (default), or
@@ -63,9 +63,10 @@ If log = TRUE:
 individual == FALSE and \code{0 <= obs_prob < 1} (imperfect observation).
 }
 
-If \code{log = FALSE}, likelihoods, instead of log-likelihoods, are returned
-and the joint likelihoods are the product, instead of the sum, of the
-individual likelihoods.
+If \code{log = FALSE}, the same structure of outputs as above are returned,
+except that likelihoods, instead of log-likelihoods, are calculated in all
+cases. Moreover, the joint likelihoods are the product, instead of the sum,
+of the individual likelihoods.
 }
 \description{
 Estimate the log-likelihood/likelihood for observed branching processes

From 658904e2c6388b70b12982c727fdacc5ee570fb9 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 21:43:06 +0100
Subject: [PATCH 610/828] Update offspring_ll calls to use x instead of chains

---
 R/likelihood.R                         | 2 +-
 tests/testthat/test-stat_likelihoods.R | 4 ++--
 2 files changed, 3 insertions(+), 3 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 23c5552d..4bd41ce1 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -105,7 +105,7 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
         offspring_ll,
         c(
           list(
-            chains = calc_sizes,
+            x = calc_sizes,
             offspring_dist = offspring_dist,
             statistic = statistic,
             stat_max = stat_max
diff --git a/tests/testthat/test-stat_likelihoods.R b/tests/testthat/test-stat_likelihoods.R
index a63a53df..5de37753 100644
--- a/tests/testthat/test-stat_likelihoods.R
+++ b/tests/testthat/test-stat_likelihoods.R
@@ -74,7 +74,7 @@ test_that("Generic offspring log-likelihoods are calculated", {
   expect_true(
     all(
       offspring_ll(
-        chains = chains,
+        x = chains,
         offspring_dist = "pois",
         nsim_offspring = 100,
         statistic = "size",
@@ -84,7 +84,7 @@ test_that("Generic offspring log-likelihoods are calculated", {
   )
   expect_length(
     offspring_ll(
-      chains = chains,
+      x = chains,
       offspring_dist = "pois",
       nsim_offspring = 100,
       statistic = "size",

From 855280591fac671e8c2f1dc7b96d5bf26901fa21 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 21:43:22 +0100
Subject: [PATCH 611/828] Add more tests

---
 tests/testthat/test-stat_likelihoods.R | 61 ++++++++++++++++++++++++++
 1 file changed, 61 insertions(+)

diff --git a/tests/testthat/test-stat_likelihoods.R b/tests/testthat/test-stat_likelihoods.R
index 5de37753..bfc81e8d 100644
--- a/tests/testthat/test-stat_likelihoods.R
+++ b/tests/testthat/test-stat_likelihoods.R
@@ -45,6 +45,16 @@ test_that("Analytical chain size distributions are numerically correct", {
     ),
     c(-0.13863, -0.13863, -4.80803, -6.13400)
   )
+  expect_identical(
+    round(
+      pois_size_ll(
+        x = chains,
+        lambda = 0.2
+      ),
+      5
+    ),
+    c(-0.20000, -0.20000, -4.64748, -7.90633)
+  )
 })
 
 test_that("Analytical chain lengths distributions are numerically correct", {
@@ -114,3 +124,54 @@ test_that("Errors are thrown", {
     "both specified"
   )
 })
+
+test_that("Likelihood function returns the right object classes", {
+  expect_type(
+    likelihood(
+      chains = chains,
+      statistic = "size",
+      offspring_dist = "pois",
+      nsim_obs = 100,
+      lambda = 0.5,
+      obs_prob = 0.5,
+      individual = TRUE
+    ),
+    "list"
+  )
+  expect_type(
+    likelihood(
+      chains = chains,
+      statistic = "size",
+      offspring_dist = "pois",
+      nsim_obs = 3,
+      lambda = 0.5,
+      obs_prob = 0.5,
+      individual = FALSE
+    ),
+    "double"
+  )
+  expect_type(
+    likelihood(
+      chains = chains,
+      statistic = "size",
+      offspring_dist = "pois",
+      nsim_obs = 3,
+      lambda = 0.5,
+      obs_prob = 1,
+      individual = TRUE
+    ),
+    "list"
+  )
+  expect_type(
+    likelihood(
+      chains = chains,
+      statistic = "size",
+      offspring_dist = "pois",
+      nsim_obs = 100,
+      lambda = 0.5,
+      obs_prob = 1,
+      individual = FALSE
+    ),
+    "double"
+  )
+})

From cfa9a65fff6780b4adb15683d3031d00c4e3898e Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Tue, 12 Sep 2023 22:22:23 +0100
Subject: [PATCH 612/828] Abbreviate code block with ifelse + vapply construct

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 R/likelihood.R | 7 ++-----
 1 file changed, 2 insertions(+), 5 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 4bd41ce1..79ed452a 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -143,11 +143,8 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
   ## (sum of the log-likelihoods), if log == TRUE, else
   ## multiply the likelihoods
   if (isFALSE(individual)) {
-    if (isTRUE(log)) {
-      chains_likelihood <- vapply(chains_likelihood, sum, 0)
-    } else {
-      chains_likelihood <- vapply(chains_likelihood, prod, 0)
-    }
+    summarise_func <- ifelse(log, sum, prod)
+    vapply(chains_likelihood, summarise_func, 0)
   }
 
   return(chains_likelihood)

From 1ccfe3846026c78bc1410abeedb53e05bfa8d957 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 12 Sep 2023 23:13:48 +0100
Subject: [PATCH 613/828] Assign final value to return the right copy

---
 R/likelihood.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 79ed452a..ab72099f 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -144,7 +144,7 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
   ## multiply the likelihoods
   if (isFALSE(individual)) {
     summarise_func <- ifelse(log, sum, prod)
-    vapply(chains_likelihood, summarise_func, 0)
+    chains_likelihood <- vapply(chains_likelihood, summarise_func, 0)
   }
 
   return(chains_likelihood)

From af04206acabda76c3d4a2536d0defcf35447f36c Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 13 Sep 2023 10:16:56 +0100
Subject: [PATCH 614/828] Replace = with == for consitency.

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 R/likelihood.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index ab72099f..63b33af1 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -16,9 +16,9 @@
 #' If \code{log = TRUE}
 #'
 #' * A joint log-likelihood (sum of individual log-likelihoods), if
-#' \code{individual == FALSE} (default) and \code{obs_prob = 1} (default), or
+#' \code{individual == FALSE} (default) and \code{obs_prob == 1} (default), or
 #' * A list of individual log-likelihoods, if \code{individual == TRUE} and
-#' \code{obs_prob = 1} (default), or
+#' \code{obs_prob == 1} (default), or
 #' * A list of individual log-likelihoods (same length as `nsim_obs`), if
 #' \code{individual == TRUE} and \code{0 <= obs_prob < 1}, or
 #' * A vector of joint log-likelihoods (same length as `nsim_obs`), if

From d95584f79f716608bd6964a95085f992f1c4a264 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 13 Sep 2023 21:05:41 +0100
Subject: [PATCH 615/828] Fix bug in summary method for when time col doesn't
 exist

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index eb2a6ea7..c91cc4de 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -87,7 +87,7 @@ summary.epichains <- function(object, ...) {
   chains_ran <- attr(object, "chains", exact = TRUE)
 
   if (is_chains_tree(object)) {
-    max_time <- max(object$time)
+    max_time <- ifelse(("time" %in% names(object)), max(object$time), NA)
 
     n_unique_ancestors <- length(
       unique(object$ancestor[!is.na(object$ancestor)])

From c97f12f04c65affd8abfe0be144d5acd7387f838 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 13 Sep 2023 21:25:13 +0100
Subject: [PATCH 616/828] Restructure ifelse construct to fix lint issue

---
 R/epichains.R | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index c91cc4de..642f002c 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -103,11 +103,11 @@ summary.epichains <- function(object, ...) {
       max_generation = max_generation
     )
   } else if (is_chains_summary(object)) {
-    if (!all(is.infinite(object))) {
+    if (all(is.infinite(object))) {
+      max_chain_stat <- min_chain_stat <- Inf
+    } else {
       max_chain_stat <- max(object[!is.infinite(object)])
       min_chain_stat <- min(object[!is.infinite(object)])
-    } else {
-      max_chain_stat <- min_chain_stat <- Inf
     }
 
     res <- list(

From 45bf1b3d131fe6eae6891fe07a7ee0d9ebf5fd44 Mon Sep 17 00:00:00 2001
From: Hugo Gruson <Bisaloo@users.noreply.github.com>
Date: Mon, 18 Sep 2023 13:18:45 +0200
Subject: [PATCH 617/828] Remove duplicated lint_changed_files workflow This is
 already handled at the organization level

---
 .github/workflows/lint_changed_files.yaml | 45 -----------------------
 1 file changed, 45 deletions(-)
 delete mode 100644 .github/workflows/lint_changed_files.yaml

diff --git a/.github/workflows/lint_changed_files.yaml b/.github/workflows/lint_changed_files.yaml
deleted file mode 100644
index 5f16f852..00000000
--- a/.github/workflows/lint_changed_files.yaml
+++ /dev/null
@@ -1,45 +0,0 @@
-# Workflow derived from https://github.com/r-lib/actions/tree/v2/examples
-# Need help debugging build failures? Start at https://github.com/r-lib/actions#where-to-find-help
-on:
-  pull_request:
-    branches: [main, master]
-
-name: lint-changed-files
-
-jobs:
-  lint-changed-files:
-    runs-on: ubuntu-latest
-    env:
-      GITHUB_PAT: ${{ secrets.GITHUB_TOKEN }}
-    steps:
-      - uses: actions/checkout@v3
-
-      - uses: r-lib/actions/setup-r@v2
-
-      - uses: r-lib/actions/setup-r-dependencies@v2
-        with:
-          extra-packages: |
-            any::gh
-            any::lintr
-            any::purrr
-            epiverse-trace/etdev
-          needs: check
-
-      - name: Add lintr options
-        run: |
-          cat('\noptions(lintr.linter_file = ".lintr")\n', file = "~/.Rprofile", append = TRUE)
-        shell: Rscript {0}
-
-      - name: Install package
-        run: R CMD INSTALL .
-
-      - name: Extract and lint files changed by this PR
-        run: |
-          files <- gh::gh("GET https://api.github.com/repos/${{ github.repository }}/pulls/${{ github.event.pull_request.number }}/files")
-          changed_files <- purrr::map_chr(files, "filename")
-          all_files <- list.files(recursive = TRUE)
-          exclusions_list <- as.list(setdiff(all_files, changed_files))
-          lintr::lint_package(exclusions = exclusions_list)
-        shell: Rscript {0}
-        env:
-          LINTR_ERROR_ON_LINT: true
\ No newline at end of file

From 9c27f3fc061f0bf6013d47d3ce8d7908bbe6d339 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 18 Sep 2023 14:27:57 +0100
Subject: [PATCH 618/828] Remove get_offspring_func() helper and reinstate code
 in function body

---
 R/helpers.R               | 45 ---------------------------------------
 R/simulate.r              | 31 ++++++++++++++++++++++++---
 man/get_offspring_func.Rd | 37 --------------------------------
 3 files changed, 28 insertions(+), 85 deletions(-)
 delete mode 100644 man/get_offspring_func.Rd

diff --git a/R/helpers.R b/R/helpers.R
index 6d42765c..a194ddd9 100644
--- a/R/helpers.R
+++ b/R/helpers.R
@@ -15,51 +15,6 @@ update_chain_stat <- function(stat_type, stat_latest, n_offspring) {
   return(stat_latest)
 }
 
-
-#' Get offspring sampling function that takes into account susceptible
-#' depletion
-#'
-#' @param n Number of items to sample
-#' @param susc Susceptible population size (calculated
-#' inside \code{\link{simulate_tree_from_pop}}  as pop - initial_immune)
-#' @inheritParams simulate_tree_from_pop
-#'
-#' @return An offspring sampling function
-#' @keywords internal
-get_offspring_func <- function(offspring_dist, n, susc, pop,
-                               mean_offspring, disp_offspring = NULL) {
-  if (offspring_dist == "nbinom") {
-    function(n, susc, pop, mean_offspring, disp_offspring) {
-      ## get distribution params from mean and dispersion
-      new_mn <- mean_offspring * susc / pop ## apply susceptibility
-      size <- new_mn / (disp_offspring - 1)
-
-      ## using a right truncated nbinom distribution
-      ## to avoid more cases than susceptibles
-      truncdist::rtrunc(
-        n,
-        spec = "nbinom",
-        b = susc,
-        mu = new_mn,
-        size = size
-      )
-    }
-  } else if (offspring_dist == "pois") {
-    function(n, susc, pop, mean_offspring, disp_offspring) {
-      truncdist::rtrunc(
-        n,
-        spec = "pois",
-        lambda = mean_offspring * susc / pop,
-        b = susc
-      )
-    }
-  } else {
-    stop("offspring_dist must either be 'pois' or 'nbinom'")
-  }
-}
-
-
-
 #' Return a function for calculating chain statistics
 #'
 #' @inheritParams simulate_tree
diff --git a/R/simulate.r b/R/simulate.r
index 72fb6ab1..e675a733 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -365,7 +365,14 @@ simulate_tree_from_pop <- function(pop,
 
     ## using a right truncated poisson distribution
     ## to avoid more cases than susceptibles
-    offspring_fun <- get_offspring_func(offspring_dist)
+    offspring_func <- function(n, susc) {
+      truncdist::rtrunc(
+        n,
+        spec = "pois",
+        lambda = offspring_mean * susc / pop,
+        b = susc
+      )
+    }
   } else if (offspring_dist == "nbinom") {
     if (missing(offspring_disp)) {
       stop(sprintf("%s", "'offspring_disp' must be specified."))
@@ -377,7 +384,25 @@ simulate_tree_from_pop <- function(pop,
         "Use 'pois' if there is no overdispersion."
       ))
     }
-    offspring_fun <- get_offspring_func(offspring_dist)
+    ## get distribution params from mean and dispersion
+    offspring_func <- function(n, susc) {
+      ## get distribution params from mean and dispersion
+      ## see ?rnbinom for parameter definition
+      new_mn <- offspring_mean * susc / pop ## apply susceptibility
+      size <- new_mn / (offspring_disp - 1)
+
+      ## using a right truncated nbinom distribution
+      ## to avoid more cases than susceptibles
+      truncdist::rtrunc(
+        n,
+        spec = "nbinom",
+        b = susc,
+        mu = new_mn,
+        size = size
+      )
+    }
+  } else {
+    stop("offspring_dist must either be 'pois' or 'nbinom'")
   }
 
   ## initializations
@@ -406,7 +431,7 @@ simulate_tree_from_pop <- function(pop,
 
     ## generate it
     current_max_id <- max(tree_df$sim_id)
-    n_offspring <- offspring_fun(1, susc, pop, offspring_mean, offspring_disp)
+    n_offspring <- offspring_func(1, susc)
 
     if (n_offspring %% 1 > 0) {
       stop("Offspring distribution must return integers")
diff --git a/man/get_offspring_func.Rd b/man/get_offspring_func.Rd
deleted file mode 100644
index 15a28e90..00000000
--- a/man/get_offspring_func.Rd
+++ /dev/null
@@ -1,37 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/helpers.R
-\name{get_offspring_func}
-\alias{get_offspring_func}
-\title{Get offspring sampling function that takes into account susceptible
-depletion}
-\usage{
-get_offspring_func(
-  offspring_dist,
-  n,
-  susc,
-  pop,
-  mean_offspring,
-  disp_offspring = NULL
-)
-}
-\arguments{
-\item{offspring_dist}{Offspring distribution: a character string
-corresponding to the R distribution function (e.g., "pois" for Poisson,
-where \code{\link{rpois}} is the R function to generate Poisson random
-numbers).}
-
-\item{n}{Number of items to sample}
-
-\item{susc}{Susceptible population size (calculated
-inside \code{\link{simulate_tree_from_pop}}  as pop - initial_immune)}
-
-\item{pop}{The susceptible population.}
-}
-\value{
-An offspring sampling function
-}
-\description{
-Get offspring sampling function that takes into account susceptible
-depletion
-}
-\keyword{internal}

From 88df8566ce4ca2aca0f441a1108ee68a9cd3a8a9 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 18 Sep 2023 18:04:41 +0100
Subject: [PATCH 619/828] Remove redundant else block

---
 R/simulate.r | 2 --
 1 file changed, 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index e675a733..9cdbfd02 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -401,8 +401,6 @@ simulate_tree_from_pop <- function(pop,
         size = size
       )
     }
-  } else {
-    stop("offspring_dist must either be 'pois' or 'nbinom'")
   }
 
   ## initializations

From 9a6695994cd59f08defab469ab901f4e41743779 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 13 Sep 2023 21:44:08 +0100
Subject: [PATCH 620/828] Move tests to dedicated script

---
 tests/testthat/tests-sim.r | 31 -------------------------------
 1 file changed, 31 deletions(-)
 delete mode 100644 tests/testthat/tests-sim.r

diff --git a/tests/testthat/tests-sim.r b/tests/testthat/tests-sim.r
deleted file mode 100644
index 05b7b813..00000000
--- a/tests/testthat/tests-sim.r
+++ /dev/null
@@ -1,31 +0,0 @@
-test_that("Simulators output epichains objects", {
-  expect_s3_class(
-    simulate_tree(
-      nchains = 10,
-      offspring_dist = "pois",
-      lambda = 2,
-      statistic = "size",
-      stat_max = 10
-    ),
-    "epichains"
-  )
-  expect_s3_class(
-    simulate_tree_from_pop(
-      pop = 100,
-      offspring_dist = "nbinom",
-      offspring_mean = 0.5,
-      offspring_disp = 1.1,
-      serials_dist = function(x) 3
-    ),
-    "epichains"
-  )
-  expect_s3_class(
-    simulate_summary(
-      nchains = 10,
-      offspring_dist = "pois",
-      lambda = 2,
-      stat_max = 10
-    ),
-    "epichains"
-  )
-})

From 366cb21b23a19824c7c69392554e60e97a71fa2d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 13 Sep 2023 21:50:28 +0100
Subject: [PATCH 621/828] Add test for checks

---
 tests/testthat/test-checks.R | 18 ++++++++++++++++++
 1 file changed, 18 insertions(+)
 create mode 100644 tests/testthat/test-checks.R

diff --git a/tests/testthat/test-checks.R b/tests/testthat/test-checks.R
new file mode 100644
index 00000000..fde23d57
--- /dev/null
+++ b/tests/testthat/test-checks.R
@@ -0,0 +1,18 @@
+test_that("Checks work", {
+  expect_error(
+    check_offspring_valid(1),
+    "character string"
+  )
+  expect_error(
+    check_offspring_func_valid("rrpois"),
+    "does not exist"
+  )
+  expect_error(
+    check_serial_valid("a"),
+    "must be a function"
+  )
+  expect_error(
+    check_nchains_valid(1.1),
+    "less than"
+  )
+})

From 76cc441677019936e1c9ad15f7d4cffb79bf6a3e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 13 Sep 2023 21:51:30 +0100
Subject: [PATCH 622/828] Add test for simulation functions

---
 tests/testthat/tests-simulate.R | 326 ++++++++++++++++++++++++++++++++
 1 file changed, 326 insertions(+)
 create mode 100644 tests/testthat/tests-simulate.R

diff --git a/tests/testthat/tests-simulate.R b/tests/testthat/tests-simulate.R
new file mode 100644
index 00000000..dcf8449c
--- /dev/null
+++ b/tests/testthat/tests-simulate.R
@@ -0,0 +1,326 @@
+# Define global variables and options
+set.seed(12)
+serial_func <- function(n) {
+  rlnorm(n, meanlog = 0.58, sdlog = 1.58)
+}
+
+test_that("Simulators return epichains objects", {
+  expect_s3_class(
+    simulate_tree(
+      nchains = 10,
+      offspring_dist = "pois",
+      lambda = 2,
+      statistic = "size",
+      stat_max = 10
+    ),
+    "epichains"
+  )
+  expect_s3_class(
+    simulate_tree_from_pop(
+      pop = 100,
+      offspring_dist = "nbinom",
+      offspring_mean = 0.5,
+      offspring_disp = 1.1,
+      serials_dist = function(x) 3
+    ),
+    "epichains"
+  )
+  expect_s3_class(
+    simulate_summary(
+      nchains = 10,
+      offspring_dist = "pois",
+      lambda = 2,
+      stat_max = 10
+    ),
+    "epichains"
+  )
+})
+
+test_that("Simulators work", {
+  expect_length(
+    simulate_summary(
+      nchains = 2,
+      statistic = "size",
+      offspring_dist = "pois",
+      lambda = 0.5
+    ),
+    2
+  )
+  expect_gte(
+    nrow(
+      simulate_tree(
+        nchains = 2,
+        offspring_dist = "pois",
+        statistic = "length",
+        lambda = 0.9
+      )
+    ),
+    2
+  )
+  expect_gte(
+    nrow(
+      simulate_tree_from_pop(
+        pop = 100,
+        offspring_dist = "pois",
+        offspring_mean = 0.9,
+        serials_dist = serial_func
+      )
+    ),
+    1
+  )
+})
+
+test_that("simulate_tree throws errors", {
+  expect_error(
+    simulate_tree(
+      nchains = 2,
+      offspring_dist = "s",
+      statistic = "length",
+      lambda = 0.9
+    ),
+    "does not exist"
+    )
+  expect_error(
+    simulate_tree(
+      nchains = 2,
+      offspring_dist = "lnorm",
+      statistic = "length",
+      meanlog = 0.9,
+      sdlog = 0.9
+    ),
+    "must return integers"
+    )
+  expect_error(
+    simulate_tree(
+      nchains = 2,
+      offspring_dist = s,
+      statistic = "length",
+      meanlog = 0.9,
+      sdlog = 0.9
+    ),
+    "not found"
+  )
+  expect_error(
+    simulate_tree(
+      nchains = 2,
+      offspring_dist = "pois",
+      statistic = "size",
+      lambda = 0.9,
+      serials_dist = c(1, 2)
+      ),
+      "must be a function"
+      )
+  expect_error(
+    simulate_tree(
+      nchains = 2,
+      offspring_dist = c(1, 2),
+      statistic = "length",
+      lambda = 0.9
+      ),
+    "character string"
+  )
+  expect_error(
+    simulate_tree(
+      nchains = 2,
+      offspring_dist = "pois",
+      statistic = "size",
+      lambda = 0.9,
+      tf = 5
+    ),
+    "must be specified"
+  )
+})
+
+test_that("simulate_summary throws errors", {
+  expect_error(
+    simulate_summary(
+      nchains = 2,
+      offspring_dist = "s",
+      statistic = "length",
+      lambda = 0.9
+    ),
+    "does not exist"
+  )
+  expect_error(
+    simulate_summary(
+      nchains = 2,
+      offspring_dist = "lnorm",
+      statistic = "length",
+      meanlog = 0.9,
+      sdlog = 0.9
+    ),
+    "must return integers"
+  )
+  expect_error(
+    simulate_summary(
+      nchains = 2,
+      offspring_dist = s,
+      statistic = "length",
+      meanlog = 0.9,
+      sdlog = 0.9
+    ),
+    "not found"
+  )
+  expect_error(
+    simulate_summary(
+      nchains = 2,
+      offspring_dist = c(1, 2),
+      statistic = "length",
+      lambda = 0.9
+    ),
+    "character string"
+  )
+})
+
+test_that("simulate_tree_from_pop throws errors", {
+expect_error(
+  simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "binom",
+    offspring_mean = 0.5,
+    serials_dist = serial_func
+  ),
+  "should be one of"
+)
+  expect_error(
+    simulate_tree_from_pop(
+      pop = 100,
+      offspring_dist = "nbinom",
+      offspring_mean = 0.5,
+      offspring_disp = 0.9,
+      serials_dist = serial_func
+    ),
+    "> 1"
+  )
+  expect_error(
+    simulate_tree_from_pop(
+      pop = 100,
+      offspring_dist = p,
+      offspring_mean = 0.5,
+      offspring_disp = 0.9,
+      serials_dist = serial_func
+    ),
+    "not found"
+  )
+})
+
+test_that("simulate_tree_from_pop throws warnings", {
+  expect_warning(
+    simulate_tree_from_pop(
+      pop = 100,
+      offspring_dist = "pois",
+      offspring_mean = 3,
+      offspring_disp = 1,
+      serials_dist = serial_func
+    ),
+    "not used for poisson offspring"
+  )
+})
+
+test_that("simulate_tree is numerically correct", {
+  expect_equal(
+    summary(
+      simulate_tree(
+        nchains = 2,
+        offspring_dist = "pois",
+        statistic = "length",
+        lambda = 0.9
+      )
+    )$chains_ran,
+    2
+  )
+   expect_equal(
+    summary(
+      simulate_tree(
+      nchains = 2,
+      offspring_dist = "pois",
+      statistic = "length",
+      lambda = 0.9
+    )
+    )$unique_ancestors,
+    2
+  )
+  expect_equal(
+    summary(
+      simulate_tree(
+        nchains = 2,
+        offspring_dist = "pois",
+        statistic = "length",
+        lambda = 0.9
+      )
+    )$max_generation,
+    3
+  )
+})
+
+test_that("simulate_summary is numerically correct", {
+  expect_equal(
+    summary(
+      simulate_summary(
+        nchains = 2,
+        offspring_dist = "pois",
+        statistic = "length",
+        lambda = 0.9
+      )
+    )$max_chain_stat,
+    3
+  )
+  expect_equal(
+    summary(
+      simulate_summary(
+        nchains = 2,
+        offspring_dist = "pois",
+        statistic = "length",
+        lambda = 0.9
+      )
+    )$min_chain_stat,
+    1
+  )
+})
+
+test_that("simulate_tree_from_pop is numerically correct", {
+  expect_equal(
+    summary(
+      simulate_tree_from_pop(
+        pop = 100,
+        offspring_dist = "pois",
+        offspring_mean = 0.9,
+        serials_dist = serial_func
+      )
+    )$unique_ancestors,
+    0
+  )
+  expect_equal(
+    summary(
+      simulate_tree_from_pop(
+        pop = 100,
+        offspring_dist = "pois",
+        offspring_mean = 0.9,
+        serials_dist = serial_func
+      )
+    )$max_time,
+    0
+  )
+  expect_equal(
+    summary(
+     simulate_tree_from_pop(
+       pop = 100,
+       offspring_dist = "pois",
+       offspring_mean = 0.9,
+       serials_dist = serial_func
+       )
+     )$max_generation,
+    1
+  )
+  expect_equal(
+    summary(
+      simulate_tree_from_pop(
+        pop = 100,
+        offspring_dist = "pois",
+        offspring_mean = 0.9,
+        serials_dist = serial_func
+      )
+    )$chains_ran,
+    NULL
+  )
+})

From 9c475234845fcb13e5143ab37181a6ef124dd720 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 13 Sep 2023 22:02:57 +0100
Subject: [PATCH 623/828] Clean up the tests

---
 tests/testthat/tests-simulate.R | 138 ++++++++++++--------------------
 1 file changed, 51 insertions(+), 87 deletions(-)

diff --git a/tests/testthat/tests-simulate.R b/tests/testthat/tests-simulate.R
index dcf8449c..7d4e2f54 100644
--- a/tests/testthat/tests-simulate.R
+++ b/tests/testthat/tests-simulate.R
@@ -1,4 +1,4 @@
-# Define global variables and options
+# Define global variables and options for simulations
 set.seed(12)
 serial_func <- function(n) {
   rlnorm(n, meanlog = 0.58, sdlog = 1.58)
@@ -79,7 +79,7 @@ test_that("simulate_tree throws errors", {
       lambda = 0.9
     ),
     "does not exist"
-    )
+  )
   expect_error(
     simulate_tree(
       nchains = 2,
@@ -89,7 +89,7 @@ test_that("simulate_tree throws errors", {
       sdlog = 0.9
     ),
     "must return integers"
-    )
+  )
   expect_error(
     simulate_tree(
       nchains = 2,
@@ -107,16 +107,16 @@ test_that("simulate_tree throws errors", {
       statistic = "size",
       lambda = 0.9,
       serials_dist = c(1, 2)
-      ),
-      "must be a function"
-      )
+    ),
+    "must be a function"
+  )
   expect_error(
     simulate_tree(
       nchains = 2,
       offspring_dist = c(1, 2),
       statistic = "length",
       lambda = 0.9
-      ),
+    ),
     "character string"
   )
   expect_error(
@@ -173,15 +173,15 @@ test_that("simulate_summary throws errors", {
 })
 
 test_that("simulate_tree_from_pop throws errors", {
-expect_error(
-  simulate_tree_from_pop(
-    pop = 100,
-    offspring_dist = "binom",
-    offspring_mean = 0.5,
-    serials_dist = serial_func
-  ),
-  "should be one of"
-)
+  expect_error(
+    simulate_tree_from_pop(
+      pop = 100,
+      offspring_dist = "binom",
+      offspring_mean = 0.5,
+      serials_dist = serial_func
+    ),
+    "should be one of"
+  )
   expect_error(
     simulate_tree_from_pop(
       pop = 100,
@@ -218,109 +218,73 @@ test_that("simulate_tree_from_pop throws warnings", {
 })
 
 test_that("simulate_tree is numerically correct", {
-  expect_equal(
-    summary(
-      simulate_tree(
-        nchains = 2,
-        offspring_dist = "pois",
-        statistic = "length",
-        lambda = 0.9
-      )
-    )$chains_ran,
-    2
-  )
-   expect_equal(
-    summary(
-      simulate_tree(
+  set.seed(12)
+  tree_sim_summary <- summary(
+    simulate_tree(
       nchains = 2,
       offspring_dist = "pois",
       statistic = "length",
       lambda = 0.9
     )
-    )$unique_ancestors,
+  )
+  expect_equal(
+    tree_sim_summary$chains_ran,
     2
   )
   expect_equal(
-    summary(
-      simulate_tree(
-        nchains = 2,
-        offspring_dist = "pois",
-        statistic = "length",
-        lambda = 0.9
-      )
-    )$max_generation,
+    tree_sim_summary$unique_ancestors,
+    2
+  )
+  expect_equal(
+    tree_sim_summary$max_generation,
     3
   )
 })
 
 test_that("simulate_summary is numerically correct", {
+  set.seed(12)
+  chain_summary_sim <- summary(
+    simulate_summary(
+      nchains = 2,
+      offspring_dist = "pois",
+      statistic = "length",
+      lambda = 0.9
+    )
+  )
   expect_equal(
-    summary(
-      simulate_summary(
-        nchains = 2,
-        offspring_dist = "pois",
-        statistic = "length",
-        lambda = 0.9
-      )
-    )$max_chain_stat,
+    chain_summary_sim$max_chain_stat,
     3
   )
   expect_equal(
-    summary(
-      simulate_summary(
-        nchains = 2,
-        offspring_dist = "pois",
-        statistic = "length",
-        lambda = 0.9
-      )
-    )$min_chain_stat,
+    chain_summary_sim$min_chain_stat,
     1
   )
 })
 
 test_that("simulate_tree_from_pop is numerically correct", {
+  set.seed(12)
+  susc_outbreak_summary <- summary(
+    simulate_tree_from_pop(
+      pop = 100,
+      offspring_dist = "pois",
+      offspring_mean = 0.9,
+      serials_dist = serial_func
+    )
+  )
   expect_equal(
-    summary(
-      simulate_tree_from_pop(
-        pop = 100,
-        offspring_dist = "pois",
-        offspring_mean = 0.9,
-        serials_dist = serial_func
-      )
-    )$unique_ancestors,
+    susc_outbreak_summary$unique_ancestors,
     0
   )
   expect_equal(
-    summary(
-      simulate_tree_from_pop(
-        pop = 100,
-        offspring_dist = "pois",
-        offspring_mean = 0.9,
-        serials_dist = serial_func
-      )
-    )$max_time,
+    susc_outbreak_summary$max_time,
     0
   )
   expect_equal(
-    summary(
-     simulate_tree_from_pop(
-       pop = 100,
-       offspring_dist = "pois",
-       offspring_mean = 0.9,
-       serials_dist = serial_func
-       )
-     )$max_generation,
+    susc_outbreak_summary$max_generation,
     1
   )
   expect_equal(
-    summary(
-      simulate_tree_from_pop(
-        pop = 100,
-        offspring_dist = "pois",
-        offspring_mean = 0.9,
-        serials_dist = serial_func
-      )
-    )$chains_ran,
+    susc_outbreak_summary$chains_ran,
     NULL
   )
 })

From 36bfebbd74e5fa238d1febb0266ec1e3cead16ba Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 13 Sep 2023 22:05:34 +0100
Subject: [PATCH 624/828] Generate likelihood doc file

---
 man/likelihood.Rd | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/man/likelihood.Rd b/man/likelihood.Rd
index 6c69b974..213a62f6 100644
--- a/man/likelihood.Rd
+++ b/man/likelihood.Rd
@@ -54,9 +54,9 @@ contributions will be returned rather than the sum/product.}
 If \code{log = TRUE}
 \itemize{
 \item A joint log-likelihood (sum of individual log-likelihoods), if
-\code{individual == FALSE} (default) and \code{obs_prob = 1} (default), or
+\code{individual == FALSE} (default) and \code{obs_prob == 1} (default), or
 \item A list of individual log-likelihoods, if \code{individual == TRUE} and
-\code{obs_prob = 1} (default), or
+\code{obs_prob == 1} (default), or
 \item A list of individual log-likelihoods (same length as \code{nsim_obs}), if
 \code{individual == TRUE} and \code{0 <= obs_prob < 1}, or
 \item A vector of joint log-likelihoods (same length as \code{nsim_obs}), if

From d2886a6d37e508cf00683ac9c11b6e55c4771a96 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 14 Sep 2023 19:06:28 +0100
Subject: [PATCH 625/828] Use expect_identical instead of expect_equal

---
 tests/testthat/tests-simulate.R | 16 ++++++++--------
 1 file changed, 8 insertions(+), 8 deletions(-)

diff --git a/tests/testthat/tests-simulate.R b/tests/testthat/tests-simulate.R
index 7d4e2f54..f83c1534 100644
--- a/tests/testthat/tests-simulate.R
+++ b/tests/testthat/tests-simulate.R
@@ -227,15 +227,15 @@ test_that("simulate_tree is numerically correct", {
       lambda = 0.9
     )
   )
-  expect_equal(
+  expect_identical(
     tree_sim_summary$chains_ran,
     2
   )
-  expect_equal(
+  expect_identical(
     tree_sim_summary$unique_ancestors,
     2
   )
-  expect_equal(
+  expect_identical(
     tree_sim_summary$max_generation,
     3
   )
@@ -251,11 +251,11 @@ test_that("simulate_summary is numerically correct", {
       lambda = 0.9
     )
   )
-  expect_equal(
+  expect_identical(
     chain_summary_sim$max_chain_stat,
     3
   )
-  expect_equal(
+  expect_identical(
     chain_summary_sim$min_chain_stat,
     1
   )
@@ -271,15 +271,15 @@ test_that("simulate_tree_from_pop is numerically correct", {
       serials_dist = serial_func
     )
   )
-  expect_equal(
+  expect_identical(
     susc_outbreak_summary$unique_ancestors,
     0
   )
-  expect_equal(
+  expect_identical(
     susc_outbreak_summary$max_time,
     0
   )
-  expect_equal(
+  expect_identical(
     susc_outbreak_summary$max_generation,
     1
   )

From d70fbc373e2ce4c022a94d8966c1af6ef11c9f0a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 14 Sep 2023 19:06:54 +0100
Subject: [PATCH 626/828] Use expect_null instead of expect_equal

---
 tests/testthat/tests-simulate.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tests/testthat/tests-simulate.R b/tests/testthat/tests-simulate.R
index f83c1534..e590e09e 100644
--- a/tests/testthat/tests-simulate.R
+++ b/tests/testthat/tests-simulate.R
@@ -283,7 +283,7 @@ test_that("simulate_tree_from_pop is numerically correct", {
     susc_outbreak_summary$max_generation,
     1
   )
-  expect_equal(
+  expect_null(susc_outbreak_summary$chains_ran)
     susc_outbreak_summary$chains_ran,
     NULL
   )

From 5b3a3a84a3d3a7233e1ed546b850e745b54b10e8 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 14 Sep 2023 19:07:08 +0100
Subject: [PATCH 627/828] Fix expected data types

---
 tests/testthat/tests-simulate.R | 19 ++++++++-----------
 1 file changed, 8 insertions(+), 11 deletions(-)

diff --git a/tests/testthat/tests-simulate.R b/tests/testthat/tests-simulate.R
index e590e09e..bd29f7d5 100644
--- a/tests/testthat/tests-simulate.R
+++ b/tests/testthat/tests-simulate.R
@@ -229,15 +229,15 @@ test_that("simulate_tree is numerically correct", {
   )
   expect_identical(
     tree_sim_summary$chains_ran,
-    2
+    2.00
   )
   expect_identical(
     tree_sim_summary$unique_ancestors,
-    2
+    2L
   )
   expect_identical(
     tree_sim_summary$max_generation,
-    3
+    3L
   )
 })
 
@@ -253,11 +253,11 @@ test_that("simulate_summary is numerically correct", {
   )
   expect_identical(
     chain_summary_sim$max_chain_stat,
-    3
+    3.00
   )
   expect_identical(
     chain_summary_sim$min_chain_stat,
-    1
+    1.00
   )
 })
 
@@ -273,18 +273,15 @@ test_that("simulate_tree_from_pop is numerically correct", {
   )
   expect_identical(
     susc_outbreak_summary$unique_ancestors,
-    0
+    0L
   )
   expect_identical(
     susc_outbreak_summary$max_time,
-    0
+    0.00
   )
   expect_identical(
     susc_outbreak_summary$max_generation,
-    1
+    1L
   )
   expect_null(susc_outbreak_summary$chains_ran)
-    susc_outbreak_summary$chains_ran,
-    NULL
-  )
 })

From 1f713dc4fca314a4ec526378491e1e980ffc71a0 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 14 Sep 2023 20:10:13 +0100
Subject: [PATCH 628/828] Add tests for utils.R

---
 tests/testthat/test-utils.R | 120 ++++++++++++++++++++++++++++++++++++
 1 file changed, 120 insertions(+)
 create mode 100644 tests/testthat/test-utils.R

diff --git a/tests/testthat/test-utils.R b/tests/testthat/test-utils.R
new file mode 100644
index 00000000..c271e47d
--- /dev/null
+++ b/tests/testthat/test-utils.R
@@ -0,0 +1,120 @@
+test_that("Reparametrized distributions work", {
+  expect_length(
+    rnbinom_mean_disp(
+      n = 5,
+      mn = 4,
+      disp = 2
+      ),
+    5
+  )
+})
+
+test_that("Log-probabilities work", {
+  expect_length(
+    complementary_logprob(x = 0),
+    1
+  )
+  expect_length(
+    complementary_logprob(x = -Inf),
+    1
+  )
+  expect_length(
+    complementary_logprob(x = -0.1),
+    1
+  )
+})
+
+test_that("Chain lengths sampler works", {
+  expect_length(
+    rgen_length(
+      n = 1,
+      x = c(1, 2, 3),
+      prob = 0.3
+    ),
+    3
+  )
+})
+
+test_that("Chain sizes sampler works", {
+  expect_length(
+    rbinom_size(
+      n = 1,
+      x = c(1, 2, 3),
+      prob = 0.3
+    ),
+    3
+  )
+})
+
+test_that("Reparametrized distributions are numerically correct", {
+  set.seed(12)
+  expect_identical(
+    rnbinom_mean_disp(
+      n = 5,
+      mn = 4,
+      disp = 2
+      ),
+    c(0, 2, 5, 2, 3)
+  )
+})
+
+test_that("Log-probabilities are numerically correct", {
+  expect_identical(
+    complementary_logprob(x = 0),
+    -Inf
+  )
+  expect_identical(
+    complementary_logprob(x = -Inf),
+    0
+  )
+  expect_lt(
+    complementary_logprob(x = -0.1),
+    0
+  )
+})
+
+test_that("Chain lengths sampler is numerically correct", {
+  set.seed(12)
+  expect_identical(
+    rgen_length(
+      n = 1,
+      x = c(1, 2, 3),
+      prob = 0.3
+    ),
+    c(8, 9, 10)
+  )
+})
+
+test_that("Chain sizes sampler is numerically correct", {
+  set.seed(12)
+  expect_identical(
+    rbinom_size(
+      n = 1,
+      x = c(1, 2, 3),
+      prob = 0.3
+    ),
+    c(1, 2, 3)
+  )
+})
+
+test_that("Reparametrized distributions throw warnings", {
+  expect_warning(
+    rnbinom_mean_disp(
+      n = 5,
+      mn = 4,
+      disp = 0.9
+    ),
+    "NAs produced"
+  )
+})
+
+test_that("Log-probabilities throw warnings", {
+  expect_warning(
+    complementary_logprob(0.1),
+    "NaNs produced"
+  )
+  expect_warning(
+    complementary_logprob(Inf),
+    "NaNs produced"
+  )
+})

From 06c33cc5fcb21d75c89b20b6023f2921e6f8fd47 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 14 Sep 2023 23:19:34 +0100
Subject: [PATCH 629/828] Add tests for epichains classes and methods

---
 tests/testthat/_snaps/epichains.md | 138 +++++++++++++++++++
 tests/testthat/test-epichains.R    | 206 +++++++++++++++++++++++++++++
 2 files changed, 344 insertions(+)
 create mode 100644 tests/testthat/_snaps/epichains.md
 create mode 100644 tests/testthat/test-epichains.R

diff --git a/tests/testthat/_snaps/epichains.md b/tests/testthat/_snaps/epichains.md
new file mode 100644
index 00000000..c31d1707
--- /dev/null
+++ b/tests/testthat/_snaps/epichains.md
@@ -0,0 +1,138 @@
+# print.epichains works for simulate_summary output
+
+    Code
+      epichains_summary
+    Output
+      `epichains` object 
+      
+       [1]   1 Inf Inf Inf Inf   1   2 Inf   1   1
+      
+       Simulated chain sizes: 
+      
+      Max: 2
+      Min: 1
+
+# print.epichains works for simulate_tree output
+
+    Code
+      epichains_tree
+    Output
+      `epichains` object
+      
+      < tree head (from first known ancestor) >
+      
+         chain_id sim_id ancestor generation
+      11        1      2        1          2
+      13        2      2        1          2
+      18        3      2        1          2
+      19        4      2        1          2
+      22        6      2        1          2
+      23        8      2        1          2
+      
+      < tree tail >
+      
+         chain_id sim_id ancestor generation
+      41        2     17        6          3
+      85        6     17        6          4
+      42        2     18        6          3
+      86        6     18        7          4
+      87        6     19        7          4
+      88        6     20        7          4
+      Chains simulated: 10
+      Number of ancestors (known): 9
+      Number of generations: 5
+      Use `as.data.frame(<object_name>)` to view the full output in the console.
+
+---
+
+    Code
+      epichains_tree2
+    Output
+      `epichains` object
+      
+      < tree head (from first known ancestor) >
+      
+         chain_id sim_id ancestor generation time
+      11        1      2        1          2    3
+      13        2      2        1          2    3
+      16        3      2        1          2    3
+      17        4      2        1          2    3
+      18        5      2        1          2    3
+      19        6      2        1          2    3
+      
+      < tree tail >
+      
+          chain_id sim_id ancestor generation time
+      116        7     20        9          4    9
+      128        8     20        9          4    9
+      117        7     21        9          4    9
+      129        8     21        9          4    9
+      130        8     22        9          4    9
+      131        8     23        9          4    9
+      Chains simulated: 10
+      Number of ancestors (known): 9
+      Number of generations: 4
+      Use `as.data.frame(<object_name>)` to view the full output in the console.
+
+# head and tail methods work
+
+    Code
+      head(epichains_tree)
+    Output
+      < tree head (from first known ancestor) >
+      
+         chain_id sim_id ancestor generation
+      11        1      2        1          2
+      13        2      2        1          2
+      18        3      2        1          2
+      19        4      2        1          2
+      22        6      2        1          2
+      23        8      2        1          2
+
+---
+
+    Code
+      head(epichains_tree2)
+    Output
+      < tree head (from first known ancestor) >
+      
+         chain_id sim_id ancestor generation time
+      11        1      2        1          2    3
+      13        2      2        1          2    3
+      16        3      2        1          2    3
+      17        4      2        1          2    3
+      18        5      2        1          2    3
+      19        6      2        1          2    3
+
+---
+
+    Code
+      tail(epichains_tree)
+    Output
+      
+      < tree tail >
+      
+         chain_id sim_id ancestor generation
+      41        2     17        6          3
+      85        6     17        6          4
+      42        2     18        6          3
+      86        6     18        7          4
+      87        6     19        7          4
+      88        6     20        7          4
+
+---
+
+    Code
+      tail(epichains_tree2)
+    Output
+      
+      < tree tail >
+      
+          chain_id sim_id ancestor generation time
+      116        7     20        9          4    9
+      128        8     20        9          4    9
+      117        7     21        9          4    9
+      129        8     21        9          4    9
+      130        8     22        9          4    9
+      131        8     23        9          4    9
+
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
new file mode 100644
index 00000000..624151ae
--- /dev/null
+++ b/tests/testthat/test-epichains.R
@@ -0,0 +1,206 @@
+set.seed(12)
+epichains_summary <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  lambda = 2
+)
+epichains_tree <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  lambda = 2
+)
+epichains_tree2 <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 2
+)
+
+aggreg_by_gen <- aggregate(
+  epichains_tree,
+  grouping_var = "generation"
+)
+aggreg_by_time <- aggregate(
+  epichains_tree2,
+  grouping_var = "time"
+)
+
+aggreg_by_both <- aggregate(
+  epichains_tree2,
+  grouping_var = "both"
+)
+
+set.seed(11223)
+epichains_summary_all_infs <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  lambda = 3
+)
+
+test_that("print.epichains works for simulate_summary output", {
+  expect_snapshot(epichains_summary)
+})
+
+test_that("print.epichains works for simulate_tree output", {
+  expect_snapshot(epichains_tree)
+})
+
+test_that("print.epichains works for simulate_tree output", {
+  expect_snapshot(epichains_tree2)
+})
+
+test_that("summary.epichains works as expected", {
+  expect_named(
+    summary(epichains_summary),
+    c(
+      "chain_ran",
+      "max_chain_stat",
+      "min_chain_stat"
+    )
+  )
+  expect_named(
+    summary(epichains_tree2),
+    c(
+      "chains_ran",
+      "max_time",
+      "unique_ancestors",
+      "max_generation"
+    )
+  )
+  expect_named(
+    summary(epichains_tree2),
+    c(
+      "chains_ran",
+      "max_time",
+      "unique_ancestors",
+      "max_generation"
+    )
+  )
+  expect_true(
+    is.infinite(
+      summary(epichains_summary_all_infs)$min_chain_stat
+    )
+  )
+  expect_true(
+    is.infinite(
+      summary(epichains_summary_all_infs)$max_chain_stat
+    )
+  )
+})
+
+test_that("validate_epichains works", {
+  expect_invisible(
+    validate_epichains(epichains_summary)
+  )
+  expect_invisible(
+    validate_epichains(epichains_tree)
+  )
+  expect_invisible(
+    validate_epichains(epichains_tree2)
+  )
+})
+
+test_that("is_chains_tree works", {
+  expect_true(
+    is_chains_tree(epichains_tree)
+  )
+  expect_true(
+    is_chains_tree(epichains_tree2)
+  )
+  expect_false(
+    is_chains_tree(epichains_summary)
+  )
+})
+
+test_that("is_chains_summary works", {
+  expect_true(
+    is_chains_tree(epichains_tree)
+  )
+  expect_true(
+    is_chains_tree(epichains_tree2)
+  )
+  expect_false(
+    is_chains_tree(epichains_summary)
+  )
+})
+
+test_that("is_epichains_aggregate_df works", {
+  expect_true(
+    is_epichains_aggregate_df(aggreg_by_gen)
+  )
+  expect_true(
+    is_epichains_aggregate_df(aggreg_by_time)
+  )
+  expect_true(
+    is_epichains_aggregate_df(aggreg_by_both)
+  )
+  expect_false(
+    is_epichains_aggregate_df(epichains_tree)
+  )
+})
+
+test_that("validate_epichains throws errors", {
+  expect_error(
+    validate_epichains(mtcars),
+    "must have an epichains class"
+  )
+})
+
+test_that("head and tail methods work", {
+  expect_snapshot(head(epichains_tree))
+  expect_snapshot(head(epichains_tree2))
+  expect_snapshot(tail(epichains_tree))
+  expect_snapshot(tail(epichains_tree2))
+})
+
+test_that("aggregate method work", {
+  expect_named(
+    aggreg_by_gen,
+    c("generation", "cases")
+  )
+  expect_named(
+    aggreg_by_time,
+    c("time", "cases")
+  )
+  expect_identical(
+    as.vector(
+      vapply(aggreg_by_both, names, FUN.VALUE = character(2))
+    ),
+    c("time", "cases", "generation", "cases")
+  )
+  expect_s3_class(
+    aggreg_by_gen,
+    "epichains_aggregate_df"
+  )
+  expect_s3_class(
+    aggreg_by_time,
+    "epichains_aggregate_df"
+  )
+  expect_s3_class(
+    aggreg_by_both,
+    "epichains_aggregate_df"
+  )
+  expect_error(
+    aggregate(epichains_summary),
+    "attribute"
+  )
+})
+
+test_that("aggregate method is numerically correct", {
+  expect_identical(
+    aggreg_by_gen$cases,
+    c(10L, 17L, 38L, 38L, 12L)
+  )
+  expect_identical(
+    aggreg_by_time$cases,
+    c(10L, 21L, 48L, 60L)
+  )
+})

From 60b1fff1144fc3e7034d6021c4566d8ae01e1dce Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 14 Sep 2023 23:19:47 +0100
Subject: [PATCH 630/828] Add tests for the helper functions

---
 tests/testthat/_snaps/helpers.md | 22 ++++++++++
 tests/testthat/test-helpers.R    | 69 +++++++++++++++++++++++++++++++-
 2 files changed, 90 insertions(+), 1 deletion(-)
 create mode 100644 tests/testthat/_snaps/helpers.md

diff --git a/tests/testthat/_snaps/helpers.md b/tests/testthat/_snaps/helpers.md
new file mode 100644
index 00000000..538123be
--- /dev/null
+++ b/tests/testthat/_snaps/helpers.md
@@ -0,0 +1,22 @@
+# get_offspring_func works correctly
+
+    Code
+      body(pois_offspring_func)
+    Output
+      {
+          truncdist::rtrunc(n, spec = "pois", lambda = mean_offspring * 
+              susc/pop, b = susc)
+      }
+
+---
+
+    Code
+      body(nbinom_offspring_func)
+    Output
+      {
+          new_mn <- mean_offspring * susc/pop
+          size <- new_mn/(disp_offspring - 1)
+          truncdist::rtrunc(n, spec = "nbinom", b = susc, mu = new_mn, 
+              size = size)
+      }
+
diff --git a/tests/testthat/test-helpers.R b/tests/testthat/test-helpers.R
index 1fbc99d3..9c7b3417 100644
--- a/tests/testthat/test-helpers.R
+++ b/tests/testthat/test-helpers.R
@@ -1,4 +1,4 @@
-test_that("Helper functions work correctly", {
+test_that("construct_offspring_ll_name works correctly", {
   expect_identical(
     construct_offspring_ll_name(
       offspring_dist = "pois",
@@ -7,3 +7,70 @@ test_that("Helper functions work correctly", {
     "pois_size_ll"
   )
 })
+
+test_that("update_chain_stat works correctly", {
+  stat_latest <- 1
+  n_offspring <- 2
+  expect_identical(
+    update_chain_stat(
+      stat_type = "size",
+      stat_latest = stat_latest,
+      n_offspring = n_offspring
+    ),
+    stat_latest + n_offspring
+  )
+  expect_identical(
+    update_chain_stat(
+      stat_type = "length",
+      stat_latest = stat_latest,
+      n_offspring = n_offspring
+    ),
+    stat_latest + pmin(1, n_offspring)
+  )
+})
+
+test_that("get_offspring_func works correctly", {
+  pois_offspring_func <- get_offspring_func(
+    offspring_dist = "pois",
+    n = n,
+    susc = susc,
+    pop = pop,
+    mean_offspring = mean_offspring,
+    disp_offspring = disp_offspring
+  )
+  expect_snapshot(body(pois_offspring_func))
+  nbinom_offspring_func <- get_offspring_func(
+    offspring_dist = "nbinom",
+    n = n,
+    susc = susc,
+    pop = pop,
+    mean_offspring = mean_offspring,
+    disp_offspring = disp_offspring
+  )
+  expect_snapshot(body(nbinom_offspring_func))
+})
+
+test_that("get_offspring_func throws errors", {
+  expect_error(
+    get_offspring_func(
+      offspring_dist = "ss",
+      n = n,
+      susc = susc,
+      pop = pop,
+      mean_offspring = mean_offspring,
+      disp_offspring = disp_offspring
+    ),
+    "must either be"
+  )
+})
+
+test_that("get_statistic_func works correctly", {
+  expect_identical(
+    get_statistic_func(chain_statistic = "size"),
+    rbinom_size
+  )
+  expect_identical(
+    get_statistic_func(chain_statistic = "length"),
+    rgen_length
+  )
+})

From 3caa99c30c986ab700694e287f5910bb10b90282 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 14 Sep 2023 23:20:02 +0100
Subject: [PATCH 631/828] Linting

---
 tests/testthat/test-utils.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/tests/testthat/test-utils.R b/tests/testthat/test-utils.R
index c271e47d..4118e7cb 100644
--- a/tests/testthat/test-utils.R
+++ b/tests/testthat/test-utils.R
@@ -4,7 +4,7 @@ test_that("Reparametrized distributions work", {
       n = 5,
       mn = 4,
       disp = 2
-      ),
+    ),
     5
   )
 })
@@ -53,7 +53,7 @@ test_that("Reparametrized distributions are numerically correct", {
       n = 5,
       mn = 4,
       disp = 2
-      ),
+    ),
     c(0, 2, 5, 2, 3)
   )
 })

From 172ca2eaf31305b1f0cc3411569f671393d99c44 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 14 Sep 2023 23:20:23 +0100
Subject: [PATCH 632/828] Add more tests for the simulation functions

---
 tests/testthat/tests-simulate.R | 23 +++++++++++++++++++++++
 1 file changed, 23 insertions(+)

diff --git a/tests/testthat/tests-simulate.R b/tests/testthat/tests-simulate.R
index bd29f7d5..ab3f2219 100644
--- a/tests/testthat/tests-simulate.R
+++ b/tests/testthat/tests-simulate.R
@@ -68,6 +68,19 @@ test_that("Simulators work", {
     ),
     1
   )
+  expect_true(
+    all(
+      simulate_tree(
+        nchains = 10,
+        statistic = "size",
+        offspring_dist = "pois",
+        stat_max = 10,
+        serials_dist = function(x) 3,
+        lambda = 2,
+        tf = 5
+      )$time < 5
+    )
+  )
 })
 
 test_that("simulate_tree throws errors", {
@@ -173,6 +186,7 @@ test_that("simulate_summary throws errors", {
 })
 
 test_that("simulate_tree_from_pop throws errors", {
+  set.seed(123)
   expect_error(
     simulate_tree_from_pop(
       pop = 100,
@@ -202,6 +216,15 @@ test_that("simulate_tree_from_pop throws errors", {
     ),
     "not found"
   )
+  expect_error(
+    simulate_tree_from_pop(
+      pop = 100,
+      offspring_dist = "nbinom",
+      offspring_mean = 0.5,
+      serials_dist = serial_func
+    ),
+    "must be specified"
+  )
 })
 
 test_that("simulate_tree_from_pop throws warnings", {

From e81dad0ea8a87ac585fe95096a5f1f5d7ff71cf8 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 15 Sep 2023 12:17:54 +0100
Subject: [PATCH 633/828] Replace snapshot test of returned functions with a
 check for the required argument specification

---
 tests/testthat/_snaps/helpers.md | 22 ----------------------
 tests/testthat/test-helpers.R    | 18 ++++++++++++++++--
 2 files changed, 16 insertions(+), 24 deletions(-)
 delete mode 100644 tests/testthat/_snaps/helpers.md

diff --git a/tests/testthat/_snaps/helpers.md b/tests/testthat/_snaps/helpers.md
deleted file mode 100644
index 538123be..00000000
--- a/tests/testthat/_snaps/helpers.md
+++ /dev/null
@@ -1,22 +0,0 @@
-# get_offspring_func works correctly
-
-    Code
-      body(pois_offspring_func)
-    Output
-      {
-          truncdist::rtrunc(n, spec = "pois", lambda = mean_offspring * 
-              susc/pop, b = susc)
-      }
-
----
-
-    Code
-      body(nbinom_offspring_func)
-    Output
-      {
-          new_mn <- mean_offspring * susc/pop
-          size <- new_mn/(disp_offspring - 1)
-          truncdist::rtrunc(n, spec = "nbinom", b = susc, mu = new_mn, 
-              size = size)
-      }
-
diff --git a/tests/testthat/test-helpers.R b/tests/testthat/test-helpers.R
index 9c7b3417..02501d58 100644
--- a/tests/testthat/test-helpers.R
+++ b/tests/testthat/test-helpers.R
@@ -38,7 +38,14 @@ test_that("get_offspring_func works correctly", {
     mean_offspring = mean_offspring,
     disp_offspring = disp_offspring
   )
-  expect_snapshot(body(pois_offspring_func))
+  expect_true(
+    any(
+      grepl(
+        "spec = \"pois\"",
+        deparse(body(pois_offspring_func))
+        )
+      )
+    )
   nbinom_offspring_func <- get_offspring_func(
     offspring_dist = "nbinom",
     n = n,
@@ -47,7 +54,14 @@ test_that("get_offspring_func works correctly", {
     mean_offspring = mean_offspring,
     disp_offspring = disp_offspring
   )
-  expect_snapshot(body(nbinom_offspring_func))
+  expect_true(
+    any(
+      grepl(
+        "spec = \"nbinom\"",
+        deparse(body(nbinom_offspring_func))
+      )
+    )
+  )
 })
 
 test_that("get_offspring_func throws errors", {

From 38f357d2550aa4f8458e87fece13a7c30faa61b7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 15 Sep 2023 14:27:37 +0100
Subject: [PATCH 634/828] Reinstate the snapshot tests of functions returning
 functions

---
 tests/testthat/_snaps/helpers.md | 22 ++++++++++++++++++++++
 tests/testthat/test-helpers.R    | 22 ++++++++++++++++++++++
 2 files changed, 44 insertions(+)
 create mode 100644 tests/testthat/_snaps/helpers.md

diff --git a/tests/testthat/_snaps/helpers.md b/tests/testthat/_snaps/helpers.md
new file mode 100644
index 00000000..83e69e93
--- /dev/null
+++ b/tests/testthat/_snaps/helpers.md
@@ -0,0 +1,22 @@
+# get_statistic_func snapshots look right
+
+    Code
+      body(pois_offspring_func)
+    Output
+      {
+          truncdist::rtrunc(n, spec = "pois", lambda = mean_offspring * 
+              susc/pop, b = susc)
+      }
+
+---
+
+    Code
+      body(nbinom_offspring_func)
+    Output
+      {
+          new_mn <- mean_offspring * susc/pop
+          size <- new_mn/(disp_offspring - 1)
+          truncdist::rtrunc(n, spec = "nbinom", b = susc, mu = new_mn, 
+              size = size)
+      }
+
diff --git a/tests/testthat/test-helpers.R b/tests/testthat/test-helpers.R
index 02501d58..35fa89bb 100644
--- a/tests/testthat/test-helpers.R
+++ b/tests/testthat/test-helpers.R
@@ -64,6 +64,28 @@ test_that("get_offspring_func works correctly", {
   )
 })
 
+test_that("get_statistic_func snapshots look right", {
+  pois_offspring_func <- get_offspring_func(
+    offspring_dist = "pois",
+    n = n,
+    susc = susc,
+    pop = pop,
+    mean_offspring = mean_offspring,
+    disp_offspring = disp_offspring
+  )
+  expect_snapshot(body(pois_offspring_func))
+
+  nbinom_offspring_func <- get_offspring_func(
+    offspring_dist = "nbinom",
+    n = n,
+    susc = susc,
+    pop = pop,
+    mean_offspring = mean_offspring,
+    disp_offspring = disp_offspring
+  )
+  expect_snapshot(body(nbinom_offspring_func))
+})
+
 test_that("get_offspring_func throws errors", {
   expect_error(
     get_offspring_func(

From 80a6cd906e1879d14475e90217c808a04dfc58d9 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 15 Sep 2023 14:29:59 +0100
Subject: [PATCH 635/828] Move simulations within tests to top of script

---
 tests/testthat/tests-simulate.R | 115 ++++++++++++++------------------
 1 file changed, 51 insertions(+), 64 deletions(-)

diff --git a/tests/testthat/tests-simulate.R b/tests/testthat/tests-simulate.R
index ab3f2219..330722a3 100644
--- a/tests/testthat/tests-simulate.R
+++ b/tests/testthat/tests-simulate.R
@@ -4,34 +4,55 @@ serial_func <- function(n) {
   rlnorm(n, meanlog = 0.58, sdlog = 1.58)
 }
 
+# simulate_tree()
+tree_sim_raw <- simulate_tree(
+  nchains = 2,
+  offspring_dist = "pois",
+  statistic = "length",
+  lambda = 0.9
+)
+
+tree_sim_summary <- summary(tree_sim_raw)
+
+# simulate_summary()
+chain_summary_raw <- simulate_summary(
+  nchains = 2,
+  offspring_dist = "pois",
+  statistic = "length",
+  lambda = 0.9
+)
+
+chain_summary_sim <- summary(chain_summary_raw)
+
+# simulate_tree_from_pop()
+susc_outbreak_raw <- simulate_tree_from_pop(
+  pop = 100,
+  offspring_dist = "pois",
+  offspring_mean = 0.9,
+  serials_dist = serial_func
+)
+
+susc_outbreak_raw2 <- simulate_tree_from_pop(
+  pop = 100,
+  offspring_dist = "nbinom",
+  offspring_mean = 1,
+  offspring_disp = 1.1,
+  serials_dist = serial_func
+)
+
+susc_outbreak_summary <- summary(susc_outbreak_raw)
+
 test_that("Simulators return epichains objects", {
   expect_s3_class(
-    simulate_tree(
-      nchains = 10,
-      offspring_dist = "pois",
-      lambda = 2,
-      statistic = "size",
-      stat_max = 10
-    ),
+    tree_sim_raw,
     "epichains"
   )
   expect_s3_class(
-    simulate_tree_from_pop(
-      pop = 100,
-      offspring_dist = "nbinom",
-      offspring_mean = 0.5,
-      offspring_disp = 1.1,
-      serials_dist = function(x) 3
-    ),
+    susc_outbreak_raw,
     "epichains"
   )
   expect_s3_class(
-    simulate_summary(
-      nchains = 10,
-      offspring_dist = "pois",
-      lambda = 2,
-      stat_max = 10
-    ),
+    chain_summary_raw,
     "epichains"
   )
 })
@@ -47,25 +68,14 @@ test_that("Simulators work", {
     2
   )
   expect_gte(
-    nrow(
-      simulate_tree(
-        nchains = 2,
-        offspring_dist = "pois",
-        statistic = "length",
-        lambda = 0.9
-      )
-    ),
-    2
+    nrow(tree_sim_raw),
+    2)
+  expect_gte(
+    nrow(susc_outbreak_raw),
+    1
   )
   expect_gte(
-    nrow(
-      simulate_tree_from_pop(
-        pop = 100,
-        offspring_dist = "pois",
-        offspring_mean = 0.9,
-        serials_dist = serial_func
-      )
-    ),
+    nrow(susc_outbreak_raw2),
     1
   )
   expect_true(
@@ -241,15 +251,6 @@ test_that("simulate_tree_from_pop throws warnings", {
 })
 
 test_that("simulate_tree is numerically correct", {
-  set.seed(12)
-  tree_sim_summary <- summary(
-    simulate_tree(
-      nchains = 2,
-      offspring_dist = "pois",
-      statistic = "length",
-      lambda = 0.9
-    )
-  )
   expect_identical(
     tree_sim_summary$chains_ran,
     2.00
@@ -265,15 +266,6 @@ test_that("simulate_tree is numerically correct", {
 })
 
 test_that("simulate_summary is numerically correct", {
-  set.seed(12)
-  chain_summary_sim <- summary(
-    simulate_summary(
-      nchains = 2,
-      offspring_dist = "pois",
-      statistic = "length",
-      lambda = 0.9
-    )
-  )
   expect_identical(
     chain_summary_sim$max_chain_stat,
     3.00
@@ -282,18 +274,13 @@ test_that("simulate_summary is numerically correct", {
     chain_summary_sim$min_chain_stat,
     1.00
   )
+  expect_identical(
+    as.vector(chain_summary_raw),
+    c(2.00, 1.00)
+  )
 })
 
 test_that("simulate_tree_from_pop is numerically correct", {
-  set.seed(12)
-  susc_outbreak_summary <- summary(
-    simulate_tree_from_pop(
-      pop = 100,
-      offspring_dist = "pois",
-      offspring_mean = 0.9,
-      serials_dist = serial_func
-    )
-  )
   expect_identical(
     susc_outbreak_summary$unique_ancestors,
     0L

From 711a5cb36337d1ebb8da2167f9a1469b6247781d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 15 Sep 2023 14:30:35 +0100
Subject: [PATCH 636/828] Add tests for simulated outcomes

---
 tests/testthat/tests-simulate.R | 34 ++++++++++++++++++++++++++++++++-
 1 file changed, 33 insertions(+), 1 deletion(-)

diff --git a/tests/testthat/tests-simulate.R b/tests/testthat/tests-simulate.R
index 330722a3..11bfb1df 100644
--- a/tests/testthat/tests-simulate.R
+++ b/tests/testthat/tests-simulate.R
@@ -263,12 +263,28 @@ test_that("simulate_tree is numerically correct", {
     tree_sim_summary$max_generation,
     3L
   )
+  expect_identical(
+    tree_sim_raw$chain_id,
+    c(1L, 2L, 2L, 2L, 2L, 2L, 2L)
+  )
+  expect_identical(
+    tree_sim_raw$sim_id,
+    c(1, 1, 2, 3, 4, 5, 6)
+  )
+  expect_identical(
+    tree_sim_raw$ancestor,
+    c(NA, NA,  1,  1,  2,  2,  2)
+  )
+  expect_identical(
+    tree_sim_raw$generation,
+    c(1L, 1L, 2L, 2L, 3L, 3L, 3L)
+  )
 })
 
 test_that("simulate_summary is numerically correct", {
   expect_identical(
     chain_summary_sim$max_chain_stat,
-    3.00
+    2.00
   )
   expect_identical(
     chain_summary_sim$min_chain_stat,
@@ -294,4 +310,20 @@ test_that("simulate_tree_from_pop is numerically correct", {
     1L
   )
   expect_null(susc_outbreak_summary$chains_ran)
+  expect_identical(
+    susc_outbreak_raw$sim_id,
+    1L
+  )
+  expect_identical(
+    susc_outbreak_raw$ancestor,
+    NA_integer_
+  )
+  expect_identical(
+    susc_outbreak_raw$generation,
+    1L
+  )
+  expect_identical(
+    susc_outbreak_raw$time,
+    0.00
+  )
 })

From 1b5eda7fa8de4a660f2973fcf386dfe914940417 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 15 Sep 2023 15:06:09 +0100
Subject: [PATCH 637/828] Add fix=TRUE to fix the pattern to be matched

---
 tests/testthat/test-helpers.R | 8 +++++---
 1 file changed, 5 insertions(+), 3 deletions(-)

diff --git a/tests/testthat/test-helpers.R b/tests/testthat/test-helpers.R
index 35fa89bb..c968ada5 100644
--- a/tests/testthat/test-helpers.R
+++ b/tests/testthat/test-helpers.R
@@ -42,10 +42,11 @@ test_that("get_offspring_func works correctly", {
     any(
       grepl(
         "spec = \"pois\"",
-        deparse(body(pois_offspring_func))
-        )
+        deparse(body(pois_offspring_func)),
+        fixed = TRUE
       )
     )
+  )
   nbinom_offspring_func <- get_offspring_func(
     offspring_dist = "nbinom",
     n = n,
@@ -58,7 +59,8 @@ test_that("get_offspring_func works correctly", {
     any(
       grepl(
         "spec = \"nbinom\"",
-        deparse(body(nbinom_offspring_func))
+        deparse(body(nbinom_offspring_func)),
+        fixed = TRUE
       )
     )
   )

From 052d9da638c2ee90e5e6df451309fdd883218908 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 15 Sep 2023 15:06:42 +0100
Subject: [PATCH 638/828] Fix comment tags

---
 tests/testthat/tests-simulate.R | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/tests/testthat/tests-simulate.R b/tests/testthat/tests-simulate.R
index 11bfb1df..e93777f2 100644
--- a/tests/testthat/tests-simulate.R
+++ b/tests/testthat/tests-simulate.R
@@ -1,10 +1,10 @@
-# Define global variables and options for simulations
+#' Define global variables and options for simulations
 set.seed(12)
 serial_func <- function(n) {
   rlnorm(n, meanlog = 0.58, sdlog = 1.58)
 }
 
-# simulate_tree()
+#' simulate_tree()
 tree_sim_raw <- simulate_tree(
   nchains = 2,
   offspring_dist = "pois",
@@ -14,7 +14,7 @@ tree_sim_raw <- simulate_tree(
 
 tree_sim_summary <- summary(tree_sim_raw)
 
-# simulate_summary()
+#' simulate_summary()
 chain_summary_raw <- simulate_summary(
   nchains = 2,
   offspring_dist = "pois",
@@ -24,7 +24,7 @@ chain_summary_raw <- simulate_summary(
 
 chain_summary_sim <- summary(chain_summary_raw)
 
-# simulate_tree_from_pop()
+#' simulate_tree_from_pop()
 susc_outbreak_raw <- simulate_tree_from_pop(
   pop = 100,
   offspring_dist = "pois",

From 7028c69d122240c92b95945c3b3ff5e320275311 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 15 Sep 2023 15:06:49 +0100
Subject: [PATCH 639/828] Lint

---
 tests/testthat/tests-simulate.R | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/tests/testthat/tests-simulate.R b/tests/testthat/tests-simulate.R
index e93777f2..1e36baaf 100644
--- a/tests/testthat/tests-simulate.R
+++ b/tests/testthat/tests-simulate.R
@@ -69,7 +69,8 @@ test_that("Simulators work", {
   )
   expect_gte(
     nrow(tree_sim_raw),
-    2)
+    2
+  )
   expect_gte(
     nrow(susc_outbreak_raw),
     1
@@ -273,7 +274,7 @@ test_that("simulate_tree is numerically correct", {
   )
   expect_identical(
     tree_sim_raw$ancestor,
-    c(NA, NA,  1,  1,  2,  2,  2)
+    c(NA, NA, 1, 1, 2, 2, 2)
   )
   expect_identical(
     tree_sim_raw$generation,

From b15b6c586a199d4ac56f56058fe2d5191780834c Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 18 Sep 2023 09:42:15 +0100
Subject: [PATCH 640/828] Remove snapshot tests

---
 tests/testthat/_snaps/helpers.md | 22 ----------------------
 tests/testthat/test-helpers.R    | 22 ----------------------
 2 files changed, 44 deletions(-)
 delete mode 100644 tests/testthat/_snaps/helpers.md

diff --git a/tests/testthat/_snaps/helpers.md b/tests/testthat/_snaps/helpers.md
deleted file mode 100644
index 83e69e93..00000000
--- a/tests/testthat/_snaps/helpers.md
+++ /dev/null
@@ -1,22 +0,0 @@
-# get_statistic_func snapshots look right
-
-    Code
-      body(pois_offspring_func)
-    Output
-      {
-          truncdist::rtrunc(n, spec = "pois", lambda = mean_offspring * 
-              susc/pop, b = susc)
-      }
-
----
-
-    Code
-      body(nbinom_offspring_func)
-    Output
-      {
-          new_mn <- mean_offspring * susc/pop
-          size <- new_mn/(disp_offspring - 1)
-          truncdist::rtrunc(n, spec = "nbinom", b = susc, mu = new_mn, 
-              size = size)
-      }
-
diff --git a/tests/testthat/test-helpers.R b/tests/testthat/test-helpers.R
index c968ada5..fe68e27e 100644
--- a/tests/testthat/test-helpers.R
+++ b/tests/testthat/test-helpers.R
@@ -66,28 +66,6 @@ test_that("get_offspring_func works correctly", {
   )
 })
 
-test_that("get_statistic_func snapshots look right", {
-  pois_offspring_func <- get_offspring_func(
-    offspring_dist = "pois",
-    n = n,
-    susc = susc,
-    pop = pop,
-    mean_offspring = mean_offspring,
-    disp_offspring = disp_offspring
-  )
-  expect_snapshot(body(pois_offspring_func))
-
-  nbinom_offspring_func <- get_offspring_func(
-    offspring_dist = "nbinom",
-    n = n,
-    susc = susc,
-    pop = pop,
-    mean_offspring = mean_offspring,
-    disp_offspring = disp_offspring
-  )
-  expect_snapshot(body(nbinom_offspring_func))
-})
-
 test_that("get_offspring_func throws errors", {
   expect_error(
     get_offspring_func(

From 03062a6e28b28cc77f653b92f822e5bb07eae5b6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 18 Sep 2023 18:02:34 +0100
Subject: [PATCH 641/828] Remove tests for the get_offspring_func() helper

---
 tests/testthat/test-helpers.R | 51 -----------------------------------
 1 file changed, 51 deletions(-)

diff --git a/tests/testthat/test-helpers.R b/tests/testthat/test-helpers.R
index fe68e27e..28d66f0b 100644
--- a/tests/testthat/test-helpers.R
+++ b/tests/testthat/test-helpers.R
@@ -29,57 +29,6 @@ test_that("update_chain_stat works correctly", {
   )
 })
 
-test_that("get_offspring_func works correctly", {
-  pois_offspring_func <- get_offspring_func(
-    offspring_dist = "pois",
-    n = n,
-    susc = susc,
-    pop = pop,
-    mean_offspring = mean_offspring,
-    disp_offspring = disp_offspring
-  )
-  expect_true(
-    any(
-      grepl(
-        "spec = \"pois\"",
-        deparse(body(pois_offspring_func)),
-        fixed = TRUE
-      )
-    )
-  )
-  nbinom_offspring_func <- get_offspring_func(
-    offspring_dist = "nbinom",
-    n = n,
-    susc = susc,
-    pop = pop,
-    mean_offspring = mean_offspring,
-    disp_offspring = disp_offspring
-  )
-  expect_true(
-    any(
-      grepl(
-        "spec = \"nbinom\"",
-        deparse(body(nbinom_offspring_func)),
-        fixed = TRUE
-      )
-    )
-  )
-})
-
-test_that("get_offspring_func throws errors", {
-  expect_error(
-    get_offspring_func(
-      offspring_dist = "ss",
-      n = n,
-      susc = susc,
-      pop = pop,
-      mean_offspring = mean_offspring,
-      disp_offspring = disp_offspring
-    ),
-    "must either be"
-  )
-})
-
 test_that("get_statistic_func works correctly", {
   expect_identical(
     get_statistic_func(chain_statistic = "size"),

From d7aa39c4fe3b7e63428da5a18eb2bc8debecf6a2 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 19 Sep 2023 16:45:40 +0100
Subject: [PATCH 642/828] Restructure tests-epichains by moving simulations
 into individual contexts

---
 tests/testthat/_snaps/epichains.md | 190 ++++++----
 tests/testthat/test-epichains.R    | 551 +++++++++++++++++++++++------
 2 files changed, 565 insertions(+), 176 deletions(-)

diff --git a/tests/testthat/_snaps/epichains.md b/tests/testthat/_snaps/epichains.md
index c31d1707..66a761ec 100644
--- a/tests/testthat/_snaps/epichains.md
+++ b/tests/testthat/_snaps/epichains.md
@@ -1,52 +1,77 @@
-# print.epichains works for simulate_summary output
+# print.epichains works for simulation functions
 
     Code
-      epichains_summary
+      susc_outbreak_raw
     Output
-      `epichains` object 
+      `epichains` object
+      
+      < tree head (from first known ancestor) >
       
-       [1]   1 Inf Inf Inf Inf   1   2 Inf   1   1
+      [1] sim_id     ancestor   generation time      
+      <0 rows> (or 0-length row.names)
       
-       Simulated chain sizes: 
+      < tree tail >
       
-      Max: 2
-      Min: 1
+        sim_id ancestor generation time
+      1      1       NA          1    0
+      Number of ancestors (known): 0
+      Number of generations: 1
+      Use `as.data.frame(<object_name>)` to view the full output in the console.
 
-# print.epichains works for simulate_tree output
+---
 
     Code
-      epichains_tree
+      susc_outbreak_raw2
     Output
       `epichains` object
       
       < tree head (from first known ancestor) >
       
-         chain_id sim_id ancestor generation
-      11        1      2        1          2
-      13        2      2        1          2
-      18        3      2        1          2
-      19        4      2        1          2
-      22        6      2        1          2
-      23        8      2        1          2
+        sim_id ancestor generation       time
+      2      2        1          2 21.5834705
+      3      3        1          2  0.3939008
+      4      4        2          3 21.6595273
       
       < tree tail >
       
-         chain_id sim_id ancestor generation
-      41        2     17        6          3
-      85        6     17        6          4
-      42        2     18        6          3
-      86        6     18        7          4
-      87        6     19        7          4
-      88        6     20        7          4
-      Chains simulated: 10
-      Number of ancestors (known): 9
-      Number of generations: 5
+        sim_id ancestor generation       time
+      1      1       NA          1  0.0000000
+      2      2        1          2 21.5834705
+      3      3        1          2  0.3939008
+      4      4        2          3 21.6595273
+      Number of ancestors (known): 2
+      Number of generations: 3
       Use `as.data.frame(<object_name>)` to view the full output in the console.
 
 ---
 
     Code
-      epichains_tree2
+      tree_sim_raw
+    Output
+      `epichains` object
+      
+      < tree head (from first known ancestor) >
+      
+        chain_id sim_id ancestor generation
+      3        1      2        1          2
+      4        1      3        1          2
+      
+      < tree tail >
+      
+        chain_id sim_id ancestor generation
+      1        1      1       NA          1
+      2        2      1       NA          1
+      3        1      2        1          2
+      4        1      3        1          2
+      Chains simulated: 2
+      Number of ancestors (known): 1
+      Number of generations: 2
+      Use `as.data.frame(<object_name>)` to view the full output in the console.
+
+---
+
+    Code
+      tree_sim_raw2
     Output
       `epichains` object
       
@@ -55,84 +80,125 @@
          chain_id sim_id ancestor generation time
       11        1      2        1          2    3
       13        2      2        1          2    3
-      16        3      2        1          2    3
+      15        3      2        1          2    3
       17        4      2        1          2    3
-      18        5      2        1          2    3
       19        6      2        1          2    3
+      20        7      2        1          2    3
       
       < tree tail >
       
           chain_id sim_id ancestor generation time
-      116        7     20        9          4    9
-      128        8     20        9          4    9
-      117        7     21        9          4    9
-      129        8     21        9          4    9
-      130        8     22        9          4    9
-      131        8     23        9          4    9
+      92         9     19        8          4    9
+      109        6     19        8          5   12
+      93         9     20        9          4    9
+      110        6     20        9          5   12
+      94         9     21        9          4    9
+      111        6     21        9          5   12
       Chains simulated: 10
       Number of ancestors (known): 9
-      Number of generations: 4
+      Number of generations: 5
       Use `as.data.frame(<object_name>)` to view the full output in the console.
 
 # head and tail methods work
 
     Code
-      head(epichains_tree)
+      head(susc_outbreak_raw)
     Output
       < tree head (from first known ancestor) >
       
-         chain_id sim_id ancestor generation
-      11        1      2        1          2
-      13        2      2        1          2
-      18        3      2        1          2
-      19        4      2        1          2
-      22        6      2        1          2
-      23        8      2        1          2
+      [1] sim_id     ancestor   generation time      
+      <0 rows> (or 0-length row.names)
 
 ---
 
     Code
-      head(epichains_tree2)
+      head(susc_outbreak_raw2)
+    Output
+      < tree head (from first known ancestor) >
+      
+        sim_id ancestor generation       time
+      2      2        1          2 21.5834705
+      3      3        1          2  0.3939008
+      4      4        2          3 21.6595273
+
+---
+
+    Code
+      head(tree_sim_raw)
+    Output
+      < tree head (from first known ancestor) >
+      
+        chain_id sim_id ancestor generation
+      3        1      2        1          2
+      4        1      3        1          2
+
+---
+
+    Code
+      head(tree_sim_raw2)
     Output
       < tree head (from first known ancestor) >
       
          chain_id sim_id ancestor generation time
       11        1      2        1          2    3
       13        2      2        1          2    3
-      16        3      2        1          2    3
+      15        3      2        1          2    3
       17        4      2        1          2    3
-      18        5      2        1          2    3
       19        6      2        1          2    3
+      20        7      2        1          2    3
+
+---
+
+    Code
+      tail(susc_outbreak_raw)
+    Output
+      
+      < tree tail >
+      
+        sim_id ancestor generation time
+      1      1       NA          1    0
+
+---
+
+    Code
+      tail(susc_outbreak_raw2)
+    Output
+      
+      < tree tail >
+      
+        sim_id ancestor generation       time
+      1      1       NA          1  0.0000000
+      2      2        1          2 21.5834705
+      3      3        1          2  0.3939008
+      4      4        2          3 21.6595273
 
 ---
 
     Code
-      tail(epichains_tree)
+      tail(tree_sim_raw)
     Output
       
       < tree tail >
       
-         chain_id sim_id ancestor generation
-      41        2     17        6          3
-      85        6     17        6          4
-      42        2     18        6          3
-      86        6     18        7          4
-      87        6     19        7          4
-      88        6     20        7          4
+        chain_id sim_id ancestor generation
+      1        1      1       NA          1
+      2        2      1       NA          1
+      3        1      2        1          2
+      4        1      3        1          2
 
 ---
 
     Code
-      tail(epichains_tree2)
+      tail(tree_sim_raw2)
     Output
       
       < tree tail >
       
           chain_id sim_id ancestor generation time
-      116        7     20        9          4    9
-      128        8     20        9          4    9
-      117        7     21        9          4    9
-      129        8     21        9          4    9
-      130        8     22        9          4    9
-      131        8     23        9          4    9
+      92         9     19        8          4    9
+      109        6     19        8          5   12
+      93         9     20        9          4    9
+      110        6     20        9          5   12
+      94         9     21        9          4    9
+      111        6     21        9          5   12
 
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index 624151ae..3ad1c810 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -1,73 +1,189 @@
-set.seed(12)
-epichains_summary <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  lambda = 2
-)
-epichains_tree <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  lambda = 2
-)
-epichains_tree2 <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 2
-)
+#' Define global variables and options for simulations
+serial_func <- function(n) {
+  rlnorm(n, meanlog = 0.58, sdlog = 1.58)
+}
 
-aggreg_by_gen <- aggregate(
-  epichains_tree,
-  grouping_var = "generation"
-)
-aggreg_by_time <- aggregate(
-  epichains_tree2,
-  grouping_var = "time"
-)
-
-aggreg_by_both <- aggregate(
-  epichains_tree2,
-  grouping_var = "both"
-)
-
-set.seed(11223)
-epichains_summary_all_infs <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  lambda = 3
-)
-
-test_that("print.epichains works for simulate_summary output", {
-  expect_snapshot(epichains_summary)
-})
-
-test_that("print.epichains works for simulate_tree output", {
-  expect_snapshot(epichains_tree)
+test_that("Simulators return epichains objects", {
+  set.seed(12)
+  #' Simulate an outbreak from a susceptible population (pois)
+  susc_outbreak_raw <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "pois",
+    offspring_mean = 0.9,
+    serials_dist = serial_func
+  )
+  #' Simulate an outbreak from a susceptible population (nbinom)
+  susc_outbreak_raw2 <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "nbinom",
+    offspring_mean = 1,
+    offspring_disp = 1.1,
+    serials_dist = serial_func
+  )
+  #' Simulate a tree of infections without serials
+  tree_sim_raw <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Simulate a tree of infections with serials
+  tree_sim_raw2 <- simulate_tree(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    serials_dist = function(x) 3,
+    lambda = 2
+  )
+  #' Simulate chain statistics
+  chain_summary_raw <- simulate_summary(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Expectations
+  expect_s3_class(
+    tree_sim_raw,
+    "epichains"
+  )
+  expect_s3_class(
+    tree_sim_raw2,
+    "epichains"
+  )
+  expect_s3_class(
+    susc_outbreak_raw,
+    "epichains"
+  )
+  expect_s3_class(
+    susc_outbreak_raw2,
+    "epichains"
+  )
+  expect_s3_class(
+    chain_summary_raw,
+    "epichains"
+  )
 })
 
-test_that("print.epichains works for simulate_tree output", {
-  expect_snapshot(epichains_tree2)
+test_that("print.epichains works for simulation functions", {
+  set.seed(12)
+  #' Simulate an outbreak from a susceptible population (pois)
+  susc_outbreak_raw <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "pois",
+    offspring_mean = 0.9,
+    serials_dist = serial_func
+  )
+  #' Simulate an outbreak from a susceptible population (nbinom)
+  susc_outbreak_raw2 <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "nbinom",
+    offspring_mean = 1,
+    offspring_disp = 1.1,
+    serials_dist = serial_func
+  )
+  #' Simulate a tree of infections without serials
+  tree_sim_raw <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Simulate a tree of infections with serials
+  tree_sim_raw2 <- simulate_tree(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    serials_dist = function(x) 3,
+    lambda = 2
+  )
+  #' Simulate chain statistics
+  chain_summary_raw <- simulate_summary(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Expectations
+  expect_snapshot(susc_outbreak_raw)
+  expect_snapshot(susc_outbreak_raw2)
+  expect_snapshot(tree_sim_raw)
+  expect_snapshot(tree_sim_raw2)
+  expect_snapshot(chain_summary_raw)
 })
 
 test_that("summary.epichains works as expected", {
+  set.seed(12)
+  #' Simulate an outbreak from a susceptible population (pois)
+  susc_outbreak_raw <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "pois",
+    offspring_mean = 0.9,
+    serials_dist = serial_func
+  )
+  #' Simulate an outbreak from a susceptible population (nbinom)
+  susc_outbreak_raw2 <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "nbinom",
+    offspring_mean = 1,
+    offspring_disp = 1.1,
+    serials_dist = serial_func
+  )
+  #' Simulate a tree of infections without serials
+  tree_sim_raw <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Simulate a tree of infections with serials
+  tree_sim_raw2 <- simulate_tree(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    serials_dist = function(x) 3,
+    lambda = 2
+  )
+  #' Simulate chain statistics
+  chain_summary_raw <- simulate_summary(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+   #' Simulate case where all the chain statistics are Inf
+  set.seed(11223)
+  epichains_summary_all_infs <- simulate_summary(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    lambda = 3
+  )
+  #' Expectations
   expect_named(
-    summary(epichains_summary),
+    summary(tree_sim_raw),
     c(
-      "chain_ran",
-      "max_chain_stat",
-      "min_chain_stat"
+      "chains_ran",
+      "max_time",
+      "unique_ancestors",
+      "max_generation"
+    )
+  )
+  expect_named(
+    summary(tree_sim_raw2),
+    c(
+      "chains_ran",
+      "max_time",
+      "unique_ancestors",
+      "max_generation"
     )
   )
   expect_named(
-    summary(epichains_tree2),
+    summary(susc_outbreak_raw),
     c(
       "chains_ran",
       "max_time",
@@ -76,7 +192,7 @@ test_that("summary.epichains works as expected", {
     )
   )
   expect_named(
-    summary(epichains_tree2),
+    summary(susc_outbreak_raw2),
     c(
       "chains_ran",
       "max_time",
@@ -84,6 +200,14 @@ test_that("summary.epichains works as expected", {
       "max_generation"
     )
   )
+  expect_named(
+    summary(chain_summary_raw),
+    c(
+      "chain_ran",
+      "max_chain_stat",
+      "min_chain_stat"
+    )
+  )
   expect_true(
     is.infinite(
       summary(epichains_summary_all_infs)$min_chain_stat
@@ -97,42 +221,208 @@ test_that("summary.epichains works as expected", {
 })
 
 test_that("validate_epichains works", {
+  set.seed(12)
+  #' Simulate an outbreak from a susceptible population (pois)
+  susc_outbreak_raw <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "pois",
+    offspring_mean = 0.9,
+    serials_dist = serial_func
+  )
+  #' Simulate an outbreak from a susceptible population (nbinom)
+  susc_outbreak_raw2 <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "nbinom",
+    offspring_mean = 1,
+    offspring_disp = 1.1,
+    serials_dist = serial_func
+  )
+  #' Simulate a tree of infections without serials
+  tree_sim_raw <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Simulate a tree of infections with serials
+  tree_sim_raw2 <- simulate_tree(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    serials_dist = function(x) 3,
+    lambda = 2
+  )
+  #' Simulate chain statistics
+  chain_summary_raw <- simulate_summary(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Expectations
+  expect_invisible(
+    validate_epichains(susc_outbreak_raw)
+  )
+  expect_invisible(
+    validate_epichains(susc_outbreak_raw2)
+  )
   expect_invisible(
-    validate_epichains(epichains_summary)
+    validate_epichains(tree_sim_raw)
   )
   expect_invisible(
-    validate_epichains(epichains_tree)
+    validate_epichains(tree_sim_raw2)
   )
   expect_invisible(
-    validate_epichains(epichains_tree2)
+    validate_epichains(chain_summary_raw)
+  )
+  expect_error(
+      validate_epichains(mtcars),
+      "must have an epichains class"
   )
 })
 
 test_that("is_chains_tree works", {
+  set.seed(12)
+  #' Simulate an outbreak from a susceptible population
+  susc_outbreak_raw <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "pois",
+    offspring_mean = 0.9,
+    serials_dist = serial_func
+  )
+  #' Simulate an outbreak from a susceptible population (nbinom)
+  susc_outbreak_raw2 <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "nbinom",
+    offspring_mean = 1,
+    offspring_disp = 1.1,
+    serials_dist = serial_func
+  )
+  #' Simulate a tree of infections without serials
+  tree_sim_raw <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Simulate a tree of infections with serials
+  tree_sim_raw2 <- simulate_tree(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    serials_dist = function(x) 3,
+    lambda = 2
+  )
+  #' Simulate chain statistics
+  chain_summary_raw <- simulate_summary(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Expectations
   expect_true(
-    is_chains_tree(epichains_tree)
+    is_chains_tree(susc_outbreak_raw)
   )
   expect_true(
-    is_chains_tree(epichains_tree2)
+    is_chains_tree(susc_outbreak_raw2)
+  )
+  expect_true(
+    is_chains_tree(tree_sim_raw)
+  )
+  expect_true(
+    is_chains_tree(tree_sim_raw2)
   )
   expect_false(
-    is_chains_tree(epichains_summary)
+    is_chains_tree(chain_summary_raw)
   )
 })
 
 test_that("is_chains_summary works", {
-  expect_true(
-    is_chains_tree(epichains_tree)
+  set.seed(12)
+  #' Simulate an outbreak from a susceptible population
+  susc_outbreak_raw <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "pois",
+    offspring_mean = 0.9,
+    serials_dist = serial_func
+  )
+  #' Simulate an outbreak from a susceptible population (nbinom)
+  susc_outbreak_raw2 <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "nbinom",
+    offspring_mean = 1,
+    offspring_disp = 1.1,
+    serials_dist = serial_func
+  )
+  #' Simulate a tree of infections without serials
+  tree_sim_raw <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Simulate a tree of infections with serials
+  tree_sim_raw2 <- simulate_tree(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    serials_dist = function(x) 3,
+    lambda = 2
   )
+  #' Simulate chain statistics
+  chain_summary_raw <- simulate_summary(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Expectations
   expect_true(
-    is_chains_tree(epichains_tree2)
+    is_chains_summary(chain_summary_raw)
+  )
+  expect_false(
+    is_chains_summary(susc_outbreak_raw)
+  )
+  expect_false(
+    is_chains_summary(susc_outbreak_raw2)
+  )
+  expect_false(
+    is_chains_summary(tree_sim_raw)
   )
   expect_false(
-    is_chains_tree(epichains_summary)
+    is_chains_summary(tree_sim_raw2)
   )
 })
 
-test_that("is_epichains_aggregate_df works", {
+test_that("aggregate.epichains method returns correct objects", {
+  set.seed(12)
+  #' Simulate a tree of infections with serials
+  tree_sim_raw2 <- simulate_tree(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    serials_dist = function(x) 3,
+    lambda = 2
+  )
+  #' Create aggregates
+  aggreg_by_gen <- aggregate(
+    tree_sim_raw2,
+    grouping_var = "generation"
+  )
+  aggreg_by_time <- aggregate(
+    tree_sim_raw2,
+    grouping_var = "time"
+  )
+  aggreg_by_both <- aggregate(
+    tree_sim_raw2,
+    grouping_var = "both"
+  )
+  #' Expectations for <epichains> class inheritance
   expect_true(
     is_epichains_aggregate_df(aggreg_by_gen)
   )
@@ -142,65 +432,98 @@ test_that("is_epichains_aggregate_df works", {
   expect_true(
     is_epichains_aggregate_df(aggreg_by_both)
   )
-  expect_false(
-    is_epichains_aggregate_df(epichains_tree)
-  )
-})
-
-test_that("validate_epichains throws errors", {
-  expect_error(
-    validate_epichains(mtcars),
-    "must have an epichains class"
-  )
-})
-
-test_that("head and tail methods work", {
-  expect_snapshot(head(epichains_tree))
-  expect_snapshot(head(epichains_tree2))
-  expect_snapshot(tail(epichains_tree))
-  expect_snapshot(tail(epichains_tree2))
-})
-
-test_that("aggregate method work", {
-  expect_named(
-    aggreg_by_gen,
-    c("generation", "cases")
-  )
-  expect_named(
-    aggreg_by_time,
-    c("time", "cases")
-  )
-  expect_identical(
-    as.vector(
-      vapply(aggreg_by_both, names, FUN.VALUE = character(2))
-    ),
-    c("time", "cases", "generation", "cases")
-  )
+  #' Expectations for <base> class inheritance
   expect_s3_class(
     aggreg_by_gen,
-    "epichains_aggregate_df"
+    "data.frame"
   )
   expect_s3_class(
     aggreg_by_time,
-    "epichains_aggregate_df"
+    "data.frame"
   )
   expect_s3_class(
     aggreg_by_both,
-    "epichains_aggregate_df"
-  )
-  expect_error(
-    aggregate(epichains_summary),
-    "attribute"
+    "list"
   )
 })
 
 test_that("aggregate method is numerically correct", {
+  set.seed(12)
+  #' Simulate a tree of infections without serials
+  tree_sim_raw <- simulate_tree(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    lambda = 2
+  )
+  #' Simulate a tree of infections with serials
+  tree_sim_raw2 <- simulate_tree(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    serials_dist = function(x) 3,
+    lambda = 2
+  )
+  #' Create aggregates
+  aggreg_by_gen <- aggregate(
+    tree_sim_raw,
+    grouping_var = "generation"
+  )
+  aggreg_by_time <- aggregate(
+    tree_sim_raw2,
+    grouping_var = "time"
+  )
   expect_identical(
     aggreg_by_gen$cases,
-    c(10L, 17L, 38L, 38L, 12L)
+    c(10L, 12L, 19L, 26L, 14L)
   )
   expect_identical(
     aggreg_by_time$cases,
-    c(10L, 21L, 48L, 60L)
+    c(10L, 17L, 38L, 38L, 12L)
+  )
+})
+
+test_that("head and tail methods work", {
+  set.seed(12)
+  #' Simulate an outbreak from a susceptible population
+  susc_outbreak_raw <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "pois",
+    offspring_mean = 0.9,
+    serials_dist = serial_func
+  )
+  #' Simulate an outbreak from a susceptible population (nbinom)
+  susc_outbreak_raw2 <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "nbinom",
+    offspring_mean = 1,
+    offspring_disp = 1.1,
+    serials_dist = serial_func
+  )
+  #' Simulate a tree of infections without serials
+  tree_sim_raw <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Simulate a tree of infections with serials
+  tree_sim_raw2 <- simulate_tree(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    serials_dist = function(x) 3,
+    lambda = 2
   )
+  expect_snapshot(head(susc_outbreak_raw))
+  expect_snapshot(head(susc_outbreak_raw2))
+  expect_snapshot(head(tree_sim_raw))
+  expect_snapshot(head(tree_sim_raw2))
+  expect_snapshot(tail(susc_outbreak_raw))
+  expect_snapshot(tail(susc_outbreak_raw2))
+  expect_snapshot(tail(tree_sim_raw))
+  expect_snapshot(tail(tree_sim_raw2))
 })

From cd7da02e6e47e696b3127b8b29d1c9af1b5f0ba0 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 19 Sep 2023 17:39:29 +0100
Subject: [PATCH 643/828] Restructure test-simulate by moving simulations into
 individual contexts

---
 tests/testthat/tests-simulate.R | 149 ++++++++++++++++++--------------
 1 file changed, 85 insertions(+), 64 deletions(-)

diff --git a/tests/testthat/tests-simulate.R b/tests/testthat/tests-simulate.R
index 1e36baaf..768be755 100644
--- a/tests/testthat/tests-simulate.R
+++ b/tests/testthat/tests-simulate.R
@@ -1,76 +1,61 @@
 #' Define global variables and options for simulations
-set.seed(12)
 serial_func <- function(n) {
   rlnorm(n, meanlog = 0.58, sdlog = 1.58)
 }
 
-#' simulate_tree()
-tree_sim_raw <- simulate_tree(
-  nchains = 2,
-  offspring_dist = "pois",
-  statistic = "length",
-  lambda = 0.9
-)
-
-tree_sim_summary <- summary(tree_sim_raw)
-
-#' simulate_summary()
-chain_summary_raw <- simulate_summary(
-  nchains = 2,
-  offspring_dist = "pois",
-  statistic = "length",
-  lambda = 0.9
-)
-
-chain_summary_sim <- summary(chain_summary_raw)
-
-#' simulate_tree_from_pop()
-susc_outbreak_raw <- simulate_tree_from_pop(
-  pop = 100,
-  offspring_dist = "pois",
-  offspring_mean = 0.9,
-  serials_dist = serial_func
-)
-
-susc_outbreak_raw2 <- simulate_tree_from_pop(
-  pop = 100,
-  offspring_dist = "nbinom",
-  offspring_mean = 1,
-  offspring_disp = 1.1,
-  serials_dist = serial_func
-)
-
-susc_outbreak_summary <- summary(susc_outbreak_raw)
-
-test_that("Simulators return epichains objects", {
-  expect_s3_class(
-    tree_sim_raw,
-    "epichains"
-  )
-  expect_s3_class(
-    susc_outbreak_raw,
-    "epichains"
-  )
-  expect_s3_class(
-    chain_summary_raw,
-    "epichains"
-  )
-})
-
 test_that("Simulators work", {
+  set.seed(12)
+  #' Simulate an outbreak from a susceptible population (pois)
+  susc_outbreak_raw <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "pois",
+    offspring_mean = 0.9,
+    serials_dist = serial_func
+  )
+  #' Simulate an outbreak from a susceptible population (nbinom)
+  susc_outbreak_raw2 <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "nbinom",
+    offspring_mean = 1,
+    offspring_disp = 1.1,
+    serials_dist = serial_func
+  )
+  #' Simulate a tree of infections without serials
+  tree_sim_raw <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Simulate a tree of infections with serials
+  tree_sim_raw2 <- simulate_tree(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    serials_dist = function(x) 3,
+    lambda = 2
+  )
+  #' Simulate chain statistics
+  chain_summary_raw <- simulate_summary(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+ #' Expectations
   expect_length(
-    simulate_summary(
-      nchains = 2,
-      statistic = "size",
-      offspring_dist = "pois",
-      lambda = 0.5
-    ),
+    chain_summary_raw,
     2
   )
   expect_gte(
     nrow(tree_sim_raw),
     2
   )
+  expect_gte(
+    nrow(tree_sim_raw2),
+    2
+  )
   expect_gte(
     nrow(susc_outbreak_raw),
     1
@@ -197,7 +182,6 @@ test_that("simulate_summary throws errors", {
 })
 
 test_that("simulate_tree_from_pop throws errors", {
-  set.seed(123)
   expect_error(
     simulate_tree_from_pop(
       pop = 100,
@@ -252,6 +236,17 @@ test_that("simulate_tree_from_pop throws warnings", {
 })
 
 test_that("simulate_tree is numerically correct", {
+  set.seed(12)
+  #' Simulate a tree of infections without serials
+  tree_sim_raw <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' summarise the results
+  tree_sim_summary <- summary(tree_sim_raw)
+  #' Expectations
   expect_identical(
     tree_sim_summary$chains_ran,
     2.00
@@ -283,21 +278,47 @@ test_that("simulate_tree is numerically correct", {
 })
 
 test_that("simulate_summary is numerically correct", {
+  set.seed(12)
+  #' Simulate chain statistics
+  chain_summary_raw <- simulate_summary(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Summarise the results
+  chain_summary_summaries <- summary(chain_summary_raw)
+  #' Expectations
   expect_identical(
-    chain_summary_sim$max_chain_stat,
+  chain_summary_summaries$chain_ran,
     2.00
   )
   expect_identical(
-    chain_summary_sim$min_chain_stat,
+    chain_summary_summaries$max_chain_stat,
+    3.00
+  )
+  expect_identical(
+    chain_summary_summaries$min_chain_stat,
     1.00
   )
   expect_identical(
     as.vector(chain_summary_raw),
-    c(2.00, 1.00)
+    c(1.00, 3.00)
   )
 })
 
 test_that("simulate_tree_from_pop is numerically correct", {
+  set.seed(12)
+  #' Simulate an outbreak from a susceptible population
+  susc_outbreak_raw <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "pois",
+    offspring_mean = 0.9,
+    serials_dist = serial_func
+  )
+  #' Summarise the results
+  susc_outbreak_summary <- summary(susc_outbreak_raw)
+  #' Expectations
   expect_identical(
     susc_outbreak_summary$unique_ancestors,
     0L

From 74e3ea643cd7a89c8b072c3a388919c6396734b7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 19 Sep 2023 17:42:00 +0100
Subject: [PATCH 644/828] Rename file

---
 tests/testthat/{tests-simulate.R => test-simulate.R} | 0
 1 file changed, 0 insertions(+), 0 deletions(-)
 rename tests/testthat/{tests-simulate.R => test-simulate.R} (100%)

diff --git a/tests/testthat/tests-simulate.R b/tests/testthat/test-simulate.R
similarity index 100%
rename from tests/testthat/tests-simulate.R
rename to tests/testthat/test-simulate.R

From e13cfd194d70ef37be670a5854095e41ee4ff91e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 19 Sep 2023 17:44:38 +0100
Subject: [PATCH 645/828] Linting

---
 tests/testthat/test-epichains.R | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index 3ad1c810..e842cd83 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -154,7 +154,7 @@ test_that("summary.epichains works as expected", {
     statistic = "length",
     lambda = 0.9
   )
-   #' Simulate case where all the chain statistics are Inf
+  #' Simulate case where all the chain statistics are Inf
   set.seed(11223)
   epichains_summary_all_infs <- simulate_summary(
     nchains = 10,
@@ -277,8 +277,8 @@ test_that("validate_epichains works", {
     validate_epichains(chain_summary_raw)
   )
   expect_error(
-      validate_epichains(mtcars),
-      "must have an epichains class"
+    validate_epichains(mtcars),
+    "must have an epichains class"
   )
 })
 

From 0f6f432034c66f2986009ab688414db7f0ed2a31 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 19 Sep 2023 17:44:54 +0100
Subject: [PATCH 646/828] Linting

---
 tests/testthat/test-simulate.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index 768be755..4f454712 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -43,7 +43,7 @@ test_that("Simulators work", {
     statistic = "length",
     lambda = 0.9
   )
- #' Expectations
+  #' Expectations
   expect_length(
     chain_summary_raw,
     2
@@ -290,7 +290,7 @@ test_that("simulate_summary is numerically correct", {
   chain_summary_summaries <- summary(chain_summary_raw)
   #' Expectations
   expect_identical(
-  chain_summary_summaries$chain_ran,
+    chain_summary_summaries$chain_ran,
     2.00
   )
   expect_identical(

From 6516df9b1001bbeb9953790bd4a06b8caeeb35e4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 19 Sep 2023 18:15:49 +0100
Subject: [PATCH 647/828] Add tests for the class of the head and tail methods

---
 tests/testthat/test-epichains.R | 70 ++++++++++++++++++++++++++++++++-
 1 file changed, 69 insertions(+), 1 deletion(-)

diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index e842cd83..98f58523 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -485,7 +485,7 @@ test_that("aggregate method is numerically correct", {
   )
 })
 
-test_that("head and tail methods work", {
+test_that("head and tail print output as expected", {
   set.seed(12)
   #' Simulate an outbreak from a susceptible population
   susc_outbreak_raw <- simulate_tree_from_pop(
@@ -527,3 +527,71 @@ test_that("head and tail methods work", {
   expect_snapshot(tail(tree_sim_raw))
   expect_snapshot(tail(tree_sim_raw2))
 })
+
+test_that("head and tail return data.frames", {
+  set.seed(12)
+  #' Simulate an outbreak from a susceptible population
+  susc_outbreak_raw <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "pois",
+    offspring_mean = 0.9,
+    serials_dist = serial_func
+  )
+  #' Simulate an outbreak from a susceptible population (nbinom)
+  susc_outbreak_raw2 <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "nbinom",
+    offspring_mean = 1,
+    offspring_disp = 1.1,
+    serials_dist = serial_func
+  )
+  #' Simulate a tree of infections without serials
+  tree_sim_raw <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9
+  )
+  #' Simulate a tree of infections with serials
+  tree_sim_raw2 <- simulate_tree(
+    nchains = 10,
+    statistic = "size",
+    offspring_dist = "pois",
+    stat_max = 10,
+    serials_dist = function(x) 3,
+    lambda = 2
+  )
+  #' Expectations
+  expect_s3_class(
+    head(susc_outbreak_raw),
+    "data.frame"
+  )
+  expect_s3_class(
+    head(susc_outbreak_raw2),
+    "data.frame"
+  )
+  expect_s3_class(
+    head(tree_sim_raw),
+    "data.frame"
+  )
+  expect_s3_class(
+    head(tree_sim_raw2),
+    "data.frame"
+  )
+  expect_s3_class(
+    tail(susc_outbreak_raw),
+    "data.frame"
+  )
+  expect_s3_class(
+    tail(susc_outbreak_raw2),
+    "data.frame"
+  )
+  expect_s3_class(
+    tail(tree_sim_raw),
+    "data.frame"
+  )
+  expect_s3_class(
+    tail(tree_sim_raw2),
+    "data.frame"
+  )
+})

From 021ed8673df641cb021d1b39a24170fb60ae2b4e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 19 Sep 2023 18:16:02 +0100
Subject: [PATCH 648/828] Re-generate snaps

---
 tests/testthat/_snaps/epichains.md | 16 +++++++++++++++-
 1 file changed, 15 insertions(+), 1 deletion(-)

diff --git a/tests/testthat/_snaps/epichains.md b/tests/testthat/_snaps/epichains.md
index 66a761ec..bb23c3c1 100644
--- a/tests/testthat/_snaps/epichains.md
+++ b/tests/testthat/_snaps/epichains.md
@@ -99,7 +99,21 @@
       Number of generations: 5
       Use `as.data.frame(<object_name>)` to view the full output in the console.
 
-# head and tail methods work
+---
+
+    Code
+      chain_summary_raw
+    Output
+      `epichains` object 
+      
+      [1] 4 1
+      
+       Simulated chain lengths: 
+      
+      Max: 4
+      Min: 1
+
+# head and tail print output as expected
 
     Code
       head(susc_outbreak_raw)

From 46a697ee92e5c7ffa885f1a6401b533b3c823fba Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 20 Sep 2023 14:12:11 +0100
Subject: [PATCH 649/828] Remove aggregate by "both" variable

---
 R/epichains.R                   | 24 +++---------------------
 man/aggregate.epichains.Rd      | 11 +++++------
 tests/testthat/test-epichains.R |  6 ------
 3 files changed, 8 insertions(+), 33 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 642f002c..83a722c6 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -266,15 +266,12 @@ tail.epichains <- function(x, ...) {
 #' aggregate(chains, grouping_var = "time")
 #'
 #' # Aggregate cases per generation
-#' aggregate(chains, grouping_var = "generation")
-#'
-#' # Aggregate cases per both time and generation
-#' aggregate(chains, grouping_var = "both")
+#' cases_per_gen <- aggregate(chains, grouping_var = "generation")
+#' head(cases_per_gen)
 aggregate.epichains <- function(x,
                                 grouping_var = c(
                                   "time",
-                                  "generation",
-                                  "both"
+                                  "generation"
                                 ),
                                 ...) {
   validate_epichains(x)
@@ -303,21 +300,6 @@ aggregate.epichains <- function(x,
       list(generation = x$generation),
       FUN = NROW
     )
-  } else if (grouping_var == "both") {
-    # Count the number of cases per time
-    list(
-      stats::aggregate(
-        list(cases = x$sim_id),
-        list(time = x$time),
-        FUN = NROW
-      ),
-      # Count the number of cases per generation
-      stats::aggregate(
-        list(cases = x$sim_id),
-        list(generation = x$generation),
-        FUN = NROW
-      )
-    )
   }
 
   structure(
diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
index eaf83f6d..9fff4db7 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains.Rd
@@ -4,21 +4,20 @@
 \alias{aggregate.epichains}
 \title{Aggregate cases in epichains objects according to a grouping variable}
 \usage{
-\method{aggregate}{epichains}(x, grouping_var = c("time", "generation", "both"), ...)
+\method{aggregate}{epichains}(x, grouping_var = c("time", "generation"), ...)
 }
 \arguments{
 \item{x}{An \code{\link{epichains}} object.}
 
 \item{grouping_var}{The variable to group and count over. Options include
-"time", "generation", and "both".}
+"time" and "generation".}
 
 \item{...}{Other arguments passed to aggregate.}
 }
 \value{
-If grouping_var is either "time" or "generation", a data.frame
-with cases aggregated over \code{grouping_var}; If
-\code{grouping_var = "both"}, a list of data.frames, the first being for
-cases over time, and the second being for cases over generations.
+An \verb{<epichains_aggregate_df>} object, which is basically a
+\verb{<data.frame>}. The object stores the \code{chain_type = chains_tree} and
+\code{grouping_var} attributes.
 }
 \description{
 Aggregate cases in epichains objects according to a grouping variable
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index 98f58523..f7d51d8f 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -419,10 +419,6 @@ test_that("aggregate.epichains method returns correct objects", {
     grouping_var = "time"
   )
   aggreg_by_both <- aggregate(
-    tree_sim_raw2,
-    grouping_var = "both"
-  )
-  #' Expectations for <epichains> class inheritance
   expect_true(
     is_epichains_aggregate_df(aggreg_by_gen)
   )
@@ -430,8 +426,6 @@ test_that("aggregate.epichains method returns correct objects", {
     is_epichains_aggregate_df(aggreg_by_time)
   )
   expect_true(
-    is_epichains_aggregate_df(aggreg_by_both)
-  )
   #' Expectations for <base> class inheritance
   expect_s3_class(
     aggreg_by_gen,

From 2a675313b8f34a735f26bfaf2aecce77d3389859 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 20 Sep 2023 14:14:14 +0100
Subject: [PATCH 650/828] Remove tests for base types

---
 tests/testthat/test-epichains.R | 6 ------
 1 file changed, 6 deletions(-)

diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index f7d51d8f..bcc4b760 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -426,8 +426,6 @@ test_that("aggregate.epichains method returns correct objects", {
     is_epichains_aggregate_df(aggreg_by_time)
   )
   expect_true(
-  #' Expectations for <base> class inheritance
-  expect_s3_class(
     aggreg_by_gen,
     "data.frame"
   )
@@ -435,10 +433,6 @@ test_that("aggregate.epichains method returns correct objects", {
     aggreg_by_time,
     "data.frame"
   )
-  expect_s3_class(
-    aggreg_by_both,
-    "list"
-  )
 })
 
 test_that("aggregate method is numerically correct", {

From aea337efc8937025220bc10e2c4068ead7e3d994 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 20 Sep 2023 14:14:36 +0100
Subject: [PATCH 651/828] Fix a comment in the tests

---
 tests/testthat/test-epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index bcc4b760..95c70fdf 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -435,7 +435,7 @@ test_that("aggregate.epichains method returns correct objects", {
   )
 })
 
-test_that("aggregate method is numerically correct", {
+test_that("aggregate.epichains method is numerically correct", {
   set.seed(12)
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(

From a5255c4c459c9f4566610a1a6516452b8255adbf Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 20 Sep 2023 14:15:37 +0100
Subject: [PATCH 652/828] Add tests for epichains_aggregate_df class

---
 tests/testthat/test-epichains.R | 10 +++++-----
 1 file changed, 5 insertions(+), 5 deletions(-)

diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index 95c70fdf..dc8211ea 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -418,20 +418,20 @@ test_that("aggregate.epichains method returns correct objects", {
     tree_sim_raw2,
     grouping_var = "time"
   )
-  aggreg_by_both <- aggregate(
+  #' Expectations for <epichains_aggregate_df> class inheritance
   expect_true(
     is_epichains_aggregate_df(aggreg_by_gen)
   )
   expect_true(
     is_epichains_aggregate_df(aggreg_by_time)
   )
-  expect_true(
+  expect_named(
     aggreg_by_gen,
-    "data.frame"
+    c("generation", "cases")
   )
-  expect_s3_class(
+  expect_named(
     aggreg_by_time,
-    "data.frame"
+    c("time", "cases")
   )
 })
 

From e71f2738193cc5290ede43096a3250c9d5b39f2f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 20 Sep 2023 14:16:05 +0100
Subject: [PATCH 653/828] Clean up documentation of aggregate method

---
 R/epichains.R              | 23 +++++++++++++----------
 man/aggregate.epichains.Rd | 22 ++++++++++++----------
 2 files changed, 25 insertions(+), 20 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 83a722c6..03145767 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -240,30 +240,33 @@ tail.epichains <- function(x, ...) {
   utils::tail(as.data.frame(x), ...)
 }
 
-#' Aggregate cases in epichains objects according to a grouping variable
+#' Aggregate cases in `<epichains>` objects by "time" or "generation"
 #'
-#' @param x An [`epichains`] object.
+#' @param x An `<epichains>` object.
 #' @param grouping_var The variable to group and count over. Options include
-#' "time", "generation", and "both".
+#' "time" and "generation".
 #' @param ... Other arguments passed to aggregate.
 #' @importFrom stats aggregate
-#' @return If grouping_var is either "time" or "generation", a data.frame
-#' with cases aggregated over `grouping_var`; If
-#' \code{grouping_var = "both"}, a list of data.frames, the first being for
-#'  cases over time, and the second being for cases over generations.
+#' @return An `<epichains_aggregate_df>` object, which is basically a
+#' `<data.frame>`. The object stores the `chain_type = chains_tree` and
+#' `grouping_var` attributes.
 #' @export
 #' @author James M. Azam
 #' @examples
 #' set.seed(123)
 #' chains <- simulate_tree(
-#'   nchains = 10, statistic = "size",
-#'   offspring_dist = "pois", stat_max = 10, serials_dist = function(x) 3,
+#'   nchains = 10,
+#'   statistic = "size",
+#'   offspring_dist = "pois",
+#'   stat_max = 10,
+#'   serials_dist = function(x) 3,
 #'   lambda = 2
 #' )
 #' chains
 #'
 #' # Aggregate cases per time
-#' aggregate(chains, grouping_var = "time")
+#' cases_per_time <- aggregate(chains, grouping_var = "time")
+#' head(cases_per_time)
 #'
 #' # Aggregate cases per generation
 #' cases_per_gen <- aggregate(chains, grouping_var = "generation")
diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
index 9fff4db7..27edc079 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains.Rd
@@ -2,12 +2,12 @@
 % Please edit documentation in R/epichains.R
 \name{aggregate.epichains}
 \alias{aggregate.epichains}
-\title{Aggregate cases in epichains objects according to a grouping variable}
+\title{Aggregate cases in \verb{<epichains>} objects by "time" or "generation"}
 \usage{
 \method{aggregate}{epichains}(x, grouping_var = c("time", "generation"), ...)
 }
 \arguments{
-\item{x}{An \code{\link{epichains}} object.}
+\item{x}{An \verb{<epichains>} object.}
 
 \item{grouping_var}{The variable to group and count over. Options include
 "time" and "generation".}
@@ -20,25 +20,27 @@ An \verb{<epichains_aggregate_df>} object, which is basically a
 \code{grouping_var} attributes.
 }
 \description{
-Aggregate cases in epichains objects according to a grouping variable
+Aggregate cases in \verb{<epichains>} objects by "time" or "generation"
 }
 \examples{
 set.seed(123)
 chains <- simulate_tree(
-  nchains = 10, statistic = "size",
-  offspring_dist = "pois", stat_max = 10, serials_dist = function(x) 3,
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
   lambda = 2
 )
 chains
 
 # Aggregate cases per time
-aggregate(chains, grouping_var = "time")
+cases_per_time <- aggregate(chains, grouping_var = "time")
+head(cases_per_time)
 
 # Aggregate cases per generation
-aggregate(chains, grouping_var = "generation")
-
-# Aggregate cases per both time and generation
-aggregate(chains, grouping_var = "both")
+cases_per_gen <- aggregate(chains, grouping_var = "generation")
+head(cases_per_gen)
 }
 \author{
 James M. Azam

From 7de80466cd1facde458b99cbd84203c9b9880da0 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 20 Sep 2023 14:16:22 +0100
Subject: [PATCH 654/828] Make the superclass a data.frame

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 03145767..db8c89cd 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -307,7 +307,7 @@ aggregate.epichains <- function(x,
 
   structure(
     out,
-    class = c("epichains_aggregate_df", class(out)),
+    class = c("epichains_aggregate_df", "data.frame"),
     chain_type = attributes(x)$chain_type,
     rownames = NULL,
     aggregated_over = grouping_var

From 872c6e188f54bd8344de04fd3e7b406396d88941 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 20 Sep 2023 14:25:29 +0100
Subject: [PATCH 655/828] Remove example that no longer applies

---
 vignettes/epichains.Rmd | 3 ---
 1 file changed, 3 deletions(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 2dcb1f0f..38d4503d 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -124,7 +124,4 @@ aggregate(tree_from_pop_pois, "time")
 
 # aggregate by generation
 aggregate(tree_from_pop_pois, "generation")
-
-# aggregate by both time and generation
-aggregate(tree_from_pop_pois, "both")
 ```

From a9f4b00c70d2bc871b9faf1db921d1a80677784e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 20 Sep 2023 16:13:08 +0100
Subject: [PATCH 656/828] Throw an error when the time column does not exist

---
 R/epichains.R | 7 +++++++
 1 file changed, 7 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index db8c89cd..20d1f5fc 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -290,6 +290,13 @@ aggregate.epichains <- function(x,
   grouping_var <- match.arg(grouping_var)
 
   out <- if (grouping_var == "time") {
+    if (is.null(x$time)) {
+      stop(
+        "Object must have a time column. ",
+        "To simulate time, specify `serials_dist` ",
+        "in the `simulate_tree()` setup."
+      )
+    }
     # Count the number of cases per generation
     stats::aggregate(
       list(cases = x$sim_id),

From a3cbf4a5495e2da795265f3bf31978c08a43cac6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 20 Sep 2023 16:13:46 +0100
Subject: [PATCH 657/828] Test for case when time is specified but not present
 in the data

---
 tests/testthat/test-epichains.R | 16 ++++++++++++++++
 1 file changed, 16 insertions(+)

diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index dc8211ea..878d22ea 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -435,6 +435,22 @@ test_that("aggregate.epichains method returns correct objects", {
   )
 })
 
+test_that("aggregate.epichains method throws errors", {
+  expect_error(
+    aggregate(
+      simulate_tree(
+        nchains = 10,
+        statistic = "size",
+        offspring_dist = "pois",
+        stat_max = 10,
+        lambda = 2
+      ),
+      grouping_var = "time"
+    ),
+    "Object must have a time column"
+  )
+})
+
 test_that("aggregate.epichains method is numerically correct", {
   set.seed(12)
   #' Simulate a tree of infections without serials

From c9d26ee700a0d9ed27c5c4184d21337eaa216504 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 11:39:59 +0100
Subject: [PATCH 658/828] Add vignette with bibliography of branching process
 applications in epidemiology

---
 _pkgdown.yml                                |   1 +
 vignettes/branching_process_literature.Rmd  |  42 ++
 vignettes/branching_process_literature.json | 663 ++++++++++++++++++++
 3 files changed, 706 insertions(+)
 create mode 100644 vignettes/branching_process_literature.Rmd
 create mode 100644 vignettes/branching_process_literature.json

diff --git a/_pkgdown.yml b/_pkgdown.yml
index 1a1a21f3..de329223 100644
--- a/_pkgdown.yml
+++ b/_pkgdown.yml
@@ -12,4 +12,5 @@ articles:
 - title: Modelling guides and background
   navbar: Modelling guides and background
   contents:
+  - branching_process_literature
 
diff --git a/vignettes/branching_process_literature.Rmd b/vignettes/branching_process_literature.Rmd
new file mode 100644
index 00000000..b06dd34c
--- /dev/null
+++ b/vignettes/branching_process_literature.Rmd
@@ -0,0 +1,42 @@
+---
+title: "Literature on branching process applications"
+output:
+  bookdown::html_vignette2:
+    fig_caption: yes
+    code_folding: show
+pkgdown:
+  as_is: true
+bibliography: branching_process_literature.json
+link-citations: true
+vignette: >
+  %\VignetteEncoding{UTF-8}
+  %\VignetteIndexEntry{Literature on branching process applications}
+  %\VignetteEngine{knitr::rmarkdown}
+editor_options: 
+  chunk_output_type: console
+nocite: '@*'
+---
+
+```{r setup, include=FALSE}
+knitr::opts_chunk$set(
+  echo = TRUE,
+  message = FALSE,
+  warning = FALSE,
+  collapse = TRUE,
+  comment = "#>"
+)
+```
+
+Below, we provide a bibliography on the application of branching processes to 
+infectious disease modelling. 
+
+It is our intention to grow this list to serve as a point of reference for 
+budding modellers with an interest in the subject. 
+
+If you would like to extend this list, the easiest way would be to [file an issue](https://github.com/epiverse-trace/epichains/issues/new/choose), listing the
+new additions and we'll take it from there,
+or [submit a pull request](https://github.com/epiverse-trace/epichains/pulls) 
+with an updated version of the bibliography file found in 
+"vignettes/branching_process_literature.json".
+
+# Bibliography
diff --git a/vignettes/branching_process_literature.json b/vignettes/branching_process_literature.json
new file mode 100644
index 00000000..5828008d
--- /dev/null
+++ b/vignettes/branching_process_literature.json
@@ -0,0 +1,663 @@
+[
+	{
+		"id": "abbott2020",
+		"type": "article-journal",
+		"container-title": "Wellcome open research",
+		"note": "publisher: The Wellcome Trust",
+		"title": "The transmissibility of novel Coronavirus in the early stages of the 2019-20 outbreak in Wuhan: Exploring initial point-source exposure sizes and durations using scenario analysis",
+		"volume": "5",
+		"author": [
+			{
+				"family": "Abbott",
+				"given": "Sam"
+			},
+			{
+				"family": "Hellewell",
+				"given": "Joel"
+			},
+			{
+				"family": "Munday",
+				"given": "James"
+			},
+			{
+				"family": "Funk",
+				"given": "Sebastian"
+			},
+			{
+				"family": "group",
+				"given": "CMMID",
+				"dropping-particle": "nCoV working"
+			},
+			{
+				"literal": "others"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "blumberg2013",
+		"type": "article-journal",
+		"abstract": "Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited 'stuttering chains'. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters. © 2013 Elsevier B.V.",
+		"container-title": "Epidemics",
+		"DOI": "10.1016/j.epidem.2013.05.002",
+		"ISSN": "17554365",
+		"issue": "3",
+		"note": "publisher: Elsevier B.V.\nPMID: 24021520",
+		"page": "131–145",
+		"title": "Comparing methods for estimating R0 from the size distribution of subcritical transmission chains",
+		"URL": "http://dx.doi.org/10.1016/j.epidem.2013.05.002",
+		"volume": "5",
+		"author": [
+			{
+				"family": "Blumberg",
+				"given": "S."
+			},
+			{
+				"family": "Lloyd-Smith",
+				"given": "J. O."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2013"
+				]
+			]
+		}
+	},
+	{
+		"id": "blumberg2013a",
+		"type": "article-journal",
+		"abstract": "For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, 0<R0<1 and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring R0 and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer R0, but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, k) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both R0 and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for R0 is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in R0 is detectable. In addition, by allowing for superspreading events, inference of k shifts the threshold above which a transmission chain should be considered anomalously large for a given value of R0 (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results.",
+		"container-title": "PLoS Computational Biology",
+		"DOI": "10.1371/journal.pcbi.1002993",
+		"ISSN": "15537358",
+		"issue": "5",
+		"note": "PMID: 23658504",
+		"page": "1–17",
+		"title": "Inference of R0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains",
+		"volume": "9",
+		"author": [
+			{
+				"family": "Blumberg",
+				"given": "Seth"
+			},
+			{
+				"family": "Lloyd-Smith",
+				"given": "James O."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2013"
+				]
+			]
+		}
+	},
+	{
+		"id": "farrington1999",
+		"type": "article-journal",
+		"abstract": "We consider the distribution of the number of generations to extinction in subcritical branching processes, with particular emphasis on applications to the spread of infectious diseases. We derive the generation distributions for processes with Bernoulli, geometric and Poisson offspring, and discuss some of their distributional and inferential properties. We present applications to the spread of infection in highly vaccinated populations, outbreaks of enteric fever, and person-to-person transmission of human monkeypox.",
+		"container-title": "Journal of Applied Probability",
+		"DOI": "10.1239/jap/1032374633",
+		"ISSN": "00219002",
+		"issue": "3",
+		"page": "771–779",
+		"title": "The distribution of time to extinction in subcritical branching processes: Applications to outbreaks of infectious disease",
+		"volume": "36",
+		"author": [
+			{
+				"family": "Farrington",
+				"given": "C. P."
+			},
+			{
+				"family": "Grant",
+				"given": "A. D."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"1999"
+				]
+			]
+		}
+	},
+	{
+		"id": "farrington2003",
+		"type": "article-journal",
+		"abstract": "Mass vaccination programmes aim to maintain the effective reproduction number R of an infection below unity. We describe methods for monitoring the value of R using surveillance data. The models are based on branching processes in which R is identified with the offspring mean. We derive unconditional likelihoods for the offspring mean using data on outbreak size and outbreak duration. We also discuss Bayesian methods, implemented by Metropolis-Hastings sampling. We investigate by simulation the validity of the models with respect to depletion of susceptibles and under-ascertainment of cases. The methods are illustrated using surveillance data on measles in the USA.",
+		"container-title": "Biostatistics (Oxford, England)",
+		"DOI": "10.1093/biostatistics/4.2.279",
+		"ISSN": "14654644",
+		"issue": "2",
+		"page": "279–295",
+		"title": "Branching process models for surveillance of infectious diseases controlled by mass vaccination.",
+		"volume": "4",
+		"author": [
+			{
+				"family": "Farrington",
+				"given": "C. P."
+			},
+			{
+				"family": "Kanaan",
+				"given": "M. N."
+			},
+			{
+				"family": "Gay",
+				"given": "N. J."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2003"
+				]
+			]
+		}
+	},
+	{
+		"id": "jacob2010",
+		"type": "article-journal",
+		"abstract": "Branching processes are stochastic individual-based processes leading consequently to a bottom-up approach. In addition, since the state variables are random integer variables (representing population sizes), the extinction occurs at random finite time on the extinction set, thus leading to fine and realistic predictions. Starting from the simplest and well-known single-type Bienaymé-Galton-Watson branching process that was used by several authors for approximating the beginning of an epidemic, we then present a general branching model with age and population dependent individual transitions. However contrary to the classical Bienaymé-Galton-Watson or asymptotically Bienaymé-Galton-Watson setting, where the asymptotic behavior of the process, as time tends to infinity, is well understood, the asymptotic behavior of this general process is a new question. Here we give some solutions for dealing with this problem depending on whether the initial population size is large or small, and whether the disease is rare or non-rare when the initial population size is large.",
+		"container-title": "International Journal of Environmental Research and Public Health",
+		"DOI": "10.3390/ijerph7031204",
+		"ISSN": "16604601",
+		"issue": "3",
+		"page": "1186–1204",
+		"title": "Branching processes: Their role in epidemiology",
+		"volume": "7",
+		"author": [
+			{
+				"family": "Jacob",
+				"given": "Christine"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2010"
+				]
+			]
+		}
+	},
+	{
+		"id": "lloyd-smith2005",
+		"type": "article-journal",
+		"abstract": "Population-level analyses often use average quantities to describe heterogeneous systems, particularly when variation does not arise from identifiable groups. A prominent example, central to our current understanding of epidemic spread, is the basic reproductive number, R0, which is defined as the mean number of infections caused by an infected individual in a susceptible population. Population estimates of R0 can obscure considerable individual variation in infectiousness, as highlighted during the global emergence of severe acute respiratory syndrome (SARS) by numerous 'superspreading events' in which certain individuals infected unusually large numbers of secondary cases. For diseases transmitted by non-sexual direct contacts, such as SARS or smallpox, individual variation is difficult to measure empirically, and thus its importance for outbreak dynamics has been unclear. Here we present an integrated theoretical and statistical analysis of the influence of individual variation in infectiousness on disease emergence. Using contact tracing data from eight directly transmitted diseases, we show that the distribution of individual infectiousness around R0 is often highly skewed. Model predictions accounting for this variation differ sharply from average-based approaches, with disease extinction more likely and outbreaks rarer but more explosive. Using these models, we explore implications for outbreak control, showing that individual-specific control measures outperform population-wide measures. Moreover, the dramatic improvements achieved through targeted control policies emphasize the need to identify predictive correlates of higher infectiousness. Our findings indicate that superspreading is a normal feature of disease spread, and to frame ongoing discussion we propose a rigorous definition for superspreading events and a method to predict their frequency. © 2005 Nature Publishing Group.",
+		"container-title": "Nature",
+		"DOI": "10.1038/nature04153",
+		"ISSN": "14764687",
+		"issue": "7066",
+		"note": "PMID: 16292310",
+		"page": "355–359",
+		"title": "Superspreading and the effect of individual variation on disease emergence",
+		"volume": "438",
+		"author": [
+			{
+				"family": "Lloyd-Smith",
+				"given": "J. O."
+			},
+			{
+				"family": "Schreiber",
+				"given": "S. J."
+			},
+			{
+				"family": "Kopp",
+				"given": "P. E."
+			},
+			{
+				"family": "Getz",
+				"given": "W. M."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2005"
+				]
+			]
+		}
+	},
+	{
+		"id": "nishiura2012",
+		"type": "article-journal",
+		"abstract": "Use of the final size distribution of minor outbreaks for the estimation of the reproduction numbers of supercritical epidemic processes has yet to be considered. We used a branching process model to derive the final size distribution of minor outbreaks, assuming a reproduction number above unity, and applying the method to final size data for pneumonic plague. Pneumonic plague is a rare disease with only one documented major epidemic in a spatially limited setting. Because the final size distribution of a minor outbreak needs to be normalized by the probability of extinction, we assume that the dispersion parameter (k) of the negative-binomial offspring distribution is known, and examine the sensitivity of the reproduction number to variation in dispersion. Assuming a geometric offspring distribution with k=1, the reproduction number was estimated at 1.16 (95% confidence interval: 0.97-1.38). When less dispersed with k=2, the maximum likelihood estimate of the reproduction number was 1.14. These estimates agreed with those published from transmission network analysis, indicating that the human-to-human transmission potential of the pneumonic plague is not very high. Given only minor outbreaks, transmission potential is not sufficiently assessed by directly counting the number of offspring. Since the absence of a major epidemic does not guarantee a subcritical process, the proposed method allows us to conservatively regard epidemic data from minor outbreaks as supercritical, and yield estimates of threshold values above unity. © 2011.",
+		"container-title": "Journal of Theoretical Biology",
+		"DOI": "10.1016/j.jtbi.2011.10.039",
+		"ISSN": "00225193",
+		"note": "publisher: Elsevier\nPMID: 22079419",
+		"page": "48–55",
+		"title": "Estimating the transmission potential of supercritical processes based on the final size distribution of minor outbreaks",
+		"URL": "http://dx.doi.org/10.1016/j.jtbi.2011.10.039",
+		"volume": "294",
+		"author": [
+			{
+				"family": "Nishiura",
+				"given": "Hiroshi"
+			},
+			{
+				"family": "Yan",
+				"given": "Ping"
+			},
+			{
+				"family": "Sleeman",
+				"given": "Candace K."
+			},
+			{
+				"family": "Mode",
+				"given": "Charles J."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2012"
+				]
+			]
+		}
+	},
+	{
+		"id": "pearson2020",
+		"type": "article-journal",
+		"abstract": "For 45 African countries/territories already reporting COVID-19 cases before 23 March 2020, we estimate the dates of reporting 1,000 and 10,000 cases. Assuming early epidemic trends without interventions, all 45 were likely to exceed 1,000 confirmed cases by the end of April 2020, with most exceeding 10,000 a few weeks later.",
+		"container-title": "Eurosurveillance",
+		"DOI": "10.2807/1560-7917.ES.2020.25.18.2000543",
+		"ISSN": "15607917",
+		"issue": "18",
+		"note": "publisher: European Centre for Disease Prevention and Control (ECDC)\nPMID: 32400361",
+		"page": "1–6",
+		"title": "Projected early spread of COVID-19 in Africa through 1 June 2020",
+		"URL": "http://dx.doi.org/10.2807/1560-7917.ES.2020.25.18.2000543",
+		"volume": "25",
+		"author": [
+			{
+				"family": "Pearson",
+				"given": "Carl A.B."
+			},
+			{
+				"family": "Schalkwyk",
+				"given": "Cari",
+				"non-dropping-particle": "van"
+			},
+			{
+				"family": "Foss",
+				"given": "Anna M."
+			},
+			{
+				"family": "O'Reilly",
+				"given": "Kathleen M."
+			},
+			{
+				"family": "Pulliam",
+				"given": "Juliet R.C."
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"2020"
+				]
+			]
+		}
+	},
+	{
+		"id": "becker1977",
+		"type": "article-journal",
+		"container-title": "Biometrics",
+		"ISSN": "0006-341X",
+		"issue": "3",
+		"note": "publisher: JSTOR",
+		"page": "515–522",
+		"title": "Estimation for discrete time branching processes with application to epidemics",
+		"volume": "33",
+		"author": [
+			{
+				"family": "Becker",
+				"given": "Niels"
+			},
+			{
+				"family": "Society",
+				"given": "International Biometric"
+			}
+		],
+		"issued": {
+			"date-parts": [
+				[
+					"1977"
+				]
+			]
+		}
+	},
+    {
+        "id": "blumberg2013",
+        "type": "article-journal",
+        "abstract": "Many diseases exhibit subcritical transmission (i.e. 0<R0<1) so that infections occur as self-limited ‘stuttering chains’. Given an ensemble of stuttering chains, information about the number of cases in each chain can be used to infer R0, which is of crucial importance for monitoring the risk that a disease will emerge to establish endemic circulation. However, the challenge of imperfect case detection has led authors to adopt a variety of work-around measures when inferring R0, such as discarding data on isolated cases or aggregating intermediate-sized chains together. Each of these methods has the potential to introduce bias, but a quantitative comparison of these approaches has not been reported. By adapting a model based on a negative binomial offspring distribution that permits a variable degree of transmission heterogeneity, we present a unified analysis of existing R0 estimation methods. Simulation studies show that the degree of transmission heterogeneity, when improperly modeled, can significantly impact the bias of R0 estimation methods designed for imperfect observation. These studies also highlight the importance of isolated cases in assessing whether an estimation technique is consistent with observed data. Analysis of data from measles outbreaks shows that likelihood scores are highest for models that allow a flexible degree of transmission heterogeneity. Aggregating intermediate sized chains often has similar performance to analyzing a complete chain size distribution. However, truncating isolated cases is beneficial only when surveillance systems clearly favor full observation of large chains but not small chains. Meanwhile, if data on the type and proportion of cases that are unobserved were known, we demonstrate that maximum likelihood inference of R0 could be adjusted accordingly. This motivates the need for future empirical and theoretical work to quantify observation error and incorporate relevant mechanisms into stuttering chain models used to estimate transmission parameters.",
+        "container-title": "Epidemics",
+        "DOI": "10.1016/j.epidem.2013.05.002",
+        "ISSN": "1755-4365",
+        "issue": 3,
+        "journalAbbreviation": "Epidemics",
+        "language": "en",
+        "page": "131-145",
+        "source": "ScienceDirect",
+        "title": "Comparing methods for estimating R0 from the size distribution of subcritical transmission chains",
+        "URL": "https://www.sciencedirect.com/science/article/pii/S1755436513000236",
+        "volume": 5,
+        "author": [
+            {
+                "family": "Blumberg",
+                "given": "S."
+            },
+            {
+                "family": "Lloyd-Smith",
+                "given": "J. O."
+            }
+        ],
+        "accessed": {
+            "date-parts": [
+                [
+                    2023,
+                    5,
+                    23
+                ]
+            ]
+        },
+        "issued": {
+            "date-parts": [
+                [
+                    2013,
+                    9,
+                    1
+                ]
+            ]
+        }
+    },
+    {
+        "id": "kucharski2016",
+        "type": "article-journal",
+        "abstract": "Using an Ebola virus disease transmission model, we found that addition of ring vaccination at the outset of the West Africa epidemic might not have led to containment of this disease. However, in later stages of the epidemic or in outbreaks with less intense transmission or more effective control, this strategy could help eliminate the disease.",
+        "container-title": "Emerging Infectious Diseases",
+        "DOI": "10.3201/eid2201.151410",
+        "ISSN": "1080-6040",
+        "issue": 1,
+        "journalAbbreviation": "Emerg Infect Dis",
+        "note": "PMID: 26691346\nPMCID: PMC4696719",
+        "page": "105-108",
+        "source": "PubMed Central",
+        "title": "Effectiveness of Ring Vaccination as Control Strategy for Ebola Virus Disease",
+        "URL": "https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4696719/",
+        "volume": 22,
+        "author": [
+            {
+                "family": "Kucharski",
+                "given": "Adam J."
+            },
+            {
+                "family": "Eggo",
+                "given": "Rosalind M."
+            },
+            {
+                "family": "Watson",
+                "given": "Conall H."
+            },
+            {
+                "family": "Camacho",
+                "given": "Anton"
+            },
+            {
+                "family": "Funk",
+                "given": "Sebastian"
+            },
+            {
+                "family": "Edmunds",
+                "given": "W. John"
+            }
+        ],
+        "accessed": {
+            "date-parts": [
+                [
+                    2023,
+                    5,
+                    23
+                ]
+            ]
+        },
+        "issued": {
+            "date-parts": [
+                [
+                    2016,
+                    1
+                ]
+            ]
+        }
+    },
+    {
+        "id": "kucharski2015",
+        "type": "article-journal",
+        "abstract": "The transmission potential of a novel infection depends on both the inherent transmissibility of a pathogen, and the level of susceptibility in the host population. However, distinguishing between these pathogen- and population-specific properties typically requires detailed serological studies, which are rarely available in the early stages of an outbreak. Using a simple transmission model that incorporates age-stratified social mixing patterns, we present a novel method for characterizing the transmission potential of subcritical infections, which have effective reproduction number R<1, from readily available data on the size of outbreaks. We show that the model can identify the extent to which outbreaks are driven by inherent pathogen transmissibility and pre-existing population immunity, and can generate unbiased estimates of the effective reproduction number. Applying the method to real-life infections, we obtained accurate estimates for the degree of age-specific immunity against monkeypox, influenza A(H5N1) and A(H7N9), and refined existing estimates of the reproduction number. Our results also suggest minimal pre-existing immunity to MERS-CoV in humans. The approach we describe can therefore provide crucial information about novel infections before serological surveys and other detailed analyses are available. The methods would also be applicable to data stratified by factors such as profession or location, which would make it possible to measure the transmission potential of emerging infections in a wide range of settings.",
+        "container-title": "PLOS Computational Biology",
+        "DOI": "10.1371/journal.pcbi.1004154",
+        "ISSN": "1553-7358",
+        "issue": 4,
+        "journalAbbreviation": "PLOS Computational Biology",
+        "language": "en",
+        "note": "publisher: Public Library of Science",
+        "page": "e1004154",
+        "source": "PLoS Journals",
+        "title": "Characterizing the Transmission Potential of Zoonotic Infections from Minor Outbreaks",
+        "URL": "https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1004154",
+        "volume": 11,
+        "author": [
+            {
+                "family": "Kucharski",
+                "given": "Adam J."
+            },
+            {
+                "family": "Edmunds",
+                "given": "W. John"
+            }
+        ],
+        "accessed": {
+            "date-parts": [
+                [
+                    2023,
+                    5,
+                    23
+                ]
+            ]
+        },
+        "issued": {
+            "date-parts": [
+                [
+                    2015,
+                    4,
+                    10
+                ]
+            ]
+        }
+    },
+    {
+        "id": "hellewell2020",
+        "type": "article-journal",
+        "container-title": "The Lancet Global Health",
+        "DOI": "10.1016/S2214-109X(20)30074-7",
+        "ISSN": "2214-109X",
+        "issue": 4,
+        "journalAbbreviation": "The Lancet Global Health",
+        "language": "English",
+        "note": "publisher: Elsevier\nPMID: 32119825",
+        "page": "e488-e496",
+        "source": "www.thelancet.com",
+        "title": "Feasibility of controlling COVID-19 outbreaks by isolation of cases and contacts",
+        "URL": "https://www.thelancet.com/article/S2214-109X(20)30074-7/fulltext",
+        "volume": 8,
+        "author": [
+            {
+                "family": "Hellewell",
+                "given": "Joel"
+            },
+            {
+                "family": "Abbott",
+                "given": "Sam"
+            },
+            {
+                "family": "Gimma",
+                "given": "Amy"
+            },
+            {
+                "family": "Bosse",
+                "given": "Nikos I."
+            },
+            {
+                "family": "Jarvis",
+                "given": "Christopher I."
+            },
+            {
+                "family": "Russell",
+                "given": "Timothy W."
+            },
+            {
+                "family": "Munday",
+                "given": "James D."
+            },
+            {
+                "family": "Kucharski",
+                "given": "Adam J."
+            },
+            {
+                "family": "Edmunds",
+                "given": "W. John"
+            },
+            {
+                "family": "Sun",
+                "given": "Fiona"
+            },
+            {
+                "family": "Flasche",
+                "given": "Stefan"
+            },
+            {
+                "family": "Quilty",
+                "given": "Billy J."
+            },
+            {
+                "family": "Davies",
+                "given": "Nicholas"
+            },
+            {
+                "family": "Liu",
+                "given": "Yang"
+            },
+            {
+                "family": "Clifford",
+                "given": "Samuel"
+            },
+            {
+                "family": "Klepac",
+                "given": "Petra"
+            },
+            {
+                "family": "Jit",
+                "given": "Mark"
+            },
+            {
+                "family": "Diamond",
+                "given": "Charlie"
+            },
+            {
+                "family": "Gibbs",
+                "given": "Hamish"
+            },
+            {
+                "family": "Zandvoort",
+                "given": "Kevin",
+                "dropping-particle": "van"
+            },
+            {
+                "family": "Funk",
+                "given": "Sebastian"
+            },
+            {
+                "family": "Eggo",
+                "given": "Rosalind M."
+            }
+        ],
+        "accessed": {
+            "date-parts": [
+                [
+                    2023,
+                    5,
+                    23
+                ]
+            ]
+        },
+        "issued": {
+            "date-parts": [
+                [
+                    2020,
+                    4,
+                    1
+                ]
+            ]
+        }
+    },
+    {
+        "id": "ratnayake2022",
+        "DOI": "10.1371/journal.pntd.0010163",
+        "ISSN": "1935-2735",
+        "URL": "http://dx.doi.org/10.1371/journal.pntd.0010163",
+        "author": [
+            {
+                "family": "Ratnayake",
+                "given": "Ruwan"
+            },
+            {
+                "family": "Checchi",
+                "given": "Francesco"
+            },
+            {
+                "family": "Jarvis",
+                "given": "Christopher I."
+            },
+            {
+                "family": "Edmunds",
+                "given": "W. John"
+            },
+            {
+                "family": "Finger",
+                "given": "Flavio"
+            }
+        ],
+        "container-title": "PLOS Neglected Tropical Diseases",
+        "editor": [
+            {
+                "family": "Yang",
+                "given": "Ruifu"
+            }
+        ],
+        "issue": "2",
+        "issued": {
+            "date-parts": [
+                [
+                    2022,
+                    2
+                ]
+            ]
+        },
+        "page": "e0010163",
+        "publisher": "Public Library of Science (PLoS)",
+        "title": "Inference is bliss: Simulation for power estimation for an observational study of a cholera outbreak intervention",
+        "title-short": "Inference is bliss",
+        "type": "article-journal",
+        "volume": "16"
+    }
+
+]

From a7fe2c3c9d52afa05b326dbfbf781a378282b7cd Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 11:44:41 +0100
Subject: [PATCH 659/828] Add vignette with theoretical underpinnings

---
 _pkgdown.yml                         |   1 +
 vignettes/theoretical_background.Rmd | 148 +++++++++++++++++++++++++++
 2 files changed, 149 insertions(+)
 create mode 100644 vignettes/theoretical_background.Rmd

diff --git a/_pkgdown.yml b/_pkgdown.yml
index de329223..30fc41fb 100644
--- a/_pkgdown.yml
+++ b/_pkgdown.yml
@@ -13,4 +13,5 @@ articles:
   navbar: Modelling guides and background
   contents:
   - branching_process_literature
+  - theoretical_background
 
diff --git a/vignettes/theoretical_background.Rmd b/vignettes/theoretical_background.Rmd
new file mode 100644
index 00000000..2cf551e2
--- /dev/null
+++ b/vignettes/theoretical_background.Rmd
@@ -0,0 +1,148 @@
+---
+title: "Theoretical background for bpmodels"
+author: "Sebastian Funk and James Azam"
+output:
+  bookdown::html_vignette2:
+    fig_caption: yes
+    code_folding: show
+pkgdown:
+  as_is: true
+bibliography: references.json
+link-citations: true
+vignette: >
+  %\VignetteEncoding{UTF-8}
+  %\VignetteIndexEntry{Theoretical background for bpmodels}
+  %\VignetteEngine{knitr::rmarkdown}
+editor_options:
+  chunk_output_type: console
+---
+
+```{r setup, include=FALSE}
+knitr::opts_chunk$set(
+  echo = TRUE,
+  message = FALSE,
+  warning = FALSE,
+  collapse = TRUE,
+  comment = "#>"
+)
+```
+
+_bpmodels_ provides methods to analyse and simulate the size and length of branching processes with an arbitrary offspring distribution.
+In this vignette we lay out the mathematical concepts behind the functionality available
+in the package.
+
+# Branching processes
+
+[Branching processes](https://en.wikipedia.org/wiki/Branching_process) are a class of models that are used to model the growth of populations. They assume that each member of the population produces a number of offspring, $Z$, that is a random variable with probability mass function $p(Z = z | \theta)$, called the _offspring distribution_.
+Their use has a long history in epidemiology, where the population is interpreted as a pathogen, and the offspring as new hosts that it infects [@farrington2003].
+Below we will call these infected individuals _cases_ but the methods could be applied in other contexts where branching processes are to be used.
+
+# Simulation
+
+To simulate from a branching process, we start with a single case and proceed in discrete steps or generations, drawing from the offspring distribution $p(Z=z | \theta)$ to generate new cases from each case.
+
+Given an infector $i$ and infectee $j$, we can additionally assign them a distribution of times $T$ that approximates when the infection event occurred. If we define $T$ as a random variable with distribution $f(T = t; \theta)$ we can assign each case $j$ a time $t_{j}$ which, if case $j$ has been affected by case $i$ is given by $f(t_{j} - t_{i} | \theta)$.
+If we identify the timing of cases by the time of their symptom onset this is the [serial interval](https://en.wikipedia.org/wiki/Serial_interval), but depending on case definitions this could be another interval.
+
+## Summary statistics
+
+Branching process simulations end when they have gone extinct, that is, no more offspring are being produced, or because of some stopping criterion. To summarise the simulations, we either study the _size_ or _length_ of the resulting _chain_ of cases.
+The size $S$ of a chain is the number of cases that have occurred over the course of the simulation including the initial case so that $S \geq 1$.
+The length $L$ of a chain is the number of generations that have been simulated including the initial case so that $L \geq 1$.
+
+# Inference
+
+By characterising a chains of cases by their size of length we can conduct inference to learn about the underlying parameters $\theta$ from observation of chain sizes or chain lengths [@blumberg2013].
+In general this is only possible for _subcritical_ branching processes, i.e. ones where the mean number of offspring is less than 1, as otherwise the branching process could grow forever.
+However, we can expand the theory to _supercritical_ branching processes, i.e. ones where the mean number of offspring is greater than 1, by defining a cutoff of chain size or length beyond which we treat the chain as if it had infinite size or length, respectively.
+
+## Size and length distributions for some offspring distributions
+
+We show the equations for the size and length distributions for some offspring distributions where they can be derived analytically:
+[Poisson](https://en.wikipedia.org/wiki/Poisson_distribution),
+[negative binomial](https://en.wikipedia.org/wiki/Negative_binomial_distribution),
+[geometric](https://en.wikipedia.org/wiki/Geometric_distribution),
+and a [gamma](https://en.wikipedia.org/wiki/Gamma_distribution)-[Borel](https://en.wikipedia.org/wiki/Borel_distribution) mixture.
+
+### Negative binomial and special cases
+
+If the offspring distribution is a Poisson distribution, we can interpret its rate parameter $\lambda$ as the basic reproduction number $R_{0}$ of the pathogen.
+In the more general case where the offspring definition can be _overdispersed_ leading to _superspreading_ we can use a negative binomial offspring distribution with mean $\mu$ and overdispersion $k$ [@lloyd-smith2005, @blumberg2013].
+In that case, the mean parameter $\mu$ is interpreted as the basic reproduction number $R_0$ of the pathogen.
+The negative binomial distribution arises from a Poisson-gamma mixture and thus a branching process with negative binomial distributed offspring can be interpreted as one with Poisson distributed offspring where the basic reproduction number $R_0$ itself varies according to a gamma distribution.
+The amount of variation in $R_0$ is then interpreted as individual-level variation in transmission representing overdispersion or superspreading, and the degree to which this happens is given by the overdispersion parameter $k$.
+
+#### Size distributions
+
+The probability $p$ of a chain of size $S$ given $R_0$ and $k$ in a branching process with negative binomial offspring distribution is given in Eq. 9 of @blumberg2013
+
+$$
+p(S|R_0, k) = \frac{\Gamma(kS + S - 1)}{\Gamma(kS)\Gamma(S + 1)} \frac{\left(\frac{R_0}{k}\right)^{S - 1}}{\left( 1 + \frac{R_0}{k} \right)^{kS + S - 1}}
+$$
+
+where $\Gamma$ is the gamma function.
+In order to estimate $S$ from a given $R_0$ and $k$ we can define a likelihood function $L(S) = p(S|R_0, k)$.
+The corresponding log-likelihood is
+
+\begin{align}
+\mathrm{LL}(S) = &\log\Gamma(kS + S  - 1) - \left(\log\Gamma(kS) + \log\Gamma(S - 1) \right) \\
+& + (S-1) \log \frac{R_0}{k} - (SR_0 + (S - 1)) \log \left(1 + \frac{R_0}{k}\right)
+\end{align}
+
+The log-likelihood for Poisson distributed offspring follows from this where $k$ tends to infinity (corresponding to Eq. 2.2 in @farrington2003)
+
+$$
+\mathrm{LL}(S) = (S - 1) \log R_0 - S R_0 + (S - 2) \log S - \log\Gamma(S)
+$$
+
+In all cases the point estimate for the basic reproduction number $\hat{R_0}$ is related to the mean chain size $\bar{S}$ by
+
+$$
+\hat{R_0} = 1 - \frac{1}{\bar{S}}
+$$
+
+#### Length distributions
+
+The cumulative mass function $F(L)$ of observing a chain of length $L$ when offspring is Poisson distributed is given by Eq. (2.5) in @farrington2003 (there called "outbreak duration"):
+
+$$
+F(L) = e^{-R_0} E_L \left( e^{R_0 e^{-R_0} } \right)
+$$
+
+where $E_L(x)$ is the iterated exponential function, $E_0(x) = 1$, $E_{L + 1}(x) = x^{E_L(x)}$.
+
+For geometric distributed offspring (corresponding to a negative Binomial with $k=1$) this function is given by
+
+$$
+F(L) = \frac{ 1- R_0^{L + 1} } {1 - R_0^{L - 2}}
+$$
+
+In both cases $f(L)$ denotes cumulative mass functions and therefore the probability of observing a chain of length $L$ is therefore $f(L) - f(L - 1)$.
+
+### Gamma-Borel mixture
+
+The probability distribution of outbreak sizes from a branching process with a Poisson offspring distribution (Eq. 2.2 in @farrington2003) is a special case of the [Borel-Tanner distribution](https://en.wikipedia.org/wiki/Borel_distribution#Borel%E2%80%93Tanner_distribution) starting with 1 individual.
+An alternative to the negative binomial offspring distribution which represents a Poisson-gamma mixture is a Borel-gamma mixture.
+This could represent situations where the variation is not at the _individual level_ but at the _chain level_, i.e. transmission chains is homogeneous but there is heterogeneity between chains.
+In that case, it can be shown that the resulting log-likelihood of chain sizes is
+
+\begin{align}
+\mathrm{LL}(S) = &\log\Gamma(k + S - 1) - \left(\log\Gamma(k) + \log\Gamma(S + 1) \right) \\
+& + (S-1) \log S - k \log \left(S + \frac{R_0}{k}\right)
+\end{align}
+
+## Numerical approximations of chain size and length distributions
+
+When analytic likelihoods are not available a numerical approximation is used to derive the distributions.
+In order to do this, the simulation functionality is be used to generate $n$ simulated chains and the value of the cumulative mass function $P(S|\theta)$ at the observed $S$ approximated by the empirical cumulative distribution function:
+$$
+P(S|\theta) \approx \sum_i \mathbf{1}(x_i <= S)
+$$
+where $\mathbf{1}$ is the indicator function and $x_i$ the i-th observed chain size (or length, if the interest is in $L$).
+In order to improve this approximation a linear approximation is applied to the values of the empirical distribution function (at the expense of normalisation to 1).
+
+The (unnormalised) probability of observing $S$ is then given by
+$$
+p(S|\theta) = P(S|\theta) - P(S - 1|\theta)
+$$
+and a an equivalent relationship is used for $L$.

From 78955710c5893f6e817c6d7129377171c0f37b23 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 20 Sep 2023 21:59:33 +0100
Subject: [PATCH 660/828] Improve documentation of simulate family of functions

---
 R/simulate.r                  | 107 ++++++++++++++++++++++------------
 man/simulate_summary.Rd       |  13 +++++
 man/simulate_tree.Rd          |  28 +++++----
 man/simulate_tree_from_pop.Rd |  79 +++++++++++++++----------
 4 files changed, 149 insertions(+), 78 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 9cdbfd02..60f47e88 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -15,23 +15,23 @@
 #' Defaults to `Inf`.
 #' @param serials_dist The serial interval distribution function; the name
 #' of a user-defined named or anonymous function with only one argument `n`,
-#' representing the number of serial intervals to generate.
+#' representing the number of serial intervals to generate. See details.
 #' @param t0 Start time (if serial interval is given); either a single value
 #' or a vector of same length as `nchains` (number of simulations) with
 #' initial times. Defaults to 0.
 #' @param tf End time (if serial interval is given).
 #' @param ... Parameters of the offspring distribution as required by R.
-#' @return an `epichains` object, which is basically a `data.frame` with
+#' @return An `<epichains>` object, which is basically a `<data.frame>` with
 #' columns `chain_id` (chain ID), `sim_id` (a unique ID within each simulation
-#' for each individual element of the chain), `ancestor`
-#' (the ID of the ancestor of each element), `generation`, and
+#' for each individual), `ancestor`
+#' (the ID of the ancestor of each individual), `generation`, and
 #' `time` (of infection)
 #' @author James M. Azam, Sebastian Funk
 #' @export
 #' @details
 #' `simulate_tree()` simulates a branching process of the form:
 #' WIP
-#' # The serial interval (`serials_dist`):
+#' # The serial interval (`serials_dist`)
 #'
 #' ## Assumptions/disambiguation
 #'
@@ -46,7 +46,7 @@
 #'
 #' See References below for some literature on the subject.
 #'
-#' ## Specifying `serials_dist` in `simulate_tree()`
+#' ## Specifying `serials_dist`
 #'
 #' `serials_dist` must be specified as a named or
 #' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) # nolint
@@ -67,17 +67,23 @@
 #' in the `simulate_tree()` call like so
 #' \code{simulate_tree(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
 #' where `...` are the other arguments to `simulate_tree()`.
-#' @seealso [simulate_summary()] for simulating the transmission chains
-#' statistic without the tree of infections.
+#' @seealso [simulate_summary()] for simulating transmission chains
+#' statistic without the full tree information.
+#' @seealso [simulate_tree_from_pop()] for simulating transmission chains
+#' from an initial susceptible population with initial immunity,
+#' returning the full tree information ("sim_id",
+#' "ancestor", "generation", and "time").
 #' @examples
 #' set.seed(123)
 #' chains <- simulate_tree(
-#'   nchains = 10, statistic = "size",
-#'   offspring_dist = "pois", stat_max = 10, serials_dist = function(x) 3,
+#'   nchains = 10,
+#'   statistic = "size",
+#'   offspring_dist = "pois",
+#'   stat_max = 10,
+#'   serials_dist = function(x) 3,
 #'   lambda = 2
 #' )
 #' @references
-#'
 #' Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
 #' between serial interval, infectiousness profile and generation time.
 #' J R Soc Interface. 2021 Jan;18(174):20200756.
@@ -222,6 +228,14 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #' @inheritParams simulate_tree
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
+#' @author James M. Azam, Sebastian Funk
+#' @seealso [simulate_tree()] for simulating the transmission chains,
+#' returning the full tree information ("sim_id", "chain_id",
+#' "ancestor", "generation", and optionally, "time").
+#' @seealso [simulate_tree_from_pop()] for simulating transmission chains
+#' from an initial susceptible population with initial immunity,
+#' returning the full tree information ("sim_id",
+#' "ancestor", "generation", and "time").
 #' @examples
 #' simulate_summary(
 #'   nchains = 10, statistic = "size", offspring_dist = "pois",
@@ -289,58 +303,75 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' Simulate a tree of infections from an initial susceptible population
 #' with initial immunity
 #'
-#' @param pop The susceptible population.
 #' @inheritParams simulate_tree
+#' @param pop The susceptible population size.
+#' @param offspring_dist Offspring distribution: a character string
+#' corresponding to the R distribution function (e.g., "pois" for Poisson,
+#' where \code{\link{rpois}} is the R function to generate Poisson random
+#' numbers). Only supports "pois" and "nbinom".
 #' @param offspring_mean The average number of secondary cases for each case.
-#' Same as R0.
+#' Same as \code{R0}.
 #' @param offspring_disp The dispersion parameter of the number of
 #' secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
 #' avoid division by 0 when calculating the size. See details and
 #'  \code{?rnbinom} for details on the parameterisation in Ecology.
-#' @param serials_dist The serial interval. A function that takes one
-#' parameter (`n`), the number of serial intervals to randomly sample. Value
-#' must be >= 0.
 #' @param initial_immune The number of initial immunes in the population.
+#' Must be less than `pop` - 1.
 #' @param t0 Start time; Defaults to 0.
 #' @param tf End time; Defaults to `Inf`.
-#' @return a data frame with columns `time`, `id` (a unique ID for each
-#' individual element of the chain), `ancestor` (the ID of the ancestor
-#' of each element), and `generation`.
+#' @return An `<epichains>` object, which is basically a `<data.frame>` with
+#' columns `sim_id` (a unique ID within each simulation for each individual
+#' of the chain), `ancestor` (the ID of the ancestor of each individual),
+#' `generation`, and `time` (of infection).
 #' @details
 #'
 #' # Offspring distributions
-#' Currently only "pois" & "nbinom" are supported. Internally truncated
-#' distributions are used to avoid infecting more people than susceptibles
-#' available.
+#' Currently, `offspring_dist` only supports "pois" & "nbinom".
+#' Internally, the respective truncated poisson and negative binomial
+#' distributions are used to avoid the situation where there are more cases
+#' than susceptibles at any point.
 #'
-#' The poisson model is parametrised so that:
+#' The poisson model has mean, lambda, parametrised as:
+#' \deqn{lambda = \dfrac{offspring\_mean \times (pop -
+#' initial\_immune - 1)}{pop}}
 #'
-#' lamda = offspring_mean * pop - initial_immune / pop
+#' The negative binomial model, has mean, mu, parametrised as:
+#' \deqn{mu = \dfrac{offspring\_mean \times (pop - initial\_immune - 1)}{pop},}
+#' and dispersion, size, parametrised as:
+#' \deqn{size = \dfrac{mu}{offspring\_disp - 1}.}
+#' This is why `offspring_disp` must be greater than 1.
 #'
-#' The negative binomial model is parametrised as:
+#' # Specifying `serials_dist`
+#' See the details section of [`simulate_tree()`] for details on how to specify
+#' `serials_dist`.
 #'
-#' mu = offspring_mean * pop - initial immune / pop, and
-#' size = mu / (offspring_disp - 1). This is why offspring_disp must be greater
-#' than 1.
-#'
-#' simulate_tree_from_pop() has a couple of key different from simulate_tree():
+#' `simulate_tree_from_pop()` has a couple of key differences from
+#' `simulate_tree()`:
 #'  * the maximal chain statistic is limited by `pop` instead of
 #'  `stat_max` (in `simulate_tree()`),
-#'  * it can only handle implemented offspring distributions ("pois" and
-#' "nbinom").
-#' @author Flavio Finger
-#' @author James M. Azam
+#'  * `offspring_dist` can only handle "pois" and "nbinom".
+#' @author Flavio Finger, James M. Azam, Sebastian Funk
+#' @seealso [simulate_tree()] for simulating the transmission chains,
+#' returning the full tree information ("sim_id", "chain_id",
+#' "ancestor", "generation", and optionally, "time").
+#' @seealso [simulate_summary()] for simulating the transmission chains
+#' statistic (size or length) without the full tree information.
 #' @examples
 #' # Simulate with poisson offspring
 #' simulate_tree_from_pop(
-#'   pop = 100, offspring_dist = "pois",
-#'   offspring_mean = 0.5, serials_dist = function(x) 3
+#'   pop = 100,
+#'   offspring_dist = "pois",
+#'   offspring_mean = 0.5,
+#'   serials_dist = function(x) 3
 #' )
 #'
 #' # Simulate with negative binomial offspring
 #' simulate_tree_from_pop(
-#'   pop = 100, offspring_dist = "nbinom",
-#'   offspring_mean = 0.5, offspring_disp = 1.1, serials_dist = function(x) 3
+#'   pop = 100,
+#'   offspring_dist = "nbinom",
+#'   offspring_mean = 0.5,
+#'   offspring_disp = 1.1,
+#'   serials_dist = function(x) 3
 #' )
 #' @export
 simulate_tree_from_pop <- function(pop,
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index 00ea66c2..b16e48b5 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -40,3 +40,16 @@ simulate_summary(
   stat_max = 10, lambda = 2
 )
 }
+\seealso{
+\code{\link[=simulate_tree]{simulate_tree()}} for simulating the transmission chains,
+returning the full tree information ("sim_id", "chain_id",
+"ancestor", "generation", and optionally, "time").
+
+\code{\link[=simulate_tree_from_pop]{simulate_tree_from_pop()}} for simulating transmission chains
+from an initial susceptible population with initial immunity,
+returning the full tree information ("sim_id",
+"ancestor", "generation", and "time").
+}
+\author{
+James M. Azam, Sebastian Funk
+}
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 1d2239cc..d9983b9d 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -35,7 +35,7 @@ Defaults to \code{Inf}.}
 
 \item{serials_dist}{The serial interval distribution function; the name
 of a user-defined named or anonymous function with only one argument \code{n},
-representing the number of serial intervals to generate.}
+representing the number of serial intervals to generate. See details.}
 
 \item{t0}{Start time (if serial interval is given); either a single value
 or a vector of same length as \code{nchains} (number of simulations) with
@@ -46,10 +46,10 @@ initial times. Defaults to 0.}
 \item{...}{Parameters of the offspring distribution as required by R.}
 }
 \value{
-an \code{epichains} object, which is basically a \code{data.frame} with
+An \verb{<epichains>} object, which is basically a \verb{<data.frame>} with
 columns \code{chain_id} (chain ID), \code{sim_id} (a unique ID within each simulation
-for each individual element of the chain), \code{ancestor}
-(the ID of the ancestor of each element), \code{generation}, and
+for each individual), \code{ancestor}
+(the ID of the ancestor of each individual), \code{generation}, and
 \code{time} (of infection)
 }
 \description{
@@ -59,7 +59,7 @@ Simulate a tree of infections with a serial and offspring distributions
 \code{simulate_tree()} simulates a branching process of the form:
 WIP
 }
-\section{The serial interval (\code{serials_dist}):}{
+\section{The serial interval (\code{serials_dist})}{
 \subsection{Assumptions/disambiguation}{
 
 In epidemiology, the generation interval is the duration between successive
@@ -74,7 +74,7 @@ generation interval, that is, the time between successive cases.
 See References below for some literature on the subject.
 }
 
-\subsection{Specifying \code{serials_dist} in \code{simulate_tree()}}{
+\subsection{Specifying \code{serials_dist}}{
 
 \code{serials_dist} must be specified as a named or
 \href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function} # nolint
@@ -101,8 +101,11 @@ where \code{...} are the other arguments to \code{simulate_tree()}.
 \examples{
 set.seed(123)
 chains <- simulate_tree(
-  nchains = 10, statistic = "size",
-  offspring_dist = "pois", stat_max = 10, serials_dist = function(x) 3,
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
   lambda = 2
 )
 }
@@ -118,8 +121,13 @@ infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
 doi: 10.1093/aje/kwg251. PMID: 14630599.
 }
 \seealso{
-\code{\link[=simulate_summary]{simulate_summary()}} for simulating the transmission chains
-statistic without the tree of infections.
+\code{\link[=simulate_summary]{simulate_summary()}} for simulating transmission chains
+statistic without the full tree information.
+
+\code{\link[=simulate_tree_from_pop]{simulate_tree_from_pop()}} for simulating transmission chains
+from an initial susceptible population with initial immunity,
+returning the full tree information ("sim_id",
+"ancestor", "generation", and "time").
 }
 \author{
 James M. Azam, Sebastian Funk
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 57c5e83a..6a2e3a45 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -17,79 +17,98 @@ simulate_tree_from_pop(
 )
 }
 \arguments{
-\item{pop}{The susceptible population.}
+\item{pop}{The susceptible population size.}
 
 \item{offspring_dist}{Offspring distribution: a character string
 corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
-numbers).}
+numbers). Only supports "pois" and "nbinom".}
 
 \item{offspring_mean}{The average number of secondary cases for each case.
-Same as R0.}
+Same as \code{R0}.}
 
 \item{offspring_disp}{The dispersion parameter of the number of
 secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
 avoid division by 0 when calculating the size. See details and
 \code{?rnbinom} for details on the parameterisation in Ecology.}
 
-\item{serials_dist}{The serial interval. A function that takes one
-parameter (\code{n}), the number of serial intervals to randomly sample. Value
-must be >= 0.}
+\item{serials_dist}{The serial interval distribution function; the name
+of a user-defined named or anonymous function with only one argument \code{n},
+representing the number of serial intervals to generate. See details.}
 
-\item{initial_immune}{The number of initial immunes in the population.}
+\item{initial_immune}{The number of initial immunes in the population.
+Must be less than \code{pop} - 1.}
 
 \item{t0}{Start time; Defaults to 0.}
 
 \item{tf}{End time; Defaults to \code{Inf}.}
 }
 \value{
-a data frame with columns \code{time}, \code{id} (a unique ID for each
-individual element of the chain), \code{ancestor} (the ID of the ancestor
-of each element), and \code{generation}.
+An \verb{<epichains>} object, which is basically a \verb{<data.frame>} with
+columns \code{sim_id} (a unique ID within each simulation for each individual
+of the chain), \code{ancestor} (the ID of the ancestor of each individual),
+\code{generation}, and \code{time} (of infection).
 }
 \description{
 Simulate a tree of infections from an initial susceptible population
 with initial immunity
 }
 \section{Offspring distributions}{
-Currently only "pois" & "nbinom" are supported. Internally truncated
-distributions are used to avoid infecting more people than susceptibles
-available.
+Currently, \code{offspring_dist} only supports "pois" & "nbinom".
+Internally, the respective truncated poisson and negative binomial
+distributions are used to avoid the situation where there are more cases
+than susceptibles at any point.
 
-The poisson model is parametrised so that:
+The poisson model has mean, lambda, parametrised as:
+\deqn{lambda = \dfrac{offspring\_mean \times (pop -
+initial\_immune - 1)}{pop}}
 
-lamda = offspring_mean * pop - initial_immune / pop
-
-The negative binomial model is parametrised as:
+The negative binomial model, has mean, mu, parametrised as:
+\deqn{mu = \dfrac{offspring\_mean \times (pop - initial\_immune - 1)}{pop},}
+and dispersion, size, parametrised as:
+\deqn{size = \dfrac{mu}{offspring\_disp - 1}.}
+This is why \code{offspring_disp} must be greater than 1.
+}
 
-mu = offspring_mean * pop - initial immune / pop, and
-size = mu / (offspring_disp - 1). This is why offspring_disp must be greater
-than 1.
+\section{Specifying \code{serials_dist}}{
+See the details section of \code{\link[=simulate_tree]{simulate_tree()}} for details on how to specify
+\code{serials_dist}.
 
-simulate_tree_from_pop() has a couple of key different from simulate_tree():
+\code{simulate_tree_from_pop()} has a couple of key differences from
+\code{simulate_tree()}:
 \itemize{
 \item the maximal chain statistic is limited by \code{pop} instead of
 \code{stat_max} (in \code{simulate_tree()}),
-\item it can only handle implemented offspring distributions ("pois" and
-"nbinom").
+\item \code{offspring_dist} can only handle "pois" and "nbinom".
 }
 }
 
 \examples{
 # Simulate with poisson offspring
 simulate_tree_from_pop(
-  pop = 100, offspring_dist = "pois",
-  offspring_mean = 0.5, serials_dist = function(x) 3
+  pop = 100,
+  offspring_dist = "pois",
+  offspring_mean = 0.5,
+  serials_dist = function(x) 3
 )
 
 # Simulate with negative binomial offspring
 simulate_tree_from_pop(
-  pop = 100, offspring_dist = "nbinom",
-  offspring_mean = 0.5, offspring_disp = 1.1, serials_dist = function(x) 3
+  pop = 100,
+  offspring_dist = "nbinom",
+  offspring_mean = 0.5,
+  offspring_disp = 1.1,
+  serials_dist = function(x) 3
 )
 }
-\author{
-Flavio Finger
+\seealso{
+\code{\link[=simulate_tree]{simulate_tree()}} for simulating the transmission chains,
+returning the full tree information ("sim_id", "chain_id",
+"ancestor", "generation", and optionally, "time").
 
-James M. Azam
+\code{\link[=simulate_summary]{simulate_summary()}} for simulating the transmission chains
+statistic (size or length) without the full tree information.
+}
+\author{
+Flavio Finger, James M. Azam, Sebastian Funk
 }

From 4348ede9a6343892dda28680de8026878d9bfd36 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 21 Sep 2023 16:06:26 +0100
Subject: [PATCH 661/828] Reword the titles

---
 R/simulate.r                  | 15 +++++++++------
 man/simulate_summary.Rd       |  2 +-
 man/simulate_tree.Rd          | 20 +++++++++++++++-----
 man/simulate_tree_from_pop.Rd |  8 ++++----
 4 files changed, 29 insertions(+), 16 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 60f47e88..df4ab1ab 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,4 +1,4 @@
-#' Simulate a tree of infections with a serial and offspring distributions
+#' Simulate transmission trees from an initial number of infections
 #'
 #' @param nchains Number of chains to simulate.
 #' @param offspring_dist Offspring distribution: a character string
@@ -223,19 +223,22 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 
 
-#' Simulate a summary of the transmission chain statistic
+#' Simulate transmission chains sizes/lengths without infection tree
 #'
 #' @inheritParams simulate_tree
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
+#' @inheritSection simulate_tree Calculating chain sizes and lengths
+#' @inheritSection simulate_tree The serial interval (`serials_dist`)
 #' @author James M. Azam, Sebastian Funk
 #' @seealso [simulate_tree()] for simulating the transmission chains,
 #' returning the full tree information ("sim_id", "chain_id",
 #' "ancestor", "generation", and optionally, "time").
-#' @seealso [simulate_tree_from_pop()] for simulating transmission chains
-#' from an initial susceptible population with initial immunity,
-#' returning the full tree information ("sim_id",
-#' "ancestor", "generation", and "time").
+#' @seealso
+#' * [simulate_tree()] for simulating transmission trees from an
+#'   initial number of infections.
+#' * [simulate_tree_from_pop()] for simulating transmission trees from a
+#'   susceptible or partially immune population.
 #' @examples
 #' simulate_summary(
 #'   nchains = 10, statistic = "size", offspring_dist = "pois",
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index b16e48b5..c3be57c2 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/simulate.r
 \name{simulate_summary}
 \alias{simulate_summary}
-\title{Simulate a summary of the transmission chain statistic}
+\title{Simulate transmission chains sizes/lengths without infection tree}
 \usage{
 simulate_summary(
   nchains,
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index d9983b9d..02a696fe 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/simulate.r
 \name{simulate_tree}
 \alias{simulate_tree}
-\title{Simulate a tree of infections with a serial and offspring distributions}
+\title{Simulate transmission trees from an initial number of infections}
 \usage{
 simulate_tree(
   nchains,
@@ -53,12 +53,22 @@ for each individual), \code{ancestor}
 \code{time} (of infection)
 }
 \description{
-Simulate a tree of infections with a serial and offspring distributions
+Simulate transmission trees from an initial number of infections
 }
-\details{
-\code{simulate_tree()} simulates a branching process of the form:
-WIP
+\section{Calculating chain sizes and lengths}{
+The function simulates the chain size for individual \eqn{i} at time
+\eqn{t}, \eqn{I_{t, i}}, as:
+\deqn{I_{t, i} = \sum_{i}^{I_{t-1}}X_{t, i},}
+and the chain length/duration for individual \eqn{i} at time \eqn{t},
+\eqn{L_{t, i}}, as:
+\deqn{L_{t, i} = {\sf min}(1, X_{t, i}), }
+where \eqn{X_{t, i}} is the secondary cases generated by individual \eqn{i}
+at time \eqn{t}, and \eqn{I_{0, i} = L_{0, i} = 1}.
+
+The distribution of secondary cases, \eqn{X_{t, i}} is modelled by the
+offspring distribution (\code{offspring_dist}).
 }
+
 \section{The serial interval (\code{serials_dist})}{
 \subsection{Assumptions/disambiguation}{
 
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 6a2e3a45..3bedd90b 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -2,8 +2,8 @@
 % Please edit documentation in R/simulate.r
 \name{simulate_tree_from_pop}
 \alias{simulate_tree_from_pop}
-\title{Simulate a tree of infections from an initial susceptible population
-with initial immunity}
+\title{Simulate transmission trees from a susceptible or partially immune
+population}
 \usage{
 simulate_tree_from_pop(
   pop,
@@ -50,8 +50,8 @@ of the chain), \code{ancestor} (the ID of the ancestor of each individual),
 \code{generation}, and \code{time} (of infection).
 }
 \description{
-Simulate a tree of infections from an initial susceptible population
-with initial immunity
+Simulate transmission trees from a susceptible or partially immune
+population
 }
 \section{Offspring distributions}{
 Currently, \code{offspring_dist} only supports "pois" & "nbinom".

From f870ef314d93ef0f6f4d17a003effeff0b76696a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 21 Sep 2023 16:08:19 +0100
Subject: [PATCH 662/828] Add model descriptions, improve, and inherit serial
 interval and offspring dist details from simulate_tree

---
 R/simulate.r                  | 82 ++++++++++++++++++++---------------
 man/simulate_summary.Rd       | 66 +++++++++++++++++++++++++---
 man/simulate_tree.Rd          | 38 ++++++++--------
 man/simulate_tree_from_pop.Rd | 66 ++++++++++++++++++++++------
 4 files changed, 181 insertions(+), 71 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index df4ab1ab..3a09198a 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -29,10 +29,20 @@
 #' @author James M. Azam, Sebastian Funk
 #' @export
 #' @details
-#' `simulate_tree()` simulates a branching process of the form:
-#' WIP
-#' # The serial interval (`serials_dist`)
+#' # Calculating chain sizes and lengths
+#' The function simulates the chain size for individual \eqn{i} at time
+#' \eqn{t}, \eqn{I_{t, i}}, as:
+#' \deqn{I_{t, i} = \sum_{i}^{I_{t-1}}X_{t, i},}
+#' and the chain length/duration for individual \eqn{i} at time \eqn{t},
+#' \eqn{L_{t, i}}, as:
+#' \deqn{L_{t, i} = {\sf min}(1, X_{t, i}), }
+#' where \eqn{X_{t, i}} is the secondary cases generated by individual \eqn{i}
+#' at time \eqn{t}, and \eqn{I_{0, i} = L_{0, i} = 1}.
+#'
+#' The distribution of secondary cases, \eqn{X_{t, i}} is modelled by the
+#' offspring distribution (`offspring_dist`).
 #'
+#' # The serial interval (`serials_dist`)
 #' ## Assumptions/disambiguation
 #'
 #' In epidemiology, the generation interval is the duration between successive
@@ -49,30 +59,30 @@
 #' ## Specifying `serials_dist`
 #'
 #' `serials_dist` must be specified as a named or
-#' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) # nolint
+#' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) #nolint
 #' with one argument.
 #'
 #' For example, assuming we want to specify the serial interval
-#' generator as a random log-normally distributed variable with
+#' distribution as a random log-normally distributed variable with
 #' `meanlog = 0.58` and `sdlog = 1.58`, we could define a named function,
 #' let's call it "serial_interval", with only one argument representing the
 #' number of serial intervals to sample:
 #' \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
 #' and assign the name of the function to `serials_dist` in
-#' `simulate_tree()` like so
-#' \code{simulate_tree(..., serials_dist = serial_interval)},
-#' where `...` are the other arguments to `simulate_tree()`.
+#' the simulation function like so
+#' \code{`simulate_*`(..., serials_dist = serial_interval)},
+#' where `...` are the other arguments to `simulate_*()` and * is a placeholder
+#' for the rest of simulation function's name.
 #'
 #' Alternatively, we could assign an anonymous function to `serials_dist`
-#' in the `simulate_tree()` call like so
-#' \code{simulate_tree(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
-#' where `...` are the other arguments to `simulate_tree()`.
-#' @seealso [simulate_summary()] for simulating transmission chains
-#' statistic without the full tree information.
-#' @seealso [simulate_tree_from_pop()] for simulating transmission chains
-#' from an initial susceptible population with initial immunity,
-#' returning the full tree information ("sim_id",
-#' "ancestor", "generation", and "time").
+#' in the `simulate_*()` call like so
+#' \code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
+#' where `...` are the other arguments to `simulate_*()`.
+#' @seealso
+#' * [simulate_summary()] for simulating transmission chains
+#'   statistics (sizes or lengths) without the infection tree.
+#' * [simulate_tree_from_pop()] for simulating transmission trees from a
+#'   susceptible or partially immune population.
 #' @examples
 #' set.seed(123)
 #' chains <- simulate_tree(
@@ -87,12 +97,16 @@
 #' Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
 #' between serial interval, infectiousness profile and generation time.
 #' J R Soc Interface. 2021 Jan;18(174):20200756.
-#' doi: 10.1098/rsif.2020.0756. Epub 2021 Jan 6.
+#' \doi{10.1098/rsif.2020.0756}. Epub 2021 Jan 6.
 #' PMID: 33402022; PMCID: PMC7879757.
 #'
 #' Fine PE. The interval between successive cases of an
 #' infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
-#' doi: 10.1093/aje/kwg251. PMID: 14630599.
+#' \doi{10.1093/aje/kwg251. PMID: 14630599}
+#'
+#' Jacob C. (2010). Branching processes: their role in epidemiology.
+#' International journal of environmental research and public health, 7(3),
+#' 1186–1204. \doi{https://doi.org/10.3390/ijerph7031204}
 simulate_tree <- function(nchains, statistic = c("size", "length"),
                           offspring_dist, stat_max = Inf,
                           serials_dist, t0 = 0,
@@ -303,8 +317,8 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
   )
 }
 
-#' Simulate a tree of infections from an initial susceptible population
-#' with initial immunity
+#' Simulate transmission trees from a susceptible or partially immune
+#' population
 #'
 #' @inheritParams simulate_tree
 #' @param pop The susceptible population size.
@@ -327,7 +341,6 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' of the chain), `ancestor` (the ID of the ancestor of each individual),
 #' `generation`, and `time` (of infection).
 #' @details
-#'
 #' # Offspring distributions
 #' Currently, `offspring_dist` only supports "pois" & "nbinom".
 #' Internally, the respective truncated poisson and negative binomial
@@ -335,30 +348,29 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' than susceptibles at any point.
 #'
 #' The poisson model has mean, lambda, parametrised as:
-#' \deqn{lambda = \dfrac{offspring\_mean \times (pop -
-#' initial\_immune - 1)}{pop}}
+#' \deqn{{\sf lambda} = \dfrac{{\sf offspring\_mean} \times ({\sf pop} -
+#' {\sf initial\_immune} - 1)}{{\sf pop}}}
 #'
 #' The negative binomial model, has mean, mu, parametrised as:
-#' \deqn{mu = \dfrac{offspring\_mean \times (pop - initial\_immune - 1)}{pop},}
+#' \deqn{{\sf mu} = \dfrac{{\sf offspring\_mean} \times ({\sf pop} -
+#' {\sf initial\_immune} - 1)}{{\sf pop}},}
 #' and dispersion, size, parametrised as:
-#' \deqn{size = \dfrac{mu}{offspring\_disp - 1}.}
+#' \deqn{{\sf size} = \dfrac{{\sf mu}}{{\sf offspring\_disp} - 1}.}
 #' This is why `offspring_disp` must be greater than 1.
 #'
-#' # Specifying `serials_dist`
-#' See the details section of [`simulate_tree()`] for details on how to specify
-#' `serials_dist`.
-#'
+#' # Differences with `simulate_tree()`
 #' `simulate_tree_from_pop()` has a couple of key differences from
 #' `simulate_tree()`:
 #'  * the maximal chain statistic is limited by `pop` instead of
 #'  `stat_max` (in `simulate_tree()`),
 #'  * `offspring_dist` can only handle "pois" and "nbinom".
+#' @inheritSection simulate_tree The serial interval (`serials_dist`)
 #' @author Flavio Finger, James M. Azam, Sebastian Funk
-#' @seealso [simulate_tree()] for simulating the transmission chains,
-#' returning the full tree information ("sim_id", "chain_id",
-#' "ancestor", "generation", and optionally, "time").
-#' @seealso [simulate_summary()] for simulating the transmission chains
-#' statistic (size or length) without the full tree information.
+#' @seealso
+#' * [simulate_tree()] for simulating transmission trees from an
+#'   initial number of infections.
+#' * [simulate_summary()] for simulating transmission chains
+#'   statistics (sizes or lengths) without the infection tree.
 #' @examples
 #' # Simulate with poisson offspring
 #' simulate_tree_from_pop(
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index c3be57c2..a59ec763 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -32,8 +32,62 @@ computed. Results above the specified value, are set to \code{Inf}.}
 \item{...}{Parameters of the offspring distribution as required by R.}
 }
 \description{
-Simulate a summary of the transmission chain statistic
+Simulate transmission chains sizes/lengths without infection tree
 }
+\section{Calculating chain sizes and lengths}{
+The function simulates the chain size for individual \eqn{i} at time
+\eqn{t}, \eqn{I_{t, i}}, as:
+\deqn{I_{t, i} = \sum_{i}^{I_{t-1}}X_{t, i},}
+and the chain length/duration for individual \eqn{i} at time \eqn{t},
+\eqn{L_{t, i}}, as:
+\deqn{L_{t, i} = {\sf min}(1, X_{t, i}), }
+where \eqn{X_{t, i}} is the secondary cases generated by individual \eqn{i}
+at time \eqn{t}, and \eqn{I_{0, i} = L_{0, i} = 1}.
+
+The distribution of secondary cases, \eqn{X_{t, i}} is modelled by the
+offspring distribution (\code{offspring_dist}).
+}
+
+\section{The serial interval (\code{serials_dist})}{
+\subsection{Assumptions/disambiguation}{
+
+In epidemiology, the generation interval is the duration between successive
+infectious events in a chain of transmission. Similarly, the serial
+interval is the duration between observed symptom onset times between
+successive cases in a transmission chain. The generation interval is
+often hard to observe because exact times of infection are hard to
+measure hence, the serial interval is often used instead . Here, we
+use the serial interval to represent what would normally be called the
+generation interval, that is, the time between successive cases.
+
+See References below for some literature on the subject.
+}
+
+\subsection{Specifying \code{serials_dist}}{
+
+\code{serials_dist} must be specified as a named or
+\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function} #nolint
+with one argument.
+
+For example, assuming we want to specify the serial interval
+distribution as a random log-normally distributed variable with
+\code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
+let's call it "serial_interval", with only one argument representing the
+number of serial intervals to sample:
+\code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
+and assign the name of the function to \code{serials_dist} in
+the simulation function like so
+\code{`simulate_*`(..., serials_dist = serial_interval)},
+where \code{...} are the other arguments to \verb{simulate_*()} and * is a placeholder
+for the rest of simulation function's name.
+
+Alternatively, we could assign an anonymous function to \code{serials_dist}
+in the \verb{simulate_*()} call like so
+\code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
+where \code{...} are the other arguments to \verb{simulate_*()}.
+}
+}
+
 \examples{
 simulate_summary(
   nchains = 10, statistic = "size", offspring_dist = "pois",
@@ -45,10 +99,12 @@ simulate_summary(
 returning the full tree information ("sim_id", "chain_id",
 "ancestor", "generation", and optionally, "time").
 
-\code{\link[=simulate_tree_from_pop]{simulate_tree_from_pop()}} for simulating transmission chains
-from an initial susceptible population with initial immunity,
-returning the full tree information ("sim_id",
-"ancestor", "generation", and "time").
+\itemize{
+\item \code{\link[=simulate_tree]{simulate_tree()}} for simulating transmission trees from an
+initial number of infections.
+\item \code{\link[=simulate_tree_from_pop]{simulate_tree_from_pop()}} for simulating transmission trees from a
+susceptible or partially immune population.
+}
 }
 \author{
 James M. Azam, Sebastian Funk
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 02a696fe..e814276d 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -87,24 +87,25 @@ See References below for some literature on the subject.
 \subsection{Specifying \code{serials_dist}}{
 
 \code{serials_dist} must be specified as a named or
-\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function} # nolint
+\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function} #nolint
 with one argument.
 
 For example, assuming we want to specify the serial interval
-generator as a random log-normally distributed variable with
+distribution as a random log-normally distributed variable with
 \code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
 let's call it "serial_interval", with only one argument representing the
 number of serial intervals to sample:
 \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
 and assign the name of the function to \code{serials_dist} in
-\code{simulate_tree()} like so
-\code{simulate_tree(..., serials_dist = serial_interval)},
-where \code{...} are the other arguments to \code{simulate_tree()}.
+the simulation function like so
+\code{`simulate_*`(..., serials_dist = serial_interval)},
+where \code{...} are the other arguments to \verb{simulate_*()} and * is a placeholder
+for the rest of simulation function's name.
 
 Alternatively, we could assign an anonymous function to \code{serials_dist}
-in the \code{simulate_tree()} call like so
-\code{simulate_tree(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})}, #nolint
-where \code{...} are the other arguments to \code{simulate_tree()}.
+in the \verb{simulate_*()} call like so
+\code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
+where \code{...} are the other arguments to \verb{simulate_*()}.
 }
 }
 
@@ -123,21 +124,24 @@ chains <- simulate_tree(
 Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
 between serial interval, infectiousness profile and generation time.
 J R Soc Interface. 2021 Jan;18(174):20200756.
-doi: 10.1098/rsif.2020.0756. Epub 2021 Jan 6.
+\doi{10.1098/rsif.2020.0756}. Epub 2021 Jan 6.
 PMID: 33402022; PMCID: PMC7879757.
 
 Fine PE. The interval between successive cases of an
 infectious disease. Am J Epidemiol. 2003 Dec 1;158(11):1039-47.
-doi: 10.1093/aje/kwg251. PMID: 14630599.
+\doi{10.1093/aje/kwg251. PMID: 14630599}
+
+Jacob C. (2010). Branching processes: their role in epidemiology.
+International journal of environmental research and public health, 7(3),
+1186–1204. \doi{https://doi.org/10.3390/ijerph7031204}
 }
 \seealso{
-\code{\link[=simulate_summary]{simulate_summary()}} for simulating transmission chains
-statistic without the full tree information.
-
-\code{\link[=simulate_tree_from_pop]{simulate_tree_from_pop()}} for simulating transmission chains
-from an initial susceptible population with initial immunity,
-returning the full tree information ("sim_id",
-"ancestor", "generation", and "time").
+\itemize{
+\item \code{\link[=simulate_summary]{simulate_summary()}} for simulating transmission chains
+statistics (sizes or lengths) without the infection tree.
+\item \code{\link[=simulate_tree_from_pop]{simulate_tree_from_pop()}} for simulating transmission trees from a
+susceptible or partially immune population.
+}
 }
 \author{
 James M. Azam, Sebastian Funk
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 3bedd90b..28c4a507 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -60,20 +60,18 @@ distributions are used to avoid the situation where there are more cases
 than susceptibles at any point.
 
 The poisson model has mean, lambda, parametrised as:
-\deqn{lambda = \dfrac{offspring\_mean \times (pop -
-initial\_immune - 1)}{pop}}
+\deqn{{\sf lambda} = \dfrac{{\sf offspring\_mean} \times ({\sf pop} -
+{\sf initial\_immune} - 1)}{{\sf pop}}}
 
 The negative binomial model, has mean, mu, parametrised as:
-\deqn{mu = \dfrac{offspring\_mean \times (pop - initial\_immune - 1)}{pop},}
+\deqn{{\sf mu} = \dfrac{{\sf offspring\_mean} \times ({\sf pop} -
+{\sf initial\_immune} - 1)}{{\sf pop}},}
 and dispersion, size, parametrised as:
-\deqn{size = \dfrac{mu}{offspring\_disp - 1}.}
+\deqn{{\sf size} = \dfrac{{\sf mu}}{{\sf offspring\_disp} - 1}.}
 This is why \code{offspring_disp} must be greater than 1.
 }
 
-\section{Specifying \code{serials_dist}}{
-See the details section of \code{\link[=simulate_tree]{simulate_tree()}} for details on how to specify
-\code{serials_dist}.
-
+\section{Differences with \code{simulate_tree()}}{
 \code{simulate_tree_from_pop()} has a couple of key differences from
 \code{simulate_tree()}:
 \itemize{
@@ -83,6 +81,46 @@ See the details section of \code{\link[=simulate_tree]{simulate_tree()}} for det
 }
 }
 
+\section{The serial interval (\code{serials_dist})}{
+\subsection{Assumptions/disambiguation}{
+
+In epidemiology, the generation interval is the duration between successive
+infectious events in a chain of transmission. Similarly, the serial
+interval is the duration between observed symptom onset times between
+successive cases in a transmission chain. The generation interval is
+often hard to observe because exact times of infection are hard to
+measure hence, the serial interval is often used instead . Here, we
+use the serial interval to represent what would normally be called the
+generation interval, that is, the time between successive cases.
+
+See References below for some literature on the subject.
+}
+
+\subsection{Specifying \code{serials_dist}}{
+
+\code{serials_dist} must be specified as a named or
+\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function} #nolint
+with one argument.
+
+For example, assuming we want to specify the serial interval
+distribution as a random log-normally distributed variable with
+\code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
+let's call it "serial_interval", with only one argument representing the
+number of serial intervals to sample:
+\code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
+and assign the name of the function to \code{serials_dist} in
+the simulation function like so
+\code{`simulate_*`(..., serials_dist = serial_interval)},
+where \code{...} are the other arguments to \verb{simulate_*()} and * is a placeholder
+for the rest of simulation function's name.
+
+Alternatively, we could assign an anonymous function to \code{serials_dist}
+in the \verb{simulate_*()} call like so
+\code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
+where \code{...} are the other arguments to \verb{simulate_*()}.
+}
+}
+
 \examples{
 # Simulate with poisson offspring
 simulate_tree_from_pop(
@@ -102,12 +140,12 @@ simulate_tree_from_pop(
 )
 }
 \seealso{
-\code{\link[=simulate_tree]{simulate_tree()}} for simulating the transmission chains,
-returning the full tree information ("sim_id", "chain_id",
-"ancestor", "generation", and optionally, "time").
-
-\code{\link[=simulate_summary]{simulate_summary()}} for simulating the transmission chains
-statistic (size or length) without the full tree information.
+\itemize{
+\item \code{\link[=simulate_tree]{simulate_tree()}} for simulating transmission trees from an
+initial number of infections.
+\item \code{\link[=simulate_summary]{simulate_summary()}} for simulating transmission chains
+statistics (sizes or lengths) without the infection tree.
+}
 }
 \author{
 Flavio Finger, James M. Azam, Sebastian Funk

From 4bb19b3aa206f585f3dbd1ae2bed85bf5b8f5734 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 21 Sep 2023 16:19:00 +0100
Subject: [PATCH 663/828] Use the \eqn{} instead \code{} to render R0.

---
 R/simulate.r                  | 2 +-
 man/simulate_tree_from_pop.Rd | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 3a09198a..25bc186d 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -327,7 +327,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' where \code{\link{rpois}} is the R function to generate Poisson random
 #' numbers). Only supports "pois" and "nbinom".
 #' @param offspring_mean The average number of secondary cases for each case.
-#' Same as \code{R0}.
+#' Same as \eqn{R0}.
 #' @param offspring_disp The dispersion parameter of the number of
 #' secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
 #' avoid division by 0 when calculating the size. See details and
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 28c4a507..0f0e1464 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -25,7 +25,7 @@ where \code{\link{rpois}} is the R function to generate Poisson random
 numbers). Only supports "pois" and "nbinom".}
 
 \item{offspring_mean}{The average number of secondary cases for each case.
-Same as \code{R0}.}
+Same as \eqn{R0}.}
 
 \item{offspring_disp}{The dispersion parameter of the number of
 secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to

From d10eb5962b842695edb2c952c134560d7b662a72 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 15:19:28 +0100
Subject: [PATCH 664/828] Remove duplicated seealso

---
 R/simulate.r | 3 ---
 1 file changed, 3 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 25bc186d..3cfe33c4 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -245,9 +245,6 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #' @inheritSection simulate_tree Calculating chain sizes and lengths
 #' @inheritSection simulate_tree The serial interval (`serials_dist`)
 #' @author James M. Azam, Sebastian Funk
-#' @seealso [simulate_tree()] for simulating the transmission chains,
-#' returning the full tree information ("sim_id", "chain_id",
-#' "ancestor", "generation", and optionally, "time").
 #' @seealso
 #' * [simulate_tree()] for simulating transmission trees from an
 #'   initial number of infections.

From 7f1215de7c43f8b185157d2c82ebc5bdd0e5a63a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 15:27:49 +0100
Subject: [PATCH 665/828] Exclude the details block from linting

---
 R/simulate.r | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index 3cfe33c4..16a43df3 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -28,6 +28,7 @@
 #' `time` (of infection)
 #' @author James M. Azam, Sebastian Funk
 #' @export
+#nolint start
 #' @details
 #' # Calculating chain sizes and lengths
 #' The function simulates the chain size for individual \eqn{i} at time
@@ -59,7 +60,7 @@
 #' ## Specifying `serials_dist`
 #'
 #' `serials_dist` must be specified as a named or
-#' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R) #nolint
+#' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R)
 #' with one argument.
 #'
 #' For example, assuming we want to specify the serial interval
@@ -78,6 +79,7 @@
 #' in the `simulate_*()` call like so
 #' \code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
 #' where `...` are the other arguments to `simulate_*()`.
+#nolint end
 #' @seealso
 #' * [simulate_summary()] for simulating transmission chains
 #'   statistics (sizes or lengths) without the infection tree.

From 286e144d830bdebcc58951960356ad94844cce9e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 20 Sep 2023 21:59:33 +0100
Subject: [PATCH 666/828] Improve documentation of simulate family of functions

---
 man/simulate_tree_from_pop.Rd | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 0f0e1464..238b9484 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -150,3 +150,6 @@ statistics (sizes or lengths) without the infection tree.
 \author{
 Flavio Finger, James M. Azam, Sebastian Funk
 }
+\author{
+Flavio Finger, James M. Azam, Sebastian Funk
+}

From e26ee35d014332e9067af6b1c2d0978d8b027a85 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 25 Sep 2023 15:14:58 +0100
Subject: [PATCH 667/828] Improve wording

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index 16a43df3..eb78e7b3 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -70,7 +70,7 @@
 #' number of serial intervals to sample:
 #' \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
 #' and assign the name of the function to `serials_dist` in
-#' the simulation function like so
+#' the simulation function, i.e.
 #' \code{`simulate_*`(..., serials_dist = serial_interval)},
 #' where `...` are the other arguments to `simulate_*()` and * is a placeholder
 #' for the rest of simulation function's name.

From 451733813126196af245da0229eadaac44856dfa Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 25 Sep 2023 15:15:25 +0100
Subject: [PATCH 668/828] Improve wording

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index eb78e7b3..a6840fb9 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -76,7 +76,7 @@
 #' for the rest of simulation function's name.
 #'
 #' Alternatively, we could assign an anonymous function to `serials_dist`
-#' in the `simulate_*()` call like so
+#' in the `simulate_*()` call, i.e.
 #' \code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
 #' where `...` are the other arguments to `simulate_*()`.
 #nolint end

From 695c63eb457446876c405cba68621cad9900a83e Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 25 Sep 2023 15:15:51 +0100
Subject: [PATCH 669/828] Improve function title

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index a6840fb9..a404b928 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -239,7 +239,7 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 
 
-#' Simulate transmission chains sizes/lengths without infection tree
+#' Simulate transmission chains sizes/lengths
 #'
 #' @inheritParams simulate_tree
 #' @param stat_max A cut off for the chain statistic (size/length) being

From b5014dc42eb0e92361c7b9240445ad9fa59feaa2 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 25 Sep 2023 15:17:49 +0100
Subject: [PATCH 670/828] Change R0 to R_0

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index a404b928..e9eed573 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -326,7 +326,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' where \code{\link{rpois}} is the R function to generate Poisson random
 #' numbers). Only supports "pois" and "nbinom".
 #' @param offspring_mean The average number of secondary cases for each case.
-#' Same as \eqn{R0}.
+#' Same as \eqn{R_0}.
 #' @param offspring_disp The dispersion parameter of the number of
 #' secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
 #' avoid division by 0 when calculating the size. See details and

From e9027c47b793b2eac08803c11b87bbca37a4acef Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 15:45:01 +0100
Subject: [PATCH 671/828] Style example to have one arg per line for
 readability

---
 R/simulate.r | 7 +++++--
 1 file changed, 5 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index e9eed573..ab93dae8 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -254,8 +254,11 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #'   susceptible or partially immune population.
 #' @examples
 #' simulate_summary(
-#'   nchains = 10, statistic = "size", offspring_dist = "pois",
-#'   stat_max = 10, lambda = 2
+#'   nchains = 10,
+#'   statistic = "size",
+#'   offspring_dist = "pois",
+#'   stat_max = 10,
+#'   lambda = 2
 #' )
 #' @export
 simulate_summary <- function(nchains, statistic = c("size", "length"),

From 081ff1946cb5ea224098d36f871aef89a941b99c Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 15:45:20 +0100
Subject: [PATCH 672/828] Render man files

---
 man/simulate_summary.Rd       | 19 +++++++++----------
 man/simulate_tree.Rd          |  4 ++--
 man/simulate_tree_from_pop.Rd |  6 +++---
 3 files changed, 14 insertions(+), 15 deletions(-)

diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index a59ec763..2e75d901 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/simulate.r
 \name{simulate_summary}
 \alias{simulate_summary}
-\title{Simulate transmission chains sizes/lengths without infection tree}
+\title{Simulate transmission chains sizes/lengths}
 \usage{
 simulate_summary(
   nchains,
@@ -32,7 +32,7 @@ computed. Results above the specified value, are set to \code{Inf}.}
 \item{...}{Parameters of the offspring distribution as required by R.}
 }
 \description{
-Simulate transmission chains sizes/lengths without infection tree
+Simulate transmission chains sizes/lengths
 }
 \section{Calculating chain sizes and lengths}{
 The function simulates the chain size for individual \eqn{i} at time
@@ -66,7 +66,7 @@ See References below for some literature on the subject.
 \subsection{Specifying \code{serials_dist}}{
 
 \code{serials_dist} must be specified as a named or
-\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function} #nolint
+\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function}
 with one argument.
 
 For example, assuming we want to specify the serial interval
@@ -76,7 +76,7 @@ let's call it "serial_interval", with only one argument representing the
 number of serial intervals to sample:
 \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
 and assign the name of the function to \code{serials_dist} in
-the simulation function like so
+the simulation function, i.e.
 \code{`simulate_*`(..., serials_dist = serial_interval)},
 where \code{...} are the other arguments to \verb{simulate_*()} and * is a placeholder
 for the rest of simulation function's name.
@@ -90,15 +90,14 @@ where \code{...} are the other arguments to \verb{simulate_*()}.
 
 \examples{
 simulate_summary(
-  nchains = 10, statistic = "size", offspring_dist = "pois",
-  stat_max = 10, lambda = 2
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  lambda = 2
 )
 }
 \seealso{
-\code{\link[=simulate_tree]{simulate_tree()}} for simulating the transmission chains,
-returning the full tree information ("sim_id", "chain_id",
-"ancestor", "generation", and optionally, "time").
-
 \itemize{
 \item \code{\link[=simulate_tree]{simulate_tree()}} for simulating transmission trees from an
 initial number of infections.
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index e814276d..3fab538a 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -87,7 +87,7 @@ See References below for some literature on the subject.
 \subsection{Specifying \code{serials_dist}}{
 
 \code{serials_dist} must be specified as a named or
-\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function} #nolint
+\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function}
 with one argument.
 
 For example, assuming we want to specify the serial interval
@@ -97,7 +97,7 @@ let's call it "serial_interval", with only one argument representing the
 number of serial intervals to sample:
 \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
 and assign the name of the function to \code{serials_dist} in
-the simulation function like so
+the simulation function, i.e.
 \code{`simulate_*`(..., serials_dist = serial_interval)},
 where \code{...} are the other arguments to \verb{simulate_*()} and * is a placeholder
 for the rest of simulation function's name.
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 238b9484..cda4e234 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -25,7 +25,7 @@ where \code{\link{rpois}} is the R function to generate Poisson random
 numbers). Only supports "pois" and "nbinom".}
 
 \item{offspring_mean}{The average number of secondary cases for each case.
-Same as \eqn{R0}.}
+Same as \eqn{R_0}.}
 
 \item{offspring_disp}{The dispersion parameter of the number of
 secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
@@ -99,7 +99,7 @@ See References below for some literature on the subject.
 \subsection{Specifying \code{serials_dist}}{
 
 \code{serials_dist} must be specified as a named or
-\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function} #nolint
+\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function}
 with one argument.
 
 For example, assuming we want to specify the serial interval
@@ -109,7 +109,7 @@ let's call it "serial_interval", with only one argument representing the
 number of serial intervals to sample:
 \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
 and assign the name of the function to \code{serials_dist} in
-the simulation function like so
+the simulation function, i.e.
 \code{`simulate_*`(..., serials_dist = serial_interval)},
 where \code{...} are the other arguments to \verb{simulate_*()} and * is a placeholder
 for the rest of simulation function's name.

From f445a2ebc53d853c45cbad832d91b54bf57f50db Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Mon, 25 Sep 2023 17:07:59 +0100
Subject: [PATCH 673/828] Clarify purpose of "statistic" argument

---
 R/simulate.r                  | 4 +++-
 man/likelihood.Rd             | 4 +++-
 man/offspring_ll.Rd           | 4 +++-
 man/simulate_summary.Rd       | 6 ++++--
 man/simulate_tree.Rd          | 6 ++++--
 man/simulate_tree_from_pop.Rd | 5 +----
 6 files changed, 18 insertions(+), 11 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index ab93dae8..b966cf2d 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -5,7 +5,9 @@
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
 #' where \code{\link{rpois}} is the R function to generate Poisson random
 #' numbers).
-#' @param statistic String; Statistic to calculate. Can be one of:
+#' @param statistic String; Statistic (size/length) to calculate. Used to
+#' determine stopping criteria for simulations when `stat_max` is finite.
+#' Can be one of:
 #' \itemize{
 #'   \item "size": the total number of offspring.
 #'   \item "length": the total number of ancestors.
diff --git a/man/likelihood.Rd b/man/likelihood.Rd
index 213a62f6..cfc5cb1e 100644
--- a/man/likelihood.Rd
+++ b/man/likelihood.Rd
@@ -20,7 +20,9 @@ likelihood(
 \arguments{
 \item{chains}{Vector of chain summaries (sizes/lengths)}
 
-\item{statistic}{String; Statistic to calculate. Can be one of:
+\item{statistic}{String; Statistic (size/length) to calculate. Used to
+determine stopping criteria for simulations when \code{stat_max} is finite.
+Can be one of:
 \itemize{
 \item "size": the total number of offspring.
 \item "length": the total number of ancestors.
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index 53632fee..bd8caeb2 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -15,7 +15,9 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers).}
 
-\item{statistic}{String; Statistic to calculate. Can be one of:
+\item{statistic}{String; Statistic (size/length) to calculate. Used to
+determine stopping criteria for simulations when \code{stat_max} is finite.
+Can be one of:
 \itemize{
 \item "size": the total number of offspring.
 \item "length": the total number of ancestors.
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index 2e75d901..a453d680 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -15,7 +15,9 @@ simulate_summary(
 \arguments{
 \item{nchains}{Number of chains to simulate.}
 
-\item{statistic}{String; Statistic to calculate. Can be one of:
+\item{statistic}{String; Statistic (size/length) to calculate. Used to
+determine stopping criteria for simulations when \code{stat_max} is finite.
+Can be one of:
 \itemize{
 \item "size": the total number of offspring.
 \item "length": the total number of ancestors.
@@ -82,7 +84,7 @@ where \code{...} are the other arguments to \verb{simulate_*()} and * is a place
 for the rest of simulation function's name.
 
 Alternatively, we could assign an anonymous function to \code{serials_dist}
-in the \verb{simulate_*()} call like so
+in the \verb{simulate_*()} call, i.e.
 \code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
 where \code{...} are the other arguments to \verb{simulate_*()}.
 }
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 3fab538a..1d6722dc 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -18,7 +18,9 @@ simulate_tree(
 \arguments{
 \item{nchains}{Number of chains to simulate.}
 
-\item{statistic}{String; Statistic to calculate. Can be one of:
+\item{statistic}{String; Statistic (size/length) to calculate. Used to
+determine stopping criteria for simulations when \code{stat_max} is finite.
+Can be one of:
 \itemize{
 \item "size": the total number of offspring.
 \item "length": the total number of ancestors.
@@ -103,7 +105,7 @@ where \code{...} are the other arguments to \verb{simulate_*()} and * is a place
 for the rest of simulation function's name.
 
 Alternatively, we could assign an anonymous function to \code{serials_dist}
-in the \verb{simulate_*()} call like so
+in the \verb{simulate_*()} call, i.e.
 \code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
 where \code{...} are the other arguments to \verb{simulate_*()}.
 }
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index cda4e234..86621422 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -115,7 +115,7 @@ where \code{...} are the other arguments to \verb{simulate_*()} and * is a place
 for the rest of simulation function's name.
 
 Alternatively, we could assign an anonymous function to \code{serials_dist}
-in the \verb{simulate_*()} call like so
+in the \verb{simulate_*()} call, i.e.
 \code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
 where \code{...} are the other arguments to \verb{simulate_*()}.
 }
@@ -150,6 +150,3 @@ statistics (sizes or lengths) without the infection tree.
 \author{
 Flavio Finger, James M. Azam, Sebastian Funk
 }
-\author{
-Flavio Finger, James M. Azam, Sebastian Funk
-}

From a756ab82e05661e02dbeef4d38cb6aaf1e91a209 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 26 Sep 2023 11:32:34 +0100
Subject: [PATCH 674/828] Add license

---
 LICENSE    | 2 +-
 LICENSE.md | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/LICENSE b/LICENSE
index bad553b7..c9f94f7f 100644
--- a/LICENSE
+++ b/LICENSE
@@ -1,2 +1,2 @@
 YEAR: 2023
-COPYRIGHT HOLDER: bpmodels authors
+COPYRIGHT HOLDER: epichains authors
diff --git a/LICENSE.md b/LICENSE.md
index 9293f3eb..e8d943b5 100644
--- a/LICENSE.md
+++ b/LICENSE.md
@@ -1,6 +1,6 @@
 # MIT License
 
-Copyright (c) 2023 bpmodels authors
+Copyright (c) 2023 epichains authors
 
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal

From 59a4bbb2352827c79c795d9692271d5a43de30e3 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 12:48:16 +0100
Subject: [PATCH 675/828] Add vignette to model/forecast covid-19 with
 epichains

---
 _pkgdown.yml                       |   1 +
 vignettes/projecting_incidence.Rmd | 365 +++++++++++++++++++++++++++++
 2 files changed, 366 insertions(+)
 create mode 100644 vignettes/projecting_incidence.Rmd

diff --git a/_pkgdown.yml b/_pkgdown.yml
index 30fc41fb..278c1519 100644
--- a/_pkgdown.yml
+++ b/_pkgdown.yml
@@ -14,4 +14,5 @@ articles:
   contents:
   - branching_process_literature
   - theoretical_background
+  - projecting_incidence
 
diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
new file mode 100644
index 00000000..6a1fc28a
--- /dev/null
+++ b/vignettes/projecting_incidence.Rmd
@@ -0,0 +1,365 @@
+---
+title: "Projecting infectious disease incidence: a COVID-19 example"
+author: "James Azam, Sebastian Funk"
+output:
+  bookdown::html_vignette2:
+    fig_caption: yes
+    code_folding: show
+pkgdown:
+  as_is: true
+bibliography: references.json
+link-citations: true
+vignette: >
+  %\VignetteIndexEntry{Projecting infectious disease incidence: a COVID-19 example}
+  %\VignetteEncoding{UTF-8}
+  %\VignetteEngine{knitr::rmarkdown}
+editor_options: 
+  chunk_output_type: console
+---
+
+```{r setup, include=FALSE}
+knitr::opts_chunk$set(
+  echo = TRUE,
+  message = FALSE,
+  warning = FALSE,
+  collapse = TRUE,
+  comment = "#>"
+)
+
+```
+
+## Overview
+
+Branching processes can be used to project infectious disease trends in time
+provided we can characterize the distribution of times between
+successive cases (serial interval), and the distribution of secondary cases
+produced by a single individual (offspring distribution). Such simulations can
+be achieved in _epichains_ with the `simulate_tree()` function and
+@pearson2020, and @abbott2020 illustrate its application to COVID-19.
+
+The purpose of this vignette is to use early data on COVID-19 in South Africa 
+[@marivate2020] to illustrate how _epichains_ can be used to forecast an 
+outbreak. 
+
+Let's load the required packages
+
+```{r packages, include=TRUE}
+library("epichains")
+library("dplyr")
+library("ggplot2")
+library("lubridate")
+```
+
+## Data
+
+Included in _epichains_ is a cleaned time series of the first 15 days of 
+the COVID-19 outbreak in South Africa. This can be loaded into 
+memory as follows: 
+```{r}
+data("covid19_sa", package = "epichains")
+```
+
+Let us examine the first 6 entries of the dataset.
+```{r}
+head(covid19_sa)
+```
+
+## Setting up the inputs  
+
+### Onset times 
+
+`simulate_tree()` requires a vector of onset times, `t0`, for each 
+chain/individual/simulation. 
+
+The `covid19_sa` dataset above is aggregated, so we will have to disaggregate
+it into a linelist with each row representing a case and their onset time. 
+
+To achieve this, we will first use the date of the index case as the reference 
+and find the difference between each date and the reference. 
+```{r linelist_gen, message=FALSE}
+days_since_index <- as.integer(covid19_sa$date - min(covid19_sa$date))
+days_since_index
+```
+
+Using the vector of start times for the time series, we will then 
+create the linelist by disaggregating the time series so 
+that each case has a corresponding start time.
+```{r}
+start_times <- rep(days_since_index, covid19_sa$cases)
+start_times
+```
+
+### Serial interval
+
+The log-normal distribution is commonly used in epidemiology to characterise 
+quantities such as the serial interval because it has a large variance 
+and can only be positive-valued [@nishiura2007; @limpert2001]. 
+
+In this example, we will assume based on COVID-19 literature that the 
+serial interval, S, is log-normal distributed with parameters, 
+$\mu = 4.7$ and $\sigma = 2.9$ [@pearson2020]. Note that when the distribution
+is described this way, it means $\mu$ and $\sigma$ are the expected value 
+and standard deviation of the natural logarithm of the serial interval. Hence, 
+in order to sample the "back-transformed" measured serial interval with 
+expectation/mean, $E[S]$ and standard deviation, $SD [S]$, 
+we can use the following parametrisation:
+
+\begin{align}
+E[S] &= \ln \left( \dfrac{\mu^2}{(\sqrt{\mu^2 + \sigma^2}} \right) \\
+
+SD [S] &= \sqrt {\ln \left(1 + \dfrac{\sigma^2}{\mu^2} \right)}
+ 
+\end{align}
+
+See ["log-normal_distribution" on Wikipedia](https://en.wikipedia.org/wiki/Log-normal_distribution) for a
+detailed explanation of this parametrisation.
+
+We will now set up the serial interval function with the appropriate inputs.
+We adopt R's random lognormal distribution generator (`rlnorm()`) that
+takes `meanlog` and `sdlog` as arguments, which we define with the
+parametrisation above as `log_mean()` and `log_sd()` respectively and wrap it in 
+the `serial_interval()` function. Moreover, `serial_interval()` takes one
+argument `sample_size` as is required by _epichains_ 
+(See `?epichains::simulate_tree`), which is further passed to `rlnorm()` as the 
+first argument to determine the number of observations to sample
+(See `?rlnorm`).
+```{r input_prep3, message=FALSE}
+mu <- 4.7
+sgma <- 2.9
+
+log_mean <- log((mu^2) / (sqrt(sgma^2 + mu^2)))  # log mean
+log_sd <- sqrt(log(1 + (sgma / mu)^2)) # log sd
+
+#' serial interval function
+serial_interval <- function(sample_size) {
+  si <- rlnorm(sample_size, meanlog = log_mean, sdlog = log_sd)
+  return(si)
+}
+```
+
+### Offspring distribution
+
+The negative binomial distribution is commonly used in epidemiology to
+account for individual variation in transmissibility, 
+also known as superspreading [@lloyd-smith2005].
+
+For this example, we will assume that the offspring distribution is 
+characterised by a negative binomial with $mu = 2.5$ [@abbott2020] and 
+$size = 0.58$ [@wang2020]. In this parameterization, $mu$ 
+represents the $R_0$, which is defined as the average number of 
+cases produced by a single individual in an entirely susceptible population. 
+The parameter $size$ represents superspreading, that is, the degree of 
+heterogeneity in transmission by single individuals.
+
+### Simulation controls
+
+Since, we have specified $R0 > 1$, it means the epidemic could potentially grow
+without end. Hence, we must specify an end time for the simulation.
+`simulate_tree()` provides the `tf` argument for this purpose. For this 
+example, we will simulate outbreaks that end $14$ days after the last date 
+of observations in the `covid19_sa` dataset.
+```{r input_prep2, message=FALSE}
+#' Date to end simulation (14 day projection in this case)
+projection_window <- 14 # 14 days/ 2-week ahead projection
+projection_end_day <- max(days_since_index) + projection_window
+projection_end_day
+```
+
+`simulate_tree()` is stochastic, meaning the results are different every
+time it is run for the same set of parameters. We will, therefore, run the
+simulations $100$ times and aggregate the results. 
+
+Let us specify that.
+```{r}
+#' Number of simulations
+sim_rep <- 100
+```
+
+Lastly, `simulate_tree()` requires a maximum chain statistic for each chain,
+above which, the simulation is cut off. If this value is 
+not specified, it assumes a value of infinity. Here, we will
+assume a maximum chain size of $1000$.
+
+Let's call it `chain_threhold`.
+```{r}
+#' Maximum chain size allowed
+chain_threshold <- 1000
+```
+
+## Modelling assumptions
+
+`simulate_tree()` makes the following simplifying assumptions:
+
+1. All cases are observed.
+1. There is no reporting delay.
+1. Reporting rate is constant through the course of the epidemic.
+1. No interventions have been implemented.
+1. Population is homogeneous and well-mixed.
+
+To summarise the whole set up so far, we are going to simulate 
+each chain `r sim_rep` times, projecting cases over
+`r projection_window` days after the first `r max(start_times)` days, and 
+assuming that no outbreak size exceeds `r chain_threshold` cases.
+
+## Running the simulations
+
+We will use the function `lapply()` to run the simulations and bind them
+by rows with `dplyr::bind_rows()`.
+```{r simulations, message=FALSE}
+set.seed(1234)
+sim_chain_sizes <- lapply(
+  seq_len(sim_rep),
+  function(sim) {
+    simulate_tree(
+      nchains = length(start_times),
+      offspring_dist = "nbinom",
+      mu = 2.5,
+      size = 0.58,
+      statistic = "size",
+      stat_max = chain_threshold,
+      serials_dist = serial_interval,
+      t0 = start_times,
+      tf = projection_end_day
+    ) %>%
+      mutate(sim = sim)
+  }
+)
+
+sim_output <- bind_rows(sim_chain_sizes)
+```
+
+Let us view the first few rows of the simulation results.
+```{r sim_output_head}
+head(sim_output)
+```
+
+## Post-processing
+
+Now, we will summarise the simulation results. 
+
+We want to plot the individual simulated daily time series and show 
+the median cases per day aggregated over all simulations.
+
+First, we will create the daily time series per simulation by
+aggregating the number of cases per day of each simulation.
+```{r post_processing}
+# Daily number of cases for each simulation
+incidence_ts <- sim_output %>%
+  mutate(day = ceiling(time)) %>%
+  group_by(sim, day) %>%
+  summarise(cases = n()) %>%
+  ungroup()
+
+head(incidence_ts)
+```
+
+Next, we will add a date column to the results of each simulation 
+set. We will use the date of the first case in the observed data 
+as the reference start date.
+```{r}
+# Get start date from the observed data
+index_date <- min(covid19_sa$date)
+index_date
+
+# Add a dates column to each simulation result
+incidence_ts_by_date <- incidence_ts %>%
+  group_by(sim) %>%
+  mutate(date = index_date + days(seq(0, n() - 1))) %>%
+  ungroup()
+
+head(incidence_ts_by_date)
+```
+
+Now we will aggregate the simulations by day and evaluate the median 
+daily cases across all simulations.
+```{r}
+# Median daily number of cases aggregated across all simulations
+median_daily_cases <- incidence_ts_by_date %>%
+  group_by(date) %>%
+  summarise(median_cases = median(cases)) %>%
+  ungroup() %>%
+  arrange(date)
+
+head(median_daily_cases)
+```
+
+## Visualization
+
+We will now plot the individual simulation results alongside the median
+of the aggregated results.
+```{r viz, fig.cap ="COVID-19 incidence in South Africa projected over a two week window in 2020. The light gray lines represent the individual simulations, the red line represents the median daily cases across all simulations, the black connected dots represent the observed data, and the dashed vertical line marks the beginning of the projection.", fig.width=6.0, fig.height=6}
+
+ggplot(data = incidence_ts_by_date) +
+  geom_line(
+    aes(
+      x = date,
+      y = cases,
+      group = sim
+    ),
+    color = "grey",
+    linewidth = 0.2,
+    alpha = 0.25
+  ) +
+  geom_line(
+    data = median_daily_cases,
+    aes(
+      x = date,
+      y = median_cases
+    ),
+    color = "tomato3",
+    linewidth = 1.8
+  ) +
+  geom_point(
+    data = covid19_sa,
+    aes(
+      x = date,
+      y = cases
+    ),
+    color = "black",
+    size = 1.75,
+    shape = 21
+  ) +
+  geom_line(
+    data = covid19_sa,
+    aes(
+      x = date,
+      y = cases
+    ),
+    color = "black",
+    linewidth = 1
+  ) +
+  scale_x_continuous(
+    breaks = seq(
+      min(incidence_ts_by_date$date),
+      max(incidence_ts_by_date$date),
+      5
+    ),
+    labels = seq(
+      min(incidence_ts_by_date$date),
+      max(incidence_ts_by_date$date),
+      5
+    ),
+    limits = c(
+      min(incidence_ts_by_date$date),
+      max(incidence_ts_by_date$date) - 4 # for a better visual look
+    )
+  ) +
+  scale_y_continuous(
+    breaks = seq(
+      0,
+      max(incidence_ts_by_date$cases) + 200,
+      250
+    ),
+    labels = seq(
+      0,
+      max(incidence_ts_by_date$cases) + 200,
+      250
+    )
+  ) +
+  geom_vline(
+    mapping = aes(xintercept = max(covid19_sa$date)),
+    linetype = "dashed"
+  ) +
+  labs(x = "Date", y = "Daily cases")
+```
+## References

From b04a7ab79728a8003c77f65e66854f1dfa59e54e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 13:42:48 +0100
Subject: [PATCH 676/828] Replace forecast with project

---
 vignettes/projecting_incidence.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 6a1fc28a..215aced2 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -38,7 +38,7 @@ be achieved in _epichains_ with the `simulate_tree()` function and
 @pearson2020, and @abbott2020 illustrate its application to COVID-19.
 
 The purpose of this vignette is to use early data on COVID-19 in South Africa 
-[@marivate2020] to illustrate how _epichains_ can be used to forecast an 
+[@marivate2020] to illustrate how _epichains_ can be used to project an 
 outbreak. 
 
 Let's load the required packages

From 2ccda4a860e36dd6c58bf88f9c44c5f7d73a82c7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 13:43:20 +0100
Subject: [PATCH 677/828] Use first five observations

---
 vignettes/projecting_incidence.Rmd | 8 ++++++--
 1 file changed, 6 insertions(+), 2 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 215aced2..17888562 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -59,9 +59,13 @@ memory as follows:
 data("covid19_sa", package = "epichains")
 ```
 
-Let us examine the first 6 entries of the dataset.
+We will use the first $5$ observations for this demonstration. We will assume
+that all the cases in that subset are imported and did not infect each other. 
+
+Let us subset and view that aspect of the data.
 ```{r}
-head(covid19_sa)
+seed_cases <- covid19_sa[1:5, ]
+head(seed_cases)
 ```
 
 ## Setting up the inputs  

From 7874147fa4124c89aeb8c35419dd50c23f113827 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 13:44:15 +0100
Subject: [PATCH 678/828] Add/revise leading paragraphs

---
 vignettes/projecting_incidence.Rmd | 17 ++++++++++++-----
 1 file changed, 12 insertions(+), 5 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 17888562..9c47d28a 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -95,6 +95,9 @@ start_times
 
 ### Serial interval
 
+Next, we will set up the serial interval distribution, that is, the time
+between successive onsets of cases in a transmission chain.
+
 The log-normal distribution is commonly used in epidemiology to characterise 
 quantities such as the serial interval because it has a large variance 
 and can only be positive-valued [@nishiura2007; @limpert2001]. 
@@ -143,9 +146,12 @@ serial_interval <- function(sample_size) {
 
 ### Offspring distribution
 
-The negative binomial distribution is commonly used in epidemiology to
-account for individual variation in transmissibility, 
-also known as superspreading [@lloyd-smith2005].
+Let us now set up the offspring distribution, that is the distribution that
+drives the mechanism behind how individual cases infect other cases. The
+appropriate way to model the offspring distribution is to capture both the
+population-level transmissibility ($R0$) and the individual-level heterogeneity
+in transmission ("superspreading"). The negative binomial distribution is
+commonly used in this case [@lloyd-smith2005].
 
 For this example, we will assume that the offspring distribution is 
 characterised by a negative binomial with $mu = 2.5$ [@abbott2020] and 
@@ -179,8 +185,9 @@ Let us specify that.
 sim_rep <- 100
 ```
 
-Lastly, `simulate_tree()` requires a maximum chain statistic for each chain,
-above which, the simulation is cut off. If this value is 
+Lastly, since, we have specified that $R0 > 1$, it means the epidemic could
+potentially grow without end. Hence, we must specify an end point for the
+simulations. 
 not specified, it assumes a value of infinity. Here, we will
 assume a maximum chain size of $1000$.
 

From bf1661af7bac2381c97d787096dd059a24dab22e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 13:44:51 +0100
Subject: [PATCH 679/828] Clean up plotting code

---
 vignettes/projecting_incidence.Rmd | 28 +++++++++++++++++++---------
 1 file changed, 19 insertions(+), 9 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 9c47d28a..86c50f5b 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -300,6 +300,20 @@ We will now plot the individual simulation results alongside the median
 of the aggregated results.
 ```{r viz, fig.cap ="COVID-19 incidence in South Africa projected over a two week window in 2020. The light gray lines represent the individual simulations, the red line represents the median daily cases across all simulations, the black connected dots represent the observed data, and the dashed vertical line marks the beginning of the projection.", fig.width=6.0, fig.height=6}
 
+# since all simulations may end at a different date, we will find the minimum
+# final date for all simulations for the purposes of visualisation.
+final_date <- incidence_ts_by_date %>%
+  group_by(sim) %>%
+  summarise(final_date = max(date), .groups = "drop") %>%
+  summarise(min_final_date = min(final_date)) %>%
+  pull(min_final_date)
+
+incidence_ts_by_date <- incidence_ts_by_date %>%
+  filter(date <= final_date)
+
+median_daily_cases <- median_daily_cases %>%
+  filter(date <= final_date)
+
 ggplot(data = incidence_ts_by_date) +
   geom_line(
     aes(
@@ -349,26 +363,22 @@ ggplot(data = incidence_ts_by_date) +
       min(incidence_ts_by_date$date),
       max(incidence_ts_by_date$date),
       5
-    ),
-    limits = c(
-      min(incidence_ts_by_date$date),
-      max(incidence_ts_by_date$date) - 4 # for a better visual look
     )
   ) +
   scale_y_continuous(
     breaks = seq(
       0,
-      max(incidence_ts_by_date$cases) + 200,
-      250
+      max(incidence_ts_by_date$cases),
+      30
     ),
     labels = seq(
       0,
-      max(incidence_ts_by_date$cases) + 200,
-      250
+      max(incidence_ts_by_date$cases),
+      30
     )
   ) +
   geom_vline(
-    mapping = aes(xintercept = max(covid19_sa$date)),
+    mapping = aes(xintercept = max(seed_cases$date)),
     linetype = "dashed"
   ) +
   labs(x = "Date", y = "Daily cases")

From 7697df4414df239813cb4e9b972ea34029299dde Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 13:45:38 +0100
Subject: [PATCH 680/828] Clarify that we're using first 5 observations to
 determine simulation seeds

---
 vignettes/projecting_incidence.Rmd | 27 +++++++++++++--------------
 1 file changed, 13 insertions(+), 14 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 86c50f5b..31f17b55 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -70,27 +70,26 @@ head(seed_cases)
 
 ## Setting up the inputs  
 
-### Onset times 
+We will now proceed to set up `simulate_tree()` for the simulations.
 
-`simulate_tree()` requires a vector of onset times, `t0`, for each 
-chain/individual/simulation. 
+### Onset times 
 
-The `covid19_sa` dataset above is aggregated, so we will have to disaggregate
-it into a linelist with each row representing a case and their onset time. 
+`simulate_tree()` requires a vector of seeding times, `t0`, for each
+chain/individual/simulation.
 
-To achieve this, we will first use the date of the index case as the reference 
-and find the difference between each date and the reference. 
+To get this, we will use the observation date of the index case as the
+reference and find the difference between the other observed dates and the reference. 
 ```{r linelist_gen, message=FALSE}
-days_since_index <- as.integer(covid19_sa$date - min(covid19_sa$date))
+days_since_index <- as.integer(seed_cases$date - min(seed_cases$date))
 days_since_index
 ```
 
-Using the vector of start times for the time series, we will then 
-create the linelist by disaggregating the time series so 
-that each case has a corresponding start time.
+Using the vector of start times from the time series, we will then 
+create a corresponding seeding time for each individual, which we'll call
+`tf`.
 ```{r}
-start_times <- rep(days_since_index, covid19_sa$cases)
-start_times
+t0 <- rep(days_since_index, seed_cases$cases)
+t0
 ```
 
 ### Serial interval
@@ -269,7 +268,7 @@ set. We will use the date of the first case in the observed data
 as the reference start date.
 ```{r}
 # Get start date from the observed data
-index_date <- min(covid19_sa$date)
+index_date <- min(seed_cases$date)
 index_date
 
 # Add a dates column to each simulation result

From 756c381543f6f181f6137a7b3285a5afb85a7270 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 13:46:24 +0100
Subject: [PATCH 681/828] Use simulate_tree's argument names for variable names

---
 vignettes/projecting_incidence.Rmd | 50 +++++++++++++++++-------------
 1 file changed, 28 insertions(+), 22 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 31f17b55..939f8cd1 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -137,7 +137,7 @@ log_mean <- log((mu^2) / (sqrt(sgma^2 + mu^2)))  # log mean
 log_sd <- sqrt(log(1 + (sgma / mu)^2)) # log sd
 
 #' serial interval function
-serial_interval <- function(sample_size) {
+serials_dist <- function(sample_size) {
   si <- rlnorm(sample_size, meanlog = log_mean, sdlog = log_sd)
   return(si)
 }
@@ -154,7 +154,14 @@ commonly used in this case [@lloyd-smith2005].
 
 For this example, we will assume that the offspring distribution is 
 characterised by a negative binomial with $mu = 2.5$ [@abbott2020] and 
-$size = 0.58$ [@wang2020]. In this parameterization, $mu$ 
+$size = 0.58$ [@wang2020]. 
+
+```{r nbinom_args, message=FALSE}
+mu <- 2.5
+size <- 0.58
+```
+
+In this parameterization, $mu$ 
 represents the $R_0$, which is defined as the average number of 
 cases produced by a single individual in an entirely susceptible population. 
 The parameter $size$ represents superspreading, that is, the degree of 
@@ -162,16 +169,13 @@ heterogeneity in transmission by single individuals.
 
 ### Simulation controls
 
-Since, we have specified $R0 > 1$, it means the epidemic could potentially grow
-without end. Hence, we must specify an end time for the simulation.
-`simulate_tree()` provides the `tf` argument for this purpose. For this 
-example, we will simulate outbreaks that end $14$ days after the last date 
-of observations in the `covid19_sa` dataset.
+For this example, we will simulate outbreaks that end $21$ days after the last
+date of observations in the `seed_cases` dataset.
 ```{r input_prep2, message=FALSE}
-#' Date to end simulation (14 day projection in this case)
-projection_window <- 14 # 14 days/ 2-week ahead projection
-projection_end_day <- max(days_since_index) + projection_window
-projection_end_day
+#' Date to end simulation
+projection_window <- 21 
+tf <- max(days_since_index) + projection_window
+tf
 ```
 
 `simulate_tree()` is stochastic, meaning the results are different every
@@ -187,13 +191,15 @@ sim_rep <- 100
 Lastly, since, we have specified that $R0 > 1$, it means the epidemic could
 potentially grow without end. Hence, we must specify an end point for the
 simulations. 
+
+`simulate_tree()` provides the `stat_max` argument for this purpose.
+Above `stat_max`, the simulation is cut off. If this value is 
 not specified, it assumes a value of infinity. Here, we will
 assume a maximum chain size of $1000$.
 
-Let's call it `chain_threhold`.
 ```{r}
 #' Maximum chain size allowed
-chain_threshold <- 1000
+stat_max <- 1000
 ```
 
 ## Modelling assumptions
@@ -208,8 +214,8 @@ chain_threshold <- 1000
 
 To summarise the whole set up so far, we are going to simulate 
 each chain `r sim_rep` times, projecting cases over
-`r projection_window` days after the first `r max(start_times)` days, and 
-assuming that no outbreak size exceeds `r chain_threshold` cases.
+`r projection_window` days after the first `r max(t0)` days, and 
+assuming that no outbreak size exceeds `r stat_max` cases.
 
 ## Running the simulations
 
@@ -221,15 +227,15 @@ sim_chain_sizes <- lapply(
   seq_len(sim_rep),
   function(sim) {
     simulate_tree(
-      nchains = length(start_times),
+      nchains = length(t0),
       offspring_dist = "nbinom",
-      mu = 2.5,
-      size = 0.58,
+      mu = mu,
+      size = size,
       statistic = "size",
-      stat_max = chain_threshold,
-      serials_dist = serial_interval,
-      t0 = start_times,
-      tf = projection_end_day
+      stat_max = stat_max,
+      serials_dist = serials_dist,
+      t0 = t0,
+      tf = tf
     ) %>%
       mutate(sim = sim)
   }

From f8d136e6edea51b203befe57ca6ff721d47b87db Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 13:50:33 +0100
Subject: [PATCH 682/828] Linting

---
 vignettes/projecting_incidence.Rmd | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 939f8cd1..63f9eb53 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -173,14 +173,14 @@ For this example, we will simulate outbreaks that end $21$ days after the last
 date of observations in the `seed_cases` dataset.
 ```{r input_prep2, message=FALSE}
 #' Date to end simulation
-projection_window <- 21 
+projection_window <- 21
 tf <- max(days_since_index) + projection_window
 tf
 ```
 
 `simulate_tree()` is stochastic, meaning the results are different every
 time it is run for the same set of parameters. We will, therefore, run the
-simulations $100$ times and aggregate the results. 
+simulations $100$ times and aggregate the results.
 
 Let us specify that.
 ```{r}

From 0c8ada6a56bb9156bc4bb37030de12c3fbe29dbc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 25 Sep 2023 15:06:04 +0100
Subject: [PATCH 683/828] Move modelling vignette to package vignette section

---
 _pkgdown.yml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/_pkgdown.yml b/_pkgdown.yml
index 278c1519..cc8cc70d 100644
--- a/_pkgdown.yml
+++ b/_pkgdown.yml
@@ -9,10 +9,10 @@ articles:
 - title: Package vignettes
   navbar: Package vignettes
   contents:
+  - projecting_incidence
 - title: Modelling guides and background
   navbar: Modelling guides and background
   contents:
   - branching_process_literature
   - theoretical_background
-  - projecting_incidence
 

From ac2a5ef4f2d4e882281c02120c43da056faa7629 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 26 Sep 2023 20:36:17 +0100
Subject: [PATCH 684/828] Add intervention function

---
 NAMESPACE             |  1 +
 R/intervention.R      | 41 +++++++++++++++++++++++++++++++++++++++++
 man/intvn_scale_r0.Rd | 42 ++++++++++++++++++++++++++++++++++++++++++
 3 files changed, 84 insertions(+)
 create mode 100644 R/intervention.R
 create mode 100644 man/intvn_scale_r0.Rd

diff --git a/NAMESPACE b/NAMESPACE
index ab92359f..c0dfa59e 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -7,6 +7,7 @@ S3method(print,epichains)
 S3method(summary,epichains)
 S3method(tail,epichains)
 export(dborel)
+export(intvn_scale_r0)
 export(is_chains_summary)
 export(is_chains_tree)
 export(is_epichains)
diff --git a/R/intervention.R b/R/intervention.R
new file mode 100644
index 00000000..3d78e6f2
--- /dev/null
+++ b/R/intervention.R
@@ -0,0 +1,41 @@
+#' Set up intervention for simulation
+#'
+#' @description
+#' `intvn_scale_r0()` is a helper for the \code{simulation_*} functions. It
+#' modifies the relevant arguments of the offspring distribution in order to
+#' mimic the impact of an intervention. In particular, it scales the mean of
+#' the offspring distribution. Currently, it can only handle the poisson and
+#' negative binomial distributions and errors when other offspring
+#' distributions are specified alongside `intvn_scale_r0`.
+#'
+#' @inheritParams simulate_tree
+#' @param r0_reduction The intervention impact. A scalar between 0 and 1.
+#' Scales the mean of `offspring_dist`. `r0_reduction` = 0 implies
+#' no intervention impact and `r0_reduction` = 1 implies full impact.
+#' @param pars_list Parameter(s) for poisson or negative binomial offspring
+#' distribution.
+#' @return List of the offspring distribution parameter(s) with the mean
+#' scaled by \code{1 - intvn_scale_r0}.
+#' @details
+#' `intvn_scale_r0()` scales the mean of the offspring distribution
+#' by \eqn{1 - r0\_reduction} so that the new mean is given as:
+#' \deqn{(1 - r0\_reduction) \times R_0,} where \eqn{R_0} is the
+#' mean of the poisson and negative binomial distribution.
+#'
+#' @author James M. Azam
+#' @export
+intvn_scale_r0 <- function(r0_reduction, offspring_dist, pars_list) {
+  # Intervention only works for pois and nbinom
+  if (!offspring_dist %in% c("pois", "nbinom")) {
+    stop(
+      "`offspring_dist` must be one of c(\"pois\", \"nbinom\"), ",
+      "if r0_reduction is specified."
+    )
+  }
+  if (offspring_dist == "pois") {
+    pars_list$lambda <- (1 - r0_reduction) * pars_list$lambda
+  } else {
+    pars_list$mu <- (1 - r0_reduction) * pars_list$mu
+  }
+  return(pars_list)
+}
diff --git a/man/intvn_scale_r0.Rd b/man/intvn_scale_r0.Rd
new file mode 100644
index 00000000..28619a69
--- /dev/null
+++ b/man/intvn_scale_r0.Rd
@@ -0,0 +1,42 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/intervention.R
+\name{intvn_scale_r0}
+\alias{intvn_scale_r0}
+\title{Set up intervention for simulation}
+\usage{
+intvn_scale_r0(r0_reduction, offspring_dist, pars_list)
+}
+\arguments{
+\item{r0_reduction}{The intervention impact. A scalar between 0 and 1.
+Scales the mean of \code{offspring_dist}. \code{r0_reduction} = 0 implies
+no intervention impact and \code{r0_reduction} = 1 implies full impact.}
+
+\item{offspring_dist}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
+
+\item{pars_list}{Parameter(s) for poisson or negative binomial offspring
+distribution.}
+}
+\value{
+List of the offspring distribution parameter(s) with the mean
+scaled by \code{1 - intvn_scale_r0}.
+}
+\description{
+\code{intvn_scale_r0()} is a helper for the \code{simulation_*} functions. It
+modifies the relevant arguments of the offspring distribution in order to
+mimic the impact of an intervention. In particular, it scales the mean of
+the offspring distribution. Currently, it can only handle the poisson and
+negative binomial distributions and errors when other offspring
+distributions are specified alongside \code{intvn_scale_r0}.
+}
+\details{
+\code{intvn_scale_r0()} scales the mean of the offspring distribution
+by \eqn{1 - r0\_reduction} so that the new mean is given as:
+\deqn{(1 - r0\_reduction) \times R_0,} where \eqn{R_0} is the
+mean of the poisson and negative binomial distribution.
+}
+\author{
+James M. Azam
+}

From 3cea60ce8a930360fa4d6da912e7a8407c97bfd4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 26 Sep 2023 20:38:53 +0100
Subject: [PATCH 685/828] Incorporate intervention into simulation functions

---
 R/simulate.r                  | 72 ++++++++++++++++++++++++++++++-----
 man/simulate_summary.Rd       |  5 +++
 man/simulate_tree.Rd          |  5 +++
 man/simulate_tree_from_pop.Rd |  8 ++--
 4 files changed, 76 insertions(+), 14 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index b966cf2d..764a7e17 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,5 +1,6 @@
 #' Simulate transmission trees from an initial number of infections
 #'
+#' @inheritParams intvn_scale_r0
 #' @param nchains Number of chains to simulate.
 #' @param offspring_dist Offspring distribution: a character string
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
@@ -113,6 +114,7 @@
 #' 1186–1204. \doi{https://doi.org/10.3390/ijerph7031204}
 simulate_tree <- function(nchains, statistic = c("size", "length"),
                           offspring_dist, stat_max = Inf,
+                          r0_reduction = 0,
                           serials_dist, t0 = 0,
                           tf = Inf, ...) {
   statistic <- match.arg(statistic)
@@ -122,10 +124,29 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
   # check that offspring is properly specified
   check_offspring_valid(offspring_dist)
 
+  # Check that the r0_reduction is well specified
+  checkmate::assert_number(
+    r0_reduction,
+    lower = 0,
+    upper = 1
+  )
+
   # check that offspring function exists in base R
   roffspring_name <- paste0("r", offspring_dist)
   check_offspring_func_valid(roffspring_name)
 
+  # Gather offspring distribution parameters
+  pars <- list(...)
+
+  # Prepare interventions if specified
+  if (r0_reduction > 0) {
+    pars <- intvn_scale_r0(
+      r0_reduction = r0_reduction,
+      offspring_dist = offspring_dist,
+      pars_list = pars
+    )
+  }
+
   if (!missing(serials_dist)) {
     check_serial_valid(serials_dist)
   } else if (!missing(tf)) {
@@ -244,6 +265,7 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #' Simulate transmission chains sizes/lengths
 #'
 #' @inheritParams simulate_tree
+#' @inheritParams intvn_scale_r0
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
 #' @inheritSection simulate_tree Calculating chain sizes and lengths
@@ -265,6 +287,7 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #' @export
 simulate_summary <- function(nchains, statistic = c("size", "length"),
                              offspring_dist,
+                             r0_reduction = 0,
                              stat_max = Inf, ...) {
   statistic <- match.arg(statistic)
 
@@ -273,10 +296,29 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
   # check that offspring is properly specified
   check_offspring_valid(offspring_dist)
 
+  # Check that the r0_reduction is well specified
+  checkmate::assert_number(
+    r0_reduction,
+    lower = 0,
+    upper = 1
+  )
+
   # check that offspring function exists in base R
   roffspring_name <- paste0("r", offspring_dist)
   check_offspring_func_valid(roffspring_name)
 
+  # Gather offspring distribution parameters
+  pars <- list(...)
+
+  # Prepare interventions if specified
+  if (r0_reduction > 0) {
+    pars <- intvn_scale_r0(
+      r0_reduction = r0_reduction,
+      offspring_dist = offspring_dist,
+      pars_list = pars
+    )
+  }
+
   # Initialisations
   stat_track <- rep(1, nchains) ## track length or size (depending on `stat`)
   n_offspring <- rep(1, nchains) ## current number of offspring
@@ -325,6 +367,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' population
 #'
 #' @inheritParams simulate_tree
+#' @inheritParams intvn_scale_r0
 #' @param pop The susceptible population size.
 #' @param offspring_dist Offspring distribution: a character string
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
@@ -395,7 +438,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' @export
 simulate_tree_from_pop <- function(pop,
                                    offspring_dist = c("pois", "nbinom"),
-                                   offspring_mean,
+                                   r0_reduction = 0,
                                    offspring_disp,
                                    serials_dist,
                                    initial_immune = 0,
@@ -403,15 +446,24 @@ simulate_tree_from_pop <- function(pop,
                                    tf = Inf) {
   offspring_dist <- match.arg(offspring_dist)
 
-  if (offspring_dist == "pois") {
-    if (!missing(offspring_disp)) {
-      warning(sprintf(
-        "%s %s %s",
-        "'offspring_disp' is not used for",
-        "poisson offspring distribution.",
-        "Will be ignored."
-      ))
-    }
+  # Check that the r0_reduction is well specified
+  checkmate::assert_number(
+    r0_reduction,
+    lower = 0,
+    upper = 1
+  )
+
+  # Gather offspring distribution parameters
+  pars <- list(...)
+
+  # Prepare interventions if specified
+  if (r0_reduction > 0) {
+    pars <- intvn_scale_r0(
+      r0_reduction = r0_reduction,
+      offspring_dist = offspring_dist,
+      pars_list = pars
+    )
+  }
 
     ## using a right truncated poisson distribution
     ## to avoid more cases than susceptibles
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index a453d680..8b8d6cd3 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -8,6 +8,7 @@ simulate_summary(
   nchains,
   statistic = c("size", "length"),
   offspring_dist,
+  r0_reduction = 0,
   stat_max = Inf,
   ...
 )
@@ -28,6 +29,10 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers).}
 
+\item{r0_reduction}{The intervention impact. A scalar between 0 and 1.
+Scales the mean of \code{offspring_dist}. \code{r0_reduction} = 0 implies
+no intervention impact and \code{r0_reduction} = 1 implies full impact.}
+
 \item{stat_max}{A cut off for the chain statistic (size/length) being
 computed. Results above the specified value, are set to \code{Inf}.}
 
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 1d6722dc..947ebd00 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -9,6 +9,7 @@ simulate_tree(
   statistic = c("size", "length"),
   offspring_dist,
   stat_max = Inf,
+  r0_reduction = 0,
   serials_dist,
   t0 = 0,
   tf = Inf,
@@ -35,6 +36,10 @@ numbers).}
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
+\item{r0_reduction}{The intervention impact. A scalar between 0 and 1.
+Scales the mean of \code{offspring_dist}. \code{r0_reduction} = 0 implies
+no intervention impact and \code{r0_reduction} = 1 implies full impact.}
+
 \item{serials_dist}{The serial interval distribution function; the name
 of a user-defined named or anonymous function with only one argument \code{n},
 representing the number of serial intervals to generate. See details.}
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 86621422..04ce0684 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -8,7 +8,7 @@ population}
 simulate_tree_from_pop(
   pop,
   offspring_dist = c("pois", "nbinom"),
-  offspring_mean,
+  r0_reduction = 0,
   offspring_disp,
   serials_dist,
   initial_immune = 0,
@@ -24,9 +24,9 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers). Only supports "pois" and "nbinom".}
 
-\item{offspring_mean}{The average number of secondary cases for each case.
-Same as \eqn{R_0}.}
-
+\item{r0_reduction}{The intervention impact. A scalar between 0 and 1.
+Scales the mean of \code{offspring_dist}. \code{r0_reduction} = 0 implies
+no intervention impact and \code{r0_reduction} = 1 implies full impact.}
 \item{offspring_disp}{The dispersion parameter of the number of
 secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
 avoid division by 0 when calculating the size. See details and

From 37ffdacb64d2c6c18292f561f0e1090e6c439ce1 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 26 Sep 2023 20:50:44 +0100
Subject: [PATCH 686/828] Add examples of intervention simulations

---
 R/simulate.r                  | 33 +++++++++++++++++++++++++++++++++
 man/simulate_summary.Rd       | 12 ++++++++++++
 man/simulate_tree.Rd          | 13 +++++++++++++
 man/simulate_tree_from_pop.Rd |  8 ++++++--
 4 files changed, 64 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 764a7e17..7aa11aae 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -98,6 +98,19 @@
 #'   serials_dist = function(x) 3,
 #'   lambda = 2
 #' )
+#'
+#' # Run model with intervention a 50% reduction in R0.
+#' chains_with_intvn <- simulate_tree(
+#'   nchains = 10,
+#'   statistic = "size",
+#'   offspring_dist = "pois",
+#'   r0_reduction = 0.5,
+#'   stat_max = 10,
+#'   serials_dist = function(x) 3,
+#'   lambda = 2
+#' )
+#'
+#' chains_with_intvn
 #' @references
 #' Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
 #' between serial interval, infectiousness profile and generation time.
@@ -284,6 +297,18 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #'   stat_max = 10,
 #'   lambda = 2
 #' )
+#'
+#' # Run model with intervention a 50% reduction in R0.
+#' chain_summary_with_intvn <- simulate_summary(
+#'   nchains = 10,
+#'   statistic = "size",
+#'   offspring_dist = "pois",
+#'   r0_reduction = 0.5,
+#'   stat_max = 10,
+#'   lambda = 2
+#' )
+#'
+#' chain_summary_with_intvn
 #' @export
 simulate_summary <- function(nchains, statistic = c("size", "length"),
                              offspring_dist,
@@ -434,6 +459,14 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #'   offspring_mean = 0.5,
 #'   offspring_disp = 1.1,
 #'   serials_dist = function(x) 3
+#' # Simulate with negative binomial offspring with intervention
+#' simulate_tree_from_pop(
+#' pop = 100,
+#' offspring_dist = "nbinom",
+#' r0_reduction = 0.5,
+#' mu = 0.5,
+#' size = 1.1,
+#' serials_dist = function(x) 3
 #' )
 #' @export
 simulate_tree_from_pop <- function(pop,
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index 8b8d6cd3..7313b8dc 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -103,6 +103,18 @@ simulate_summary(
   stat_max = 10,
   lambda = 2
 )
+
+# Run model with intervention a 50\% reduction in R0.
+chain_summary_with_intvn <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  r0_reduction = 0.5,
+  stat_max = 10,
+  lambda = 2
+)
+
+chain_summary_with_intvn
 }
 \seealso{
 \itemize{
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 947ebd00..a2f405c1 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -126,6 +126,19 @@ chains <- simulate_tree(
   serials_dist = function(x) 3,
   lambda = 2
 )
+
+# Run model with intervention a 50\% reduction in R0.
+chains_with_intvn <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  r0_reduction = 0.5,
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 2
+)
+
+chains_with_intvn
 }
 \references{
 Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 04ce0684..af691343 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -131,11 +131,15 @@ simulate_tree_from_pop(
 )
 
 # Simulate with negative binomial offspring
+simulate_tree_from_pop(
+# Simulate with negative binomial offspring with intervention (50\%
+reduction in R0)
 simulate_tree_from_pop(
   pop = 100,
   offspring_dist = "nbinom",
-  offspring_mean = 0.5,
-  offspring_disp = 1.1,
+  r0_reduction = 0.5,
+  mu = 0.5,
+  size = 1.1,
   serials_dist = function(x) 3
 )
 }

From 93111d3e1193f07d17a775ccc10d439b8860fe98 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 26 Sep 2023 20:54:29 +0100
Subject: [PATCH 687/828] Refactor to use do.call() with list of dot-dot-dot
 args

---
 R/simulate.r | 16 ++++++++++++++--
 1 file changed, 14 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 7aa11aae..a57609e3 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -189,7 +189,13 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
   # next, simulate n chains
   while (length(sim) > 0) {
     # simulate next generation
-    next_gen <- get(roffspring_name)(n = sum(n_offspring[sim]), ...)
+    next_gen <- do.call(
+      get(roffspring_name),
+      c(
+        list(n = sum(n_offspring[sim])),
+        pars
+      )
+    )
     if (any(next_gen %% 1 > 0)) {
       stop("Offspring distribution must return integers")
     }
@@ -352,7 +358,13 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
   ## next, simulate nchains chains
   while (length(sim) > 0) {
     ## simulate next generation
-    next_gen <- get(roffspring_name)(n = sum(n_offspring[sim]), ...)
+    next_gen <- do.call(
+      get(roffspring_name),
+      c(
+        list(n = sum(n_offspring[sim])),
+        pars
+      )
+      )
     if (any(next_gen %% 1 > 0)) {
       stop("Offspring distribution must return integers")
     }

From 6863c79185212e209c764c4385f827ad20dc8a9c Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 26 Sep 2023 20:56:34 +0100
Subject: [PATCH 688/828] Format example

---
 R/simulate.r | 16 ++++++----------
 1 file changed, 6 insertions(+), 10 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index a57609e3..5cf3e420 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -466,19 +466,15 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #'
 #' # Simulate with negative binomial offspring
 #' simulate_tree_from_pop(
+#' # Simulate with negative binomial offspring with intervention (50%
+#' reduction in R0)
+#' simulate_tree_from_pop(
 #'   pop = 100,
 #'   offspring_dist = "nbinom",
-#'   offspring_mean = 0.5,
-#'   offspring_disp = 1.1,
+#'   r0_reduction = 0.5,
+#'   mu = 0.5,
+#'   size = 1.1,
 #'   serials_dist = function(x) 3
-#' # Simulate with negative binomial offspring with intervention
-#' simulate_tree_from_pop(
-#' pop = 100,
-#' offspring_dist = "nbinom",
-#' r0_reduction = 0.5,
-#' mu = 0.5,
-#' size = 1.1,
-#' serials_dist = function(x) 3
 #' )
 #' @export
 simulate_tree_from_pop <- function(pop,

From 99e4502fbaefedd7224f0e36e857c7498364a2db Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 26 Sep 2023 20:58:25 +0100
Subject: [PATCH 689/828] Restructure to use dot-dot-dot to pass args to pois
 and binom

---
 R/simulate.r                    | 43 ++++++++++++++---------------
 man/simulate_tree_from_pop.Rd   | 26 ++++++++++--------
 tests/testthat/test-epichains.R | 48 ++++++++++++++++-----------------
 tests/testthat/test-simulate.R  | 25 +++++------------
 vignettes/epichains.Rmd         |  6 ++---
 5 files changed, 70 insertions(+), 78 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 5cf3e420..4d76d593 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -410,12 +410,6 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
 #' where \code{\link{rpois}} is the R function to generate Poisson random
 #' numbers). Only supports "pois" and "nbinom".
-#' @param offspring_mean The average number of secondary cases for each case.
-#' Same as \eqn{R_0}.
-#' @param offspring_disp The dispersion parameter of the number of
-#' secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
-#' avoid division by 0 when calculating the size. See details and
-#'  \code{?rnbinom} for details on the parameterisation in Ecology.
 #' @param initial_immune The number of initial immunes in the population.
 #' Must be less than `pop` - 1.
 #' @param t0 Start time; Defaults to 0.
@@ -432,15 +426,15 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' than susceptibles at any point.
 #'
 #' The poisson model has mean, lambda, parametrised as:
-#' \deqn{{\sf lambda} = \dfrac{{\sf offspring\_mean} \times ({\sf pop} -
+#' \deqn{{\sf lambda} = \dfrac{{\sf lambda} \times ({\sf pop} -
 #' {\sf initial\_immune} - 1)}{{\sf pop}}}
 #'
 #' The negative binomial model, has mean, mu, parametrised as:
-#' \deqn{{\sf mu} = \dfrac{{\sf offspring\_mean} \times ({\sf pop} -
+#' \deqn{{\sf mu} = \dfrac{{\sf mu} \times ({\sf pop} -
 #' {\sf initial\_immune} - 1)}{{\sf pop}},}
 #' and dispersion, size, parametrised as:
-#' \deqn{{\sf size} = \dfrac{{\sf mu}}{{\sf offspring\_disp} - 1}.}
-#' This is why `offspring_disp` must be greater than 1.
+#' \deqn{{\sf size} = \dfrac{{\sf mu}}{{\sf size} - 1}.}
+#' This is why `size` must be greater than 1.
 #'
 #' # Differences with `simulate_tree()`
 #' `simulate_tree_from_pop()` has a couple of key differences from
@@ -460,12 +454,18 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' simulate_tree_from_pop(
 #'   pop = 100,
 #'   offspring_dist = "pois",
-#'   offspring_mean = 0.5,
+#'   lambda = 0.5,
 #'   serials_dist = function(x) 3
 #' )
 #'
 #' # Simulate with negative binomial offspring
 #' simulate_tree_from_pop(
+#' pop = 100, offspring_dist = "nbinom",
+#' mu = 0.5,
+#' size = 1.1,
+#' serials_dist = function(x) 3
+#' )
+#'
 #' # Simulate with negative binomial offspring with intervention (50%
 #' reduction in R0)
 #' simulate_tree_from_pop(
@@ -480,11 +480,11 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 simulate_tree_from_pop <- function(pop,
                                    offspring_dist = c("pois", "nbinom"),
                                    r0_reduction = 0,
-                                   offspring_disp,
                                    serials_dist,
                                    initial_immune = 0,
                                    t0 = 0,
-                                   tf = Inf) {
+                                   tf = Inf,
+                                   ...) {
   offspring_dist <- match.arg(offspring_dist)
 
   # Check that the r0_reduction is well specified
@@ -506,24 +506,25 @@ simulate_tree_from_pop <- function(pop,
     )
   }
 
-    ## using a right truncated poisson distribution
+  if (offspring_dist == "pois") {
+    ## Use a right truncated poisson distribution
     ## to avoid more cases than susceptibles
     offspring_func <- function(n, susc) {
       truncdist::rtrunc(
         n,
         spec = "pois",
-        lambda = offspring_mean * susc / pop,
+        lambda = pars$lambda * susc / pop,
         b = susc
       )
     }
   } else if (offspring_dist == "nbinom") {
-    if (missing(offspring_disp)) {
-      stop(sprintf("%s", "'offspring_disp' must be specified."))
-    } else if (offspring_disp <= 1) { ## dispersion coefficient
+    if (is.null(pars$size)) {
+      stop(sprintf("%s", "'size' must be specified."))
+    } else if (pars$size <= 1) { ## dispersion coefficient
       stop(sprintf(
         "%s %s %s",
         "Offspring distribution 'nbinom' requires",
-        "argument 'offspring_disp' > 1.",
+        "argument 'size' > 1.",
         "Use 'pois' if there is no overdispersion."
       ))
     }
@@ -531,8 +532,8 @@ simulate_tree_from_pop <- function(pop,
     offspring_func <- function(n, susc) {
       ## get distribution params from mean and dispersion
       ## see ?rnbinom for parameter definition
-      new_mn <- offspring_mean * susc / pop ## apply susceptibility
-      size <- new_mn / (offspring_disp - 1)
+      new_mn <- pars$mu * susc / pop ## apply susceptibility
+      size <- new_mn / (pars$size - 1)
 
       ## using a right truncated nbinom distribution
       ## to avoid more cases than susceptibles
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index af691343..26e8230e 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -9,11 +9,11 @@ simulate_tree_from_pop(
   pop,
   offspring_dist = c("pois", "nbinom"),
   r0_reduction = 0,
-  offspring_disp,
   serials_dist,
   initial_immune = 0,
   t0 = 0,
-  tf = Inf
+  tf = Inf,
+  ...
 )
 }
 \arguments{
@@ -27,10 +27,6 @@ numbers). Only supports "pois" and "nbinom".}
 \item{r0_reduction}{The intervention impact. A scalar between 0 and 1.
 Scales the mean of \code{offspring_dist}. \code{r0_reduction} = 0 implies
 no intervention impact and \code{r0_reduction} = 1 implies full impact.}
-\item{offspring_disp}{The dispersion parameter of the number of
-secondary cases. Ignored if \code{offspring == "pois"}. Must be > 1 to
-avoid division by 0 when calculating the size. See details and
-\code{?rnbinom} for details on the parameterisation in Ecology.}
 
 \item{serials_dist}{The serial interval distribution function; the name
 of a user-defined named or anonymous function with only one argument \code{n},
@@ -42,6 +38,8 @@ Must be less than \code{pop} - 1.}
 \item{t0}{Start time; Defaults to 0.}
 
 \item{tf}{End time; Defaults to \code{Inf}.}
+
+\item{...}{Parameters of the offspring distribution as required by R.}
 }
 \value{
 An \verb{<epichains>} object, which is basically a \verb{<data.frame>} with
@@ -60,15 +58,15 @@ distributions are used to avoid the situation where there are more cases
 than susceptibles at any point.
 
 The poisson model has mean, lambda, parametrised as:
-\deqn{{\sf lambda} = \dfrac{{\sf offspring\_mean} \times ({\sf pop} -
+\deqn{{\sf lambda} = \dfrac{{\sf lambda} \times ({\sf pop} -
 {\sf initial\_immune} - 1)}{{\sf pop}}}
 
 The negative binomial model, has mean, mu, parametrised as:
-\deqn{{\sf mu} = \dfrac{{\sf offspring\_mean} \times ({\sf pop} -
+\deqn{{\sf mu} = \dfrac{{\sf mu} \times ({\sf pop} -
 {\sf initial\_immune} - 1)}{{\sf pop}},}
 and dispersion, size, parametrised as:
-\deqn{{\sf size} = \dfrac{{\sf mu}}{{\sf offspring\_disp} - 1}.}
-This is why \code{offspring_disp} must be greater than 1.
+\deqn{{\sf size} = \dfrac{{\sf mu}}{{\sf size} - 1}.}
+This is why \code{size} must be greater than 1.
 }
 
 \section{Differences with \code{simulate_tree()}}{
@@ -126,12 +124,18 @@ where \code{...} are the other arguments to \verb{simulate_*()}.
 simulate_tree_from_pop(
   pop = 100,
   offspring_dist = "pois",
-  offspring_mean = 0.5,
+  lambda = 0.5,
   serials_dist = function(x) 3
 )
 
 # Simulate with negative binomial offspring
 simulate_tree_from_pop(
+pop = 100, offspring_dist = "nbinom",
+mu = 0.5,
+size = 1.1,
+serials_dist = function(x) 3
+)
+
 # Simulate with negative binomial offspring with intervention (50\%
 reduction in R0)
 simulate_tree_from_pop(
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index 878d22ea..ed80ccad 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -9,15 +9,15 @@ test_that("Simulators return epichains objects", {
   susc_outbreak_raw <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "pois",
-    offspring_mean = 0.9,
+    lambda = 0.9,
     serials_dist = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "nbinom",
-    offspring_mean = 1,
-    offspring_disp = 1.1,
+    mu = 1,
+    size = 1.1,
     serials_dist = serial_func
   )
   #' Simulate a tree of infections without serials
@@ -72,15 +72,15 @@ test_that("print.epichains works for simulation functions", {
   susc_outbreak_raw <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "pois",
-    offspring_mean = 0.9,
+    lambda = 0.9,
     serials_dist = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "nbinom",
-    offspring_mean = 1,
-    offspring_disp = 1.1,
+    mu = 1,
+    size = 1.1,
     serials_dist = serial_func
   )
   #' Simulate a tree of infections without serials
@@ -120,15 +120,15 @@ test_that("summary.epichains works as expected", {
   susc_outbreak_raw <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "pois",
-    offspring_mean = 0.9,
+    lambda = 0.9,
     serials_dist = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "nbinom",
-    offspring_mean = 1,
-    offspring_disp = 1.1,
+    mu = 1,
+    size = 1.1,
     serials_dist = serial_func
   )
   #' Simulate a tree of infections without serials
@@ -226,15 +226,15 @@ test_that("validate_epichains works", {
   susc_outbreak_raw <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "pois",
-    offspring_mean = 0.9,
+    lambda = 0.9,
     serials_dist = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "nbinom",
-    offspring_mean = 1,
-    offspring_disp = 1.1,
+    mu = 1,
+    size = 1.1,
     serials_dist = serial_func
   )
   #' Simulate a tree of infections without serials
@@ -288,15 +288,15 @@ test_that("is_chains_tree works", {
   susc_outbreak_raw <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "pois",
-    offspring_mean = 0.9,
+    lambda = 0.9,
     serials_dist = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "nbinom",
-    offspring_mean = 1,
-    offspring_disp = 1.1,
+    mu = 1,
+    size = 1.1,
     serials_dist = serial_func
   )
   #' Simulate a tree of infections without serials
@@ -346,15 +346,15 @@ test_that("is_chains_summary works", {
   susc_outbreak_raw <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "pois",
-    offspring_mean = 0.9,
+    lambda = 0.9,
     serials_dist = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "nbinom",
-    offspring_mean = 1,
-    offspring_disp = 1.1,
+    mu = 1,
+    size = 1.1,
     serials_dist = serial_func
   )
   #' Simulate a tree of infections without serials
@@ -495,15 +495,15 @@ test_that("head and tail print output as expected", {
   susc_outbreak_raw <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "pois",
-    offspring_mean = 0.9,
+    lambda = 0.9,
     serials_dist = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "nbinom",
-    offspring_mean = 1,
-    offspring_disp = 1.1,
+    mu = 1,
+    size = 1.1,
     serials_dist = serial_func
   )
   #' Simulate a tree of infections without serials
@@ -538,15 +538,15 @@ test_that("head and tail return data.frames", {
   susc_outbreak_raw <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "pois",
-    offspring_mean = 0.9,
+    lambda = 0.9,
     serials_dist = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "nbinom",
-    offspring_mean = 1,
-    offspring_disp = 1.1,
+    mu = 1,
+    size = 1.1,
     serials_dist = serial_func
   )
   #' Simulate a tree of infections without serials
diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index 4f454712..bcabdcf8 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -9,15 +9,15 @@ test_that("Simulators work", {
   susc_outbreak_raw <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "pois",
-    offspring_mean = 0.9,
+    lambda = 0.9,
     serials_dist = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "nbinom",
-    offspring_mean = 1,
-    offspring_disp = 1.1,
+    mu = 1,
+    size = 1.1,
     serials_dist = serial_func
   )
   #' Simulate a tree of infections without serials
@@ -195,8 +195,8 @@ test_that("simulate_tree_from_pop throws errors", {
     simulate_tree_from_pop(
       pop = 100,
       offspring_dist = "nbinom",
-      offspring_mean = 0.5,
-      offspring_disp = 0.9,
+      mu = 0.5,
+      size = 0.9,
       serials_dist = serial_func
     ),
     "> 1"
@@ -222,19 +222,6 @@ test_that("simulate_tree_from_pop throws errors", {
   )
 })
 
-test_that("simulate_tree_from_pop throws warnings", {
-  expect_warning(
-    simulate_tree_from_pop(
-      pop = 100,
-      offspring_dist = "pois",
-      offspring_mean = 3,
-      offspring_disp = 1,
-      serials_dist = serial_func
-    ),
-    "not used for poisson offspring"
-  )
-})
-
 test_that("simulate_tree is numerically correct", {
   set.seed(12)
   #' Simulate a tree of infections without serials
@@ -313,7 +300,7 @@ test_that("simulate_tree_from_pop is numerically correct", {
   susc_outbreak_raw <- simulate_tree_from_pop(
     pop = 100,
     offspring_dist = "pois",
-    offspring_mean = 0.9,
+    lambda = 0.9,
     serials_dist = serial_func
   )
   #' Summarise the results
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 38d4503d..f33411c3 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -90,7 +90,7 @@ summary_sim # print the output
 tree_from_pop_pois <- simulate_tree_from_pop(
   pop = 1000,
   offspring_dist = "pois",
-  offspring_mean = 0.5,
+  lambda = 0.5,
   serials_dist = function(x) 3
 )
 
@@ -100,8 +100,8 @@ tree_from_pop_pois # print the output
 tree_from_pop_nbinom <- simulate_tree_from_pop(
   pop = 1000,
   offspring_dist = "nbinom",
-  offspring_mean = 0.5,
-  offspring_disp = 1.1,
+  mu = 0.5,
+  size = 1.1,
   serials_dist = function(x) 3
 )
 

From a744d92535527cba95eef3336a39b89597fb2639 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 26 Sep 2023 21:04:27 +0100
Subject: [PATCH 690/828] Linting

---
 R/simulate.r                  | 4 ++--
 man/simulate_tree_from_pop.Rd | 2 +-
 2 files changed, 3 insertions(+), 3 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 4d76d593..6d6089c4 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -364,7 +364,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
         list(n = sum(n_offspring[sim])),
         pars
       )
-      )
+    )
     if (any(next_gen %% 1 > 0)) {
       stop("Offspring distribution must return integers")
     }
@@ -467,7 +467,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' )
 #'
 #' # Simulate with negative binomial offspring with intervention (50%
-#' reduction in R0)
+#' # reduction in R0)
 #' simulate_tree_from_pop(
 #'   pop = 100,
 #'   offspring_dist = "nbinom",
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 26e8230e..05894243 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -137,7 +137,7 @@ serials_dist = function(x) 3
 )
 
 # Simulate with negative binomial offspring with intervention (50\%
-reduction in R0)
+# reduction in R0)
 simulate_tree_from_pop(
   pop = 100,
   offspring_dist = "nbinom",

From b3f718e14ca1b8bfb03b22127df5bcb11b73d0a5 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 27 Sep 2023 17:00:40 +0100
Subject: [PATCH 691/828] Add tests of simulations with intervention

---
 tests/testthat/test-simulate.R | 156 +++++++++++++++++++++++++++++++++
 1 file changed, 156 insertions(+)

diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index bcabdcf8..cb8e4790 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -20,6 +20,15 @@ test_that("Simulators work", {
     size = 1.1,
     serials_dist = serial_func
   )
+  #' Simulate an outbreak from a susceptible population (pois) with
+  #' 50% R0 reduction
+  susc_outbreak_raw_intvn <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "pois",
+    lambda = 1.5,
+    serials_dist = serial_func,
+    r0_reduction = 0.5
+  )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
     nchains = 2,
@@ -36,6 +45,15 @@ test_that("Simulators work", {
     serials_dist = function(x) 3,
     lambda = 2
   )
+  #' Simulate a tree of infections without serials and with 50% reduction
+  #' in R0
+  tree_sim_raw_intvn <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9,
+    r0_reduction = 0.5
+  )
   #' Simulate chain statistics
   chain_summary_raw <- simulate_summary(
     nchains = 2,
@@ -43,11 +61,23 @@ test_that("Simulators work", {
     statistic = "length",
     lambda = 0.9
   )
+  #' Simulate chain statistics and with a 50% reduction in R0
+  chain_summary_raw_intvn <- simulate_summary(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9,
+    r0_reduction = 0.5
+  )
   #' Expectations
   expect_length(
     chain_summary_raw,
     2
   )
+  expect_length(
+    chain_summary_raw_intvn,
+    2
+  )
   expect_gte(
     nrow(tree_sim_raw),
     2
@@ -56,6 +86,10 @@ test_that("Simulators work", {
     nrow(tree_sim_raw2),
     2
   )
+  expect_gte(
+    nrow(tree_sim_raw2),
+    5
+  )
   expect_gte(
     nrow(susc_outbreak_raw),
     1
@@ -64,6 +98,10 @@ test_that("Simulators work", {
     nrow(susc_outbreak_raw2),
     1
   )
+  expect_gte(
+    nrow(susc_outbreak_raw_intvn),
+    1
+  )
   expect_true(
     all(
       simulate_tree(
@@ -138,6 +176,17 @@ test_that("simulate_tree throws errors", {
     ),
     "must be specified"
   )
+  expect_error(
+    simulate_tree(
+      nchains = 2,
+      offspring_dist = "binom",
+      statistic = "length",
+      size = 1,
+      prob = 0.5,
+      r0_reduction = 0.5
+    ),
+    "must be one of"
+  )
 })
 
 test_that("simulate_summary throws errors", {
@@ -179,6 +228,17 @@ test_that("simulate_summary throws errors", {
     ),
     "character string"
   )
+  expect_error(
+    simulate_summary(
+      nchains = 2,
+      offspring_dist = "binom",
+      statistic = "length",
+      size = 1,
+      prob = 0.5,
+      r0_reduction = 0.5
+    ),
+    "must be one of"
+  )
 })
 
 test_that("simulate_tree_from_pop throws errors", {
@@ -231,8 +291,18 @@ test_that("simulate_tree is numerically correct", {
     statistic = "length",
     lambda = 0.9
   )
+  #' Simulate a tree of infections without serials and with 50% reduction
+  #' in R0
+  tree_sim_raw_intvn <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9,
+    r0_reduction = 0.5
+  )
   #' summarise the results
   tree_sim_summary <- summary(tree_sim_raw)
+  tree_sim_intvn_summary <- summary(tree_sim_raw_intvn)
   #' Expectations
   expect_identical(
     tree_sim_summary$chains_ran,
@@ -262,6 +332,35 @@ test_that("simulate_tree is numerically correct", {
     tree_sim_raw$generation,
     c(1L, 1L, 2L, 2L, 3L, 3L, 3L)
   )
+  #' Expectations for intervention simulation
+  expect_identical(
+    tree_sim_summary$chains_ran,
+    2.0
+  )
+  expect_identical(
+    tree_sim_summary$unique_ancestors,
+    2L
+  )
+  expect_identical(
+    tree_sim_summary$max_generation,
+    3L
+  )
+  expect_identical(
+    tree_sim_raw$chain_id,
+    c(1L, 2L, 2L, 2L, 2L, 2L, 2L)
+  )
+  expect_identical(
+    tree_sim_raw$sim_id,
+    c(1, 1, 2, 3, 4, 5, 6)
+  )
+  expect_identical(
+    tree_sim_raw$ancestor,
+    c(NA, NA, 1, 1, 2, 2, 2)
+  )
+  expect_identical(
+    tree_sim_raw$generation,
+    c(1L, 1L, 2L, 2L, 3L, 3L, 3L)
+  )
 })
 
 test_that("simulate_summary is numerically correct", {
@@ -273,8 +372,17 @@ test_that("simulate_summary is numerically correct", {
     statistic = "length",
     lambda = 0.9
   )
+  #' Simulate chain statistics and with a 50% reduction in R0
+  chain_summary_raw_intvn <- simulate_summary(
+    nchains = 2,
+    offspring_dist = "pois",
+    statistic = "length",
+    lambda = 0.9,
+    r0_reduction = 0.5
+  )
   #' Summarise the results
   chain_summary_summaries <- summary(chain_summary_raw)
+  chain_summary_intvn_summaries <- summary(chain_summary_raw_intvn)
   #' Expectations
   expect_identical(
     chain_summary_summaries$chain_ran,
@@ -292,6 +400,22 @@ test_that("simulate_summary is numerically correct", {
     as.vector(chain_summary_raw),
     c(1.00, 3.00)
   )
+  expect_identical(
+    chain_summary_intvn_summaries$chain_ran,
+    2.00
+  )
+  expect_identical(
+    chain_summary_intvn_summaries$max_chain_stat,
+    2.00
+  )
+  expect_identical(
+    chain_summary_intvn_summaries$min_chain_stat,
+    1.00
+  )
+  expect_identical(
+    as.vector(chain_summary_raw_intvn),
+    c(2.00, 1.00)
+  )
 })
 
 test_that("simulate_tree_from_pop is numerically correct", {
@@ -303,8 +427,19 @@ test_that("simulate_tree_from_pop is numerically correct", {
     lambda = 0.9,
     serials_dist = serial_func
   )
+  #' Simulate an outbreak from a susceptible population (pois) with
+  #' 50% R0 reduction
+  set.seed(7)
+  susc_outbreak_raw_intvn <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "pois",
+    lambda = 1.5,
+    serials_dist = serial_func,
+    r0_reduction = 0.5
+  )
   #' Summarise the results
   susc_outbreak_summary <- summary(susc_outbreak_raw)
+  susc_outbreak_summary_intvn <- summary(susc_outbreak_raw_intvn)
   #' Expectations
   expect_identical(
     susc_outbreak_summary$unique_ancestors,
@@ -335,4 +470,25 @@ test_that("simulate_tree_from_pop is numerically correct", {
     susc_outbreak_raw$time,
     0.00
   )
+  #' Expectations for intervention simulation
+  expect_identical(
+    susc_outbreak_summary_intvn$unique_ancestors,
+    12L
+  )
+  expect_identical(
+    round(
+      susc_outbreak_summary_intvn$max_time,
+      1
+    ),
+    72.1
+  )
+  expect_identical(
+    susc_outbreak_summary_intvn$max_generation,
+    10L
+  )
+  expect_null(susc_outbreak_summary_intvn$chains_ran)
+  expect_identical(
+    sum(aggregate(susc_outbreak_raw_intvn, "time")$cases),
+    20L
+  )
 })

From fbbcbaab0687dce8090cfd89b4c1579c2b359fa1 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 27 Sep 2023 17:19:36 +0100
Subject: [PATCH 692/828] Add tests to cover nbinom offspring intervention case

---
 tests/testthat/test-simulate.R | 39 ++++++++++++++++++++++++++++++++++
 1 file changed, 39 insertions(+)

diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index cb8e4790..465ed9e6 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -29,6 +29,16 @@ test_that("Simulators work", {
     serials_dist = serial_func,
     r0_reduction = 0.5
   )
+  #' Simulate an outbreak from a susceptible population (nbinom) with
+  #' 50% R0 reduction
+  susc_outbreak_raw_intvn2 <- simulate_tree_from_pop(
+    pop = 100,
+    offspring_dist = "nbinom",
+    mu = 1.5,
+    size = 1.1,
+    serials_dist = serial_func,
+    r0_reduction = 0.5
+  )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
     nchains = 2,
@@ -54,6 +64,16 @@ test_that("Simulators work", {
     lambda = 0.9,
     r0_reduction = 0.5
   )
+  #' Simulate a tree of infections with nbinom offspring and with 50% reduction
+  #' in R0
+  tree_sim_raw_intvn2 <- simulate_tree(
+    nchains = 2,
+    offspring_dist = "nbinom",
+    statistic = "length",
+    mu = 0.9,
+    size = 1.1,
+    r0_reduction = 0.5
+  )
   #' Simulate chain statistics
   chain_summary_raw <- simulate_summary(
     nchains = 2,
@@ -69,6 +89,15 @@ test_that("Simulators work", {
     lambda = 0.9,
     r0_reduction = 0.5
   )
+  #' Simulate chain statistics with nbinom offspring and with a 50% reduction in R0
+  chain_summary_raw_intvn2 <- simulate_summary(
+    nchains = 2,
+    offspring_dist = "nbinom",
+    statistic = "length",
+    mu = 1.9,
+    size = 1.1,
+    r0_reduction = 0.5
+  )
   #' Expectations
   expect_length(
     chain_summary_raw,
@@ -78,6 +107,10 @@ test_that("Simulators work", {
     chain_summary_raw_intvn,
     2
   )
+  expect_length(
+    chain_summary_raw_intvn2,
+    2
+  )
   expect_gte(
     nrow(tree_sim_raw),
     2
@@ -89,6 +122,9 @@ test_that("Simulators work", {
   expect_gte(
     nrow(tree_sim_raw2),
     5
+  expect_identical(
+    nrow(tree_sim_raw_intvn2),
+    2L
   )
   expect_gte(
     nrow(susc_outbreak_raw),
@@ -101,6 +137,9 @@ test_that("Simulators work", {
   expect_gte(
     nrow(susc_outbreak_raw_intvn),
     1
+  expect_identical(
+    nrow(susc_outbreak_raw_intvn2),
+    1L
   )
   expect_true(
     all(

From 0d6602d080dfecba97a20eea3858192d8401ee45 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 27 Sep 2023 17:19:56 +0100
Subject: [PATCH 693/828] Fixed some tests

---
 tests/testthat/test-simulate.R | 12 +++++++-----
 1 file changed, 7 insertions(+), 5 deletions(-)

diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index 465ed9e6..a284c390 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -119,9 +119,10 @@ test_that("Simulators work", {
     nrow(tree_sim_raw2),
     2
   )
-  expect_gte(
-    nrow(tree_sim_raw2),
-    5
+  expect_identical(
+    nrow(tree_sim_raw_intvn),
+    3L
+  )
   expect_identical(
     nrow(tree_sim_raw_intvn2),
     2L
@@ -134,9 +135,10 @@ test_that("Simulators work", {
     nrow(susc_outbreak_raw2),
     1
   )
-  expect_gte(
+  expect_identical(
     nrow(susc_outbreak_raw_intvn),
-    1
+    2L
+  )
   expect_identical(
     nrow(susc_outbreak_raw_intvn2),
     1L

From 8511f5e43fdefcbf2563365a059af5bc1ea37ead Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 27 Sep 2023 17:30:01 +0100
Subject: [PATCH 694/828] Linting

---
 tests/testthat/test-simulate.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index a284c390..e87007c1 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -89,7 +89,8 @@ test_that("Simulators work", {
     lambda = 0.9,
     r0_reduction = 0.5
   )
-  #' Simulate chain statistics with nbinom offspring and with a 50% reduction in R0
+  #' Simulate chain statistics with nbinom offspring and with a 50% reduction
+  #' in R0
   chain_summary_raw_intvn2 <- simulate_summary(
     nchains = 2,
     offspring_dist = "nbinom",

From 4d6e86572874bdb1f65220b91cb9b3b9b0610d96 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 2 Oct 2023 12:42:06 +0100
Subject: [PATCH 695/828] Don't expose intervention function

---
 NAMESPACE             | 1 -
 R/intervention.R      | 2 +-
 man/intvn_scale_r0.Rd | 1 +
 3 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/NAMESPACE b/NAMESPACE
index c0dfa59e..ab92359f 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -7,7 +7,6 @@ S3method(print,epichains)
 S3method(summary,epichains)
 S3method(tail,epichains)
 export(dborel)
-export(intvn_scale_r0)
 export(is_chains_summary)
 export(is_chains_tree)
 export(is_epichains)
diff --git a/R/intervention.R b/R/intervention.R
index 3d78e6f2..d116f119 100644
--- a/R/intervention.R
+++ b/R/intervention.R
@@ -23,7 +23,7 @@
 #' mean of the poisson and negative binomial distribution.
 #'
 #' @author James M. Azam
-#' @export
+#' @keywords internal
 intvn_scale_r0 <- function(r0_reduction, offspring_dist, pars_list) {
   # Intervention only works for pois and nbinom
   if (!offspring_dist %in% c("pois", "nbinom")) {
diff --git a/man/intvn_scale_r0.Rd b/man/intvn_scale_r0.Rd
index 28619a69..c2410212 100644
--- a/man/intvn_scale_r0.Rd
+++ b/man/intvn_scale_r0.Rd
@@ -40,3 +40,4 @@ mean of the poisson and negative binomial distribution.
 \author{
 James M. Azam
 }
+\keyword{internal}

From 458f844ce0b0724902fe13999d5fb42a4143e9ec Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 2 Oct 2023 14:47:56 +0100
Subject: [PATCH 696/828] Rename R0_reduction argument to intvn_mean_reduction

---
 R/intervention.R               | 15 ++++++------
 R/simulate.r                   | 42 +++++++++++++++++-----------------
 man/simulate_summary.Rd        | 11 +++++----
 man/simulate_tree.Rd           | 11 +++++----
 man/simulate_tree_from_pop.Rd  | 11 +++++----
 tests/testthat/test-simulate.R | 22 +++++++++---------
 6 files changed, 58 insertions(+), 54 deletions(-)

diff --git a/R/intervention.R b/R/intervention.R
index d116f119..3eb64689 100644
--- a/R/intervention.R
+++ b/R/intervention.R
@@ -9,13 +9,14 @@
 #' distributions are specified alongside `intvn_scale_r0`.
 #'
 #' @inheritParams simulate_tree
-#' @param r0_reduction The intervention impact. A scalar between 0 and 1.
-#' Scales the mean of `offspring_dist`. `r0_reduction` = 0 implies
-#' no intervention impact and `r0_reduction` = 1 implies full impact.
+#' @param intvn_mean_reduction Amount of reduction in the mean. A scalar
+#' between 0 and 1.
+#' It scales the mean of `offspring_dist`. `intvn_mean_reduction` = 0 implies
+#' no intervention impact and `intvn_mean_reduction` = 1 implies full impact.
 #' @param pars_list Parameter(s) for poisson or negative binomial offspring
 #' distribution.
 #' @return List of the offspring distribution parameter(s) with the mean
-#' scaled by \code{1 - intvn_scale_r0}.
+#' scaled by \code{1 - intvn_mean_reduction}.
 #' @details
 #' `intvn_scale_r0()` scales the mean of the offspring distribution
 #' by \eqn{1 - r0\_reduction} so that the new mean is given as:
@@ -29,13 +30,13 @@ intvn_scale_r0 <- function(r0_reduction, offspring_dist, pars_list) {
   if (!offspring_dist %in% c("pois", "nbinom")) {
     stop(
       "`offspring_dist` must be one of c(\"pois\", \"nbinom\"), ",
-      "if r0_reduction is specified."
+      "if intvn_mean_reduction is specified."
     )
   }
   if (offspring_dist == "pois") {
-    pars_list$lambda <- (1 - r0_reduction) * pars_list$lambda
+    pars_list$lambda <- (1 - intvn_mean_reduction) * pars_list$lambda
   } else {
-    pars_list$mu <- (1 - r0_reduction) * pars_list$mu
+    pars_list$mu <- (1 - intvn_mean_reduction) * pars_list$mu
   }
   return(pars_list)
 }
diff --git a/R/simulate.r b/R/simulate.r
index 6d6089c4..9887d729 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,6 +1,6 @@
 #' Simulate transmission trees from an initial number of infections
 #'
-#' @inheritParams intvn_scale_r0
+#' @inheritParams intvn_reduce_mean
 #' @param nchains Number of chains to simulate.
 #' @param offspring_dist Offspring distribution: a character string
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
@@ -104,7 +104,7 @@
 #'   nchains = 10,
 #'   statistic = "size",
 #'   offspring_dist = "pois",
-#'   r0_reduction = 0.5,
+#'   intvn_mean_reduction = 0.5,
 #'   stat_max = 10,
 #'   serials_dist = function(x) 3,
 #'   lambda = 2
@@ -127,7 +127,7 @@
 #' 1186–1204. \doi{https://doi.org/10.3390/ijerph7031204}
 simulate_tree <- function(nchains, statistic = c("size", "length"),
                           offspring_dist, stat_max = Inf,
-                          r0_reduction = 0,
+                          intvn_mean_reduction = 0,
                           serials_dist, t0 = 0,
                           tf = Inf, ...) {
   statistic <- match.arg(statistic)
@@ -137,9 +137,9 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
   # check that offspring is properly specified
   check_offspring_valid(offspring_dist)
 
-  # Check that the r0_reduction is well specified
+  # Check that the intvn_mean_reduction is well specified
   checkmate::assert_number(
-    r0_reduction,
+    intvn_mean_reduction,
     lower = 0,
     upper = 1
   )
@@ -152,9 +152,9 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
   pars <- list(...)
 
   # Prepare interventions if specified
-  if (r0_reduction > 0) {
+  if (intvn_mean_reduction > 0) {
     pars <- intvn_scale_r0(
-      r0_reduction = r0_reduction,
+      intvn_mean_reduction = intvn_mean_reduction,
       offspring_dist = offspring_dist,
       pars_list = pars
     )
@@ -284,7 +284,7 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #' Simulate transmission chains sizes/lengths
 #'
 #' @inheritParams simulate_tree
-#' @inheritParams intvn_scale_r0
+#' @inheritParams intvn_reduce_mean
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
 #' @inheritSection simulate_tree Calculating chain sizes and lengths
@@ -309,7 +309,7 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #'   nchains = 10,
 #'   statistic = "size",
 #'   offspring_dist = "pois",
-#'   r0_reduction = 0.5,
+#'   intvn_mean_reduction = 0.5,
 #'   stat_max = 10,
 #'   lambda = 2
 #' )
@@ -318,7 +318,7 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #' @export
 simulate_summary <- function(nchains, statistic = c("size", "length"),
                              offspring_dist,
-                             r0_reduction = 0,
+                             intvn_mean_reduction = 0,
                              stat_max = Inf, ...) {
   statistic <- match.arg(statistic)
 
@@ -327,9 +327,9 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
   # check that offspring is properly specified
   check_offspring_valid(offspring_dist)
 
-  # Check that the r0_reduction is well specified
+  # Check that the intvn_mean_reduction is well specified
   checkmate::assert_number(
-    r0_reduction,
+    intvn_mean_reduction,
     lower = 0,
     upper = 1
   )
@@ -342,9 +342,9 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
   pars <- list(...)
 
   # Prepare interventions if specified
-  if (r0_reduction > 0) {
+  if (intvn_mean_reduction > 0) {
     pars <- intvn_scale_r0(
-      r0_reduction = r0_reduction,
+      intvn_mean_reduction = intvn_mean_reduction,
       offspring_dist = offspring_dist,
       pars_list = pars
     )
@@ -404,7 +404,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' population
 #'
 #' @inheritParams simulate_tree
-#' @inheritParams intvn_scale_r0
+#' @inheritParams intvn_mean_reduction
 #' @param pop The susceptible population size.
 #' @param offspring_dist Offspring distribution: a character string
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
@@ -471,7 +471,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' simulate_tree_from_pop(
 #'   pop = 100,
 #'   offspring_dist = "nbinom",
-#'   r0_reduction = 0.5,
+#'   intvn_mean_reduction = 0.5,
 #'   mu = 0.5,
 #'   size = 1.1,
 #'   serials_dist = function(x) 3
@@ -479,7 +479,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' @export
 simulate_tree_from_pop <- function(pop,
                                    offspring_dist = c("pois", "nbinom"),
-                                   r0_reduction = 0,
+                                   intvn_mean_reduction = 0,
                                    serials_dist,
                                    initial_immune = 0,
                                    t0 = 0,
@@ -487,9 +487,9 @@ simulate_tree_from_pop <- function(pop,
                                    ...) {
   offspring_dist <- match.arg(offspring_dist)
 
-  # Check that the r0_reduction is well specified
+  # Check that the intvn_mean_reduction is well specified
   checkmate::assert_number(
-    r0_reduction,
+    intvn_mean_reduction,
     lower = 0,
     upper = 1
   )
@@ -498,9 +498,9 @@ simulate_tree_from_pop <- function(pop,
   pars <- list(...)
 
   # Prepare interventions if specified
-  if (r0_reduction > 0) {
+  if (intvn_mean_reduction > 0) {
     pars <- intvn_scale_r0(
-      r0_reduction = r0_reduction,
+      intvn_mean_reduction = intvn_mean_reduction,
       offspring_dist = offspring_dist,
       pars_list = pars
     )
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index 7313b8dc..d17a15b2 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -8,7 +8,7 @@ simulate_summary(
   nchains,
   statistic = c("size", "length"),
   offspring_dist,
-  r0_reduction = 0,
+  intvn_mean_reduction = 0,
   stat_max = Inf,
   ...
 )
@@ -29,9 +29,10 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers).}
 
-\item{r0_reduction}{The intervention impact. A scalar between 0 and 1.
-Scales the mean of \code{offspring_dist}. \code{r0_reduction} = 0 implies
-no intervention impact and \code{r0_reduction} = 1 implies full impact.}
+\item{intvn_mean_reduction}{Amount of reduction in the mean. A scalar
+between 0 and 1.
+It scales the mean of \code{offspring_dist}. \code{intvn_mean_reduction} = 0 implies
+no intervention impact and \code{intvn_mean_reduction} = 1 implies full impact.}
 
 \item{stat_max}{A cut off for the chain statistic (size/length) being
 computed. Results above the specified value, are set to \code{Inf}.}
@@ -109,7 +110,7 @@ chain_summary_with_intvn <- simulate_summary(
   nchains = 10,
   statistic = "size",
   offspring_dist = "pois",
-  r0_reduction = 0.5,
+  intvn_mean_reduction = 0.5,
   stat_max = 10,
   lambda = 2
 )
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index a2f405c1..699d75fd 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -9,7 +9,7 @@ simulate_tree(
   statistic = c("size", "length"),
   offspring_dist,
   stat_max = Inf,
-  r0_reduction = 0,
+  intvn_mean_reduction = 0,
   serials_dist,
   t0 = 0,
   tf = Inf,
@@ -36,9 +36,10 @@ numbers).}
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
-\item{r0_reduction}{The intervention impact. A scalar between 0 and 1.
-Scales the mean of \code{offspring_dist}. \code{r0_reduction} = 0 implies
-no intervention impact and \code{r0_reduction} = 1 implies full impact.}
+\item{intvn_mean_reduction}{Amount of reduction in the mean. A scalar
+between 0 and 1.
+It scales the mean of \code{offspring_dist}. \code{intvn_mean_reduction} = 0 implies
+no intervention impact and \code{intvn_mean_reduction} = 1 implies full impact.}
 
 \item{serials_dist}{The serial interval distribution function; the name
 of a user-defined named or anonymous function with only one argument \code{n},
@@ -132,7 +133,7 @@ chains_with_intvn <- simulate_tree(
   nchains = 10,
   statistic = "size",
   offspring_dist = "pois",
-  r0_reduction = 0.5,
+  intvn_mean_reduction = 0.5,
   stat_max = 10,
   serials_dist = function(x) 3,
   lambda = 2
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 05894243..84cc079d 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -8,7 +8,7 @@ population}
 simulate_tree_from_pop(
   pop,
   offspring_dist = c("pois", "nbinom"),
-  r0_reduction = 0,
+  intvn_mean_reduction = 0,
   serials_dist,
   initial_immune = 0,
   t0 = 0,
@@ -24,9 +24,10 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers). Only supports "pois" and "nbinom".}
 
-\item{r0_reduction}{The intervention impact. A scalar between 0 and 1.
-Scales the mean of \code{offspring_dist}. \code{r0_reduction} = 0 implies
-no intervention impact and \code{r0_reduction} = 1 implies full impact.}
+\item{intvn_mean_reduction}{Amount of reduction in the mean. A scalar
+between 0 and 1.
+It scales the mean of \code{offspring_dist}. \code{intvn_mean_reduction} = 0 implies
+no intervention impact and \code{intvn_mean_reduction} = 1 implies full impact.}
 
 \item{serials_dist}{The serial interval distribution function; the name
 of a user-defined named or anonymous function with only one argument \code{n},
@@ -141,7 +142,7 @@ serials_dist = function(x) 3
 simulate_tree_from_pop(
   pop = 100,
   offspring_dist = "nbinom",
-  r0_reduction = 0.5,
+  intvn_mean_reduction = 0.5,
   mu = 0.5,
   size = 1.1,
   serials_dist = function(x) 3
diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index e87007c1..7200b3b6 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -27,7 +27,7 @@ test_that("Simulators work", {
     offspring_dist = "pois",
     lambda = 1.5,
     serials_dist = serial_func,
-    r0_reduction = 0.5
+    intvn_mean_reduction = 0.5
   )
   #' Simulate an outbreak from a susceptible population (nbinom) with
   #' 50% R0 reduction
@@ -37,7 +37,7 @@ test_that("Simulators work", {
     mu = 1.5,
     size = 1.1,
     serials_dist = serial_func,
-    r0_reduction = 0.5
+    intvn_mean_reduction = 0.5
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -62,7 +62,7 @@ test_that("Simulators work", {
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9,
-    r0_reduction = 0.5
+    intvn_mean_reduction = 0.5
   )
   #' Simulate a tree of infections with nbinom offspring and with 50% reduction
   #' in R0
@@ -72,7 +72,7 @@ test_that("Simulators work", {
     statistic = "length",
     mu = 0.9,
     size = 1.1,
-    r0_reduction = 0.5
+    intvn_mean_reduction = 0.5
   )
   #' Simulate chain statistics
   chain_summary_raw <- simulate_summary(
@@ -87,7 +87,7 @@ test_that("Simulators work", {
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9,
-    r0_reduction = 0.5
+    intvn_mean_reduction = 0.5
   )
   #' Simulate chain statistics with nbinom offspring and with a 50% reduction
   #' in R0
@@ -97,7 +97,7 @@ test_that("Simulators work", {
     statistic = "length",
     mu = 1.9,
     size = 1.1,
-    r0_reduction = 0.5
+    intvn_mean_reduction = 0.5
   )
   #' Expectations
   expect_length(
@@ -225,7 +225,7 @@ test_that("simulate_tree throws errors", {
       statistic = "length",
       size = 1,
       prob = 0.5,
-      r0_reduction = 0.5
+      intvn_mean_reduction = 0.5
     ),
     "must be one of"
   )
@@ -277,7 +277,7 @@ test_that("simulate_summary throws errors", {
       statistic = "length",
       size = 1,
       prob = 0.5,
-      r0_reduction = 0.5
+      intvn_mean_reduction = 0.5
     ),
     "must be one of"
   )
@@ -340,7 +340,7 @@ test_that("simulate_tree is numerically correct", {
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9,
-    r0_reduction = 0.5
+    intvn_mean_reduction = 0.5
   )
   #' summarise the results
   tree_sim_summary <- summary(tree_sim_raw)
@@ -420,7 +420,7 @@ test_that("simulate_summary is numerically correct", {
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9,
-    r0_reduction = 0.5
+    intvn_mean_reduction = 0.5
   )
   #' Summarise the results
   chain_summary_summaries <- summary(chain_summary_raw)
@@ -477,7 +477,7 @@ test_that("simulate_tree_from_pop is numerically correct", {
     offspring_dist = "pois",
     lambda = 1.5,
     serials_dist = serial_func,
-    r0_reduction = 0.5
+    intvn_mean_reduction = 0.5
   )
   #' Summarise the results
   susc_outbreak_summary <- summary(susc_outbreak_raw)

From bb3092360084834472e7f3f454df6b378b2bb8f6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 2 Oct 2023 14:50:26 +0100
Subject: [PATCH 697/828] Rename intvn_scale_R0() to intvn_reduce_mean()

---
 R/intervention.R         |  5 ++--
 R/simulate.r             |  6 ++---
 man/intvn_reduce_mean.Rd | 54 ++++++++++++++++++++++++++++++++++++++++
 man/intvn_scale_r0.Rd    | 43 --------------------------------
 4 files changed, 60 insertions(+), 48 deletions(-)
 create mode 100644 man/intvn_reduce_mean.Rd
 delete mode 100644 man/intvn_scale_r0.Rd

diff --git a/R/intervention.R b/R/intervention.R
index 3eb64689..a31b3cb6 100644
--- a/R/intervention.R
+++ b/R/intervention.R
@@ -1,7 +1,7 @@
 #' Set up intervention for simulation
 #'
 #' @description
-#' `intvn_scale_r0()` is a helper for the \code{simulation_*} functions. It
+#' `intvn_reduce_mean()` is a helper for the \code{simulate_*()} functions. It
 #' modifies the relevant arguments of the offspring distribution in order to
 #' mimic the impact of an intervention. In particular, it scales the mean of
 #' the offspring distribution. Currently, it can only handle the poisson and
@@ -18,7 +18,7 @@
 #' @return List of the offspring distribution parameter(s) with the mean
 #' scaled by \code{1 - intvn_mean_reduction}.
 #' @details
-#' `intvn_scale_r0()` scales the mean of the offspring distribution
+#' `intvn_reduce_mean()` scales the mean of the offspring distribution
 #' by \eqn{1 - r0\_reduction} so that the new mean is given as:
 #' \deqn{(1 - r0\_reduction) \times R_0,} where \eqn{R_0} is the
 #' mean of the poisson and negative binomial distribution.
@@ -26,6 +26,7 @@
 #' @author James M. Azam
 #' @keywords internal
 intvn_scale_r0 <- function(r0_reduction, offspring_dist, pars_list) {
+intvn_reduce_mean <- function(intvn_mean_reduction, offspring_dist, pars_list) {
   # Intervention only works for pois and nbinom
   if (!offspring_dist %in% c("pois", "nbinom")) {
     stop(
diff --git a/R/simulate.r b/R/simulate.r
index 9887d729..11f34afe 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -153,7 +153,7 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 
   # Prepare interventions if specified
   if (intvn_mean_reduction > 0) {
-    pars <- intvn_scale_r0(
+    pars <- intvn_reduce_mean(
       intvn_mean_reduction = intvn_mean_reduction,
       offspring_dist = offspring_dist,
       pars_list = pars
@@ -343,7 +343,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 
   # Prepare interventions if specified
   if (intvn_mean_reduction > 0) {
-    pars <- intvn_scale_r0(
+    pars <- intvn_reduce_mean(
       intvn_mean_reduction = intvn_mean_reduction,
       offspring_dist = offspring_dist,
       pars_list = pars
@@ -499,7 +499,7 @@ simulate_tree_from_pop <- function(pop,
 
   # Prepare interventions if specified
   if (intvn_mean_reduction > 0) {
-    pars <- intvn_scale_r0(
+    pars <- intvn_reduce_mean(
       intvn_mean_reduction = intvn_mean_reduction,
       offspring_dist = offspring_dist,
       pars_list = pars
diff --git a/man/intvn_reduce_mean.Rd b/man/intvn_reduce_mean.Rd
new file mode 100644
index 00000000..9ae20db7
--- /dev/null
+++ b/man/intvn_reduce_mean.Rd
@@ -0,0 +1,54 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/intervention.R
+\name{intvn_reduce_mean}
+\alias{intvn_reduce_mean}
+\title{Reduce the mean of the offspring distribution}
+\usage{
+intvn_reduce_mean(intvn_mean_reduction, offspring_dist, pars_list)
+}
+\arguments{
+\item{intvn_mean_reduction}{Amount of reduction in the mean. A scalar
+between 0 and 1.
+It scales the mean of \code{offspring_dist}. \code{intvn_mean_reduction} = 0 implies
+no intervention impact and \code{intvn_mean_reduction} = 1 implies full impact.}
+
+\item{offspring_dist}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
+
+\item{pars_list}{Parameter(s) for poisson or negative binomial offspring
+distribution.}
+}
+\value{
+List of the offspring distribution parameter(s) with the mean
+scaled by \code{1 - intvn_mean_reduction}.
+}
+\description{
+\code{intvn_reduce_mean()} is a helper for the \code{simulate_*()} functions. It
+reduces/scales the mean of the offspring distribution in order to
+mimic the impact of a population-level intervention. Currently, it can only
+handle the poisson and negative binomial distributions and errors when other
+offspring distributions are specified alongside the \code{intvn_mean_reduction}
+argument.
+}
+\details{
+\code{intvn_reduce_mean()} scales the mean of the offspring distribution
+by \eqn{1 - {\sf intvn\_mean\_reduction}} so that the new mean is given as:
+\deqn{(1 - {\sf intvn\_mean\_reduction}) \times {\sf mean,}} This
+scaling when applied to the poisson and negative binomial offspring
+distributions corresponds to the population-level reduction of R0 as
+described in Lloyd-Smith et al, (2005). \code{intvn_reduce_mean()} is therefore
+only implemented for the aforementioned distributions and errors when other
+offspring distributions are specified along with the \code{intvn_mean_reduction}
+argument in the \code{simulate_*()} functions.
+}
+\references{
+Lloyd-Smith, J., Schreiber, S., Kopp, P. et al. Superspreading
+and the effect of individual variation on disease emergence. Nature 438,
+355–359 (2005). \doi{10.1038/nature04153}
+}
+\author{
+James M. Azam
+}
+\keyword{internal}
diff --git a/man/intvn_scale_r0.Rd b/man/intvn_scale_r0.Rd
deleted file mode 100644
index c2410212..00000000
--- a/man/intvn_scale_r0.Rd
+++ /dev/null
@@ -1,43 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/intervention.R
-\name{intvn_scale_r0}
-\alias{intvn_scale_r0}
-\title{Set up intervention for simulation}
-\usage{
-intvn_scale_r0(r0_reduction, offspring_dist, pars_list)
-}
-\arguments{
-\item{r0_reduction}{The intervention impact. A scalar between 0 and 1.
-Scales the mean of \code{offspring_dist}. \code{r0_reduction} = 0 implies
-no intervention impact and \code{r0_reduction} = 1 implies full impact.}
-
-\item{offspring_dist}{Offspring distribution: a character string
-corresponding to the R distribution function (e.g., "pois" for Poisson,
-where \code{\link{rpois}} is the R function to generate Poisson random
-numbers).}
-
-\item{pars_list}{Parameter(s) for poisson or negative binomial offspring
-distribution.}
-}
-\value{
-List of the offspring distribution parameter(s) with the mean
-scaled by \code{1 - intvn_scale_r0}.
-}
-\description{
-\code{intvn_scale_r0()} is a helper for the \code{simulation_*} functions. It
-modifies the relevant arguments of the offspring distribution in order to
-mimic the impact of an intervention. In particular, it scales the mean of
-the offspring distribution. Currently, it can only handle the poisson and
-negative binomial distributions and errors when other offspring
-distributions are specified alongside \code{intvn_scale_r0}.
-}
-\details{
-\code{intvn_scale_r0()} scales the mean of the offspring distribution
-by \eqn{1 - r0\_reduction} so that the new mean is given as:
-\deqn{(1 - r0\_reduction) \times R_0,} where \eqn{R_0} is the
-mean of the poisson and negative binomial distribution.
-}
-\author{
-James M. Azam
-}
-\keyword{internal}

From 994d1af99c60cd90c21eefd46df520ad1bba32b3 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 2 Oct 2023 14:51:20 +0100
Subject: [PATCH 698/828] Reword function title and description

---
 R/intervention.R | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/R/intervention.R b/R/intervention.R
index a31b3cb6..af973145 100644
--- a/R/intervention.R
+++ b/R/intervention.R
@@ -1,12 +1,12 @@
-#' Set up intervention for simulation
+#' Reduce the mean of the offspring distribution
 #'
 #' @description
 #' `intvn_reduce_mean()` is a helper for the \code{simulate_*()} functions. It
-#' modifies the relevant arguments of the offspring distribution in order to
-#' mimic the impact of an intervention. In particular, it scales the mean of
-#' the offspring distribution. Currently, it can only handle the poisson and
-#' negative binomial distributions and errors when other offspring
-#' distributions are specified alongside `intvn_scale_r0`.
+#' reduces/scales the mean of the offspring distribution in order to
+#' mimic the impact of a population-level intervention. Currently, it can only
+#' handle the poisson and negative binomial distributions and errors when other
+#' offspring distributions are specified alongside the `intvn_mean_reduction`
+#' argument.
 #'
 #' @inheritParams simulate_tree
 #' @param intvn_mean_reduction Amount of reduction in the mean. A scalar

From b0a0c63acf4223ea25e04a7801135671f4cb5991 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 2 Oct 2023 14:52:20 +0100
Subject: [PATCH 699/828] Reword details for clarity

---
 R/intervention.R | 11 ++++++++---
 1 file changed, 8 insertions(+), 3 deletions(-)

diff --git a/R/intervention.R b/R/intervention.R
index af973145..7e483236 100644
--- a/R/intervention.R
+++ b/R/intervention.R
@@ -19,9 +19,14 @@
 #' scaled by \code{1 - intvn_mean_reduction}.
 #' @details
 #' `intvn_reduce_mean()` scales the mean of the offspring distribution
-#' by \eqn{1 - r0\_reduction} so that the new mean is given as:
-#' \deqn{(1 - r0\_reduction) \times R_0,} where \eqn{R_0} is the
-#' mean of the poisson and negative binomial distribution.
+#' by \eqn{1 - {\sf intvn\_mean\_reduction}} so that the new mean is given as:
+#' \deqn{(1 - {\sf intvn\_mean\_reduction}) \times {\sf mean,}} This
+#' scaling when applied to the poisson and negative binomial offspring
+#' distributions corresponds to the population-level reduction of R0 as
+#' described in Lloyd-Smith et al, (2005). `intvn_reduce_mean()` is therefore
+#' only implemented for the aforementioned distributions and errors when other
+#' offspring distributions are specified along with the `intvn_mean_reduction`
+#' argument in the \code{simulate_*()} functions.
 #'
 #' @author James M. Azam
 #' @keywords internal

From 8f600f0b00634cae37be8a3d2c587b32b73cc6ba Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 2 Oct 2023 14:52:53 +0100
Subject: [PATCH 700/828] Add a reference

---
 R/intervention.R | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/R/intervention.R b/R/intervention.R
index 7e483236..d14e3547 100644
--- a/R/intervention.R
+++ b/R/intervention.R
@@ -30,7 +30,9 @@
 #'
 #' @author James M. Azam
 #' @keywords internal
-intvn_scale_r0 <- function(r0_reduction, offspring_dist, pars_list) {
+#' @references Lloyd-Smith, J., Schreiber, S., Kopp, P. et al. Superspreading
+#' and the effect of individual variation on disease emergence. Nature 438,
+#' 355–359 (2005). \doi{10.1038/nature04153}
 intvn_reduce_mean <- function(intvn_mean_reduction, offspring_dist, pars_list) {
   # Intervention only works for pois and nbinom
   if (!offspring_dist %in% c("pois", "nbinom")) {

From f164e125e229505429bf859ab78b500f730f238d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 2 Oct 2023 15:21:34 +0100
Subject: [PATCH 701/828] Reword intvn_mean_reduction parameter documentation

---
 R/intervention.R              | 9 +++++----
 man/intvn_reduce_mean.Rd      | 9 +++++----
 man/simulate_summary.Rd       | 9 +++++----
 man/simulate_tree.Rd          | 9 +++++----
 man/simulate_tree_from_pop.Rd | 9 +++++----
 5 files changed, 25 insertions(+), 20 deletions(-)

diff --git a/R/intervention.R b/R/intervention.R
index d14e3547..3ab7339c 100644
--- a/R/intervention.R
+++ b/R/intervention.R
@@ -9,10 +9,11 @@
 #' argument.
 #'
 #' @inheritParams simulate_tree
-#' @param intvn_mean_reduction Amount of reduction in the mean. A scalar
-#' between 0 and 1.
-#' It scales the mean of `offspring_dist`. `intvn_mean_reduction` = 0 implies
-#' no intervention impact and `intvn_mean_reduction` = 1 implies full impact.
+#' @param intvn_mean_reduction A number between 0
+#' and 1 for scaling/reducing the mean of `offspring_dist`. Serves as
+#' population-level intervention. `intvn_mean_reduction` = 0
+#' implies no intervention impact and `intvn_mean_reduction` = 1 implies full
+#' impact.
 #' @param pars_list Parameter(s) for poisson or negative binomial offspring
 #' distribution.
 #' @return List of the offspring distribution parameter(s) with the mean
diff --git a/man/intvn_reduce_mean.Rd b/man/intvn_reduce_mean.Rd
index 9ae20db7..080ec2a9 100644
--- a/man/intvn_reduce_mean.Rd
+++ b/man/intvn_reduce_mean.Rd
@@ -7,10 +7,11 @@
 intvn_reduce_mean(intvn_mean_reduction, offspring_dist, pars_list)
 }
 \arguments{
-\item{intvn_mean_reduction}{Amount of reduction in the mean. A scalar
-between 0 and 1.
-It scales the mean of \code{offspring_dist}. \code{intvn_mean_reduction} = 0 implies
-no intervention impact and \code{intvn_mean_reduction} = 1 implies full impact.}
+\item{intvn_mean_reduction}{A number between 0
+and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
+population-level intervention. \code{intvn_mean_reduction} = 0
+implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
+impact.}
 
 \item{offspring_dist}{Offspring distribution: a character string
 corresponding to the R distribution function (e.g., "pois" for Poisson,
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index d17a15b2..9c1283af 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -29,10 +29,11 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers).}
 
-\item{intvn_mean_reduction}{Amount of reduction in the mean. A scalar
-between 0 and 1.
-It scales the mean of \code{offspring_dist}. \code{intvn_mean_reduction} = 0 implies
-no intervention impact and \code{intvn_mean_reduction} = 1 implies full impact.}
+\item{intvn_mean_reduction}{A number between 0
+and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
+population-level intervention. \code{intvn_mean_reduction} = 0
+implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
+impact.}
 
 \item{stat_max}{A cut off for the chain statistic (size/length) being
 computed. Results above the specified value, are set to \code{Inf}.}
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 699d75fd..cb3d0629 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -36,10 +36,11 @@ numbers).}
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
-\item{intvn_mean_reduction}{Amount of reduction in the mean. A scalar
-between 0 and 1.
-It scales the mean of \code{offspring_dist}. \code{intvn_mean_reduction} = 0 implies
-no intervention impact and \code{intvn_mean_reduction} = 1 implies full impact.}
+\item{intvn_mean_reduction}{A number between 0
+and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
+population-level intervention. \code{intvn_mean_reduction} = 0
+implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
+impact.}
 
 \item{serials_dist}{The serial interval distribution function; the name
 of a user-defined named or anonymous function with only one argument \code{n},
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 84cc079d..030f9fe8 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -24,10 +24,11 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers). Only supports "pois" and "nbinom".}
 
-\item{intvn_mean_reduction}{Amount of reduction in the mean. A scalar
-between 0 and 1.
-It scales the mean of \code{offspring_dist}. \code{intvn_mean_reduction} = 0 implies
-no intervention impact and \code{intvn_mean_reduction} = 1 implies full impact.}
+\item{intvn_mean_reduction}{A number between 0
+and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
+population-level intervention. \code{intvn_mean_reduction} = 0
+implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
+impact.}
 
 \item{serials_dist}{The serial interval distribution function; the name
 of a user-defined named or anonymous function with only one argument \code{n},

From 5a59132afbab30ce7dfac5965b96499f743cff71 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 6 Oct 2023 19:05:10 +0100
Subject: [PATCH 702/828] Change chains_ran to chains_run

---
 R/epichains.R                   |  8 ++++----
 tests/testthat/test-epichains.R | 10 +++++-----
 tests/testthat/test-simulate.R  | 12 ++++++------
 3 files changed, 15 insertions(+), 15 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 20d1f5fc..0dd0ab70 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -33,7 +33,7 @@ format.epichains <- function(x, ...) {
     # print summary information
     writeLines(
       c(
-        sprintf("Chains simulated: %s", chain_info[["chains_ran"]]),
+        sprintf("Chains simulated: %s", chain_info[["chains_run"]]),
         sprintf(
           "Number of ancestors (known): %s",
           chain_info[["unique_ancestors"]]
@@ -84,7 +84,7 @@ format.epichains <- function(x, ...) {
 summary.epichains <- function(object, ...) {
   validate_epichains(object)
 
-  chains_ran <- attr(object, "chains", exact = TRUE)
+  chains_run <- attr(object, "chains", exact = TRUE)
 
   if (is_chains_tree(object)) {
     max_time <- ifelse(("time" %in% names(object)), max(object$time), NA)
@@ -97,7 +97,7 @@ summary.epichains <- function(object, ...) {
 
     # out of summary
     res <- list(
-      chains_ran = chains_ran,
+      chains_run = chains_run,
       max_time = max_time,
       unique_ancestors = n_unique_ancestors,
       max_generation = max_generation
@@ -111,7 +111,7 @@ summary.epichains <- function(object, ...) {
     }
 
     res <- list(
-      chain_ran = chains_ran,
+      chains_run = chains_run,
       max_chain_stat = max_chain_stat,
       min_chain_stat = min_chain_stat
     )
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index ed80ccad..9b1d4f75 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -167,7 +167,7 @@ test_that("summary.epichains works as expected", {
   expect_named(
     summary(tree_sim_raw),
     c(
-      "chains_ran",
+      "chains_run",
       "max_time",
       "unique_ancestors",
       "max_generation"
@@ -176,7 +176,7 @@ test_that("summary.epichains works as expected", {
   expect_named(
     summary(tree_sim_raw2),
     c(
-      "chains_ran",
+      "chains_run",
       "max_time",
       "unique_ancestors",
       "max_generation"
@@ -185,7 +185,7 @@ test_that("summary.epichains works as expected", {
   expect_named(
     summary(susc_outbreak_raw),
     c(
-      "chains_ran",
+      "chains_run",
       "max_time",
       "unique_ancestors",
       "max_generation"
@@ -194,7 +194,7 @@ test_that("summary.epichains works as expected", {
   expect_named(
     summary(susc_outbreak_raw2),
     c(
-      "chains_ran",
+      "chains_run",
       "max_time",
       "unique_ancestors",
       "max_generation"
@@ -203,7 +203,7 @@ test_that("summary.epichains works as expected", {
   expect_named(
     summary(chain_summary_raw),
     c(
-      "chain_ran",
+      "chains_run",
       "max_chain_stat",
       "min_chain_stat"
     )
diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index 7200b3b6..b6fd787e 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -347,7 +347,7 @@ test_that("simulate_tree is numerically correct", {
   tree_sim_intvn_summary <- summary(tree_sim_raw_intvn)
   #' Expectations
   expect_identical(
-    tree_sim_summary$chains_ran,
+    tree_sim_summary$chains_run,
     2.00
   )
   expect_identical(
@@ -376,7 +376,7 @@ test_that("simulate_tree is numerically correct", {
   )
   #' Expectations for intervention simulation
   expect_identical(
-    tree_sim_summary$chains_ran,
+    tree_sim_summary$chains_run,
     2.0
   )
   expect_identical(
@@ -427,7 +427,7 @@ test_that("simulate_summary is numerically correct", {
   chain_summary_intvn_summaries <- summary(chain_summary_raw_intvn)
   #' Expectations
   expect_identical(
-    chain_summary_summaries$chain_ran,
+    chain_summary_summaries$chains_run,
     2.00
   )
   expect_identical(
@@ -443,7 +443,7 @@ test_that("simulate_summary is numerically correct", {
     c(1.00, 3.00)
   )
   expect_identical(
-    chain_summary_intvn_summaries$chain_ran,
+    chain_summary_intvn_summaries$chains_run,
     2.00
   )
   expect_identical(
@@ -495,7 +495,7 @@ test_that("simulate_tree_from_pop is numerically correct", {
     susc_outbreak_summary$max_generation,
     1L
   )
-  expect_null(susc_outbreak_summary$chains_ran)
+  expect_null(susc_outbreak_summary$chains_run)
   expect_identical(
     susc_outbreak_raw$sim_id,
     1L
@@ -528,7 +528,7 @@ test_that("simulate_tree_from_pop is numerically correct", {
     susc_outbreak_summary_intvn$max_generation,
     10L
   )
-  expect_null(susc_outbreak_summary_intvn$chains_ran)
+  expect_null(susc_outbreak_summary_intvn$chains_run)
   expect_identical(
     sum(aggregate(susc_outbreak_raw_intvn, "time")$cases),
     20L

From 5ccde1af062033f94dcae195cac19ed0809b739c Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 6 Oct 2023 18:00:37 +0100
Subject: [PATCH 703/828] Add Quick Start section to README

---
 README.Rmd | 217 ++++++++++++++++++++++++++++++++++++++++++++++++++++-
 1 file changed, 216 insertions(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index fba9a261..5f3af150 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -65,7 +65,222 @@ library("epichains")
 
 # Quick start
 
-Work in progress
+## Core functionality
+
+_{{ packagename }}_ provides four main functions: 
+
+### [likelihood()](https://epiverse-trace.github.io/epichains/reference/likelihood.html)
+
+This function calculates the likelihood/loglikelihood of observing a vector of outbreak summaries obtained from transmission chains. Summaries here refer to transmission chain sizes or lengths/durations.
+
+`likelihood()` requires a vector of chain summaries (sizes or lengths),
+`chains`, the corresponding statistic to calculate, `statistic`, and the offspring distribution,
+`offspring_dist` its associated parameters. It also requires `nsim_obs`, which is the number of simulations to run if the likelihoods do not have a closed-form solution and must be simulated. This argument will be explained further in the ["Getting Started"](https://epiverse-trace.github.io/epichains/articles/epichains.html) vignette.
+
+Let's look at the following example where we estimate the loglikelihood of observing `chain_sizes`.
+```{r}
+set.seed(121)
+# example of observed chain sizes
+# randomly generate 20 chains of size between 1 to 10
+chain_sizes <- sample(1:10, 20, replace = TRUE)
+```
+
+```{r}
+# estimate loglikelihood of the observed chain sizes
+likelihood_eg <- likelihood(
+  chains = chain_sizes,
+  statistic = "size",
+  offspring_dist = "pois",
+  nsim_obs = 100,
+  lambda = 0.5
+)
+# Print the estimate
+likelihood_eg
+```
+
+
+### [simulate_tree()](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html) 
+
+`simulate_tree()` simulates an outbreak from a given number of infections.
+It retains and returns information on infectors (ancestors), infectees, the generation of infection, and the time, if a serial distribution is specified.
+
+Let's look at an example where we simulate the transmission trees of $10$ initial infections/chains. We 
+assume a poisson offspring distribution with mean, $\text{lambda} = 0.9$, and a serial interval of $3$ days:
+```{r}
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+head(sim_tree_eg)
+```
+
+`simulate_tree()` can model population-level intervention by reducing the $R_0$,
+using the `intvn_mean_reduction` argument.
+
+To illustrate this, we will use the previous example and specify
+a population-level intervention that reduces $R_0$ by $50\%$.
+
+```{r}
+set.seed(123)
+
+sim_tree_intvn_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  intvn_mean_reduction = 0.5,
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+head(sim_tree_intvn_eg)
+```
+
+### [simulate_summary()](https://epiverse-trace.github.io/epichains/reference/simulate_summary.html)
+
+`simulate_summary()` is basically `simulate_tree()` except that it does not retain
+information on each infector and infectee. It returns the eventual size or length/duration of each transmission chain.
+
+Here is an example to simulate the previous examples without intervention,
+returning the size of each of the $10$ chains. It assumes a poisson offspring distribution with
+mean of $0.9$.
+```{r}
+set.seed(123)
+
+simulate_summary_eg <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  lambda = 0.9
+)
+
+# Print the results
+simulate_summary_eg
+```
+
+Here is an example with an intervention that reduces $R_0$ by $50\%$.
+
+```{r}
+simulate_summary_intvn_eg <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  intvn_mean_reduction = 0.5,
+  stat_max = 10,
+  lambda = 0.9
+)
+
+# Print the results
+simulate_summary_intvn_eg
+```
+
+
+### [simulate_tree_from_pop()](https://epiverse-trace.github.io/epichains/reference/simulate_tree_from_pop.html)
+
+`simulate_tree_from_pop()` simulates outbreaks based on a specified population size and pre-existing immunity until the susceptible pool runs out.
+  
+Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and serial interval of $3$:
+```{r}
+set.seed(7)
+
+sim_tree_from_pop_eg <- simulate_tree_from_pop(
+  pop = 1000,
+  offspring_dist = "pois",
+  lambda = 1,
+  serials_dist = function(x) {3}
+  )
+
+head(sim_tree_from_pop_eg)
+```
+
+## Other functionalities
+
+### Summarising
+
+You can run `summary()` on `<epichains>` objects to get useful summaries.
+```{r include=TRUE,echo=TRUE}
+# Example with simulate_tree()
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+summary(sim_tree_eg)
+
+# Example with simulate_summary()
+set.seed(123)
+
+simulate_summary_eg <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  lambda = 0.9
+)
+
+# Get summaries
+summary(simulate_summary_eg)
+```
+
+### Aggregating
+
+You can aggregate `<epichains>` objects returned by the `simulate_*()` functions into a time series, which is a `<data.frame>` with columns "cases"  and either "generation" or "time", depending on the value of `grouping_var`.
+
+To aggregate over "time", you must have specified a serial interval distribution in the simulation step.
+```{r include=TRUE,echo=TRUE}
+# Example with simulate_tree()
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+aggregate(sim_tree_eg, grouping_var = "time")
+```
+
+### Plotting
+
+Aggregated `<epichains>` objects can easily be plotted using base R or `ggplot2` with little to no data manipulation.
+
+Here is an end-to-end example from simulation through aggregation to plotting.
+```{r}
+# Run simulation with simulate_tree()
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+# Aggregate cases over time
+sim_aggreg <- aggregate(sim_tree_eg, grouping_var = "time")
+
+# Plot cases over time
+plot(sim_aggreg, type = "b")
+```
 
 ## Package vignettes
 

From aa3234eab700ac7c84b5d3624785d277e5cc2b26 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 6 Oct 2023 18:01:15 +0100
Subject: [PATCH 704/828] Remove contents of Getting Started vignette

---
 vignettes/epichains.Rmd | 99 +----------------------------------------
 1 file changed, 1 insertion(+), 98 deletions(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index f33411c3..2acfa562 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -27,101 +27,4 @@ knitr::opts_chunk$set(
 )
 ```
 
-## Functionality
-
-`epichains` currently has 4 core functions:
-
-* `simulate_tree()`: simulate transmission trees from a given number of chains.
-* `simulate_tree_from_pop()`: simulate transmission trees from a given number 
-  population size and initial immunity.
-* `simulate_summary()`: simulate a vector of observed transmission chains 
-  sizes/lengths from a given number of chains.
-* `likelihood()`: estimate the likelihood/loglikelihood of observing
-  chains of given sizes/lengths.
-
-### Object-orientation
-
-#### Classes
-
-* An `epichains` class:
-  * superclass of `data.frame` with attributes for tracking `chain_type` as: 
-    * `chains_tree`, if returned from `simulate_tree()` or 
-    `simulate_tree_from_pop()`
-    * `chains_vec`, if returned from `simulate_summary()`.
-* An `epichains_aggregate_df` class:
-  * superclass of `data.frame` with attributes for tracking if aggregation is 
-  done over "time", "generation" or "both". Useful for `plot` method dispatch 
-  (see methods section below).
-
-#### Methods
-
-* `print()`
-* `summary()`
-* `aggregate()`
-
-## Demo
-
-### Printing and summary
-```{r include=TRUE,echo=TRUE}
-library(epichains)
-# Using `simulate_tree()`
-tree_from_pois_offspring <- simulate_tree(
-  nchains = 10,
-  offspring_dist = "pois",
-  serials_dist = function(x) 3,
-  lambda = 2,
-  stat_max = 10
-)
-
-tree_from_pois_offspring # print the output
-
-# Using simulate_summary()
-summary_sim <- simulate_summary(
-  nchains = 50, offspring_dist = "pois",
-  statistic = "length", lambda = 2,
-  stat_max = 10
-)
-
-summary_sim # print the output
-
-# Using `simulate_tree_from_pop()`
-
-# Simulate with poisson offspring
-tree_from_pop_pois <- simulate_tree_from_pop(
-  pop = 1000,
-  offspring_dist = "pois",
-  lambda = 0.5,
-  serials_dist = function(x) 3
-)
-
-tree_from_pop_pois # print the output
-
-# Simulate with negative binomial offspring
-tree_from_pop_nbinom <- simulate_tree_from_pop(
-  pop = 1000,
-  offspring_dist = "nbinom",
-  mu = 0.5,
-  size = 1.1,
-  serials_dist = function(x) 3
-)
-
-tree_from_pop_nbinom # print the output
-
-# Likelihoods
-
-chain_sizes <- c(1, 1, 4, 7)
-likelihood(
-  chains = chain_sizes, statistic = "size",
-  offspring_dist = "pois", nsim_obs = 100,
-  lambda = 0.5
-)
-```
-
-### Aggregation
-```{r include=TRUE,echo=TRUE}
-# aggregate by time
-aggregate(tree_from_pop_pois, "time")
-
-# aggregate by generation
-aggregate(tree_from_pop_pois, "generation")
-```
+WIP

From 4d8514dc4ad184866432227f10d3befe651e9c62 Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Fri, 6 Oct 2023 17:04:20 +0000
Subject: [PATCH 705/828] Automatic readme update

---
 README.md                                 | 314 +++++++++++++++++++++-
 man/figures/README-unnamed-chunk-13-1.png | Bin 0 -> 23036 bytes
 2 files changed, 313 insertions(+), 1 deletion(-)
 create mode 100644 man/figures/README-unnamed-chunk-13-1.png

diff --git a/README.md b/README.md
index cfe04da6..3aa36fa5 100644
--- a/README.md
+++ b/README.md
@@ -59,7 +59,319 @@ library("epichains")
 
 # Quick start
 
-Work in progress
+## Core functionality
+
+*epichains* provides four main functions:
+
+### [likelihood()](https://epiverse-trace.github.io/epichains/reference/likelihood.html)
+
+This function calculates the likelihood/loglikelihood of observing a
+vector of outbreak summaries obtained from transmission chains.
+Summaries here refer to transmission chain sizes or lengths/durations.
+
+`likelihood()` requires a vector of chain summaries (sizes or lengths),
+`chains`, the corresponding statistic to calculate, `statistic`, and the
+offspring distribution, `offspring_dist` its associated parameters. It
+also requires `nsim_obs`, which is the number of simulations to run if
+the likelihoods do not have a closed-form solution and must be
+simulated. This argument will be explained further in the [“Getting
+Started”](https://epiverse-trace.github.io/epichains/articles/epichains.html)
+vignette.
+
+Let’s look at the following example where we estimate the loglikelihood
+of observing `chain_sizes`.
+
+``` r
+set.seed(121)
+# example of observed chain sizes
+# randomly generate 20 chains of size between 1 to 10
+chain_sizes <- sample(1:10, 20, replace = TRUE)
+```
+
+``` r
+# estimate loglikelihood of the observed chain sizes
+likelihood_eg <- likelihood(
+  chains = chain_sizes,
+  statistic = "size",
+  offspring_dist = "pois",
+  nsim_obs = 100,
+  lambda = 0.5
+)
+# Print the estimate
+likelihood_eg
+#> [1] -67.82879
+```
+
+### [simulate_tree()](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html)
+
+`simulate_tree()` simulates an outbreak from a given number of
+infections. It retains and returns information on infectors (ancestors),
+infectees, the generation of infection, and the time, if a serial
+distribution is specified.
+
+Let’s look at an example where we simulate the transmission trees of
+$10$ initial infections/chains. We assume a poisson offspring
+distribution with mean, $\text{lambda} = 0.9$, and a serial interval of
+$3$ days:
+
+``` r
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+head(sim_tree_eg)
+#> < tree head (from first known ancestor) >
+#>    chain_id sim_id ancestor generation time
+#> 11        2      2        1          2    3
+#> 13        3      2        1          2    3
+#> 14        4      2        1          2    3
+#> 16        5      2        1          2    3
+#> 19        7      2        1          2    3
+#> 20        8      2        1          2    3
+```
+
+`simulate_tree()` can model population-level intervention by reducing
+the $R_0$, using the `intvn_mean_reduction` argument.
+
+To illustrate this, we will use the previous example and specify a
+population-level intervention that reduces $R_0$ by $50\%$.
+
+``` r
+set.seed(123)
+
+sim_tree_intvn_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  intvn_mean_reduction = 0.5,
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+head(sim_tree_intvn_eg)
+#> < tree head (from first known ancestor) >
+#>    chain_id sim_id ancestor generation time
+#> 11        2      2        1          2    3
+#> 12        4      2        1          2    3
+#> 13        5      2        1          2    3
+#> 15        8      2        1          2    3
+#> 14        5      3        1          2    3
+#> 16        2      3        2          3    6
+```
+
+### [simulate_summary()](https://epiverse-trace.github.io/epichains/reference/simulate_summary.html)
+
+`simulate_summary()` is basically `simulate_tree()` except that it does
+not retain information on each infector and infectee. It returns the
+eventual size or length/duration of each transmission chain.
+
+Here is an example to simulate the previous examples without
+intervention, returning the size of each of the $10$ chains. It assumes
+a poisson offspring distribution with mean of $0.9$.
+
+``` r
+set.seed(123)
+
+simulate_summary_eg <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  lambda = 0.9
+)
+
+# Print the results
+simulate_summary_eg
+#> `epichains` object 
+#> 
+#>  [1]   1 Inf   4   4 Inf   1   2 Inf   5   3
+#> 
+#>  Simulated chain sizes: 
+#> 
+#> Max: 5
+#> Min: 1
+```
+
+Here is an example with an intervention that reduces $R_0$ by $50\%$.
+
+``` r
+simulate_summary_intvn_eg <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  intvn_mean_reduction = 0.5,
+  stat_max = 10,
+  lambda = 0.9
+)
+
+# Print the results
+simulate_summary_intvn_eg
+#> `epichains` object 
+#> 
+#>  [1] 1 1 1 1 1 2 1 1 2 1
+#> 
+#>  Simulated chain sizes: 
+#> 
+#> Max: 2
+#> Min: 1
+```
+
+### [simulate_tree_from_pop()](https://epiverse-trace.github.io/epichains/reference/simulate_tree_from_pop.html)
+
+`simulate_tree_from_pop()` simulates outbreaks based on a specified
+population size and pre-existing immunity until the susceptible pool
+runs out.
+
+Here is a quick example where we simulate an outbreak in a population of
+size $1000$. We assume individuals have a poisson offspring distribution
+with mean, $\text{lambda} = 1$, and serial interval of $3$:
+
+``` r
+set.seed(7)
+
+sim_tree_from_pop_eg <- simulate_tree_from_pop(
+  pop = 1000,
+  offspring_dist = "pois",
+  lambda = 1,
+  serials_dist = function(x) {3}
+  )
+
+head(sim_tree_from_pop_eg)
+#> < tree head (from first known ancestor) >
+#>   sim_id ancestor generation time
+#> 2      2        1          2    3
+#> 3      3        1          2    3
+#> 4      4        1          2    3
+#> 5      5        1          2    3
+#> 6      6        2          3    6
+#> 7      7        6          4    9
+```
+
+## Other functionalities
+
+### Summarising
+
+You can run `summary()` on `<epichains>` objects to get useful
+summaries.
+
+``` r
+# Example with simulate_tree()
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+summary(sim_tree_eg)
+#> $chains_ran
+#> [1] 10
+#> 
+#> $max_time
+#> [1] 12
+#> 
+#> $unique_ancestors
+#> [1] 9
+#> 
+#> $max_generation
+#> [1] 5
+
+# Example with simulate_summary()
+set.seed(123)
+
+simulate_summary_eg <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  lambda = 0.9
+)
+
+# Get summaries
+summary(simulate_summary_eg)
+#> $chain_ran
+#> [1] 10
+#> 
+#> $max_chain_stat
+#> [1] 5
+#> 
+#> $min_chain_stat
+#> [1] 1
+```
+
+### Aggregating
+
+You can aggregate `<epichains>` objects returned by the `simulate_*()`
+functions into a time series, which is a `<data.frame>` with columns
+“cases” and either “generation” or “time”, depending on the value of
+`grouping_var`.
+
+To aggregate over “time”, you must have specified a serial interval
+distribution in the simulation step.
+
+``` r
+# Example with simulate_tree()
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+aggregate(sim_tree_eg, grouping_var = "time")
+#>   time cases
+#> 1    0    10
+#> 2    3    13
+#> 3    6    15
+#> 4    9    18
+#> 5   12     2
+```
+
+### Plotting
+
+Aggregated `<epichains>` objects can easily be plotted using base R or
+`ggplot2` with little to no data manipulation.
+
+Here is an end-to-end example from simulation through aggregation to
+plotting.
+
+``` r
+# Run simulation with simulate_tree()
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+# Aggregate cases over time
+sim_aggreg <- aggregate(sim_tree_eg, grouping_var = "time")
+
+# Plot cases over time
+plot(sim_aggreg, type = "b")
+```
+
+<img src="man/figures/README-unnamed-chunk-13-1.png" width="100%" />
 
 ## Package vignettes
 
diff --git a/man/figures/README-unnamed-chunk-13-1.png b/man/figures/README-unnamed-chunk-13-1.png
new file mode 100644
index 0000000000000000000000000000000000000000..8d066582938536c053a0e338fd4866be2276b246
GIT binary patch
literal 23036
zcmeFZc{G*p+cv)KC_`n8PeNp#Lo&08%tM4ERA!1|o2M;JQW7#x$xMjMbCWSsWF9hP
z9y4cn&s%)H&-45J_5Sg$^}cI8f3U1=@B6;)>pF+yJkI0X{#qJ}Cx~f@Q7F_2WhHrS
z6bhYyLg5V)9)(xjpATI?q3};yU%I4adr9$<{T+J;oqM-T%@xh<%pI&vwH4)1DA5-$
zbd9X&bg87HOY*r{n*BObQi4zJ3Wt8UquutfcXHP}|874|$ej<eLG`l(vDR!tquWW=
zoYGXXWHo8SA;WuyHSA0_nR5ZWi668Q@%=b9TDKJ6w9^pHlnCXQg%sOXH1==)idpcU
z7<8N;Bnc|!F$^1IMf1>R^NxQX$*=wO`Lj5-#;f&&S1T`f;A^heY<OozKUVe5Eiir^
z>{=^Y-?N;_)>6cKI`F7FOSIwKh&gKa)R#OX9KP(II-;)I8$_Qa?!+*TP<$#t>HiZ)
zx|r7z`AYDqi2@Ht#`E3JeNT@~Ft1(5KjygD7bq#5_Kx#hqeA_~%qR`!?-hQ3d<qo?
zY(Mw;X}-)q??p9qq<Afy%OULN6S8>Sv3~5+96sy(FJyb>CDues%qXL>@<$HDOVXcB
zbv?@ICb8)mB5l>&sU4A)WJ7+gPBoGZU+3kw3j|DJh@HH(V~+nKWk3y&aPj84uQM~`
zs#|kCZG53`b+LB&=|=G{Y74a2Jg=p-cvq`fvc)r5p80^$Wpce#>CgJ<ap7`1CcH&d
zUs<Id8^;{S)}43SUfpumkBuZ82^OOvp?%KEuUD`nSLdT&z@9{AQi%0=InrOJ-WSq<
z7he@mJFXRSHe~LG^ygC%FX}Nh58Y-@oxe#bAH+y%?P+-PZ7MPCq}7|rWUDtDox`(3
zA2~O7QAJm6cI##)+s@0S>@hF4W_(<*(q%3`?(Jrk>C{SOagVt}K$%}T*l&aGT*%n@
z(@bv}gW2nK%$}RIiL0Cxt_?fqX!g6Lt-}qESvy`NpgdsGMfU5irCG%Ahn6oY@gij1
zE!{1)4X?ZGjK#g|^aIK+>&2J-aGb)jSkl_?V(WT4VjWucKAKE)<(f>0TtB{h%=?JX
zOrKtn@XUoKJz}p?d&AF)zoj(Hq?37d@7&6Nn`j`cw}n32i1I3NEhDr%YPdbNUM~}D
zTfZKBmd$5$a7_AVvtztN{QV(ZozXC#gNcL5{j-koiZ&f?3m0tXbKbru{$(A0r9WQo
zdN;2mhuI^kkDor8scKH1di;>+7Qef*NG9bA8l$qKwYm0YnO<2qnsmomyR7^lKV`07
z4m-;3nKHT{bVS2$^dYmd%;|HF?v}YfoLfzC_EweZ&GG%}xO7L7;I^lU=LKmhnUDAU
zP7eiCUYT9$aG1(#JUQ~*I`%_B)bz7Y^<$&AiE{+GsE?0VCveP^PL|QNrJO7Xw;D&~
zdI*Ysv{&BUq3Lr=Hg}9aD!R=sv3|W!jC45bNb4V~#)~-1(TW9BmUN2inzk$Z=i5VQ
z(N4clHTs&gEtzL$Rt;V4SO-zvo)yM>e|Frxejn)=LKP?M9ob`bCHZ3dqY(mxsQEQz
z3w3oA7yM0#!h2$k!iT@`;D;7|kXV8~d<C75aOCiBykX>@6efNBC=>>zEH8W26>n~U
zDCWiLZuJuX<KvHKJ=R!Xc`4_yC^O}K=ndxm6Fgt=q2HnG6>CG((;KV}yop!WSo2s$
z8nTnflz#~BZSKzdqzKGJMovgh7)ZCwe2Ed8aPw~b<6_b5;@x7eOmo_a@EDqq5r_Kw
z>mpH75$D-FSri^V8D{@iH6a;p`SRbtkr#4xai|VGJHbo;zJ(DrE_L{MLdKh!D7>_H
z{br93u1Ls;K{a^%`({kF9v)iDu%!O@zjwf)Lf8L&GY<C-jh|5H5OVrItHR@0`1j2y
zJVOFP%-wM|p~D5D;Ch2c{__Zh#ALYM26c=7yn>%le(d0BU>$c(F`^1yM>qZF6+(;~
z>AyQb*v*AO*>H<QGX48ZumKGJvjP7e;s02`|G2>ab1~o+S(3)hk<b2u<_%Q!ub365
zDTcWYBM2iQI#S3o?4|UU(z$cz9t4#NE2jiJ7tc~|FiG#oGSD&hTv9-5`3OrCDC|2y
z^%D98t#Gh<^hM?IhS&6ldhQsVo(s2b-3pfSa8B1r)!@<@7CgLowYzVgdE@o?v@e{3
zM>}q71^*{-DqY8+E?A#6{(DvA(z(}QWB<3_!>Hvev$w6|y}M~C=DL`0qr`!=wNaE5
zuZx5XS6WGwz8_d|X9!Nxi82b?bOaHf;;J)p888o9jTSI{d?r&al3eLM3eW0Ueequh
z_x(T=BkgrvC7h4<ZeQW+5O&QdhSqn6MJns_1LNc2CK`q#H(uaK@Ch+Stx8nnNMNou
zL?1bJGBl80qG9ppCsw0UY>GoKrTg~A{mrhbUDm4Y<$xkCwYy|bnK3BZV-M}g_k$^6
zVRdhT?Rd#VTtK*b%>#mD_tojdiZ^tGt_H9Pjqg+k_9HYMXAsk+qb2UEHV@3d`zud<
z|GgB$cv92-GR~#gGAj7le&8-vNVxUsv7q<%hU=Gs3Xh_!>jf${KeA3Q2*+WnRnfBq
z3&}r{`1UhGHks?sM>I+D{uIFyE_SB{Za*|@j_JXVLrzp7fLeKNw#VoSDX~AshF}s;
z{T*G`-#ifXusYkQEXieIFwd+$*wu3CT8ofnb6ps(p&A)20|Q@$`)cs!?}^4Eq>^L{
zo^V%N^a{@XokBV(lE*N0_*enAxtbVBw>*RE%Hg{0!qSxD7bb61dX{W2)t>z|bV8)R
zy1V6Jf@^aK<iEH(dR&i?qq~_}4aRCV-H|Tn_)9JQMoHdr+6zk83-54>_i&_N3cuM9
zA@FuHyu+>hF|G^0m|&rRI$3)EISUDIlsT)1@fmS_ixvsD>CA|HcR}P%HzT)RR?v@k
zhU{lNd$pAWWvk!7I>kTS;-E&>i4!8gPMn*Y>R@Qu-uxZ?KAMDv?*gTNjKOV{N4Vek
zlXy<X(OT#BP5T&emSK?wgV)LxPMwtJvqM!TgFbs+o6C*-$&M4vVMpVioTL{F387QM
zyduk0!!c+zawrq-TZ=8aYG*CFDON&1MJ@6+bzoQIw0I4NVj!JtX2x2^j|^Sai+251
zYXPL!FS9(y5W+E>V3IaqM$)mU&TzDSr9jaS`_d`4eCdhL{*>kmgB9&nK6?pA<1=2d
zs#1rncfU7{^}m!(mWu&9Ldys5Tid;=>hZ>K^TBdA?Hpq%>mON>R9sh;;INrVsCmeY
z7zdl*YQ{@1H%lzkfd|hvg_V=#s>A0m+6c-aj&Uy$7g)00H?+$JM)F$B@mC#~rF^(9
zt(Zd+6^9lhi$ZfafBXmgA!B2Yxq_fE%@jg>LvZizk4Z0)gB%vcGm=ww;V4}n^!Qde
zQ~FOiTy88xGf6j7FYub2_2O`iMfud102=xUMPD+dOjJlvJ6GN6Uy~)@*qn$Te<3n>
zUX^2tca1&XQ(Jmt_(+@4(-`MaUPHa<AMd#HEn6;{%EjtQH~3+$!%3+JFm#<m_N-X$
z=C>%}<;72=0Y{ylke&@AmD$$1c(<2@Ucwn`ws`u*R)>zJzsBWu2E&v1<t!M~^uN9$
zfSzcM4e2uRF|sQer=IG`Pn}YibYDqRi<dUaxSCGk7#DK3-}}#8u!y~}LAmSukO1W#
zvi}xNs_W4vK05^~lWmuto#JNe{hdDJ3?C?JFS0kOkCQ4335dJd^MMB26d^!V`89;F
z7)h<~coj?bW5jKIED@{GdE;8O>A~4j@6_zR&H1tg6~Ch=ZZ|uQ9*Lz&?(@kusTzFo
zCAEo=g`=7!3E#k#V&BM4ppTuT3$p&6QlDjzpLF>Jt%Vrt3C#D@b&b7eKNKuK+w!Lk
z22WtBNzj9vM5}VPTE`Hp_BA8e-Ttj9<gY{%P&HTB*R)y6{s+Q$l6hlT9I^CU0c_Rg
zi8B}LD!u+B#*c8Lv#Aq~b##7H{D~}gk8peCyQT#Z>)aBLcG6RWC?)m@(fdTKnTD*A
zlKniVlZ8eKcIgO<jlda-iWtWg!LB6vav`$0E!G@MhR~gu-z~E>$^89H_nS+{Oz`8|
zzGe0~P8TMWhH)N4+tDzhoIFUhy^)CPhc12Co}?U-B=?9oWg^FWXUk!>Tj2X2{SpUr
z(jTImT;ug2wOQsFeEau9b<FW9Z6a$VCAu?IX;kX6U}y1uCZZNhy<vW!EE&QY1v&Z8
zH&H3sT(G->Y&3i~l#I|f9^o!Qpr>yqqdEe+x|ms=tQrpCQcOElgFc_sV`=2eH?wF6
zdjV^n4%@6+rC4jz<zLbEyQ?fO$?m;@LV)DVi*tzQ8k4oMhCV)i`ZgO?#qVYb!P}n+
z#juk>$Io0;TOMzmc*J!y`Q)W1Cxzj0ZBVf0|7r#_#8$0ILS4}p@7lOae){-vEHkGv
z_@&HlHxJ`N@~VVir(}U`SJ=l#B#nO-s?KKdDAD*PkmMfMY4xw)Pie!oXMLU<$SK>u
zU!U%5Y_ezx<)jo%Cu^M(Q+z?oI@epE`la*61I#26XuQs7p^^Q~M|10EUyu!Mwvh4O
zE;iut9w>EMo_za~xW;li+K8rK#$zq3XGVH8{T?g&4Q0Zy3)r=>=iqFv+QZ;3uY2+>
z8$&sDSWLe@w>Zr@z$Ii@cqcB)o`dT(pOLtbw_eeNS)v}tb)4jPE)1#Fb%GLxeWNA3
zGf=7<=Q={fsh1UTXRy4d-(e57vuV!xf#%E*ORTt)V6(^C?0f0^+VYHbkSAo1vbiD4
zZA8=X8(*l16&FZ>q@CD^@3lVQB4|G-nXaD`7oN$mTslRaR4Nf{*HZ6`>w_a5_;EXw
z1_^68+|fwCW29k;3IPqUh2xQyapXpNQ}#Z8qQ|~Pcj2+FD3W7+cOmH2`%|)SU%6#7
zbs7q;mJ6(>kRtH`cY-R17nm;?Bjxw9^pK}3g@1xsRmUp-%u|MG5oi0zrB5`xR%P&*
z6^InQe_M|KHyt|g)J=+D)9A+!kks0YSVoFwyxrO@e%*}F$g3SG>Srw8OT2ZHFPg}@
z^7irZv(h*6EWS+#X<s>x9}Et62$?*&7a=T0-;V`XXp!0DK{6XyLc=Fg8MVc!1Q!K=
z3f83iqhBrB!~LEqTp?4^#xb<AxVa*jWih-OAm7-#qQT($5O!5p*W$>UY*j9SfPtQc
zBSsO%?7mY;NykqMik9Vup7iCwbH})y*+&`?bLY?cY~SZJ$V+m|lAVCV=9ThTW7)vg
z{CR8fEh<q$tpXxE9|y03{9#=?YzU!$>YK<A)d^#)YAx7%G;8Rf_n*7zhDF)-M*Y|C
z&k=M!ni3<s=ZBI?u(M)qcnX$ujbm|WZ&&~5NjeUNXH>5sGc?NVE|DT2?>N2q)#b5z
z(_0!W=Bc=B`FlS<612@}!Uamf<-Qv}T}C#Rnh-%278P}6^QEe^7pxc!^2}b%^%XTf
zIqS^<8G&-3_tmEX^!uU-tcyBF#aXb5oz{j9*d5qd*u%Q~7HM1R=UoNooHCnCYXj@k
zuDl&fiMCsQ7og8Q2mz|6@NV?%U`1c1ZjvzrIU{OS>5}#dutvt~jDmvN(tl=!+=hLR
zC21$C04S)cqcQPh%GA#Zd&1zZBxup}j~ADT6k9^{k(HS)O=Gw5Q!hK8PrIsD3pJjf
zDh`J|MUwM39*wX!i@k=gw*W&qplPk9XICL+MH$X&N^d3sgb4pqAL94Bdj{Ya#lZRD
zjipiTG%co=oq^M?3e`?h39#<h0HBN~_S#kX{p}P}BxI3<+3=d1J>Gk3%4uTp1d^O}
zeE}q?hVvPzwZ1p4J((Yv>OD7DAyJd6e@4pReY`1BXGKr4&ymgdvnmtKJtI`9<Ag^3
z`n?bRkX+2Vb8lIgjDR^c!0lOG=KI?-bh+D1s_76wMMJQX{c^WC)3~CvCsN6UySK`x
zswq-P(XMRnBEMdwcrpf4G|@4q(-JQe{~%NJ6y6{F8#K7D$&rB0JJNM(pS!dxgdRKo
z52K0qX0kzx-N%PS_cbRx5U1}%d#umD-re~F=7c@^M`mwRlhSpXHfXC><QSSuE9;to
zb?=9CA~|x|YJYS#DW)Y(iu<M1+BL~OyMfY3Bgv79YD#2@e)w2*je)gg0QVO!Jpq&(
zW;J!a$c`5fv7qU|s$Hwk?(egdX1Ncw6U{2UW%oGR${v*B(;P!<P~I~~Wt6$xsBrJR
z2Iy}mvGMt5sY1Jfh6fGzuWUd}qhn3*$nF9}cOgR3YwJT~K{h34&W)1bn_?0!^E`?#
z8D7&EyVk7W*jy<z&n@nuJ2P}cphBxFTc`{mLI+a9R%iH3<p7$8!mNqk?!*PBgaW>_
ztlIS<AAa#8T9)30_CtgBhp7-`spX4Qa2FD`k&v0S+V)m4dda=bwA+a@x%kO$%{K~_
z`??#_oOp#3YAR2<<T!^*QCE-Y+UBM=KA*(=X$<FAy>=~vPV`>2D4$VjM|n!oaGCQQ
zt%~9DWe9W>r&@82P^BzU`^k82<Qgb#!x~jrt{Y<*weSYURTlo4F3qJQ6jiQvn!!z<
zE7Q%mDzNuqopI|a!)SY=0!iD>J*StW9drv7>v5AN&+F;k&!~mF-!8Ov&@K-|p^1pB
z65Ie6)p{r|EF(4N1yrZ`xF`TY43<C5?o)#=P6sgzwxg*P3GPWwZ37+)6<z(knmqB7
z6HfIZo}0@GK8^Z__bH&a{#^2FedujDrM3<zK9xgC<1(XwuOfrnXCD2uPf}P*o|E`7
zjKh(r4z4CJ;G-YzIc}{?eS7DVxy(uru`v!exYeOBLODI<36cGqgUXistiddlco!PE
z2Ha=f4IX)OnTfnr;z=pCD?-4uN&Gz%Cw}BPGDX0<CkclT)M-uLU1T5cxm?d~@icJ0
zB3t6al=SR<(hA;Ei*K}TG8RJ+{@Mm?!*~qN>7=N=f-3Xb(Gzrp*H~NgEj2l>BtDEx
zi|%Ez(R=*}OQwa_r7qWQ@(bb@(WHsoj9>dKVz#f*rPNE1vc46DLvRfL`r=Hq@KYyl
z`HC-bT8rtMwmGRsV@ji6@p0_++U7*^K62WdmvmdcW+2}#hjAFIyQOF3y4tDVhN&h;
zXV2huDme9hur~cse+DYTDJdnjV7zVr`b9}JVUpf3WS3B=_9GcIVxJb<cJZZ~R8{sv
zicKtPv9H|xMaB9~qIgQ?B>n^5goPG}Esb@rn7>&Zt!aL_LawAz?SwZ;@6yjS^W&XR
z{_v{bO(-3dJ70Doc=G~@mF6$V2<olrQc<j`VHfZwAj~EiSGf1{Y383N>SogVV0ER<
zzwmpCI*C;QjYNYy!|o}>E!2~xqDv|399yK=H6@qu|D64^zN7k$obXw-ebtVIU92r>
z>Ml50Oh3Hi*L*;>Y!mT6vbWf;oWcL_J;m+!FTtDDsEfk6HPJj2G>98_ww+}duu$`m
z6+_nbfpBtn#>8t?x9RRGAPQxGKkEDw01_Uap0aKmzs^oHDc&otX12S%+5X;Ck^R(*
zybZPL0z50gcu7~@UY54y%LqWB^@!*soO#%WZ@j<#h>Gt<H-5~gC#2y=PtYp;YKnZ<
z@C-|KFBG@-V4CYVenOxP_0XxW1$b!-*PjtH<1kV9d^d_2GxgpH%BJ##V5d7ct=p6Q
zC-ojtvZ)0<ARtNM^y0f*&5MSE5&q4Nb@kVX8^thE0V;Z<#jc4@Mp%k5GfSH7e$UJ{
z+(9tp6rj3O9V=oVclF)%)l!0CKVwEgOtmZ;Q0Hq16_hyfH1%ygDzsKrf#|A2^(QpD
zM$A(p(bYot>>DLb$7Y9X{E~rVz?*aUb&Db6G3I&oB~mFCpS?dr-_BZ#^MxM4D+6MB
zj+$KtXdJcS_GHyO=egd9*eBQ<6F|7Ix(6&JhZ_e++m+Vml`9eA@e`RfMYO5?>YNi>
z4MZJ7I|E_U%d6o{$QZ<iXS5;Zp4kYMM#Q56;o%s_1tnjy4GkS23ofIj+*g8be|c69
ztKKx_b>nOPFRYO*0X9!-#XV+A7)p+ndu!93s)HU2QqI`-VbL9)EM(;5Oh#qS!Xb;`
zE)#E~badH)-WPv!isJ~H@G~J2$Y-6Y>9O-r&d-<s)({HQDI(*6Sk_b`iC6{EWK{1d
zZoN>;c&{I`H1UNU#k9WF@YqL+M$}-(gR7DKi0GR}^1Pv4$rt>XoXV{X$YY8hKW^xr
zVsQ^%Ex~2`^dSL}zB0~x%R1gx%S(b_<DFgVMK>lKj$G97*K^Oc9!se4o$6WWYkSgl
z*drxevd5YtDLG+hl`%BQUGY`EM~<lgdvK@W^_^`#C|TECP^*g7M}1{>H!eYKOGSBa
z9+1iG#UxvYX$obi{EOaDnfW4d_G-q}`a2?-`9I6BQ^BpAl235sU@~DL!Hj6awCWna
zW2SbQK1*L{w4s=BG&BlV%m4^IDR>^T`S5YPfw+91y<Pl-W~-AXp$||UG~d)%$q6f~
zOB^TUkEq8=CN;85{RnqA%z<6D@K1t(gI`591_3dhqIK1A_2ow>BqQgnbzT$FFa@FU
z(5+NA?{lFEw^{Ga^$PdonX9T*Z?1NksQ&dgB=HL6ujS;sJw%~3Xr2zn<s0N%Xz+85
z|Hc8*J4XA8i}7YSj_(@%z$bv-&!~BA&>BV`#{(tig8*do=~68NNDm<BEwZ<__-Nl(
zsGgylX>os=<w^A_Jam`GB0U^C#m3J1kU6AY0aG!b&2bvU%?t|70nuwGV0^Dx{wBl2
z@8?Nqj6DI=)8&^8ybI;O1EVwLdW@j8GyULl;#0zI)`Ui;yF-;y;A)1m1vZ_6reYH<
z@c`(rzQRtm_vq3g3yCHub(&%0FJF8>C+6_^&Ub6aHM0q3s3y(|U({7#Wby@ME$}y3
zOqwylX4)RPc&A$nA>3NcC7+<s3{*h4MPnBRjj*;F1B4Hb_}(NU!yQ!(!;;@iLuylb
zmov`>r<VcYo-gWjQg047@$Pv(gSBeo1bVBRF`-dFH%&`6&SPHe;-lP}z8nKG+*ZMO
zdQlF(^L+nl%6oR*xixC%A=5*6(tdmSgmS!eMUG?|51NpLPb>GPtaWF`Yqdy0g~un(
zgl+hjKfureYW|L^O(qo89UiL>5x`5$u0Ksqe#;^g=u$&!-s@(SE;c=!G-$$y#+WC=
z{caPn7O8n>?24II2omw*C;>I!2+?MeA%^N-0cbfIN-TruJq)dg-JJ&-6)*G><%wkS
zv@G$-l#+3S4{`uSBD@)ti2heoKAOX-FsP{F6FH1G18{44(|4DCPqaiEj*1L<I$}1V
zuotmdZjBPQZTh(2zHiDpsexvYeG;&uL;#qSlmL4oIXc<4D?7{k6&YDevNF!s#Cl}(
zHhw!>LQ$>iYcAIBshZ(PwS9cM>q~-D_?w3(=^N84$V*Y>%~ywjYwao91ja4z{^(`5
z-OY*dwLW|H4K>1-6Z=pE!H}6Zd5lUgD0JNyMEer99-)0oF~LZNk09<wsr7zlsOy7)
z2#%(MJ!dZc2fK2-qqt}KB<pK#ed+<hoI5>v_f3B7172-BdJ=J{9IOY+OOmI8{{iL_
z8gq>D=oZP}&&$_(Z3SvlsV~__&$z<YWs}I(N{Iu?^!l^Ly78Rk(>fs|8I>=@<ochV
zo+yYU4`ZrT!1lO>l}UW52q*t(2KGQ^@YHD`i%>ArhN2<wGYEYQ*l(7qRp!|-QieN)
z6CZ?3mX|BR_tyI&XvME$!Y^+@4NA!~fB;f4hcTr^;W+ufA5m5^@mzXDCyuRazEj^g
z3jW4me=}p^hA)(f^`kFk7;<os)I)%UQv%-`t@yeR#s|>g;jiPQJQ`qA=bEFoC3(Qf
zetu21X+~nBTKUK4w8ARQaZ<KnB5m&ksy2U}af4{}jel$B&pOnxuI09_XgFKKN0=(_
z9W&rN9RS;DnQ6u`P=T@Ce5=iBNqD39o~%Vv#H}A|YF4PKz*R{oDkcBS=3A7Nf$Lnm
zu-IQ<8*UI)f4U%n3V~Wmkmi#i&kDLNn~9R;$}<|7b=*=`kye$jU3hUuL<9EgSALYv
zz!*N%Y^G~k2*Z~w>vyz0Q=dPd`HDl=k~mUjnc>BXrdNH(J_>~3BvF!9$sf(vmD;#9
zrJu?GBy@o}M|dWFws8d)P_jYbVo^IG_T@{L+Z0A+fbD#ssb9eINCD;h8NVT%pC1V_
zzUsFp0HXO4=aP9PXvaLkpwXpAUTM&}kG>FT8d>t%T6v}S?t0{xF-p<cRDd;j`XrMd
zS4z)B5AHy*7n?aF_wy1r5HMCd^#hO4;x^`+%i|U&TDWvGBBc7AC|>dDv=`nre4t5F
z$f<A{<y(zE-Ic@d{$v8G<KwgFW;sT3AO<LXe_G>@5XXo+sldsyLi^`+jF@B8`&jTr
z5f9c{sqh7adCMV|w{MATy-aKTPG~jH7&yQgb>%iw^cfL5q;sMYDQMm_FHej022h$9
zAY@q|{Kd2&Ssb<>(~cHDvkM0k1V_N?buQ;z&_fiS6@_T|n2GO@&z=#mCw%S!6>#TY
znS7B+QVHUO@qhU`44l0=IQ*TKsV)(m!YUi8fH2ssoGIZ?&WwEGIfZ8t_v3QPm!EB~
zSM9xq1B>jNA53E{K#abftu<Md#-Pmkok5xCUC#RjCXnZMdY3ySE@Q&A+$UmPnzo!>
z2Askn>?t}g4Dv2i?Nv^$H?2H>GlB=FeG?zM@hba%(^^b9lQuU{vmFihV`m7k0_GLp
zvQ;@CGbm#-#pth|rT+Tz(af1}VtCM2f)%?fB@W|bk7$g~`xGCOx+J(53Y<usjRsdp
zOf`@@W{@M1aw;7n;iPk<`<W?VDTMl($UWKE>t<lHs`u~#PO}5g5#D9&ae@M0n+Kz>
z3Ei#g7YJnvh6i<^d%#XTd;A3nnnzEbl)V=ZJq-?X?;%d(at1Imk~=HkKQ8DfUhuai
zmL*NIje7PV<?_v)t<~jj)0fIq?aAgtm0o%Ge#*g80wz)J-Uca`Lv0bzT!oKyEA6wd
zGO}oi+vy6Clw$;KLMtjr^otQ1vZ07nEX1@hU?XqXbfhr>6@O7G!M!gQIVw!GI37Yf
zF;9`PpTtLlMB&8MXTRzAbD`RBw5_meOSDj%D84`1M4@uz=<)N0wPKZMD#**gvf|co
zzrIE5Q}Y|oer(qn(}IlnSS8Mrm%P|v{Mq)Op}jDP`Ql4a7w>Wr5L2BmJ2mxg4Zxpa
z@G<{=LW_l$tU}7N-ExTw?E@d(HLqc&A)4e}t=ijJu&8mR_edi7a#qUyYMk4z)123G
z69R1Ht|0cjPV&K~Q$-_Al2d#>?uai265)|6BXo*{lN*VUiB*38`u43@V{WHE_m-dM
z)>LYHsgrGe5R)HsgEAB1id96DYcUA(C}8?^<K1+J$ZIsur%)A9zI;v0UPhp?9H-Cj
zw#E7hDCfi+E&dS8YQX8D(8|Qv`F?{quQh}G5hXZ*5Xo>F@`Pid>o`*aq^<Q*a{1|`
zAQVFjB8H~ko8#e$RbzzG^nG`zKyc#8m7v<U3W*uvo8pPJ=0XAerpO04`tdEoxz&Ry
zfBO*u<<vt2vSJ(62Ke%?7ud+Q#2-hNjIYg~fHw(+H#@Xk9j7UOs+hp%w-O^`P3mJM
zT%rtTL<X=SNbiwNBZdRW8R0+q6LqgXZJtVaxW9M{%?%{JvZ+TsJPJ=jhGVP$6i`D|
z?75lO6fM#+n|Oc7YhC$`+#@sKYAgbpplv}QWZRDL4*;&gKBEN9pF!|uQW6WwqJbT+
zXnn?)`RUE2Cry<#r8l=B?wC!szOhJ61kNkUNzzD<{uxwZkoZ&+uBL0ZC&(Ug`~6MC
zRI2e3em1ldDhB-)9<U$`?n;}Y15gtxzzcp0yNkpmAlT&RtM<Hs#0{3HM{4>vWUYY&
z1988h8a}SDHa3OE3%lhY%Un|ORAy&I2Q2du3C+b(NhDU6{Cs?-{|0XZFdI<MJj4LW
zuHZ9KwYU4H9rz`d<}ubP;?^V6U16jS6<t{d%;IDfz+n7Z+k)-k=pF5WlJY?dV_yiA
z-;Bffz}R5rxJ8SgNNm1)rt9@=e~IHV6!rDIMg8Z1DMa#h)IN#K2Vv#Dkr&5#_#6&1
zO@K5?EnDJF%>@l(7_%|ecvfa+W=#d9m0bw5-UNQsG;Y+_hoEwI%XX^m?Gq|)JvHo-
zt5NN5Fo)OB<xzozwnK918)ETiNs?5<HK4%#xE3SrP0FIXibHFi;NZHNcC%gV-Y_nd
zTVDgLg;hMj4f2R`a*Hj)Dd4t$HOJbzi#gdl8&&Ca&3Ph$b5`-CUbM}AV>t<v0C79d
z?YBk#XJcZ%(W_?(S<c|*`-(KT))(v_5YG|N3t2pW_~=nRB*dmLLnsV><5B89r*4(O
zV4Y20ol_VCnz(3t&fZ6;s0U&SU#X_VfugGoE&lWB!sGW+mLCXB-Vy|#z&I`3HTrG~
z-Q>i9snXD9XjU^84LC;=NmuUedWhX0k;l4wt|s94-o6QWI$tuu@O}Qef|m)8VWArK
zx#reNedP<uG`ZLNzJMlo1?2ecj_d75m=D{I8hMT9V?j5t{Og;oJhtrfXzfc?fH&XP
z-OuQGT*^1AA2};<w)1MZRumMoYS9<J9WQD%IVZjr&Kl~u!j_PMoEI&D!*~NLP?Qlp
zw>RVWwk>IOMjdhpS^AuGp{=z!ota~{I^Ah;rDt*R-aZ%bd;+Kj|LoN-@{~E{^t<|`
z0|7fV$If0b^(uJc0g>9G?$IuqX=6ptQ0EL1V^8`bIj;FRB+Y@q*}6{z==wF>bX>o9
zeG-)RDdHh)>cLN+61o9#{%y|H==PIOJLsz2t;`;39rhT=qcuGL?0r3BA5rM+Gna-W
zIUI(AKqZue2|y6_&W$!VRyySLxY~f8Xvz}@Ja|4tbERak!lUsa5#{90)#Eg0`-<#E
z8rhI0L3@^gP@4?LeDUZ>ny#mbe;eFaFzmV0bivYITe&-VlxOOG39@>S7&pERd5OKh
z1pfUj@Q9SpZu!-$Yp;gC|N5H8RR&0=-q5~+)oHpzpwap@kAXJyUL+!Ae57kqb?cJ+
z5Er}`ivc7q?;%(4$?u3Sm@WOWZa_BmP)$unXc?JpPo>6RBxy1(zK5p}P6*BXp%0OK
zZ;~o{nJ+&N0*0g>Y-~Nk--0il_zMDWk>*~XV`*;NgR5g^M_Qp*7Ln|Xm1<oYt*vcx
z?dtm=9%fpN!;|Hrt-sC$@rBp8ygTbgf}!bj27!3G8awq=w{e(Oo?PUG16wLysQe=@
zk*$|~<8<9r%;+s&vs#=t&;ZK-wVobL5I?%oV;L_@cenS0K+&*M_z|N7WBc!pqKcRz
z<xUwgEK%~vmw;$xh0@2izzW3ZD5*R^9($PEw;MW&ES4wth*|oMWSAat+D<(hfBz-a
zL{U4#q-Uk+)>->1yN1^x@*_=(?y%Glq4p}Sz5-jzLXA#gL#Owy1Mxd;if36junwQD
zTtuvh9ZfHe4FXoI9x5y^ZPSd%AWR-!+R1;6tGp|SmPgnU{&>&u4gfV->&2NiWcxBJ
zP_{EpYOW0^=J>JZ*gpJv@ykv~!$0w=7Rk?q&rU?i`X*kw2&x5`5hAfrt^<gW4IxHV
zUUP}1B~cOVLxctNR>`|~T>x_nE^>KA!@?15%G_W`Ig@gEUD-xLLMf}x*>)|Pb<8ij
zd-I*g>d2AyU&(lml$W@lp^HJQofmYPl!Zam5sEbQCgOeU3dvV(DD4MIJMkm0o6zgO
zE$Z)lybnlFc<dxA&@c6ro{24mF0)ukS}3m)x*iVW_l_T_bWn4wMD$Ia#y1iplWoxO
z`2O3X^8Hj5Wb=qPP5S7l?rh1@&hTt&G0SE;q%ge86)Dv-NukpFJB#G&OsgM*lzW5>
zVPcVLK`PbQ{%Iir8*5Mtj~?~U2%<FdlDjc=-Pi1h*1H~P)Y({Q*uIPoB<h@Zq^EI*
zG}dRl1jW-+4Nn!GjSjRquxa>BrM4kw(5*mk)&mOswwBVFsUHquJW5!f@?$5{QS*k-
zR}*m_8c1xjc+dXuv*UhV4y5@<^j-oGgSwXk>8!)tu%5o03CZaCK)e^t>h||%l9(&T
z;c&f#xQ#8Z_jyff%ZalImy8DRSLS-%v2ie0nRFUBE!Xw4d?vdr32JGi7#2!m6fmUM
z%QdT~5Mp`28cIPiE(X?v!V@8cEYrk8UKciKXT>|y_>UH8+TeWW;AhBWb--)My9kLQ
z-DMS3Kc+eV{FzqNHNRG9R@FDnLV630#3j{41mbm`-t2w50SPQ%FvVqAz<Kss8xujP
zo+b7?!XmXLtbkC0dhUK;*`46P+HnNDfaQ{)x7@k_JqvLvvkN<JC^Gi^_^~PjSjpaf
z0$a~~B4R~7@7AZILe%dKi|j1E78sd9J{9Qmo$gkwIF%EUIKt2yy#b*DQDBTzhR&`d
zUFRAn2X>oETG5p8)_43){z;R=LgMd`<_qe2cv@>8CDH~1XLP-Skr{0W<K+fV&z`!v
ze49R9=ObcHNMkE9Jw<46wY6t~RB{p5@7#CSLa-IIJ>&3$a8M5&2|wlosJtlD?-ZW$
zR`F$&aSKq4QIXi*OKGZt$!i<XuvDl`W^u7f$S@2|8P^gq`o;C;lI8l$EC${B5bJt!
z|M|JmrBbOU6~3=#h;1_m&HmF0+(zvzWhHd;D5-?+_I3~&ReJWUJhrG0W@WkU?s47k
z%G^?i!GgFfJ8w;@-UyOgoH~a2(|w>wKSf^^9QM|=Lp-IF*Fx?IA7hRz$3fyXrfzMq
zpObgk$`#`;lI55VyMxM*jXkOiYo?~p{|Dq^>QNU2gbES;0TeYC4ILgQF50y?LdbO3
z+gXnuNV?sn;AgdyO48`Kzj=(qPI%zDOx|?6<VtJQ<@24r<!+AF-MMeBwACw-lanuR
zjJ}NSTWT^^-HA>{Bmmx}g!Z6}8i#a66<cAFeM5|Lai-zO;b9nI?5jx6Tx@DuYE3>X
zR?Z)Ehcu0X=UOfX_z`oqGpkeIYtyW%lY~aV5bkCMH&=HV?)q*A7@r^dflfIu6#F^3
zP&^dVb&+1$vlv=PU$e}kJ|>CMBfAV$Hu2=vfcVeIIWeOi&vY(qo>W3q#@gqpkH6oO
z8l5+K3Boy+QY-h_-|yTdy|-OKr1N4DKZ%_A^5<w-28pw=1{04j-5uR3ebXPhuP;R5
zW#YT9&04eln#MN?qx+!JSu%xBg`2KMv=&AhvErxlSAzItvCsz3XROe;oHtWE8Yt1!
zu24!B5_Sv9UuSdmzoIxm!YsZ(J}N=0*9=;zD-Vy>cPrPkd64V^-I5sacBVAGS0P*I
zr&B$v(Nkl4`l^+7)xs`CkBcU*KM4?`MZt3F;7D+|KLl8ZdC;R-vx%2gUF=}w^fmLq
zIK9CdVU21%Y+jlI`C<-YnZ!mifGw&g86+dFZpnJ64<!NL6EOAa1fp+4(iFHyAt4p-
z^^QBjQg?eVv_&q_4W3deJa%=Rr7JlM<VS41QpUhOwtbm$x-kg_FSn3I)A<>f9`#U=
zMz)>a0=StkVY_^wEcIn1nrtnFj3UhPULhz$K92|iCDR&GpvZexa<wbv2IQ$tY0KuA
zU$)m>@SiEi9Q*5{jKj8gv7`4Pv<BGefBR{Kgx&!MomvkdM|&WxFRW6R2NHBjHLiwv
zE~U|XiS8$F<QF&owTMAt1<TY|VCj^B7R@o7knBV}Drbr{=_xZLSfuv0f49F?Bz2S+
z?CDIx9^Q}I7rLl@pQ)i!=x_K;O-zR)XqJUVeV;vVRHAoD-z&V^#|FX<G9!}Eodz9_
z=Ba~^24z8G)CwWcq{1ot*%ABI;{@3OHQx<HN+=-Q!ON_leZ!jC&&A|5r@-G`CKNRq
z{`ur=ScS)W!`fV5tnt-`aAE`w?aW-N#l?x<|7i9GZwk?LK$GJF$=8%~CVYs>W&b%T
zP7#ujP#QVBniHdBbyy{V&;`|6Y?g6FFORrMk)4r_fzbnH%a1S5h&0a{W%u2NZq32z
z>Jd*$h5+-Av~1p~g?*r747!6th(lFCWh`&JI?sYUCZC?&{`j#EED8s~+7u0b-Ckg;
zKVRscWmF&z_bZ*BfKp$fXzaNMk3rsNH5D!>2K&R2PW#`ZP|C**K8_c*4cp$D$+6Z5
zz>i^uJ6%WATI(Kg{9}-^$c(Vwd%HUpzTMz)5vP}b{`}dTPR@GxoGpY_?D~yiypSq6
z<I*Wl5rN<a=x`)2Ks9ZG2x=1ld<f&zxy;};MxL)Q0nq9GJ|;xg{eUf@&ru^r^i`+J
z1GYOpAMxHD$&gK*_e`pYmvEjvUo^1#qulO$J#=FJo+G|a0_Z~=X5JMwl)L?g`iifn
z>Dh<76oee87ksKv-N^6V|92amiGjTXge24`H*F(HH&3nPd_1izQ*jEZ&i>{%@#E(K
zHdz?3@psfNI051LM*_FCkE|JoDSRg(B5B`v7WojhPoF_kWq%dT<W@tK6$bv6eT8?M
zIjLBcU&H@1x{LSas&M65YS4-uJ3-3=nWqr^EdRf&w0(z(uroi@^|G!ZoX$yOwj??%
z9=bJwGGxC*fSX)n4dOMt*v5$6-dNIORTgI=1AYfj+yJSnVFiGDpxEPwU48|jF-e*{
z`*=F6Kxwm{?%-gxx%Rsc1EmGIXG~SbV;rUN(j0pq1Rrb-1_z040_1>WfO_zZYpgXW
zKr5(it3`ca7lcX)0O7PN+?}A8W`~(@49*AkAhwes^ytAI&{VC)i1tMV3S^xAy(k^z
z|Lqje*8F=3ZI;%wE6k7=0}jZ1he_ZAFAW_HJ@gN4&hywwE$S4_WFroKX4HvfX}7-{
zkF;`o3-5A+TfX8oEKK60=PT3)nWwS6!=V<Qgh0@&juF_P6q*h>9f{y`sDew8k_X|1
z(FzUSwAa9$H9&jLFYFkRAvcWZykk%QE6v8?zU`CdfGF#LBD>QZIjaqCh`KJ8eR%y^
zLPTW*zQdlX8SktX$)Pj?e7t|(BJqJ87RxgN-P8`!GvFGOJ;-sTNpCpV4*to)n?Ult
z3i=BUP*Wypj&%k>ul8{{5y$@NA?M4aRq5%LZd6uSzWT#Zk42GI7RdG!=lGNlb^up_
zzSClz<6`5;jQWqJ9ETg-XOu+-UEg|c{CpC~eTHC56^B~kCt)~zv}5?)ao;yzD50je
zFDCwFk$_P_`yOL(#RQ&};e+mpN{w!eO)B&xVtdCP9V|(fMhRM1lh;}y;vAFW7gUM`
zgPQIyCO$Y$I-Wyz7b*5?l2`28fZgp|>o~Bu7Gh|+DS#rM6%<sfcjyOh2%xn@iWtlh
z(JGvauROul+8j4f+~Gi9u)3e*e|Q<x9~b7wo*k}$x-MEIM{;A>&%z~%(NbZV6ShR1
z>7UfYE%Cs|3bxPWRPh0b#A?3r($a=T-~~b5zp<n`2aSMrSOFs7-BuZ{bB7jvo&g8d
z!J%AZO{hp1{rVyV@V3EgmZ_vGaQ_#$8~c<K_*FA?@y=SGLaiSk)Sf*At5>E7F<u?r
zS5N)jfhuCj)eap*hgPLGbRURuryC(!j0f(Ldp!YU*$&HN$_9`sJC#F>xxgKCe7~|D
zWS?O^gp<+(I}y*6;y@&1h$rJK?{hf-{(C^)9OS^*gF$-ZuSX;gu6wTl14_e{0|HXN
z>l4W%cGRjIxAwmH7#0S83M%xxn>FZ7$PP!u<3k=GGZS?B+J`eD$B3bVG(<vg<OS>C
zBdQbe=g4tesqfM`4;CR$beuu*8dSmOK*L+jEDv_h^nl&)U<HIPFzMRKNkD@jI^D-9
zs4*^>jw3qov}%`qIRu1(hIpUdsj^auHy`m6G^_|34l`LHaYwe1;WH}Ea{<)6dBP@w
zf8tRvo`-*M`NiaFki}mM=9K5vMA4pe%{y4OZ!7VKJ3ZRaw&J$^kwtV@)-yr^FOBp}
zEZ3nmD3xNT#?n*cg27-v-O2}R{x)m|E#HFxJ=y*KEp!4EfQ}oDk=smQ!B3F3GRr`O
zlE~H*k@VmDbOI5Te|^B#MESvz2;+BI_2c1b9rZ;d^T>Pxa9j`1Je%<sq`<i+o=(R<
z*t8@)MgR*4!Cwz0hfXwpSF#qE)1#X)hvqa$`OgKbAVm1y951s_(IA$489(9V9ja%C
zNo0*Ul;esVBASR4b4;l?{pW%RS}XpdWamD63m!8=&>i>N{pC$>MHH=?(O*&*@>UmN
zN7|K3Dq%dF(Dk~f9yxGb3$0aDNH>1)IlM;r+;@j|d`cFSde9^G!(8_g4HDmYq>~4v
z$yDuhpPnUJt1qqoFloi`K_<50p9FzcjPX_d@Rz86h3arv)i%B->BZ{+U710nT8~}b
zeUBgJG!5|%;se=J)*wzDwyB{inTmw>YFXa@SU5d+qoP@BJKK}5m}l8yXOW9EmaZx=
znH(B-38FAe2A<ggCVx1urUf<Hj+T#<Wgo+W2uBbkS2HXo?o+I7cRye^9|EdXK{`F(
zHDxO1=_(J=2edxwB85`n6IYDARy#5LKg+2Ii}_FkC61{ejuuXNmT&Pb7|1$hP<%Gc
z69CN^4s5V0$b`&*IS`mwI!s4oO8z*z7BSPW$2aGShO`|{PaI_&-ZEU=`kDq^@O_C%
zC;j=GxT~3Z>TGHej4(eD=q4R>46;-SU$y#d6i7+@c5YLC`iUs4;i{61_r9x(C<EkF
zC(Eeec;a-ZsRT3`$lm_WP$W{?%e{Oe#A5`sLLj~_AJY)ZN!45^JvoM({-S(fjEgde
zhy>`D_8b!#A`(Zh?y(39r{povu#Hc5W(EtGUM>Nid>okYr^hmirhbprQ!Z?d#aVTv
zy_&2V3+%HiQ?~&bLccYl8{EGrA9dRGf{aL$WvKZZ-oBKc9*GjooXnp2_?_Uk{=4g}
zfFs#Z9T-O#f6Mdw<IMaA^d739B`}-sFKMsz^02hlp*=jr*lHb3sV~85)J0J<XswEO
z*)_kQ7oy<7#BWl0r}Q%^o#;&~7%H+c{%*JkS-Bu0VC*n>*rTC)Amn0f+9I+P?c2HP
zG{Ea>L{|tD)-My7BQcf4Y6mfhYg}5t&#RrJ)MN{aq62GdKSwq|;@)NClFWKKd&dW_
zYZb@#!=denpVRwm4_ZElx*h^tZRUhg8KmdkHqI#4&Ic043v*OShrx-Eg(rbB-YK#c
zNGKLhbW4g-wNufs&Q%GGLtnz@p_l}kAMsS|M<Ex(?~heTt$MeMX%80Sc`k@q@g!vV
zfS$>(?7Anf3;*{(B8*pY0#W2QfZnHEPuOtg(tjNl#zhl=1cwV@<Pe7YTPLAg4-w|l
zAU@UMgZ4wSMNmEBd`;Gg=@bWv^el+3HYXbX(qAZaAWa%fcwGgF%CDCAg^fJ%0vx>I
z?&*fZv>tfssd2>MdjhO$4UA<BVyH+D79PWX55j>e5=28ZrSBX6yrFdttwnQzLg5ga
zPY|_|L~UJULaA5K>mLA5+)Dxm?~PkMWiB}|7BWyvoSrHE-(Z3#9~m{ZMd5_!_-lPn
zzE{iB`KK%!z7-<^gZ;xh54h9v{p~Mfu&WN>9d9TSpk0e6cc1nkuriYAGoCvqhs-pb
zmhtw4T1A=&58cXg&lM4%>|aAkBLO5;f*xyk5am;d#Otm-6Ohgqp${(+bkrBPRMmr;
z#9HZ#NjL>=E3+b<_uw7)vY2o)cj%kLG$yyK(tw9DME4x^bn=7$ck^hyzpc>5D?afp
z+S0Lkz9|sZ5j%G|{V<f?Lidtns0u1vzAT<)*35kpFYR~lLyAKLMnV8;93E)T2SWu{
zlDo1-4Z_*~f?Y{Ta-}S9zEDNcMo?NFd<ccECq`z$kf}>xP}J9M$i#z;W8z2Z{6QGF
znT|xKd&3yen><gRa_0qnl~cg6dJqVGgNXmu6c;}~8$zK@86e*U@VCCO7K@1y!%GV>
ze3Ey##2I{qBp4`j35V*eBzxBRR0OJn&l=HO?QinXU-;V`%MC^Ul6tI;_7!w%sMUSG
z!!0JmVUP(-r<orh@fbrjuMKhHC+ylf+8pu|7YSh)LLM5IEaChng-jwY=CY{i?6t%X
zhbflA7bZ)Ufam!3&NZB>r|>ofuJ^L~<->oAAgb%)FGLth=br`#2ajR=h6afvRqUxM
z0Gk(5$CHtcW=B-VKHCjTr(>^Pi&=_(h`LZo)bei?7)JErZl7KtgA`Y^h<zcCdJ(%k
zYI==2Y5VYAR=%*-x&&Fj?l>VmZgSk$kHbv=KIm~@Xh1fAos2_}u2C9vvZlb5X4n7i
z7(zTPR%Zl+f>0HO6We=_nNd!C5)sRX{=!Xw^=L-e<*DzfMh4#8B<d)eWH#%6Pu-tz
ztR>zAD~I6PxVqE3ZsJWcy2&XAE5l)|2oPYFsu4?kmx(S&LMq{;75=ls@U=?@1Vad#
zfOz9=o^nSp>H-m+@&4@LzMJG4L&@Z>qkZVlAm5V12_z^FFH-#ba6G<9GYFJR+{lC_
z^oXuFOf=JHWzmK6NhJi@QV<`46=RwW=%yvXwu4TU#_d<6C5vX9B#cq|<mgMJc(35T
zYAqhdEHaQc0xg{c;ub-lT~D{kH&4_agB`sB$(p~0Hm!v7okbGz5jvQjx_=}{>R>S&
zsyCq%E$F$i7%{A48!u$3AqHaPrYK=mU<ZD>k--R&h`_1ChhZo31aiT`u({a$+0&;%
zAO~v%XfjecR<b~Zk?Kmi`Y)>CN%-?|=JVT!fsn5OG#OC1*Xy*9<J>dE(vKh9TGS{(
zIfN~-=;tUyyw_Ufo!$@1py6a#XKnFnVPR<z<~(({tL!A@!0#bb+laCu3nmGl?`^}B
zR}$1TY^YAdC1V+|$ia^thhY}x8Ss8VA)2J0W89v5>yjAKii5TRsEo|N;>0s@Wx0;y
zSq0rxrv5j&sWHM3*fP+tENgia@s`KaCZZ#V59|r56r@ns^UylXznJRZ0NCT~Z*v_&
zTS8H&2s;esWUUYKEDEm$%1`%7|8&N~gCFL>v#3eUlj1X}<bhsXD2!{T2tJzOmw>J`
zf||iKu{$K5;cR$LyfC<g&@Adx_m)i1(6|rjN&ZAS)3KvFGc3IYz+fEcukneY+ZNut
zyhmTC1jhT!sIN}Q0JSz>v2NT*u?Hj&k1F>yB63`cwlj<&A(5MiB>(XuFrlvy83gH@
ztJMCD6lC2X4xBtof}cP%oV1({@*ra*N<pz}3iI)K#pqJjc<-B5WDU1sQ8If!3@)dB
zZt?l>(8*+V!er0d(qC5*SgK?YI_t60gkswj6b4?&Do4ueuplj!$cuNQCN&7@eh@u;
z@}v=FZ&-jm`N2B<J5=0xmZS8=4IsNb`ED7lZf+Je3p%T8Z`c0F{UreR5&f$1B}ZoO
zIo^vMzTrb&5IC4tDAoKl)Q%qdDhmA`UNN;@{O9l}F^rRNHnQw&M-BrR%C{R{c`xjG
z7$$*V{{K)vaOFFA=+<sp$$v2jDAQ4RR)^Dx`)<Zqh$3mL{XzKCbb#RJ%G7u5a@QrY
zu#X-F3oxCC%-%Z7<>gwxgr;EW33dr4gC`9wRZd~=bw(y+3Lgu<gopep;LPQ{v&oM7
z|2<YQ!CCBW!TlJr|LZ9)RFsB(wf}igS&)L95EnT8Z-^!QOwEXT@vg@7KM$G!Hg_~O
z;;#;7-x6=bD_?*7pWDJ?ak9Wg4YNC1V9?OrB$<@dkYnsXZ*Ln8!LQVT6=W&{teqv2
zY+-=KjlvVAQ3Hsi&+a6h9W297XnRMWm;<V-@YHzk>%e6)L$wi%uqGPu()5Y#uFQRi
zbnYlA1J5JkQ^C2p=aHF9L|eJLF-mHGwH2>mvMVPZ!ub#$>wOpUl}o?in+igJtW=hl
ze<-=qdMVGQlgmY)>vM7!ZtCpb#*@v>7MR6HL?FO3(Tr%92zhw<6k4^(!L%(y4%*ZC
z3p3G%KbWdYoTS@G>_S;fc3Ra9Q8|v*BKx5_h{KBTB@Pp>fX;Z41-CH+FvI-Td-2jW
zE*HxIkoeR=_y2Mw#H&}(F*6@~O3r-lE&9su&H-V^%TrLjo{ogEr8<z&h`q(|@hL#_
zhdJN8k$Urop}j)3VG*C;G-+#Hnv6WN&!0IFnl7#n3VZPF4CaIJZ8Cc>QqXFs`U*IE
z(z`V`!u~+7?Od7DjK!?D(==7i%Kq9X{h?@4uz;-;l$4Yl-1Po48|;dfycJyy5rZCT
zpRGEmVQR-pEL#|UAE|=46EWdRR$L9UurH>dsBjukfc8Dp%~A~(ry;AGR}<y^>tIHJ
zprwQS>kTKUP3jO?w8M`q1A!JJ&^QTZh;-m531}|EL=&(vFUp~Ddpp~|^f$Rd@x7l)
zJNG6C)rMgd_*wbpuV`jODEixxK?F6P&wakO+j7eCJM&A)+tNdC-N#1>61=x(N2oo^
zHSYb1dpf-unpX8N30impkzxEM#z<2VndxkDlq9?XeKB<qcmwYQ>%AfyB2S<>^5nj7
zmu~-v{o|ALT&K9M20`n<FeYy=*f95|ul%aFO2-oqweh#^3T6<nU>c?to;yeokG=tv
zY9$c?i4Rah+iWnlE6}pUXzdH}?^5CeXJmLXl*d4DupZf5<u#&$=cSEIXFV6;iVdnA
zMoBZ-OQV)Fc#}@{cpg>A*gFeUlx&1AV)Fxy5V^)}nJ=R#f|XB-wcOU#r@gFNH{rVr
zl-^x0V9BTz+p?}q(Za}JcXt+i8lisEab-<qXSugBc_sbY`^3owUqplmvjA)mufw2*
zX2*)R5O5^;cwSx|u7~gNFo#Nx!s(0w_M_q~My5K+(<3TS)fj-jQ<b8r>0-|*L+0Gt
zt=|>)L((9L*L^^<v6RovBTE$bK>_12wxUk6{*<;#2}Mi8dSd7$otU=w#?NDJpf76=
zdRE2Xxo!jgjC_|!7${IT&U{MjwM}sQ)pXjrHNiLXArJ_xpkAp&M1y%A>vnBJN9Z_d
zp3>SeL)SSNbX)Pi;#0_kn`Ge&J|dStd%7z)P#)i$RoOhl47EFFW!+4?5%yT}2Jfu_
zg42R#O1X*}WxqlU*FKhNyGkS@`UKK9FpGKxR8XUH4M*QQmuBGBxY#e8x%jllU3?0@
zu0R+1U7R2u7AX3a$aW4yGKOQYUsPU04oMmqdwU2V16iJL;y`d6o>p&cVWlyxJ(f|7
zc=)OJyB;#0&g8i3dzD0@UqA*F#Sm0XBY)yD^;hmMy2_jDTU~Dh4Lrfo&0%bmQxnJi
zyc!vd6=zfPz`WHvg-Cqs5iL0Y4B?l;H%F98aBG57XQSQwq_&EITxPX-5Dt^!Vq2V*
zDHx1LsFQy*fU~IhO<G}_IuIE(Y0F*7AJGY-NM&QKw$#@axAytQA8&-8VC|EWB>o7{
zCG!0iBkIrFmg2!(HJwUta3so9m#(FHZ%F=b5(-L^#ngVEk`GKX`0UVkw<yvF={5RD
zPOZaOnKP~S`a-h!4++&Up2V}ipqpD1R`2j$0qs)`q-R?_-Tm600=t3hh<XJQFtMaS
zv)WZ$OSZgq`}Tu)=`xs=*mhR-LcTqR!l+5aW~U`WK9ElA_E(>&;`4zWL~1djZTRuM
zl!-+{qA-6T6)(;vuqIP+HwKy(zhg8Kekzq<K0Q96Koj=^C9~C9FL-4pD~iamtz}90
z9$sw8yC)jK{$2=-JMN-lcgJ+r)-|RW6>sF<mo$E=%F8+VyHiFx`fGtTWh*Ju2|3_L
zIUooK3kgGCbNA!(GPO&+4$HNj?h@C+-#@$lN#p48f5>v$OFe_;13$mIh&N#U3kp=r
zyc@m$Am3HPM%YQ@Ag(NHd3I)5?aakHWEJ!+uLSXSL^EMJaB8Y!C|x{8wrcvHUoclN
zq;^@w;91N&7cs0zIs`<MCpDH`PIR1G7SIu*D3E=&+4Hip^<}(~Cxv2>_tVN|m=I?0
zT$B~6xm5Jx`&k=$d&R}b(xYvp4ksp^mTQ3Hq2+%CX-25>VjQvNLlsQdpH5+!x6U#`
z3a!F4!Z*)ts2K%JE_`zt^3sRSIVTi$!v2%(w!Nw{&a>SetPQ$E#EU(z8C5glS0Ktr
zC8T?oUGw+}qworr)NCJkLJGuQK?yIajtMVlGg#ueZ*J5$c{Z_hJ1Q{!TZ~w~6H!RJ
z6BxWvZH)*gG_%z4s8j#Nw1i1Uj9$GVj0FFp2RwW<0x|lZ2_2N&0D5@fV+E*JEdQd~
z|9Q{<o%;jdi2@P}R)ilF%K=d{bP-o07Tor$Rz`}W`!o&CX@4p)fQDZMm_BvzU+*$x
z-m~<$62kK=;(*WJpH{=+9Pyz5XG179PAw>eZ!`SfN5I2A=>7*2vB*`gv9tm;YEWSF
z8u^AHL@Ezd>{|qyiDM_pLM}lXKuE4vFeIabuofU1-Df^Qeyo;bjf|>5g`NkmM%b>@
zz|F=Wd4@r;stg<sGOX_aEX7Ro&wWKGFW`4OYt9CLgnR>){UgZ9+RteC`H<NWkb;7Q
z^Qk9Hvj~_Ey*GUVqikRBQvr9DL)8(zAY9zB^p?MJBN6%Dmqb&=w=aPTR)G{e1`vL^
zfjb8wA<TL|SPK+wzHJ%hhH`C~8qcn#EIFBj9lXmJsZ)`e8i1xM!8Nyh(FA5~i3$`z
zxv_Cr+ouPf>5S8nva49tgOD9K3z}dAsz8>z@W0mh>etuTs=zMKvpK+Nb1&dNb@1E?
zXyeNBAE3@^K{qhOqocZtlo}e+S%BvQWjFb+4Or?m)n_ko?W0)me&BTQ%;TW-3!DzX
zsz&$g>+92j<44EmOxBOL1MPatxv`;<!Njc<xHJUlX5fs(7LljG-uiitBlAS(16Q2{
zuNPAFI&#Io+HEnvtaKG<stkAl&`<HdHDAK_9z745;NRi<VEz4~JZ;OTpjn_8EdlET
zKI^!w+(PnqgAQWp;qR@kJJuul*zPEBYZh=WlyyeSlOJk}8h|zecfdAzPEXqO>cSV`
zs>u8*d*A|MrHlW8LkEidwqGWI1}j&s@n8pf6zPyAdybUT(|mz5M9RQ6w5f3SniZS%
zc9y)jp!o9c?(M*R{SGBzHNb|)*~h?!Pt-)#Oi*_RaaiPo(neslbqd%%KeJ>ZaBo_4
zQ0&~d>o&gzZDRyBho=F@+&zJ9Q_zB*L%J2-9iA+}qa?*VfCp&an4^9ZGy*hF>_Yr%
zsg_rZK8DH7dIxL=x=c^Fv=Mlq8PIc}Lo4szDw-k<bTYUXR<M#``-9k91tJI$(7~C}
z(S1eGDKQ2nJ>WKw(#*qwIk1BC0O)MYyj;suy};lF_ttiV00#$_^15$?7p&WX+X{AV
zeX3y&9~=k*M&EQZnWOLy=pEn*m#?l(TYLxB1#IvJI&UO8rtv@XYdH%Oxt$Nb1J7M$
N@O1TaS?83{1OS?DEt3EM

literal 0
HcmV?d00001


From 8817e0de68f92c68885d641d655d4d8c51019f1d Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Fri, 6 Oct 2023 18:26:55 +0000
Subject: [PATCH 706/828] Automatic readme update

---
 README.md | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/README.md b/README.md
index 3aa36fa5..205a2e22 100644
--- a/README.md
+++ b/README.md
@@ -276,7 +276,7 @@ sim_tree_eg <- simulate_tree(
 )
 
 summary(sim_tree_eg)
-#> $chains_ran
+#> $chains_run
 #> [1] 10
 #> 
 #> $max_time
@@ -301,7 +301,7 @@ simulate_summary_eg <- simulate_summary(
 
 # Get summaries
 summary(simulate_summary_eg)
-#> $chain_ran
+#> $chains_run
 #> [1] 10
 #> 
 #> $max_chain_stat

From df9ab02db3aeb97b859e6a13c33008aeaa6aadd8 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Sat, 7 Oct 2023 21:36:00 +0100
Subject: [PATCH 707/828] Delete README.md file

---
 README.md | 419 ------------------------------------------------------
 1 file changed, 419 deletions(-)
 delete mode 100644 README.md

diff --git a/README.md b/README.md
deleted file mode 100644
index 205a2e22..00000000
--- a/README.md
+++ /dev/null
@@ -1,419 +0,0 @@
-
-<!-- README.md is generated from README.Rmd. Please edit that file. -->
-<!-- The code to render this README is stored in .github/workflows/render-readme.yaml -->
-<!-- Variables marked with double curly braces will be transformed beforehand: -->
-<!-- `packagename` is extracted from the DESCRIPTION file -->
-<!-- `gh_repo` is extracted via a special environment variable in GitHub Actions -->
-
-# *epichains*: Methods for simulating and analysing the size and length of transmission chains from branching process models <img src="man/figures/epichains_logo.png" align="right" height="130" />
-
-<!-- badges: start -->
-
-![GitHub R package
-version](https://img.shields.io/github/r-package/v/epiverse-trace/epichains)
-[![R-CMD-check](https://github.com/epiverse-trace/epichains/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/epichains/actions/workflows/R-CMD-check.yaml)
-[![codecov](https://codecov.io/github/epiverse-trace/epichains/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/epichains)
-![GitHub
-contributors](https://img.shields.io/github/contributors/epiverse-trace/epichains)
-[![License:
-MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
-[![Lifecycle:
-experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html#experimental)
-<!-- badges: end -->
-
-*epichains* is an R package to simulate, analyse, and visualize the size
-and length of branching processes with a given offspring distribution.
-These models are often used in infectious disease epidemiology, where
-the chains represent chains of transmission, and the offspring
-distribution represents the distribution of secondary infections caused
-by an infected individual.
-
-*epichains* re-implements
-[bpmodels](%22https://github.com/epiverse-trace/bpmodels/%22) by
-providing dedicated data structures that allow easy manipulation and
-interoperability with other existing packages for handling transmission
-chain and contact-tracing data.
-
-*epichains* is developed at the [Centre for the Mathematical Modelling
-of Infectious
-Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling-infectious-diseases)
-at the London School of Hygiene and Tropical Medicine as part of the
-[Epiverse Initiative](https://data.org/initiatives/epiverse/).
-
-# Installation
-
-The latest development version of the *epichains* package can be
-installed via
-
-``` r
-# check whether {pak} is installed
-if (!require("pak")) install.packages("pak")
-pak::pak("epiverse-trace/epichains")
-```
-
-To load the package, use
-
-``` r
-library("epichains")
-```
-
-# Quick start
-
-## Core functionality
-
-*epichains* provides four main functions:
-
-### [likelihood()](https://epiverse-trace.github.io/epichains/reference/likelihood.html)
-
-This function calculates the likelihood/loglikelihood of observing a
-vector of outbreak summaries obtained from transmission chains.
-Summaries here refer to transmission chain sizes or lengths/durations.
-
-`likelihood()` requires a vector of chain summaries (sizes or lengths),
-`chains`, the corresponding statistic to calculate, `statistic`, and the
-offspring distribution, `offspring_dist` its associated parameters. It
-also requires `nsim_obs`, which is the number of simulations to run if
-the likelihoods do not have a closed-form solution and must be
-simulated. This argument will be explained further in the [“Getting
-Started”](https://epiverse-trace.github.io/epichains/articles/epichains.html)
-vignette.
-
-Let’s look at the following example where we estimate the loglikelihood
-of observing `chain_sizes`.
-
-``` r
-set.seed(121)
-# example of observed chain sizes
-# randomly generate 20 chains of size between 1 to 10
-chain_sizes <- sample(1:10, 20, replace = TRUE)
-```
-
-``` r
-# estimate loglikelihood of the observed chain sizes
-likelihood_eg <- likelihood(
-  chains = chain_sizes,
-  statistic = "size",
-  offspring_dist = "pois",
-  nsim_obs = 100,
-  lambda = 0.5
-)
-# Print the estimate
-likelihood_eg
-#> [1] -67.82879
-```
-
-### [simulate_tree()](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html)
-
-`simulate_tree()` simulates an outbreak from a given number of
-infections. It retains and returns information on infectors (ancestors),
-infectees, the generation of infection, and the time, if a serial
-distribution is specified.
-
-Let’s look at an example where we simulate the transmission trees of
-$10$ initial infections/chains. We assume a poisson offspring
-distribution with mean, $\text{lambda} = 0.9$, and a serial interval of
-$3$ days:
-
-``` r
-set.seed(123)
-
-sim_tree_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-head(sim_tree_eg)
-#> < tree head (from first known ancestor) >
-#>    chain_id sim_id ancestor generation time
-#> 11        2      2        1          2    3
-#> 13        3      2        1          2    3
-#> 14        4      2        1          2    3
-#> 16        5      2        1          2    3
-#> 19        7      2        1          2    3
-#> 20        8      2        1          2    3
-```
-
-`simulate_tree()` can model population-level intervention by reducing
-the $R_0$, using the `intvn_mean_reduction` argument.
-
-To illustrate this, we will use the previous example and specify a
-population-level intervention that reduces $R_0$ by $50\%$.
-
-``` r
-set.seed(123)
-
-sim_tree_intvn_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-head(sim_tree_intvn_eg)
-#> < tree head (from first known ancestor) >
-#>    chain_id sim_id ancestor generation time
-#> 11        2      2        1          2    3
-#> 12        4      2        1          2    3
-#> 13        5      2        1          2    3
-#> 15        8      2        1          2    3
-#> 14        5      3        1          2    3
-#> 16        2      3        2          3    6
-```
-
-### [simulate_summary()](https://epiverse-trace.github.io/epichains/reference/simulate_summary.html)
-
-`simulate_summary()` is basically `simulate_tree()` except that it does
-not retain information on each infector and infectee. It returns the
-eventual size or length/duration of each transmission chain.
-
-Here is an example to simulate the previous examples without
-intervention, returning the size of each of the $10$ chains. It assumes
-a poisson offspring distribution with mean of $0.9$.
-
-``` r
-set.seed(123)
-
-simulate_summary_eg <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  lambda = 0.9
-)
-
-# Print the results
-simulate_summary_eg
-#> `epichains` object 
-#> 
-#>  [1]   1 Inf   4   4 Inf   1   2 Inf   5   3
-#> 
-#>  Simulated chain sizes: 
-#> 
-#> Max: 5
-#> Min: 1
-```
-
-Here is an example with an intervention that reduces $R_0$ by $50\%$.
-
-``` r
-simulate_summary_intvn_eg <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  stat_max = 10,
-  lambda = 0.9
-)
-
-# Print the results
-simulate_summary_intvn_eg
-#> `epichains` object 
-#> 
-#>  [1] 1 1 1 1 1 2 1 1 2 1
-#> 
-#>  Simulated chain sizes: 
-#> 
-#> Max: 2
-#> Min: 1
-```
-
-### [simulate_tree_from_pop()](https://epiverse-trace.github.io/epichains/reference/simulate_tree_from_pop.html)
-
-`simulate_tree_from_pop()` simulates outbreaks based on a specified
-population size and pre-existing immunity until the susceptible pool
-runs out.
-
-Here is a quick example where we simulate an outbreak in a population of
-size $1000$. We assume individuals have a poisson offspring distribution
-with mean, $\text{lambda} = 1$, and serial interval of $3$:
-
-``` r
-set.seed(7)
-
-sim_tree_from_pop_eg <- simulate_tree_from_pop(
-  pop = 1000,
-  offspring_dist = "pois",
-  lambda = 1,
-  serials_dist = function(x) {3}
-  )
-
-head(sim_tree_from_pop_eg)
-#> < tree head (from first known ancestor) >
-#>   sim_id ancestor generation time
-#> 2      2        1          2    3
-#> 3      3        1          2    3
-#> 4      4        1          2    3
-#> 5      5        1          2    3
-#> 6      6        2          3    6
-#> 7      7        6          4    9
-```
-
-## Other functionalities
-
-### Summarising
-
-You can run `summary()` on `<epichains>` objects to get useful
-summaries.
-
-``` r
-# Example with simulate_tree()
-set.seed(123)
-
-sim_tree_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-summary(sim_tree_eg)
-#> $chains_run
-#> [1] 10
-#> 
-#> $max_time
-#> [1] 12
-#> 
-#> $unique_ancestors
-#> [1] 9
-#> 
-#> $max_generation
-#> [1] 5
-
-# Example with simulate_summary()
-set.seed(123)
-
-simulate_summary_eg <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  lambda = 0.9
-)
-
-# Get summaries
-summary(simulate_summary_eg)
-#> $chains_run
-#> [1] 10
-#> 
-#> $max_chain_stat
-#> [1] 5
-#> 
-#> $min_chain_stat
-#> [1] 1
-```
-
-### Aggregating
-
-You can aggregate `<epichains>` objects returned by the `simulate_*()`
-functions into a time series, which is a `<data.frame>` with columns
-“cases” and either “generation” or “time”, depending on the value of
-`grouping_var`.
-
-To aggregate over “time”, you must have specified a serial interval
-distribution in the simulation step.
-
-``` r
-# Example with simulate_tree()
-set.seed(123)
-
-sim_tree_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-aggregate(sim_tree_eg, grouping_var = "time")
-#>   time cases
-#> 1    0    10
-#> 2    3    13
-#> 3    6    15
-#> 4    9    18
-#> 5   12     2
-```
-
-### Plotting
-
-Aggregated `<epichains>` objects can easily be plotted using base R or
-`ggplot2` with little to no data manipulation.
-
-Here is an end-to-end example from simulation through aggregation to
-plotting.
-
-``` r
-# Run simulation with simulate_tree()
-set.seed(123)
-
-sim_tree_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-# Aggregate cases over time
-sim_aggreg <- aggregate(sim_tree_eg, grouping_var = "time")
-
-# Plot cases over time
-plot(sim_aggreg, type = "b")
-```
-
-<img src="man/figures/README-unnamed-chunk-13-1.png" width="100%" />
-
-## Package vignettes
-
-Specific use cases of *epichains* can be found in the [online
-documentation as package
-vignettes](https://epiverse-trace.github.io/epichains/), under
-“Articles”.
-
-## Reporting bugs
-
-To report a bug please open an
-[issue](https://github.com/epiverse-trace/epichains/issues/new/choose).
-
-## Contribute
-
-We welcome contributions to enhance the package’s functionalities. If
-you wish to do so, please follow the [package contributing
-guide](https://github.com/epiverse-trace/epichains/blob/main/.github/CONTRIBUTING.md).
-
-## Code of conduct
-
-Please note that the *epichains* project is released with a [Contributor
-Code of
-Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md).
-By contributing to this project, you agree to abide by its terms.
-
-## Citing this package
-
-``` r
-citation("epichains")
-#> To cite package epichains in publications use:
-#> 
-#>   Sebastian Funk, Flavio Finger, and James M. Azam (2023). epichains:
-#>   Analysing transmission chain statistics using branching process
-#>   models, website: https://github.com/epiverse-trace/epichains/
-#> 
-#> A BibTeX entry for LaTeX users is
-#> 
-#>   @Manual{,
-#>     title = {epichains: Analysing transmission chain statistics using branching process models},
-#>     author = {{Sebastian Funk} and {Flavio Finger} and {James M. Azam}},
-#>     year = {2023},
-#>     url = {https://github.com/epiverse-trace/epichains/},
-#>   }
-```

From 882fe299751d4aec194ca4081737f94e464f52cd Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Sun, 8 Oct 2023 15:38:19 +0100
Subject: [PATCH 708/828] Revise README

---
 README.Rmd | 83 ++++++++++++++++++++++++++++++++++--------------------
 1 file changed, 53 insertions(+), 30 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index 5f3af150..5e6d057b 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -65,11 +65,11 @@ library("epichains")
 
 # Quick start
 
-## Core functionality
+## Chain likelihoods
 
 _{{ packagename }}_ provides four main functions: 
 
-### [likelihood()](https://epiverse-trace.github.io/epichains/reference/likelihood.html)
+### [`likelihood()`](https://epiverse-trace.github.io/epichains/reference/likelihood.html)
 
 This function calculates the likelihood/loglikelihood of observing a vector of outbreak summaries obtained from transmission chains. Summaries here refer to transmission chain sizes or lengths/durations.
 
@@ -98,8 +98,12 @@ likelihood_eg <- likelihood(
 likelihood_eg
 ```
 
+## Chain simulation
 
-### [simulate_tree()](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html) 
+There are three simulation functions, herein referred to colelctively as the `simulate_*()` functions.
+``
+
+### [`simulate_tree()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html) 
 
 `simulate_tree()` simulates an outbreak from a given number of infections.
 It retains and returns information on infectors (ancestors), infectees, the generation of infection, and the time, if a serial distribution is specified.
@@ -121,53 +125,76 @@ sim_tree_eg <- simulate_tree(
 head(sim_tree_eg)
 ```
 
-`simulate_tree()` can model population-level intervention by reducing the $R_0$,
-using the `intvn_mean_reduction` argument.
+### [`simulate_summary()`](https://epiverse-trace.github.io/epichains/reference/simulate_summary.html)
 
-To illustrate this, we will use the previous example and specify
-a population-level intervention that reduces $R_0$ by $50\%$.
+`simulate_summary()` is basically `simulate_tree()` except that it does not retain
+information on each infector and infectee. It returns the eventual size or length/duration of each transmission chain.
 
+Here is an example to simulate the previous examples without intervention,
+returning the size of each of the $10$ chains. It assumes a poisson offspring distribution with
+mean of $0.9$.
 ```{r}
 set.seed(123)
 
-sim_tree_intvn_eg <- simulate_tree(
+simulate_summary_eg <- simulate_summary(
   nchains = 10,
   statistic = "size",
   offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
   stat_max = 10,
-  serials_dist = function(x) 3,
   lambda = 0.9
 )
 
-head(sim_tree_intvn_eg)
+# Print the results
+simulate_summary_eg
 ```
 
-### [simulate_summary()](https://epiverse-trace.github.io/epichains/reference/simulate_summary.html)
+### [`simulate_tree_from_pop()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree_from_pop.html)
 
-`simulate_summary()` is basically `simulate_tree()` except that it does not retain
-information on each infector and infectee. It returns the eventual size or length/duration of each transmission chain.
+`simulate_tree_from_pop()` simulates outbreaks based on a specified population size and pre-existing immunity until the susceptible pool runs out.
+  
+Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and serial interval of $3$:
+```{r}
+set.seed(7)
+
+sim_tree_from_pop_eg <- simulate_tree_from_pop(
+  pop = 1000,
+  offspring_dist = "pois",
+  lambda = 1,
+  serials_dist = function(x) {3}
+  )
+
+head(sim_tree_from_pop_eg)
+```
+
+#### Simulating interventions
+
+All the `simulate_*()` functions can model interventions that reduce the $R_0$,
+using the `intvn_mean_reduction` argument. In general, these can be
+interpreted as population-level interventions.
+
+To illustrate this, we will use the previous examples for each function and specify
+a population-level intervention that reduces $R_0$ by $50\%$.
+
+Using `simulate_tree()`, we can specify an initial number of cases
+and a population level intervention, `intvn_mean_reduction`, that reduces $R_0$ by $50\%$.
 
-Here is an example to simulate the previous examples without intervention,
-returning the size of each of the $10$ chains. It assumes a poisson offspring distribution with
-mean of $0.9$.
 ```{r}
 set.seed(123)
 
-simulate_summary_eg <- simulate_summary(
+sim_tree_intvn_eg <- simulate_tree(
   nchains = 10,
   statistic = "size",
   offspring_dist = "pois",
+  intvn_mean_reduction = 0.5,
   stat_max = 10,
+  serials_dist = function(x) 3,
   lambda = 0.9
 )
 
-# Print the results
-simulate_summary_eg
+head(sim_tree_intvn_eg)
 ```
 
-Here is an example with an intervention that reduces $R_0$ by $50\%$.
-
+Here is an example with `simulate_summary()`, modelling an intervention that reduces $R_0$ by $50\%$.
 ```{r}
 simulate_summary_intvn_eg <- simulate_summary(
   nchains = 10,
@@ -182,23 +209,19 @@ simulate_summary_intvn_eg <- simulate_summary(
 simulate_summary_intvn_eg
 ```
 
-
-### [simulate_tree_from_pop()](https://epiverse-trace.github.io/epichains/reference/simulate_tree_from_pop.html)
-
-`simulate_tree_from_pop()` simulates outbreaks based on a specified population size and pre-existing immunity until the susceptible pool runs out.
-  
-Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and serial interval of $3$:
+Finally, let's use `simulate_tree_from_pop()`.
 ```{r}
 set.seed(7)
 
-sim_tree_from_pop_eg <- simulate_tree_from_pop(
+sim_tree_from_pop_intvn_eg <- simulate_tree_from_pop(
   pop = 1000,
   offspring_dist = "pois",
+  intvn_mean_reduction = 0.5,
   lambda = 1,
   serials_dist = function(x) {3}
   )
 
-head(sim_tree_from_pop_eg)
+head(sim_tree_from_pop_intvn_eg)
 ```
 
 ## Other functionalities

From 2cc5886a9d05631f42f34c3144db774578068919 Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Sun, 8 Oct 2023 14:41:12 +0000
Subject: [PATCH 709/828] Automatic readme update

---
 README.md                                 | 454 ++++++++++++++++++++++
 man/figures/README-unnamed-chunk-14-1.png | Bin 0 -> 23036 bytes
 2 files changed, 454 insertions(+)
 create mode 100644 README.md
 create mode 100644 man/figures/README-unnamed-chunk-14-1.png

diff --git a/README.md b/README.md
new file mode 100644
index 00000000..3f622a03
--- /dev/null
+++ b/README.md
@@ -0,0 +1,454 @@
+
+<!-- README.md is generated from README.Rmd. Please edit that file. -->
+<!-- The code to render this README is stored in .github/workflows/render-readme.yaml -->
+<!-- Variables marked with double curly braces will be transformed beforehand: -->
+<!-- `packagename` is extracted from the DESCRIPTION file -->
+<!-- `gh_repo` is extracted via a special environment variable in GitHub Actions -->
+
+# *epichains*: Methods for simulating and analysing the size and length of transmission chains from branching process models <img src="man/figures/epichains_logo.png" align="right" height="130" />
+
+<!-- badges: start -->
+
+![GitHub R package
+version](https://img.shields.io/github/r-package/v/epiverse-trace/epichains)
+[![R-CMD-check](https://github.com/epiverse-trace/epichains/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/epiverse-trace/epichains/actions/workflows/R-CMD-check.yaml)
+[![codecov](https://codecov.io/github/epiverse-trace/epichains/branch/main/graphs/badge.svg)](https://codecov.io/github/epiverse-trace/epichains)
+![GitHub
+contributors](https://img.shields.io/github/contributors/epiverse-trace/epichains)
+[![License:
+MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
+[![Lifecycle:
+experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html#experimental)
+<!-- badges: end -->
+
+*epichains* is an R package to simulate, analyse, and visualize the size
+and length of branching processes with a given offspring distribution.
+These models are often used in infectious disease epidemiology, where
+the chains represent chains of transmission, and the offspring
+distribution represents the distribution of secondary infections caused
+by an infected individual.
+
+*epichains* re-implements
+[bpmodels](%22https://github.com/epiverse-trace/bpmodels/%22) by
+providing dedicated data structures that allow easy manipulation and
+interoperability with other existing packages for handling transmission
+chain and contact-tracing data.
+
+*epichains* is developed at the [Centre for the Mathematical Modelling
+of Infectious
+Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling-infectious-diseases)
+at the London School of Hygiene and Tropical Medicine as part of the
+[Epiverse Initiative](https://data.org/initiatives/epiverse/).
+
+# Installation
+
+The latest development version of the *epichains* package can be
+installed via
+
+``` r
+# check whether {pak} is installed
+if (!require("pak")) install.packages("pak")
+pak::pak("epiverse-trace/epichains")
+```
+
+To load the package, use
+
+``` r
+library("epichains")
+```
+
+# Quick start
+
+## Chain likelihoods
+
+*epichains* provides four main functions:
+
+### [`likelihood()`](https://epiverse-trace.github.io/epichains/reference/likelihood.html)
+
+This function calculates the likelihood/loglikelihood of observing a
+vector of outbreak summaries obtained from transmission chains.
+Summaries here refer to transmission chain sizes or lengths/durations.
+
+`likelihood()` requires a vector of chain summaries (sizes or lengths),
+`chains`, the corresponding statistic to calculate, `statistic`, and the
+offspring distribution, `offspring_dist` its associated parameters. It
+also requires `nsim_obs`, which is the number of simulations to run if
+the likelihoods do not have a closed-form solution and must be
+simulated. This argument will be explained further in the [“Getting
+Started”](https://epiverse-trace.github.io/epichains/articles/epichains.html)
+vignette.
+
+Let’s look at the following example where we estimate the loglikelihood
+of observing `chain_sizes`.
+
+``` r
+set.seed(121)
+# example of observed chain sizes
+# randomly generate 20 chains of size between 1 to 10
+chain_sizes <- sample(1:10, 20, replace = TRUE)
+```
+
+``` r
+# estimate loglikelihood of the observed chain sizes
+likelihood_eg <- likelihood(
+  chains = chain_sizes,
+  statistic = "size",
+  offspring_dist = "pois",
+  nsim_obs = 100,
+  lambda = 0.5
+)
+# Print the estimate
+likelihood_eg
+#> [1] -67.82879
+```
+
+## Chain simulation
+
+There are three simulation functions, herein referred to colelctively as
+the `simulate_*()` functions. \`\`
+
+### [`simulate_tree()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html)
+
+`simulate_tree()` simulates an outbreak from a given number of
+infections. It retains and returns information on infectors (ancestors),
+infectees, the generation of infection, and the time, if a serial
+distribution is specified.
+
+Let’s look at an example where we simulate the transmission trees of
+$10$ initial infections/chains. We assume a poisson offspring
+distribution with mean, $\text{lambda} = 0.9$, and a serial interval of
+$3$ days:
+
+``` r
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+head(sim_tree_eg)
+#> < tree head (from first known ancestor) >
+#>    chain_id sim_id ancestor generation time
+#> 11        2      2        1          2    3
+#> 13        3      2        1          2    3
+#> 14        4      2        1          2    3
+#> 16        5      2        1          2    3
+#> 19        7      2        1          2    3
+#> 20        8      2        1          2    3
+```
+
+### [`simulate_summary()`](https://epiverse-trace.github.io/epichains/reference/simulate_summary.html)
+
+`simulate_summary()` is basically `simulate_tree()` except that it does
+not retain information on each infector and infectee. It returns the
+eventual size or length/duration of each transmission chain.
+
+Here is an example to simulate the previous examples without
+intervention, returning the size of each of the $10$ chains. It assumes
+a poisson offspring distribution with mean of $0.9$.
+
+``` r
+set.seed(123)
+
+simulate_summary_eg <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  lambda = 0.9
+)
+
+# Print the results
+simulate_summary_eg
+#> `epichains` object 
+#> 
+#>  [1]   1 Inf   4   4 Inf   1   2 Inf   5   3
+#> 
+#>  Simulated chain sizes: 
+#> 
+#> Max: 5
+#> Min: 1
+```
+
+### [`simulate_tree_from_pop()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree_from_pop.html)
+
+`simulate_tree_from_pop()` simulates outbreaks based on a specified
+population size and pre-existing immunity until the susceptible pool
+runs out.
+
+Here is a quick example where we simulate an outbreak in a population of
+size $1000$. We assume individuals have a poisson offspring distribution
+with mean, $\text{lambda} = 1$, and serial interval of $3$:
+
+``` r
+set.seed(7)
+
+sim_tree_from_pop_eg <- simulate_tree_from_pop(
+  pop = 1000,
+  offspring_dist = "pois",
+  lambda = 1,
+  serials_dist = function(x) {3}
+  )
+
+head(sim_tree_from_pop_eg)
+#> < tree head (from first known ancestor) >
+#>   sim_id ancestor generation time
+#> 2      2        1          2    3
+#> 3      3        1          2    3
+#> 4      4        1          2    3
+#> 5      5        1          2    3
+#> 6      6        2          3    6
+#> 7      7        6          4    9
+```
+
+#### Simulating interventions
+
+All the `simulate_*()` functions can model interventions that reduce the
+$R_0$, using the `intvn_mean_reduction` argument. In general, these can
+be interpreted as population-level interventions.
+
+To illustrate this, we will use the previous examples for each function
+and specify a population-level intervention that reduces $R_0$ by
+$50\%$.
+
+Using `simulate_tree()`, we can specify an initial number of cases and a
+population level intervention, `intvn_mean_reduction`, that reduces
+$R_0$ by $50\%$.
+
+``` r
+set.seed(123)
+
+sim_tree_intvn_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  intvn_mean_reduction = 0.5,
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+head(sim_tree_intvn_eg)
+#> < tree head (from first known ancestor) >
+#>    chain_id sim_id ancestor generation time
+#> 11        2      2        1          2    3
+#> 12        4      2        1          2    3
+#> 13        5      2        1          2    3
+#> 15        8      2        1          2    3
+#> 14        5      3        1          2    3
+#> 16        2      3        2          3    6
+```
+
+Here is an example with `simulate_summary()`, modelling an intervention
+that reduces $R_0$ by $50\%$.
+
+``` r
+simulate_summary_intvn_eg <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  intvn_mean_reduction = 0.5,
+  stat_max = 10,
+  lambda = 0.9
+)
+
+# Print the results
+simulate_summary_intvn_eg
+#> `epichains` object 
+#> 
+#>  [1] 5 3 3 3 5 2 2 1 1 1
+#> 
+#>  Simulated chain sizes: 
+#> 
+#> Max: 5
+#> Min: 1
+```
+
+Finally, let’s use `simulate_tree_from_pop()`.
+
+``` r
+set.seed(7)
+
+sim_tree_from_pop_intvn_eg <- simulate_tree_from_pop(
+  pop = 1000,
+  offspring_dist = "pois",
+  intvn_mean_reduction = 0.5,
+  lambda = 1,
+  serials_dist = function(x) {3}
+  )
+
+head(sim_tree_from_pop_intvn_eg)
+#> < tree head (from first known ancestor) >
+#>   sim_id ancestor generation time
+#> 2      2        1          2    3
+#> 3      3        1          2    3
+#> 4      4        1          2    3
+```
+
+## Other functionalities
+
+### Summarising
+
+You can run `summary()` on `<epichains>` objects to get useful
+summaries.
+
+``` r
+# Example with simulate_tree()
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+summary(sim_tree_eg)
+#> $chains_run
+#> [1] 10
+#> 
+#> $max_time
+#> [1] 12
+#> 
+#> $unique_ancestors
+#> [1] 9
+#> 
+#> $max_generation
+#> [1] 5
+
+# Example with simulate_summary()
+set.seed(123)
+
+simulate_summary_eg <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  lambda = 0.9
+)
+
+# Get summaries
+summary(simulate_summary_eg)
+#> $chains_run
+#> [1] 10
+#> 
+#> $max_chain_stat
+#> [1] 5
+#> 
+#> $min_chain_stat
+#> [1] 1
+```
+
+### Aggregating
+
+You can aggregate `<epichains>` objects returned by the `simulate_*()`
+functions into a time series, which is a `<data.frame>` with columns
+“cases” and either “generation” or “time”, depending on the value of
+`grouping_var`.
+
+To aggregate over “time”, you must have specified a serial interval
+distribution in the simulation step.
+
+``` r
+# Example with simulate_tree()
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+aggregate(sim_tree_eg, grouping_var = "time")
+#>   time cases
+#> 1    0    10
+#> 2    3    13
+#> 3    6    15
+#> 4    9    18
+#> 5   12     2
+```
+
+### Plotting
+
+Aggregated `<epichains>` objects can easily be plotted using base R or
+`ggplot2` with little to no data manipulation.
+
+Here is an end-to-end example from simulation through aggregation to
+plotting.
+
+``` r
+# Run simulation with simulate_tree()
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+# Aggregate cases over time
+sim_aggreg <- aggregate(sim_tree_eg, grouping_var = "time")
+
+# Plot cases over time
+plot(sim_aggreg, type = "b")
+```
+
+<img src="man/figures/README-unnamed-chunk-14-1.png" width="100%" />
+
+## Package vignettes
+
+Specific use cases of *epichains* can be found in the [online
+documentation as package
+vignettes](https://epiverse-trace.github.io/epichains/), under
+“Articles”.
+
+## Reporting bugs
+
+To report a bug please open an
+[issue](https://github.com/epiverse-trace/epichains/issues/new/choose).
+
+## Contribute
+
+We welcome contributions to enhance the package’s functionalities. If
+you wish to do so, please follow the [package contributing
+guide](https://github.com/epiverse-trace/epichains/blob/main/.github/CONTRIBUTING.md).
+
+## Code of conduct
+
+Please note that the *epichains* project is released with a [Contributor
+Code of
+Conduct](https://github.com/epiverse-trace/.github/blob/main/CODE_OF_CONDUCT.md).
+By contributing to this project, you agree to abide by its terms.
+
+## Citing this package
+
+``` r
+citation("epichains")
+#> To cite package epichains in publications use:
+#> 
+#>   Sebastian Funk, Flavio Finger, and James M. Azam (2023). epichains:
+#>   Analysing transmission chain statistics using branching process
+#>   models, website: https://github.com/epiverse-trace/epichains/
+#> 
+#> A BibTeX entry for LaTeX users is
+#> 
+#>   @Manual{,
+#>     title = {epichains: Analysing transmission chain statistics using branching process models},
+#>     author = {{Sebastian Funk} and {Flavio Finger} and {James M. Azam}},
+#>     year = {2023},
+#>     url = {https://github.com/epiverse-trace/epichains/},
+#>   }
+```
diff --git a/man/figures/README-unnamed-chunk-14-1.png b/man/figures/README-unnamed-chunk-14-1.png
new file mode 100644
index 0000000000000000000000000000000000000000..8d066582938536c053a0e338fd4866be2276b246
GIT binary patch
literal 23036
zcmeFZc{G*p+cv)KC_`n8PeNp#Lo&08%tM4ERA!1|o2M;JQW7#x$xMjMbCWSsWF9hP
z9y4cn&s%)H&-45J_5Sg$^}cI8f3U1=@B6;)>pF+yJkI0X{#qJ}Cx~f@Q7F_2WhHrS
z6bhYyLg5V)9)(xjpATI?q3};yU%I4adr9$<{T+J;oqM-T%@xh<%pI&vwH4)1DA5-$
zbd9X&bg87HOY*r{n*BObQi4zJ3Wt8UquutfcXHP}|874|$ej<eLG`l(vDR!tquWW=
zoYGXXWHo8SA;WuyHSA0_nR5ZWi668Q@%=b9TDKJ6w9^pHlnCXQg%sOXH1==)idpcU
z7<8N;Bnc|!F$^1IMf1>R^NxQX$*=wO`Lj5-#;f&&S1T`f;A^heY<OozKUVe5Eiir^
z>{=^Y-?N;_)>6cKI`F7FOSIwKh&gKa)R#OX9KP(II-;)I8$_Qa?!+*TP<$#t>HiZ)
zx|r7z`AYDqi2@Ht#`E3JeNT@~Ft1(5KjygD7bq#5_Kx#hqeA_~%qR`!?-hQ3d<qo?
zY(Mw;X}-)q??p9qq<Afy%OULN6S8>Sv3~5+96sy(FJyb>CDues%qXL>@<$HDOVXcB
zbv?@ICb8)mB5l>&sU4A)WJ7+gPBoGZU+3kw3j|DJh@HH(V~+nKWk3y&aPj84uQM~`
zs#|kCZG53`b+LB&=|=G{Y74a2Jg=p-cvq`fvc)r5p80^$Wpce#>CgJ<ap7`1CcH&d
zUs<Id8^;{S)}43SUfpumkBuZ82^OOvp?%KEuUD`nSLdT&z@9{AQi%0=InrOJ-WSq<
z7he@mJFXRSHe~LG^ygC%FX}Nh58Y-@oxe#bAH+y%?P+-PZ7MPCq}7|rWUDtDox`(3
zA2~O7QAJm6cI##)+s@0S>@hF4W_(<*(q%3`?(Jrk>C{SOagVt}K$%}T*l&aGT*%n@
z(@bv}gW2nK%$}RIiL0Cxt_?fqX!g6Lt-}qESvy`NpgdsGMfU5irCG%Ahn6oY@gij1
zE!{1)4X?ZGjK#g|^aIK+>&2J-aGb)jSkl_?V(WT4VjWucKAKE)<(f>0TtB{h%=?JX
zOrKtn@XUoKJz}p?d&AF)zoj(Hq?37d@7&6Nn`j`cw}n32i1I3NEhDr%YPdbNUM~}D
zTfZKBmd$5$a7_AVvtztN{QV(ZozXC#gNcL5{j-koiZ&f?3m0tXbKbru{$(A0r9WQo
zdN;2mhuI^kkDor8scKH1di;>+7Qef*NG9bA8l$qKwYm0YnO<2qnsmomyR7^lKV`07
z4m-;3nKHT{bVS2$^dYmd%;|HF?v}YfoLfzC_EweZ&GG%}xO7L7;I^lU=LKmhnUDAU
zP7eiCUYT9$aG1(#JUQ~*I`%_B)bz7Y^<$&AiE{+GsE?0VCveP^PL|QNrJO7Xw;D&~
zdI*Ysv{&BUq3Lr=Hg}9aD!R=sv3|W!jC45bNb4V~#)~-1(TW9BmUN2inzk$Z=i5VQ
z(N4clHTs&gEtzL$Rt;V4SO-zvo)yM>e|Frxejn)=LKP?M9ob`bCHZ3dqY(mxsQEQz
z3w3oA7yM0#!h2$k!iT@`;D;7|kXV8~d<C75aOCiBykX>@6efNBC=>>zEH8W26>n~U
zDCWiLZuJuX<KvHKJ=R!Xc`4_yC^O}K=ndxm6Fgt=q2HnG6>CG((;KV}yop!WSo2s$
z8nTnflz#~BZSKzdqzKGJMovgh7)ZCwe2Ed8aPw~b<6_b5;@x7eOmo_a@EDqq5r_Kw
z>mpH75$D-FSri^V8D{@iH6a;p`SRbtkr#4xai|VGJHbo;zJ(DrE_L{MLdKh!D7>_H
z{br93u1Ls;K{a^%`({kF9v)iDu%!O@zjwf)Lf8L&GY<C-jh|5H5OVrItHR@0`1j2y
zJVOFP%-wM|p~D5D;Ch2c{__Zh#ALYM26c=7yn>%le(d0BU>$c(F`^1yM>qZF6+(;~
z>AyQb*v*AO*>H<QGX48ZumKGJvjP7e;s02`|G2>ab1~o+S(3)hk<b2u<_%Q!ub365
zDTcWYBM2iQI#S3o?4|UU(z$cz9t4#NE2jiJ7tc~|FiG#oGSD&hTv9-5`3OrCDC|2y
z^%D98t#Gh<^hM?IhS&6ldhQsVo(s2b-3pfSa8B1r)!@<@7CgLowYzVgdE@o?v@e{3
zM>}q71^*{-DqY8+E?A#6{(DvA(z(}QWB<3_!>Hvev$w6|y}M~C=DL`0qr`!=wNaE5
zuZx5XS6WGwz8_d|X9!Nxi82b?bOaHf;;J)p888o9jTSI{d?r&al3eLM3eW0Ueequh
z_x(T=BkgrvC7h4<ZeQW+5O&QdhSqn6MJns_1LNc2CK`q#H(uaK@Ch+Stx8nnNMNou
zL?1bJGBl80qG9ppCsw0UY>GoKrTg~A{mrhbUDm4Y<$xkCwYy|bnK3BZV-M}g_k$^6
zVRdhT?Rd#VTtK*b%>#mD_tojdiZ^tGt_H9Pjqg+k_9HYMXAsk+qb2UEHV@3d`zud<
z|GgB$cv92-GR~#gGAj7le&8-vNVxUsv7q<%hU=Gs3Xh_!>jf${KeA3Q2*+WnRnfBq
z3&}r{`1UhGHks?sM>I+D{uIFyE_SB{Za*|@j_JXVLrzp7fLeKNw#VoSDX~AshF}s;
z{T*G`-#ifXusYkQEXieIFwd+$*wu3CT8ofnb6ps(p&A)20|Q@$`)cs!?}^4Eq>^L{
zo^V%N^a{@XokBV(lE*N0_*enAxtbVBw>*RE%Hg{0!qSxD7bb61dX{W2)t>z|bV8)R
zy1V6Jf@^aK<iEH(dR&i?qq~_}4aRCV-H|Tn_)9JQMoHdr+6zk83-54>_i&_N3cuM9
zA@FuHyu+>hF|G^0m|&rRI$3)EISUDIlsT)1@fmS_ixvsD>CA|HcR}P%HzT)RR?v@k
zhU{lNd$pAWWvk!7I>kTS;-E&>i4!8gPMn*Y>R@Qu-uxZ?KAMDv?*gTNjKOV{N4Vek
zlXy<X(OT#BP5T&emSK?wgV)LxPMwtJvqM!TgFbs+o6C*-$&M4vVMpVioTL{F387QM
zyduk0!!c+zawrq-TZ=8aYG*CFDON&1MJ@6+bzoQIw0I4NVj!JtX2x2^j|^Sai+251
zYXPL!FS9(y5W+E>V3IaqM$)mU&TzDSr9jaS`_d`4eCdhL{*>kmgB9&nK6?pA<1=2d
zs#1rncfU7{^}m!(mWu&9Ldys5Tid;=>hZ>K^TBdA?Hpq%>mON>R9sh;;INrVsCmeY
z7zdl*YQ{@1H%lzkfd|hvg_V=#s>A0m+6c-aj&Uy$7g)00H?+$JM)F$B@mC#~rF^(9
zt(Zd+6^9lhi$ZfafBXmgA!B2Yxq_fE%@jg>LvZizk4Z0)gB%vcGm=ww;V4}n^!Qde
zQ~FOiTy88xGf6j7FYub2_2O`iMfud102=xUMPD+dOjJlvJ6GN6Uy~)@*qn$Te<3n>
zUX^2tca1&XQ(Jmt_(+@4(-`MaUPHa<AMd#HEn6;{%EjtQH~3+$!%3+JFm#<m_N-X$
z=C>%}<;72=0Y{ylke&@AmD$$1c(<2@Ucwn`ws`u*R)>zJzsBWu2E&v1<t!M~^uN9$
zfSzcM4e2uRF|sQer=IG`Pn}YibYDqRi<dUaxSCGk7#DK3-}}#8u!y~}LAmSukO1W#
zvi}xNs_W4vK05^~lWmuto#JNe{hdDJ3?C?JFS0kOkCQ4335dJd^MMB26d^!V`89;F
z7)h<~coj?bW5jKIED@{GdE;8O>A~4j@6_zR&H1tg6~Ch=ZZ|uQ9*Lz&?(@kusTzFo
zCAEo=g`=7!3E#k#V&BM4ppTuT3$p&6QlDjzpLF>Jt%Vrt3C#D@b&b7eKNKuK+w!Lk
z22WtBNzj9vM5}VPTE`Hp_BA8e-Ttj9<gY{%P&HTB*R)y6{s+Q$l6hlT9I^CU0c_Rg
zi8B}LD!u+B#*c8Lv#Aq~b##7H{D~}gk8peCyQT#Z>)aBLcG6RWC?)m@(fdTKnTD*A
zlKniVlZ8eKcIgO<jlda-iWtWg!LB6vav`$0E!G@MhR~gu-z~E>$^89H_nS+{Oz`8|
zzGe0~P8TMWhH)N4+tDzhoIFUhy^)CPhc12Co}?U-B=?9oWg^FWXUk!>Tj2X2{SpUr
z(jTImT;ug2wOQsFeEau9b<FW9Z6a$VCAu?IX;kX6U}y1uCZZNhy<vW!EE&QY1v&Z8
zH&H3sT(G->Y&3i~l#I|f9^o!Qpr>yqqdEe+x|ms=tQrpCQcOElgFc_sV`=2eH?wF6
zdjV^n4%@6+rC4jz<zLbEyQ?fO$?m;@LV)DVi*tzQ8k4oMhCV)i`ZgO?#qVYb!P}n+
z#juk>$Io0;TOMzmc*J!y`Q)W1Cxzj0ZBVf0|7r#_#8$0ILS4}p@7lOae){-vEHkGv
z_@&HlHxJ`N@~VVir(}U`SJ=l#B#nO-s?KKdDAD*PkmMfMY4xw)Pie!oXMLU<$SK>u
zU!U%5Y_ezx<)jo%Cu^M(Q+z?oI@epE`la*61I#26XuQs7p^^Q~M|10EUyu!Mwvh4O
zE;iut9w>EMo_za~xW;li+K8rK#$zq3XGVH8{T?g&4Q0Zy3)r=>=iqFv+QZ;3uY2+>
z8$&sDSWLe@w>Zr@z$Ii@cqcB)o`dT(pOLtbw_eeNS)v}tb)4jPE)1#Fb%GLxeWNA3
zGf=7<=Q={fsh1UTXRy4d-(e57vuV!xf#%E*ORTt)V6(^C?0f0^+VYHbkSAo1vbiD4
zZA8=X8(*l16&FZ>q@CD^@3lVQB4|G-nXaD`7oN$mTslRaR4Nf{*HZ6`>w_a5_;EXw
z1_^68+|fwCW29k;3IPqUh2xQyapXpNQ}#Z8qQ|~Pcj2+FD3W7+cOmH2`%|)SU%6#7
zbs7q;mJ6(>kRtH`cY-R17nm;?Bjxw9^pK}3g@1xsRmUp-%u|MG5oi0zrB5`xR%P&*
z6^InQe_M|KHyt|g)J=+D)9A+!kks0YSVoFwyxrO@e%*}F$g3SG>Srw8OT2ZHFPg}@
z^7irZv(h*6EWS+#X<s>x9}Et62$?*&7a=T0-;V`XXp!0DK{6XyLc=Fg8MVc!1Q!K=
z3f83iqhBrB!~LEqTp?4^#xb<AxVa*jWih-OAm7-#qQT($5O!5p*W$>UY*j9SfPtQc
zBSsO%?7mY;NykqMik9Vup7iCwbH})y*+&`?bLY?cY~SZJ$V+m|lAVCV=9ThTW7)vg
z{CR8fEh<q$tpXxE9|y03{9#=?YzU!$>YK<A)d^#)YAx7%G;8Rf_n*7zhDF)-M*Y|C
z&k=M!ni3<s=ZBI?u(M)qcnX$ujbm|WZ&&~5NjeUNXH>5sGc?NVE|DT2?>N2q)#b5z
z(_0!W=Bc=B`FlS<612@}!Uamf<-Qv}T}C#Rnh-%278P}6^QEe^7pxc!^2}b%^%XTf
zIqS^<8G&-3_tmEX^!uU-tcyBF#aXb5oz{j9*d5qd*u%Q~7HM1R=UoNooHCnCYXj@k
zuDl&fiMCsQ7og8Q2mz|6@NV?%U`1c1ZjvzrIU{OS>5}#dutvt~jDmvN(tl=!+=hLR
zC21$C04S)cqcQPh%GA#Zd&1zZBxup}j~ADT6k9^{k(HS)O=Gw5Q!hK8PrIsD3pJjf
zDh`J|MUwM39*wX!i@k=gw*W&qplPk9XICL+MH$X&N^d3sgb4pqAL94Bdj{Ya#lZRD
zjipiTG%co=oq^M?3e`?h39#<h0HBN~_S#kX{p}P}BxI3<+3=d1J>Gk3%4uTp1d^O}
zeE}q?hVvPzwZ1p4J((Yv>OD7DAyJd6e@4pReY`1BXGKr4&ymgdvnmtKJtI`9<Ag^3
z`n?bRkX+2Vb8lIgjDR^c!0lOG=KI?-bh+D1s_76wMMJQX{c^WC)3~CvCsN6UySK`x
zswq-P(XMRnBEMdwcrpf4G|@4q(-JQe{~%NJ6y6{F8#K7D$&rB0JJNM(pS!dxgdRKo
z52K0qX0kzx-N%PS_cbRx5U1}%d#umD-re~F=7c@^M`mwRlhSpXHfXC><QSSuE9;to
zb?=9CA~|x|YJYS#DW)Y(iu<M1+BL~OyMfY3Bgv79YD#2@e)w2*je)gg0QVO!Jpq&(
zW;J!a$c`5fv7qU|s$Hwk?(egdX1Ncw6U{2UW%oGR${v*B(;P!<P~I~~Wt6$xsBrJR
z2Iy}mvGMt5sY1Jfh6fGzuWUd}qhn3*$nF9}cOgR3YwJT~K{h34&W)1bn_?0!^E`?#
z8D7&EyVk7W*jy<z&n@nuJ2P}cphBxFTc`{mLI+a9R%iH3<p7$8!mNqk?!*PBgaW>_
ztlIS<AAa#8T9)30_CtgBhp7-`spX4Qa2FD`k&v0S+V)m4dda=bwA+a@x%kO$%{K~_
z`??#_oOp#3YAR2<<T!^*QCE-Y+UBM=KA*(=X$<FAy>=~vPV`>2D4$VjM|n!oaGCQQ
zt%~9DWe9W>r&@82P^BzU`^k82<Qgb#!x~jrt{Y<*weSYURTlo4F3qJQ6jiQvn!!z<
zE7Q%mDzNuqopI|a!)SY=0!iD>J*StW9drv7>v5AN&+F;k&!~mF-!8Ov&@K-|p^1pB
z65Ie6)p{r|EF(4N1yrZ`xF`TY43<C5?o)#=P6sgzwxg*P3GPWwZ37+)6<z(knmqB7
z6HfIZo}0@GK8^Z__bH&a{#^2FedujDrM3<zK9xgC<1(XwuOfrnXCD2uPf}P*o|E`7
zjKh(r4z4CJ;G-YzIc}{?eS7DVxy(uru`v!exYeOBLODI<36cGqgUXistiddlco!PE
z2Ha=f4IX)OnTfnr;z=pCD?-4uN&Gz%Cw}BPGDX0<CkclT)M-uLU1T5cxm?d~@icJ0
zB3t6al=SR<(hA;Ei*K}TG8RJ+{@Mm?!*~qN>7=N=f-3Xb(Gzrp*H~NgEj2l>BtDEx
zi|%Ez(R=*}OQwa_r7qWQ@(bb@(WHsoj9>dKVz#f*rPNE1vc46DLvRfL`r=Hq@KYyl
z`HC-bT8rtMwmGRsV@ji6@p0_++U7*^K62WdmvmdcW+2}#hjAFIyQOF3y4tDVhN&h;
zXV2huDme9hur~cse+DYTDJdnjV7zVr`b9}JVUpf3WS3B=_9GcIVxJb<cJZZ~R8{sv
zicKtPv9H|xMaB9~qIgQ?B>n^5goPG}Esb@rn7>&Zt!aL_LawAz?SwZ;@6yjS^W&XR
z{_v{bO(-3dJ70Doc=G~@mF6$V2<olrQc<j`VHfZwAj~EiSGf1{Y383N>SogVV0ER<
zzwmpCI*C;QjYNYy!|o}>E!2~xqDv|399yK=H6@qu|D64^zN7k$obXw-ebtVIU92r>
z>Ml50Oh3Hi*L*;>Y!mT6vbWf;oWcL_J;m+!FTtDDsEfk6HPJj2G>98_ww+}duu$`m
z6+_nbfpBtn#>8t?x9RRGAPQxGKkEDw01_Uap0aKmzs^oHDc&otX12S%+5X;Ck^R(*
zybZPL0z50gcu7~@UY54y%LqWB^@!*soO#%WZ@j<#h>Gt<H-5~gC#2y=PtYp;YKnZ<
z@C-|KFBG@-V4CYVenOxP_0XxW1$b!-*PjtH<1kV9d^d_2GxgpH%BJ##V5d7ct=p6Q
zC-ojtvZ)0<ARtNM^y0f*&5MSE5&q4Nb@kVX8^thE0V;Z<#jc4@Mp%k5GfSH7e$UJ{
z+(9tp6rj3O9V=oVclF)%)l!0CKVwEgOtmZ;Q0Hq16_hyfH1%ygDzsKrf#|A2^(QpD
zM$A(p(bYot>>DLb$7Y9X{E~rVz?*aUb&Db6G3I&oB~mFCpS?dr-_BZ#^MxM4D+6MB
zj+$KtXdJcS_GHyO=egd9*eBQ<6F|7Ix(6&JhZ_e++m+Vml`9eA@e`RfMYO5?>YNi>
z4MZJ7I|E_U%d6o{$QZ<iXS5;Zp4kYMM#Q56;o%s_1tnjy4GkS23ofIj+*g8be|c69
ztKKx_b>nOPFRYO*0X9!-#XV+A7)p+ndu!93s)HU2QqI`-VbL9)EM(;5Oh#qS!Xb;`
zE)#E~badH)-WPv!isJ~H@G~J2$Y-6Y>9O-r&d-<s)({HQDI(*6Sk_b`iC6{EWK{1d
zZoN>;c&{I`H1UNU#k9WF@YqL+M$}-(gR7DKi0GR}^1Pv4$rt>XoXV{X$YY8hKW^xr
zVsQ^%Ex~2`^dSL}zB0~x%R1gx%S(b_<DFgVMK>lKj$G97*K^Oc9!se4o$6WWYkSgl
z*drxevd5YtDLG+hl`%BQUGY`EM~<lgdvK@W^_^`#C|TECP^*g7M}1{>H!eYKOGSBa
z9+1iG#UxvYX$obi{EOaDnfW4d_G-q}`a2?-`9I6BQ^BpAl235sU@~DL!Hj6awCWna
zW2SbQK1*L{w4s=BG&BlV%m4^IDR>^T`S5YPfw+91y<Pl-W~-AXp$||UG~d)%$q6f~
zOB^TUkEq8=CN;85{RnqA%z<6D@K1t(gI`591_3dhqIK1A_2ow>BqQgnbzT$FFa@FU
z(5+NA?{lFEw^{Ga^$PdonX9T*Z?1NksQ&dgB=HL6ujS;sJw%~3Xr2zn<s0N%Xz+85
z|Hc8*J4XA8i}7YSj_(@%z$bv-&!~BA&>BV`#{(tig8*do=~68NNDm<BEwZ<__-Nl(
zsGgylX>os=<w^A_Jam`GB0U^C#m3J1kU6AY0aG!b&2bvU%?t|70nuwGV0^Dx{wBl2
z@8?Nqj6DI=)8&^8ybI;O1EVwLdW@j8GyULl;#0zI)`Ui;yF-;y;A)1m1vZ_6reYH<
z@c`(rzQRtm_vq3g3yCHub(&%0FJF8>C+6_^&Ub6aHM0q3s3y(|U({7#Wby@ME$}y3
zOqwylX4)RPc&A$nA>3NcC7+<s3{*h4MPnBRjj*;F1B4Hb_}(NU!yQ!(!;;@iLuylb
zmov`>r<VcYo-gWjQg047@$Pv(gSBeo1bVBRF`-dFH%&`6&SPHe;-lP}z8nKG+*ZMO
zdQlF(^L+nl%6oR*xixC%A=5*6(tdmSgmS!eMUG?|51NpLPb>GPtaWF`Yqdy0g~un(
zgl+hjKfureYW|L^O(qo89UiL>5x`5$u0Ksqe#;^g=u$&!-s@(SE;c=!G-$$y#+WC=
z{caPn7O8n>?24II2omw*C;>I!2+?MeA%^N-0cbfIN-TruJq)dg-JJ&-6)*G><%wkS
zv@G$-l#+3S4{`uSBD@)ti2heoKAOX-FsP{F6FH1G18{44(|4DCPqaiEj*1L<I$}1V
zuotmdZjBPQZTh(2zHiDpsexvYeG;&uL;#qSlmL4oIXc<4D?7{k6&YDevNF!s#Cl}(
zHhw!>LQ$>iYcAIBshZ(PwS9cM>q~-D_?w3(=^N84$V*Y>%~ywjYwao91ja4z{^(`5
z-OY*dwLW|H4K>1-6Z=pE!H}6Zd5lUgD0JNyMEer99-)0oF~LZNk09<wsr7zlsOy7)
z2#%(MJ!dZc2fK2-qqt}KB<pK#ed+<hoI5>v_f3B7172-BdJ=J{9IOY+OOmI8{{iL_
z8gq>D=oZP}&&$_(Z3SvlsV~__&$z<YWs}I(N{Iu?^!l^Ly78Rk(>fs|8I>=@<ochV
zo+yYU4`ZrT!1lO>l}UW52q*t(2KGQ^@YHD`i%>ArhN2<wGYEYQ*l(7qRp!|-QieN)
z6CZ?3mX|BR_tyI&XvME$!Y^+@4NA!~fB;f4hcTr^;W+ufA5m5^@mzXDCyuRazEj^g
z3jW4me=}p^hA)(f^`kFk7;<os)I)%UQv%-`t@yeR#s|>g;jiPQJQ`qA=bEFoC3(Qf
zetu21X+~nBTKUK4w8ARQaZ<KnB5m&ksy2U}af4{}jel$B&pOnxuI09_XgFKKN0=(_
z9W&rN9RS;DnQ6u`P=T@Ce5=iBNqD39o~%Vv#H}A|YF4PKz*R{oDkcBS=3A7Nf$Lnm
zu-IQ<8*UI)f4U%n3V~Wmkmi#i&kDLNn~9R;$}<|7b=*=`kye$jU3hUuL<9EgSALYv
zz!*N%Y^G~k2*Z~w>vyz0Q=dPd`HDl=k~mUjnc>BXrdNH(J_>~3BvF!9$sf(vmD;#9
zrJu?GBy@o}M|dWFws8d)P_jYbVo^IG_T@{L+Z0A+fbD#ssb9eINCD;h8NVT%pC1V_
zzUsFp0HXO4=aP9PXvaLkpwXpAUTM&}kG>FT8d>t%T6v}S?t0{xF-p<cRDd;j`XrMd
zS4z)B5AHy*7n?aF_wy1r5HMCd^#hO4;x^`+%i|U&TDWvGBBc7AC|>dDv=`nre4t5F
z$f<A{<y(zE-Ic@d{$v8G<KwgFW;sT3AO<LXe_G>@5XXo+sldsyLi^`+jF@B8`&jTr
z5f9c{sqh7adCMV|w{MATy-aKTPG~jH7&yQgb>%iw^cfL5q;sMYDQMm_FHej022h$9
zAY@q|{Kd2&Ssb<>(~cHDvkM0k1V_N?buQ;z&_fiS6@_T|n2GO@&z=#mCw%S!6>#TY
znS7B+QVHUO@qhU`44l0=IQ*TKsV)(m!YUi8fH2ssoGIZ?&WwEGIfZ8t_v3QPm!EB~
zSM9xq1B>jNA53E{K#abftu<Md#-Pmkok5xCUC#RjCXnZMdY3ySE@Q&A+$UmPnzo!>
z2Askn>?t}g4Dv2i?Nv^$H?2H>GlB=FeG?zM@hba%(^^b9lQuU{vmFihV`m7k0_GLp
zvQ;@CGbm#-#pth|rT+Tz(af1}VtCM2f)%?fB@W|bk7$g~`xGCOx+J(53Y<usjRsdp
zOf`@@W{@M1aw;7n;iPk<`<W?VDTMl($UWKE>t<lHs`u~#PO}5g5#D9&ae@M0n+Kz>
z3Ei#g7YJnvh6i<^d%#XTd;A3nnnzEbl)V=ZJq-?X?;%d(at1Imk~=HkKQ8DfUhuai
zmL*NIje7PV<?_v)t<~jj)0fIq?aAgtm0o%Ge#*g80wz)J-Uca`Lv0bzT!oKyEA6wd
zGO}oi+vy6Clw$;KLMtjr^otQ1vZ07nEX1@hU?XqXbfhr>6@O7G!M!gQIVw!GI37Yf
zF;9`PpTtLlMB&8MXTRzAbD`RBw5_meOSDj%D84`1M4@uz=<)N0wPKZMD#**gvf|co
zzrIE5Q}Y|oer(qn(}IlnSS8Mrm%P|v{Mq)Op}jDP`Ql4a7w>Wr5L2BmJ2mxg4Zxpa
z@G<{=LW_l$tU}7N-ExTw?E@d(HLqc&A)4e}t=ijJu&8mR_edi7a#qUyYMk4z)123G
z69R1Ht|0cjPV&K~Q$-_Al2d#>?uai265)|6BXo*{lN*VUiB*38`u43@V{WHE_m-dM
z)>LYHsgrGe5R)HsgEAB1id96DYcUA(C}8?^<K1+J$ZIsur%)A9zI;v0UPhp?9H-Cj
zw#E7hDCfi+E&dS8YQX8D(8|Qv`F?{quQh}G5hXZ*5Xo>F@`Pid>o`*aq^<Q*a{1|`
zAQVFjB8H~ko8#e$RbzzG^nG`zKyc#8m7v<U3W*uvo8pPJ=0XAerpO04`tdEoxz&Ry
zfBO*u<<vt2vSJ(62Ke%?7ud+Q#2-hNjIYg~fHw(+H#@Xk9j7UOs+hp%w-O^`P3mJM
zT%rtTL<X=SNbiwNBZdRW8R0+q6LqgXZJtVaxW9M{%?%{JvZ+TsJPJ=jhGVP$6i`D|
z?75lO6fM#+n|Oc7YhC$`+#@sKYAgbpplv}QWZRDL4*;&gKBEN9pF!|uQW6WwqJbT+
zXnn?)`RUE2Cry<#r8l=B?wC!szOhJ61kNkUNzzD<{uxwZkoZ&+uBL0ZC&(Ug`~6MC
zRI2e3em1ldDhB-)9<U$`?n;}Y15gtxzzcp0yNkpmAlT&RtM<Hs#0{3HM{4>vWUYY&
z1988h8a}SDHa3OE3%lhY%Un|ORAy&I2Q2du3C+b(NhDU6{Cs?-{|0XZFdI<MJj4LW
zuHZ9KwYU4H9rz`d<}ubP;?^V6U16jS6<t{d%;IDfz+n7Z+k)-k=pF5WlJY?dV_yiA
z-;Bffz}R5rxJ8SgNNm1)rt9@=e~IHV6!rDIMg8Z1DMa#h)IN#K2Vv#Dkr&5#_#6&1
zO@K5?EnDJF%>@l(7_%|ecvfa+W=#d9m0bw5-UNQsG;Y+_hoEwI%XX^m?Gq|)JvHo-
zt5NN5Fo)OB<xzozwnK918)ETiNs?5<HK4%#xE3SrP0FIXibHFi;NZHNcC%gV-Y_nd
zTVDgLg;hMj4f2R`a*Hj)Dd4t$HOJbzi#gdl8&&Ca&3Ph$b5`-CUbM}AV>t<v0C79d
z?YBk#XJcZ%(W_?(S<c|*`-(KT))(v_5YG|N3t2pW_~=nRB*dmLLnsV><5B89r*4(O
zV4Y20ol_VCnz(3t&fZ6;s0U&SU#X_VfugGoE&lWB!sGW+mLCXB-Vy|#z&I`3HTrG~
z-Q>i9snXD9XjU^84LC;=NmuUedWhX0k;l4wt|s94-o6QWI$tuu@O}Qef|m)8VWArK
zx#reNedP<uG`ZLNzJMlo1?2ecj_d75m=D{I8hMT9V?j5t{Og;oJhtrfXzfc?fH&XP
z-OuQGT*^1AA2};<w)1MZRumMoYS9<J9WQD%IVZjr&Kl~u!j_PMoEI&D!*~NLP?Qlp
zw>RVWwk>IOMjdhpS^AuGp{=z!ota~{I^Ah;rDt*R-aZ%bd;+Kj|LoN-@{~E{^t<|`
z0|7fV$If0b^(uJc0g>9G?$IuqX=6ptQ0EL1V^8`bIj;FRB+Y@q*}6{z==wF>bX>o9
zeG-)RDdHh)>cLN+61o9#{%y|H==PIOJLsz2t;`;39rhT=qcuGL?0r3BA5rM+Gna-W
zIUI(AKqZue2|y6_&W$!VRyySLxY~f8Xvz}@Ja|4tbERak!lUsa5#{90)#Eg0`-<#E
z8rhI0L3@^gP@4?LeDUZ>ny#mbe;eFaFzmV0bivYITe&-VlxOOG39@>S7&pERd5OKh
z1pfUj@Q9SpZu!-$Yp;gC|N5H8RR&0=-q5~+)oHpzpwap@kAXJyUL+!Ae57kqb?cJ+
z5Er}`ivc7q?;%(4$?u3Sm@WOWZa_BmP)$unXc?JpPo>6RBxy1(zK5p}P6*BXp%0OK
zZ;~o{nJ+&N0*0g>Y-~Nk--0il_zMDWk>*~XV`*;NgR5g^M_Qp*7Ln|Xm1<oYt*vcx
z?dtm=9%fpN!;|Hrt-sC$@rBp8ygTbgf}!bj27!3G8awq=w{e(Oo?PUG16wLysQe=@
zk*$|~<8<9r%;+s&vs#=t&;ZK-wVobL5I?%oV;L_@cenS0K+&*M_z|N7WBc!pqKcRz
z<xUwgEK%~vmw;$xh0@2izzW3ZD5*R^9($PEw;MW&ES4wth*|oMWSAat+D<(hfBz-a
zL{U4#q-Uk+)>->1yN1^x@*_=(?y%Glq4p}Sz5-jzLXA#gL#Owy1Mxd;if36junwQD
zTtuvh9ZfHe4FXoI9x5y^ZPSd%AWR-!+R1;6tGp|SmPgnU{&>&u4gfV->&2NiWcxBJ
zP_{EpYOW0^=J>JZ*gpJv@ykv~!$0w=7Rk?q&rU?i`X*kw2&x5`5hAfrt^<gW4IxHV
zUUP}1B~cOVLxctNR>`|~T>x_nE^>KA!@?15%G_W`Ig@gEUD-xLLMf}x*>)|Pb<8ij
zd-I*g>d2AyU&(lml$W@lp^HJQofmYPl!Zam5sEbQCgOeU3dvV(DD4MIJMkm0o6zgO
zE$Z)lybnlFc<dxA&@c6ro{24mF0)ukS}3m)x*iVW_l_T_bWn4wMD$Ia#y1iplWoxO
z`2O3X^8Hj5Wb=qPP5S7l?rh1@&hTt&G0SE;q%ge86)Dv-NukpFJB#G&OsgM*lzW5>
zVPcVLK`PbQ{%Iir8*5Mtj~?~U2%<FdlDjc=-Pi1h*1H~P)Y({Q*uIPoB<h@Zq^EI*
zG}dRl1jW-+4Nn!GjSjRquxa>BrM4kw(5*mk)&mOswwBVFsUHquJW5!f@?$5{QS*k-
zR}*m_8c1xjc+dXuv*UhV4y5@<^j-oGgSwXk>8!)tu%5o03CZaCK)e^t>h||%l9(&T
z;c&f#xQ#8Z_jyff%ZalImy8DRSLS-%v2ie0nRFUBE!Xw4d?vdr32JGi7#2!m6fmUM
z%QdT~5Mp`28cIPiE(X?v!V@8cEYrk8UKciKXT>|y_>UH8+TeWW;AhBWb--)My9kLQ
z-DMS3Kc+eV{FzqNHNRG9R@FDnLV630#3j{41mbm`-t2w50SPQ%FvVqAz<Kss8xujP
zo+b7?!XmXLtbkC0dhUK;*`46P+HnNDfaQ{)x7@k_JqvLvvkN<JC^Gi^_^~PjSjpaf
z0$a~~B4R~7@7AZILe%dKi|j1E78sd9J{9Qmo$gkwIF%EUIKt2yy#b*DQDBTzhR&`d
zUFRAn2X>oETG5p8)_43){z;R=LgMd`<_qe2cv@>8CDH~1XLP-Skr{0W<K+fV&z`!v
ze49R9=ObcHNMkE9Jw<46wY6t~RB{p5@7#CSLa-IIJ>&3$a8M5&2|wlosJtlD?-ZW$
zR`F$&aSKq4QIXi*OKGZt$!i<XuvDl`W^u7f$S@2|8P^gq`o;C;lI8l$EC${B5bJt!
z|M|JmrBbOU6~3=#h;1_m&HmF0+(zvzWhHd;D5-?+_I3~&ReJWUJhrG0W@WkU?s47k
z%G^?i!GgFfJ8w;@-UyOgoH~a2(|w>wKSf^^9QM|=Lp-IF*Fx?IA7hRz$3fyXrfzMq
zpObgk$`#`;lI55VyMxM*jXkOiYo?~p{|Dq^>QNU2gbES;0TeYC4ILgQF50y?LdbO3
z+gXnuNV?sn;AgdyO48`Kzj=(qPI%zDOx|?6<VtJQ<@24r<!+AF-MMeBwACw-lanuR
zjJ}NSTWT^^-HA>{Bmmx}g!Z6}8i#a66<cAFeM5|Lai-zO;b9nI?5jx6Tx@DuYE3>X
zR?Z)Ehcu0X=UOfX_z`oqGpkeIYtyW%lY~aV5bkCMH&=HV?)q*A7@r^dflfIu6#F^3
zP&^dVb&+1$vlv=PU$e}kJ|>CMBfAV$Hu2=vfcVeIIWeOi&vY(qo>W3q#@gqpkH6oO
z8l5+K3Boy+QY-h_-|yTdy|-OKr1N4DKZ%_A^5<w-28pw=1{04j-5uR3ebXPhuP;R5
zW#YT9&04eln#MN?qx+!JSu%xBg`2KMv=&AhvErxlSAzItvCsz3XROe;oHtWE8Yt1!
zu24!B5_Sv9UuSdmzoIxm!YsZ(J}N=0*9=;zD-Vy>cPrPkd64V^-I5sacBVAGS0P*I
zr&B$v(Nkl4`l^+7)xs`CkBcU*KM4?`MZt3F;7D+|KLl8ZdC;R-vx%2gUF=}w^fmLq
zIK9CdVU21%Y+jlI`C<-YnZ!mifGw&g86+dFZpnJ64<!NL6EOAa1fp+4(iFHyAt4p-
z^^QBjQg?eVv_&q_4W3deJa%=Rr7JlM<VS41QpUhOwtbm$x-kg_FSn3I)A<>f9`#U=
zMz)>a0=StkVY_^wEcIn1nrtnFj3UhPULhz$K92|iCDR&GpvZexa<wbv2IQ$tY0KuA
zU$)m>@SiEi9Q*5{jKj8gv7`4Pv<BGefBR{Kgx&!MomvkdM|&WxFRW6R2NHBjHLiwv
zE~U|XiS8$F<QF&owTMAt1<TY|VCj^B7R@o7knBV}Drbr{=_xZLSfuv0f49F?Bz2S+
z?CDIx9^Q}I7rLl@pQ)i!=x_K;O-zR)XqJUVeV;vVRHAoD-z&V^#|FX<G9!}Eodz9_
z=Ba~^24z8G)CwWcq{1ot*%ABI;{@3OHQx<HN+=-Q!ON_leZ!jC&&A|5r@-G`CKNRq
z{`ur=ScS)W!`fV5tnt-`aAE`w?aW-N#l?x<|7i9GZwk?LK$GJF$=8%~CVYs>W&b%T
zP7#ujP#QVBniHdBbyy{V&;`|6Y?g6FFORrMk)4r_fzbnH%a1S5h&0a{W%u2NZq32z
z>Jd*$h5+-Av~1p~g?*r747!6th(lFCWh`&JI?sYUCZC?&{`j#EED8s~+7u0b-Ckg;
zKVRscWmF&z_bZ*BfKp$fXzaNMk3rsNH5D!>2K&R2PW#`ZP|C**K8_c*4cp$D$+6Z5
zz>i^uJ6%WATI(Kg{9}-^$c(Vwd%HUpzTMz)5vP}b{`}dTPR@GxoGpY_?D~yiypSq6
z<I*Wl5rN<a=x`)2Ks9ZG2x=1ld<f&zxy;};MxL)Q0nq9GJ|;xg{eUf@&ru^r^i`+J
z1GYOpAMxHD$&gK*_e`pYmvEjvUo^1#qulO$J#=FJo+G|a0_Z~=X5JMwl)L?g`iifn
z>Dh<76oee87ksKv-N^6V|92amiGjTXge24`H*F(HH&3nPd_1izQ*jEZ&i>{%@#E(K
zHdz?3@psfNI051LM*_FCkE|JoDSRg(B5B`v7WojhPoF_kWq%dT<W@tK6$bv6eT8?M
zIjLBcU&H@1x{LSas&M65YS4-uJ3-3=nWqr^EdRf&w0(z(uroi@^|G!ZoX$yOwj??%
z9=bJwGGxC*fSX)n4dOMt*v5$6-dNIORTgI=1AYfj+yJSnVFiGDpxEPwU48|jF-e*{
z`*=F6Kxwm{?%-gxx%Rsc1EmGIXG~SbV;rUN(j0pq1Rrb-1_z040_1>WfO_zZYpgXW
zKr5(it3`ca7lcX)0O7PN+?}A8W`~(@49*AkAhwes^ytAI&{VC)i1tMV3S^xAy(k^z
z|Lqje*8F=3ZI;%wE6k7=0}jZ1he_ZAFAW_HJ@gN4&hywwE$S4_WFroKX4HvfX}7-{
zkF;`o3-5A+TfX8oEKK60=PT3)nWwS6!=V<Qgh0@&juF_P6q*h>9f{y`sDew8k_X|1
z(FzUSwAa9$H9&jLFYFkRAvcWZykk%QE6v8?zU`CdfGF#LBD>QZIjaqCh`KJ8eR%y^
zLPTW*zQdlX8SktX$)Pj?e7t|(BJqJ87RxgN-P8`!GvFGOJ;-sTNpCpV4*to)n?Ult
z3i=BUP*Wypj&%k>ul8{{5y$@NA?M4aRq5%LZd6uSzWT#Zk42GI7RdG!=lGNlb^up_
zzSClz<6`5;jQWqJ9ETg-XOu+-UEg|c{CpC~eTHC56^B~kCt)~zv}5?)ao;yzD50je
zFDCwFk$_P_`yOL(#RQ&};e+mpN{w!eO)B&xVtdCP9V|(fMhRM1lh;}y;vAFW7gUM`
zgPQIyCO$Y$I-Wyz7b*5?l2`28fZgp|>o~Bu7Gh|+DS#rM6%<sfcjyOh2%xn@iWtlh
z(JGvauROul+8j4f+~Gi9u)3e*e|Q<x9~b7wo*k}$x-MEIM{;A>&%z~%(NbZV6ShR1
z>7UfYE%Cs|3bxPWRPh0b#A?3r($a=T-~~b5zp<n`2aSMrSOFs7-BuZ{bB7jvo&g8d
z!J%AZO{hp1{rVyV@V3EgmZ_vGaQ_#$8~c<K_*FA?@y=SGLaiSk)Sf*At5>E7F<u?r
zS5N)jfhuCj)eap*hgPLGbRURuryC(!j0f(Ldp!YU*$&HN$_9`sJC#F>xxgKCe7~|D
zWS?O^gp<+(I}y*6;y@&1h$rJK?{hf-{(C^)9OS^*gF$-ZuSX;gu6wTl14_e{0|HXN
z>l4W%cGRjIxAwmH7#0S83M%xxn>FZ7$PP!u<3k=GGZS?B+J`eD$B3bVG(<vg<OS>C
zBdQbe=g4tesqfM`4;CR$beuu*8dSmOK*L+jEDv_h^nl&)U<HIPFzMRKNkD@jI^D-9
zs4*^>jw3qov}%`qIRu1(hIpUdsj^auHy`m6G^_|34l`LHaYwe1;WH}Ea{<)6dBP@w
zf8tRvo`-*M`NiaFki}mM=9K5vMA4pe%{y4OZ!7VKJ3ZRaw&J$^kwtV@)-yr^FOBp}
zEZ3nmD3xNT#?n*cg27-v-O2}R{x)m|E#HFxJ=y*KEp!4EfQ}oDk=smQ!B3F3GRr`O
zlE~H*k@VmDbOI5Te|^B#MESvz2;+BI_2c1b9rZ;d^T>Pxa9j`1Je%<sq`<i+o=(R<
z*t8@)MgR*4!Cwz0hfXwpSF#qE)1#X)hvqa$`OgKbAVm1y951s_(IA$489(9V9ja%C
zNo0*Ul;esVBASR4b4;l?{pW%RS}XpdWamD63m!8=&>i>N{pC$>MHH=?(O*&*@>UmN
zN7|K3Dq%dF(Dk~f9yxGb3$0aDNH>1)IlM;r+;@j|d`cFSde9^G!(8_g4HDmYq>~4v
z$yDuhpPnUJt1qqoFloi`K_<50p9FzcjPX_d@Rz86h3arv)i%B->BZ{+U710nT8~}b
zeUBgJG!5|%;se=J)*wzDwyB{inTmw>YFXa@SU5d+qoP@BJKK}5m}l8yXOW9EmaZx=
znH(B-38FAe2A<ggCVx1urUf<Hj+T#<Wgo+W2uBbkS2HXo?o+I7cRye^9|EdXK{`F(
zHDxO1=_(J=2edxwB85`n6IYDARy#5LKg+2Ii}_FkC61{ejuuXNmT&Pb7|1$hP<%Gc
z69CN^4s5V0$b`&*IS`mwI!s4oO8z*z7BSPW$2aGShO`|{PaI_&-ZEU=`kDq^@O_C%
zC;j=GxT~3Z>TGHej4(eD=q4R>46;-SU$y#d6i7+@c5YLC`iUs4;i{61_r9x(C<EkF
zC(Eeec;a-ZsRT3`$lm_WP$W{?%e{Oe#A5`sLLj~_AJY)ZN!45^JvoM({-S(fjEgde
zhy>`D_8b!#A`(Zh?y(39r{povu#Hc5W(EtGUM>Nid>okYr^hmirhbprQ!Z?d#aVTv
zy_&2V3+%HiQ?~&bLccYl8{EGrA9dRGf{aL$WvKZZ-oBKc9*GjooXnp2_?_Uk{=4g}
zfFs#Z9T-O#f6Mdw<IMaA^d739B`}-sFKMsz^02hlp*=jr*lHb3sV~85)J0J<XswEO
z*)_kQ7oy<7#BWl0r}Q%^o#;&~7%H+c{%*JkS-Bu0VC*n>*rTC)Amn0f+9I+P?c2HP
zG{Ea>L{|tD)-My7BQcf4Y6mfhYg}5t&#RrJ)MN{aq62GdKSwq|;@)NClFWKKd&dW_
zYZb@#!=denpVRwm4_ZElx*h^tZRUhg8KmdkHqI#4&Ic043v*OShrx-Eg(rbB-YK#c
zNGKLhbW4g-wNufs&Q%GGLtnz@p_l}kAMsS|M<Ex(?~heTt$MeMX%80Sc`k@q@g!vV
zfS$>(?7Anf3;*{(B8*pY0#W2QfZnHEPuOtg(tjNl#zhl=1cwV@<Pe7YTPLAg4-w|l
zAU@UMgZ4wSMNmEBd`;Gg=@bWv^el+3HYXbX(qAZaAWa%fcwGgF%CDCAg^fJ%0vx>I
z?&*fZv>tfssd2>MdjhO$4UA<BVyH+D79PWX55j>e5=28ZrSBX6yrFdttwnQzLg5ga
zPY|_|L~UJULaA5K>mLA5+)Dxm?~PkMWiB}|7BWyvoSrHE-(Z3#9~m{ZMd5_!_-lPn
zzE{iB`KK%!z7-<^gZ;xh54h9v{p~Mfu&WN>9d9TSpk0e6cc1nkuriYAGoCvqhs-pb
zmhtw4T1A=&58cXg&lM4%>|aAkBLO5;f*xyk5am;d#Otm-6Ohgqp${(+bkrBPRMmr;
z#9HZ#NjL>=E3+b<_uw7)vY2o)cj%kLG$yyK(tw9DME4x^bn=7$ck^hyzpc>5D?afp
z+S0Lkz9|sZ5j%G|{V<f?Lidtns0u1vzAT<)*35kpFYR~lLyAKLMnV8;93E)T2SWu{
zlDo1-4Z_*~f?Y{Ta-}S9zEDNcMo?NFd<ccECq`z$kf}>xP}J9M$i#z;W8z2Z{6QGF
znT|xKd&3yen><gRa_0qnl~cg6dJqVGgNXmu6c;}~8$zK@86e*U@VCCO7K@1y!%GV>
ze3Ey##2I{qBp4`j35V*eBzxBRR0OJn&l=HO?QinXU-;V`%MC^Ul6tI;_7!w%sMUSG
z!!0JmVUP(-r<orh@fbrjuMKhHC+ylf+8pu|7YSh)LLM5IEaChng-jwY=CY{i?6t%X
zhbflA7bZ)Ufam!3&NZB>r|>ofuJ^L~<->oAAgb%)FGLth=br`#2ajR=h6afvRqUxM
z0Gk(5$CHtcW=B-VKHCjTr(>^Pi&=_(h`LZo)bei?7)JErZl7KtgA`Y^h<zcCdJ(%k
zYI==2Y5VYAR=%*-x&&Fj?l>VmZgSk$kHbv=KIm~@Xh1fAos2_}u2C9vvZlb5X4n7i
z7(zTPR%Zl+f>0HO6We=_nNd!C5)sRX{=!Xw^=L-e<*DzfMh4#8B<d)eWH#%6Pu-tz
ztR>zAD~I6PxVqE3ZsJWcy2&XAE5l)|2oPYFsu4?kmx(S&LMq{;75=ls@U=?@1Vad#
zfOz9=o^nSp>H-m+@&4@LzMJG4L&@Z>qkZVlAm5V12_z^FFH-#ba6G<9GYFJR+{lC_
z^oXuFOf=JHWzmK6NhJi@QV<`46=RwW=%yvXwu4TU#_d<6C5vX9B#cq|<mgMJc(35T
zYAqhdEHaQc0xg{c;ub-lT~D{kH&4_agB`sB$(p~0Hm!v7okbGz5jvQjx_=}{>R>S&
zsyCq%E$F$i7%{A48!u$3AqHaPrYK=mU<ZD>k--R&h`_1ChhZo31aiT`u({a$+0&;%
zAO~v%XfjecR<b~Zk?Kmi`Y)>CN%-?|=JVT!fsn5OG#OC1*Xy*9<J>dE(vKh9TGS{(
zIfN~-=;tUyyw_Ufo!$@1py6a#XKnFnVPR<z<~(({tL!A@!0#bb+laCu3nmGl?`^}B
zR}$1TY^YAdC1V+|$ia^thhY}x8Ss8VA)2J0W89v5>yjAKii5TRsEo|N;>0s@Wx0;y
zSq0rxrv5j&sWHM3*fP+tENgia@s`KaCZZ#V59|r56r@ns^UylXznJRZ0NCT~Z*v_&
zTS8H&2s;esWUUYKEDEm$%1`%7|8&N~gCFL>v#3eUlj1X}<bhsXD2!{T2tJzOmw>J`
zf||iKu{$K5;cR$LyfC<g&@Adx_m)i1(6|rjN&ZAS)3KvFGc3IYz+fEcukneY+ZNut
zyhmTC1jhT!sIN}Q0JSz>v2NT*u?Hj&k1F>yB63`cwlj<&A(5MiB>(XuFrlvy83gH@
ztJMCD6lC2X4xBtof}cP%oV1({@*ra*N<pz}3iI)K#pqJjc<-B5WDU1sQ8If!3@)dB
zZt?l>(8*+V!er0d(qC5*SgK?YI_t60gkswj6b4?&Do4ueuplj!$cuNQCN&7@eh@u;
z@}v=FZ&-jm`N2B<J5=0xmZS8=4IsNb`ED7lZf+Je3p%T8Z`c0F{UreR5&f$1B}ZoO
zIo^vMzTrb&5IC4tDAoKl)Q%qdDhmA`UNN;@{O9l}F^rRNHnQw&M-BrR%C{R{c`xjG
z7$$*V{{K)vaOFFA=+<sp$$v2jDAQ4RR)^Dx`)<Zqh$3mL{XzKCbb#RJ%G7u5a@QrY
zu#X-F3oxCC%-%Z7<>gwxgr;EW33dr4gC`9wRZd~=bw(y+3Lgu<gopep;LPQ{v&oM7
z|2<YQ!CCBW!TlJr|LZ9)RFsB(wf}igS&)L95EnT8Z-^!QOwEXT@vg@7KM$G!Hg_~O
z;;#;7-x6=bD_?*7pWDJ?ak9Wg4YNC1V9?OrB$<@dkYnsXZ*Ln8!LQVT6=W&{teqv2
zY+-=KjlvVAQ3Hsi&+a6h9W297XnRMWm;<V-@YHzk>%e6)L$wi%uqGPu()5Y#uFQRi
zbnYlA1J5JkQ^C2p=aHF9L|eJLF-mHGwH2>mvMVPZ!ub#$>wOpUl}o?in+igJtW=hl
ze<-=qdMVGQlgmY)>vM7!ZtCpb#*@v>7MR6HL?FO3(Tr%92zhw<6k4^(!L%(y4%*ZC
z3p3G%KbWdYoTS@G>_S;fc3Ra9Q8|v*BKx5_h{KBTB@Pp>fX;Z41-CH+FvI-Td-2jW
zE*HxIkoeR=_y2Mw#H&}(F*6@~O3r-lE&9su&H-V^%TrLjo{ogEr8<z&h`q(|@hL#_
zhdJN8k$Urop}j)3VG*C;G-+#Hnv6WN&!0IFnl7#n3VZPF4CaIJZ8Cc>QqXFs`U*IE
z(z`V`!u~+7?Od7DjK!?D(==7i%Kq9X{h?@4uz;-;l$4Yl-1Po48|;dfycJyy5rZCT
zpRGEmVQR-pEL#|UAE|=46EWdRR$L9UurH>dsBjukfc8Dp%~A~(ry;AGR}<y^>tIHJ
zprwQS>kTKUP3jO?w8M`q1A!JJ&^QTZh;-m531}|EL=&(vFUp~Ddpp~|^f$Rd@x7l)
zJNG6C)rMgd_*wbpuV`jODEixxK?F6P&wakO+j7eCJM&A)+tNdC-N#1>61=x(N2oo^
zHSYb1dpf-unpX8N30impkzxEM#z<2VndxkDlq9?XeKB<qcmwYQ>%AfyB2S<>^5nj7
zmu~-v{o|ALT&K9M20`n<FeYy=*f95|ul%aFO2-oqweh#^3T6<nU>c?to;yeokG=tv
zY9$c?i4Rah+iWnlE6}pUXzdH}?^5CeXJmLXl*d4DupZf5<u#&$=cSEIXFV6;iVdnA
zMoBZ-OQV)Fc#}@{cpg>A*gFeUlx&1AV)Fxy5V^)}nJ=R#f|XB-wcOU#r@gFNH{rVr
zl-^x0V9BTz+p?}q(Za}JcXt+i8lisEab-<qXSugBc_sbY`^3owUqplmvjA)mufw2*
zX2*)R5O5^;cwSx|u7~gNFo#Nx!s(0w_M_q~My5K+(<3TS)fj-jQ<b8r>0-|*L+0Gt
zt=|>)L((9L*L^^<v6RovBTE$bK>_12wxUk6{*<;#2}Mi8dSd7$otU=w#?NDJpf76=
zdRE2Xxo!jgjC_|!7${IT&U{MjwM}sQ)pXjrHNiLXArJ_xpkAp&M1y%A>vnBJN9Z_d
zp3>SeL)SSNbX)Pi;#0_kn`Ge&J|dStd%7z)P#)i$RoOhl47EFFW!+4?5%yT}2Jfu_
zg42R#O1X*}WxqlU*FKhNyGkS@`UKK9FpGKxR8XUH4M*QQmuBGBxY#e8x%jllU3?0@
zu0R+1U7R2u7AX3a$aW4yGKOQYUsPU04oMmqdwU2V16iJL;y`d6o>p&cVWlyxJ(f|7
zc=)OJyB;#0&g8i3dzD0@UqA*F#Sm0XBY)yD^;hmMy2_jDTU~Dh4Lrfo&0%bmQxnJi
zyc!vd6=zfPz`WHvg-Cqs5iL0Y4B?l;H%F98aBG57XQSQwq_&EITxPX-5Dt^!Vq2V*
zDHx1LsFQy*fU~IhO<G}_IuIE(Y0F*7AJGY-NM&QKw$#@axAytQA8&-8VC|EWB>o7{
zCG!0iBkIrFmg2!(HJwUta3so9m#(FHZ%F=b5(-L^#ngVEk`GKX`0UVkw<yvF={5RD
zPOZaOnKP~S`a-h!4++&Up2V}ipqpD1R`2j$0qs)`q-R?_-Tm600=t3hh<XJQFtMaS
zv)WZ$OSZgq`}Tu)=`xs=*mhR-LcTqR!l+5aW~U`WK9ElA_E(>&;`4zWL~1djZTRuM
zl!-+{qA-6T6)(;vuqIP+HwKy(zhg8Kekzq<K0Q96Koj=^C9~C9FL-4pD~iamtz}90
z9$sw8yC)jK{$2=-JMN-lcgJ+r)-|RW6>sF<mo$E=%F8+VyHiFx`fGtTWh*Ju2|3_L
zIUooK3kgGCbNA!(GPO&+4$HNj?h@C+-#@$lN#p48f5>v$OFe_;13$mIh&N#U3kp=r
zyc@m$Am3HPM%YQ@Ag(NHd3I)5?aakHWEJ!+uLSXSL^EMJaB8Y!C|x{8wrcvHUoclN
zq;^@w;91N&7cs0zIs`<MCpDH`PIR1G7SIu*D3E=&+4Hip^<}(~Cxv2>_tVN|m=I?0
zT$B~6xm5Jx`&k=$d&R}b(xYvp4ksp^mTQ3Hq2+%CX-25>VjQvNLlsQdpH5+!x6U#`
z3a!F4!Z*)ts2K%JE_`zt^3sRSIVTi$!v2%(w!Nw{&a>SetPQ$E#EU(z8C5glS0Ktr
zC8T?oUGw+}qworr)NCJkLJGuQK?yIajtMVlGg#ueZ*J5$c{Z_hJ1Q{!TZ~w~6H!RJ
z6BxWvZH)*gG_%z4s8j#Nw1i1Uj9$GVj0FFp2RwW<0x|lZ2_2N&0D5@fV+E*JEdQd~
z|9Q{<o%;jdi2@P}R)ilF%K=d{bP-o07Tor$Rz`}W`!o&CX@4p)fQDZMm_BvzU+*$x
z-m~<$62kK=;(*WJpH{=+9Pyz5XG179PAw>eZ!`SfN5I2A=>7*2vB*`gv9tm;YEWSF
z8u^AHL@Ezd>{|qyiDM_pLM}lXKuE4vFeIabuofU1-Df^Qeyo;bjf|>5g`NkmM%b>@
zz|F=Wd4@r;stg<sGOX_aEX7Ro&wWKGFW`4OYt9CLgnR>){UgZ9+RteC`H<NWkb;7Q
z^Qk9Hvj~_Ey*GUVqikRBQvr9DL)8(zAY9zB^p?MJBN6%Dmqb&=w=aPTR)G{e1`vL^
zfjb8wA<TL|SPK+wzHJ%hhH`C~8qcn#EIFBj9lXmJsZ)`e8i1xM!8Nyh(FA5~i3$`z
zxv_Cr+ouPf>5S8nva49tgOD9K3z}dAsz8>z@W0mh>etuTs=zMKvpK+Nb1&dNb@1E?
zXyeNBAE3@^K{qhOqocZtlo}e+S%BvQWjFb+4Or?m)n_ko?W0)me&BTQ%;TW-3!DzX
zsz&$g>+92j<44EmOxBOL1MPatxv`;<!Njc<xHJUlX5fs(7LljG-uiitBlAS(16Q2{
zuNPAFI&#Io+HEnvtaKG<stkAl&`<HdHDAK_9z745;NRi<VEz4~JZ;OTpjn_8EdlET
zKI^!w+(PnqgAQWp;qR@kJJuul*zPEBYZh=WlyyeSlOJk}8h|zecfdAzPEXqO>cSV`
zs>u8*d*A|MrHlW8LkEidwqGWI1}j&s@n8pf6zPyAdybUT(|mz5M9RQ6w5f3SniZS%
zc9y)jp!o9c?(M*R{SGBzHNb|)*~h?!Pt-)#Oi*_RaaiPo(neslbqd%%KeJ>ZaBo_4
zQ0&~d>o&gzZDRyBho=F@+&zJ9Q_zB*L%J2-9iA+}qa?*VfCp&an4^9ZGy*hF>_Yr%
zsg_rZK8DH7dIxL=x=c^Fv=Mlq8PIc}Lo4szDw-k<bTYUXR<M#``-9k91tJI$(7~C}
z(S1eGDKQ2nJ>WKw(#*qwIk1BC0O)MYyj;suy};lF_ttiV00#$_^15$?7p&WX+X{AV
zeX3y&9~=k*M&EQZnWOLy=pEn*m#?l(TYLxB1#IvJI&UO8rtv@XYdH%Oxt$Nb1J7M$
N@O1TaS?83{1OS?DEt3EM

literal 0
HcmV?d00001


From 2d93e764d147ecdb4310788a502962589478565e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 9 Oct 2023 12:14:00 +0100
Subject: [PATCH 710/828] Fix some sections.

---
 README.Rmd | 14 ++++++++------
 1 file changed, 8 insertions(+), 6 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index 5e6d057b..dcc28847 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -42,8 +42,8 @@ transmission, and the offspring distribution represents the distribution of
 secondary infections caused by an infected individual. 
 
 _{{ packagename }}_ re-implements [bpmodels]("https://github.com/epiverse-trace/bpmodels/")
-by providing dedicated data structures that allow easy manipulation and interoperability with other existing
-packages for handling transmission chain and contact-tracing data.
+by providing bespoke functions and data structures that allow easy
+manipulation and interoperability with other Epiverse packages, for example, [superspreading]("https://github.com/epiverse-trace/superspreading/") and [epiparameter]("https://github.com/epiverse-trace/epiparameter/"), and potentially some existing packages for handling transmission chains, for example, [epicontacts](https://github.com/reconhub/epicontacts).
 
 _{{ packagename }}_ is developed at the [Centre for the Mathematical Modelling of Infectious Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling-infectious-diseases) at the London School of Hygiene and Tropical Medicine as part of the [Epiverse Initiative](https://data.org/initiatives/epiverse/).
 
@@ -65,9 +65,12 @@ library("epichains")
 
 # Quick start
 
-## Chain likelihoods
+_{{ packagename }}_ provides functionalities for estimating the likelihood of observing a given transmission chain, `likelihood()`, and functions for simulating transmission chains: `simulate_tree()`, `simulate_tree_from_pop()`, and `simulate_summary()`. 
+
+The objects returned by these functions play nicely with `summary()` and `aggregate()`. Aggregated results also play nicely with `plot()`.
+Each functionality is briefly demonstrated below. 
 
-_{{ packagename }}_ provides four main functions: 
+## Chain likelihoods
 
 ### [`likelihood()`](https://epiverse-trace.github.io/epichains/reference/likelihood.html)
 
@@ -100,8 +103,7 @@ likelihood_eg
 
 ## Chain simulation
 
-There are three simulation functions, herein referred to colelctively as the `simulate_*()` functions.
-``
+There are three simulation functions, herein referred to collectively as the `simulate_*()` functions.
 
 ### [`simulate_tree()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html) 
 

From dd1f734b907f2e7b77942150d41a257f3c5f1a89 Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Mon, 9 Oct 2023 11:17:17 +0000
Subject: [PATCH 711/828] Automatic readme update

---
 README.md | 26 +++++++++++++++++++-------
 1 file changed, 19 insertions(+), 7 deletions(-)

diff --git a/README.md b/README.md
index 3f622a03..af38196e 100644
--- a/README.md
+++ b/README.md
@@ -30,9 +30,14 @@ by an infected individual.
 
 *epichains* re-implements
 [bpmodels](%22https://github.com/epiverse-trace/bpmodels/%22) by
-providing dedicated data structures that allow easy manipulation and
-interoperability with other existing packages for handling transmission
-chain and contact-tracing data.
+providing bespoke functions and data structures that allow easy
+manipulation and interoperability with other Epiverse packages, for
+example,
+[superspreading](%22https://github.com/epiverse-trace/superspreading/%22)
+and
+[epiparameter](%22https://github.com/epiverse-trace/epiparameter/%22),
+and potentially some existing packages for handling transmission chains,
+for example, [epicontacts](https://github.com/reconhub/epicontacts).
 
 *epichains* is developed at the [Centre for the Mathematical Modelling
 of Infectious
@@ -59,9 +64,16 @@ library("epichains")
 
 # Quick start
 
-## Chain likelihoods
+*epichains* provides functionalities for estimating the likelihood of
+observing a given transmission chain, `likelihood()`, and functions for
+simulating transmission chains: `simulate_tree()`,
+`simulate_tree_from_pop()`, and `simulate_summary()`.
+
+The objects returned by these functions play nicely with `summary()` and
+`aggregate()`. Aggregated results also play nicely with `plot()`. Each
+functionality is briefly demonstrated below.
 
-*epichains* provides four main functions:
+## Chain likelihoods
 
 ### [`likelihood()`](https://epiverse-trace.github.io/epichains/reference/likelihood.html)
 
@@ -104,8 +116,8 @@ likelihood_eg
 
 ## Chain simulation
 
-There are three simulation functions, herein referred to colelctively as
-the `simulate_*()` functions. \`\`
+There are three simulation functions, herein referred to collectively as
+the `simulate_*()` functions.
 
 ### [`simulate_tree()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html)
 

From f5506fb1ad1051fd388dd4431d4830f3cf1baf08 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 9 Oct 2023 17:02:34 +0100
Subject: [PATCH 712/828] Simplify README

---
 README.Rmd | 268 +++++++----------------------------------------------
 1 file changed, 31 insertions(+), 237 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index dcc28847..c78d6067 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -41,13 +41,13 @@ models are often used in infectious disease epidemiology, where the chains repre
 transmission, and the offspring distribution represents the distribution of 
 secondary infections caused by an infected individual. 
 
-_{{ packagename }}_ re-implements [bpmodels]("https://github.com/epiverse-trace/bpmodels/")
+_{{ packagename }}_ re-implements [epichains]("https://github.com/epiverse-trace/epichains/")
 by providing bespoke functions and data structures that allow easy
 manipulation and interoperability with other Epiverse packages, for example, [superspreading]("https://github.com/epiverse-trace/superspreading/") and [epiparameter]("https://github.com/epiverse-trace/epiparameter/"), and potentially some existing packages for handling transmission chains, for example, [epicontacts](https://github.com/reconhub/epicontacts).
 
 _{{ packagename }}_ is developed at the [Centre for the Mathematical Modelling of Infectious Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling-infectious-diseases) at the London School of Hygiene and Tropical Medicine as part of the [Epiverse Initiative](https://data.org/initiatives/epiverse/).
 
-# Installation
+## Installation
 
 The latest development version of the _{{ packagename }}_ package can be installed via
 
@@ -63,251 +63,45 @@ To load the package, use
 library("epichains")
 ```
 
-# Quick start
+## Quick start
 
-_{{ packagename }}_ provides functionalities for estimating the likelihood of observing a given transmission chain, `likelihood()`, and functions for simulating transmission chains: `simulate_tree()`, `simulate_tree_from_pop()`, and `simulate_summary()`. 
+_{{ packagename }}_ provides four main functions:
 
-The objects returned by these functions play nicely with `summary()` and `aggregate()`. Aggregated results also play nicely with `plot()`.
-Each functionality is briefly demonstrated below. 
+* `simulate_tree()`: simulates transmission chains using an initial number of
+cases and information on the offspring distribution. This function returns
+an object with columns that track information on who infected whom, the
+generation of infection, and optionally, the time of infection.
 
-## Chain likelihoods
+* `simulate_summary()`: simulates a vector of transmission chain sizes or
+lengths using an initial number of cases and information on the offspring
+distribution. This function only returns a vector of realized chain size or
+length.
 
-### [`likelihood()`](https://epiverse-trace.github.io/epichains/reference/likelihood.html)
+* `simulate_tree_from_pop()`: simulates transmission chains given an initial
+population size and information on the offspring distribution. You can also
+specify a given level of pre-existing immunity. This function returns
+an object with columns that track information on who infected whom, the
+generation of infection, and the time of infection.
 
-This function calculates the likelihood/loglikelihood of observing a vector of outbreak summaries obtained from transmission chains. Summaries here refer to transmission chain sizes or lengths/durations.
+* `likelihood()`: calculates the loglikelihood (or likelihood, depending
+on the value of `log`) of observing a vector of transmission chain sizes or
+lengths.
 
-`likelihood()` requires a vector of chain summaries (sizes or lengths),
-`chains`, the corresponding statistic to calculate, `statistic`, and the offspring distribution,
-`offspring_dist` its associated parameters. It also requires `nsim_obs`, which is the number of simulations to run if the likelihoods do not have a closed-form solution and must be simulated. This argument will be explained further in the ["Getting Started"](https://epiverse-trace.github.io/epichains/articles/epichains.html) vignette.
+The objects returned by the `simulate_*()` functions can be summarised with
+`summary()` and aggregated into a `<data.frame>` of cases per time or generation
+with `aggregate()`. Aggregated results can also be passed on to `plot()` with
+its own arguments to customize the resulting plots. 
 
-Let's look at the following example where we estimate the loglikelihood of observing `chain_sizes`.
-```{r}
-set.seed(121)
-# example of observed chain sizes
-# randomly generate 20 chains of size between 1 to 10
-chain_sizes <- sample(1:10, 20, replace = TRUE)
-```
-
-```{r}
-# estimate loglikelihood of the observed chain sizes
-likelihood_eg <- likelihood(
-  chains = chain_sizes,
-  statistic = "size",
-  offspring_dist = "pois",
-  nsim_obs = 100,
-  lambda = 0.5
-)
-# Print the estimate
-likelihood_eg
-```
-
-## Chain simulation
-
-There are three simulation functions, herein referred to collectively as the `simulate_*()` functions.
-
-### [`simulate_tree()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html) 
-
-`simulate_tree()` simulates an outbreak from a given number of infections.
-It retains and returns information on infectors (ancestors), infectees, the generation of infection, and the time, if a serial distribution is specified.
-
-Let's look at an example where we simulate the transmission trees of $10$ initial infections/chains. We 
-assume a poisson offspring distribution with mean, $\text{lambda} = 0.9$, and a serial interval of $3$ days:
-```{r}
-set.seed(123)
-
-sim_tree_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-head(sim_tree_eg)
-```
-
-### [`simulate_summary()`](https://epiverse-trace.github.io/epichains/reference/simulate_summary.html)
-
-`simulate_summary()` is basically `simulate_tree()` except that it does not retain
-information on each infector and infectee. It returns the eventual size or length/duration of each transmission chain.
-
-Here is an example to simulate the previous examples without intervention,
-returning the size of each of the $10$ chains. It assumes a poisson offspring distribution with
-mean of $0.9$.
-```{r}
-set.seed(123)
-
-simulate_summary_eg <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  lambda = 0.9
-)
-
-# Print the results
-simulate_summary_eg
-```
-
-### [`simulate_tree_from_pop()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree_from_pop.html)
-
-`simulate_tree_from_pop()` simulates outbreaks based on a specified population size and pre-existing immunity until the susceptible pool runs out.
-  
-Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and serial interval of $3$:
-```{r}
-set.seed(7)
-
-sim_tree_from_pop_eg <- simulate_tree_from_pop(
-  pop = 1000,
-  offspring_dist = "pois",
-  lambda = 1,
-  serials_dist = function(x) {3}
-  )
-
-head(sim_tree_from_pop_eg)
-```
-
-#### Simulating interventions
-
-All the `simulate_*()` functions can model interventions that reduce the $R_0$,
-using the `intvn_mean_reduction` argument. In general, these can be
-interpreted as population-level interventions.
-
-To illustrate this, we will use the previous examples for each function and specify
-a population-level intervention that reduces $R_0$ by $50\%$.
-
-Using `simulate_tree()`, we can specify an initial number of cases
-and a population level intervention, `intvn_mean_reduction`, that reduces $R_0$ by $50\%$.
-
-```{r}
-set.seed(123)
-
-sim_tree_intvn_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-head(sim_tree_intvn_eg)
-```
+Each of the listed functionalities is demonstrated in detail
+in the ["Getting Started" vignette](https://epiverse-trace.github.io/epichains/articles/epichains.html).
 
-Here is an example with `simulate_summary()`, modelling an intervention that reduces $R_0$ by $50\%$.
-```{r}
-simulate_summary_intvn_eg <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  stat_max = 10,
-  lambda = 0.9
-)
-
-# Print the results
-simulate_summary_intvn_eg
-```
-
-Finally, let's use `simulate_tree_from_pop()`.
-```{r}
-set.seed(7)
-
-sim_tree_from_pop_intvn_eg <- simulate_tree_from_pop(
-  pop = 1000,
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  lambda = 1,
-  serials_dist = function(x) {3}
-  )
-
-head(sim_tree_from_pop_intvn_eg)
-```
-
-## Other functionalities
-
-### Summarising
-
-You can run `summary()` on `<epichains>` objects to get useful summaries.
-```{r include=TRUE,echo=TRUE}
-# Example with simulate_tree()
-set.seed(123)
-
-sim_tree_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-summary(sim_tree_eg)
-
-# Example with simulate_summary()
-set.seed(123)
-
-simulate_summary_eg <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  lambda = 0.9
-)
-
-# Get summaries
-summary(simulate_summary_eg)
-```
-
-### Aggregating
-
-You can aggregate `<epichains>` objects returned by the `simulate_*()` functions into a time series, which is a `<data.frame>` with columns "cases"  and either "generation" or "time", depending on the value of `grouping_var`.
-
-To aggregate over "time", you must have specified a serial interval distribution in the simulation step.
-```{r include=TRUE,echo=TRUE}
-# Example with simulate_tree()
-set.seed(123)
-
-sim_tree_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-aggregate(sim_tree_eg, grouping_var = "time")
-```
-
-### Plotting
-
-Aggregated `<epichains>` objects can easily be plotted using base R or `ggplot2` with little to no data manipulation.
-
-Here is an end-to-end example from simulation through aggregation to plotting.
-```{r}
-# Run simulation with simulate_tree()
-set.seed(123)
-
-sim_tree_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-# Aggregate cases over time
-sim_aggreg <- aggregate(sim_tree_eg, grouping_var = "time")
+## Package vignettes
 
-# Plot cases over time
-plot(sim_aggreg, type = "b")
-```
+The theory behind the models provided here can be
+found in the [theory vignette](https://epiverse-trace.github.io/epichains/articles/theoretical_background.html).
 
-## Package vignettes
+We have also collated a bibliography of branching process applications in 
+epidemiology. These can be found in the [literature vignette](https://epiverse-trace.github.io/epichains/articles/branching_process_literature.html).
 
 Specific use cases of _{{ packagename }}_ can be found in 
 the [online documentation as package vignettes](https://epiverse-trace.github.io/epichains/), under "Articles".

From 25dfc8792fcbf39d04faf5cccf9ee81acea18b6c Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 9 Oct 2023 17:02:49 +0100
Subject: [PATCH 713/828] Add Getting Started content

---
 vignettes/epichains.Rmd | 363 +++++++++++++++++++++++++++++++++++++++-
 1 file changed, 361 insertions(+), 2 deletions(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 2acfa562..9b96b448 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -1,6 +1,6 @@
 ---
 title: "Getting started with epichains"
-author: "James Azam"
+author: "James M. Azam and Sebastian Funk"
 output:
   bookdown::html_vignette2:
     fig_caption: yes
@@ -27,4 +27,363 @@ knitr::opts_chunk$set(
 )
 ```
 
-WIP
+_epichains_ provides methods to analyse and simulate the size and length
+of branching processes with an arbitrary offspring distribution. These
+can be used, for example, to analyse the distribution of chain sizes
+or length of infectious disease outbreaks, as discussed in @farrington2003 and 
+@blumberg2013.
+
+```{r}
+library("epichains")
+```
+
+## Chain likelihoods
+
+### [`likelihood()`](https://epiverse-trace.github.io/epichains/reference/likelihood.html)
+
+This function calculates the likelihood/loglikelihood of observing a vector of outbreak summaries obtained from transmission chains. "Outbreak summaries" here refer to transmission chain sizes or lengths/durations.
+
+`likelihood()` requires a vector of chain summaries (sizes or lengths),
+`chains`, the corresponding statistic to calculate, `statistic`, the offspring
+distribution, `offspring_dist` and its associated parameters. `offspring_dist`
+is specified as the "base name" of the inbuilt distributions in R, for example,
+"pois", "nbinom", etc. `likelihood()` also requires `nsim_obs`, which is the
+number of simulations to run if the likelihoods do not have a closed-form
+solution and must be simulated. This argument will be explained further in
+the next section.
+
+By default, the result is a log-likelihood but if `log = FALSE`, then
+likelihoods are returned. 
+
+Let's look at the following example where we estimate the log-likelihood of
+observing `chain_sizes`.
+```{r}
+set.seed(121)
+# example of observed chain sizes
+# randomly generate 20 chains of size between 1 to 10
+chain_sizes <- sample(1:10, 20, replace = TRUE)
+chain_sizes
+```
+
+```{r}
+# estimate loglikelihood of the observed chain sizes
+likelihood_eg <- likelihood(
+  chains = chain_sizes,
+  statistic = "size",
+  offspring_dist = "pois",
+  nsim_obs = 100,
+  lambda = 0.5
+)
+# Print the estimate
+likelihood_eg
+```
+
+### Joint and individual log-likelihoods
+
+`likelihood()`, by default, returns the joint log-likelihood. If instead,
+the individual log-likelihoods are required, then the `individual` argument
+must be set to `TRUE`. To return likelihoods instead, set `log = TRUE`.
+```{r}
+set.seed(121)
+# example of observed chain sizes
+# randomly generate 20 chains of size between 1 to 10
+chain_sizes <- sample(1:10, 20, replace = TRUE)
+chain_sizes
+```
+
+```{r}
+# estimate loglikelihood of the observed chain sizes
+likelihood_ind_eg <- likelihood(
+  chains = chain_sizes,
+  statistic = "size",
+  offspring_dist = "pois",
+  nsim_obs = 100,
+  lambda = 0.5,
+  individual = TRUE
+)
+# Print the estimate
+likelihood_ind_eg
+```
+
+### How `likelihood()` works
+
+*epichains* ships with functions for the analytical solutions of some
+transmission chain "size" and "length" distributions. For the size
+distributions, we provide the `poisson`, `negative binomial`, and `gamma-borel`
+mixture. For the length distribution, we provide the `poisson` and `geometric`
+distributions. These can be used with `likelihood()` based on what is specified
+for `offspring_dist` and `statistic`.
+
+If an analytical solution does not exist, we provide `offspring_ll()`, which
+uses simulations to approximate the probability distributions
+([using a linear approximation to the cumulative 
+distribution](https://en.wikipedia.org/wiki/Empirical_distribution_function) 
+for unobserved sizes/lengths). In this case, an extra argument `nsim_offspring` 
+must be passed to `likelihood()` to specify the number of simulations to be 
+used for this approximation.
+
+For example, let's look at an example where `chain_sizes` is observed and we
+want to calculate the likelihood of this being drawn from a binomial
+distribution with probability `prob = 0.9`.
+```{r}
+set.seed(121)
+# example of observed chain sizes; randomly generate 20 chains of size 1 to 10
+chain_sizes <- sample(1:10, 20, replace = TRUE)
+# get their likelihood
+liks <- likelihood(
+  chains = chain_sizes,
+  offspring_dist = "binom",
+  statistic = "size",
+  size = 1,
+  prob = 0.9,
+  nsim_offspring = 250
+)
+liks
+```
+
+### Observation probability
+
+`likelihood()` uses the argument `obs_prob` to model the observation
+probability.
+
+By default, it assumes perfect observation, where `obs_prob = 1`
+(See `?likelihood`), meaning that all transmission events are observed and
+recorded in the data.
+
+If observations are imperfect, the `obs_prob` must be
+less than 1. In the case of imperfect observation, "true" chain sizes
+or lengths are simulated `nsim_obs` times, and the likelihood calculated for
+each of the simulations.
+
+For example, if the probability of observing each case is `obs_prob = 0.30`,
+we use
+
+```{r}
+set.seed(121)
+# example of observed chain sizes; randomly generate 20 chains of size 1 to 10
+chain_sizes <- sample(1:10, 20, replace = TRUE)
+# get their likelihood
+liks <- likelihood(
+  chains = chain_sizes,
+  statistic = "size",
+  offspring_dist = "pois",
+  obs_prob = 0.3,
+  lambda = 0.5,
+  nsim_obs = 10
+)
+liks
+```
+
+This returns `10` likelihood values (because `nsim_obs = 10`), which can be
+averaged to come up with an overall likelihood estimate.
+
+To find out about the usage of the `likelihood()` function, you can run
+`?likelihood` to access its `R` help file.
+
+## Chain simulation
+
+There are three simulation functions, herein referred to collectively as the `simulate_*()` functions.
+
+### [`simulate_tree()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html) 
+
+`simulate_tree()` simulates an outbreak from a given number of infections.
+It retains and returns information on infectors (ancestors), infectees, the generation of infection, and the time, if a serial distribution is specified.
+
+Let's look at an example where we simulate the transmission trees of $10$ initial infections/chains. We 
+assume a poisson offspring distribution with mean, $\text{lambda} = 0.9$, and a serial interval of $3$ days:
+```{r}
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+head(sim_tree_eg)
+```
+
+### [`simulate_summary()`](https://epiverse-trace.github.io/epichains/reference/simulate_summary.html)
+
+`simulate_summary()` is basically `simulate_tree()` except that it does not retain
+information on each infector and infectee. It returns the eventual size or length/duration of each transmission chain.
+
+Here is an example to simulate the previous examples without intervention,
+returning the size of each of the $10$ chains. It assumes a poisson offspring distribution with
+mean of $0.9$.
+```{r}
+set.seed(123)
+
+simulate_summary_eg <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  lambda = 0.9
+)
+
+# Print the results
+simulate_summary_eg
+```
+
+### [`simulate_tree_from_pop()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree_from_pop.html)
+
+`simulate_tree_from_pop()` simulates outbreaks based on a specified population size and pre-existing immunity until the susceptible pool runs out.
+  
+Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and serial interval of $3$:
+```{r}
+set.seed(7)
+
+sim_tree_from_pop_eg <- simulate_tree_from_pop(
+  pop = 1000,
+  offspring_dist = "pois",
+  lambda = 1,
+  serials_dist = function(x) {3}
+  )
+
+head(sim_tree_from_pop_eg)
+```
+
+#### Simulating chains with interventions
+
+All the `simulate_*()` functions can model interventions that reduce the $R_0$,
+using the `intvn_mean_reduction` argument. In general, these can be
+interpreted as population-level interventions.
+
+To illustrate this, we will use the previous examples for each function and specify
+a population-level intervention that reduces $R_0$ by $50\%$.
+
+Using `simulate_tree()`, we can specify an initial number of cases
+and a population level intervention, `intvn_mean_reduction`, that reduces $R_0$ by $50\%$.
+
+```{r}
+set.seed(123)
+
+sim_tree_intvn_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  intvn_mean_reduction = 0.5,
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+head(sim_tree_intvn_eg)
+```
+
+Here is an example with `simulate_summary()`, modelling an intervention that reduces $R_0$ by $50\%$.
+```{r}
+simulate_summary_intvn_eg <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  intvn_mean_reduction = 0.5,
+  stat_max = 10,
+  lambda = 0.9
+)
+
+# Print the results
+simulate_summary_intvn_eg
+```
+
+Finally, let's use `simulate_tree_from_pop()`.
+```{r}
+set.seed(7)
+
+sim_tree_from_pop_intvn_eg <- simulate_tree_from_pop(
+  pop = 1000,
+  offspring_dist = "pois",
+  intvn_mean_reduction = 0.5,
+  lambda = 1,
+  serials_dist = function(x) {3}
+  )
+
+head(sim_tree_from_pop_intvn_eg)
+```
+
+## Other functionalities
+
+### Summarising
+
+You can run `summary()` on `<epichains>` objects to get useful summaries.
+```{r include=TRUE,echo=TRUE}
+# Example with simulate_tree()
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+summary(sim_tree_eg)
+
+# Example with simulate_summary()
+set.seed(123)
+
+simulate_summary_eg <- simulate_summary(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  lambda = 0.9
+)
+
+# Get summaries
+summary(simulate_summary_eg)
+```
+
+### Aggregating
+
+You can aggregate `<epichains>` objects returned by the `simulate_*()` functions into a time series, which is a `<data.frame>` with columns "cases"  and either "generation" or "time", depending on the value of `grouping_var`.
+
+To aggregate over "time", you must have specified a serial interval distribution in the simulation step.
+```{r include=TRUE,echo=TRUE}
+# Example with simulate_tree()
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+aggregate(sim_tree_eg, grouping_var = "time")
+```
+
+### Plotting
+
+Aggregated `<epichains>` objects can easily be plotted using base R or `ggplot2` with little to no data manipulation.
+
+Here is an end-to-end example from simulation through aggregation to plotting.
+```{r}
+# Run simulation with simulate_tree()
+set.seed(123)
+
+sim_tree_eg <- simulate_tree(
+  nchains = 10,
+  statistic = "size",
+  offspring_dist = "pois",
+  stat_max = 10,
+  serials_dist = function(x) 3,
+  lambda = 0.9
+)
+
+# Aggregate cases over time
+sim_aggreg <- aggregate(sim_tree_eg, grouping_var = "time")
+
+# Plot cases over time
+plot(sim_aggreg, type = "b")
+```
+
+## References

From 1979085f178b5a52c7d1bf446c947c907655b6cf Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Mon, 9 Oct 2023 16:05:41 +0000
Subject: [PATCH 714/828] Automatic readme update

---
 README.md | 385 +++++-------------------------------------------------
 1 file changed, 34 insertions(+), 351 deletions(-)

diff --git a/README.md b/README.md
index af38196e..f72fd545 100644
--- a/README.md
+++ b/README.md
@@ -29,7 +29,7 @@ distribution represents the distribution of secondary infections caused
 by an infected individual.
 
 *epichains* re-implements
-[bpmodels](%22https://github.com/epiverse-trace/bpmodels/%22) by
+[epichains](%22https://github.com/epiverse-trace/epichains/%22) by
 providing bespoke functions and data structures that allow easy
 manipulation and interoperability with other Epiverse packages, for
 example,
@@ -45,7 +45,7 @@ Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling
 at the London School of Hygiene and Tropical Medicine as part of the
 [Epiverse Initiative](https://data.org/initiatives/epiverse/).
 
-# Installation
+## Installation
 
 The latest development version of the *epichains* package can be
 installed via
@@ -62,365 +62,48 @@ To load the package, use
 library("epichains")
 ```
 
-# Quick start
+## Quick start
 
-*epichains* provides functionalities for estimating the likelihood of
-observing a given transmission chain, `likelihood()`, and functions for
-simulating transmission chains: `simulate_tree()`,
-`simulate_tree_from_pop()`, and `simulate_summary()`.
+*epichains* provides four main functions:
 
-The objects returned by these functions play nicely with `summary()` and
-`aggregate()`. Aggregated results also play nicely with `plot()`. Each
-functionality is briefly demonstrated below.
+- `simulate_tree()`: simulates transmission chains using an initial
+  number of cases and information on the offspring distribution. This
+  function returns an object with columns that track information on who
+  infected whom, the generation of infection, and optionally, the time
+  of infection.
 
-## Chain likelihoods
+- `simulate_summary()`: simulates a vector of transmission chain sizes
+  or lengths using an initial number of cases and information on the
+  offspring distribution. This function only returns a vector of
+  realized chain size or length.
 
-### [`likelihood()`](https://epiverse-trace.github.io/epichains/reference/likelihood.html)
+- `simulate_tree_from_pop()`: simulates transmission chains given an
+  initial population size and information on the offspring distribution.
+  You can also specify a given level of pre-existing immunity. This
+  function returns an object with columns that track information on who
+  infected whom, the generation of infection, and the time of infection.
 
-This function calculates the likelihood/loglikelihood of observing a
-vector of outbreak summaries obtained from transmission chains.
-Summaries here refer to transmission chain sizes or lengths/durations.
+- `likelihood()`: calculates the loglikelihood (or likelihood, depending
+  on the value of `log`) of observing a vector of transmission chain
+  sizes or lengths.
 
-`likelihood()` requires a vector of chain summaries (sizes or lengths),
-`chains`, the corresponding statistic to calculate, `statistic`, and the
-offspring distribution, `offspring_dist` its associated parameters. It
-also requires `nsim_obs`, which is the number of simulations to run if
-the likelihoods do not have a closed-form solution and must be
-simulated. This argument will be explained further in the [“Getting
-Started”](https://epiverse-trace.github.io/epichains/articles/epichains.html)
-vignette.
+The objects returned by the `simulate_*()` functions can be summarised
+with `summary()` and aggregated into a `<data.frame>` of cases per time
+or generation with `aggregate()`. Aggregated results can also be passed
+on to `plot()` with its own arguments to customize the resulting plots.
 
-Let’s look at the following example where we estimate the loglikelihood
-of observing `chain_sizes`.
+Each of the listed functionalities is demonstrated in detail in the
+[“Getting Started”
+vignette](https://epiverse-trace.github.io/epichains/articles/epichains.html).
 
-``` r
-set.seed(121)
-# example of observed chain sizes
-# randomly generate 20 chains of size between 1 to 10
-chain_sizes <- sample(1:10, 20, replace = TRUE)
-```
-
-``` r
-# estimate loglikelihood of the observed chain sizes
-likelihood_eg <- likelihood(
-  chains = chain_sizes,
-  statistic = "size",
-  offspring_dist = "pois",
-  nsim_obs = 100,
-  lambda = 0.5
-)
-# Print the estimate
-likelihood_eg
-#> [1] -67.82879
-```
-
-## Chain simulation
-
-There are three simulation functions, herein referred to collectively as
-the `simulate_*()` functions.
-
-### [`simulate_tree()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html)
-
-`simulate_tree()` simulates an outbreak from a given number of
-infections. It retains and returns information on infectors (ancestors),
-infectees, the generation of infection, and the time, if a serial
-distribution is specified.
-
-Let’s look at an example where we simulate the transmission trees of
-$10$ initial infections/chains. We assume a poisson offspring
-distribution with mean, $\text{lambda} = 0.9$, and a serial interval of
-$3$ days:
-
-``` r
-set.seed(123)
-
-sim_tree_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-head(sim_tree_eg)
-#> < tree head (from first known ancestor) >
-#>    chain_id sim_id ancestor generation time
-#> 11        2      2        1          2    3
-#> 13        3      2        1          2    3
-#> 14        4      2        1          2    3
-#> 16        5      2        1          2    3
-#> 19        7      2        1          2    3
-#> 20        8      2        1          2    3
-```
-
-### [`simulate_summary()`](https://epiverse-trace.github.io/epichains/reference/simulate_summary.html)
-
-`simulate_summary()` is basically `simulate_tree()` except that it does
-not retain information on each infector and infectee. It returns the
-eventual size or length/duration of each transmission chain.
-
-Here is an example to simulate the previous examples without
-intervention, returning the size of each of the $10$ chains. It assumes
-a poisson offspring distribution with mean of $0.9$.
-
-``` r
-set.seed(123)
-
-simulate_summary_eg <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  lambda = 0.9
-)
-
-# Print the results
-simulate_summary_eg
-#> `epichains` object 
-#> 
-#>  [1]   1 Inf   4   4 Inf   1   2 Inf   5   3
-#> 
-#>  Simulated chain sizes: 
-#> 
-#> Max: 5
-#> Min: 1
-```
-
-### [`simulate_tree_from_pop()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree_from_pop.html)
-
-`simulate_tree_from_pop()` simulates outbreaks based on a specified
-population size and pre-existing immunity until the susceptible pool
-runs out.
-
-Here is a quick example where we simulate an outbreak in a population of
-size $1000$. We assume individuals have a poisson offspring distribution
-with mean, $\text{lambda} = 1$, and serial interval of $3$:
-
-``` r
-set.seed(7)
-
-sim_tree_from_pop_eg <- simulate_tree_from_pop(
-  pop = 1000,
-  offspring_dist = "pois",
-  lambda = 1,
-  serials_dist = function(x) {3}
-  )
-
-head(sim_tree_from_pop_eg)
-#> < tree head (from first known ancestor) >
-#>   sim_id ancestor generation time
-#> 2      2        1          2    3
-#> 3      3        1          2    3
-#> 4      4        1          2    3
-#> 5      5        1          2    3
-#> 6      6        2          3    6
-#> 7      7        6          4    9
-```
-
-#### Simulating interventions
-
-All the `simulate_*()` functions can model interventions that reduce the
-$R_0$, using the `intvn_mean_reduction` argument. In general, these can
-be interpreted as population-level interventions.
-
-To illustrate this, we will use the previous examples for each function
-and specify a population-level intervention that reduces $R_0$ by
-$50\%$.
-
-Using `simulate_tree()`, we can specify an initial number of cases and a
-population level intervention, `intvn_mean_reduction`, that reduces
-$R_0$ by $50\%$.
-
-``` r
-set.seed(123)
-
-sim_tree_intvn_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-head(sim_tree_intvn_eg)
-#> < tree head (from first known ancestor) >
-#>    chain_id sim_id ancestor generation time
-#> 11        2      2        1          2    3
-#> 12        4      2        1          2    3
-#> 13        5      2        1          2    3
-#> 15        8      2        1          2    3
-#> 14        5      3        1          2    3
-#> 16        2      3        2          3    6
-```
-
-Here is an example with `simulate_summary()`, modelling an intervention
-that reduces $R_0$ by $50\%$.
-
-``` r
-simulate_summary_intvn_eg <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  stat_max = 10,
-  lambda = 0.9
-)
-
-# Print the results
-simulate_summary_intvn_eg
-#> `epichains` object 
-#> 
-#>  [1] 5 3 3 3 5 2 2 1 1 1
-#> 
-#>  Simulated chain sizes: 
-#> 
-#> Max: 5
-#> Min: 1
-```
-
-Finally, let’s use `simulate_tree_from_pop()`.
-
-``` r
-set.seed(7)
-
-sim_tree_from_pop_intvn_eg <- simulate_tree_from_pop(
-  pop = 1000,
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  lambda = 1,
-  serials_dist = function(x) {3}
-  )
-
-head(sim_tree_from_pop_intvn_eg)
-#> < tree head (from first known ancestor) >
-#>   sim_id ancestor generation time
-#> 2      2        1          2    3
-#> 3      3        1          2    3
-#> 4      4        1          2    3
-```
-
-## Other functionalities
-
-### Summarising
-
-You can run `summary()` on `<epichains>` objects to get useful
-summaries.
-
-``` r
-# Example with simulate_tree()
-set.seed(123)
-
-sim_tree_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-summary(sim_tree_eg)
-#> $chains_run
-#> [1] 10
-#> 
-#> $max_time
-#> [1] 12
-#> 
-#> $unique_ancestors
-#> [1] 9
-#> 
-#> $max_generation
-#> [1] 5
-
-# Example with simulate_summary()
-set.seed(123)
-
-simulate_summary_eg <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  lambda = 0.9
-)
-
-# Get summaries
-summary(simulate_summary_eg)
-#> $chains_run
-#> [1] 10
-#> 
-#> $max_chain_stat
-#> [1] 5
-#> 
-#> $min_chain_stat
-#> [1] 1
-```
-
-### Aggregating
-
-You can aggregate `<epichains>` objects returned by the `simulate_*()`
-functions into a time series, which is a `<data.frame>` with columns
-“cases” and either “generation” or “time”, depending on the value of
-`grouping_var`.
-
-To aggregate over “time”, you must have specified a serial interval
-distribution in the simulation step.
-
-``` r
-# Example with simulate_tree()
-set.seed(123)
-
-sim_tree_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-aggregate(sim_tree_eg, grouping_var = "time")
-#>   time cases
-#> 1    0    10
-#> 2    3    13
-#> 3    6    15
-#> 4    9    18
-#> 5   12     2
-```
-
-### Plotting
-
-Aggregated `<epichains>` objects can easily be plotted using base R or
-`ggplot2` with little to no data manipulation.
-
-Here is an end-to-end example from simulation through aggregation to
-plotting.
-
-``` r
-# Run simulation with simulate_tree()
-set.seed(123)
-
-sim_tree_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 0.9
-)
-
-# Aggregate cases over time
-sim_aggreg <- aggregate(sim_tree_eg, grouping_var = "time")
-
-# Plot cases over time
-plot(sim_aggreg, type = "b")
-```
+## Package vignettes
 
-<img src="man/figures/README-unnamed-chunk-14-1.png" width="100%" />
+The theory behind the models provided here can be found in the [theory
+vignette](https://epiverse-trace.github.io/epichains/articles/theoretical_background.html).
 
-## Package vignettes
+We have also collated a bibliography of branching process applications
+in epidemiology. These can be found in the [literature
+vignette](https://epiverse-trace.github.io/epichains/articles/branching_process_literature.html).
 
 Specific use cases of *epichains* can be found in the [online
 documentation as package

From 7ec6b97be832047cda1d4d94da180899b2224f61 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 9 Oct 2023 17:28:13 +0100
Subject: [PATCH 715/828] Linting: define inline function outside of main
 function call to fix braces issue

---
 vignettes/epichains.Rmd | 45 +++++++++++++++++++++++++++++++++--------
 1 file changed, 37 insertions(+), 8 deletions(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 9b96b448..4dee9cac 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -193,6 +193,10 @@ Let's look at an example where we simulate the transmission trees of $10$ initia
 assume a poisson offspring distribution with mean, $\text{lambda} = 0.9$, and a serial interval of $3$ days:
 ```{r}
 set.seed(123)
+# Define serial distribution
+serial_func <- function(x) {
+  return(3)
+}
 
 sim_tree_eg <- simulate_tree(
   nchains = 10,
@@ -236,13 +240,17 @@ simulate_summary_eg
 Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and serial interval of $3$:
 ```{r}
 set.seed(7)
+# Define serial distribution
+serial_func <- function(x) {
+  return(3)
+}
 
 sim_tree_from_pop_eg <- simulate_tree_from_pop(
   pop = 1000,
   offspring_dist = "pois",
   lambda = 1,
-  serials_dist = function(x) {3}
-  )
+  serials_dist = serial_func
+)
 
 head(sim_tree_from_pop_eg)
 ```
@@ -261,6 +269,10 @@ and a population level intervention, `intvn_mean_reduction`, that reduces $R_0$
 
 ```{r}
 set.seed(123)
+# Define serial distribution
+serial_func <- function(x) {
+  return(3)
+}
 
 sim_tree_intvn_eg <- simulate_tree(
   nchains = 10,
@@ -268,7 +280,7 @@ sim_tree_intvn_eg <- simulate_tree(
   offspring_dist = "pois",
   intvn_mean_reduction = 0.5,
   stat_max = 10,
-  serials_dist = function(x) 3,
+  serials_dist = serial_func,
   lambda = 0.9
 )
 
@@ -293,14 +305,18 @@ simulate_summary_intvn_eg
 Finally, let's use `simulate_tree_from_pop()`.
 ```{r}
 set.seed(7)
+# Define serial distribution
+serial_func <- function(x) {
+  return(3)
+}
 
 sim_tree_from_pop_intvn_eg <- simulate_tree_from_pop(
   pop = 1000,
   offspring_dist = "pois",
   intvn_mean_reduction = 0.5,
   lambda = 1,
-  serials_dist = function(x) {3}
-  )
+  serials_dist = serial_func
+)
 
 head(sim_tree_from_pop_intvn_eg)
 ```
@@ -313,13 +329,17 @@ You can run `summary()` on `<epichains>` objects to get useful summaries.
 ```{r include=TRUE,echo=TRUE}
 # Example with simulate_tree()
 set.seed(123)
+# Define serial distribution
+serial_func <- function(x) {
+  return(3)
+}
 
 sim_tree_eg <- simulate_tree(
   nchains = 10,
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  serials_dist = function(x) 3,
+  serials_dist = serial_func,
   lambda = 0.9
 )
 
@@ -349,12 +369,17 @@ To aggregate over "time", you must have specified a serial interval distribution
 # Example with simulate_tree()
 set.seed(123)
 
+# Define serial distribution
+serial_func <- function(x) {
+  return(3)
+}
+
 sim_tree_eg <- simulate_tree(
   nchains = 10,
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  serials_dist = function(x) 3,
+  serials_dist = serial_func,
   lambda = 0.9
 )
 
@@ -369,13 +394,17 @@ Here is an end-to-end example from simulation through aggregation to plotting.
 ```{r}
 # Run simulation with simulate_tree()
 set.seed(123)
+# Define serial distribution
+serial_func <- function(x) {
+  return(3)
+}
 
 sim_tree_eg <- simulate_tree(
   nchains = 10,
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  serials_dist = function(x) 3,
+  serials_dist = serial_func,
   lambda = 0.9
 )
 

From 439a80cd037e4ed78e6df9d4487e5fb88aa6fbea Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 9 Oct 2023 18:04:25 +0100
Subject: [PATCH 716/828] Replace epichains with bpmodels where relevant

---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index c78d6067..439d9665 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -41,7 +41,7 @@ models are often used in infectious disease epidemiology, where the chains repre
 transmission, and the offspring distribution represents the distribution of 
 secondary infections caused by an infected individual. 
 
-_{{ packagename }}_ re-implements [epichains]("https://github.com/epiverse-trace/epichains/")
+_{{ packagename }}_ re-implements [bpmodels]("https://github.com/epiverse-trace/bpmodels/")
 by providing bespoke functions and data structures that allow easy
 manipulation and interoperability with other Epiverse packages, for example, [superspreading]("https://github.com/epiverse-trace/superspreading/") and [epiparameter]("https://github.com/epiverse-trace/epiparameter/"), and potentially some existing packages for handling transmission chains, for example, [epicontacts](https://github.com/reconhub/epicontacts).
 

From 130e37153ce3d63d7c1710d1adc4e46aeb74f2c1 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 9 Oct 2023 18:04:44 +0100
Subject: [PATCH 717/828] Correct Epiverse to Epiverse-TRACE

---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index 439d9665..3891c6ef 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -43,7 +43,7 @@ secondary infections caused by an infected individual.
 
 _{{ packagename }}_ re-implements [bpmodels]("https://github.com/epiverse-trace/bpmodels/")
 by providing bespoke functions and data structures that allow easy
-manipulation and interoperability with other Epiverse packages, for example, [superspreading]("https://github.com/epiverse-trace/superspreading/") and [epiparameter]("https://github.com/epiverse-trace/epiparameter/"), and potentially some existing packages for handling transmission chains, for example, [epicontacts](https://github.com/reconhub/epicontacts).
+manipulation and interoperability with other Epiverse-TRACE packages, for example, [superspreading]("https://github.com/epiverse-trace/superspreading/") and [epiparameter]("https://github.com/epiverse-trace/epiparameter/"), and potentially some existing packages for handling transmission chains, for example, [epicontacts](https://github.com/reconhub/epicontacts).
 
 _{{ packagename }}_ is developed at the [Centre for the Mathematical Modelling of Infectious Diseases](https://www.lshtm.ac.uk/research/centres/centre-mathematical-modelling-infectious-diseases) at the London School of Hygiene and Tropical Medicine as part of the [Epiverse Initiative](https://data.org/initiatives/epiverse/).
 

From bde83e999815259422ff1ac8c329e8cfaefe5c80 Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Mon, 9 Oct 2023 17:10:28 +0000
Subject: [PATCH 718/828] Automatic readme update

---
 README.md | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/README.md b/README.md
index f72fd545..363bf833 100644
--- a/README.md
+++ b/README.md
@@ -29,10 +29,10 @@ distribution represents the distribution of secondary infections caused
 by an infected individual.
 
 *epichains* re-implements
-[epichains](%22https://github.com/epiverse-trace/epichains/%22) by
+[bpmodels](%22https://github.com/epiverse-trace/bpmodels/%22) by
 providing bespoke functions and data structures that allow easy
-manipulation and interoperability with other Epiverse packages, for
-example,
+manipulation and interoperability with other Epiverse-TRACE packages,
+for example,
 [superspreading](%22https://github.com/epiverse-trace/superspreading/%22)
 and
 [epiparameter](%22https://github.com/epiverse-trace/epiparameter/%22),

From 2f901ed9ffa9b0e51e42d9eca5c87040cc2481a9 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 11 Oct 2023 16:56:34 +0100
Subject: [PATCH 719/828] Improve wording

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index 3891c6ef..91a410ef 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -70,7 +70,7 @@ _{{ packagename }}_ provides four main functions:
 * `simulate_tree()`: simulates transmission chains using an initial number of
 cases and information on the offspring distribution. This function returns
 an object with columns that track information on who infected whom, the
-generation of infection, and optionally, the time of infection.
+generation of infection and, if a serial interval is given, the time of infection.
 
 * `simulate_summary()`: simulates a vector of transmission chain sizes or
 lengths using an initial number of cases and information on the offspring

From a65cf8a6c2d3343004fb60b292de6af08e6e9200 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 11 Oct 2023 16:56:49 +0100
Subject: [PATCH 720/828] Improve wording

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index 91a410ef..c98b8d6b 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -81,7 +81,7 @@ length.
 population size and information on the offspring distribution. You can also
 specify a given level of pre-existing immunity. This function returns
 an object with columns that track information on who infected whom, the
-generation of infection, and the time of infection.
+generation of infection and, if a serial interval is given, the time of infection.
 
 * `likelihood()`: calculates the loglikelihood (or likelihood, depending
 on the value of `log`) of observing a vector of transmission chain sizes or

From ced49aeb46e09fd2acb740ad0bc799c1a55dd533 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 11 Oct 2023 17:00:36 +0100
Subject: [PATCH 721/828] Remove backticks.

---
 vignettes/epichains.Rmd | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 4dee9cac..cefb1ceb 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -109,8 +109,8 @@ likelihood_ind_eg
 
 *epichains* ships with functions for the analytical solutions of some
 transmission chain "size" and "length" distributions. For the size
-distributions, we provide the `poisson`, `negative binomial`, and `gamma-borel`
-mixture. For the length distribution, we provide the `poisson` and `geometric`
+distributions, we provide the poisson, negative binomial, and gamma-borel
+mixture. For the length distribution, we provide the poisson and geometric
 distributions. These can be used with `likelihood()` based on what is specified
 for `offspring_dist` and `statistic`.
 

From 0691e0b6e488f406d1e7457de162af90e9432805 Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Wed, 11 Oct 2023 16:00:54 +0000
Subject: [PATCH 722/828] Automatic readme update

---
 README.md | 7 ++++---
 1 file changed, 4 insertions(+), 3 deletions(-)

diff --git a/README.md b/README.md
index 363bf833..bb9a3d71 100644
--- a/README.md
+++ b/README.md
@@ -69,8 +69,8 @@ library("epichains")
 - `simulate_tree()`: simulates transmission chains using an initial
   number of cases and information on the offspring distribution. This
   function returns an object with columns that track information on who
-  infected whom, the generation of infection, and optionally, the time
-  of infection.
+  infected whom, the generation of infection and, if a serial interval
+  is given, the time of infection.
 
 - `simulate_summary()`: simulates a vector of transmission chain sizes
   or lengths using an initial number of cases and information on the
@@ -81,7 +81,8 @@ library("epichains")
   initial population size and information on the offspring distribution.
   You can also specify a given level of pre-existing immunity. This
   function returns an object with columns that track information on who
-  infected whom, the generation of infection, and the time of infection.
+  infected whom, the generation of infection and, if a serial interval
+  is given, the time of infection.
 
 - `likelihood()`: calculates the loglikelihood (or likelihood, depending
   on the value of `log`) of observing a vector of transmission chain

From 848b56029a3f21fa87afe3e8a42128b37e23e3b4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 11 Oct 2023 17:01:31 +0100
Subject: [PATCH 723/828] Correct wording

---
 vignettes/epichains.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index cefb1ceb..3f8c0b90 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -82,7 +82,7 @@ likelihood_eg
 
 `likelihood()`, by default, returns the joint log-likelihood. If instead,
 the individual log-likelihoods are required, then the `individual` argument
-must be set to `TRUE`. To return likelihoods instead, set `log = TRUE`.
+must be set to `TRUE`. To return likelihoods instead, set `log = FALSE`.
 ```{r}
 set.seed(121)
 # example of observed chain sizes

From 7e35de600706204f252249ea808821078d39f18f Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 8 Nov 2023 14:29:52 +0000
Subject: [PATCH 724/828] remove `intvn_reduce_mean` argument

---
 R/intervention.R               |  51 ----------
 R/simulate.r                   |  90 ------------------
 man/intvn_reduce_mean.Rd       |  55 -----------
 man/simulate_summary.Rd        |  19 ----
 man/simulate_tree.Rd           |  20 ----
 man/simulate_tree_from_pop.Rd  |  18 ----
 tests/testthat/test-simulate.R | 169 ---------------------------------
 vignettes/epichains.Rmd        |  66 -------------
 8 files changed, 488 deletions(-)
 delete mode 100644 R/intervention.R
 delete mode 100644 man/intvn_reduce_mean.Rd

diff --git a/R/intervention.R b/R/intervention.R
deleted file mode 100644
index 3ab7339c..00000000
--- a/R/intervention.R
+++ /dev/null
@@ -1,51 +0,0 @@
-#' Reduce the mean of the offspring distribution
-#'
-#' @description
-#' `intvn_reduce_mean()` is a helper for the \code{simulate_*()} functions. It
-#' reduces/scales the mean of the offspring distribution in order to
-#' mimic the impact of a population-level intervention. Currently, it can only
-#' handle the poisson and negative binomial distributions and errors when other
-#' offspring distributions are specified alongside the `intvn_mean_reduction`
-#' argument.
-#'
-#' @inheritParams simulate_tree
-#' @param intvn_mean_reduction A number between 0
-#' and 1 for scaling/reducing the mean of `offspring_dist`. Serves as
-#' population-level intervention. `intvn_mean_reduction` = 0
-#' implies no intervention impact and `intvn_mean_reduction` = 1 implies full
-#' impact.
-#' @param pars_list Parameter(s) for poisson or negative binomial offspring
-#' distribution.
-#' @return List of the offspring distribution parameter(s) with the mean
-#' scaled by \code{1 - intvn_mean_reduction}.
-#' @details
-#' `intvn_reduce_mean()` scales the mean of the offspring distribution
-#' by \eqn{1 - {\sf intvn\_mean\_reduction}} so that the new mean is given as:
-#' \deqn{(1 - {\sf intvn\_mean\_reduction}) \times {\sf mean,}} This
-#' scaling when applied to the poisson and negative binomial offspring
-#' distributions corresponds to the population-level reduction of R0 as
-#' described in Lloyd-Smith et al, (2005). `intvn_reduce_mean()` is therefore
-#' only implemented for the aforementioned distributions and errors when other
-#' offspring distributions are specified along with the `intvn_mean_reduction`
-#' argument in the \code{simulate_*()} functions.
-#'
-#' @author James M. Azam
-#' @keywords internal
-#' @references Lloyd-Smith, J., Schreiber, S., Kopp, P. et al. Superspreading
-#' and the effect of individual variation on disease emergence. Nature 438,
-#' 355–359 (2005). \doi{10.1038/nature04153}
-intvn_reduce_mean <- function(intvn_mean_reduction, offspring_dist, pars_list) {
-  # Intervention only works for pois and nbinom
-  if (!offspring_dist %in% c("pois", "nbinom")) {
-    stop(
-      "`offspring_dist` must be one of c(\"pois\", \"nbinom\"), ",
-      "if intvn_mean_reduction is specified."
-    )
-  }
-  if (offspring_dist == "pois") {
-    pars_list$lambda <- (1 - intvn_mean_reduction) * pars_list$lambda
-  } else {
-    pars_list$mu <- (1 - intvn_mean_reduction) * pars_list$mu
-  }
-  return(pars_list)
-}
diff --git a/R/simulate.r b/R/simulate.r
index 11f34afe..c5b81d1b 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,6 +1,5 @@
 #' Simulate transmission trees from an initial number of infections
 #'
-#' @inheritParams intvn_reduce_mean
 #' @param nchains Number of chains to simulate.
 #' @param offspring_dist Offspring distribution: a character string
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
@@ -98,19 +97,6 @@
 #'   serials_dist = function(x) 3,
 #'   lambda = 2
 #' )
-#'
-#' # Run model with intervention a 50% reduction in R0.
-#' chains_with_intvn <- simulate_tree(
-#'   nchains = 10,
-#'   statistic = "size",
-#'   offspring_dist = "pois",
-#'   intvn_mean_reduction = 0.5,
-#'   stat_max = 10,
-#'   serials_dist = function(x) 3,
-#'   lambda = 2
-#' )
-#'
-#' chains_with_intvn
 #' @references
 #' Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
 #' between serial interval, infectiousness profile and generation time.
@@ -127,7 +113,6 @@
 #' 1186–1204. \doi{https://doi.org/10.3390/ijerph7031204}
 simulate_tree <- function(nchains, statistic = c("size", "length"),
                           offspring_dist, stat_max = Inf,
-                          intvn_mean_reduction = 0,
                           serials_dist, t0 = 0,
                           tf = Inf, ...) {
   statistic <- match.arg(statistic)
@@ -137,13 +122,6 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
   # check that offspring is properly specified
   check_offspring_valid(offspring_dist)
 
-  # Check that the intvn_mean_reduction is well specified
-  checkmate::assert_number(
-    intvn_mean_reduction,
-    lower = 0,
-    upper = 1
-  )
-
   # check that offspring function exists in base R
   roffspring_name <- paste0("r", offspring_dist)
   check_offspring_func_valid(roffspring_name)
@@ -151,15 +129,6 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
   # Gather offspring distribution parameters
   pars <- list(...)
 
-  # Prepare interventions if specified
-  if (intvn_mean_reduction > 0) {
-    pars <- intvn_reduce_mean(
-      intvn_mean_reduction = intvn_mean_reduction,
-      offspring_dist = offspring_dist,
-      pars_list = pars
-    )
-  }
-
   if (!missing(serials_dist)) {
     check_serial_valid(serials_dist)
   } else if (!missing(tf)) {
@@ -284,7 +253,6 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #' Simulate transmission chains sizes/lengths
 #'
 #' @inheritParams simulate_tree
-#' @inheritParams intvn_reduce_mean
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
 #' @inheritSection simulate_tree Calculating chain sizes and lengths
@@ -303,22 +271,9 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #'   stat_max = 10,
 #'   lambda = 2
 #' )
-#'
-#' # Run model with intervention a 50% reduction in R0.
-#' chain_summary_with_intvn <- simulate_summary(
-#'   nchains = 10,
-#'   statistic = "size",
-#'   offspring_dist = "pois",
-#'   intvn_mean_reduction = 0.5,
-#'   stat_max = 10,
-#'   lambda = 2
-#' )
-#'
-#' chain_summary_with_intvn
 #' @export
 simulate_summary <- function(nchains, statistic = c("size", "length"),
                              offspring_dist,
-                             intvn_mean_reduction = 0,
                              stat_max = Inf, ...) {
   statistic <- match.arg(statistic)
 
@@ -327,13 +282,6 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
   # check that offspring is properly specified
   check_offspring_valid(offspring_dist)
 
-  # Check that the intvn_mean_reduction is well specified
-  checkmate::assert_number(
-    intvn_mean_reduction,
-    lower = 0,
-    upper = 1
-  )
-
   # check that offspring function exists in base R
   roffspring_name <- paste0("r", offspring_dist)
   check_offspring_func_valid(roffspring_name)
@@ -341,15 +289,6 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
   # Gather offspring distribution parameters
   pars <- list(...)
 
-  # Prepare interventions if specified
-  if (intvn_mean_reduction > 0) {
-    pars <- intvn_reduce_mean(
-      intvn_mean_reduction = intvn_mean_reduction,
-      offspring_dist = offspring_dist,
-      pars_list = pars
-    )
-  }
-
   # Initialisations
   stat_track <- rep(1, nchains) ## track length or size (depending on `stat`)
   n_offspring <- rep(1, nchains) ## current number of offspring
@@ -404,7 +343,6 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' population
 #'
 #' @inheritParams simulate_tree
-#' @inheritParams intvn_mean_reduction
 #' @param pop The susceptible population size.
 #' @param offspring_dist Offspring distribution: a character string
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
@@ -465,21 +403,9 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' size = 1.1,
 #' serials_dist = function(x) 3
 #' )
-#'
-#' # Simulate with negative binomial offspring with intervention (50%
-#' # reduction in R0)
-#' simulate_tree_from_pop(
-#'   pop = 100,
-#'   offspring_dist = "nbinom",
-#'   intvn_mean_reduction = 0.5,
-#'   mu = 0.5,
-#'   size = 1.1,
-#'   serials_dist = function(x) 3
-#' )
 #' @export
 simulate_tree_from_pop <- function(pop,
                                    offspring_dist = c("pois", "nbinom"),
-                                   intvn_mean_reduction = 0,
                                    serials_dist,
                                    initial_immune = 0,
                                    t0 = 0,
@@ -487,25 +413,9 @@ simulate_tree_from_pop <- function(pop,
                                    ...) {
   offspring_dist <- match.arg(offspring_dist)
 
-  # Check that the intvn_mean_reduction is well specified
-  checkmate::assert_number(
-    intvn_mean_reduction,
-    lower = 0,
-    upper = 1
-  )
-
   # Gather offspring distribution parameters
   pars <- list(...)
 
-  # Prepare interventions if specified
-  if (intvn_mean_reduction > 0) {
-    pars <- intvn_reduce_mean(
-      intvn_mean_reduction = intvn_mean_reduction,
-      offspring_dist = offspring_dist,
-      pars_list = pars
-    )
-  }
-
   if (offspring_dist == "pois") {
     ## Use a right truncated poisson distribution
     ## to avoid more cases than susceptibles
diff --git a/man/intvn_reduce_mean.Rd b/man/intvn_reduce_mean.Rd
deleted file mode 100644
index 080ec2a9..00000000
--- a/man/intvn_reduce_mean.Rd
+++ /dev/null
@@ -1,55 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/intervention.R
-\name{intvn_reduce_mean}
-\alias{intvn_reduce_mean}
-\title{Reduce the mean of the offspring distribution}
-\usage{
-intvn_reduce_mean(intvn_mean_reduction, offspring_dist, pars_list)
-}
-\arguments{
-\item{intvn_mean_reduction}{A number between 0
-and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
-population-level intervention. \code{intvn_mean_reduction} = 0
-implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
-impact.}
-
-\item{offspring_dist}{Offspring distribution: a character string
-corresponding to the R distribution function (e.g., "pois" for Poisson,
-where \code{\link{rpois}} is the R function to generate Poisson random
-numbers).}
-
-\item{pars_list}{Parameter(s) for poisson or negative binomial offspring
-distribution.}
-}
-\value{
-List of the offspring distribution parameter(s) with the mean
-scaled by \code{1 - intvn_mean_reduction}.
-}
-\description{
-\code{intvn_reduce_mean()} is a helper for the \code{simulate_*()} functions. It
-reduces/scales the mean of the offspring distribution in order to
-mimic the impact of a population-level intervention. Currently, it can only
-handle the poisson and negative binomial distributions and errors when other
-offspring distributions are specified alongside the \code{intvn_mean_reduction}
-argument.
-}
-\details{
-\code{intvn_reduce_mean()} scales the mean of the offspring distribution
-by \eqn{1 - {\sf intvn\_mean\_reduction}} so that the new mean is given as:
-\deqn{(1 - {\sf intvn\_mean\_reduction}) \times {\sf mean,}} This
-scaling when applied to the poisson and negative binomial offspring
-distributions corresponds to the population-level reduction of R0 as
-described in Lloyd-Smith et al, (2005). \code{intvn_reduce_mean()} is therefore
-only implemented for the aforementioned distributions and errors when other
-offspring distributions are specified along with the \code{intvn_mean_reduction}
-argument in the \code{simulate_*()} functions.
-}
-\references{
-Lloyd-Smith, J., Schreiber, S., Kopp, P. et al. Superspreading
-and the effect of individual variation on disease emergence. Nature 438,
-355–359 (2005). \doi{10.1038/nature04153}
-}
-\author{
-James M. Azam
-}
-\keyword{internal}
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index 9c1283af..a453d680 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -8,7 +8,6 @@ simulate_summary(
   nchains,
   statistic = c("size", "length"),
   offspring_dist,
-  intvn_mean_reduction = 0,
   stat_max = Inf,
   ...
 )
@@ -29,12 +28,6 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers).}
 
-\item{intvn_mean_reduction}{A number between 0
-and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
-population-level intervention. \code{intvn_mean_reduction} = 0
-implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
-impact.}
-
 \item{stat_max}{A cut off for the chain statistic (size/length) being
 computed. Results above the specified value, are set to \code{Inf}.}
 
@@ -105,18 +98,6 @@ simulate_summary(
   stat_max = 10,
   lambda = 2
 )
-
-# Run model with intervention a 50\% reduction in R0.
-chain_summary_with_intvn <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  stat_max = 10,
-  lambda = 2
-)
-
-chain_summary_with_intvn
 }
 \seealso{
 \itemize{
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index cb3d0629..1d6722dc 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -9,7 +9,6 @@ simulate_tree(
   statistic = c("size", "length"),
   offspring_dist,
   stat_max = Inf,
-  intvn_mean_reduction = 0,
   serials_dist,
   t0 = 0,
   tf = Inf,
@@ -36,12 +35,6 @@ numbers).}
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
-\item{intvn_mean_reduction}{A number between 0
-and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
-population-level intervention. \code{intvn_mean_reduction} = 0
-implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
-impact.}
-
 \item{serials_dist}{The serial interval distribution function; the name
 of a user-defined named or anonymous function with only one argument \code{n},
 representing the number of serial intervals to generate. See details.}
@@ -128,19 +121,6 @@ chains <- simulate_tree(
   serials_dist = function(x) 3,
   lambda = 2
 )
-
-# Run model with intervention a 50\% reduction in R0.
-chains_with_intvn <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  stat_max = 10,
-  serials_dist = function(x) 3,
-  lambda = 2
-)
-
-chains_with_intvn
 }
 \references{
 Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 030f9fe8..4f42d429 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -8,7 +8,6 @@ population}
 simulate_tree_from_pop(
   pop,
   offspring_dist = c("pois", "nbinom"),
-  intvn_mean_reduction = 0,
   serials_dist,
   initial_immune = 0,
   t0 = 0,
@@ -24,12 +23,6 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers). Only supports "pois" and "nbinom".}
 
-\item{intvn_mean_reduction}{A number between 0
-and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
-population-level intervention. \code{intvn_mean_reduction} = 0
-implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
-impact.}
-
 \item{serials_dist}{The serial interval distribution function; the name
 of a user-defined named or anonymous function with only one argument \code{n},
 representing the number of serial intervals to generate. See details.}
@@ -137,17 +130,6 @@ mu = 0.5,
 size = 1.1,
 serials_dist = function(x) 3
 )
-
-# Simulate with negative binomial offspring with intervention (50\%
-# reduction in R0)
-simulate_tree_from_pop(
-  pop = 100,
-  offspring_dist = "nbinom",
-  intvn_mean_reduction = 0.5,
-  mu = 0.5,
-  size = 1.1,
-  serials_dist = function(x) 3
-)
 }
 \seealso{
 \itemize{
diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index b6fd787e..4852fd9c 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -20,25 +20,6 @@ test_that("Simulators work", {
     size = 1.1,
     serials_dist = serial_func
   )
-  #' Simulate an outbreak from a susceptible population (pois) with
-  #' 50% R0 reduction
-  susc_outbreak_raw_intvn <- simulate_tree_from_pop(
-    pop = 100,
-    offspring_dist = "pois",
-    lambda = 1.5,
-    serials_dist = serial_func,
-    intvn_mean_reduction = 0.5
-  )
-  #' Simulate an outbreak from a susceptible population (nbinom) with
-  #' 50% R0 reduction
-  susc_outbreak_raw_intvn2 <- simulate_tree_from_pop(
-    pop = 100,
-    offspring_dist = "nbinom",
-    mu = 1.5,
-    size = 1.1,
-    serials_dist = serial_func,
-    intvn_mean_reduction = 0.5
-  )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
     nchains = 2,
@@ -55,25 +36,6 @@ test_that("Simulators work", {
     serials_dist = function(x) 3,
     lambda = 2
   )
-  #' Simulate a tree of infections without serials and with 50% reduction
-  #' in R0
-  tree_sim_raw_intvn <- simulate_tree(
-    nchains = 2,
-    offspring_dist = "pois",
-    statistic = "length",
-    lambda = 0.9,
-    intvn_mean_reduction = 0.5
-  )
-  #' Simulate a tree of infections with nbinom offspring and with 50% reduction
-  #' in R0
-  tree_sim_raw_intvn2 <- simulate_tree(
-    nchains = 2,
-    offspring_dist = "nbinom",
-    statistic = "length",
-    mu = 0.9,
-    size = 1.1,
-    intvn_mean_reduction = 0.5
-  )
   #' Simulate chain statistics
   chain_summary_raw <- simulate_summary(
     nchains = 2,
@@ -81,37 +43,11 @@ test_that("Simulators work", {
     statistic = "length",
     lambda = 0.9
   )
-  #' Simulate chain statistics and with a 50% reduction in R0
-  chain_summary_raw_intvn <- simulate_summary(
-    nchains = 2,
-    offspring_dist = "pois",
-    statistic = "length",
-    lambda = 0.9,
-    intvn_mean_reduction = 0.5
-  )
-  #' Simulate chain statistics with nbinom offspring and with a 50% reduction
-  #' in R0
-  chain_summary_raw_intvn2 <- simulate_summary(
-    nchains = 2,
-    offspring_dist = "nbinom",
-    statistic = "length",
-    mu = 1.9,
-    size = 1.1,
-    intvn_mean_reduction = 0.5
-  )
   #' Expectations
   expect_length(
     chain_summary_raw,
     2
   )
-  expect_length(
-    chain_summary_raw_intvn,
-    2
-  )
-  expect_length(
-    chain_summary_raw_intvn2,
-    2
-  )
   expect_gte(
     nrow(tree_sim_raw),
     2
@@ -120,14 +56,6 @@ test_that("Simulators work", {
     nrow(tree_sim_raw2),
     2
   )
-  expect_identical(
-    nrow(tree_sim_raw_intvn),
-    3L
-  )
-  expect_identical(
-    nrow(tree_sim_raw_intvn2),
-    2L
-  )
   expect_gte(
     nrow(susc_outbreak_raw),
     1
@@ -136,14 +64,6 @@ test_that("Simulators work", {
     nrow(susc_outbreak_raw2),
     1
   )
-  expect_identical(
-    nrow(susc_outbreak_raw_intvn),
-    2L
-  )
-  expect_identical(
-    nrow(susc_outbreak_raw_intvn2),
-    1L
-  )
   expect_true(
     all(
       simulate_tree(
@@ -218,17 +138,6 @@ test_that("simulate_tree throws errors", {
     ),
     "must be specified"
   )
-  expect_error(
-    simulate_tree(
-      nchains = 2,
-      offspring_dist = "binom",
-      statistic = "length",
-      size = 1,
-      prob = 0.5,
-      intvn_mean_reduction = 0.5
-    ),
-    "must be one of"
-  )
 })
 
 test_that("simulate_summary throws errors", {
@@ -270,17 +179,6 @@ test_that("simulate_summary throws errors", {
     ),
     "character string"
   )
-  expect_error(
-    simulate_summary(
-      nchains = 2,
-      offspring_dist = "binom",
-      statistic = "length",
-      size = 1,
-      prob = 0.5,
-      intvn_mean_reduction = 0.5
-    ),
-    "must be one of"
-  )
 })
 
 test_that("simulate_tree_from_pop throws errors", {
@@ -333,18 +231,8 @@ test_that("simulate_tree is numerically correct", {
     statistic = "length",
     lambda = 0.9
   )
-  #' Simulate a tree of infections without serials and with 50% reduction
-  #' in R0
-  tree_sim_raw_intvn <- simulate_tree(
-    nchains = 2,
-    offspring_dist = "pois",
-    statistic = "length",
-    lambda = 0.9,
-    intvn_mean_reduction = 0.5
-  )
   #' summarise the results
   tree_sim_summary <- summary(tree_sim_raw)
-  tree_sim_intvn_summary <- summary(tree_sim_raw_intvn)
   #' Expectations
   expect_identical(
     tree_sim_summary$chains_run,
@@ -414,17 +302,8 @@ test_that("simulate_summary is numerically correct", {
     statistic = "length",
     lambda = 0.9
   )
-  #' Simulate chain statistics and with a 50% reduction in R0
-  chain_summary_raw_intvn <- simulate_summary(
-    nchains = 2,
-    offspring_dist = "pois",
-    statistic = "length",
-    lambda = 0.9,
-    intvn_mean_reduction = 0.5
-  )
   #' Summarise the results
   chain_summary_summaries <- summary(chain_summary_raw)
-  chain_summary_intvn_summaries <- summary(chain_summary_raw_intvn)
   #' Expectations
   expect_identical(
     chain_summary_summaries$chains_run,
@@ -442,22 +321,6 @@ test_that("simulate_summary is numerically correct", {
     as.vector(chain_summary_raw),
     c(1.00, 3.00)
   )
-  expect_identical(
-    chain_summary_intvn_summaries$chains_run,
-    2.00
-  )
-  expect_identical(
-    chain_summary_intvn_summaries$max_chain_stat,
-    2.00
-  )
-  expect_identical(
-    chain_summary_intvn_summaries$min_chain_stat,
-    1.00
-  )
-  expect_identical(
-    as.vector(chain_summary_raw_intvn),
-    c(2.00, 1.00)
-  )
 })
 
 test_that("simulate_tree_from_pop is numerically correct", {
@@ -469,19 +332,8 @@ test_that("simulate_tree_from_pop is numerically correct", {
     lambda = 0.9,
     serials_dist = serial_func
   )
-  #' Simulate an outbreak from a susceptible population (pois) with
-  #' 50% R0 reduction
-  set.seed(7)
-  susc_outbreak_raw_intvn <- simulate_tree_from_pop(
-    pop = 100,
-    offspring_dist = "pois",
-    lambda = 1.5,
-    serials_dist = serial_func,
-    intvn_mean_reduction = 0.5
-  )
   #' Summarise the results
   susc_outbreak_summary <- summary(susc_outbreak_raw)
-  susc_outbreak_summary_intvn <- summary(susc_outbreak_raw_intvn)
   #' Expectations
   expect_identical(
     susc_outbreak_summary$unique_ancestors,
@@ -512,25 +364,4 @@ test_that("simulate_tree_from_pop is numerically correct", {
     susc_outbreak_raw$time,
     0.00
   )
-  #' Expectations for intervention simulation
-  expect_identical(
-    susc_outbreak_summary_intvn$unique_ancestors,
-    12L
-  )
-  expect_identical(
-    round(
-      susc_outbreak_summary_intvn$max_time,
-      1
-    ),
-    72.1
-  )
-  expect_identical(
-    susc_outbreak_summary_intvn$max_generation,
-    10L
-  )
-  expect_null(susc_outbreak_summary_intvn$chains_run)
-  expect_identical(
-    sum(aggregate(susc_outbreak_raw_intvn, "time")$cases),
-    20L
-  )
 })
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 3f8c0b90..7d361795 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -255,72 +255,6 @@ sim_tree_from_pop_eg <- simulate_tree_from_pop(
 head(sim_tree_from_pop_eg)
 ```
 
-#### Simulating chains with interventions
-
-All the `simulate_*()` functions can model interventions that reduce the $R_0$,
-using the `intvn_mean_reduction` argument. In general, these can be
-interpreted as population-level interventions.
-
-To illustrate this, we will use the previous examples for each function and specify
-a population-level intervention that reduces $R_0$ by $50\%$.
-
-Using `simulate_tree()`, we can specify an initial number of cases
-and a population level intervention, `intvn_mean_reduction`, that reduces $R_0$ by $50\%$.
-
-```{r}
-set.seed(123)
-# Define serial distribution
-serial_func <- function(x) {
-  return(3)
-}
-
-sim_tree_intvn_eg <- simulate_tree(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  stat_max = 10,
-  serials_dist = serial_func,
-  lambda = 0.9
-)
-
-head(sim_tree_intvn_eg)
-```
-
-Here is an example with `simulate_summary()`, modelling an intervention that reduces $R_0$ by $50\%$.
-```{r}
-simulate_summary_intvn_eg <- simulate_summary(
-  nchains = 10,
-  statistic = "size",
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  stat_max = 10,
-  lambda = 0.9
-)
-
-# Print the results
-simulate_summary_intvn_eg
-```
-
-Finally, let's use `simulate_tree_from_pop()`.
-```{r}
-set.seed(7)
-# Define serial distribution
-serial_func <- function(x) {
-  return(3)
-}
-
-sim_tree_from_pop_intvn_eg <- simulate_tree_from_pop(
-  pop = 1000,
-  offspring_dist = "pois",
-  intvn_mean_reduction = 0.5,
-  lambda = 1,
-  serials_dist = serial_func
-)
-
-head(sim_tree_from_pop_intvn_eg)
-```
-
 ## Other functionalities
 
 ### Summarising

From dedfd71c2511fee81a84076770b0bab255807342 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 8 Nov 2023 15:25:40 +0000
Subject: [PATCH 725/828] add interventions vignette

---
 vignettes/interventions.Rmd | 162 ++++++++++++++++++++++++++++++++++++
 1 file changed, 162 insertions(+)
 create mode 100644 vignettes/interventions.Rmd

diff --git a/vignettes/interventions.Rmd b/vignettes/interventions.Rmd
new file mode 100644
index 00000000..f642fc5d
--- /dev/null
+++ b/vignettes/interventions.Rmd
@@ -0,0 +1,162 @@
+---
+title: "Modelling disease control interventions"
+author: "Sebastian Funk and James M. Azam"
+output:
+  bookdown::html_vignette2:
+    fig_caption: yes
+    code_folding: show
+pkgdown:
+  as_is: true
+bibliography: references.json
+link-citations: true
+vignette: >
+  %\VignetteIndexEntry{Modelling disease control interventions}
+  %\VignetteEncoding{UTF-8}
+  %\VignetteEngine{knitr::rmarkdown}
+editor_options: 
+  chunk_output_type: console
+---
+
+```{r include=FALSE}
+knitr::opts_chunk$set(
+  echo = TRUE,
+  message = FALSE,
+  warning = FALSE,
+  collapse = TRUE,
+  comment = "#>"
+)
+```
+
+_epichains_ does not provide any direct functionality for studying public health interventions.
+However, the flexible simulation functionality that it includes can be used to consider some specific changes to the parameters that can be interpreted as the result of control measures.
+Here we investigate the effect on outbreak sizes, but the same approaches could be used for investigating chain lengths (using the `statistic` argument to `simulate_summary`) or the time progression of outbreaks (using the `simulate_tree` function).
+
+```{r}
+library("epichains")
+```
+
+As a base case we consider the spread of an infection with a negative binomial offspring distribution with mean 1.2 and overdispersion parameter 0.5.
+We simulate 200 chains tracking up to 99 infections:
+
+```{r simulate_chains}
+sims <- simulate_summary(
+  nchains = 200, offspring_dist = "nbinom", stat_max = 99, mu = 1.2, size = 0.5
+)
+```
+
+We then plot the resulting distribution of chain sizes
+```{r uncontrolled_chains_plot}
+library("ggplot2")
+sims[is.infinite(sims)] <- 100
+ggplot(data.frame(x = sims), aes(x = x)) +
+  geom_histogram(breaks = seq(0, 100, by = 5), closed = "left") +
+  scale_x_continuous(breaks = c(0, 25, 50, 75, 100),
+                     labels = c(0, 25, 50, 75, ">99")) +
+  theme_bw()
+```
+
+# Reductions in the strength of transmission
+
+Following [@lloyd-smith2005] we consider two ways in which disease control interventions can reduce the reproduction number: _population-wide_ and _individual-specific_ control.
+
+## Population-wide control
+
+By population-level control we mean an intervention that reduces the mean number of offspring (i.e. the reproduction number) by a fixed proportion.
+For example, to reduce R by 25% at the population level we scale the `mu` parameter from 1.2 to 0.9:
+
+```{r simulate_chains_pop_control}
+sims <- simulate_summary(
+  nchains = 200, offspring_dist = "nbinom", stat_max = 99, mu = 0.9, size = 0.5
+)
+sims[is.infinite(sims)] <- 100
+ggplot(data.frame(x = sims), aes(x = x)) +
+  geom_histogram(breaks = seq(0, 100, by = 5), closed = "left") +
+  scale_x_continuous(breaks = c(0, 25, 50, 75, 100),
+                     labels = c(0, 25, 50, 75, ">99")) +
+  theme_bw()
+```
+
+## Individual-level control.
+
+In simulating population-level control we now apply the same reduction as before (25%) but instead of assuming that the mean is reduced we apply this such that 25% of individuals do not transmit further at all, whereas the remaining 75% generate offspring as in the uncontrolled case.
+
+To do this, we can no longer use the standard negative binomial distribution that comes with R.
+Instead, we define a random generator from a modified negative binomial distribution that includes our individual-level control as a `control` argument indicating the level of individual-level control (0: no control; 1: full control):
+
+```{r nbinom_ind_control}
+rnbinom_ind <- function(n, ..., control = 0) {
+  ## initialise number of offspring to 0
+  offspring <- rep(0L, n)
+  ## for each individual, decide whether they transmit further
+  transmits <- rbinom(n = n, prob = 1 - control, size = 1)
+  ## check if anyone transmits further
+  if (sum(transmits) > 0L) {
+    ## for those that transmit, sample from negative binomial with given
+    ## parameters
+    offspring[which(transmits == 1L)] <- rnbinom(n = n, ...)
+  }
+  return(offspring)
+}
+```
+
+Having defined this, we can generate simulations as before:
+
+```{r simulate_chains_ind_control}
+sims <- simulate_summary(
+  nchains = 200, offspring_dist = "nbinom_ind", stat_max = 99, mu = 1.2, 
+  size = 0.5, control = 0.25
+)
+sims[is.infinite(sims)] <- 100
+ggplot(data.frame(x = sims), aes(x = x)) +
+  geom_histogram(breaks = seq(0, 100, by = 5), closed = "left") +
+  scale_x_continuous(breaks = c(0, 25, 50, 75, 100),
+                     labels = c(0, 25, 50, 75, ">99")) +
+  theme_bw()
+```
+
+# Preventing superspreading events
+
+Another way of controlling a disease would be to prevent individuals from spreading to a large number of others, for example by preventing mass gatherings or, more generally, settings where superspreading events can occur.
+
+We can model this by truncating the offspring distribution at a certain size.
+This can be done, for example, using the [truncdist](https://cran.r-project.org/package=truncdist) R package.
+We use this to define a truncated negative binomial offspring distribution:
+
+```{r negbin_truncated}
+library("truncdist")
+rnbinom_truncated <- function(n, ..., max = Inf) {
+  return(rtrunc(n = n, spec = "nbinom", b = max, ...))
+}
+```
+
+We use this to simulate chains in a situation where the maximum of secondary cases that each infected person can generate is 10.
+
+```{r simulate_chains_truncated}
+sims <- simulate_summary(
+  nchains = 200, offspring_dist = "nbinom_truncated", stat_max = 99, mu = 1.2, 
+  size = 0.5, max =  10
+)
+sims[is.infinite(sims)] <- 100
+ggplot(data.frame(x = sims), aes(x = x)) +
+  geom_histogram(breaks = seq(0, 100, by = 5), closed = "left") +
+  scale_x_continuous(breaks = c(0, 25, 50, 75, 100),
+                     labels = c(0, 25, 50, 75, ">99")) +
+  theme_bw()
+```
+
+# Truncating the generation interval
+
+Lastly, we consider a situation where the generation interval is shortened.
+We do not model this explicitly but instead consider the effect on the offspring distribution.
+
+For example, if our generation interval is from a gamma distribution with shape = 25 and rate = 5 (corresponding to a mean of 5 and standard deviation of 1), and we stop all transmission that would normally occur more than 6 days after infection, we can calculate the proportion of transmissions that are prevented as
+
+```{r truncate_gen_int}
+control <- 1 - pgamma(6, shape = 25, rate = 5)
+signif(control, 2)
+```
+
+# References
+
+In other words, this would prevent `r round(100 * control)`% of infections in this example.
+The value of `control` can be used in the examples above to study the effect on outbreak sizes.

From 84a38f74366f3ee98e4c03da5bab83d6930c82a2 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 8 Nov 2023 15:25:53 +0000
Subject: [PATCH 726/828] update pkgdown

---
 _pkgdown.yml | 1 +
 1 file changed, 1 insertion(+)

diff --git a/_pkgdown.yml b/_pkgdown.yml
index cc8cc70d..a84a41a0 100644
--- a/_pkgdown.yml
+++ b/_pkgdown.yml
@@ -10,6 +10,7 @@ articles:
   navbar: Package vignettes
   contents:
   - projecting_incidence
+  - interventions
 - title: Modelling guides and background
   navbar: Modelling guides and background
   contents:

From 0c9068486ebf23484dd0b3ce052d073e0c08a30b Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Wed, 8 Nov 2023 15:26:34 +0000
Subject: [PATCH 727/828] add news item

---
 NEWS.md | 5 +++++
 1 file changed, 5 insertions(+)

diff --git a/NEWS.md b/NEWS.md
index 228aab65..0a4161fe 100644
--- a/NEWS.md
+++ b/NEWS.md
@@ -1,3 +1,8 @@
+# epichains 0.1.9999
+
+## Documentation
+* A vignette outlining how to simulate interventions has been added
+
 # epichains 0.1.0
 
 ## Package name change

From 9e391282b390f0ea45c7ec3278385a7a2c0a5b7f Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 9 Nov 2023 09:55:00 +0000
Subject: [PATCH 728/828] Apply suggestions from code review

Co-authored-by: James Azam <james.m.azam@gmail.com>
---
 vignettes/interventions.Rmd | 16 +++++++++-------
 1 file changed, 9 insertions(+), 7 deletions(-)

diff --git a/vignettes/interventions.Rmd b/vignettes/interventions.Rmd
index f642fc5d..0696a4ad 100644
--- a/vignettes/interventions.Rmd
+++ b/vignettes/interventions.Rmd
@@ -47,7 +47,7 @@ sims <- simulate_summary(
 We then plot the resulting distribution of chain sizes
 ```{r uncontrolled_chains_plot}
 library("ggplot2")
-sims[is.infinite(sims)] <- 100
+sims[is.infinite(sims)] <- 100 # Replace infections > 99 with 100 for plotting.
 ggplot(data.frame(x = sims), aes(x = x)) +
   geom_histogram(breaks = seq(0, 100, by = 5), closed = "left") +
   scale_x_continuous(breaks = c(0, 25, 50, 75, 100),
@@ -55,7 +55,7 @@ ggplot(data.frame(x = sims), aes(x = x)) +
   theme_bw()
 ```
 
-# Reductions in the strength of transmission
+# Reducing the strength of transmission
 
 Following [@lloyd-smith2005] we consider two ways in which disease control interventions can reduce the reproduction number: _population-wide_ and _individual-specific_ control.
 
@@ -68,7 +68,7 @@ For example, to reduce R by 25% at the population level we scale the `mu` parame
 sims <- simulate_summary(
   nchains = 200, offspring_dist = "nbinom", stat_max = 99, mu = 0.9, size = 0.5
 )
-sims[is.infinite(sims)] <- 100
+sims[is.infinite(sims)] <- 100 # Replace infections > 99 with 100 for plotting.
 ggplot(data.frame(x = sims), aes(x = x)) +
   geom_histogram(breaks = seq(0, 100, by = 5), closed = "left") +
   scale_x_continuous(breaks = c(0, 25, 50, 75, 100),
@@ -90,7 +90,7 @@ rnbinom_ind <- function(n, ..., control = 0) {
   ## for each individual, decide whether they transmit further
   transmits <- rbinom(n = n, prob = 1 - control, size = 1)
   ## check if anyone transmits further
-  if (sum(transmits) > 0L) {
+  if (any(transmits == 1L)) {
     ## for those that transmit, sample from negative binomial with given
     ## parameters
     offspring[which(transmits == 1L)] <- rnbinom(n = n, ...)
@@ -106,7 +106,7 @@ sims <- simulate_summary(
   nchains = 200, offspring_dist = "nbinom_ind", stat_max = 99, mu = 1.2, 
   size = 0.5, control = 0.25
 )
-sims[is.infinite(sims)] <- 100
+sims[is.infinite(sims)] <- 100 # Replace infections > 99 with 100 for plotting.
 ggplot(data.frame(x = sims), aes(x = x)) +
   geom_histogram(breaks = seq(0, 100, by = 5), closed = "left") +
   scale_x_continuous(breaks = c(0, 25, 50, 75, 100),
@@ -130,13 +130,14 @@ rnbinom_truncated <- function(n, ..., max = Inf) {
 ```
 
 We use this to simulate chains in a situation where the maximum of secondary cases that each infected person can generate is 10.
+This can be likened to a disease control strategy where gatherings are limited to 10 people.
 
 ```{r simulate_chains_truncated}
 sims <- simulate_summary(
   nchains = 200, offspring_dist = "nbinom_truncated", stat_max = 99, mu = 1.2, 
   size = 0.5, max =  10
 )
-sims[is.infinite(sims)] <- 100
+sims[is.infinite(sims)] <- 100 # Replace infections > 99 with 100 for plotting.
 ggplot(data.frame(x = sims), aes(x = x)) +
   geom_histogram(breaks = seq(0, 100, by = 5), closed = "left") +
   scale_x_continuous(breaks = c(0, 25, 50, 75, 100),
@@ -156,7 +157,8 @@ control <- 1 - pgamma(6, shape = 25, rate = 5)
 signif(control, 2)
 ```
 
-# References
 
 In other words, this would prevent `r round(100 * control)`% of infections in this example.
 The value of `control` can be used in the examples above to study the effect on outbreak sizes.
+
+# References

From ee2b201ae91f55f7cb0d8b3219a234863d50c478 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 9 Nov 2023 12:46:10 +0000
Subject: [PATCH 729/828] make lintr happy

---
 vignettes/interventions.Rmd | 13 ++++++++-----
 1 file changed, 8 insertions(+), 5 deletions(-)

diff --git a/vignettes/interventions.Rmd b/vignettes/interventions.Rmd
index 0696a4ad..2176283f 100644
--- a/vignettes/interventions.Rmd
+++ b/vignettes/interventions.Rmd
@@ -31,8 +31,13 @@ _epichains_ does not provide any direct functionality for studying public health
 However, the flexible simulation functionality that it includes can be used to consider some specific changes to the parameters that can be interpreted as the result of control measures.
 Here we investigate the effect on outbreak sizes, but the same approaches could be used for investigating chain lengths (using the `statistic` argument to `simulate_summary`) or the time progression of outbreaks (using the `simulate_tree` function).
 
-```{r}
+```{r load_libraries}
+## main package
 library("epichains")
+## for plotting
+library("ggplot2")
+## for truncating the offspring distribution later
+library("truncdist")
 ```
 
 As a base case we consider the spread of an infection with a negative binomial offspring distribution with mean 1.2 and overdispersion parameter 0.5.
@@ -46,7 +51,6 @@ sims <- simulate_summary(
 
 We then plot the resulting distribution of chain sizes
 ```{r uncontrolled_chains_plot}
-library("ggplot2")
 sims[is.infinite(sims)] <- 100 # Replace infections > 99 with 100 for plotting.
 ggplot(data.frame(x = sims), aes(x = x)) +
   geom_histogram(breaks = seq(0, 100, by = 5), closed = "left") +
@@ -103,7 +107,7 @@ Having defined this, we can generate simulations as before:
 
 ```{r simulate_chains_ind_control}
 sims <- simulate_summary(
-  nchains = 200, offspring_dist = "nbinom_ind", stat_max = 99, mu = 1.2, 
+  nchains = 200, offspring_dist = "nbinom_ind", stat_max = 99, mu = 1.2,
   size = 0.5, control = 0.25
 )
 sims[is.infinite(sims)] <- 100 # Replace infections > 99 with 100 for plotting.
@@ -123,7 +127,6 @@ This can be done, for example, using the [truncdist](https://cran.r-project.org/
 We use this to define a truncated negative binomial offspring distribution:
 
 ```{r negbin_truncated}
-library("truncdist")
 rnbinom_truncated <- function(n, ..., max = Inf) {
   return(rtrunc(n = n, spec = "nbinom", b = max, ...))
 }
@@ -134,7 +137,7 @@ This can be likened to a disease control strategy where gatherings are limited t
 
 ```{r simulate_chains_truncated}
 sims <- simulate_summary(
-  nchains = 200, offspring_dist = "nbinom_truncated", stat_max = 99, mu = 1.2, 
+  nchains = 200, offspring_dist = "nbinom_truncated", stat_max = 99, mu = 1.2,
   size = 0.5, max =  10
 )
 sims[is.infinite(sims)] <- 100 # Replace infections > 99 with 100 for plotting.

From fd813987499b6922565cbf2e2ab73eb521f35e1a Mon Sep 17 00:00:00 2001
From: Hugo Gruson <Bisaloo@users.noreply.github.com>
Date: Mon, 6 Nov 2023 17:35:36 +0100
Subject: [PATCH 730/828] Sum loglikelihoods before exponentiating

---
 R/likelihood.R | 20 +++++++++++---------
 1 file changed, 11 insertions(+), 9 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 63b33af1..68f0f24c 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -134,17 +134,19 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
     likelihoods[sx[!(sx %in% exclude)]]
   })
 
-  ## transform log-likelihoods into likelihoods if required
-  if (isFALSE(log)) {
-    chains_likelihood <- lapply(chains_likelihood, exp)
+  ## if individual == FALSE, return the joint log-likelihood
+  ## (sum of the log-likelihoods)
+  if (!individual) {
+    chains_likelihood <- vapply(chains_likelihood, sum, 0)
   }
 
-  ## if individual == FALSE, return the joint log-likelihood
-  ## (sum of the log-likelihoods), if log == TRUE, else
-  ## multiply the likelihoods
-  if (isFALSE(individual)) {
-    summarise_func <- ifelse(log, sum, prod)
-    chains_likelihood <- vapply(chains_likelihood, summarise_func, 0)
+  ## transform log-likelihoods into likelihoods if required
+  if (!log) {
+    if (individual) {
+      chains_likelihood <- lapply(chains_likelihood, exp)
+    } else {
+      chains_likelihood <- exp(chains_likelihood)
+    }
   }
 
   return(chains_likelihood)

From c5e26f5d33f57beb5ded4cc9e13ee7c33b661185 Mon Sep 17 00:00:00 2001
From: Hugo Gruson <Bisaloo@users.noreply.github.com>
Date: Wed, 22 Nov 2023 09:38:08 +0100
Subject: [PATCH 731/828] Use svg logo with correct name To allow
 auto-detection by pkgdown and r-universe

---
 man/figures/epichains_logo.png | Bin 31438 -> 0 bytes
 man/figures/logo.svg           | 833 +++++++++++++++++++++++++++++++++
 2 files changed, 833 insertions(+)
 delete mode 100644 man/figures/epichains_logo.png
 create mode 100644 man/figures/logo.svg

diff --git a/man/figures/epichains_logo.png b/man/figures/epichains_logo.png
deleted file mode 100644
index 7963de5bca351f5bf27316d333b322ad1204637c..0000000000000000000000000000000000000000
GIT binary patch
literal 0
HcmV?d00001

literal 31438
zcmXt8V~{3Iw|vIj+2M|D+qP}nwr%ftc5K_WZQHha=lvq?kLrr(=#GjyQFTt9%nX;4
z5&Z>?0Sy2Eeu;|-Dg4Z9KjQ-u{O5^q(suJRLD`9^JN``3{|z85+VGd3m*`HyYEFu_
zCQh#U4#ogiS63Qy8%sw+eLG_sTL;thD=rKG01qH8#INL*aiMMMhP;UJHI<mTnYk%l
z{TCDxs`I%_Nw^I<j{q-%3^D+00Q@N+Vx=_<3=E|(%<o~#6Y4LYfQKxL@%e(IZ;&1^
zc{?Wv`H^?l`-Z0=gQLk&$FbKj<#j0C(2QRjP_<{a70vj7Cm;xS$SuZsxDJMyy9m4m
ze1WfFEq2F~q?YBMI}~6N0#pM48U}zeYycMfVjfSvcC_?}`c;DQOQ?L14-((yW?>i}
z9v7I84gkytF#b*_5ibXQc{C86NpYJNs1<?O@ROM3YtVq6&nfK$LXf2AQ{_8(lSMdb
zz>ta~Qp1{3Xad3m^#}=X=4?Gpw%)#|s^w?RQDB0vOy=h`b0CB+d8Q8tmm8xC0^oxZ
zzB?gqW%5kv&spAbQhqP_biV=!M;Q`{^ZXaDT-XMRp_{$3<Q!#Z{nGzi6L`V1W2(f^
z)R3<l7;pa<5(0pODfTZ<#_QjF+4d8MqTSW<(F4itidpPVr|32x6aXk6d{BX5>NRLF
z9j)*9@u2IAJcb+)MwkkW9>&9q7<#`#P>wlNs{|y{$v$!S$BNi765;hxIA;4&<%-h@
z75BBT@0mjV<Ulv5aL&91G(w1~BZs?pBq~kf^~^p~4)D*ez<altTaozlrU5y=>JXtJ
zdHQ!ZJ{x(YSFUMloszgCQ!ppxjhxKc6{}T^?CwfW9$OVxWzqY^X{<gjH!M6dAZ2_Y
zvlf7&-h|JZux&t)39x|CsH|ideSxOu?*3{lM_MqrM*~RVAqu>0#Y)bn<F@%PaM;2l
zQUb^ulSj>ateDfM=<LFhN2*_TzkUg|2l8nb&r?}u`@`Ro;N4KX>O&!<vBD*9Wxv+k
zZ{c8rGG94!vBYIGU&wBjwjh^Xc$=jmN>#X+JBS>eyG{R=VZK`QJNpSXCuqOlXdpr|
z4+!2GpeH)~+S=$=5=oT9l*{h7Xk1X)^8i}^G-r(YpU)QgbWJ1Tv90AC;|atbNp@eK
zht4uqYrbGJyfy2P((MDYe~CQ&XwhWn;ss>TI5=yezUQU(aK5(l)}N3pq_I5e^?|Qm
zn=~Ovg>yTY0AR=)maJ*=2HCo=&(YR(_K<H5j4+q<9mXAdTmB<m5xxna9x3%>q3)Q<
zfmjpnGc!zdn#~wb7&H)V{}cuu>QJ?oJKN2n{^QfSeSW~o`iKpgorA3A67^Iow&7cK
zzP}j-NG>Z>({)lfWUm1Lk_uDGW{pkE$NMZW#y-%a%mMBmxqW&-ftGeVr#&N0IC+w0
z*21hfYt}23K<;{%8Rj(&%g|d8WL7Dt_M6ZZD5TIYM{pIY!jS1@^|rs|HDQ<oJ*{k&
zUni7o^rnBQKh0IucJUB(U2YYucff*?7>YNu@u)MJnPbEw2t2P8;5{84O|IebyZ^DF
z^qd5pAJ0uskM`@+6xSK_)nj=K)taU$*Hy)O@vG(&l2C#nk(t~(`VGmX+&jX!e>la$
zE!7$4myjZ0!8vg&XMPeS>Ig`n?VJ~k50I(Bl)#h5Jvw_|)KI$SXQe|Fz@*g5rfns`
z!tVR(ae3vadRixg8Bb)4OZcuB*Uz)uaTJ(`fo9YBb0WC_0+8q)+%I?n2L@4gLO*we
z#H6OX`r|EOKQnf7Ia^e@K)MtR9R!|Y%Vd5Q=HcU-^-Q4-W-JH<-VA+AB<>(x-C(2G
zZ8B|rZ=iJA3@<HR&oky)K&}Rv?t$FW<g@|S$KeA0D`^<l3FBdn^GU%SJy1lt2nawc
zadTM|IPu<Xy{us?aeT*oQoYJ<V{w)C{8Z{O?-2u@*9zJ64T2vO_6I136e#yR?(p1P
zk(&c5yNhjcmS~{<XRi@bjtI9;WtVHp?XngJ45wOt@GoeO@6>}aqhROTvSq#aOXxpW
z7bB-?-aL6M+aJ;m2Jlb9M~z{3O_Vu~$IT`pU$=Ov@d~R56txm-R!DN3O1Um8qa|Ko
zO1D<tbx*?L)rlqo8P?;Hp;%kzZ@o!w;}EzO#0*~ph2GiYju@;P6d;1?9Pj3qVGorw
z+Ih;$j7=2rtVp`_mrgQ&ejghB(9LnL<HI>+^U2(O<oM-DMKU=N)Lmhe?v|QACW+F|
z)zJW>d-SOx9=R(<%hf05CI2VKB;1=QAJ`f^kL%)rzRGdC2Q3tLlS{TJrGZJBBfE5^
z&$C?<yDNmAr0v%cZB?;(>c;im>>5+%-2D~+o|okB+Ks>y&qoc6X955yvxKd8TWa3!
zx5{#8Fk5cy*n{{bZU`96PHQ&$VsR9cd%6e$R<_4)Tx0s*5}hC0SD87-)&jxvfB+bD
zeheHP#Rjm~fB?TbBvJ+{As^c%<YDj6wk5MQWPqP`GQTd2l(Ibd3pJ%KBmF|`wkLj8
zTDL*{m4XF}x{H3EdHk~Z2{C#rJIrnJbSnn79d(ayme~+aJ~%K54+!YxSX*r;ER~PV
zp4qWuY|JiGBa!u)3tlK{IA1iwyuuKg&#JdZz1<><woYxO7F&<ejLCfm7pd#wd3Pz}
zUR(G#)oe#wlnqV74GtiRl9L`1u@@xp{SJ4^yANS!U-?>h$~Zm2YS2QNJ`|?HrNQlz
z=d#W>hM|LqfAtwd&p`HY%{qa)e7mh=sPhct%TWmC%I>%2yjfNG7lHLpW+oBZp0XbN
zeJtbL5Noq!2dC)Q(xA0>n9XwTk|o_joH@Rp^S){2I6<S+Nmnfb;U0RMjgKJ!S*tC|
z6#^J5`AU}Z3lr(R>odi1j>25)9-7vd_&XgLlr6w0N1Z|K^4EQNZsr5`dscgNgmPAx
zT>5xU&V@SPowBm!bl7dO<$ECSDH{)C-CK<nI6)Ffu-c)ln}gctDNcT)qjaCWQbh4v
zHwa?HE<&he$?*<49(3D^ea!3Tn5S}ZG)YmF<@6s@xnhP&!&_seMRapho(@v<B^Oho
z&;tZY=@$=M1Ml}%M(LEE@3uxQ$OddERbbU0H#FC@$kN^}lEHtpuY)WPSGb;y+t*yq
zk!^L^sGc=7J@ln^la}EU*NU3*wBaEhvCHV0ORH$T)Mee78q6s?Ubp0;)o_oL)?+9o
znWxuKGQ>uI?|nZ+T`w#&*qx2jh=c>PUYGDks}HJDO-onh%VIr5nVQDE2@!@Q5d@?q
zeH8BS13$bESxuxIcvu{6u*r2xquJE-3g6z)ECzw=Plc#H=VUdV&D{ruR)cGZGGTNk
zUVOHmTZB^UbaJ-fqc7cbNrMii0l8lQK%vo`1(VyFAmIzxJdQnvw$i7y-+9eAv6UY8
z3WarsU)Pp{(*3P&XoHIkS*~U&Y&O5|;1N8|^tbz$zA}WCV9C7@<0IMysiRT&87<)T
zS{}Cd@0NCu?9rqX(=V8+q+>ebk`jqFk{a&nus?J?HvrmL?xxaL;eo!z_C{ZvaPsV_
zV8BPz#Sm68-R4`rq`F0PD~X(Gcu1Q|p?a??IG&UC<Ma0K1)8?fhuuh7&No+UUCN1q
zHelvDp~Rh)BqoQivCFslWrndrM|tGl`Y03eG(GhoUo|z-#iVnhS=h@^Sa4u(aDerg
z3GPd3)Jq{LDVr$|N7GVD^@2#nG|f!Zue(zDf9&XnYK`9+CbPDl&aJG4j_A%(EU|~|
zt=pYy$8)V`$g@(|5TfH~JV1PeW27|-bcbL18+oS1Y@JadZTEb}a+({j%^<BEVxTRp
ziW-?*8g;jH^-lNnAt?n8x2(7-t_<5}85W<$U%~72TkHND{7l(0ARS=gZs_o7AONY=
zikPoWj)=%(?tL?)#RuLv{T&LIbm|2@UBl?~DHZt;dsD>dZ>PwI5o2Fw#%wpoNuA9h
zBvms&OPsa<yx5aTWA`DawIkIw?TE7DL^DWZni*uLc#ZXwG$^n)D1hJCwble7`tbRP
z=MarPM>SI>y-ZT9#!;vst2B;rW9ojn1g!gFVbC~jyp)oW$#@z5ATR2rkJxRx4?|at
zLa>rJt@g90oi5}D)$RWZzy~D(i9D9_8DT%Zy?t!+DrDIU9H6?~xWOY8T#~|4_-piH
zKdnC5Zu$FcLPhlFNaxEW9PJdI9^qzfD-Z^jgEz*Qy}^NCAR&7EhNnqk5r&7vX>rB-
z-j$qY1OD`~SkAcySVVGmv-DWm=eaIjEq~QKV-b!nFSbgwr(-m{M`Z3T^r1LbE#A<L
z=Ptqja=t8MwHtCr+jj?usj_J1xTZYq?Y%N$V`~UAXY?!e#h$uPB#L}B=H0ap&$7)&
zEFm;CkvyO6O_3NU-1}@5a(Xalw$bat1H}Kv%LrEQRJ;^^=$fExXUVsRVvznMD|C(R
zM?7$MdOU)?H(4NfJ~O$TkF^Q%DRIM4W+#?Xkg!?~aS;9;5|2sjO?TGJHi>E7PR*1w
zQ}UFlLc<dQ@nqdmTdf9IS}nEwCBA6HXzRsl82A%L&=T8GGYMVzKvQY-=}FHsdYSQs
z={uCh%~unCfi5QX{A&wik*=Z8q8l+Vb%LzEsmPP^N>b)*8=k60ZP{arOj(l7CyCnV
zY>kio3_({9t-=z7yxPk5P<Tk%J~VD1NE^2}c>bOde(&JtWA5`X$X^z0u`Q6WfZ_-&
zBURc>->rrzvKsj6&o15eC$$Rj{_ZJ6Awa6r#GiCwuc43mi$qTDDozQeg+|Ir4QEC_
zzh*#>(i8l8t#abSZop~`k2c0!kp;+>%*dP!?CoLtmOf(*f!E*EN^W(d!mkUo(>jd1
z@E|FNtuyX6#pSw^y89?1`01I2;Z>O@>yQ3K&g3cQ;jp2mGgd@+od7zaWan{ku$R&M
z?0*+t4j*J$CB^>`viuT^_0Cq`$q<R0)(nclUobWqsFAF=;x3#JK|VJq`3BK;QUdAI
zgne=5jx;WWR$tO}1dr_lUJQC&>wO)L)3WxOkR})EY+=w3hyiA5t8S?wMf!JvgsrT{
zQjTe7W@`(VLF({Ox*FK7T|q!J>exn{hLdT@?Gt~_mZj>C+F1~uINf#sgA}Z|raapo
zA-h5dNs;3g%%srReN5*O>c=+6Xf~L$9qZ5c2=4X>01Ej}>I4?%247kITS`Z)ID}}+
z4XW165ptJ2<oaMcXhi6H<3l^$d@UsA;yhTpt-Tx$&r>DFaYr|VQz&3qWO>s+qc0c0
z2D90rPMET!iIT?xel<Fx-{vT{^~ttbKd}8-CDY8T2OyEXn;HS=1OWgUPY#rrAk4kr
zo3LMZwtp2#vx7pZV6{wqI6RacDTv^!KWz(iKHi<kNC%^aG`NDNU1|MpT=#FTHa6F+
ziDKdzDIV%32KRfHB}L=0SzWo3h*-IbBwq;FembYwPMj*t730KnW96kiHtaCTEyE{P
z4mYt{HN;DvMdX|5x&U!Yk8;~$RLo!`<sOJL`s8p;9DHr?x|=<0Up?@$O!_^_!1Q&o
zyY&>>#&T~K56uvQT+Ao51S}f7_CnbH7YFs=a>=t7K=+~4u{)ov|CC%|*_xV1)&Kk5
zfa&_vSu4d?kt2m)w0IS(?;-DyQCad>A=pf3vYjbK?2NnTP}!F}K1g(EYq<Ndj}gvy
z`z@2jnSee8>-JS8P!L|50S*D!?b{zG2HmyUg_BdT4=qASy|G2THfVP_%FDz<ZPFwH
zIh^6lfoRF8R<@kr2(A<oWrgkG>5Xs=_qUdTka|en{c_JtC{@5m;^F#fO5&C%!mL~<
zTt#n{Hxx*h@ME*?GtJ)K@)5^clbO!7A!q9!(zC}ea6_5e{>sZV0>~H3CGI63`Cev>
zIe25ggh7|G;%1#j`vMMkfkZ%BowMO`^dwWlz3+PUrXzW0MjHb^JfdL2I2ca1mCln=
zT=DXAm>DGgfQXAXEW8(_UqJOYJOGd;5;ST%;XX!h^D=|cym^g$^x>oS{OQj&l<?Ck
ztf+FZ`8E`GK{-;3kLgq8MQxekXi*kGFk}$ePZ^Mgf~A&aq@+&~&C!0_^QFYQZhmxs
z=`#@-#{s(KQ;+Y1NzcY?kfNh6wxapm$iYvPlV3eL;dnU@^=-Z;NHd~BTwJl7|FGx?
zNgr3TunUhGmVG>4a4$7Y0vo%Q|D_ZpA%#z|mT|~Qzb$L?LyJs^nq!P_FXTv|C8~X&
zV4*mzVp+VB=pgH@<3>$=X3bjAaL-*>1^59TV7*sqZssGqeUAJ%pI!f%S-Bm{74YK%
zM38`be^EmO;O+YF3cmdQGhq)3_OjUKh&BZ9++!#KQDrQBxiFa9L&lpa=)c5#7yIpG
zLlNXwA3^fGeH?!(nsRNDq-A*2)3ova?CMC?Dx$6j*fEB+$aI7jLhr;%FVbCN*(e$V
zL&1CNA+9cs_G(Q@Z*5k*tAkzb5DPNbW;JGnu^wp0jJ5P-$0{bT93szr=J?eZcpO{|
z-Q~_Pnq;Xg$%MS5Ijk_X%9o=G)T8!Pvr_17dD_3MQ}cYW(WR2r@1i49tfTI#g_^Fu
zHK_v~fv4xyEcVs;-p=l&K9n!KmvDKU%F3FxM2>zu!xA0Ln~koppU^@<wB{}_=~N-5
zLu5eZ;t;00ShO*@qol>BHCQ@;<*;=f>O>k}H(u-*2P%s&fGXxdGxlmBl60!C+@726
zj5azK6X~}D0-#I&;v%Bs`+5spzVuxFGApel^0XaCzYNs9rc)AO@IyqsJ~LEh?%dwi
zi<&I^crFVVBq#ei8;m)tj@vn~V4}%$ncCl84SpC7Z`E-9zLH6R1>{K1^O+xi)GndO
z!qx@9eOLmC$5-qMoq`FOBY9_O)1%{;IN70GnS~T(asc-z@XdU`RsV2d9Mplu(pb~o
z44!H8Jxg0Ar@EJ|15|3B6P4N|JvsfMf1>I@Jx@2&<Ro2n^mMlE>gGH*YuL(q!k!TN
z&WgcRTJy<EYr5jKg+A;V_#JtE`n?R`v3=o#<KeyHb|HuU^-w4!?y=GVIR%D;WBwDv
znOiqQQ`*dy7U6t}NjFZ~JlUo!6P|5*|6QR{tq<w)+$jeFpSF|n;n>X&*Fz!Jj=r@w
z3FKDzowhUEf8-1gRa<?<wlsE3P%xE?S8J53!UK8TGHsm-M;h2>LuG<(&Wsz3yBr~}
zs;&9$bze<32ild&J5Zx_k(XvywxW;ev_$jG6~*N4b|+E{D0_t|yco;#^lgSg!!Zd=
z#e5sSM`^om)qj|ECb@3C5IqZKM6}O!JLmUJ{+1l@D)U$8-kj&Ragsh;57MsLX}uqh
zTBgnUIzlUmf}V)&WF%Q$^zfQ~UW{JM`3j=#$KNGs>4W2`fkJkSn>aVvZVJ4eJi5np
zadRw(-z5$Et`zHfo^6M^qH<>8+MLc(%@{kX{WS{nICQ10pHv?uE%TA(e7Pf&d>`^D
zOWQe~yBnp~wPHO`<*2$#WiFw9zEYdUW+0czf^_YlmBY4*hozjm`?rb!WCJ#fbctoZ
zdPH>HE;v~1X~<XtW*K`zQ-qqRs!9Sy3jQIFueRNvJ-s&)>#N49w*Qv}5Fic^)AX3R
ze|+kz=H;D#ofNfQvwWvffySiG`&SNkDy;sr^4HW1@yZ<O=q^tH5t2<@1ft4Fg<w?q
zQ7)vxhH3(5LB(EV2=^aY8q0G1J<@$dxMzwLvx>}@rac#Zs6d!s7Y_JzCz*43dn_t@
z$*bzz<8%mwT@WU}2yR-Re8p4D<TA;Zh`U9%idSkjKaUG_sM8GrFdrUB5uJ$ZKe5e%
zX4Y(RQf3{0C_g#9(d?v>FP_^Gx2xFe<h8fwGZIARO}X)38RAofzHA5TJ1xioMERs|
zO-_`#4UR&74VMNKmo_G0;^n=(yzD(krgB+&Sh>q1Y3HVu!(HqB9(x<?9@Ob=&DyCo
z%R{r6j*%xm6isf3^x&?LNLBfV#{|He1gn`n?va5H^Kva0E${*}_!GKmbH&8822oQ)
z$8iipkN|%oL)xI376&9lxo@4Rg!N?<T6a(Xe43>&;`DNvBWE!7LAd7biY$`bGjkAY
z1RwAeRRwD4U0j^*@juuHg=1%yc#<xOg14&+UlOj%W%yzYUUipPjQ9hgzJrr&%E~Qz
z_J5_HwI5W@JtY2`9fyT=(r;xG8Mk}K#5Qw=d&&%9;&Nsim7HvMK4N@atz5-`(8e}B
zMM9G|izTI}+D>OHBCj&#$)tv(_~&xn&7=fg=wF!zWy^0p?um{6U5YN}#uS2Z7w6Pz
zndT^v-luSdT@B3raN*k8zb6OuB;S`}U|wq&G?bf}E{M0c3zx19B7yXLP@c9?$=F%i
zTNf;jNIyg^#5>Eb!2$3D#z#Otqxuwgzm#bNDh0g@z@D8cT(>G!6}t7i#_G6!s^)7t
zSC>eReB|;Jn!!41iukA3Lz&5U5-de|$v}n?8RGULfsj{?B%{98vI&=Zw0Kr86etj&
z7xr?=O@{cGT*KN;Jfxod)hPRjKZyod1u9TFegm3Ss5Yu>3a1_~S9cC3^$9%l!K|cu
zyr?ULuco@)w4fLYw0w>AmaH~M2cc5Xkax6XQ7<y#*-iLrmP+on^`Pnx{+>3qgj8sZ
z?U#95vL*fzob_<0gDz>xWUU-4luE<}!Shr&3V!~G&f)tz1-odv-f`G><1ba|c)Z@;
zggB1qaifQP@eH0E>s*cJ?#&Y3I0>>_0M7>D-2RvvFn;O+=UY=)<5npnH$mdFca=@a
zeL5KXBGD=er|1JWhUo;#fCfFTuhjH%6t81tpJOUzSfc|GY0|UTas-4T()4fhVfS;c
zam|iyM@4?>V<B>hX?0M79-2`-q{UXRfK1Xk3bt$^a5tN`<UHY;w(A@Yp>y<Ct7{cu
zAc}B=q`mtCEfYnX-1S1DZWCm>(1?q~{y;&8pZ~zKJ1n$PG?lKp9%IaMw<<mc^Sv(?
zGyN(0!V9n%@@I<9ILW2$1pzsvP<rHzG_m}$TNv$UD5Vn*kH(EH#&(bYD3o`tY7;h0
zn_8HA1JNX$X`hQn+1H+{jN3DBI!h%B4z;e62RVKp$+aXWXG7mRy0nVhS8O5t0_Zsn
zL+ZaCv|DMQ29`*FMj?vB|IM%^4)N_;{98(}Dgoig*D9|1C(c9UsglpXOqx(vJpA9A
z+_PAgO38U0&w7P=rh1Nr{2uN<lV8_IaP`HSiI3G&lbbGHGoZk}zwaf?fqOCI@TX4F
zJJKHG%>aPkF6I<i?#@w%TTS``#_gJZ{GqzIv#a(NyPwI0zn`0sAo&Gk=;=T1E3H)G
zw3A#FF{t^PU!@IUZw9Z>j!u&x)MOXQz0|3F?9Ds}Z|hm}1+d50N>0dXZwD?`oIZ6|
z!EAzQ($1I^daTbx|BgZlW$k#>+IHS?Y(<R#5;~4fdz%)3DQj6-5+bI_353uIlXLB2
zi}($;OYmx$o>)p%k!cOlk3KwvaI+PqVqPsU+s&8g%DZddDt^g>mrM{)MUS_}@BZC8
zfWj8{Yd1~OibUdmo#*+?T7ks?;0<WfA|XSay8lTzt;B~?<0;Li&Ive9q6-n?lx1?2
z*O#yss>5*CjXl{jOKOiET6Z(>^g-Yz-01X@KnHPR$FejaN7TJsUMoB}^pI{)2zo_3
z81@iWK9|mzYX^&^@7dUnIiWR1jwkG1u~=d9OkJFoYji9uGgMt=)NsBbB6pRw0%x{9
z^zk--Tdrx+?`w{IY&N~+qJJ*_Z0o;WUGy<bPzg+tJ~OSR^6Q$mD&&=#FllnOzN&lj
zR#r<+8$N(R^e9ySJKkQrMg#(Ids3kAoU$Ml1@(Zs>hJ^xVwz`kfR}FjVEu;5YYiNJ
zZ`d)8+I`)QJ)t#0{!7^XGF)T+L|0ynvQo%L@%~JV-tVa?W68Ve!%sPih87XTD&p+I
z#hF7L3h(#3JnqZ=is#%Fg4U8}8$G{L%rZV#jH^u%Nq@G4W@?Bh3E~~%4L#W3=j=4L
zz!8{2C8t&aVM}~uAsXu8XuS>J^UM5P<VpK=Fp(5yqZen^CJB@oz7m*gwwd^PNBH6F
z4XG{l0oY;SSo2r9>F@ftnyY7vc6$*;doT7u8K%eKPAhX6LqlDaFa5F2ru1=7c;M8f
zYLsuIZIzGP0jxVI3m#+p8$NizEdQ3n^@v_=+vc0gPUR7P8HM4h5qg)zs3GD7)~z7t
zG9#--y+&yxM5$U<j=x#b&Z@ul4~OPv{VA?3)%FkTeUZgw|H9Lng!*!6U|=~j+QP~V
z0;XbNH3-XLbj?tzYET>ifVIV?@nWpa-qh56CGdmii#73AP$%f}rk1p9UfaP3$#*M`
zXtp01HQ;2%A1D^#sILPBN^fIoZct#KgaPPj+8}IBrlWY98}EjCoI5Ll)I&7xZ{f@@
z0s-teh+{E62U(W1qyPGXERCg-j4@oWBa3!E&HHHotWAd;zTMfKzQ0@yzFJ10ZtO-w
zFv8xq&3$?~W>>c&QaNqUOQ(gd6Ct@A&7QfVVj_Z?ATw7^#oEo@oSKE0&=<p<d19X|
zM*gFn{>sAKi1BoELWKbIlpbhVOr)%<Q66)w+OP>CzdS3#R19=Ve$i8HdFW6Y!p8ZN
zxF1LMSJs1j=+IdZ#98#-L*aLQTw|w|74030Cai=iu=;(@B{^-5MR_&%fp~YS*9PlQ
zawDEqx)s~dVQcqn;9ve1XUBc>&G3T%t+x8pDp=evwd;u(D2bRf85r3C%c~>a7CDkX
zOejM<g1Nu}0k~Eg(Z65&JF&r!mOLHnU_tS6&gE)tgzZ%8XHhDq$n*rzHL{S;jv355
z%n+|Hb7kCR`(us>bkNrPJ#Vw(^{##nhsEBVcEx6}pYbVrB}Ea`T%{L^G5Xe)lLY5&
z`=*tDrr_Fn(cZJ-Rh_C31Hez(eej{`!$4W8MRc09cDvw+qD+N$aP0zkvH(UrLTE(t
zdHMgn047NWvdtxXm)e`ti1IMMIMXoQ42@8Wqv*ZbJ_B677o{vdSUFdCv(OC|XOmL=
zKF^&iGRzTJ%}I(<D+dy<pU-4T4u3of^87k;v5MDC&6Wq6hx*_Ce9lu^003`OWKA5!
ztAur4R0T!by=ayU9TB_Roj|q<4W+j5c9Guuz<u~dFED3gxn={gg08!{;?K@W{zRXh
zWiS?0A_4TilcdgaZCx?21P<@NMNGmXZ9Vo^joPN)ebFVwpY)bZKE7V_D;DFVtY;wo
zHU69wi1=)@$yBj=RKsz1J{p+jdt}-I09#Shb22|)OFM*th8idCIjOt`W4bW?zF9i4
zkx>VK@luU)vDE@+)?bxznVb)_HZDZQiid@REs?^i9J|LNm8j9%wBL*Xsf8T^c>Idy
z^)4yikHeXKm2MS&fJ8Z%pHjyn0tCQ`9M<N$LCA*8Bo^o}2b{Y#{%f;m1Ndb#PDDw^
z!^hH_ZQQ_LpW_&lBbHyYZt*ewzMWdJkwFJ}b8v;NCuIrDs<A?SUw_~yHbNryv*t8k
zX^Q+Z-i9hb0J~xTY-j1#mek;^>CVA*&-Q!xK0+or)(jh^d^(dYK9C0xrxEaS)AwwV
zEQkUF%nB9cm`8h&&<vHso(5ixylrt+ym&d3>>Xp=?aki=!CcPdWS{G(DFUSvbJdc=
z;aM4Oe9u=F(uSdeIEg^W5VR^|i~v7UZjkG+i{lCyP!})!@I_mOjO?oqZF@(he|^_S
zk&cu!@eis?`l{;;vok6}in(k**&CFsea5`MY?FV6I+r1xtyjgJL)A^ub*r&t{2CX1
zV|z&R-CVDFFm?gU2U$cy*_`Jd%S$$2oh(45yn%3!@+YlIV_?q`Yi-XQSAfu#=78#X
ze4uufFAn?3AsMpF3F^Ww0Vzgd(0B3qfgMDJkBV8q_<qFuo}6?;H~qbou${(~+Bdx{
z7|M#p1c&MJUyk>yhL}&{vFDoc004N%kR-{23)e!WdbE*h3k)+M<i3E{t37@d%!N{@
zDCVo3X#`f;)a7^K5k2KR3=?nT?`HS9n?<NR;)6d(l1;iZU-owx-j1<myeg5y&X!}{
zPO_q=!o)Zh`$zF;K>!}llGyy!LNtSm<TG)VKip4N__)^leyKizg}B9A*7r&wdb^9T
zo9Xgn(k<O}Fy@g&`QbJt63QXC^LME+Iu$Fk(L~#0cq3?7bx~3@f9|H>ejfn%7JW9h
z3lAAfWwKt#`zvmVRO^>KvAa3p%TFJ-bcd2etIQ}Y2C{4&^K74WOxil@w;Eo(#;k3b
zHFj6GO@#NGe?Y-+3YueWc2LdX#^+Qc06?}~T~-+#UVyK(L9tSdcE+kzoV8yOme+Z5
z<yNdhJlYZnaKWlsT1d`rGYHvjj%4FWwvw(pv+y{VRadgexrW+qn5w#*X(xW)Yvq{t
zUqeNczOl_xB{|Qzrm*xh3?7}Bus@q848sM<`#IK=4#Tw-C%~0h)Q+C7#MO^2N1hK1
z*ol+Fy{enc)xv(}Fwv1QzIQ{2zJ2~<6Qi#ZLPL_dFo=9)zvx!EISV&vs#slJ-EplZ
z3~!8XQpr)!ou=p{nmzvY%p0L%GkjDTu@HRDS;TaA$QxDFVfwG#jgVo$KD%2SQnj`<
zPdZtD@0IAO_~8TEbxcFD0d*N`)}3J5`WURiuAf}E(q=P&DiGdNc-DgQFg-3Y&V9W3
zg}!nx_J~F#D}uFTchU1(BiUu?P^t5d5{SzkKm3zKI1{-VNtneUcJ-{znT6&jG_y2)
zMJ6IArs3>s+OL1!CN0%bEZ&wS^0386ifBxb-MHMp!^LwImCW3Bp`q_#pf2n>xfP86
z-cL4b8A=%PDo^Q@b22q5FY8UCII)}-@N893pren%)e2}!$leSLBD*LMN0V~((*84<
zd|?a+Pd|7Id3H18?Lr?PiDfN&NnadeT$>s<j~F5zMN7VD>3Oihq0-2}iZ_DIz{7{2
z1p?<2j)_lAibmx_rr#q*zgU*kjmQA*V*oEE^H)TUqXJ1BYPrY=0~H77@6$V$^wFgZ
zx6?QU0Rj9i!{)c~moMBhets!;4OySn?gZVY)nslWU2$8~*@1vkZ}Z|8=r&FLSE~O-
zL+z3aOO1P`<KEM$f_N#RQoEPZw8uwPzsFZWQ6#Qx7mr+UE<H0+g;(lN<R4+>A?k>J
zj`a49Hu%Mpqj9xQT8XjFYJ@t6g!x+bhoB@$2+8;nvGNyev^cPT2tkdE;$2H2$|y%f
zAFMDiKOR{UnO-AP5frc)w0>vN!0>J*B|DQPGn1Kl(9iesnt6@-24-}2$ILa3#lvt2
zRmzl5^jFnT^|Xa&ytbEFq*T*;LZk_fr9a4=06t&@K>ONCRsXgAgwUkJ11@Zo{U>1}
ztDa+YpF8Qf?#h8^YGGKC)MrbhmFHkC5sPGYDlWUx>8EP$n<8G_GM1jTRwW5Bo^H@i
zcB8|DxH{3G9eF(eA>PT(q&OzSTeDPRb3eRv9-_}A2!h&7znAx~GT&|&tWy+sKM2*F
zws6&w$S+Js_H9HC@0J`h6$o*4V7X=W=op){M!Xo=Vf(4%e80#cloec$X}ZLw7T125
zQ{2swD#ymCJm`<VHcM2plZ#(7P2rWnz08@23i~K6B}(i^;iAK<0l=S@8D9T{T^c?O
z(g;#s>p|A+<SAGyN??ub;FA^RA$RNEw}vCXqIW)%UHQAB3E_rK6~EVA|EYUY#L+n1
zkTLWG{j!MwQV#n4#a`_Fb_{ZnbU{!@AX-Ic=tQ3H_i=ioEFIR2(`<X8E-vF2(l@Cz
zHlBfYHvas~TxJ0G^uQqY`4J`H87Pxf%k54dEbV+JluMS2kTig!IH6*&So?y=2$s7h
z-oTdtN4t$X?zZMVpR%|oOZrf!NKv6v6>z$#?R3Q{Hs1RBqldu4C2Llz>$jF`O8OH)
zhD}qGfQpN}vc#<T5+{6FOcR?77cxbp5d9R0<jqU7v2nCN)@-GpZtLuol-EDn=)-id
z632)nsVnqzH-D27sq6`4ka#e#U<Xy8G`OCUSmFc%kq{8@Sg<AY+?)d?3g(DteN^J!
zLHKV?76+x|KQ4^37KNU?6T^7CGlTaBYsFU<qouPZ%h)4Sk=r<o+8lehyK~p`l^S9|
z<=yob1bqix^1XX0u}`(S?>oDBi}T>C98HjMOhiBoLv{q@DIUAe`EMgBzRcV=k8n@N
zgsRLgsG8ewTV7)=PO4FwooU8V`dhfe5A>%;n6x)v^~<o?B*$%=_pc^Wk>dKCxNQf8
z4;(D&Au<rO?wgJjTO|(~06<{uFlIY$G$RxZ%>{^8^xn`^+|rPZ(ZRyYl`=YyTTZ*7
zt`>OGsY!h}OD@k`<g-%?p=rH>S_ZJkMLYM~X6@<2H1}2Lt3eSH@hQ`YeREp8lA7V_
z&52u#_Z~f*{Go1l42ix_sNH`fcV;TTdzyG2bBbuT-EiJqCUGW4(){7S5zsh%JJS>$
z{LaaYa%H`&R$_LXOdryxD2`)|ok7-njwxfYJdI@m0$`i1FQjlbxE5t~$!J_buJ)ti
z=ldazSLV}|2Nf}|?czJ4a{2zs<%qPtEwvgzv{vVMty2tyG!GUQ?e9)n)~z?2yRU2S
zRqYA<ulI3jQc~^WV0vU7(>#W)4{@WsHol}l{2?gqw=ayUjZotNfY^tyy>7RH=1dYO
z_yk3$-9BLj8Zv>A9Edh)%5a-4DF^lC9gaC@tARJh@Q09j@rUqpM2JGjr@2u=4k5^A
zR||@VXy`8cqE&bD%(n0IOB@J$sGyyTK8T1UHqkTa_zSK$TCtMI6plwT+QGteD|X?N
zl<nB0f~CbJMh)@+DGsh54jMjPS&ZmVokAN50T^w)6{_3h4L-eF-gTI$Dpq?OU(~mD
zGgU*67#`T|bn9tqaNmPNVqFMfOD8qPhQ^YkA?ZYgnrLns)ssNoYR>$JPw<~R*qZ%F
zm5~O`Ob*E~fRHykj!QzCJw4#ESgduMoEELNV1T|9O0DMqI=4dfrlKF-w++uNSk?%o
z>$V`>&IP74Mj%EmnUPF2Iu5UV6?W1Lyo34{O;#5qM0__d`@Q5jPXwpjd#V&+7N{~s
z+mOF$njq=I9Ar^_sulssoBy#~|GWaHHJPV^9le|(K1$38Ejcktr>k;>f=<M<RSdTM
z#1gr@p%vP6<%Ib8Fb(k_!!HCYyhn?BscAiXBupv#G7AbyInrc{vPbyp0@*+Q-rQtZ
zkK)6NE^Et`kT=@MA(yr);#Er-@@k+8yML(fxm(u~uY5AM)K14Ed0dQdTu4wxt1Z5a
zOVOZ!6p*yoUuX|Z|L)MXl}&OR+*Q@6Tn5`gC0HJq(n$nsSF2l2umomV5UxV#%94co
zyX;YjQn<TBuT~d*{zN99YP4$wyT7CBt3YN5?T7o(VYbexl)cZf7+xC6i8+4{CRw)3
z^Lm(JWRUwX@}eEyeX%-g-iJzaE=Dq^B+Vg!C|+qK9a|~1ytpWm;aGgO?zbie78sie
z%RTh>5#f*SkOBf^)J4&W^o%p~A_j?W*G$1zvhXDuyXj9NZ2pF*rUL+|13C+Mvv3T`
zPaR?X;eU?YpLY?k$jIU6Anla7Q?T>c)A;*DVigFqYpZ>JLgBqG9`L~fK0r-b=ef3V
z8L%Y0U4v<)0Xo2oY&v~)S>oBpI%tqQ=f7(pDZa!@A+&M(0clA-y1;M)BP2h|Iy+0W
zvB@HZw!|o54;?4#hpKaJ!nqz%^V4?gWTaNIan%rZr($@&rRB<j(JztG64j4uDH*`c
zTO5Q@c$8q%X_k0{B~!3g{@5b@8h|nFr{gC>v{UBI0e+#LOvlT-oC_LFk&<xLQFw(b
zNve+!j$q_)S?~fK@cDR_yepEV$LS3NZB~a4pqLKOr`MF^VNMXdh3WtHg;~z5;<V%3
zTgSDetfwf8eh4IasV}wi1T0^gQMz5N_+5kA`Z;^%)R@H`Hv_oWn{SueA05~qz9;gz
z!L^GgZ=Bk`4Ud%xw7f5{cg!=Yk%jIq9!~nQ#kA<^J<uLU`sNtC`Pf7Ryga_Mt3$d)
zTSb3dceN}0G9rmkgM#ZE?v6P;Ht6MsyECyI8<Pw%MWQ&JNvKeE^~Ymbvnz%8xiE&A
zCH!(1$Md7Ii!MR2v-u&$E&uHNG-B;cC0Hkpe~;0%SwUG?;+H!IFd#9$0y(?Es-lGg
z0LZCLPDZ-6P=o|1yWE=7VN>@0hpoxCuZ=SU!n__}b$A_1l%Yng>R|Jr6%Vv(`w!s`
zf4Ugo>9%EzBTKv2DeV5w0_J&$buIIZ)c#1bv0x&VQ=7(NtK5)mDLM}1Gis|*^1OAd
zj*CWO(SNO$e~1<?#WGIg?!#%!Z|8*`E9-vz%t%D&{TbVE=Q`T10;Gnp(x1_HO{gK7
zZxpc92loW>ROGzF*pzkhaI9yKX)l<E-n7vT8@ke0WYUOqhJ2Vd>l?d&Xi+)COxVHp
z)Z#Mh#Ktm^>{%e(mn=x@Q6w<2$m`@y_=Q3vw3dIDLZ&-Jo-mQhUQ9NZk?5+GE7sUR
zOt8RVOOF)oIZ7th1w?sla8iiD-qVk$$nYhoTr6I9C@JL4(^9BuF;CVnH8vvDss|sG
z5vralj=C#J_$T4P2bg@3l!_YpF}12P;d($QF595esAm{i26uwd0k~Y*g|}A+A`~%N
zw_0kb;B+Plt~K9e#pfKgM_}b)R2?J2Sid<6*InqNj|Mcm1yjNR^xH>x@m0fFBu=(d
zzK>?Op3<Yy+i(rAPsv+2>y<i^BxJ<seBxMF_bOwQ?c;f06L}OjpSfz%*Uw)JHe4xM
z_@e-cH$~DzbtpbSHSG1t$991aO@h>~SPTGPK&Ijc_Z?{5izZm`4)YJ32M;#I2KYft
zX)%6e6;l04Ar&i`%pKR^EjT2vYjR1CEFJQsb(?NU%@bpwU8R9X@HFgL$BtlbeLFqp
zo=v{&o|yQ&49jw49tj7UgSoKDWLLV?2H^`3Y=z}Rpyh8<Y>m>)2qdjqp@@ZPjR58s
zA(zpab-RcUx`>DUx12Q0C0vn2NDTXTiX3y=y*&DWK7>mL?}A2RZt<LP?Z-#rFf*JU
zopfoStpA$WZuQ&Q+C{W*5Yz4*!O;G%bri>7DRDN5u_d8%6c@)xdmrP*q-ldkXMX47
zvw0=ye`K!VCsbydh1RbUY%|SYqgZvY9WdLUuO$3aV`3@@XVkL_KL~J`cpA)yA@X9N
zDV0g1@?JF=nq0$N#=k{R-e$@>DQ(<{!l_V`7(FB_iuDiW1m_14z2^G9S;n3WeP_(j
zKMm}TgTezx;uSXni*MrVWC%l)aq5jI56K2^`+td{YdowqoGjYR)fJTGM$R=63~EBl
z4+3QZJhb@jeLHl(04Na1l}3fMF+ZTdFnTg?&bvo?cZw5GfL@9*5eT^W=|Cyqaj7#q
zQMbYkq4h@}S~#;p*AaedE+-S`Rmci>5HYGTkVZ)+(o2DMHlt<v<<hmH^y(H<inq2l
zRgx6*WF?Q4qp)S6zOarx**hLVj+mfN!kQ_W3&xs;&6M*b;G|y+y=-w%CZu9r2U{F@
zVPh-zm~0m9on*>xgvuT!ntW}Lulu@r{!bU~ta+$l&I_!dzEtne(G5ZF!Y??$jj;qx
zLiG)o8kxjNoTGS!0?1kZfYoKmJXAzm#|U8NZO6FU-n5FmZK}DnRqAC6?Rph@UFMO)
zVUX#=DPZ?fVvSKb#KV%q1?pK<3wYOdW{;~J>2N*6Y;m?cP5B2hG%y9f3xh=(axfzd
zIav^(H8QCfHX<53PuXLDUPV}K0t17{LHa8s^zb37ohm~D^u&S!(TtC<!r{RIgVRbL
zi8&j{>SfLLOi@5<v~T?fHsNCAKeYUPBn}zNx_iUb1M^!o<ET?<ffjiKa`?M&XxM*C
zi8zPBYe^4T^KAwTCgb<g{IxEh-T0P0{E_<6jAX-~%%+dD!dNLo-RfsZ8X_Ow!f2)}
zTL%Y@V95Um*G3Pt#?(a?oSv#fa~xZ`*@_&I^tDn<1Zq;c)==06t%&3AD}%^J;nB_F
z`!~r<&HdhLMi~m$f+=7mok4*X<0=Ri#<71H`muN8w&CaP(-442-}%4hQ~jh~*71oX
z0`_eIE3OnN+xEXAxH2O}W8lqDK{=8_P^bTei0O`gL|rBha#5jfbtu>PotB;#x3x^1
zaeYjieL#{{_5^2?m$e1^czlZjwVAs*^~`K;+#kBrKz87#pstIoB&wzQ8l^FxJ4T!=
zLK7dg;f6B`*G3fxNgby|c}+^HGCcf-@Vw7%TS+S=aANaXP3#XJh)<5bKoUq4-&%af
zzBGy7TxQWCbFw?J6KIm0UZd(EvrT1RuxOzH*FgA5P7frezpTb5F;<et>)B>`=}R9%
z7{wp9T3|K)8nMMCN@j|U75-NXP_Yk)f1RV@3%y|+G_z(d9cg5s=rGrh$WX1?^S9sK
zC?7?NzO9HlLDHrG7cIG<<{A<e9!vQCq7E&aQ5lsL9P@{Eq5%SUnM2R}^~9>UV1tMY
z(1h*e2T;9CmZ);BO%h?w@(8?~hPPXT3fz40nl|WOwGFILic1j#^bD%azAouEv3M$u
zO)G&hV_7m~i<$=u<~V5z72$g80gS@%H;)5dPf*Tc9JxGveE^Gx_bzwQa#o@Ce?~t(
z6d1~EG(AeqGlM4klmBp;;H|n+-Ks!XJZ$bMdC6lE+8&Uv#Kpros|xe5(XnvDy94;+
z{VZjguYrQOEA#2kS$v+rG`B`NQ>?Zexk6@O_lJ~k@%*`V9A-^3&#HuZO0)?P^x&!Q
z@4FP@;DDWR2Dc%<rH!&5r61xrwh7DIxZ>~_hkvSgKUQGpbRXHpqv%RgNxUmtM{pMb
zyHBzW$h}6CLl~;F<EdpTv)i4UM=+&~4o8(t?Bt6P3cT=XUh`Icjej_)QKb(6@4XYh
ze8M=kXP-@CV#$dc!_4i%GfIs8cZ_c51!aC>-2CmwVX^r}<D6wyNJN>WTpneV<bS%X
z2ny)o`u%a6fJ1zNF*^2$^wV(*h(FJLsxp*F{sW_lEpJDqe4}tmmJ=F9mnsEA>HEv3
zRcr;p4}kA8JD>~ZlqaO7Mk*0%lo)k}U2bF$WWLIjJ#SzXA|C|+N{Xb7D42x)Ffu?L
z_Yxw{cgL>#UzOeA|70lz2}Esr@D*qFObhJ2v{yEUoIRE5s~*1i7hmhdlsj1%n4O|v
ze}mOF@6DuRB_8Cxao&QN1W15gfoP;bJsyECD>jL<i)cTZ-c_2fI7^GO;qGv$YrAlr
z0$pcYH~_G*yI)p{lSIH$E@!VUDb;q)x#o+42wqc~bL`qOBcaozq7xy62XGY!mlF+l
z2{eL`5MwD(Cfxl|oYuIki^m^m=y6|H*nXNms%BpDPnV7yuGW9;JYkY55Utx4QWC!a
zJ4ux2ZTEv?ZB59~h-V%|ONpg))pwOIRMzvKsGW)N1)I5&I+ee5?pzwVZG_p>&5Ci@
z63nn11TvF5jpdKSR1ije(s@P<<ADHlpB_+t@P0%ft_N<d;aciglgemHW_HJww5=h(
zrFQ|rrYL@hg(Z4FGk3V1WTy&)`^r!WvNdcJR-vJh%5J31xJ4m6HFW}X9Xh7D`9FLp
zt31k-rzd3Rv#1dIo_mTPJD4Br&9IY^qIg)hh(l<MeA|0SSrN)yp|~F;AVPrTvWit5
zR^N<WD0pXF2xW=_Bfw9WgUmMifv@KGw;%K;_>z-F5gQXyUg2TEmo+joO$Ds8i6|xU
zYQ#Aa=86J#d{V&asx~OV4u`$yjNMvpTuxrpGs`>&9|~<TchpyhNbDE8Gr(_~5r*kC
zz(GAcWpB<ISIe;zML82sLzw9O)BI5k9MJP{i4P123m*+i{CBwISz1>7k9K@V)RXzy
zZP2I2k_cCCp-pPR(j~@?6o0-FO;*_L)uQbx6ZF}libCMMzqaT{x`?poQOMvz=%O9+
zOzzXD8w_7vW3M%TgP3k9D%C|_GC?u(8Kq;COmJ*4>1Ccw)DUH=j~0ZeHj&4iCYkp%
zU?i(g5~cmHKWK`F1@NsG7l)M~N~UqT4NxbvwaNUA6mfPAd2h}Bhif!9dL46lWOr;E
zfUbeFW2w{u@^r@#w%PFz!KeWKTDOPin~--4YDUJMW?b+E-R{4Ix-;oHkBoC-D#w)7
z{wWo}D!pfyDe{!7ea^3HsZg_&DU<ize2wV%pds_xf+Rq!iLt>GpG08#01D#!MNo*#
zo$>s65h$@Q_Vnm$`7(bj-k*@bMnNz^46HpthBxfh8a+wl($di4eva1RKGnH2cdrg|
z_QygSS|u)1)UbPPO6Jw)w<jC^`_hihzi-)i&=s~r3b`cy_4WZTrz>8iQ+8=73H%A#
zj<V*jJ50rwvvrzu25SNDf=pW2=BHTbQWxc`B_h5JUf+VlZXpI8Fl}CWaJ5YN95pcU
zJ^<2>jA%PV$=WL^pD#q|0&TW3N6o6Sq!Nmas(31|Yr~ApnItZ#3SJ~D&LvGlCbYL0
zgt>0A8pvkwh*ou75>(whzWq>_MA#wi(Q(@9S$S>(+Zc1X5ky;ViTD%i{2x}umxpnn
z?LOGo({MX#P<&N`*@JD{lVQtDl`PY)ZA3;GyXD|e2D3EcMpy!F-bgp<87#QAv$eML
z4Rvqs;T|Om>U)g3!1Yk`#JF^<;cGlkWb?gV1$Ut}{n75nr^wU(mSc0c095Ak4gkzN
zoKO%@2i7JCK0T(JeFo;!?d}JX<6dKqS7xM({_X0jTdp{j*<6vQS$~_QB1}ovg|*of
z!`f(z6FrWW2ioc=jo6S-y;W-IR;-rOB3dg!8V2{cRW2224dCx0xi!nxHp{I+nphC=
z`EZX+@M2ofm0w$i^rhYPE#j&z)XBWc12DUZaAIXkZ@#boiDj%~<z<~%lht?ZaeybT
zsZ_5ff7?OIw5t7TH^c*Vx;KgUnXucUyT#{mopV}xc|gjz+UoswPw@niup690fRh)&
zmTJR<5w2v<_pX1AClIJ+Lpye%#M|97uG<+@eT<R*PGnRLUZFI43bP*yP~BcgAw&lN
zhVZnc@go2DR*hdc#zOu0#Q#ok>&c16Pg9AVoG)x>GUIFn1<`Xk@WeTj!l|JDXJ3A<
zr7ZH49mbqi)_+@qoHdHl|IufKP0nS0OUO>#>nyJ?*(@APtM`nVF#ae!ve|@=xM>FP
zdsQlyePb$=do)7|G18j0t*(EHxgendHCp<==>UF=!Y*gz!Q4Fn0FUIDu?XdYF(|<8
z!wYf!sK<EF<7rx3X0aPPyHK;bW_#_Q0ck7gwuEnHe157Kk-WlKAi%GpFQ_VxQKM?m
zANnE-MLjuhWi-EnfO9uuO^JNO51{VHhA59r<MyT8ssFa0;XVc2sbyvFO;rbbgsNi<
zlFnK`L!ED%3f&7!Y)BdkFT81&>=q!N+Z8b`vG&htk5W1i#TMT%urugS#hv5Wy>vi^
z>w9bUl_%pL;Sja}CS;sHCmyIN5i^UDq|{-s4nYiK#%qC0iCFX1%C(B%kfnKWH6mdq
z%A>;9$QzuT@7|BKo2{O_h34BLd(*l^_r=rt#*vsr?Qj$KaxZV3ZGOBeTO-wuSKNGu
zyo@Qt9Ub;NZ@LvmVWvo#@?}2I#CuuN4`ZbAK`MlzB{PgUo#Ld8w2nTvKOzpH??zoM
zg-p%ru0IW}Stv}Ul0RPVZqmY=a!L(APO!+ZCIzRR{6PcX>*4C(d6;pn%>kW%6cgSb
z<p1{qK&Jfr@)n|U>!NcYotEo6WK)n5C$#FWTtx1;?ZV+?Co(og+SoO8rcl=;cS8^C
zJ7NL={$3n}Un|pP8pG1PyeRL()Q)X@6r#DZ%+gVl1V_uyFqDgg4Ox(F3{>y2#g&`k
zn8hcOfjEkCEklwEUj+^Sa#ze^xiOti2>HL3zBxRP?)m#plg4I~#<q>dR%0iPZ99!^
z+i2X_w(X>`ZNK}x-``%>{<pjLoZ)BY%$fVTfa{M!X~prh>O5k5d2bTK<{(NLb$!P>
zEmz8lUT!<R_02P>i@RpkP8i>bvcLV&NO`zr^TCkuk#0wNnG#MreKM)j$N_27K8>$9
z2&5k=#;!1Ggl%E9k2hBa(`7XDH3q4mZ#9U<;T$ey2mR2VDT^o6hu_PwK)L_v?2dXO
z?tnVe6$HwLz1^*sc*c3wIA!FKrmXJJ+@$X%FOOZhHZC8GW*KnNEvQjT<`CWS1)5<X
z%6`HDu%$l5lP-^5BWrJlSnIh}j16)Jrug1=;M3NZ)^K-y>SkJz)W=1@QdADys*ajn
zw>Bbe?Lr7$R6<^wW~k854^&|)jnhtvz?#w<qKN+l34$)q4MjJEFQcN`hdkLmjffI6
zEPA|K`gw!wn?N0*0L3vG_}wA=$V!f|ID`t$DP?Es><jl;Pm<>Y4vD5AfvS(Uk?G->
zFTNvB&e;Z3E0<7vQiCyZdk2O*vyt-pN~sVj1v%;?BCdMX#@XSPm*IDI1Cgly%Zm8G
zdWhc}$GN@sxH$7owk@V^4j*HaVvqc^^6)CUZmXBe7=~&L9rYq#4~Ofpl+<BBkk8nG
z2OgCM`SO-$Q=bY)lvrQ>%xDh*=RFRx@s<}WRrXp0t+VD{8XR7QxgPNf9R`0GP=4Ie
z(D~8_qgRnp`?$zvsP3!EHW-)#2@m=}m=x2ClYgH2SB8ODwClkjwIqt=3x@g`+tS3+
zjI+IwWoi+p^`1~r!t{nTgV-MdX1{o;r%7_}mIB!*&!Nq>u8acb(wO>R=S(_r0o6_;
zcg77*l}tK#2_B;E^*ERvpJj5PZBjR;3_sWNND>?6ZR4hoYtHQ4tUi*+iBgEw^^vL-
zBN6U)Qotw?i$4~)6fySqbvE)N8mNzqV(vMveIHazZc>P!s~UqWR?fE?qwJ*P^{wT=
zIyi~MAN3=|9@@Ng%HzuIfy2o%XDgWm8%#n^+dnHzUI(1G$vgb&?Y}S|o3)6>Hu2^g
z23|=#9<(=oVIY|+3!v=RAuFq4T=`tOtduAKTQLz#zVy8wJ{1cJ9DbIcTKJCd8OkD)
zF)n$G4Pl@`81Us(=aj4i$KY-$PpOQmnMZ9UR!O9UldCstSwrQClu0cGFwCiV$9mUi
z9Bs{Ui?1^vP^2c#6#jg42486wHN&onTi0uAKo;xolHNX}LaN$v8Ans@b1j!y#bnRf
zuM*WmA6>L{Uxa?Xt#GW4>jre}^%4iNvLb3XjY>7yXq21`1I=hN%0t6aTy}-iQdv!$
zBggt5xS|Sf{YNwV6)^`hGxS|4c@$MRYu_4XQNLQJN?m(drEe0Am}wgN%9$5|fx5+K
z@*c<}S5}d<0$X_-HHGWf5dA@|DiIC@IN0#ZpF`EwEok^0!$G3-vMl{(HCv_x%Vj3A
z#vssE5y9$2-Qn@M?`J^|ZcLzbt&y|Hnq}ip*H;`2HGE!QF4<3Wn*B+?p~;e(oO<Zm
z#~UU}O~S@Kj5#Pct9j@Ao_L0?J2xviY(IgM;ZAR4j2($xh(bsptpN`5_ILiMIyWk6
zCK2i5MKU;3Q5>Bh+1oJ)tk;xr^VH{VZ4r5T{&U)rV2oTv9;+xtO{8TFee9HacR!$9
zP8n9(-5}Ex8vljk&0-|ynwLg9yz<6QHYjAQlc2+R>F!7|npBrz!evxcH1z7aXS_o=
zuz6XXnSur`ZNkJqpYm+~Jc=ZpHLkC>4_GK00GzMP%LgCw^NG<NKX^5Mv6EmMUOhez
z&CqE|e1L(`cP;ht71+eB)6-H4*w@w?DJY;gET2o1EEzJ`+OD_CelKIjdS9Cm-k9Ym
zgMIaU;~|e^>eua*F)1uUQO!WWR>uqp3?8M^YD$L28fNMI;@<J44qGQIf&>m0Hnz`3
zoK@Cv@t5DfSY-E*#T3m9(;2HHT=V<9d7{^Il62Yt!&~;%hgXka9_RE5TDSlU9HQe(
zJmQ)Jf`fAIEl$AV$LkI^elp}25EuFJ`*zm;k-$z1+#lPI^uAS<6_xA_%@>$!2{3cB
z`cm@UCkAA!y`U_TT46q;ou3{!=HP<3BbP<@sEb_CGe|-1ICz-tlANApEEvp|yqldl
zT&dX-&$U<=Qr29$voXU@D1guCR`YaZ7-Mg}+MrbLdA3>`p!+ewF}^ZaLM2J<OXW|P
z>%7)DTVImng7}?DNPxa|zmAzDOoc|xqS1Oqh=PnfzWK|7UOZ`@%IV^402;a7F1gG<
zHjH*85GZ;#S2}}zpZP)0V;|#iuT04n7eV#&t?nQJ=|}T+P`l+nf$cel&gNRUgTwv2
z{Vks{tEXFea(@2L<dCcD`+>e-q(fl>cWt@Fqp-*P?<$gaflqD*_oGSUljm#4k#~30
zisw|Z`K#US+kN+Xe7Av>e@&gBnu5%FpCbyNpTBw#5!GrwmZzr5VEW$TnQPQ4TBhGd
z_HlYsLt#`Z`iG99N;Iz6zS#~9($zoj2YH;?sd5+8P=0ZJLm$B77iHS|B!2N}>z9Ls
z99ER0-(If`*^$dB@o%i8NmiU4uE;YSEkZMYeL#gFu+>HxQB+b4#WI;%%;9qWv+A0b
zk|F(eeOjH)@AIz0;<YF75!JNi^7QiD(qI7V4}#P6QuthIbx!i})<`BgcyqsL!Dd;G
z`pyo`^5dm{Wpk=c(Lg#MdaUy!{n_GJ5pPQEd_IX3#(-X%(2!nx`m@=q<>A(N%&Jne
zsdInzn_<8GK9*YEctaa*7+X-`{tj?`?=o!qe3eDQ;}qy%w8Ydx>5)?S#a^($|2dx9
zaN5mvs^Z4*QVSbKE{RI(^XYvqPorukI`h>~2>o}v7vh8N4{@Y->!Rb1fr#^t*N4g*
zq=YkGrX87fCe7-kMHRw>KOVRf*N^;8lTVV6Y7(lLsfT$-1x2lI*i&_cEf>v>Hd~^H
zQ##$MqJJ}bvbe<eH~D9}bNJ+qk+7|4-OjB$ilxO-m@s$<kB`j4cs$-PiShWnQgL{^
znrATLHzFr|KP#~vx4Z`v_l%l4d*JF!R3!gq)LOo}yo{Kd{7o{Jkc@|LG@YDh&}eVg
z1YRgCB;DD5z2wjibL7uD*X2(@uAEbr_%IZYQJf?2ZI_Bq`r5eW_r8a~>3w4qw6(pD
zr8`}%-Z;({zY%y>&uXx`Hxj(T<EpCh*<9WY^iHC3Ybe`u#s1;J@jXK~_PJ)mOnLql
zm;!SevNguS*A{N%W|%g8D%i*)Fgi~B1kCEl)>qKcBG7xyWHF`IDWVGX(+M+(sKxqO
zFo-{4Y1=YL=@Qj3L@CDFQcId{DVuLLzl`B|mcA^AjE-EGa$Iu$2o669Pr}RzP1$vL
z9nd#k_S!YJTE70AGK<SizoVoChm-MO6i8`0U{TiB%VnE+Y1n{cJgGqoUT~x{Iwwr{
zhZ9bM8oWZ0h@L47`7fMpiycFJXvIoIp-?m`*q3h1PD8(neKN|XOU%<roqSTfH3y)t
zt)L(*Y<F-Z%F_sZL0jjm9zTE2<R(6xKL|ill;qs$^Q8;qTe5Oe#KGPlEjvfYb5f!x
zRg#d1@6Qz8Z$}Ei22I}2f35eYw}{cvsicy_<HG_o^4YOmm=`3cC!#x7zu%p5KHlM|
z%=hETW_O38F;J^QN=p7Im1F7^S*Xa&u@)8;=hx3G6W?4L&bs~iqgp<5H<Dl+BA@va
z(={B(c=8)V9(=LLcDlxYER`;pnMj0=Gvc&*x<NO1dAqj=fq8V`>ynZp*UIO_`)l;G
z57{#s-WaWvMPNBnov76P@O_|lw)^x+cunBsG-y@PN?jt6P+Ay(;o;;qNKy=&ot+I$
z85ucRPh-hn<FLiV4b=aZ%3~OO*5BgT5-lf(xix1dk3}A`9jYql5Mj}nHeI8OgiTau
zJV>12Vl%~EsZ&OTdj_YcEg-DBS_x;#ZyitXc`a4;5~J2p?=WO*u+rTb*^)Gkdrtsm
zMnv-+B^q(r=FKe2O29U_LbGVPWtoDh@Wt2DY=Pi)U@+e(r$08nk)TdM;TON*-wWr%
znQU&FUK|7|0u}`Y>n+5hQgaZ$^=zRK@hL8Pf3QqWRUK=Rfd=MeCO6rO&jR&vo#_-T
zcXA;XLnzyp4~@oO2je(AIAG=qJRI8V{GWlTw}bCGB~>~a@vL|f5Q2V=C@A?D5tx!_
zsHiexqk8PNm>F<%)Xeeg9flzzj2GwUbQSOj3-x9TqXq$zfD)L<Wl^{J>`Mu*g?ZhD
z@;dJXQZ{=&Ho2g@dN0v)b5VsI0c6Q#ut}@6$>I5kD+WIq6o<bR<uyTq2>17;4f-Cp
zA|5{;GDaFG0GWk+z9n36*!}A9#)n7ZKuN2MfHP712uvt+5vg0ecVmE)gVgi6Px86U
z=DTZOse|jaVrFKZ&Ut;9DHx1tYJLv{{YsBHwOosxD2_fk?+SZPO!VKKiTZ9gnbA9s
zDQe>N8<olB#idqk_Fz|yAq-34d2UM7hSabD(i{;FL!rUqrfTSs=A1loQ419t<X75A
zaM|pTx41v_lPz^Yepi<?PNngqlwNV($|s}mTa9W&H2x1-kGrcpNg4t+mkOV`#Y)U2
z8W4y$4!YI%g{tkcx+~G=<AZ#Q3mIRoTt$?JYpcsOCQ+)W=JHBkWyxkkM&v=Dhyv^;
zk}#{y#rTucNkU^{VWALAl+Q;do#**tt1X2cKg%o(g1hoP%sp!ubR(8Zx7Jvuqpd9y
z?fVjF?hz8rk)23sbugXdJKHB7HEX-ts-n{-x5>j5mn<!6R0<n?d2&Erq28g%qYtPF
z!s}kGA^8#t0fFsJ_Nwzy_s2?JnuY6bI<+)03*%7)LuyPqJDWSAzHe46?^0rgRqc6T
z<E4B;cAwAc61ND3W48$O8F~5xdTWH5NOwo)xg$&%#4Ba^pTEWd=}wMxi}`Y^9W>=n
zUVbC)&CQ1puUGbQmhk4)HWlxQFx-ci$z~$fKMs7h>V^u@uF6y*a`=3GYGu}<qobGK
z+PEt*b=vLnhC}`{P>=goXB;S~5TUG*nR=^)L6rsz72c2XYS{aB*YU-8j%pp2XWs((
zRfh#jx6W&g(LINCGR|eAyVln!e)6I8^IemMvd@_Ss6y59pcXi+^fwoIzZmW+?`>*^
zgkrgc7UePnhmOy59Vp}Ebor76Vq0uFCkOXiOzC$>1R0!MYPPS|)cSAPvVj0-La~yu
z5;P>+bD(|RGb~rl-Z7Eo6zOuPMRiCDjf$qy1odQLk1h>Sr%((W<pts7et66b!V3}i
zJP_#}8wI>DaW~++8#W#R$coYMM-)wX<N2z6q6Wnsc|oY_M7q$1$35)>ZylYC2DdX+
zW*N=>zI1{D3mMtK-ly!Fcgr?D{lMQ)x1|d6n{}4iIoari%0C?o$sv1%r1@Ljzu`5;
z5j9Cs1=|quWF`}m(!x3;e28FL)38D{tM!EwOkL=S{keS~tZL;L^=C=nKS;zAUt@3;
zs#e?d&^UduITl{;=lb$%MsT9suZ9bgfG*Fk^tGYh;Y$L`RDM&#v`_pw@!=x(C+!cf
z@h+8>K52>7F#@fea9(02J@*}w(}TQWk)-35juYt|O()zE;r>6~Us(n%dDYaGh)b4#
zt7aDjr=$odpnua^O*DB{s9Nu|RuL#*DqODbjjA>Our}Mw4^nxJ_|agI=0#u#ZyG;a
zI`l>I=G(93P?YgaKF|K(!`!9QnPwCcbil(u1Nymtx>z~px%7u@{_c9aqrKznF5>jW
zhK`-1D-_6Y5ai|ShZ^#@EMg|qT85n*;VssA_Eh$M+Nq~q;~p)_GVCP}XGyf7U0a(Q
zaq9CG6q}r^S_*!BbyrV`3SaH&^UlsXPyJY!{$nv}9ptXWyE}0!(UASN?VfHF@48^{
zIQq>{LRywgx6><3mz)BPAt50QKqPlvjww=PGgYQhI~WK|zf9HKsG6H<n=~)}g-RwK
z>OEVVm6~i;XfW?0GCv$E{dXrIV#cAnHMha&Zf^SLbn$;G{32W`w$_?l?U0jR7Biz>
zyE8dJ##|&J_f3(Oz5~V1*K-V}?=Vo%-me?jZ<7i}5(+@lAYZV<X2p#dEn-n}tUWy@
zhX%^)oYOswu4rf+<?y(czJ7j4i;0THQ9bvLO2|`4{ryx{+=C5jc+@z0g$6~=5#1ec
z<&-Vuf_=`t-0#$xPKG*hk9k(5PDNPx#-&PVzrWgA9DaUKV1|iIvvH=T@c|P&3Ow$G
zE}5hDL!qSTX08!zoY_%uwvDKv<Mik@eQtcESX0=z<VeOadBIip4Ou^QqRTWsWRn#P
zGc&W`_^gT)!!k2vz1hwEL9W+p3$E<NoIkv6g@g#Tr47=XNQr)At9KX#-Hn$i<-1aq
zE-7SUy~y6d9dF>$xmTp=)#7=c&R33AZ#63vES+v&@PGc4&(9A9(g0@QO}DsMf&PIZ
zG)=fND2?PGL-FY86b2ns>#aKRg&KRq1or3S^OA4~I^l{x=Xu!M&oB3tvVmghJdlR)
zv%0ojB{$^FB~#GX6qOY(2nU@1P!nV{E7{Buv^Umay&uhG;E(sut&>XAIc&~ivyxP<
z42I-Wb2GSHFSK0c7OyU|c=l5D`_PlwH!xgWmM8jzudc3;*Voq}fI*^X7spi107$vO
z$M2034mm>D^8H-wDI+5a*;l|BODR1>i5Qs9SD~pAE$=-i=i&Lz=Y13Cb$5i~e)42L
zb6>7T70_jBBZrudf=bNe^H4D|=;cZA)iZM}FJl!$r+$W9s33#=z>l!I27CO6ryT4G
zSEwDW$BAj=$EY*)V+b8N!=(F$l&nIr=lKY<<`1tL6*%RCRaw{!QI$C`gzyf~W3^6(
ztBZY4n%IRP)>Kvjx66h0Su`0yNt&l!v%6dSJ@-E&k7&w}KE~mR>!_vPTi)?h-W8*{
zjgb{>`E+iTB{IrToy-a8ABqxBFuE#qHq+!3WJwx0kvS)x7@ppq^u$7UPus%8aznAO
z@%^1Dna_D7V(L~bfbZw55rv}%Zky%dxEH-9E0k*O#)|IaXMp4GmlrzB{|aJYSI5%1
zvsc-i!(B`uYvSB<>|6%xuY(yF{7f!$R9kpiTrU$|EICU3eV1$%1HEU27Pg*4=Ce9}
zzTv+4gId0`W1IvXCl53op+JCHuVoxq^EUhPzcX{SEKf1NJ#LvhPCczUZ~ahbEe!`>
z+gfm7*KxGjXqV*USW{HL-f&IVClNo{;NuOPxVJRZrC`pI3z^LRVeNb#@b}-#Liv0v
zn`I%Cue$st?5pnEZmaXaq{gi4bgq!U7v^`Zd|op4)1<vV$c*;Hbpj6Qa3#(!gj3$q
zcs|1-sl<8&_#@%;@$?86y(V(mbaPoh9tTsR?>&ZJOloC(Z=**j>1`G=3)*2B93l+X
z17SRNy;+i^g2*GWN#xmXBHi(P^;$)&)te?o&H&{Sib$s0^7tJH@}n*8+J<Anq62|G
zxkcOB!S*zWQs}iQ?H{iw8xj#%3Q&zZtbS^PgL0G8p7}{g!1+9`a@Eh7+4<_GA4zB9
zvHw|7)Ps3{i`y#CKux4hWVdS;>bON}Esg9R-*`1z{igs+|ByB!I}+maF-VnHVakD7
zM=M3ZuaW&RpdMs$36>reP9Bu09jJjBI4;w_{t}}2(JY!d+Yr;S+DmE4uOGQ^n$PWg
zdIQ8;=3{mI^)C5`quBwQiP5C^H4dAf!F)84Or#WKC?)cl$g&?d(kn-K!Wfz?W5&aG
z$J~~<4%T2KRGD<IH$nNPP*e<hs6=Sw!@a(rI&b&$W#m&6kk3kRc6p-y-(^?!LIpS4
z>JF}rh|nlTFFsyv;|!!_Wyz8mQf+66;z?nmq`rxp@mzYxgHmMmPv$>_&*4@11j~dp
z;8Mo^s(Hs82j9E>X$|o+a)`v{LuF=0d3}NzA;ZXa2-9LeL^mNR98hX)gALynz8lqR
zjBr+~R%$I@+;30u-IQmrd6Wzjiys(Eu`DRaANIX$$02h3TQ`#&^_@(M-G=tT-sA3U
z<gSTPw6?zzmo9ChwlpH*UKi`2{i;kj7X52`T+#KK@mS%1KCbb6%@J~`xT1n~2QUo#
z-Bq@ell?LDK1Q>Du|$zAp7C_AVO*k|hnf)u31}ZvOM0h~2pm%eh9h7d9xLwd?l5fL
z-tL~Bo>r!&AzabITwGj~Oh2+iz22IC1`60cUPO|i`P$c*r^4@_k|?h{Bv2dfz_-}<
zey^Cxeaju=z!#ov#W?)yp0#RlRr2-BG{`IsDn{bHAwzW|qTW(`(SBqdHDq&+6`|{F
zIWL>tJpOE_g+pG4?pmYEXWZ3t3DZ-5Yg}*?e7V@?`7YRp+vLuy-g+$AKgu*%|6h|%
zTYuNNhqp=1qV=j>L<+-_QDI3&d#)=FzSfXh0>vpR8fqx`>q8p6_4W0&NPAkr{b}h|
zE?Ae>{RwIlR)$veQ88HiX|1=^Gmu{uh3+j0s@kJuCQmLYuK?YS70dM^#3;%uosvaQ
z2C<v#EL*67+R=E$-j$9k=*2!dxu*8^c6QxeR$EoNHBM!V>sp<Z7B5fG{5C83h&o-m
zO4?nWv%ePCTULJy2Qv<&HA)pK@&agTqcMv|#rFFuKO3{c%Sx7dMnz9JxMX~d#kuB0
zQ(hj9<t7@)PlULqU@&^&)BY;=PpQ*^ZNBm*^Y&h7mMX)-@>FW!96mSs>{h!h6B4l`
z5e*vlqGezJUvawyv{QgbW}y;~+-xnL*6bIb(BdBZfiIN?^Pk5HU6)v=)m!<?sU;#l
z`T~TRnGg!r{DQL|XsV0VSW3!5ghsX@#vPuiGW!x!OAK!)LX2MXl<%>5!uJ%>_!v4Z
zUK&n^i*(FvWI;doLpnJ*xp#1Mb$i8S)9tcaU5wZ2Er&1`n$Pj@uDg4R*D%>CoH#ry
zX5+Fv?oA{Q7U0_z1pG#vVn`*`TRa*}x^o3zI=bG;JH2nolaLAIuXvH5iAoh4{`>?|
zzRiZ`q28hwI*~blElio?Lgut%He#b28LIH!j+Z;87N~&QB2zX`cELW_59gDsgGZ|T
z%CU`OwYJ7zKJe0bU_#K4ULRa@CBD7~vq5nAp-LsCT^}vt<$9#Vij@#V{Mr)4Nd`zr
zNIhM0czMnl>{d<qeCpEl>QVIk4usHOwybrU)Iahs+tqm&k5S^)^@7!1585fc*GXHh
zO{>Hos6`QIJ?p{uG;p?D=BW`gK>`|5j=g^c;*DxMxc_p3QDNh|k4Wj_4|jVfWut=y
zx59=kM7TzY1&r%-vbKZ%8jOFFis&oj%C}KD*H{gX=#fb#p*JZT`LTej4ulA^^uYmd
zwR?aT5Xf!4*eCA_&#!l2XY0AWh?JP-x@GhCFo-q+o{%B34=0oI%Nliu21!tJ|Eojt
z8}*o?5w)Bj1Hr(U>4MXOT%~!2RUZg6Ni21wG$9BY&LGyjT@$xH!P-K@8*LsICEJIZ
z3~n=+%SSERupMZ3))xYY4fhi!c>Y7Kkv99PogCkqRo);;1n*dZr$!T`D(Id&ANP2x
z^sSwa3SJt%<+~}%Zod_Q+WQH2V;Rt=A5;+OxiP-&9O)ZNNZ|b0@;0%>D%uBUM<lF~
zzml!n&A4Zi`F3fQG~#s7CkdmA_!I7nABeE2aX`IWT<q2ZY|W@=M=B!FTh_};(S6X4
zS8cV`{I={)o4Ry#?Gg#+R$7>$p%6j=Dd1v|>SE8Vl!oqfZ!vL3HR>KmSKyPnL7B}8
zr$yfZ!%?j&i7(1iH@iuMn$^L-CTti4+9L&<BwkZKqqTJ4#E)wK)V}^{zGkiq;Gv~i
zL6ZhI(|M<bdq+p33Jf5LPYCUAvuG{Lx6ODFNms`ML;i^#7J_(|(dKZWT+>v?4P8C|
z>yn?;@$<DVOgeJ69b5CS*oeysKdXZ}KG(8DYWc1rDaT(GWjD07c|95`4P0?fg+`c)
zhOZhHe6W1ZYJZ>KNSONIe%8bbL&vU{D_{u#U7HA?pC4PuoXczWEL}L|X{;`(0?By;
z$HHVu+t7%HSIAu*WjSb|1769*pw(P>Yxmy+-`c16?SPr`vK%GJ=h}@@{(eknoezP!
zPFIbrIz^&`++W~l{C?q#7?t!-t~K17#-+=PKzqYO8-bwjJc2o2ADyrB&`_Fnt@V1^
zTE#UsrGr4i*^ZTT7Uv_9U)EMiQ+a-a{QoQi6JspFLzEmvoc9{3bl_sKivE!jUw^u*
z>Q_+N3>&}`=!E^nLIM4L5QVZ#CjkkhWH7l%Wed~c?(u@UTZXabu`=XGnm*q}D)hPf
zkRkmDzx%0IkgAZ}@q0ag!yXdlnm0QdSK1>6v7&$^cBzbq*emDxD^iCRfag(U$u5z;
zfR4U`S*ggM6=e&b+wj)LSXgdsDw*f*Gu1ca8}l!ehx<6)Q>$8IQF4Ku=z1JIg%$Y2
z-B;yncwGgPzNuPueurgtQ-$-JvH6W@@~3yO_?u4zZPGz)Ikq|yi4Ptjx|J7lSSqF%
z5_Ya8=Q+PRQHmOD74G%VB9iZk-EjS=6f6#+YRn7ssJ^Zl3p}2BKTo*OU2dY?`i@@>
z_=ZS!d!Uh9qmfbKZ+A;i54!3oS;22K!8(wUPr+hiuEBK5x$EFcWakkHmbKiL?3FL!
zGC{fMR!yn=X>rnD*9g6ek+B^wK_DEC|NbMO-Hu0G`k@oe5NoYduyI+8<aEuCN#Lo{
zY{nmasYr6gqIY>3>>Db|?}PmhRu_uFpgISzc0pj+N~ylEW)-F^$X^{2<T41?F?erj
z*`>!DVZOkc>)`b2rw;*!SZp+xOl_Hr4tC>^>r}1SJ`WFhabPhL8mX<I@Rg!sUsao$
z>olgTs42VX=ioU3xEZn#zXWm{?|qjZhWT&rf@%2Sj&u)Xbj?5%95`s`J{TDPz$t$o
zYGJH#WbN`!@B9%dDS{3^qKsW=wU=|cbP&kiA~+R0ixTd$)DEoEb)tS)WTj^wSHY^S
zx8-F^<Pgx53j?X;h2~;lpbC|=XvN!UvC&czr6%QYuq84ME|KKPX$8R$FT-SQ1RX25
z!xrWQOyH8CUkscaJP5TcFo)Bi#^j>VzI|D$+D~e8Jjn`UVO2ptp=>(~7r|M+h)mq0
zb$MK{iB~~!>@w7IYHy(*U3dGmYQ^iK(`|n{1C`CtKoSD}2(g!v)b;jhY!P?OA`c8i
z7#Q2jU<ZP%#{?}5|2KKi!maf*MJ6YgV4kIU-HJ3BqZ%%U3Wf_e2G`2yf&uA`fl0WX
z{)dWJlXC~^;U@z^^cxyQdfjC_>kplz#{=CJa%N<qF$b0`-+oUhO=~CWxM^4)Tx3Eu
zGg20&e=B}JJD0?=Z}<`6e}>Sdk8qUkKK8&><ri!-oq47a;cuH~akW!&f)R=pP@GWr
zt2V@_<4HG1Pzp+|E<!(YQYB=*zmgq44}~RO%OD^dF#U@n&c!sb!MDA4n(x+vneZEL
zO2ZP28w2ZO+#Mhmjeok`>^Z@x>pUpu)S{Qv3_G8>`LcJ@KC$>~g*Il9Pd<tEn#R-7
zx1fvUJU|CMvq6_02r8Cb)T6w$3dK?($pr1>&-b&!Ot$Kw`%QWfc4CCX$os_%1Rdwj
zYa(WlXK$L${Sbj*{)2-D@;pJA8GQPMCbPptNf+AHL{ZzJf9kDe-l^cl+vn{myEoZi
zE%sr^hDg4OYT`9AvM~!ba&Tq%%+xBdgFqC}SEoTxci{mXg=rR*$3EY{6eRr2x%lOI
zP0CLj%cemfWTC)zmxw%&2qj3FtrLvk^5lihQTT3SwW%souJf?MGnx8>LrvIvs_9=6
z8&LbG2qptKDAeEQc9OA0FF~Sb5=Z{35A-TcyN;gSIlEmFXIQI%68)#)IoWMa`;S{z
z{HB#bxl!86xrV0)C-)I1b`9;+8!e;Y)IV$qtV%=9A9_GHI2<~N>=_6mUE{>bXP;Ug
zPm^=Jb(})646I<#%Tbau@cV<Gg~@|0(y{<|0K!Q?pM0R1f?rO;&?4wTA@`?cx=g$L
z?O8kT_J^?W@bxV8-8S7D8#<gmZ*FlNuLQntM6hshXj54{YFp)M)k`+>)b3!}<{CHF
zH~!xWuppXKi}G8OYfUBTrwTk$@VP`9%W@29LM%X_q-@dC=uKo>PJ8kELpMg75YSPk
zUm(?=9N~+V6R%`S=REA6^Uuw2vGb7K&DqyZfB!_F@tF2XY&7@eTsM}2sV+XfzOG3Z
zDX#Zch*T&gQTL{+l!xD`Hrww)x}VHbDNJSat;9r9C#MDw*;X2S;-|eLkRE-@5AJ{Q
z8|yBQdnkY7tJjIjM~8^if7}nb+wu05o>Rs#;*}Kv!GIX~O)ZBJGjzOA+F1ar%Qcx0
zaY(wOA^b%JTK=x(hO=<JP??3*TMrBbnLg$4_4V~6ARzddFH;H<tF~OKimNl3h?mP?
zm-W5e<h$7$Nk~+t4$9{9{F^{7J4?Xp_Osr8SL}K?jwCI|_an{4-o9shd)q=7iq-qY
z>3AZ8V`O^zXQgJPSO$k}@$SwJG!_=tdYh}Q%X+i@c&*`(h~;7h%6~j>GM&Qb=H^a6
zT5X_daynMrXtZ8Cy1KGc@bKW?{`b$1$LXkhGLuV5evVbU%_ZvXdWc|viHRw&pnwDs
z7IueRF0&M?i#%c3|A@4#BoOmMsn3P{vq|4$9`1NPec^85cK6YypERNFAv0k!%u{37
zI$WTJT+L29$QSQ<a%Xwdd=lB~6cY|~x_`fRPKK--$rK3&LIwTvnOBm$8m5ZB!26*Z
zGKsL=v0gbwsg*4uBSQ*_gm0xNnlBoG=`~j@``q6zV(@se;iRdFa|4u@h)GCB5WcgA
zo}b_Ti5?srL?x9p(9_e~etW)E77>A1Xs}eHCnEaI>Gf1IJT@l%O9M0NJBQrE2t|%v
zUoe7U0kGY-P_8D;ii6<b=!k}jh8A|d)-;!$o&7XZAZ~V8rQ_s+p|&HjtocVWf!wRh
z=RLKwl+pnxz#62<Kne;9u3v5U8H0g=y-nu{uK={&ZGC>IWu~QtB{S&R)H2cEK2ka=
zbun4qS|*a2P=^J1GDtn>#T3?>S5ZeI2P)w*kvP8YJ-K>&8#AjZ@wcW{c)1-*!%u$!
zM{oZ7D{_0eh32L^n*=>#=~L$TP9_K*;U6a=896!X;OFez9NdK{dBXPX;Z*s<`C7?J
zn`<===wE*r8hO^HA4-8b$PbUl1@J69t{3Rnyhw&a(F-9VArT-DxARFq7dQ9L!9>O_
zP=0z^t~LDHX?wofuqz}i+z<eTWP%K8W@+^=1CnvLkGnbf6^%NR<JCs%u^`}m&S+ua
zyL-5Jc=iAbzjAPJY(HJ^vfmf+#3z%yjEn83j?XaCYC1lpH?MH4o%L3W&XS|C!8ZvA
zSUWb#VJl-2gH!I(s-K8sT@nVaQk^sr_z9|m`?Z6C>OJ^SUdHeTssc)uzb1$si`|_n
z<UIsfELND?pSXRzxx<3G$0$2o?@tz(1a$GNb}t{ywvw&ao0+`#M=?&e`$CM|+&Is7
zcKl!OPlvL(ouwc^R{(AP(ae;Tp-CgU2>FWUx=fM!eZfqC>@u60n*Qm{X>mFpz&q~^
zqs#!DgaOL4LPA1!TwLzgsetB_X63}US+6zX3PCdt{wn`jim!sVT<qddzO{d@jpxpz
zCL2|4v%+yZd~P;Vv0+*ihAq|GUk*-)&8S;(r|{vAMMph{t(aDl(dQn~?_H`<R{-0w
zK^u8ZJIs)F!^|hlYOhdgHq?W*dU+^k0)C<j?^*>EI1B(;kVkxnf`U3bOmTfboXRQj
z`FJ*MwOIJ`1gr-gBoF|J0K2hPs!)(xrc@%;^>!m|?&87@3j;F~F<@XeUy9<p-TRfA
zkkIdPIuD!x4^K7{k4IIrT9>b9YHF&GsI9ftKut~U2R(iCa*cjKv-w=Hm%4gtvi`TD
zwWjK~-XJ*TYTb@yKo!C*mZ~OvA687Yy}Wo&cswfB=VrisBVs2OnSdT!iHfbnuIkxX
zwZF<ifi4seWL%!Efp;EGsK(@skl?E#b@^Yh+sKpP^hRAR;HeP*`b96m;8#KfhF~u1
zqkQofnC2)AAa|NvvQjzR!E*Io`p-pcF<*Lmc)*S&)6GTU^J%iXUHv*3O`=K6Gm0%K
zDXFnsqL&3MyTR+(VI&-brd~K0Uf(@CBpQ#08Zcj&FcJSxpv0Awkbt&m#%ea6$~1er
zq_+@2a(<>iR2+5E+{|h4@%9X4(^w~B(YP8l4CK&XWO9D!csyO{$8BfPpu@I%+*?>L
zRq3?Appi>V0p5^NrM|AN4+vL_kWf%5ZNFyWiy$kPeKl@QpWA}gi$QrL5R$=0wpplJ
zgDK`hEokXrZG%~>hVc5JtGMgVkSw9e5O{*ZEcO@2Xw0N%Kkd-HJ)_=P7BWdxexJ8g
zTt1I_tGB1?m+J&_rbnAiQpxzh?rs5Hz_fwqc8=#t0=v5SFHcUw!EP+vj7?2R(qvQ_
ziE-SXu87iNWBqV(aqWOG0RgOB57?B+=5-f(d3}YuxVQ)si+Or-+gYhI4FrlX2so_B
zdty{n`GC(`URD>%`zas*951cO_re4T9Mz$R(a6H|@oHPOUR(R^e_eLENp@T4p+PND
zwM`ZS5u<=9Ir?O|=d(_}HzsQR2Q2FYV>y}`(V^>cK-y6RRuecE<E|_eNBq{G_CXq5
zS^U1)%+S_AFxl^=wUGv@1+2gQ{yNB}l$t`jYLcR>_`Z&Js9Q`<vdmI2uuN<eGoy$}
zJEF0D4jf`8waqLEv|y%#m5}4%X=JB@L(77JYHNAaPRXO{PY>mHPM#NOAN_$MrYMKa
zdajqZ_p`_SNyoj1Bf!R4Dy3q^`_m-`r%T$gUcj4=s8(ufc+qR2&JGlmw;feth?-A7
zn?!D)@^e^!FX{bVsQ?_&%+x6<LV3(iYm>G2n`W1}zD-)|l=N&;v*N=LGK0iThj_mA
zzv2U(m}l#_i)`7X8xt7<N#}6ZSSorjiMF{EG6WdCPOFGJptyQo*Fo)?4HiOfZq4L;
ze0YGnCI*82_VevwKM=-9<>cg^)@-{L;z(hDtJLLu)jvW(flm<m-+!Veg#LGUNO5%U
z(a#9a$;tWe=*X;hWd*yWsEDXRGs?igAhag{s;h22R)ZQ3md{J$@AIBDQ{U|`?(n?2
zFTqrQbm}g<%1_<3s1v7@CizS9Ra~d;zQF5%{ma{Sj6R%MyfVD;;0qpGx`f8M77AjN
zOOH)TN?MY2#N9hMz{D}pz7;l0rqv+EX0u47WoC|_KfN`{`40h<mzUe!>`5vBc7dU*
zt1G0ejIL22hCoC?0oBsh1_y<Ri|l;5D67vnf8@l?YBo*y-0A%)G899Ab$>J?jE{vS
z4)OIX6#zHH#KpxE@VK3RLi_t+(y9|mN=gzPh*L<`sdp-e2!I(byKn1p+YOSx5=84_
zv1E?;;Rc|<k8W_aIf|lU3ve-Bra&xo%$feIGm+h^xK;D}#RTH~0@|Zom~XGCoWJNI
zzT2954mU?zE}bt)KtVx47ghMTyGydSw@2tt*gY^Hno9}$x2`Tu9XN`q?+37(vnn^8
zypJ)o*jz>d-N@LODA_pAcr3YHw_1FuT8{z`53e>W3$IG6sS25ZK>pjqW;6l692X#(
z|L~b~3Vjj?#L2_6tJCH(!<~hwOpP%*JWP2cfQO5FHJQa@o@}fjCI&e=F(LY&;E!es
z=CgQQhyVTir3y3(0n)tzkb^P+zrw@9h+19FM*-23qK4I|(*_3yLKqZPXEd^QTu*Jp
z6m_n+Yrar2O;EZs04Xv`Z(a*}8|-Imsmz>jN+K*<K6ZfZhJlK!d?f|f9^5x}kihR!
zd_HgTiyuCX8>VQ7VL4<)i6?wOs~unyhgI4wLhKgvp{5#`iBe?w<~YD&!~n^KMn;mt
z`1_@%e(Rl^Q<}DDkSLfDFfk#Ql9LmY1zHy|!qpJbSY?HO{UXfC$cV7n>XgNiza&G2
z`6k9p<hzH@X_tF;vQS=_k?|dJ__k&S=9^o#`1{L!!d8c8N`-1=Iw<xKpWR_kV&%VJ
z3$)4~{yw}Fqv|8)0Qq%v@ZmEVK-~W80spI1BDW40ixU}2S~yzn?L>YA>R`zsa)IxY
z(xT_S%|Jmv>n=6zH_#=z2ys_tmLry$avYfQuRQEwcROOG`^ynzP#!i&oAqOD#qv`k
z<MA_QC&QV4j=G^rxtLe85|hvCDfVo>4DAv4F-79IU%(b6ZJdN%6b>5%0|P@IV3h4r
zMM7*A^WDs8@HlK1{RyNnKwHZnwd&~0%?{~!MJh<pFoEDzrB>7Mf6ls6mBiNe(!b>j
z!0z|xqCGG7rvv{*Fb`ng!N<o3fD~V5ilnisY_~dTg`i0xzW5W`Rcbao0pTcLKIexv
z0BKkL^VK)|V~IeNn=X+jcmN!Rh?NyB3NG$AfbW`qAtVDF|IdP^QkjHtFfqw5W(q>q
z{|Oo(0Hw~*10N}AU!_xx`nwp%_+q6`%cjdw@H*JbqhiXMkBaq;+u(AatmoO)+hh*v
zM5aQ{VQ|kop>NgV(9e6`@l9#RCuigInVj1E6|Pv5$ty`pN=iUO(1O%HkqHoSBAGb3
zI(}@0lVMRsGW};t>BV%51*+u>!!^FebH}lS(1ZyiSA&OTJ4A@AhEE$|XcUyuyRe9x
zo!*|4lam*fml%fd+iLjXm}gvc-uegZLC7Azk+U<}OnWTrK2$|EdHze976S@hxEG8f
zf8m4vviQV--M`&gn)-~?TWMFX+d{R+tVg;FPQKiy-Yw1*dc6w#Rtv!d-y}?uR4ug5
zGPZ}P7;aAyVj;A}*%Kt5p^nV0;ZZweProdVG1SqaHgTa3Xz8k`-WAcTq7<FVM_Gt?
zfRr2^CriHMp;+<0(9`T*=F}tp6&U@KH$Ah%)n}cjd5?L!1BgQI5f>9MGF4FU1mC-U
zl_?X;;Lb{On2!G{W^Qvh?zZE#;`Whkci3_d)0|2b;X+fiZ_k}UQu?SpGMEuBd-6gS
z0v{f$qP1Og{PyJu5wsrJA>F>NlQw`B@)tZ{HOaiTb9OM<d?2DP?t6?#ohxq2V9CKZ
z<M#>Ij6XZ|qBCt9m4*MvU$++S6Uhi7r{SBwLX0i2@}Yq+Myu9aCBe^Ek9V7cKlZn>
z%6V@&e<WpgyqDgscPtBO_<~J~g$qXf{m@R-NnU!Ydj1z^(p{^er<aY~`PJ_?`AQPi
z7vIgw0+;8dzGZ4TMB@j$diILuXZAChQDmfBN&qR#6hYE`zDF9Z9cvljA7V-SlGU>a
z^q}u&7a>LP4ttJ5(XfKM(0RIk`K=Bm;^VUa7OY<#dtE1nFzd}%NFkjZ0I8R~?FQz<
z=M6Mbdj_)3#af@JgcX=|_w(*|lD&?;aF4|!>Hb&pGZY_{JLSYNR7ZsVgTt_Fsp4v?
zDvvh-(PX1(Bg_a-daThaY?a~lIzYB#!^&p5reJPl@JDjOF&7%stzEHb@)?t0?P@a!
zLPNS}I?~Q+Z7$5TYyi}+1`@#L$dsc4uh(Z8toyM*W8(rKfviL^+D&IgkEa*mI1R5<
zKhQa*8Lv`<si>5~&{J$FdivM13}00!2?so}QqN5acO@be(d}(M9?KNjH~vLA-R0eU
zVh1uU;RcsP49|Mi<8OEGzi)RCLg=OLXl>ge!6T=gQSBt;o<@ZRX8E0{gH$0N2#}6~
zz^cIoItLN{q}P#fn9hUVQ#%D|72;>(F_3|<hhZO~+kP6zN~dP6_!SLAW)OK$_Nm>1
zrp8Vgp$vPDkZYnBzNb8>7V1|78vb$F*Y2`7TwP4>v})GqEU@AF7cUnDA~xx_{(;!M
zqr(A1NTI8N?p}**N4laVBN+cM^vKP8N5p8mkF$HRqS=7RW>ptYPu>8?44yENVDvW#
z1=_=FR_1jh5v8I@YvIx2Jvla8mUh)g2G!@8KI?Gn)q*Y<3=c&=HSzwiX=?Nel?FaZ
zs1DV=T<|vpj6P_3a5<7LE;Ohb%6VxLe<dejI^W^;r}Pe5*&@qPiEzr%^c8eiu)gfX
zJ;)V#P{{$Qs#6(f>9tdlsvgx3StvXP3SJ;oI{@W@-R021(cS<n@_Rr0XT5Nc(;>IR
zd8gTb3kow|DYj99W@hYWZa7+ral71<n!=_jLtoYMOTSO1DxB5uo1qn;<mct8ezaYw
ziGFx3BWWJQ=+8QJapL8AO$nGijmgon%OGYCH>+oKy!?ZTCH*;FhMtZP+km5)g?dfH
zv=_3w$Ds`5B~8LK6X~(~RtwBdht%Pb+8CO(uZ2cyBjr2og<>WE6+($n!dtECFClV8
z(^jE6Z49g6!Hfb|akbmsa`gKp@E2tn%zH|UfG7P5-)#WaPgW!H7Z>UKXDg-d{i|e{
zqG9$umM9Fgyb^)7Wu3t>0XM)+CTKdMJ?m_EQBXNpnz7nL>0rf+2j%TCh3x2j*6DcG
z(EJ%4RFS-HAv$$(G|0Ooae1x%e-uc;ltdTcYWz+M@Js#TH!!I5cQNYJtgkqF@8sF7
zbM9VC*$bBG@DO~K<x?blyg2lWkQqnuF!m_8rBu=XeF6Kmk;DRyv<p*zer|bKK!Fth
z+T8Pe_BufQRng^eRPoPDI=`GW`=_(u_!XqI@9*hJL&D<H4^7V|*6MBH5=FC1Xwe}Q
z9pr;PP$(qmXARyzv+94QUNcIM_uaoTLUy6Q^;OL+9gBA1#}|KJFNRmrh(u)syH)WG
z>PYzC`aMJbzm7Xhs6poq_{AD-hPWeib;>)xug2=jWDGi6F=z2U%9FC0j)5H)9F7!T
zyMu@j2-OP|_p5pjeJX2UBx!JxglNEVHDjDH%}h`TS!b5B;E)A|x*^jMl*H}WHlz<G
zF>=x+X_sAoI{fR!Teaec2hX+37GYrx=^HJKz73HnnjS?K9a1153S0sg;6ZbxPsbd*
zyU0!B&u6HP4U4$<pXcV&d>x{?4t_5g@JQyc&kzDNff15EE+hG-Y;Gtkq5~8_ExY0`
zNO#xZ6J3ocg%Gv<1#kPMtapB8V;#6;tTy;f_$g>D|Em^~gkGWXcjI?a5HuNy-@s_H
z%IN=za}=iZp0F%S`3E*Qb)rs7TuzG1#@?r)*5V@@%Gks(r^Di4$E=pZ)cD^IRCth8
zK1wa&?sBk$0`2>Mg4l<5pZ<mvAJV&$gyKK73Q4^rgG+nm@0Wb}B>newexX;x7lu;^
z1v%b(uf##+|K34=oh;A-Ctfu4GZ_?$AM5930L?PN>+dZ5-u?lSQOI6*YpkehiC<Ep
zs&{}Ff*xs$#1Hmk9Ib-?b+8Z@ghH!ux-sTldxw-Im8Sw)_YWCc@aN=jW0trH8U)L=
z7GS<j8fr{>{hy%A1vs6)pHe{gYl33GfGH(WG<wBq_|g!6B7J154kCU)ihzJiQdCZ)
JO6ZsW{{!A)PU-*v

diff --git a/man/figures/logo.svg b/man/figures/logo.svg
new file mode 100644
index 00000000..3d9f2d4a
--- /dev/null
+++ b/man/figures/logo.svg
@@ -0,0 +1,833 @@
+<svg xml:space="preserve" id="epichains_svg__Layer_1" width="336.9" height="389.2" x="0" y="0" version="1.1" xmlns="http://www.w3.org/2000/svg">
+  <style id="style1566" type="text/css">
+    .epichains_svg__st0{fill:#18bed2}
+  </style>
+  <path id="epichains_svg__path4333" d="M97.177 342.914c-39.88-23.202-76.929-44.734-82.33-47.847l-9.82-5.66-.01-96.073-.008-96.071 81.694-47.46c44.932-26.104 82.18-47.646 82.774-47.872.725-.276 27.999 15.245 83.305 47.408l82.227 47.819V289.42l-81.726 47.495c-44.95 26.122-82.147 47.65-82.661 47.84-.526.195-32.66-18.112-73.445-41.84z" style="display:inline;opacity:.75;fill:#fff;fill-opacity:1;stroke:none;stroke-width:4.8885"/>
+  <path id="epichains_svg__path1568" d="M261.1 203.8c0-.8.5-1.4 1.3-1.4s1.4.5 1.4 1.3-.5 1.4-1.3 1.4h-.1c-.6 0-1.2-.6-1.3-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1570" d="M265.1 179.5c-.1.7-.7 1.1-1.3 1.1h-.2c-.7-.1-1.3-.8-1.1-1.6.1-.7.8-1.3 1.6-1.1.6.2 1.1.9 1 1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1572" d="M265.1 215.7c.1.7-.4 1.4-1.1 1.6h-.2c-.7 0-1.2-.5-1.3-1.1-.1-.7.4-1.4 1.1-1.6.7-.1 1.4.4 1.5 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1574" d="M262.4 255.6c.6-.3 1.4-.1 1.7.5.3.6.1 1.4-.5 1.7-.2.1-.4.1-.6.1-.5 0-.9-.2-1.1-.7-.3-.6-.1-1.3.5-1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1576" d="M263.6 323.6c-.2.2-.4.3-.7.3-.2 0-.4-.1-.6-.2-.4-.3-.4-.9-.1-1.3.3-.4.9-.4 1.3-.1.5.4.5.9.1 1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1578" d="M264.4 284.3c-.2.2-.4.2-.7.2-.3 0-.7-.2-.9-.5-.4-.5-.3-1.2.2-1.6.5-.4 1.2-.3 1.6.2.4.6.3 1.3-.2 1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1580" d="M265.1 228.3c-.2-.7.2-1.5 1-1.7.7-.2 1.5.2 1.7 1 .2.7-.2 1.5-1 1.7h-.4c-.6 0-1.2-.4-1.3-1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1582" d="M267.8 308.8c-.2.2-.5.3-.7.3-.3 0-.5-.1-.7-.3-.4-.4-.4-1 0-1.4.4-.4 1-.4 1.4 0 .4.3.4 1 0 1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1584" d="M269.8 268.3c-.2.1-.4.2-.7.2-.4 0-.8-.2-1.1-.6-.4-.6-.2-1.4.4-1.7.6-.4 1.4-.2 1.7.4.5.6.3 1.4-.3 1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1586" d="M269.7 238.3c.7-.3 1.5.1 1.8.8.3.7-.1 1.5-.8 1.8-.2.1-.3.1-.5.1-.5 0-1.1-.3-1.3-.9-.2-.7.1-1.5.8-1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1588" d="M271.5 156.1c-.2.5-.7.9-1.3.9-.2 0-.3 0-.5-.1-.7-.3-1.1-1.1-.8-1.8.3-.7 1.1-1.1 1.8-.8.7.3 1 1.1.8 1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1590" d="M274.1 251.4c-.3-.7-.1-1.5.6-1.8.7-.3 1.5-.1 1.8.6.3.7.1 1.5-.6 1.8-.2.1-.4.2-.6.2-.5-.1-1-.4-1.2-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1592" d="M278.2 185.5c-.1.7-.7 1.3-1.5 1.3h-.2c-.8-.1-1.4-.8-1.3-1.6.1-.8.8-1.4 1.6-1.3.9.1 1.5.8 1.4 1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1594" d="M278.2 209.7c.1.8-.5 1.5-1.3 1.6h-.2c-.7 0-1.4-.6-1.5-1.3-.1-.8.5-1.5 1.3-1.6.9-.1 1.6.5 1.7 1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1596" d="M276 196.1c.8 0 1.5.7 1.5 1.5s-.7 1.5-1.5 1.5-1.5-.7-1.5-1.5.7-1.5 1.5-1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1598" d="M276.9 317c-.2.2-.5.4-.8.4-.2 0-.5-.1-.7-.2-.4-.4-.5-1-.1-1.4.4-.4 1-.5 1.4-.1.5.3.5.9.2 1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1600" d="M277 278.3c-.2.2-.5.3-.8.3-.4 0-.7-.2-1-.5-.4-.5-.3-1.3.2-1.8.5-.4 1.3-.3 1.8.2s.4 1.3-.2 1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1602" d="M277.5 222.3c-.2-.8.3-1.6 1.1-1.8.8-.2 1.6.3 1.8 1.1.2.8-.3 1.6-1.1 1.8h-.4c-.6 0-1.2-.5-1.4-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1604" d="M280.9 302.4c-.2.2-.5.4-.8.4-.3 0-.6-.1-.8-.3-.5-.4-.5-1.2-.1-1.6.4-.4 1.1-.5 1.6-.1s.5 1.1.1 1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1606" d="M282.3 262.4c-.2.2-.5.2-.8.2-.4 0-.9-.2-1.1-.6-.4-.6-.2-1.5.4-1.9.6-.4 1.5-.2 1.9.4.4.7.2 1.5-.4 1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1608" d="M283.9 162.1c-.2.6-.8 1-1.4 1-.2 0-.4 0-.5-.1-.8-.3-1.1-1.1-.9-1.9.3-.8 1.1-1.1 1.9-.9.8.3 1.2 1.1.9 1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1610" d="M283.1 235c-.2.1-.4.1-.5.1-.6 0-1.2-.4-1.4-1-.3-.8.1-1.6.9-1.9.8-.3 1.6.1 1.9.9.3.8-.1 1.6-.9 1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1612" d="M286.3 245.5c-.4-.7-.1-1.6.6-2 .7-.4 1.6-.1 2 .6.4.7.1 1.6-.6 2-.2.1-.5.2-.7.2-.5 0-1-.3-1.3-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1614" d="M289.9 193.1h-.1c-.9-.1-1.5-.8-1.5-1.7.1-.9.8-1.5 1.7-1.5.9.1 1.5.8 1.5 1.7-.1.8-.8 1.5-1.6 1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1616" d="M291.5 203.6c.1.9-.6 1.6-1.5 1.7h-.1c-.8 0-1.5-.6-1.6-1.5-.1-.9.6-1.6 1.5-1.7.8 0 1.6.6 1.7 1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1618" d="M288 270.1c.6-.5 1.4-.4 1.9.2.5.6.4 1.4-.2 1.9-.3.2-.6.3-.9.3-.4 0-.8-.2-1-.5-.5-.6-.4-1.4.2-1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1620" d="M290.3 310.3c-.2.3-.6.4-.9.4-.2 0-.5-.1-.7-.2-.5-.4-.6-1.1-.2-1.6.4-.5 1.1-.6 1.6-.2.5.4.6 1.1.2 1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1622" d="M291.6 217.5c-.7 0-1.4-.5-1.6-1.3-.2-.9.4-1.7 1.2-1.9.9-.2 1.7.4 1.9 1.2.2.9-.4 1.7-1.2 1.9-.1.1-.2.1-.3.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1624" d="M294.1 295.8c-.2.3-.6.4-.9.4-.3 0-.6-.1-.8-.3-.5-.5-.6-1.2-.1-1.8.5-.6 1.2-.6 1.8-.1.4.5.5 1.3 0 1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1626" d="M294.8 256.5c-.3.2-.6.3-.8.3-.5 0-.9-.2-1.2-.6-.5-.7-.3-1.6.4-2.1s1.6-.3 2.1.4c.4.6.2 1.5-.5 2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1628" d="M295 229.3c-.6 0-1.3-.4-1.5-1-.3-.8.1-1.7 1-2 .8-.3 1.7.1 2 1 .3.8-.1 1.7-1 2-.2-.1-.4 0-.5 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1630" d="M298.2 281.1c-.3.3-.6.4-1 .4-.3 0-.7-.1-1-.4-.5-.5-.6-1.4 0-1.9.5-.5 1.4-.6 1.9 0 .6.5.6 1.3.1 1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1632" d="M298.5 239.6c-.4-.8-.1-1.7.7-2.2.8-.4 1.7-.1 2.2.7.4.8.1 1.7-.7 2.2-.2.1-.5.2-.7.2-.6 0-1.2-.4-1.5-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1634" d="M300.4 265.9c-.5-.6-.5-1.5.1-2.1.6-.5 1.5-.5 2.1.1.5.6.5 1.5-.1 2.1-.3.2-.6.4-1 .4s-.8-.2-1.1-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1636" d="M301.6 197.6c0-.9.8-1.7 1.7-1.7.9 0 1.7.8 1.7 1.7 0 .9-.8 1.7-1.7 1.7-.9 0-1.7-.7-1.7-1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1638" d="M303.8 303.3c-.2.3-.6.5-1 .5-.3 0-.5-.1-.7-.2-.6-.4-.7-1.2-.3-1.7.4-.6 1.2-.7 1.7-.3.6.4.7 1.1.3 1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1640" d="M304.3 211.5c-.8 0-1.5-.6-1.7-1.4-.2-.9.5-1.8 1.4-2 .9-.2 1.8.5 2 1.4.2.9-.5 1.8-1.4 2h-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1642" d="M307.5 289.1c-.3.3-.7.5-1.1.5-.3 0-.6-.1-.8-.3-.6-.5-.7-1.3-.2-1.9.5-.6 1.3-.7 1.9-.2.6.4.7 1.3.2 1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1644" d="M307.4 250.5c-.3.2-.6.3-.9.3-.5 0-1-.2-1.3-.7-.5-.7-.4-1.7.3-2.2.7-.5 1.7-.4 2.2.4.6.7.4 1.7-.3 2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1646" d="M307.9 223.3c-.2.1-.4.1-.6.1-.7 0-1.4-.4-1.6-1.2-.3-.9.2-1.9 1.1-2.2.9-.3 1.9.2 2.2 1.1.3 1-.2 1.9-1.1 2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1648" d="M311.3 274.6c-.3.3-.7.5-1.1.5-.3 0-.7-.1-1-.4-.6-.5-.7-1.5-.1-2.1.5-.6 1.5-.7 2.1-.1.6.5.7 1.5.1 2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1650" d="M310.8 233.7c-.4-.8-.1-1.9.7-2.3.8-.5 1.9-.1 2.3.7.5.8.1 1.9-.7 2.3-.3.1-.5.2-.8.2-.6 0-1.2-.3-1.5-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1652" d="M313.2 259.7c-.6-.6-.6-1.6 0-2.3.6-.6 1.6-.6 2.2 0 .6.6.6 1.6 0 2.3-.3.3-.7.4-1.1.4-.4.1-.8 0-1.1-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1654" d="M317.3 193.3h-.2c-1-.1-1.7-1-1.6-2 .1-1 1-1.7 2-1.6 1 .1 1.7 1 1.6 2-.1.9-.9 1.6-1.8 1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1656" d="M319.1 203.6c.1 1-.6 1.9-1.6 2h-.2c-.9 0-1.7-.7-1.8-1.6-.1-1 .6-1.9 1.6-2 1-.2 1.9.6 2 1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1658" d="M317.6 296c-.3.4-.7.6-1.2.6-.2 0-.5-.1-.7-.2-.6-.4-.8-1.2-.4-1.9.4-.6 1.2-.8 1.9-.4.6.4.8 1.3.4 1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1660" d="M317.7 244.2c-.6-.7-.4-1.8.3-2.4.7-.6 1.8-.4 2.4.3.6.7.4 1.8-.3 2.4-.3.2-.7.4-1 .4-.6 0-1.1-.2-1.4-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1662" d="M321.1 282c-.3.4-.7.6-1.2.6-.3 0-.6-.1-.8-.3-.7-.5-.8-1.4-.4-2.1.5-.7 1.4-.8 2.1-.4.6.6.8 1.5.3 2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1664" d="M324.7 267.8c-.3.4-.8.6-1.3.6-.3 0-.7-.1-1-.3-.7-.5-.8-1.5-.3-2.2.5-.7 1.5-.8 2.2-.3.8.5 1 1.5.4 2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1666" d="M324.6 228.8c-.6 0-1.3-.3-1.6-.9-.5-.9-.2-2 .7-2.5.9-.5 2-.2 2.5.7.5.9.2 2-.7 2.5-.3.1-.6.2-.9.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1668" d="M328.6 253.3c-.3.4-.8.6-1.3.6-.4 0-.8-.1-1.2-.4-.7-.6-.7-1.7-.1-2.4s1.7-.7 2.4-.1c.8.6.8 1.6.2 2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1670" d="M326.1 141.7c.7-.6 1.8-.6 2.4.1.6.7.6 1.8-.1 2.4-.3.3-.7.5-1.2.5s-.9-.2-1.3-.6c-.6-.6-.5-1.7.2-2.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1672" d="M326.1 169.1c-.3.6-1 .9-1.6.9-.3 0-.6-.1-.9-.2-.9-.5-1.2-1.6-.7-2.5.5-.9 1.6-1.2 2.5-.7.9.5 1.2 1.6.7 2.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1674" d="M322.4 127.1c.7-.5 1.7-.4 2.2.3.5.7.4 1.7-.3 2.2-.3.2-.6.3-1 .3-.5 0-.9-.2-1.3-.6-.4-.7-.3-1.7.4-2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1676" d="M320.3 217.5c-.2.1-.4.1-.5.1-.8 0-1.5-.5-1.7-1.3-.3-1 .2-2 1.2-2.3 1-.3 2 .2 2.3 1.2.2 1-.3 2-1.3 2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1678" d="M321.5 180c-.2.8-1 1.3-1.7 1.3-.2 0-.4 0-.5-.1-1-.3-1.5-1.3-1.2-2.3.3-1 1.3-1.5 2.3-1.2.8.4 1.4 1.4 1.1 2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1680" d="M319 112.8c.7-.5 1.6-.3 2.1.4s.3 1.6-.4 2.1c-.3.2-.6.3-.8.3-.5 0-.9-.2-1.2-.6-.6-.8-.4-1.8.3-2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1682" d="M320 150.7c.7.6.9 1.7.3 2.4-.3.4-.8.7-1.3.7-.4 0-.7-.1-1-.4-.7-.6-.9-1.7-.3-2.4.5-.8 1.5-.9 2.3-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1684" d="M315.6 98.7c.6-.4 1.5-.2 1.9.4.4.6.2 1.5-.4 1.9-.2.1-.5.2-.7.2-.5 0-.9-.2-1.2-.6-.4-.6-.2-1.5.4-1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1686" d="M313.1 135.4c.6-.6 1.6-.7 2.3 0 .6.6.7 1.6 0 2.3-.3.3-.7.5-1.1.5-.4 0-.8-.1-1.1-.4-.7-.7-.7-1.7-.1-2.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1688" d="M313 160.8c.8.4 1.1 1.5.7 2.3-.3.6-.9.9-1.5.9-.3 0-.6-.1-.8-.2-.8-.4-1.1-1.5-.7-2.3.5-.8 1.5-1.2 2.3-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1690" d="M309.2 120.5c.6-.5 1.5-.5 2.1.1.5.6.5 1.5-.1 2.1-.3.3-.6.4-1 .4s-.8-.2-1.1-.5c-.6-.7-.5-1.6.1-2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1692" d="M308.9 174.1c-.2.7-.9 1.2-1.6 1.2-.2 0-.4 0-.6-.1-.9-.3-1.4-1.3-1.1-2.2.3-.9 1.3-1.4 2.2-1.1.9.3 1.4 1.3 1.1 2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1694" d="M305.5 105.9c.6-.5 1.5-.4 1.9.2.5.6.4 1.4-.2 1.9-.3.2-.6.3-.8.3-.4 0-.8-.2-1.1-.5-.5-.6-.4-1.4.2-1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1696" d="M305.1 145c.5-.7 1.5-.9 2.2-.4.7.5.9 1.5.4 2.2-.3.4-.8.7-1.3.7-.3 0-.7-.1-.9-.3-.7-.5-.9-1.5-.4-2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1698" d="M304.6 183.7c.9.2 1.6 1 1.4 2-.1.8-.9 1.4-1.7 1.4h-.3c-.9-.1-1.6-1-1.4-2 .2-.9 1.1-1.5 2-1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1700" d="M302 91.6c.6-.4 1.3-.3 1.7.3.4.6.3 1.3-.3 1.7-.2.2-.5.2-.7.2-.4 0-.8-.2-1-.5-.4-.5-.3-1.3.3-1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1702" d="M300.4 129.3c.5-.6 1.5-.7 2.1-.1.6.5.7 1.5.1 2.1-.3.3-.7.5-1.1.5-.3 0-.7-.1-1-.4-.6-.6-.7-1.5-.1-2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1704" d="M300.7 154.9c.8.4 1.1 1.4.7 2.2-.3.5-.8.8-1.4.8-.3 0-.5-.1-.7-.2-.8-.4-1.1-1.4-.7-2.2.3-.7 1.3-1 2.1-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1706" d="M296.2 114.1c.5-.5 1.4-.5 1.9 0s.5 1.4 0 1.9c-.3.3-.6.4-1 .4s-.7-.1-1-.4c-.5-.5-.5-1.4.1-1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1708" d="M296.4 168.1c-.2.6-.8 1-1.5 1-.2 0-.4 0-.5-.1-.8-.3-1.3-1.2-1-2 .3-.8 1.2-1.3 2-1 .9.3 1.3 1.3 1 2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1710" d="M295.1 140.7c-.3.4-.7.6-1.2.6-.3 0-.6-.1-.8-.3-.7-.5-.8-1.4-.4-2.1.5-.7 1.4-.8 2.1-.4.6.6.8 1.6.3 2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1712" d="M292.3 99.3c.5-.5 1.3-.4 1.8.1s.4 1.3-.1 1.8c-.2.2-.5.3-.8.3-.4 0-.7-.1-.9-.4-.6-.6-.6-1.4 0-1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1714" d="M291.9 177.8c.9.2 1.4 1 1.2 1.9-.2.8-.8 1.3-1.6 1.3h-.3c-.9-.2-1.4-1-1.2-1.9.2-1 1-1.5 1.9-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1716" d="M288.6 84.7c.5-.4 1.2-.3 1.6.2.4.5.3 1.2-.2 1.6-.2.2-.5.2-.7.2-.3 0-.7-.1-.9-.4-.4-.5-.3-1.2.2-1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1718" d="M287.7 123.2c.5-.6 1.3-.7 1.9-.2.6.5.7 1.3.2 1.9-.3.3-.7.5-1.1.5-.3 0-.6-.1-.9-.3-.5-.5-.5-1.3-.1-1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1720" d="M286.3 149.7c.4-.7 1.3-1 2-.6s1 1.3.6 2c-.3.5-.8.8-1.3.8-.2 0-.5-.1-.7-.2-.7-.4-1-1.3-.6-2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1722" d="M285.2 287.4c-.2.2-.6.3-.9.3-.3 0-.7-.1-.9-.4-.5-.5-.5-1.3 0-1.8s1.3-.5 1.8 0c.5.7.5 1.5 0 1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1724" d="M283.3 107.8c.5-.5 1.3-.5 1.8 0s.5 1.3 0 1.8c-.2.3-.6.4-.9.4-.3 0-.6-.1-.9-.3-.5-.6-.5-1.4 0-1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1726" d="M282.2 132.8c.6.4.8 1.3.4 1.9-.3.4-.7.6-1.1.6-.3 0-.5-.1-.8-.2-.6-.4-.8-1.3-.4-1.9.5-.7 1.3-.8 1.9-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1728" d="M279.2 92.8c.5-.4 1.2-.4 1.6 0 .4.5.4 1.2 0 1.6-.2.2-.5.3-.8.3-.3 0-.6-.1-.8-.4-.5-.4-.4-1.1 0-1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1730" d="M277.5 173c.2-.8 1-1.3 1.8-1.1.8.2 1.3 1 1.1 1.8-.2.7-.8 1.1-1.4 1.1h-.4c-.8-.3-1.3-1.1-1.1-1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1732" d="M275.4 78c.4-.4 1.1-.3 1.4.1.4.4.3 1.1-.1 1.4-.2.2-.4.2-.7.2-.3 0-.6-.1-.8-.4-.3-.3-.3-.9.2-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1734" d="M275.2 117.2c.4-.5 1.2-.6 1.8-.2.5.4.6 1.2.2 1.8-.2.3-.6.5-1 .5-.3 0-.5-.1-.8-.3-.5-.5-.6-1.3-.2-1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1736" d="M274 143.8c.3-.7 1.2-.9 1.8-.6.7.3.9 1.2.6 1.8-.2.5-.7.7-1.2.7-.2 0-.4 0-.6-.2-.6-.2-.9-1-.6-1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1738" d="M272.3 293.7c-.2.2-.5.3-.8.3-.3 0-.6-.1-.8-.4-.4-.5-.4-1.2.1-1.6.5-.4 1.2-.4 1.6.1.4.4.3 1.1-.1 1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1740" d="M270.6 101.6c.4-.5 1.1-.5 1.6-.1s.5 1.1.1 1.6c-.2.2-.5.4-.8.4-.3 0-.5-.1-.8-.3-.5-.4-.5-1.1-.1-1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1742" d="M269.7 126.9c.6.4.8 1.1.4 1.7-.2.4-.6.6-1.1.6-.2 0-.5-.1-.7-.2-.6-.4-.8-1.1-.4-1.7.5-.6 1.3-.8 1.8-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1744" d="M266.3 86.4c.4-.4 1-.4 1.4 0 .4.4.4 1.1 0 1.4-.2.2-.5.3-.7.3-.2 0-.5-.1-.7-.3-.4-.3-.4-1 0-1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1746" d="M265 166.9c.2-.7 1-1.2 1.7-1 .7.2 1.2.9 1 1.7-.2.6-.7 1-1.3 1h-.4c-.7-.2-1.2-.9-1-1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1748" d="M264.5 112.5c-.2.3-.6.5-.9.5-.2 0-.5-.1-.7-.2-.5-.4-.6-1.1-.2-1.6.4-.5 1.1-.6 1.6-.2.5.3.6 1 .2 1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1750" d="M262.3 71.5c.4-.3.9-.3 1.3.1.3.4.3.9-.1 1.3-.2.2-.4.2-.6.2-.2 0-.5-.1-.7-.3-.3-.4-.3-1 .1-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1752" d="M261.8 138c.3-.6 1.1-.9 1.7-.5.6.3.9 1.1.5 1.7-.2.4-.7.7-1.1.7-.2 0-.4 0-.6-.1-.6-.4-.8-1.2-.5-1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1754" d="M262.6 190.1c.8 0 1.3.7 1.3 1.4s-.6 1.3-1.4 1.3h-.1c-.8 0-1.3-.7-1.3-1.4.1-.7.7-1.3 1.5-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1756" d="M257.9 95.5c.4-.4 1-.5 1.4-.1.4.4.5 1 .1 1.4-.2.2-.5.4-.8.4-.2 0-.5-.1-.7-.2-.3-.5-.3-1.1 0-1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1758" d="M257.4 244.4c.6-.3 1.4.1 1.6.7.3.6-.1 1.4-.7 1.6-.2.1-.3.1-.5.1-.5 0-1-.3-1.2-.8-.1-.7.2-1.4.8-1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1760" d="M256.7 149.2c.3-.6 1-1 1.6-.7.6.3 1 1 .7 1.6-.2.5-.7.8-1.2.8-.2 0-.3 0-.5-.1-.6-.2-.9-.9-.6-1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1762" d="M253.5 80.1c.3-.4.9-.4 1.3 0 .4.3.4.9 0 1.3-.2.2-.4.3-.7.3-.2 0-.4-.1-.6-.2-.3-.4-.4-1 0-1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1764" d="M251.1 131.6c.6.3.8 1 .5 1.5-.2.4-.6.6-1 .6-.2 0-.4 0-.5-.1-.6-.3-.8-1-.5-1.5.3-.6 1-.8 1.5-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1766" d="M251.2 172c.7.1 1.1.8 1 1.5-.1.6-.6 1-1.2 1h-.2c-.7-.1-1.1-.8-1-1.5 0-.7.7-1.1 1.4-1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1768" d="M251.7 104.9c.5.3.6 1 .2 1.4-.2.3-.5.4-.8.4-.2 0-.4-.1-.6-.2-.5-.3-.6-1-.2-1.4.3-.4.9-.5 1.4-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1770" d="M249.3 65.1c.3-.3.8-.3 1.1 0 .3.3.3.8 0 1.1-.2.1-.4.2-.6.2-.2 0-.4-.1-.6-.2-.2-.3-.2-.8.1-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1772" d="M249.3 184.1c.7.1 1.2.7 1.1 1.4-.1.6-.6 1.1-1.2 1.1h-.1c-.7-.1-1.2-.7-1.1-1.4 0-.6.7-1.1 1.3-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1774" d="M245.3 89.4c.3-.4.9-.4 1.3-.1.4.3.4.9.1 1.3-.2.2-.4.3-.7.3-.2 0-.4-.1-.6-.2-.3-.3-.4-.9-.1-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1776" d="M245.1 250.4c.6-.2 1.2 0 1.5.6.2.6 0 1.2-.6 1.5-.1.1-.3.1-.4.1-.4 0-.9-.3-1.1-.7-.3-.6 0-1.2.6-1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1778" d="M244.4 143.3c.2-.6.9-.9 1.5-.6.6.2.9.9.6 1.5-.2.4-.6.7-1.1.7-.1 0-.3 0-.4-.1-.5-.2-.8-.9-.6-1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1780" d="M240.8 73.9c.3-.3.8-.4 1.1-.1.3.3.4.8.1 1.1-.2.2-.4.3-.6.3-.2 0-.4-.1-.5-.2-.4-.3-.4-.8-.1-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1782" d="M237.9 99.2c.3-.4.9-.5 1.3-.2s.5.9.2 1.3c-.2.3-.5.4-.7.4-.2 0-.4-.1-.5-.2-.5-.3-.6-.9-.3-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1784" d="M238.5 229.2h-.2c-.5 0-1-.4-1.1-.9-.1-.6.3-1.2.9-1.4.6-.1 1.2.3 1.4.9 0 .6-.4 1.2-1 1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1786" d="M238.8 269.4c-.1.1-.3.1-.5.1-.4 0-.7-.2-.9-.6-.3-.5-.1-1.1.4-1.4.5-.3 1.1-.1 1.4.4.3.7.1 1.3-.4 1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1788" d="M238.5 166.1c.6.1 1 .7.9 1.4-.1.5-.6.9-1.1.9h-.2c-.6-.1-1-.7-.9-1.4 0-.7.7-1.1 1.3-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1790" d="M237.4 126.2c.3-.5.9-.7 1.4-.4.5.3.7.9.4 1.4-.2.4-.5.6-.9.6-.2 0-.3 0-.5-.1-.5-.4-.7-1-.4-1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1792" d="M236.5 58.7c.3-.3.7-.3 1 0 .3.3.3.7 0 1-.1.1-.3.2-.5.2s-.3-.1-.5-.2c-.3-.3-.3-.7 0-1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1794" d="M236.3 178.1c.6.1 1.1.7 1 1.3-.1.6-.6 1-1.1 1h-.1c-.6-.1-1.1-.7-1-1.3 0-.6.6-1 1.2-1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1796" d="M235.2 190.4c.6 0 1.1.6 1.1 1.2s-.5 1.1-1.1 1.1c-.6 0-1.1-.6-1.1-1.2-.1-.7.4-1.2 1.1-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1798" d="M232.8 256.5c.5-.2 1.1 0 1.3.6.2.5 0 1.1-.6 1.3-.1.1-.3.1-.4.1-.4 0-.8-.2-.9-.6-.2-.6.1-1.2.6-1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1800" d="M232.2 137.4c.2-.5.8-.8 1.3-.6.5.2.8.8.6 1.3-.2.4-.5.6-.9.6-.1 0-.3 0-.4-.1-.6 0-.8-.6-.6-1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1802" d="M232.8 83.3c.3-.3.8-.4 1.1-.1.3.3.4.8.1 1.1-.2.2-.4.3-.6.3-.2 0-.3-.1-.5-.2-.3-.2-.3-.7-.1-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1804" d="M228.1 67.7c.2-.3.7-.3 1-.1.3.2.3.7.1 1-.1.2-.3.2-.5.2s-.3-.1-.4-.2c-.4-.1-.4-.6-.2-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1806" d="M225.5 93.2c.3-.4.7-.5 1.1-.2.4.2.5.7.2 1.1-.2.2-.4.3-.7.3-.2 0-.3 0-.4-.1-.4-.2-.4-.7-.2-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1808" d="M225.1 120.3c.2-.4.8-.6 1.2-.4.4.2.6.8.4 1.2-.2.3-.5.5-.8.5-.1 0-.3 0-.4-.1-.4-.2-.6-.7-.4-1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1810" d="M225.8 160.2c.6.1.9.7.8 1.2-.1.5-.5.8-1 .8h-.2c-.6-.1-.9-.7-.8-1.2.1-.6.7-1 1.2-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1812" d="M223.7 52.5c.2-.2.6-.2.8 0 .2.2.2.6 0 .8-.1.1-.3.2-.4.2-.1 0-.3-.1-.4-.2-.2-.2-.2-.6 0-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1814" d="M223.4 172.2c.6.1.9.6.9 1.2-.1.5-.5.9-1 .9h-.2c-.6-.1-.9-.6-.9-1.2.1-.6.6-1 1.2-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1816" d="M222.8 185.4c0 .5-.5.9-1 .9h-.1c-.6 0-1-.5-.9-1.1 0-.6.5-1 1.1-.9.5.1 1 .6.9 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1818" d="M222.3 197.6c0 .6-.5 1-1 1s-1-.5-1-1c0-.6.5-1 1-1s1 .5 1 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1820" d="M221.6 132.2c-.1.3-.5.6-.8.6-.1 0-.2 0-.4-.1-.5-.2-.7-.7-.5-1.2.2-.5.7-.7 1.2-.5.5.3.7.8.5 1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1822" d="M221.3 77.2c.3.2.4.7.1 1-.1.2-.3.3-.5.3-.1 0-.3 0-.4-.1-.3-.2-.4-.7-.1-1 .2-.4.6-.5.9-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1824" d="M219.5 37.4c.2-.2.5-.2.6 0 .2.2.2.5 0 .6-.1.1-.2.1-.3.1-.1 0-.2 0-.3-.1-.2-.2-.2-.5 0-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1826" d="M217.3 143.7c-.1.4-.5.6-.9.6h-.3c-.5-.2-.7-.7-.6-1.2.2-.5.7-.7 1.2-.6.5.2.8.7.6 1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1828" d="M213.2 87.3c.2-.3.6-.4.9-.2.3.2.4.6.2.9-.1.2-.3.3-.6.3-.1 0-.3 0-.4-.1-.3-.2-.3-.6-.1-.9Z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1830" d="M212.9 114.4c.2-.4.7-.6 1.1-.3.4.2.5.7.3 1.1-.1.3-.4.4-.7.4-.1 0-.2 0-.4-.1-.3-.2-.5-.7-.3-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1832" d="M210.5 166.2c.5.1.8.6.7 1.1-.1.4-.5.7-.9.7h-.2c-.5-.1-.8-.6-.7-1.1.2-.4.6-.8 1.1-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1834" d="M209.5 179.4c0 .5-.4.8-.9.8h-.1c-.5-.1-.9-.5-.8-1 .1-.5.5-.9 1-.8.5 0 .9.5.8 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1836" d="M207.7 202.9c.5 0 .9.4.9.9s-.4.9-.9.9-.9-.4-.9-.9.4-.9.9-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1838" d="M206.8 191.5c0-.5.4-.9.9-.9s.9.4.9.9-.4.9-.9.9-.9-.4-.9-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1840" d="M209.2 126.3c-.1.3-.4.5-.7.5-.1 0-.2 0-.3-.1-.4-.2-.6-.6-.4-1 .2-.4.6-.6 1-.4.4.1.6.6.4 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1842" d="M207.9 71.3c.2-.3.5-.3.8-.1.3.2.3.5.1.8-.1.1-.3.2-.5.2-.1 0-.2 0-.3-.1-.3-.2-.3-.5-.1-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1844" d="M206.7 31c.1-.1.3-.1.5 0 .1.1.1.3 0 .5-.1.1-.2.1-.2.1-.1 0-.2 0-.2-.1-.2-.1-.2-.4-.1-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1846" d="M204.3 136.7c.4.1.6.6.5 1-.1.3-.4.5-.8.5h-.3c-.4-.1-.6-.6-.5-1 .3-.5.7-.7 1.1-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1848" d="M203 55.5c.2-.2.4-.2.6-.1.2.2.2.4.1.6-.1.1-.2.2-.4.2-.1 0-.2 0-.3-.1-.1-.1-.2-.4 0-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1850" d="M200.8 81.4c.2-.3.5-.3.8-.2.3.2.3.5.2.8-.1.2-.3.3-.5.3-.1 0-.2 0-.3-.1-.3-.2-.4-.5-.2-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1852" d="M200.7 108.5c.2-.3.6-.5.9-.3.3.2.5.6.3.9-.1.2-.4.4-.6.4-.1 0-.2 0-.3-.1-.3-.1-.4-.5-.3-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1854" d="M197.8 160.3c.4.1.7.5.6.9-.1.4-.4.6-.8.6h-.2c-.4-.1-.7-.5-.6-.9.1-.4.5-.7 1-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1856" d="M196.8 120.3c-.1.3-.4.4-.6.4-.1 0-.2 0-.3-.1-.3-.1-.5-.6-.4-.9.1-.3.6-.5.9-.4.4.2.5.6.4 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1858" d="M195.5 221.3c.4-.1.8.2.9.7.1.4-.3.8-.7.9h-.1c-.4 0-.7-.3-.8-.7 0-.4.3-.8.7-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1860" d="M195.5 174c-.4-.1-.7-.5-.7-.9.1-.4.5-.7.9-.7.4.1.7.5.7.9-.1.4-.4.7-.8.7h-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1862" d="M195.5 65.3c.1-.2.4-.3.6-.1.2.1.2.4.1.6-.1.1-.2.2-.4.2-.1 0-.2 0-.3-.1-.1-.1-.2-.4 0-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1864" d="M195.1 91.7c.3.2.4.5.2.8-.1.2-.3.3-.5.3-.1 0-.2 0-.3-.1-.3-.2-.4-.5-.2-.8.2-.3.5-.4.8-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1866" d="M193.9 24.7c.1-.1.2-.1.3 0 .1.1.1.2 0 .3 0 0-.1.1-.2.1s-.1 0-.2-.1c.1 0 .1-.2.1-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1868" d="M194.4 184.6c.4 0 .8.4.7.8 0 .4-.4.7-.8.7h-.1c-.4 0-.8-.4-.7-.8.1-.4.5-.8.9-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1870" d="M191.9 130.8c.4.1.5.5.4.9-.1.3-.4.5-.6.5h-.2c-.4-.1-.5-.5-.4-.9.1-.5.4-.6.8-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1872" d="M190.5 49.5c.1-.1.3-.2.5-.1.1.1.2.3.1.5-.1.1-.2.1-.3.1-.1 0-.1 0-.2-.1-.2-.1-.2-.3-.1-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1874" d="M188.5 75.4c.1-.2.4-.3.6-.1.2.1.3.4.1.6-.1.1-.2.2-.4.2-.1 0-.2 0-.2-.1-.2-.1-.2-.4-.1-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1876" d="M188.5 102.7c.1-.3.5-.4.8-.3.3.1.4.5.3.8-.1.2-.3.3-.5.3-.1 0-.2 0-.3-.1-.3-.1-.5-.4-.3-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1878" d="M185.1 154.3c.4.1.6.4.5.8-.1.3-.4.5-.7.5h-.1c-.4-.1-.6-.4-.5-.8.1-.4.4-.5.8-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1880" d="M184 113.6c.3.1.4.5.3.7-.1.2-.3.3-.5.3h-.2c-.3-.1-.4-.5-.3-.7.1-.3.4-.5.7-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1882" d="M183.1 59.3c.1-.2.3-.2.5-.1s.2.3.1.5c-.1.1-.2.1-.3.1-.1 0-.1 0-.2-.1-.2 0-.2-.3-.1-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1884" d="M182.1 85.9c.1-.2.4-.3.6-.2.2.1.3.4.2.6-.1.1-.2.2-.4.2-.1 0-.2 0-.2-.1-.2 0-.3-.2-.2-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1886" d="M181.2 18.5h.2v.2h-.2v-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1888" d="M181.2 178.6c.4 0 .7.4.6.7 0 .4-.3.6-.7.6h-.1c-.4 0-.7-.4-.6-.7.1-.4.4-.7.8-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1890" d="M180.4 190.8c.4 0 .7.3.7.7 0 .4-.3.7-.7.7-.4 0-.7-.3-.7-.7 0-.4.4-.7.7-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1892" d="M177.1 97.2c-.1.2-.2.3-.4.3h-.2c-.2-.1-.3-.4-.2-.6.1-.2.4-.3.6-.2.2 0 .3.3.2.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1894" d="M176.2 69.5c.1-.2.3-.2.5-.1s.2.3.1.5c-.1.1-.2.2-.3.2-.1 0-.1 0-.2-.1-.1-.2-.2-.4-.1-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1896" d="M172.4 148.5c.3.1.5.4.4.7-.1.3-.3.4-.6.4h-.1c-.3-.1-.5-.4-.4-.7.1-.3.4-.5.7-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1898" d="M171.7 107.7c.2.1.3.4.2.6-.1.2-.2.3-.4.3h-.2c-.2-.1-.3-.4-.2-.6.1-.3.3-.4.6-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1900" d="M170.7 53.3c.1-.1.2-.1.3-.1.1.1.1.2.1.3 0 .1-.1.1-.2.1h-.1c-.1 0-.2-.2-.1-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1902" d="M167.5 173.1c0-.3.3-.5.6-.5s.5.3.5.6-.3.5-.6.5h-.1c-.2 0-.5-.3-.4-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1904" d="M166.5 119.4c.1-.2.3-.4.6-.3.2.1.4.4.3.6-.1.2-.2.3-.4.3h-.2c-.3-.1-.4-.4-.3-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1906" d="M167 184.8c.3 0 .6.3.5.6 0 .3-.3.5-.6.5s-.6-.3-.5-.6c0-.2.3-.5.6-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1908" d="M164.5 90.7c.2.1.2.3.2.5-.1.1-.2.2-.3.2h-.1c-.2-.1-.2-.3-.2-.5.1-.2.3-.3.4-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1910" d="M163.9 63.6c.1-.1.2-.1.3-.1.1.1.1.2.1.3 0 .1-.1.1-.2.1h-.1c-.1-.1-.1-.2-.1-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1912" d="M159.8 142.5c.2.1.4.3.3.6-.1.2-.2.3-.4.3h-.1c-.2-.1-.4-.3-.3-.6 0-.2.2-.3.5-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1914" d="M158.9 102c.1-.2.3-.3.4-.2.2.1.3.3.2.4-.1.1-.2.2-.3.2h-.1c-.2 0-.3-.2-.2-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1916" d="M155.5 167.1c0 .2-.2.4-.4.4h-.1c-.2 0-.4-.3-.4-.5s.3-.4.5-.4.4.3.4.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1918" d="M153.7 178.8c.3 0 .4.2.4.5 0 .2-.2.4-.5.4s-.4-.2-.4-.5c.1-.2.3-.4.5-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1920" d="M147.4 137c0 .2-.2.3-.3.3h-.1c-.2 0-.3-.2-.2-.4 0-.2.2-.3.4-.2.2 0 .3.1.2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1922" d="M142.3 107.4c.1 0 .2.2.1.3 0 .1-.1.1-.2.1h-.1c-.1 0-.2-.2-.1-.3 0-.1.1-.2.3-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1924" d="M139.7 79.1c0-.1.1-.1.2-.1s.1.1.1.2l-.1.1c-.3-.1-.3-.1-.2-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1926" d="M139.6 185c.2 0 .3.2.3.4s-.2.3-.3.3c-.2 0-.3-.2-.3-.4s.1-.3.3-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1928" d="M134.4 90.2c0-.1.1-.1.2-.1s.1.1.1.1l-.1.1c-.2.1-.2 0-.2-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1930" d="M125.7 113.1c.1 0 .1.1.1.1l-.1.1c-.1 0-.1-.1-.1-.1 0-.1 0-.1.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1932" d="M112 197.7c-.1 0-.1-.1-.1-.1 0-.1 0-.1.1-.1s.1 0 .1.1-.1.1-.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1934" d="M112.3 185.2c.1 0 .1.1.1.1 0 .1-.1.1-.1.1-.1 0-.1-.1-.1-.1-.1 0 0-.1.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1936" d="M112.3 210c-.1 0-.1 0-.1-.1s0-.1.1-.1.1 0 .1.1l-.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1938" d="M113.1 173c.1 0 .1.1.1.1 0 .1-.1.1-.1.1-.1 0-.1-.1-.1-.1 0-.1 0-.1.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1940" d="M113.2 222.1c0 .1 0 .1-.1.1s-.1 0-.1-.1 0-.1.1-.1l.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1942" d="M114.5 160.8c.1 0 .1.1.1.1 0 .1-.1.1-.1.1-.1 0-.1-.1-.1-.1 0-.1 0-.1.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1944" d="M114.5 234.4c-.1 0-.1 0-.1-.1s0-.1.1-.1.1 0 .1.1 0 .1-.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1946" d="M116.3 148.8c0-.1.1-.1.1-.1.1 0 .1.1.1.1 0 .1-.1.1-.1.1-.1 0-.1 0-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1948" d="M116.5 246.5c-.1 0-.1 0-.1-.1s0-.1.1-.1.1 0 .1.1-.1.1-.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1950" d="M118.8 136.8c0-.1.1-.1.1-.1.1 0 .1.1.1.1 0 .1-.1.1-.1.1l-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1952" d="M119 258.5c-.1 0-.1 0-.1-.1s0-.1.1-.1.1 0 .1.1 0 .1-.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1954" d="M121.9 124.9c0-.1.1-.1.1-.1.1 0 .1.1.1.1 0 .1-.1.1-.1.1-.1 0-.1 0-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1956" d="M122.1 270.4c-.1 0-.1 0-.1-.1s0-.1.1-.1.1 0 .1.1c0 0 0 .1-.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1958" d="M125.7 282.1c-.1 0-.1 0-.1-.1s0-.1.1-.1.1 0 .1.1c0 0 0 .1-.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1960" d="M125.7 204c-.1 0-.2-.1-.2-.2s.1-.2.2-.2.2.1.2.2-.1.2-.2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1962" d="M125.7 191.7c-.1 0-.2-.1-.2-.2s.1-.2.2-.2.2.1.2.2-.1.2-.2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1964" d="M126.3 179c.1 0 .2.1.2.2s-.1.2-.2.2-.2-.1-.2-.2.1-.2.2-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1966" d="M126.3 216.2c-.1 0-.2-.1-.2-.2s.1-.2.2-.2.2.1.2.2-.1.2-.2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1968" d="M127.5 166.8c.1 0 .2.1.2.3 0 .1-.1.2-.2.2s-.2-.1-.2-.3c0-.1.1-.2.2-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1970" d="M127.5 228.4c-.1 0-.2-.1-.3-.2 0-.1.1-.2.2-.3.1 0 .2.1.3.2 0 .2-.1.3-.2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1972" d="M129.3 154.7c.1 0 .2.1.2.3 0 .1-.1.2-.2.2s-.2-.1-.2-.3c-.1-.2.1-.3.2-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1974" d="M129.3 240.6c-.1 0-.2-.1-.3-.2 0-.1.1-.2.2-.3.1 0 .2.1.3.2 0 .1-.1.3-.2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1976" d="M129.7 101.6c0-.1.1-.1.1-.1.1 0 .1.1.1.1l-.1.1c-.1 0-.1 0-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1978" d="M130 293.5c0 .1 0 .1-.1.1 0 0-.1 0-.1-.1s0-.1.1-.1c0 0 0 .1.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1980" d="M131.6 142.6c.1 0 .2.1.2.3 0 .1-.1.2-.2.2s-.2-.1-.2-.3c0-.1.1-.2.2-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1982" d="M131.7 252.6s-.1 0 0 0c-.2 0-.2-.1-.3-.2 0-.1.1-.2.2-.3.1 0 .2 0 .3.2 0 .2-.1.3-.2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1984" d="M134.3 130.9c0-.1.2-.2.3-.2.1 0 .2.2.2.3 0 .1-.1.2-.2.2h-.1c-.1-.1-.2-.2-.2-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1986" d="M134.6 305c-.1 0-.1 0-.1-.1s0-.1.1-.1.1 0 .1.1c0 0-.1 0-.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1988" d="M134.6 264.5c-.2 0-.3-.1-.3-.2s0-.2.2-.3c.1 0 .2 0 .3.2 0 .2 0 .3-.2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1990" d="M137.9 119.1c0-.1.2-.2.3-.1.1 0 .2.2.1.3 0 .1-.1.2-.2.2h-.1c-.1-.1-.2-.3-.1-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1992" d="M138.3 276c0 .1 0 .2-.1.3h-.1c-.1 0-.2-.1-.2-.2s0-.2.1-.3c.2 0 .3.1.3.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1994" d="M139 197.6c0-.2.2-.3.3-.3.2 0 .3.1.3.3 0 .2-.2.3-.3.3-.2.1-.3-.1-.3-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1996" d="M139.8 316c0 .1 0 .1 0 0-.1 0-.1 0-.2-.1 0-.1 0-.1.1-.2.1 0 .1 0 .2.1 0 .2 0 .2-.1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path1998" d="M139.6 210.2c-.2 0-.3-.1-.4-.3 0-.2.1-.4.3-.4.2 0 .3.1.4.3.1.3-.1.4-.3.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2000" d="M140.2 173.1c0-.2.2-.3.4-.3s.3.2.3.4-.2.3-.3.3c-.3-.1-.4-.2-.4-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2002" d="M140.6 222.5c-.2 0-.4-.1-.4-.3 0-.2.1-.4.3-.4.2 0 .4.1.4.3 0 .2-.1.3-.3.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2004" d="M142.2 160.6c.2 0 .3.2.3.4s-.2.3-.3.3h-.1c-.2 0-.3-.2-.3-.4s.2-.3.4-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2006" d="M142.3 287.8s-.1 0 0 0c-.2 0-.3 0-.3-.1s0-.2.1-.3c.1 0 .2 0 .3.1.1.1 0 .3-.1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2008" d="M142.2 234.6s-.1 0 0 0c-.2 0-.4-.1-.4-.3 0-.2.1-.4.3-.4.2 0 .4.1.4.3 0 .2-.1.4-.3.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2010" d="M144 148.8c0-.2.2-.3.4-.3s.3.2.3.4-.2.3-.3.3h-.1c-.3 0-.4-.2-.3-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2012" d="M144.4 246.7s-.1 0 0 0c-.2 0-.4-.1-.4-.3 0-.2.1-.4.3-.4.2 0 .4.1.4.3 0 .2-.1.3-.3.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2014" d="M145.5 326.9h-.2v-.2h.2c.1.1.1.1 0 .2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2016" d="M145.6 68.3c0 .1-.1.1 0 0-.1.1-.1.1-.2 0-.1 0-.1-.1 0-.2 0-.1.1-.1.2 0v.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2018" d="M146.6 96.1c.1-.1.2-.2.3-.1.1.1.2.2.1.3 0 .1-.1.1-.2.1h-.1c-.1 0-.1-.2-.1-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2020" d="M147 299.1c-.1 0-.1.1 0 0-.2 0-.3 0-.3-.1s0-.3.1-.3.2 0 .3.1c0 .2 0 .3-.1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2022" d="M147.2 258.6s-.1 0 0 0c-.2 0-.4-.1-.4-.2 0-.2.1-.4.2-.4.2 0 .4.1.4.2.1.2 0 .4-.2.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2024" d="M150.2 125c.1-.2.2-.3.4-.2.2.1.3.2.2.4 0 .1-.2.2-.3.2h-.1c-.2 0-.3-.2-.2-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2026" d="M150.7 270.4h-.1c-.1 0-.3-.1-.3-.2-.1-.2 0-.4.2-.4.2-.1.4 0 .4.2 0 .1-.1.3-.2.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2028" d="M151.8 337.4c-.1 0-.1.1 0 0h-.2v-.2h.2v.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2030" d="M151.8 57.8c-.1.1-.1 0-.2 0s-.1-.1 0-.2c0-.1.1-.1.2 0 0 0 .1.1 0 .2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2032" d="M151.9 85c.1-.1.2-.2.3-.1.1.1.2.2.1.3 0 .1-.1.1-.2.1h-.1c-.1-.1-.2-.2-.1-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2034" d="M152.2 310.2h-.1c-.1 0-.2 0-.2-.1-.1-.1 0-.3.1-.3.1-.1.2 0 .3.1 0 .1 0 .3-.1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2036" d="M152.6 191.5c0-.3.2-.4.5-.4s.4.2.4.5c0 .2-.2.4-.5.4-.2 0-.4-.3-.4-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2038" d="M153.1 204.2c-.2 0-.5-.2-.5-.4 0-.3.2-.5.4-.5.3 0 .5.2.5.4 0 .3-.2.5-.4.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2040" d="M153.8 216.5c-.1 0-.1 0 0 0-.3 0-.5-.2-.5-.4 0-.3.2-.5.4-.5.3 0 .5.2.5.4s-.2.5-.4.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2042" d="M154.2 113.4c.1-.2.3-.3.4-.2.2.1.3.3.2.4-.1.1-.2.2-.3.2h-.1c-.2 0-.2-.2-.2-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2044" d="M154.7 282h-.1c-.1 0-.3-.1-.3-.2-.1-.2 0-.4.2-.4.2-.1.4 0 .4.2.1.1 0 .3-.2.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2046" d="M155.1 228.7c-.3 0-.5-.2-.5-.4 0-.3.1-.5.4-.5.3 0 .5.1.5.4 0 .2-.1.4-.4.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2048" d="M157.1 154.5c.2 0 .4.3.4.5s-.2.4-.4.4h-.1c-.2 0-.4-.3-.4-.5 0-.3.2-.4.5-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2050" d="M157.5 240.2c0 .2-.1.5-.4.5h-.1c-.2 0-.4-.1-.4-.4 0-.2.1-.5.4-.5.2 0 .4.2.5.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2052" d="M157.9 321h-.1c-.1 0-.2 0-.2-.1-.1-.1 0-.2.1-.3.1-.1.3 0 .3.1.1.1 0 .3-.1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2054" d="M158 74.3c0 .1-.1.1-.2.1h-.1c-.1-.1-.1-.2-.1-.3.1-.1.2-.1.3-.1.2.1.2.2.1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2056" d="M158.5 347.7c-.1 0-.1 0 0 0h-.2v-.2h.2v.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2058" d="M158.5 47.5c-.1.1-.1 0-.2 0s-.1-.1 0-.2c0-.1.1-.1.2 0s.1.1 0 .2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2060" d="M159.3 293.3h-.1c-.1 0-.3-.1-.3-.2-.1-.2 0-.4.2-.4.2-.1.4 0 .4.2.1.1 0 .3-.2.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2062" d="M159.8 252.7h-.1c-.2 0-.4-.1-.4-.3-.1-.2.1-.5.3-.6.2-.1.5.1.6.3 0 .3-.1.6-.4.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2064" d="M162.5 131c.1-.2.3-.4.6-.3.2.1.4.3.3.6-.1.2-.2.3-.4.3h-.1c-.3-.1-.5-.3-.4-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2066" d="M163.1 264.5h-.1c-.2 0-.4-.1-.4-.3-.1-.2.1-.5.3-.6.2-.1.5.1.6.3 0 .3-.1.6-.4.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2068" d="M164.2 331.6h-.1c-.1 0-.1 0-.2-.1s0-.2.1-.3c.1-.1.2 0 .3.1 0 .1 0 .2-.1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2070" d="M164.5 304.4h-.2c-.1 0-.2-.1-.3-.2-.1-.2 0-.4.1-.5.2-.1.4 0 .5.1.2.3.1.5-.1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2072" d="M165.6 357.6c0 .1 0 .1 0 0h-.2v-.2h.2c.1.1.1.2 0 .2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2074" d="M165.7 37.5c0 .1 0 .1 0 0h-.2s-.1-.1 0-.2c0 0 .1-.1.2 0 0 .1.1.2 0 .2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2076" d="M166.1 197.7c0-.3.3-.6.6-.6s.6.3.6.6-.3.6-.6.6c-.4-.1-.6-.3-.6-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2078" d="M167.1 276.2h-.2c-.2 0-.4-.1-.4-.3-.1-.2 0-.5.3-.6.2-.1.5 0 .6.3.1.2 0 .5-.3.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2080" d="M167 210.5c-.3 0-.6-.2-.6-.5s.2-.6.5-.6.6.2.6.5c.1.3-.2.6-.5.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2082" d="M168.6 222.1c0 .3-.2.6-.5.6h-.1c-.3 0-.5-.2-.6-.5 0-.3.2-.6.5-.6.4-.1.7.2.7.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2084" d="M169.9 160.5c.3.1.5.3.5.7 0 .3-.3.5-.6.5h-.1c-.3-.1-.5-.3-.5-.7.1-.4.4-.6.7-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2086" d="M169.3 234.4c-.1-.3.2-.6.5-.7.3-.1.6.2.7.5.1.3-.1.6-.5.7h-.1c-.3 0-.6-.2-.6-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2088" d="M170.3 315.2h-.2c-.1 0-.2-.1-.3-.2-.1-.2 0-.4.1-.5.2-.1.4 0 .5.1.2.3.1.5-.1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2090" d="M170.5 80.4c-.1.1-.2.2-.3.2h-.2c-.2-.1-.2-.3-.1-.5s.3-.2.5-.1c.1 0 .2.2.1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2092" d="M171 341.8h-.1c-.1 0-.1 0-.2-.1s0-.2.1-.3c.1-.1.2 0 .3.1 0 .1 0 .2-.1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2094" d="M171.7 287.5h-.2c-.2 0-.3-.1-.4-.3-.1-.2 0-.5.2-.6.2-.1.5 0 .6.2.2.3 0 .6-.2.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2096" d="M172.4 246.9h-.1c-.3 0-.5-.2-.6-.4-.1-.3.1-.6.4-.7.3-.1.6.1.7.4.1.3-.1.6-.4.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2098" d="M173.2 367.2c0 .1 0 .1 0 0h-.2v-.2h.2c.1.1.1.2 0 .2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2100" d="M173.3 27.9s0 .1 0 0h-.2s-.1-.1 0-.2c0 0 .1-.1.2 0s.1.2 0 .2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2102" d="M174.9 137c.1-.3.4-.5.7-.4.3.1.5.4.4.7-.1.2-.3.4-.5.4h-.2c-.3-.1-.5-.4-.4-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2104" d="M175.6 258.7h-.2c-.2 0-.5-.2-.5-.4-.1-.3.1-.6.4-.7.3-.1.6.1.7.4.1.3-.1.6-.4.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2106" d="M176.7 325.7c-.1 0-.1.1-.2.1s-.2-.1-.3-.2c-.1-.2-.1-.4.1-.5.2-.1.4-.1.5.1.1.2 0 .4-.1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2108" d="M176.9 298.6h-.2c-.2 0-.3-.1-.4-.3-.1-.2 0-.5.2-.6.2-.1.5 0 .6.2.1.3.1.6-.2.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2110" d="M178.2 351.6h-.1c-.1 0-.1 0-.2-.1s-.1-.2 0-.3c.1-.1.2-.1.3 0 .2.2.2.4 0 .4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2112" d="M178.4 43.7c0 .1-.1.1-.2.1h-.1c-.1-.1-.1-.2 0-.3.1-.1.2-.1.3 0v.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2114" d="M178.7 125.3c.1-.3.4-.4.7-.3.3.1.4.4.3.7-.1.2-.3.4-.5.4h-.2c-.2-.2-.4-.5-.3-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2116" d="M179.5 270.3h-.2c-.2 0-.4-.1-.5-.4-.1-.3 0-.6.3-.7.3-.1.6 0 .7.3.2.4 0 .7-.3.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2118" d="M180.4 204.4c-.4 0-.7-.3-.7-.7 0-.4.3-.7.7-.7.4 0 .7.3.7.7 0 .4-.3.7-.7.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2120" d="M181.3 376.5h-.2v-.2h.2c.1.1 0 .1 0 .2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2122" d="M181.2 216.7c-.4 0-.7-.3-.7-.6 0-.4.2-.7.6-.7.4 0 .7.2.7.6.1.3-.2.6-.6.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2124" d="M182.8 166.4c.4.1.6.4.6.8-.1.3-.3.6-.7.6h-.1c-.4-.1-.6-.4-.6-.8s.4-.7.8-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2126" d="M182.6 227.5c.4-.1.7.2.8.6.1.4-.2.7-.6.8h-.1c-.3 0-.6-.2-.7-.6-.1-.4.2-.8.6-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2128" d="M182.7 309.4c-.1 0-.1.1-.2.1-.2 0-.3-.1-.4-.2-.1-.2 0-.5.2-.6.2-.1.5 0 .6.2.1.1.1.3-.2.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2130" d="M183.5 335.8c-.1 0-.1.1-.2.1s-.2-.1-.3-.1c-.1-.2-.1-.4.1-.5.2-.1.4-.1.5.1.1.1.1.3-.1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2132" d="M184.1 281.7h-.2c-.2 0-.4-.1-.5-.3-.1-.3 0-.6.3-.7.3-.1.6 0 .7.3.1.3 0 .6-.3.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2134" d="M185.1 240.9h-.1c-.3 0-.6-.2-.7-.5-.1-.4.1-.7.5-.8.4-.1.7.1.8.5.1.3-.1.7-.5.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2136" d="m186 361.2-.1.1c-.1 0-.1 0-.2-.1s-.1-.2 0-.3c.1-.1.2-.1.3 0 .1 0 .1.2 0 .3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2138" d="M186.1 34.2c0 .1-.1.1-.2.1s-.1 0-.1-.1c-.1-.1-.1-.2 0-.3.1-.1.2-.1.3 0 .1 0 .1.2 0 .3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2140" d="M187.3 142.9c.1-.4.5-.6.8-.5.4.1.6.5.5.8-.1.3-.4.5-.7.5h-.2c-.3 0-.5-.4-.4-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2142" d="M188.1 252.8h-.2c-.3 0-.6-.2-.7-.5-.1-.4.1-.7.5-.8.4-.1.7.1.8.5.2.3 0 .7-.4.8z" class="epichains_svg__st0" style="display:inline"/>
+  <path id="epichains_svg__path2144" d="M189.1 319.8c-.1 0-.2.1-.2.1-.1 0-.3-.1-.4-.2-.1-.2-.1-.5.1-.6.2-.1.5-.1.6.1.2.2.2.5-.1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2146" d="M189.3 292.8c-.1 0-.2.1-.3.1-.2 0-.4-.1-.5-.3-.1-.3 0-.6.3-.8.3-.1.6 0 .8.3.1.2 0 .5-.3.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2148" d="M190.9 345.6c-.1.1-.1.1-.2.1s-.2 0-.3-.1c-.1-.1-.1-.4.1-.5.1-.1.4-.1.5.1.1.1.1.3-.1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2150" d="M191.9 264.4h-.2c-.3 0-.5-.2-.6-.5-.1-.4.1-.7.4-.9.4-.1.7.1.9.4.1.5-.1.9-.5 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2152" d="M193.2 197.6c0-.4.4-.8.8-.8s.8.4.8.8-.4.8-.8.8c-.5 0-.8-.3-.8-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2154" d="M194.3 209.1c.4 0 .8.3.8.7 0 .4-.3.8-.7.8h-.1c-.4 0-.8-.3-.8-.7.1-.4.4-.8.8-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2156" d="M194.2 370.3s-.1.1-.2.1-.1 0-.2-.1-.1-.2 0-.3c.1-.1.2-.1.3 0 .2.1.2.2.1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2158" d="M195.2 303.5c-.1.1-.2.1-.3.1-.2 0-.4-.1-.5-.3-.2-.3-.1-.6.2-.8.3-.2.6-.1.8.2.1.3 0 .7-.2.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2160" d="M196.1 329.9c-.1.1-.2.1-.3.1-.1 0-.3-.1-.4-.2-.1-.2-.1-.5.1-.6.2-.1.5-.1.6.1.3.2.2.5 0 .6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2162" d="M196.5 275.8c-.1 0-.2.1-.3.1-.3 0-.5-.2-.6-.4-.1-.3 0-.7.4-.9.3-.1.7 0 .9.4.1.3-.1.7-.4.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2164" d="M197.8 235h-.2c-.4 0-.7-.3-.8-.6-.1-.4.2-.8.6-.9.4-.1.8.2.9.6.2.4-.1.8-.5.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2166" d="M198.8 355c-.1.1-.1.1-.2.1s-.2 0-.3-.1c-.1-.1-.1-.4 0-.5.1-.1.4-.1.5 0 .1.2.1.4 0 .5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2168" d="M198.9 40.5c-.1.1-.2.1-.3.1-.1 0-.2 0-.2-.1-.1-.1-.2-.3 0-.5.1-.1.3-.2.5 0 .1.1.1.3 0 .5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2170" d="M199.7 148.9c.1-.4.6-.7 1-.6.4.1.7.6.6 1-.1.4-.4.6-.8.6h-.2c-.5-.1-.7-.6-.6-1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2172" d="M199.7 246.3c-.1-.4.1-.9.6-1 .4-.1.9.1 1 .6.1.5-.1.9-.6 1h-.2c-.4 0-.7-.2-.8-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2174" d="M201.6 313.9c-.1.1-.2.1-.3.1-.2 0-.4-.1-.5-.3-.2-.3-.1-.6.2-.8.3-.2.6-.1.8.2.2.3.1.7-.2.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2176" d="M201.7 286.9c-.1 0-.2.1-.3.1-.3 0-.5-.1-.6-.4-.2-.3 0-.7.3-.9.3-.2.7 0 .9.3.1.3 0 .7-.3.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2178" d="M203.6 339.6c-.1.1-.2.1-.3.1-.1 0-.3-.1-.4-.2-.2-.2-.1-.5.1-.6.2-.2.5-.1.6.1.2.2.2.5 0 .6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2180" d="M204.4 258.6h-.3c-.3 0-.6-.2-.8-.5-.1-.4.1-.9.5-1 .4-.1.9.1 1 .5.2.4 0 .9-.4 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2182" d="M206.6 97.7c.2-.3.6-.4.9-.2.3.2.4.6.2.9-.1.2-.4.3-.6.3-.1 0-.2 0-.3-.1-.3-.1-.4-.6-.2-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2184" d="M207.1 364c-.1.1-.1.1-.2.1s-.2 0-.2-.1c-.1-.1-.1-.4 0-.5.1-.1.3-.1.5 0 .1.2.1.4-.1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2186" d="M207.6 297.6c-.1.1-.2.1-.3.1-.2 0-.5-.1-.6-.3-.2-.3-.1-.7.2-.9.3-.2.7-.1.9.2.2.3.1.8-.2.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2188" d="M208.7 324c-.1.1-.2.1-.3.1-.2 0-.3-.1-.5-.2-.2-.3-.1-.6.1-.8.3-.2.6-.1.8.1.2.2.2.6-.1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2190" d="M208.8 270c-.1 0-.2.1-.3.1-.3 0-.6-.2-.7-.5-.2-.4 0-.9.4-1 .4-.2.9 0 1 .4.2.4 0 .8-.4 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2192" d="M208.7 216.9h-.1c-.5 0-.9-.3-.9-.8-.1-.5.3-.9.8-1 .5-.1 1 .3 1 .8.1.5-.3 1-.8 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2194" d="M210.4 229.1c-.4 0-.8-.3-.9-.7-.1-.5.2-1 .7-1.1.5-.1 1 .2 1.1.7.1.5-.2 1-.7 1.1h-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2196" d="M211.6 348.9c-.1.1-.2.1-.3.1-.1 0-.2-.1-.3-.1-.2-.2-.2-.5 0-.6.2-.2.5-.2.6 0 .2.1.2.4 0 .6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2198" d="M211.7 46.8c-.1.1-.2.1-.3.1-.1 0-.2 0-.3-.1-.2-.2-.2-.5 0-.6.2-.2.5-.2.6 0 .1.2.1.4 0 .6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2200" d="M212.1 154.9c.1-.5.6-.8 1.1-.7.5.1.8.6.7 1.1-.1.4-.5.7-.9.7h-.2c-.5-.1-.8-.6-.7-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2202" d="M212.1 240.4c-.1-.5.2-1 .7-1.1.5-.1 1 .2 1.1.7.1.5-.2 1-.7 1.1h-.2c-.4-.1-.7-.3-.9-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2204" d="M214.1 308c-.1.1-.3.1-.4.1-.2 0-.4-.1-.6-.3-.2-.3-.1-.7.2-1 .3-.2.7-.1 1 .2.2.4.1.8-.2 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2206" d="M214 281.1c-.1.1-.2.1-.4.1-.3 0-.6-.2-.7-.4-.2-.4 0-.9.4-1.1.4-.2.9 0 1.1.3.2.4 0 .9-.4 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2208" d="M215.5 61.7c.2-.2.6-.3.8-.1.2.2.3.6.1.8-.1.1-.3.2-.4.2-.1 0-.3 0-.4-.1-.3-.2-.3-.6-.1-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2210" d="M216.4 333.6c-.1.1-.2.1-.4.1s-.3-.1-.4-.2c-.2-.2-.2-.6.1-.8.2-.2.6-.2.8.1.1.2.1.6-.1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2212" d="M216.8 252.8h-.3c-.4 0-.7-.2-.9-.6-.2-.5.1-1 .6-1.2.5-.2 1 .1 1.2.6.1.5-.1 1-.6 1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2214" d="M218.9 103.6c.2-.4.7-.5 1.1-.3.4.2.5.7.3 1.1-.1.3-.4.4-.7.4-.1 0-.3 0-.4-.1-.4-.2-.6-.7-.3-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2216" d="M219.2 290.4c.4-.2.9-.1 1.1.3.2.4.1.9-.3 1.1-.1.1-.3.1-.4.1-.3 0-.5-.1-.7-.4-.2-.3-.1-.8.3-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2218" d="M220.1 357.7c-.1.1-.2.1-.3.1-.1 0-.2 0-.3-.1-.2-.2-.2-.5 0-.6.2-.2.5-.2.6 0 .2.2.2.4 0 .6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2220" d="M221.3 317.9c-.1.1-.3.1-.4.1-.2 0-.4-.1-.5-.3-.2-.3-.2-.7.1-1 .3-.2.7-.2 1 .1.2.5.1.9-.2 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2222" d="M221.2 264.2c-.1 0-.2.1-.4.1-.4 0-.7-.2-.8-.6-.2-.5 0-1 .5-1.2.5-.2 1 0 1.2.5.2.5 0 1-.5 1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2224" d="M221.8 210.9c-.5 0-1-.4-1-.9 0-.6.4-1.1.9-1.1.6 0 1.1.4 1.1.9.1.6-.4 1.1-1 1.1.1 0 0 0 0 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2226" d="M223.2 223.1c-.5 0-.9-.4-1-.9-.1-.6.3-1.1.8-1.2.6-.1 1.1.3 1.2.8.1.6-.3 1.1-.9 1.2 0 .1 0 .1-.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2228" d="M224.5 342.7c-.1.1-.3.2-.4.2-.2 0-.3-.1-.4-.2-.2-.2-.2-.6 0-.8.2-.2.6-.2.8 0 .3.2.3.6 0 .8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2230" d="M224.6 234.3c-.1-.6.2-1.1.8-1.2.6-.1 1.1.2 1.2.8.1.6-.2 1.1-.8 1.2h-.2c-.4 0-.9-.3-1-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2232" d="M225.2 274.9c-.2-.4 0-1 .4-1.2.4-.2 1-.1 1.2.4.2.4 0 1-.4 1.2-.1.1-.3.1-.4.1-.3 0-.6-.2-.8-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2234" d="M226.6 302.1c-.1.1-.3.1-.4.1-.3 0-.5-.1-.7-.3-.3-.4-.2-.9.2-1.1.4-.3.9-.2 1.1.2.3.4.2.9-.2 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2236" d="M227.9 149c.2-.5.8-.8 1.3-.7.5.2.8.8.7 1.3-.1.4-.5.7-1 .7h-.3c-.6-.2-.9-.7-.7-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2238" d="M229.1 327.4c-.1.1-.3.2-.4.2-.2 0-.4-.1-.5-.2-.2-.3-.2-.7.1-1 .3-.2.7-.2 1 .1.1.2.1.7-.2.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2240" d="M229.3 246.9h-.3c-.4 0-.8-.3-1-.7-.2-.5.1-1.1.7-1.3.5-.2 1.1.1 1.3.7.1.5-.2 1.1-.7 1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2242" d="M231.1 109.5c.3-.4.8-.6 1.3-.3.4.3.6.8.3 1.2-.2.3-.5.4-.8.4-.2 0-.3 0-.5-.1-.4-.2-.5-.7-.3-1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2244" d="M231.2 285.7c-.3-.4-.1-1 .3-1.3.4-.3 1-.1 1.3.3.3.4.1 1-.3 1.3-.1.1-.3.1-.5.1-.3 0-.6-.2-.8-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2246" d="M233.2 351.4c-.1.1-.3.2-.4.2-.1 0-.3-.1-.4-.2-.2-.2-.2-.6 0-.8.2-.2.6-.2.8 0 .2.2.2.5 0 .8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2248" d="M234 311.9c-.1.1-.3.2-.5.2s-.5-.1-.6-.3c-.3-.3-.2-.8.1-1.1.3-.3.8-.2 1.1.1.3.4.2.9-.1 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2250" d="M234 203.8c0-.6.5-1.2 1.1-1.2.6 0 1.2.5 1.2 1.1 0 .6-.5 1.2-1.1 1.2-.7 0-1.2-.5-1.2-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2252" d="M235 216.1c-.1-.6.4-1.2 1-1.3.6-.1 1.2.4 1.3 1 .1.6-.4 1.2-1 1.3h-.1c-.6 0-1.1-.4-1.2-1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2254" d="M237.5 336.4c-.1.1-.3.2-.5.2s-.4-.1-.5-.2c-.3-.3-.3-.7 0-1 .3-.3.7-.3 1 0 .2.3.2.7 0 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2256" d="M239.2 296.2c-.2.1-.3.2-.5.2-.3 0-.6-.1-.7-.4-.3-.4-.2-1 .2-1.3.4-.3 1-.2 1.3.2.2.4.1 1-.3 1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2258" d="M240.3 155c.2-.6.8-.9 1.4-.7.6.2.9.8.7 1.4-.2.5-.6.8-1.1.8-.1 0-.2 0-.3-.1-.6-.2-.9-.8-.7-1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2260" d="M241.9 321.3c-.2.1-.3.2-.5.2s-.4-.1-.6-.3c-.3-.3-.3-.8.1-1.1.3-.3.8-.3 1.1.1.3.3.3.8-.1 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2262" d="M241.7 241c-.1 0-.2.1-.3.1-.5 0-.9-.3-1.1-.8-.2-.6.1-1.2.7-1.4.6-.2 1.2.1 1.4.7.3.6-.1 1.2-.7 1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2264" d="M243.4 115.4c.3-.5.9-.6 1.4-.3.5.3.6.9.3 1.4-.2.3-.5.5-.9.5-.2 0-.4 0-.5-.1-.4-.4-.6-1-.3-1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2266" d="M244.3 280.2c-.3 0-.7-.2-.9-.5-.3-.5-.1-1.1.3-1.4.5-.3 1.1-.1 1.4.3.3.5.1 1.1-.3 1.4-.1.2-.3.2-.5.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2268" d="M246.4 344.8c-.1.2-.3.2-.5.2s-.3-.1-.5-.2c-.3-.3-.3-.7-.1-1 .3-.3.7-.3 1-.1.3.4.3.9.1 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2270" d="M246.7 305.9c-.2.1-.4.2-.6.2-.3 0-.5-.1-.7-.3-.3-.4-.3-1 .1-1.3.4-.3 1-.3 1.3.1.3.4.3 1-.1 1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2272" d="M247.4 197.6c0-.7.6-1.3 1.3-1.3.7 0 1.3.6 1.3 1.3 0 .7-.6 1.3-1.3 1.3-.7 0-1.3-.6-1.3-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2274" d="M248 210c-.1-.7.4-1.3 1.1-1.4.7-.1 1.3.4 1.4 1.1.1.7-.4 1.3-1.1 1.4h-.1c-.7 0-1.2-.5-1.3-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2276" d="M250.5 330.1c-.2.2-.4.2-.6.2-.2 0-.4-.1-.6-.2-.3-.3-.3-.8 0-1.1.3-.3.8-.3 1.1 0 .4.2.4.7.1 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2278" d="M250.1 261.6c.6-.3 1.2-.1 1.5.5.3.6.1 1.2-.5 1.5-.2.1-.3.1-.5.1-.4 0-.8-.2-1-.6-.2-.5 0-1.2.5-1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2280" d="M251.8 290.2c-.2.1-.4.2-.6.2-.3 0-.6-.1-.8-.4-.3-.5-.2-1.1.2-1.4.5-.3 1.1-.2 1.4.2.3.5.2 1.1-.2 1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2282" d="M251.2 223.2h-.2c-.6 0-1.1-.4-1.2-1-.1-.7.3-1.3 1-1.5.7-.1 1.3.3 1.5 1 0 .8-.4 1.4-1.1 1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2284" d="M252.6 161c.2-.7.9-1 1.6-.8.7.2 1 .9.8 1.5-.2.6-.7.9-1.2.9h-.3c-.7-.3-1.1-1-.9-1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2286" d="M252.7 234.3c-.2-.7.2-1.4.8-1.6.7-.2 1.4.2 1.6.8.2.7-.2 1.4-.8 1.6-.1 0-.2.1-.4.1-.6 0-1.1-.4-1.2-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2288" d="M254.8 315.1c-.2.2-.4.2-.6.2-.2 0-.5-.1-.7-.3-.3-.4-.3-.9 0-1.3.4-.3.9-.3 1.3 0 .4.4.4 1 0 1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2290" d="M255.7 121.3c.3-.5 1-.7 1.6-.4.5.3.7 1 .4 1.6-.2.3-.6.5-1 .5-.2 0-.4-.1-.6-.2-.6-.2-.7-.9-.4-1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2292" d="M257.3 274.2c-.2.1-.4.2-.6.2-.4 0-.7-.2-1-.5-.3-.5-.2-1.2.4-1.6.5-.3 1.2-.2 1.6.4.3.5.2 1.2-.4 1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2294" d="M258.1 298.3c.4-.4 1.1-.3 1.4.1.4.4.3 1.1-.1 1.4-.2.2-.4.2-.7.2-.3 0-.6-.1-.8-.4-.3-.3-.2-1 .2-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2296" d="M320.3 177.8c-1-.3-2 .2-2.3 1.2-.3 1 .2 2 1.2 2.3.2.1.4.1.5.1.8 0 1.5-.5 1.7-1.3.4-1-.2-2-1.1-2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2298" d="M326.2 226.1c-.5-.9-1.6-1.2-2.5-.7-.9.5-1.2 1.6-.7 2.5.3.6 1 .9 1.6.9.3 0 .6-.1.9-.2.9-.5 1.2-1.6.7-2.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2300" d="M330.2 157c-.7.8-.6 1.9.2 2.6.3.3.7.4 1.1.4v-3.6c-.5 0-1 .2-1.3.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2302" d="M323 167.3c-.5.9-.2 2 .7 2.5.3.2.6.2.9.2.6 0 1.3-.3 1.6-.9.5-.9.2-2-.7-2.5-.9-.5-2-.1-2.5.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2304" d="M317.1 201.9c-1 .1-1.7 1-1.6 2 .1.9.9 1.6 1.8 1.6h.2c1-.1 1.7-1 1.6-2-.1-.9-1-1.7-2-1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2306" d="M317.5 189.7c-1-.1-1.9.6-2 1.6-.1 1 .6 1.9 1.6 2h.2c.9 0 1.7-.7 1.8-1.6.1-1-.6-1.9-1.6-2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2308" d="M330.3 238.2c.3.4.8.6 1.2.6v-3.6c-.4 0-.7.2-1.1.4-.7.7-.8 1.9-.1 2.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2310" d="M319.2 214c-1 .3-1.5 1.3-1.2 2.3.2.8 1 1.3 1.7 1.3.2 0 .4 0 .5-.1 1-.3 1.5-1.3 1.2-2.3-.2-1-1.2-1.5-2.2-1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2312" d="M303.3 199.3c.9 0 1.7-.8 1.7-1.7 0-.9-.8-1.7-1.7-1.7-.9 0-1.7.8-1.7 1.7 0 .9.8 1.7 1.7 1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2314" d="M313.8 232.1c-.4-.8-1.5-1.1-2.3-.7-.8.4-1.1 1.5-.7 2.3.3.6.9.9 1.5.9.3 0 .6-.1.8-.2.8-.4 1.1-1.5.7-2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2316" d="M305.7 173c-.3.9.2 1.9 1.1 2.2.2.1.4.1.6.1.7 0 1.4-.4 1.6-1.2.3-.9-.2-1.9-1.1-2.2-.9-.3-1.9.2-2.2 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2318" d="M306 209.6c-.2-.9-1-1.6-2-1.4-.9.2-1.6 1-1.4 2 .1.8.9 1.4 1.7 1.4h.3c1-.2 1.6-1.1 1.4-2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2320" d="M309 221.2c-.3-.9-1.3-1.4-2.2-1.1-.9.3-1.4 1.3-1.1 2.2.2.7.9 1.2 1.6 1.2.2 0 .4 0 .6-.1.9-.4 1.4-1.3 1.1-2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2322" d="M317.9 153.4c.3.2.7.4 1 .4.5 0 1-.2 1.3-.7.6-.7.4-1.8-.3-2.4-.7-.6-1.8-.4-2.4.3-.5.7-.4 1.8.4 2.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2324" d="M302.6 185.1c-.2.9.5 1.8 1.4 2h.3c.8 0 1.5-.6 1.7-1.4.2-.9-.5-1.8-1.4-2-.9-.1-1.8.5-2 1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2326" d="M327.2 144.7c.4 0 .8-.1 1.2-.5.7-.6.7-1.7.1-2.4s-1.7-.7-2.4-.1-.7 1.7-.1 2.4c.3.4.8.6 1.2.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2328" d="M310.7 161.5c-.4.8-.1 1.9.7 2.3.3.1.5.2.8.2.6 0 1.2-.3 1.5-.9.4-.8.1-1.9-.7-2.3-.8-.5-1.8-.1-2.3.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2330" d="M326.1 251c-.6.7-.6 1.8.1 2.4.3.3.7.4 1.2.4s.9-.2 1.3-.6c.6-.7.6-1.8-.1-2.4-.8-.5-1.9-.5-2.5.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2332" d="M320.1 244.5c.7-.6.9-1.7.3-2.4-.6-.7-1.7-.9-2.4-.3-.7.6-.9 1.7-.3 2.4.3.4.8.7 1.3.7.4 0 .7-.1 1.1-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2334" d="M314.3 138.1c.4 0 .8-.2 1.1-.5.6-.6.6-1.6 0-2.3-.6-.6-1.6-.6-2.3 0-.6.6-.6 1.6 0 2.3.4.4.8.5 1.2.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2336" d="M293.1 215.5c-.2-.9-1-1.4-1.9-1.2-.9.2-1.4 1-1.2 1.9.2.8.8 1.3 1.6 1.3h.3c.9-.2 1.4-1.1 1.2-2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2338" d="M323.4 129.9c.3 0 .7-.1 1-.3.7-.5.8-1.5.3-2.2-.5-.7-1.5-.8-2.2-.3-.7.5-.8 1.5-.3 2.2.3.4.7.6 1.2.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2340" d="M289.8 202.2c-.9.1-1.5.8-1.5 1.7.1.8.8 1.5 1.6 1.5h.1c.9-.1 1.5-.8 1.5-1.7 0-.9-.9-1.6-1.7-1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2342" d="M290 189.9c-.9-.1-1.6.6-1.7 1.5-.1.9.6 1.6 1.5 1.7h.1c.8 0 1.5-.6 1.6-1.5 0-.9-.6-1.7-1.5-1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2344" d="M305.5 248c-.7.5-.9 1.5-.3 2.2.3.4.8.7 1.3.7.3 0 .7-.1.9-.3.7-.5.9-1.5.3-2.2-.5-.8-1.5-1-2.2-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2346" d="M293.4 167c-.3.8.1 1.7 1 2 .2.1.4.1.5.1.7 0 1.3-.4 1.5-1 .3-.8-.1-1.7-1-2-.8-.4-1.7.1-2 .9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2348" d="M301.4 238.1c-.4-.8-1.4-1.1-2.2-.7-.8.4-1.1 1.4-.7 2.2.3.5.8.8 1.4.8.3 0 .5-.1.7-.2.9-.3 1.2-1.3.8-2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2350" d="M291.2 180.9h.3c.7 0 1.4-.5 1.6-1.3.2-.9-.4-1.7-1.2-1.9-.9-.2-1.7.4-1.9 1.2-.2 1 .4 1.8 1.2 2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2352" d="M315.5 257.5c-.6-.6-1.6-.7-2.2 0-.6.6-.7 1.6 0 2.3.3.3.7.5 1.1.5.4 0 .8-.1 1.1-.4.6-.7.6-1.7 0-2.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2354" d="M299.2 157.7c.2.1.5.2.7.2.6 0 1.1-.3 1.4-.8.4-.8.1-1.7-.7-2.2-.8-.4-1.7-.1-2.2.7-.3.8 0 1.7.8 2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2356" d="M296.4 227.1c-.3-.8-1.2-1.3-2-1-.8.3-1.3 1.2-1 2 .2.6.8 1 1.5 1 .2 0 .4 0 .5-.1.9-.1 1.4-1.1 1-1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2358" d="M324.4 265.6c-.7-.5-1.7-.4-2.2.3-.5.7-.4 1.7.3 2.2.3.2.6.3 1 .3.5 0 .9-.2 1.3-.6.5-.7.3-1.7-.4-2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2360" d="M305.5 147.2c.3.2.6.3.9.3.5 0 1-.2 1.3-.7.5-.7.4-1.7-.4-2.2-.7-.5-1.7-.4-2.2.4-.5.7-.3 1.7.4 2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2362" d="M292.7 139.1c-.5.7-.3 1.6.4 2.1.3.2.5.3.8.3.5 0 .9-.2 1.2-.6.5-.7.3-1.6-.4-2.1-.6-.6-1.5-.4-2 .3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2364" d="M310.2 123c.3 0 .7-.1 1-.4.6-.5.7-1.5.1-2.1-.5-.6-1.5-.7-2.1-.1-.6.5-.7 1.5-.1 2.1.2.4.7.5 1.1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2366" d="M319.8 115.5c.3 0 .6-.1.8-.3.7-.5.8-1.4.4-2.1-.5-.7-1.4-.8-2.1-.4-.7.5-.8 1.4-.4 2.1.4.5.8.7 1.3.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2368" d="M302.7 263.9c-.5-.6-1.5-.7-2.1-.1-.6.5-.7 1.5-.1 2.1.3.3.7.5 1.1.5.3 0 .7-.1 1-.4.5-.5.6-1.5.1-2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2370" d="M276.6 208.4c-.8.1-1.4.8-1.3 1.6.1.8.7 1.3 1.5 1.3h.2c.8-.1 1.4-.8 1.3-1.6-.2-.8-.9-1.4-1.7-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2372" d="M283.1 160.2c-.8-.3-1.6.1-1.9.9-.3.8.1 1.6.9 1.9.2.1.4.1.5.1.6 0 1.2-.4 1.4-1 .2-.8-.2-1.6-.9-1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2374" d="M276.9 183.9c-.8-.1-1.5.5-1.6 1.3-.1.8.5 1.5 1.3 1.6h.2c.7 0 1.4-.6 1.5-1.3 0-.8-.6-1.5-1.4-1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2376" d="M282.1 232.2c-.8.3-1.1 1.1-.9 1.9.2.6.8 1 1.4 1 .2 0 .4 0 .5-.1.8-.3 1.1-1.1.9-1.9-.3-.8-1.2-1.2-1.9-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2378" d="M278.6 174.7h.4c.7 0 1.3-.5 1.4-1.1.2-.8-.3-1.6-1.1-1.8-.8-.2-1.6.3-1.8 1.1-.2.8.3 1.6 1.1 1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2380" d="M293.1 254.1c-.7.5-.8 1.4-.4 2.1.3.4.7.6 1.2.6.3 0 .6-.1.8-.3.7-.5.8-1.4.4-2.1-.4-.6-1.3-.8-2-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2382" d="M329.6 106.3c-.7.4-1 1.3-.6 2 .3.5.8.8 1.3.8.2 0 .5-.1.7-.2.2-.1.4-.3.5-.5v-1.7c-.4-.5-1.2-.7-1.9-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2384" d="M286.9 151.7c.2.1.5.2.7.2.5 0 1-.3 1.3-.8.4-.7.1-1.6-.6-2-.7-.4-1.6-.1-2 .6-.4.7-.1 1.6.6 2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2386" d="M329.1 286.8c-.4.7-.1 1.6.6 2 .2.1.5.2.7.2.4 0 .8-.2 1.1-.5v-1.9l-.4-.4c-.8-.4-1.7-.1-2 .6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2388" d="M288.3 246.1c.7-.4 1-1.3.6-2s-1.3-1-2-.6-1 1.3-.6 2c.3.5.8.8 1.3.8.3 0 .5-.1.7-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2390" d="M280.4 221.6c-.2-.8-1-1.3-1.8-1.1-.8.2-1.3 1-1.1 1.8.2.7.8 1.1 1.4 1.1h.4c.8-.3 1.3-1.1 1.1-1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2392" d="M320.7 279.9c-.7-.5-1.6-.3-2.1.4s-.3 1.6.4 2.1c.3.2.6.3.8.3.5 0 .9-.2 1.2-.6.6-.8.4-1.7-.3-2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2394" d="M309.1 272.6c-.5.6-.5 1.5.1 2.1.3.3.6.4 1 .4s.8-.2 1.1-.5c.5-.6.5-1.5-.1-2.1-.6-.6-1.5-.5-2.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2396" d="M301.5 131.7c.4 0 .8-.2 1.1-.5.5-.6.5-1.5-.1-2.1-.6-.5-1.5-.5-2.1.1-.5.6-.5 1.5.1 2.1.3.3.6.4 1 .4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2398" d="M274.5 197.6c0 .8.7 1.5 1.5 1.5s1.5-.7 1.5-1.5-.7-1.5-1.5-1.5-1.5.7-1.5 1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2400" d="M262.4 192.8s.1 0 0 0c.8 0 1.4-.6 1.4-1.3 0-.8-.5-1.4-1.3-1.4s-1.4.5-1.4 1.3.6 1.4 1.3 1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2402" d="M266 168.6h.4c.6 0 1.2-.4 1.3-1 .2-.7-.2-1.5-1-1.7-.7-.2-1.5.2-1.7 1-.2.8.3 1.5 1 1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2404" d="M280.8 260.2c-.6.4-.8 1.3-.4 1.9.3.4.7.6 1.1.6.3 0 .5-.1.8-.2.6-.4.8-1.3.4-1.9-.4-.7-1.3-.9-1.9-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2406" d="M317.2 294.1c-.6-.4-1.5-.2-1.9.4-.4.6-.2 1.5.4 1.9.2.1.5.2.7.2.5 0 .9-.2 1.2-.6.4-.6.2-1.5-.4-1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2408" d="M289.7 272.2c.6-.5.7-1.3.2-1.9-.5-.6-1.3-.7-1.9-.2-.6.5-.7 1.3-.2 1.9.3.3.7.5 1 .5.4 0 .7-.1.9-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2410" d="M306.3 108.3c.3 0 .6-.1.8-.3.6-.5.7-1.3.2-1.9-.5-.6-1.3-.7-1.9-.2-.6.5-.7 1.3-.2 1.9.3.3.7.5 1.1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2412" d="M305.3 287.4c-.5.6-.4 1.5.2 1.9.3.2.5.3.8.3.4 0 .8-.2 1.1-.5.5-.6.4-1.5-.2-1.9-.5-.5-1.4-.4-1.9.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2414" d="M316.4 101.3c.3 0 .5-.1.7-.2.6-.4.8-1.2.4-1.9-.4-.6-1.2-.8-1.9-.4-.6.4-.8 1.2-.4 1.9.3.3.7.6 1.2.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2416" d="M297.1 116.4c.3 0 .7-.1 1-.4.5-.5.6-1.4 0-1.9-.5-.5-1.4-.6-1.9 0-.5.5-.6 1.4 0 1.9.2.3.6.4.9.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2418" d="M280.3 133.2c-.4.6-.2 1.5.4 1.9.2.2.5.2.8.2.4 0 .9-.2 1.1-.6.4-.6.2-1.5-.4-1.9-.6-.4-1.4-.3-1.9.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2420" d="M274.6 145.7c.2.1.4.2.6.2.5 0 1-.3 1.2-.7.3-.7.1-1.5-.6-1.8-.7-.3-1.5-.1-1.8.6-.3.5 0 1.3.6 1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2422" d="M298.1 279.2c-.5-.5-1.4-.5-1.9 0s-.5 1.4 0 1.9c.3.3.6.4 1 .4s.7-.1 1-.4c.5-.6.5-1.4-.1-1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2424" d="M288.8 125.4c.4 0 .8-.2 1.1-.5.5-.6.4-1.4-.2-1.9-.6-.5-1.4-.4-1.9.2-.5.6-.4 1.4.2 1.9.2.2.5.3.8.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2426" d="M262.6 205.1c.8 0 1.3-.7 1.3-1.4 0-.8-.7-1.3-1.4-1.3-.8 0-1.3.7-1.3 1.4s.6 1.3 1.4 1.3c-.1 0-.1 0 0 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2428" d="M264 177.9c-.7-.1-1.4.4-1.6 1.1-.1.7.4 1.4 1.1 1.6h.2c.7 0 1.2-.5 1.3-1.1.2-.7-.3-1.4-1-1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2430" d="M263.6 214.6c-.7.1-1.3.8-1.1 1.6.1.7.7 1.1 1.3 1.1h.2c.7-.1 1.3-.8 1.1-1.6-.1-.7-.8-1.2-1.5-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2432" d="M266.7 229.2c.7-.2 1.2-1 1-1.7-.2-.7-.9-1.2-1.7-1-.7.2-1.2.9-1 1.7.2.6.7 1 1.3 1 .2.1.3.1.4 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2434" d="M270.2 240.9c.2 0 .3 0 .5-.1.7-.3 1.1-1.1.8-1.8-.3-.7-1.1-1-1.8-.8-.7.3-1 1.1-.8 1.8.3.6.8.9 1.3.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2436" d="M275.9 251.9c.7-.3.9-1.2.6-1.8-.3-.7-1.2-.9-1.8-.6-.7.3-.9 1.2-.6 1.8.2.5.7.7 1.2.7.2.1.4 0 .6-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2438" d="M270.7 154.4c-.7-.3-1.5.1-1.8.8-.3.7.1 1.5.8 1.8.2.1.3.1.5.1.5 0 1.1-.3 1.3-.9.2-.8-.1-1.6-.8-1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2440" d="M262.3 139.7c.2.1.4.1.6.1.5 0 .9-.2 1.1-.7.3-.6.1-1.4-.5-1.7-.6-.3-1.4-.1-1.7.5-.3.7-.1 1.5.5 1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2442" d="M253.5 162.5h.3c.5 0 1-.4 1.2-.9.2-.7-.2-1.4-.8-1.5-.7-.2-1.4.2-1.6.8-.2.7.2 1.4.9 1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2444" d="M275.5 276.3c-.5.4-.6 1.2-.2 1.8.2.3.6.5 1 .5.3 0 .5-.1.8-.3.5-.4.6-1.2.2-1.8-.5-.5-1.3-.6-1.8-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2446" d="M302.7 93.9c.3 0 .5-.1.7-.2.6-.4.7-1.2.3-1.7-.4-.6-1.2-.7-1.7-.3-.6.4-.7 1.2-.3 1.7.2.3.6.5 1 .5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2448" d="M257.9 246.8c.2 0 .3 0 .5-.1.6-.3 1-1 .7-1.6-.3-.6-1-1-1.6-.7-.6.3-1 1-.7 1.6.1.5.6.8 1.1.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2450" d="M250.7 174.5h.2c.6 0 1.1-.4 1.2-1 .1-.7-.3-1.3-1-1.5-.7-.1-1.3.3-1.5 1 0 .7.4 1.3 1.1 1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2452" d="M257.8 151c.5 0 1-.3 1.2-.8.3-.6-.1-1.4-.7-1.6-.6-.3-1.4.1-1.6.7-.3.6.1 1.4.7 1.6.1 0 .3.1.4.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2454" d="M276.2 119.2c.4 0 .7-.2 1-.5.4-.5.3-1.3-.2-1.8-.5-.4-1.3-.3-1.8.2-.4.5-.3 1.3.2 1.8.2.2.5.3.8.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2456" d="M283.4 285.6c-.5.5-.5 1.3 0 1.8.2.3.6.4.9.4.3 0 .6-.1.9-.3.5-.5.5-1.3 0-1.8s-1.3-.5-1.8-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2458" d="M293.1 101.4c.3 0 .6-.1.8-.3.5-.5.6-1.2.1-1.8-.5-.5-1.2-.6-1.8-.1-.5.5-.6 1.2-.1 1.8.3.3.6.4 1 .4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2460" d="M268.4 129c.2.1.4.2.7.2.4 0 .8-.2 1.1-.6.4-.6.2-1.4-.4-1.7-.6-.4-1.4-.2-1.7.4-.5.5-.3 1.3.3 1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2462" d="M284.2 109.9c.3 0 .7-.1.9-.4.5-.5.5-1.3 0-1.8s-1.3-.5-1.8 0-.5 1.3 0 1.8c.3.3.6.4.9.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2464" d="M250.7 220.8c-.7.1-1.1.8-1 1.5.1.6.6 1 1.2 1h.2c.7-.1 1.1-.8 1-1.5 0-.7-.7-1.1-1.4-1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2466" d="M249.4 211.1c.7-.1 1.2-.7 1.1-1.4-.1-.7-.7-1.2-1.4-1.1-.7.1-1.2.7-1.1 1.4.1.6.6 1.1 1.2 1.1h.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2468" d="M249.9 197.6c0-.7-.6-1.3-1.3-1.3-.7 0-1.3.6-1.3 1.3 0 .7.6 1.3 1.3 1.3.7 0 1.3-.6 1.3-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2470" d="M254.2 235.1c.7-.2 1-.9.8-1.6-.2-.7-.9-1-1.6-.8-.7.2-1 .9-.8 1.6.2.5.7.9 1.2.9.2 0 .3 0 .4-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2472" d="M301.8 301.9c-.4.6-.3 1.3.3 1.7.2.2.5.2.7.2.4 0 .8-.2 1-.5.4-.6.3-1.3-.3-1.7-.5-.4-1.3-.3-1.7.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2474" d="M249.1 186.6h.1c.6 0 1.2-.5 1.2-1.1.1-.7-.4-1.3-1.1-1.4-.7-.1-1.3.4-1.4 1.1 0 .7.5 1.4 1.2 1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2476" d="M292.2 294.2c-.5.5-.4 1.3.1 1.8.2.2.5.3.8.3.3 0 .7-.1.9-.4.4-.5.4-1.3-.1-1.8-.4-.5-1.2-.4-1.7.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2478" d="M263 257.9c.2 0 .4 0 .6-.1.6-.3.9-1.1.5-1.7-.3-.6-1.1-.9-1.7-.5-.6.3-.9 1.1-.5 1.7.2.4.6.6 1.1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2480" d="M270.2 266.6c-.4-.6-1.1-.7-1.7-.4-.6.4-.7 1.1-.4 1.7.2.4.6.6 1.1.6.2 0 .5-.1.7-.2.5-.3.7-1.1.3-1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2482" d="M241 156.4c.1 0 .2.1.3.1.5 0 .9-.3 1.1-.8.2-.6-.1-1.2-.7-1.4-.6-.2-1.2.1-1.4.7-.2.6.1 1.2.7 1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2484" d="M257.7 272.7c-.3-.5-1-.7-1.6-.4-.5.3-.7 1-.4 1.6.2.3.6.5 1 .5.2 0 .4-.1.6-.2.6-.3.7-1 .4-1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2486" d="M241.1 238.8c-.6.2-.9.8-.7 1.4.1.5.6.8 1.1.8.1 0 .2 0 .3-.1.6-.2.9-.8.7-1.4-.2-.5-.8-.9-1.4-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2488" d="M250.7 263.7c.2 0 .4 0 .5-.1.6-.3.8-1 .5-1.5-.3-.6-1-.8-1.5-.5-.6.3-.8 1-.5 1.5.1.4.5.6 1 .6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2490" d="M256.1 122.9c.2.1.4.2.6.2.4 0 .7-.2 1-.5.3-.5.2-1.2-.4-1.6-.5-.3-1.2-.2-1.6.4-.3.5-.2 1.2.4 1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2492" d="M236 180.4h.1c.6 0 1.1-.4 1.1-1 .1-.6-.4-1.2-1-1.3-.6-.1-1.2.4-1.3 1 0 .6.5 1.2 1.1 1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2494" d="M245.5 144.9c.4 0 .9-.3 1.1-.7.2-.6 0-1.2-.6-1.5-.6-.2-1.2 0-1.5.6-.2.6 0 1.2.6 1.5.1.1.2.1.4.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2496" d="M238 168.3h.2c.5 0 1-.4 1.1-.9.1-.6-.3-1.2-.9-1.4-.6-.1-1.2.3-1.4.9 0 .7.4 1.3 1 1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2498" d="M237.3 215.8c-.1-.6-.6-1.1-1.3-1-.6.1-1.1.7-1 1.3.1.6.6 1 1.1 1h.1c.7-.1 1.2-.6 1.1-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2500" d="M245.5 252.6c.1 0 .3 0 .4-.1.6-.2.9-.9.6-1.5-.2-.6-.9-.9-1.5-.6-.6.2-.9.9-.6 1.5.3.4.7.7 1.1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2502" d="M235.2 204.9c.6 0 1.1-.6 1.1-1.2s-.6-1.1-1.2-1.1-1.1.6-1.1 1.2.5 1.1 1.2 1.1c-.1 0 0 0 0 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2504" d="M250.1 133.6c.2.1.3.1.5.1.4 0 .8-.2 1-.6.3-.6.1-1.2-.5-1.5-.6-.3-1.2-.1-1.5.5-.3.6-.1 1.2.5 1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2506" d="M271.4 103.5c.3 0 .6-.1.8-.4.4-.5.4-1.2-.1-1.6-.5-.4-1.2-.4-1.6.1-.4.5-.4 1.2.1 1.6.3.2.5.3.8.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2508" d="M280 94.7c.3 0 .6-.1.8-.3.5-.4.5-1.1 0-1.6-.4-.5-1.1-.5-1.6 0s-.5 1.1-.1 1.6c.3.2.6.3.9.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2510" d="M289.3 86.7c.2 0 .5-.1.7-.2.5-.4.6-1.1.2-1.6-.4-.5-1.1-.6-1.6-.2-.5.4-.6 1.1-.2 1.6.2.3.6.4.9.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2512" d="M272.3 292.1c-.4-.5-1.1-.5-1.6-.1s-.5 1.1-.1 1.6c.2.2.5.4.8.4.3 0 .5-.1.8-.3.5-.5.6-1.2.1-1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2514" d="M262.7 111.2c-.4.5-.3 1.2.2 1.6.2.2.4.2.7.2.3 0 .7-.2.9-.5.4-.5.3-1.2-.2-1.6-.5-.3-1.2-.3-1.6.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2516" d="M288.5 308.9c-.4.5-.3 1.2.2 1.6.2.2.5.2.7.2.3 0 .7-.1.9-.4.4-.5.3-1.2-.2-1.6-.5-.4-1.2-.3-1.6.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2518" d="M239.4 227.8c-.1-.6-.7-1-1.4-.9-.6.1-1 .7-.9 1.4.1.5.6.9 1.1.9h.2c.7-.2 1.1-.8 1-1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2520" d="M263 282.4c-.5.4-.6 1.1-.2 1.6.2.3.6.5.9.5.2 0 .5-.1.7-.2.5-.4.6-1.1.2-1.6-.4-.5-1.1-.6-1.6-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2522" d="M235.1 192.6c.7 0 1.2-.5 1.2-1.1 0-.6-.5-1.2-1.1-1.2-.6 0-1.2.5-1.2 1.1 0 .7.4 1.2 1.1 1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2524" d="M279.2 300.8c-.4.5-.4 1.2.1 1.6.2.2.5.3.8.3.3 0 .6-.1.8-.4.4-.5.4-1.2-.1-1.6-.4-.4-1.1-.3-1.6.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2526" d="M276 79.8c.2 0 .5-.1.7-.2.4-.4.5-1 .1-1.4-.4-.4-1-.5-1.4-.1-.4.4-.5 1-.1 1.4.2.2.4.3.7.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2528" d="M275.3 315.7c-.4.4-.3 1.1.1 1.4.2.2.4.2.7.2.3 0 .6-.1.8-.4.4-.4.3-1.1-.1-1.4-.5-.3-1.1-.2-1.5.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2530" d="M225.9 235.1c.6-.1.9-.7.8-1.2-.1-.6-.7-.9-1.2-.8-.6.1-.9.7-.8 1.2.1.5.5.8 1 .8h.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2532" d="M228.6 244.9c-.5.2-.8.8-.7 1.3.1.4.5.7 1 .7h.3c.5-.2.8-.8.7-1.3-.2-.6-.7-.8-1.3-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2534" d="M228.6 150.3h.3c.4 0 .8-.3 1-.7.2-.5-.1-1.1-.7-1.3-.5-.2-1.1.1-1.3.7-.2.6.1 1.1.7 1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2536" d="M224.2 221.9c-.1-.6-.6-.9-1.2-.8-.6.1-.9.6-.8 1.2.1.5.5.9 1 .9h.2c.5-.2.9-.7.8-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2538" d="M243.7 116.8c.2.1.4.1.5.1.3 0 .7-.2.9-.5.3-.5.1-1.1-.3-1.4-.4-.3-1.1-.1-1.4.3-.3.6-.1 1.2.3 1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2540" d="M259.4 299.8c.4-.4.5-1 .1-1.4-.4-.4-1-.5-1.4-.1-.4.4-.5 1-.1 1.4.2.2.5.4.8.4.2 0 .4-.1.6-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2542" d="M245.2 278.7c-.3-.5-.9-.6-1.4-.3-.5.3-.6.9-.3 1.4.2.3.5.5.9.5.2 0 .4 0 .5-.1.5-.4.6-1 .3-1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2544" d="M266.4 307.3c-.4.4-.4 1 0 1.4.2.2.5.3.7.3.3 0 .5-.1.7-.3.4-.4.4-1 0-1.4-.4-.4-1-.4-1.4 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2546" d="M250.6 288.6c-.5.3-.6 1-.2 1.4.2.3.5.4.8.4.2 0 .4-.1.6-.2.5-.3.6-1 .2-1.4-.3-.5-1-.6-1.4-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2548" d="M222.8 209.8c0-.6-.5-1-1.1-.9-.6 0-1 .5-.9 1.1 0 .5.5.9 1 .9h.1c.5 0 1-.5.9-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2550" d="M233.2 258.4c.1 0 .3 0 .4-.1.5-.2.8-.8.6-1.3-.2-.5-.8-.8-1.3-.6-.5.2-.8.8-.6 1.3.1.5.5.7.9.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2552" d="M237.9 267.6c-.5.3-.7.9-.4 1.4.2.4.5.6.9.6.2 0 .3 0 .5-.1.5-.3.7-.9.4-1.4-.3-.5-.9-.7-1.4-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2554" d="M220.8 185.3c0 .6.4 1.1.9 1.1h.1c.5 0 1-.4 1-.9 0-.6-.4-1.1-.9-1.1-.6-.1-1.1.3-1.1.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2556" d="M250.5 106.6c.2.1.4.2.6.2.3 0 .6-.1.8-.4.3-.5.2-1.1-.2-1.4-.5-.3-1.1-.2-1.4.2-.4.4-.3 1.1.2 1.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2558" d="M258.7 97.2c.3 0 .6-.1.8-.4.4-.4.3-1.1-.1-1.4-.4-.4-1.1-.3-1.4.1-.4.4-.3 1.1.1 1.4.1.2.4.3.6.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2560" d="M233.1 138.8c.4 0 .8-.2.9-.6.2-.5 0-1.1-.6-1.3-.5-.2-1.1 0-1.3.6-.2.5 0 1.1.6 1.3h.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2562" d="M223 174.2h.2c.5 0 .9-.4 1-.9.1-.6-.3-1.1-.9-1.2-.6-.1-1.1.3-1.2.9 0 .6.4 1.1.9 1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2564" d="M237.8 127.6c.1.1.3.1.5.1.4 0 .7-.2.9-.6.3-.5.1-1.1-.4-1.4-.5-.3-1.1-.1-1.4.4-.3.6-.1 1.2.4 1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2566" d="M220.3 197.6c0 .6.5 1 1 1s1-.5 1-1c0-.6-.5-1-1-1s-1 .5-1 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2568" d="M225.3 162.1h.2c.5 0 .9-.3 1-.8.1-.6-.2-1.1-.8-1.2-.6-.1-1.1.2-1.2.8 0 .6.3 1.1.8 1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2570" d="M267 88.2c.3 0 .5-.1.7-.3.4-.4.4-1 0-1.4-.4-.4-1-.4-1.4 0-.4.4-.4 1 0 1.4.2.1.5.3.7.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2572" d="M253.6 313.7c-.4.3-.4.9 0 1.3.2.2.4.3.7.3.2 0 .4-.1.6-.2.4-.3.4-.9 0-1.3s-.9-.4-1.3-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2574" d="M254.2 81.7c.2 0 .5-.1.7-.3.3-.4.3-.9 0-1.3-.4-.3-.9-.3-1.3 0-.3.4-.3.9 0 1.3.1.2.3.3.6.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2576" d="M231.4 110.8c.1.1.3.1.5.1.3 0 .6-.2.8-.4.3-.4.1-1-.3-1.2-.4-.3-1-.1-1.3.3-.2.4-.1.9.3 1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2578" d="M238.1 294.7c-.4.3-.5.9-.2 1.3.2.3.5.4.7.4.2 0 .4-.1.5-.2.4-.3.5-.9.2-1.3-.2-.4-.7-.5-1.2-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2580" d="M245.5 304.5c-.4.3-.4.9-.1 1.3.2.2.4.3.7.3.2 0 .4-.1.6-.2.4-.3.4-.9.1-1.3-.3-.4-.9-.4-1.3-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2582" d="M221.2 131.1c-.5-.2-1 0-1.2.5-.2.5 0 1 .5 1.2.1 0 .2.1.4.1.4 0 .7-.2.8-.6.1-.5-.1-1-.5-1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2584" d="M215.6 143.1c-.2.5.1 1 .6 1.2h.3c.4 0 .7-.2.9-.6.2-.5-.1-1-.6-1.1-.5-.2-1 0-1.2.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2586" d="M238.6 100.6c.3 0 .6-.1.7-.4.3-.4.2-1-.2-1.3-.4-.3-1-.2-1.3.2-.3.4-.2 1 .2 1.3.2.2.4.2.6.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2588" d="M232.7 284.7c-.3-.4-.8-.6-1.3-.3-.4.3-.6.8-.3 1.3.2.3.5.4.8.4.2 0 .3 0 .5-.1.5-.3.6-.9.3-1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2590" d="M212.8 156h.2c.4 0 .8-.3.9-.7.1-.5-.2-1-.7-1.1-.5-.1-1 .2-1.1.7-.1.5.2 1 .7 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2592" d="M246 90.9c.3 0 .5-.1.7-.3.3-.4.3-1-.1-1.3-.4-.3-1-.3-1.3.1-.3.4-.3 1 .1 1.3.2.1.4.2.6.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2594" d="M262.3 322.4c-.3.4-.3.9.1 1.3.2.2.4.2.6.2.2 0 .5-.1.7-.3.3-.4.3-.9-.1-1.3-.4-.3-1-.3-1.3.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2596" d="M211.3 228c-.1-.5-.6-.8-1.1-.7-.5.1-.8.6-.7 1.1.1.4.5.7.9.7h.2c.5-.1.8-.6.7-1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2598" d="M208.5 215.1c-.5.1-.9.5-.8 1 0 .5.4.8.9.8h.1c.5-.1.9-.5.8-1 0-.5-.5-.8-1-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2600" d="M216.2 251c-.5.2-.7.7-.6 1.2.1.4.5.6.9.6h.3c.5-.2.7-.7.6-1.2-.2-.5-.7-.7-1.2-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2602" d="M262.9 73.1c.2 0 .4-.1.6-.2.4-.3.4-.9.1-1.3-.3-.4-.9-.4-1.3-.1-.4.3-.4.9-.1 1.3.2.2.5.3.7.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2604" d="M226.4 275.3c.4-.2.6-.8.4-1.2-.2-.4-.8-.6-1.2-.4-.4.2-.6.8-.4 1.2.2.3.5.5.8.5.2 0 .3-.1.4-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2606" d="M271.9 330.1s-.1 0-.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2608" d="M225.5 121.6c.1.1.3.1.4.1.3 0 .7-.2.8-.5.2-.4 0-1-.4-1.2-.4-.2-1-.1-1.2.4-.2.4 0 .9.4 1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2610" d="M220.5 262.5c-.5.2-.7.7-.5 1.2.1.3.5.6.8.6.1 0 .2 0 .4-.1.5-.2.7-.7.5-1.2-.2-.5-.7-.7-1.2-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2612" d="M213.3 241c.5-.1.8-.6.7-1.1-.1-.5-.6-.8-1.1-.7-.5.1-.8.6-.7 1.1.1.4.5.7.9.7h.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2614" d="M207.7 179.2c-.1.5.3.9.8 1h.1c.5 0 .9-.3.9-.8.1-.5-.3-.9-.8-1-.5-.1-.9.3-1 .8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2616" d="M207.8 204.7c.5 0 .9-.4.9-.9s-.4-.9-.9-.9-.9.4-.9.9.4.9.9.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2618" d="M207.8 192.4c.5 0 .9-.4.9-.9s-.4-.9-.9-.9-.9.4-.9.9c-.1.5.3.9.9.9-.1 0-.1 0 0 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2620" d="M210.2 168h.2c.4 0 .8-.3.9-.7.1-.5-.2-1-.7-1.1-.5-.1-1 .2-1.1.7-.1.6.2 1 .7 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2622" d="M240.9 320.1c-.3.3-.4.8-.1 1.1.2.2.4.3.6.3.2 0 .4-.1.5-.2.3-.3.3-.8.1-1.1-.3-.4-.8-.4-1.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2624" d="M203.9 257.1c-.4.1-.6.6-.5 1 .1.3.4.5.8.5h.3c.4-.1.6-.6.5-1-.3-.4-.7-.7-1.1-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2626" d="M213.3 115.5c.1.1.2.1.4.1.3 0 .6-.2.7-.4.2-.4 0-.9-.3-1.1-.4-.2-.9 0-1.1.3-.3.4-.1.9.3 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2628" d="M207.7 125.6c-.2.4 0 .9.4 1 .1 0 .2.1.3.1.3 0 .6-.2.7-.5.2-.4 0-.9-.4-1-.3-.2-.8 0-1 .4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2630" d="M213.3 279.7c-.4.2-.6.7-.4 1.1.1.3.4.4.7.4.1 0 .2 0 .4-.1.4-.2.5-.7.3-1.1-.1-.4-.6-.5-1-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2632" d="M194.8 197.6c0-.4-.4-.8-.8-.8s-.8.4-.8.8.4.8.8.8.8-.3.8-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2634" d="M203.3 137.2c-.1.4.1.9.5 1h.3c.3 0 .6-.2.8-.5.1-.4-.1-.9-.5-1-.5-.2-.9 0-1.1.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2636" d="M194.3 186.2s.1 0 0 0c.5 0 .8-.3.8-.7 0-.4-.3-.8-.7-.8-.4 0-.8.3-.8.7 0 .4.3.7.7.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2638" d="M249.9 66.4c.2 0 .4-.1.6-.2.3-.3.3-.8 0-1.1-.3-.3-.8-.3-1.1 0-.3.3-.3.8 0 1.1.1.1.3.2.5.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2640" d="M233.4 84.6c.2 0 .5-.1.6-.3.3-.3.2-.8-.1-1.1-.3-.3-.8-.2-1.1.1-.3.3-.2.8.1 1.1.2.2.4.2.5.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2642" d="M226.2 94.5c.3 0 .5-.1.7-.3.3-.4.2-.9-.2-1.1-.4-.3-.9-.2-1.1.2-.3.4-.2.9.2 1.1 0 .1.2.1.4.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2644" d="M197.5 161.8h.2c.4 0 .7-.3.8-.6.1-.4-.2-.8-.6-.9-.4-.1-.8.2-.9.6-.3.4 0 .9.5.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2646" d="M208.2 268.5c-.4.2-.6.6-.4 1 .1.3.4.5.7.5.1 0 .2 0 .3-.1.4-.2.6-.6.4-1-.1-.3-.6-.5-1-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2648" d="M233 310.7c-.3.3-.4.8-.1 1.1.2.2.4.3.6.3.2 0 .3-.1.5-.2.3-.3.4-.8.1-1.1-.3-.3-.8-.4-1.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2650" d="M201.2 245.9c-.1-.4-.6-.7-1-.6-.4.1-.7.6-.6 1 .1.4.4.6.8.6h.2c.5-.1.8-.6.6-1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2652" d="M225.7 300.8c-.4.3-.5.7-.2 1.1.2.2.4.3.7.3.2 0 .3 0 .4-.1.4-.3.5-.7.2-1.1-.2-.4-.7-.5-1.1-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2654" d="M249.4 328.9c-.3.3-.3.8 0 1.1.2.1.4.2.6.2.2 0 .4-.1.6-.2.3-.3.3-.8 0-1.1-.4-.3-.9-.3-1.2 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2656" d="M200.2 149.9h.2c.4 0 .7-.2.8-.6.1-.4-.1-.9-.6-1-.4-.1-.9.1-1 .6 0 .4.2.9.6 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2658" d="M219.1 104.7c.1.1.3.1.4.1.3 0 .5-.1.7-.4.2-.4.1-.9-.3-1.1-.4-.2-.9-.1-1.1.3-.2.4 0 .9.3 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2660" d="M194.4 210.7c.5 0 .8-.4.8-.8s-.4-.8-.8-.7c-.4 0-.8.4-.7.8-.1.4.3.7.7.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2662" d="M196.4 173.3c.1-.4-.3-.8-.7-.9-.4-.1-.8.3-.9.7-.1.4.3.8.7.9h.1c.4 0 .7-.3.8-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2664" d="M194.8 222.2c.1.4.4.7.8.7h.1c.4-.1.7-.5.7-.9-.1-.4-.5-.7-.9-.7-.4.1-.7.5-.7.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2666" d="M220 291.8c.4-.2.5-.7.3-1.1-.2-.4-.7-.5-1.1-.3-.4.2-.5.7-.3 1.1.1.2.4.4.7.4.1 0 .3 0 .4-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2668" d="M197.5 233.4c-.4.1-.7.5-.6.9.1.4.4.6.8.6h.2c.4-.1.7-.5.6-.9-.2-.4-.6-.7-1-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2670" d="M258.4 337.2c-.2.2-.2.5-.1.8l1.3-.8s0-.1-.1-.1c-.3-.3-.8-.2-1.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2672" d="M241.4 75.3c.2 0 .4-.1.6-.3.3-.3.3-.8-.1-1.1-.3-.3-.8-.3-1.1.1-.3.3-.3.8.1 1.1.1.1.3.2.5.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2674" d="M194.8 173.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2676" d="M181.1 215.3c-.4 0-.7.4-.6.7 0 .4.3.6.7.6h.1c.4 0 .7-.4.6-.7-.1-.3-.4-.6-.8-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2678" d="M181.1 203.7c0-.4-.3-.7-.7-.7-.4 0-.7.3-.7.7 0 .4.3.7.7.7.4 0 .7-.3.7-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2680" d="M182.7 228.8h.1c.4-.1.6-.4.6-.8-.1-.4-.4-.6-.8-.6-.4.1-.6.4-.6.8.1.4.3.6.7.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2682" d="M184.8 239.6c-.4.1-.6.4-.5.8.1.3.4.5.7.5h.1c.4-.1.6-.4.5-.8-.1-.4-.4-.6-.8-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2684" d="M228.7 68.9c.2 0 .4-.1.5-.2.2-.3.2-.7-.1-1-.3-.2-.7-.2-1 .1-.2.3-.2.7.1 1 .1 0 .3.1.5.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2686" d="M187.7 143.8h.2c.3 0 .6-.2.7-.5.1-.4-.1-.7-.5-.8-.4-.1-.7.1-.8.5-.1.3.1.7.4.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2688" d="M191.5 263.1c-.4.1-.5.5-.4.9.1.3.4.5.6.5h.2c.4-.1.5-.5.4-.9-.1-.4-.5-.6-.8-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2690" d="M237 59.9c.2 0 .4-.1.5-.2.3-.3.3-.7 0-1-.3-.3-.7-.3-1 0-.3.3-.3.7 0 1 .1.1.3.2.5.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2692" d="M188.6 251.9c-.1-.4-.5-.6-.8-.5-.4.1-.6.5-.5.8.1.3.4.5.7.5h.2c.3 0 .5-.4.4-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2694" d="M213.4 306.9c-.3.2-.4.6-.2 1 .1.2.3.3.6.3.1 0 .3 0 .4-.1.3-.2.4-.6.2-1-.3-.4-.7-.4-1-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2696" d="M213.7 88.3c.2 0 .4-.1.6-.3.2-.3.1-.7-.2-.9-.3-.2-.7-.1-.9.2-.2.3-.1.7.2 1h.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2698" d="M184.2 154.9c-.1.4.2.7.5.8h.1c.3 0 .6-.2.7-.5.1-.4-.2-.7-.5-.8-.3-.1-.7.1-.8.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2700" d="M236.5 335.4c-.3.3-.3.7 0 1 .1.1.3.2.5.2s.3-.1.5-.2c.3-.3.3-.7 0-1-.3-.2-.7-.2-1 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2702" d="M220.5 78.2c.1.1.3.1.4.1.2 0 .4-.1.5-.3.2-.3.2-.7-.1-1-.3-.2-.7-.2-1 .1-.2.5-.1.9.2 1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2704" d="M181.1 179.9c.4 0 .7-.3.7-.6 0-.4-.2-.7-.6-.7-.4 0-.7.2-.7.6-.1.3.2.7.6.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2706" d="M195.9 274.6c-.3.1-.5.6-.4.9.1.3.4.4.6.4.1 0 .2 0 .3-.1.3-.1.5-.6.4-.9-.1-.3-.5-.5-.9-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2708" d="M180.4 192.2c.4 0 .7-.3.7-.7 0-.4-.3-.7-.7-.7-.4 0-.7.3-.7.7 0 .3.3.7.7.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2710" d="M201 109.5c.1 0 .2.1.3.1.2 0 .5-.1.6-.4.2-.3 0-.7-.3-.9-.3-.2-.7 0-.9.3-.1.3 0 .7.3.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2712" d="M245.3 343.9c-.3.3-.2.7.1 1 .1.1.3.2.5.2s.4-.1.5-.2c.3-.3.2-.7 0-1-.4-.3-.8-.3-1.1 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2714" d="M195.5 119.7c-.1.3 0 .7.4.9.1 0 .2.1.3.1.3 0 .5-.2.6-.4.1-.3 0-.7-.4-.9-.3-.2-.7 0-.9.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2716" d="M207.8 296.7c-.2-.3-.6-.4-.9-.2-.3.2-.4.6-.2.9.1.2.4.3.6.3.1 0 .2 0 .3-.1.3-.1.4-.6.2-.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2718" d="M201 285.7c-.3.2-.5.6-.3.9.1.2.4.4.6.4.1 0 .2 0 .3-.1.3-.2.5-.6.3-.9-.1-.3-.5-.5-.9-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2720" d="M191.4 132.1h.2c.3 0 .5-.2.6-.5.1-.4-.1-.7-.4-.9-.4-.1-.7.1-.9.4 0 .5.2.8.5 1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2722" d="M206.9 98.7c.1.1.2.1.3.1.2 0 .5-.1.6-.3.2-.3.1-.7-.2-.9-.3-.2-.7-.1-.9.2-.3.2-.2.7.2.9z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2724" d="M228.2 326.4c-.3.2-.3.7-.1 1 .1.2.3.2.5.2s.3-.1.4-.2c.3-.2.3-.7.1-1-.2-.2-.6-.3-.9 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2726" d="M182.6 167.7h.1c.3 0 .6-.2.7-.6.1-.4-.2-.7-.6-.8-.4-.1-.7.2-.8.6-.1.4.2.8.6.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2728" d="M220.5 316.9c-.3.2-.4.7-.1 1 .1.2.3.3.5.3.1 0 .3 0 .4-.1.3-.2.4-.7.1-1-.2-.4-.6-.5-.9-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2730" d="M223.8 341.9c-.2.2-.2.6 0 .8.1.1.3.2.4.2.1 0 .3-.1.4-.2.2-.2.2-.6 0-.8-.3-.2-.6-.2-.8 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2732" d="M224.1 53.4c.2 0 .3-.1.4-.2.2-.2.2-.6 0-.8-.2-.2-.6-.2-.8 0-.2.2-.2.6 0 .8.1.2.3.2.4.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2734" d="M232.4 44.6c.1.1.3.2.4.2.1 0 .3 0 .4-.2l.1-.1-1-.6c-.1.2-.1.5.1.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2736" d="M208.3 72.2c.2 0 .3-.1.5-.2.2-.3.1-.6-.1-.8-.3-.2-.6-.1-.8.1-.2.3-.1.6.1.8.1.1.2.1.3.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2738" d="M179.1 269.3c-.3.1-.4.4-.3.7.1.2.3.4.5.4h.2c.3-.1.4-.4.3-.7 0-.4-.4-.5-.7-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2740" d="M215.9 62.6c.2 0 .3-.1.4-.2.2-.2.2-.6-.1-.8-.2-.2-.6-.2-.8.1-.2.2-.2.6.1.8.2.1.3.1.4.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2742" d="M215.6 332.7c-.2.2-.3.6-.1.8.1.1.3.2.4.2.1 0 .3 0 .4-.1.2-.2.3-.6.1-.8-.2-.3-.5-.3-.8-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2744" d="M208 323.1c-.3.2-.3.5-.1.8.1.1.3.2.5.2.1 0 .2 0 .3-.1.3-.2.3-.5.1-.8-.2-.3-.5-.3-.8-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2746" d="M201 82.2c.1.1.2.1.3.1.2 0 .4-.1.5-.3.2-.3.1-.6-.2-.8-.3-.2-.6-.1-.8.2-.2.3-.1.6.2.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2748" d="M195.4 302.8c-.2-.3-.5-.4-.8-.2-.3.2-.4.5-.2.8.1.2.3.3.5.3.1 0 .2 0 .3-.1.2-.2.3-.6.2-.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2750" d="M183.3 113.9c-.1.3 0 .6.3.7h.2c.2 0 .4-.1.5-.3.1-.3 0-.6-.3-.7-.3-.2-.6 0-.7.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2752" d="M175.3 137.7h.2c.2 0 .5-.2.5-.4.1-.3-.1-.6-.4-.7-.3-.1-.6.1-.7.4-.1.3.1.6.4.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2754" d="M172.2 245.7c-.3.1-.5.4-.4.7.1.3.3.4.6.4h.1c.3-.1.5-.4.4-.7-.1-.2-.4-.4-.7-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2756" d="M188.8 291.8c-.3.1-.4.5-.3.8.1.2.3.3.5.3.1 0 .2 0 .3-.1.3-.1.4-.5.3-.8-.2-.3-.5-.4-.8-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2758" d="M194.5 92.6c.1.1.2.1.3.1.2 0 .4-.1.5-.3.2-.3.1-.6-.2-.8-.3-.2-.6-.1-.8.2-.1.3 0 .7.2.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2760" d="M183.6 280.6c-.3.1-.4.5-.3.7.1.2.3.3.5.3h.2c.3-.1.4-.5.3-.7-.1-.2-.4-.4-.7-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2762" d="M188.7 103.5c.1 0 .2.1.3.1.2 0 .4-.1.5-.3.1-.3 0-.6-.3-.8-.3-.1-.6 0-.8.3-.1.2.1.5.3.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2764" d="M179.1 126.1h.2c.2 0 .5-.1.5-.4.1-.3 0-.6-.3-.7-.3-.1-.6 0-.7.3-.2.3 0 .6.3.8z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2766" d="M175.3 257.6c-.3.1-.5.4-.4.7.1.2.3.4.5.4h.2c.3-.1.5-.4.4-.7-.1-.3-.4-.5-.7-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2768" d="M171.7 148.9c-.1.3.1.6.4.7h.1c.3 0 .5-.2.6-.4.1-.3-.1-.6-.4-.7-.3-.1-.6.1-.7.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2770" d="M168 173.7c.4 0 .6-.2.6-.5s-.2-.6-.5-.6-.6.2-.6.5c-.1.3.2.6.5.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2772" d="M167.2 197.7c0-.3-.3-.6-.6-.6s-.6.3-.6.6.3.6.6.6c.4-.1.6-.3.6-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2774" d="M167 209.4c-.3 0-.6.3-.5.6 0 .3.3.5.6.5s.6-.3.5-.6c0-.3-.3-.6-.6-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2776" d="M167 186c.3 0 .6-.2.6-.5s-.2-.6-.5-.6-.6.2-.6.5c-.1.3.1.6.5.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2778" d="M201 313c-.3.2-.3.5-.2.8.1.2.3.3.5.3.1 0 .2 0 .3-.1.3-.2.3-.5.2-.8-.2-.3-.5-.4-.8-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2780" d="M168 221.6c-.3 0-.5.3-.5.6s.3.5.6.5h.1c.3 0 .5-.3.5-.6-.1-.3-.4-.6-.7-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2782" d="M170.4 234.2c-.1-.3-.3-.5-.7-.5-.3.1-.5.3-.5.7 0 .3.3.5.6.5h.1c.3-.1.5-.4.5-.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2784" d="M169.7 161.6h.1c.3 0 .5-.2.6-.5.1-.3-.2-.6-.5-.7-.3-.1-.6.2-.7.5 0 .4.2.6.5.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2786" d="M232.4 350.6c-.2.2-.2.6 0 .8.1.1.3.2.4.2.2 0 .3-.1.4-.2.2-.2.2-.6 0-.8-.2-.3-.6-.3-.8 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2790" d="M203 338.9c-.2.2-.2.4-.1.6.1.1.2.2.4.2.1 0 .2 0 .3-.1.2-.2.2-.4.1-.6-.2-.2-.5-.3-.7-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2792" d="M159.6 251.9c-.2.1-.4.3-.3.6.1.2.2.3.4.3h.1c.2-.1.4-.3.3-.6 0-.3-.3-.4-.5-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2794" d="M211 348.2c-.2.2-.2.5 0 .6.1.1.2.1.3.1.1 0 .2 0 .3-.1.2-.2.2-.5 0-.6-.1-.1-.4-.2-.6 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2796" d="M162.8 131.6h.1c.2 0 .4-.1.4-.3.1-.2-.1-.5-.3-.6-.2-.1-.5.1-.6.3 0 .3.2.5.4.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2798" d="M203.3 56.3c.1 0 .3-.1.4-.2.2-.2.1-.5-.1-.6-.2-.2-.5-.1-.6.1-.2.2-.1.5.1.6 0 0 .1.1.2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2800" d="M176.3 96.8c-.1.2 0 .5.2.6h.2c.2 0 .3-.1.4-.3.1-.2 0-.5-.2-.6-.2 0-.5.1-.6.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2802" d="M171.4 286.7c-.2.1-.3.4-.2.6.1.2.2.3.4.3h.2c.2-.1.3-.4.2-.6-.1-.3-.4-.4-.6-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2804" d="M211 46.2c-.2.2-.2.5 0 .6.1.1.2.1.3.1.1 0 .2-.1.3-.1.2-.2.2-.5 0-.6-.1-.2-.4-.2-.6 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2806" d="M159.2 142.9c-.1.2.1.5.3.6h.1c.2 0 .4-.1.4-.3.1-.2-.1-.5-.3-.6-.2-.1-.4 0-.5.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2808" d="M219.8 38.1c.1 0 .2 0 .3-.1.2-.2.2-.5 0-.6-.2-.2-.5-.2-.6 0-.2.2-.2.5 0 .6.1.1.2.1.3.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2810" d="M195.6 329.2c-.2.1-.2.4-.1.6.1.1.2.2.4.2.1 0 .2 0 .3-.1.2-.1.3-.4.1-.6-.2-.2-.5-.3-.7-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2812" d="M166.7 119.9h.2c.2 0 .4-.1.4-.3.1-.2 0-.5-.3-.6-.2-.1-.5 0-.6.3 0 .3.1.6.3.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2814" d="M153.7 179.7c.3 0 .5-.2.5-.4s-.2-.5-.4-.5c-.3 0-.5.2-.5.4-.1.3.1.5.4.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2816" d="M171.3 108.5h.2c.2 0 .3-.1.4-.3.1-.2 0-.5-.2-.6-.2-.1-.5 0-.6.2-.1.4 0 .6.2.7z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2818" d="M163.4 264c-.1-.2-.3-.4-.6-.3-.2.1-.4.3-.3.6.1.2.2.3.4.3h.1c.4-.1.5-.4.4-.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2820" d="M154.6 167c0 .3.1.5.4.5h.1c.2 0 .4-.2.4-.4s-.1-.5-.4-.5c-.3 0-.5.2-.5.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2822" d="M182.3 308.6c-.2.1-.3.4-.2.6.1.1.2.2.4.2.1 0 .2 0 .2-.1.2-.1.3-.4.2-.6-.1-.2-.4-.2-.6-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2824" d="M219.5 357.1c-.2.2-.2.5 0 .6.1.1.2.1.3.1.1 0 .2 0 .3-.1.2-.2.2-.5 0-.6-.1-.2-.4-.2-.6 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2826" d="M153 192s.1 0 0 0c.3 0 .5-.2.5-.4 0-.3-.2-.5-.4-.5-.3 0-.5.2-.5.4s.2.5.4.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2828" d="M176.5 297.8c-.2.1-.3.4-.2.6.1.2.2.3.4.3h.2c.2-.1.3-.4.2-.6-.1-.4-.4-.4-.6-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2830" d="M153.5 203.8c0-.3-.2-.4-.5-.4s-.4.2-.4.5c0 .2.2.4.5.4.2-.1.4-.3.4-.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2832" d="M195.8 66c.1 0 .3-.1.4-.2.1-.2.1-.5-.1-.6-.2-.1-.5-.1-.6.1-.1.2-.1.5.1.6.1.1.2.1.2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2834" d="M166.8 275.3c-.2.1-.4.4-.3.6.1.2.2.3.4.3h.2c.2-.1.4-.3.3-.6-.1-.3-.4-.4-.6-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2836" d="M188.6 76.1c.1.1.2.1.2.1.1 0 .3-.1.4-.2.1-.2.1-.5-.1-.6-.2-.1-.5-.1-.6.1-.1.1-.1.4.1.6z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2838" d="M188.7 319.1c-.2.1-.3.4-.1.6.1.1.2.2.4.2.1 0 .2 0 .2-.1.2-.1.3-.4.1-.6-.2-.2-.4-.3-.6-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2840" d="M155 227.8c-.2 0-.4.3-.4.5s.2.4.5.4h.1c.2 0 .4-.3.4-.5-.1-.3-.4-.5-.6-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2842" d="M182.3 86.6c.1 0 .1.1.2.1.2 0 .3-.1.4-.2.1-.2 0-.5-.2-.6-.2-.1-.5 0-.6.2-.1.1 0 .3.2.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2844" d="M156.9 155.4c.1 0 .1 0 0 0 .3 0 .5-.1.5-.4 0-.2-.1-.5-.4-.5-.2 0-.5.1-.5.4 0 .2.2.5.4.5z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2846" d="M157 239.9c-.2 0-.4.3-.4.5s.2.4.4.4h.1c.2 0 .4-.3.4-.5-.1-.3-.3-.5-.5-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2848" d="M153.7 215.6c-.3 0-.4.2-.4.5 0 .2.2.4.5.4.2 0 .4-.2.4-.5s-.3-.4-.5-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2850" d="M139.6 209.5c-.2 0-.3.2-.3.4s.2.3.3.3c.2 0 .3-.2.3-.4.1-.1-.1-.3-.3-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2852" d="M159.1 292.7c-.2.1-.3.3-.2.4.1.1.2.2.3.2h.1c.2-.1.3-.3.2-.4-.1-.2-.3-.3-.4-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2854" d="M150.4 269.7c-.2.1-.3.3-.2.4 0 .1.2.2.3.2h.1c.2-.1.3-.2.2-.4 0-.1-.2-.2-.4-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2856" d="M176.5 70c.1 0 .2-.1.3-.2.1-.2.1-.4-.1-.5-.2-.1-.4-.1-.5.1-.1.2-.1.4.1.5.1.1.1.1.2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2858" d="M147 258c-.2 0-.3.2-.2.4 0 .2.2.3.3.3h.1c.2 0 .3-.2.2-.4 0-.3-.2-.4-.4-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2860" d="M183.4 59.8c.1 0 .2 0 .3-.1.1-.2.1-.4-.1-.5-.2-.1-.4-.1-.5.1-.1.2-.1.4.1.5h.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2862" d="M150.4 125.5h.1c.1 0 .3-.1.3-.2.1-.2 0-.4-.2-.4-.2-.1-.4 0-.4.2-.1.1 0 .3.2.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2864" d="M154.5 281.3c-.2.1-.3.3-.2.4.1.1.2.2.3.2h.1c.2-.1.3-.3.2-.4-.1-.1-.3-.2-.4-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2866" d="M207 31.6c.1 0 .2 0 .2-.1.1-.1.1-.4 0-.5-.1-.1-.4-.1-.5 0-.1.1-.1.4 0 .5.1.1.2.1.3.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2868" d="M190.7 50c.1 0 .2 0 .3-.1.1-.1.1-.4-.1-.5-.1-.1-.4-.1-.5.1-.1.1-.1.4.1.5h.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2870" d="M154.4 113.9h.1c.1 0 .3-.1.3-.2.1-.2 0-.4-.2-.4-.2-.1-.4 0-.4.2 0 .1 0 .3.2.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2872" d="M164.1 90.9c-.1.2 0 .4.2.5h.1c.1 0 .2-.1.3-.2.1-.2 0-.4-.2-.5-.1-.1-.3 0-.4.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2874" d="M159.2 102.5c.1 0 .3-.1.3-.2.1-.2 0-.4-.2-.4-.2-.1-.4 0-.4.2-.1.2 0 .4.2.4h.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2876" d="M139.6 197.6c0-.2-.2-.3-.3-.3-.2 0-.3.1-.3.3 0 .2.2.3.3.3.2.1.3-.1.3-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2878" d="M146.8 136.8c0 .2.1.4.2.4h.1c.2 0 .3-.1.3-.3 0-.2-.1-.4-.2-.4-.2.1-.4.2-.4.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2880" d="M139.6 185.7c.2 0 .3-.1.4-.3 0-.2-.1-.3-.3-.4-.2 0-.3.1-.4.3 0 .2.1.4.3.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2882" d="M206.6 363.5c-.1.1-.1.3 0 .5.1.1.2.1.2.1s.2 0 .2-.1c.1-.1.1-.3 0-.5 0-.1-.2-.1-.4 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2884" d="M140.5 221.8c-.2 0-.3.2-.3.4s.2.3.3.3c.2 0 .3-.2.3-.4.1-.2-.1-.3-.3-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2886" d="M142.1 233.9c-.2 0-.3.2-.3.4s.2.3.3.3h.1c.2 0 .3-.2.3-.4-.1-.2-.2-.3-.4-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2888" d="M140.5 173.5c.2 0 .4-.1.4-.3 0-.2-.1-.4-.3-.4-.2 0-.4.1-.4.3 0 .2.1.3.3.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2890" d="M144.2 149.2s.1 0 0 0c.2 0 .4-.1.4-.3 0-.2-.1-.4-.3-.4-.2 0-.4.1-.4.3 0 .2.1.4.3.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2892" d="M144.2 246c-.2 0-.3.2-.3.4s.2.3.3.3h.1c.2 0 .3-.2.3-.4s-.2-.3-.4-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2894" d="M142 161.3c.1 0 .1 0 0 0 .2 0 .4-.1.4-.3 0-.2-.1-.4-.3-.4-.2 0-.4.1-.4.3 0 .2.2.4.3.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2896" d="M198.3 354.5c-.1.1-.2.3 0 .5.1.1.2.1.3.1.1 0 .2 0 .2-.1.1-.1.2-.3 0-.5-.1-.1-.3-.1-.5 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2898" d="M170 314.6c-.2.1-.2.3-.1.5.1.1.2.2.3.2h.2c.2-.1.2-.3.1-.5s-.3-.3-.5-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2900" d="M169.9 80c-.1.2 0 .4.1.5h.2c.1 0 .2-.1.3-.2.1-.2 0-.4-.1-.5-.2 0-.4.1-.5.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2902" d="M183.1 335.3c-.2.1-.2.3-.1.5.1.1.2.1.3.1.1 0 .1 0 .2-.1.2-.1.2-.3.1-.5-.1-.1-.3-.1-.5 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2904" d="M176.3 325.1c-.2.1-.2.3-.1.5.1.1.2.2.3.2.1 0 .1 0 .2-.1.2-.1.2-.3.1-.5s-.3-.2-.5-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2906" d="M198.4 40c-.1.1-.1.4 0 .5.1.1.1.1.2.1s.2 0 .3-.1c.1-.1.1-.4 0-.5-.2-.1-.4-.1-.5 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2908" d="M164.2 303.8c-.2.1-.2.3-.1.5.1.1.2.2.3.2h.2c.2-.1.2-.3.2-.5-.2-.2-.4-.3-.6-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2910" d="M190.5 345.1c-.1.1-.2.3-.1.5.1.1.2.1.3.1.1 0 .1 0 .2-.1s.2-.3.1-.5c-.2-.1-.4-.1-.5 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2912" d="M126.3 215.8c-.1 0-.2.1-.2.2s.1.2.2.2.2-.1.2-.2-.1-.2-.2-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2914" d="M126.3 179.5c.1 0 .2-.1.2-.2s-.1-.2-.2-.2-.2.1-.2.2l.2.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2916" d="M127.4 167.2c.1 0 .1 0 0 0 .1 0 .2-.1.3-.2 0-.1-.1-.2-.2-.3-.1 0-.2.1-.3.2 0 .2.1.3.2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2918" d="M125.7 191.3c-.1 0-.2.1-.2.2s.1.2.2.2.2-.1.2-.2-.1-.2-.2-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2920" d="M170.9 53.7c.1 0 .1 0 .2-.1s0-.2-.1-.3c-.1-.1-.2 0-.3.1-.1.1 0 .2.1.3h.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2922" d="M142.1 107.8s.1 0 0 0c.2 0 .3 0 .3-.1s0-.3-.1-.3-.2 0-.3.1c-.1.1 0 .3.1.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2924" d="M164.1 63.9c.1 0 .1 0 .2-.1s0-.2-.1-.3c-.1-.1-.2 0-.3.1-.1.1 0 .2.1.3h.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2926" d="M194.1 25.1c.1 0 .1 0 .2-.1s.1-.2 0-.3c-.1-.1-.2-.1-.3 0-.1.1-.1.2 0 .3 0 .1.1.1.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2928" d="M127.5 228c-.1 0-.2.1-.2.3 0 .1.1.2.2.2s.2-.1.2-.3c0-.1-.1-.2-.2-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2930" d="M125.7 203.5c-.1 0-.2.1-.2.2s.1.2.2.2.2-.1.2-.2-.1-.2-.2-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2932" d="M170.7 341.4c-.1.1-.1.2-.1.3 0 .1.1.1.2.1h.1c.1-.1.1-.2.1-.3 0-.1-.2-.2-.3-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2934" d="M138.1 119.4c.1 0 .2-.1.2-.2s0-.2-.1-.3c-.1 0-.2 0-.3.1-.1.2 0 .4.2.4-.1 0-.1 0 0 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2936" d="M129.2 155.1c.1 0 .2-.1.3-.2 0-.1-.1-.2-.2-.3-.1 0-.2.1-.3.2 0 .2.1.3.2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2938" d="M178.3 351.3c-.1-.1-.2-.1-.3 0-.1.1-.1.2 0 .3 0 .1.1.1.2.1h.1c.1-.1.1-.3 0-.4z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2940" d="M185.7 360.8c-.1.1-.1.2 0 .3 0 .1.1.1.2.1s.1 0 .1-.1c.1-.1.1-.2 0-.3-.1-.1-.2-.1-.3 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2942" d="M193.9 370c-.1.1-.1.2 0 .3 0 0 .1.1.2.1s.1 0 .2-.1.1-.2 0-.3c-.2-.1-.3-.1-.4 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2944" d="M185.7 33.9c-.1.1-.1.2 0 .3l.1.1c.1 0 .1 0 .2-.1s.1-.2 0-.3c0-.1-.2-.1-.3 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2946" d="M178 43.4c-.1.1-.1.2 0 .3h.1c.1 0 .1 0 .2-.1s.1-.2 0-.3c-.1 0-.2 0-.3.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2948" d="M164.3 331.2c-.1-.1-.2-.1-.3-.1-.1.1-.1.2-.1.3 0 .1.1.1.2.1h.1c.1 0 .1-.1.1-.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2950" d="M146.8 298.7c-.1.1-.2.2-.1.3 0 .1.1.1.2.1h.1c.1 0 .2-.2.1-.3-.1-.1-.2-.1-.3-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2952" d="M152.1 85.3c.1 0 .2 0 .2-.1.1-.1 0-.2-.1-.3-.1-.1-.3 0-.3.1-.1.1 0 .3.1.3h.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2954" d="M152 309.8c-.1.1-.2.2-.1.3 0 .1.1.1.2.1h.1c.1-.1.2-.2.1-.3-.1-.1-.2-.1-.3-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2956" d="M134.5 264.1c-.1 0-.2.2-.2.3 0 .1.1.2.2.2h.1c.1 0 .2-.2.2-.3 0-.2-.2-.3-.3-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2958" d="M157.6 74.1c-.1.1 0 .2.1.3h.1c.1 0 .2 0 .2-.1.1-.1 0-.2-.1-.3-.1 0-.2 0-.3.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2960" d="M142.1 287.4c-.1 0-.2.2-.1.3 0 .1.1.1.2.1h.1c.1 0 .2-.2.1-.3 0-.1-.1-.1-.3-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2964" d="M138 275.8c-.1 0-.2.2-.1.3 0 .1.1.2.2.2h.1c.1 0 .2-.2.1-.3 0-.1-.1-.2-.3-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2966" d="M157.7 320.6c-.1.1-.1.2-.1.3 0 .1.1.1.2.1h.1c.1-.1.1-.2.1-.3 0-.1-.2-.1-.3-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2968" d="M134.5 131.2c.1 0 .2-.1.2-.2s0-.2-.2-.3c-.1 0-.2 0-.3.2l.3.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2970" d="M131.5 143.1c.1 0 .1 0 0 0 .2 0 .3-.1.3-.2s0-.2-.2-.3c-.1 0-.2.1-.3.2.1.1.1.2.2.3z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2972" d="M146.9 96.4c.1 0 .2-.1.2-.1.1-.1 0-.2-.1-.3-.1 0-.3 0-.3.1-.1.1-.1.3.2.3-.1 0-.1 0 0 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2974" d="M131.6 252.2c-.1 0-.2.1-.2.3 0 .1.1.2.2.2s.2-.1.2-.3c0-.2-.1-.3-.2-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2976" d="M129.2 240.1c-.1 0-.2.1-.2.3 0 .1.1.2.2.2s.2-.1.2-.3c.1-.1-.1-.2-.2-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2980" d="M114.5 161c.1 0 .1 0 .1-.1s0-.1-.1-.1-.1 0-.1.1 0 .1.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2982" d="M114.5 234.2c-.1 0-.1.1-.1.1 0 .1.1.1.1.1.1 0 .1-.1.1-.1 0-.1-.1-.1-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2984" d="M113.1 173.2c.1 0 .1 0 .1-.1s0-.1-.1-.1-.1 0-.1.1 0 .1.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2986" d="M134.6 304.8c0-.1-.1-.1-.1-.1-.1 0-.1.1-.1.1l.1.1c.1 0 .2 0 .1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2988" d="M116.4 148.9c.1 0 .1 0 .1-.1s0-.1-.1-.1-.1 0-.1.1 0 .1.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2990" d="M113.1 222c-.1 0-.1.1-.1.1 0 .1.1.1.1.1.1 0 .1-.1.1-.1l-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2992" d="M129.8 101.8s.1 0 .1-.1 0-.1-.1-.1-.1 0-.1.1l.1.1c0-.1 0 0 0 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2994" d="M122 125s.1 0 .1-.1 0-.1-.1-.1-.1 0-.1.1 0 .1.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2996" d="M122 270.2c-.1 0-.1.1-.1.1l.1.1c.1 0 .1-.1.1-.1.1-.1 0-.1-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path2998" d="M125.6 281.9c-.1 0-.1.1-.1.1l.1.1c.1 0 .1-.1.1-.1.1-.1 0-.1-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3000" d="M129.8 293.5c-.1 0-.1.1-.1.1l.1.1c.1 0 .1-.1.1-.1 0-.1 0-.2-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3002" d="M116.4 246.3c-.1 0-.1.1-.1.1 0 .1.1.1.1.1.1 0 .1-.1.1-.1.1-.1 0-.1-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3004" d="M119 258.3c-.1 0-.1.1-.1.1 0 .1.1.1.1.1.1 0 .1-.1.1-.1 0-.1-.1-.1-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3006" d="M118.9 136.9c.1 0 .1 0 .1-.1s0-.1-.1-.1-.1 0-.1.1l.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3008" d="M139.7 315.8c-.1 0-.1.1-.1.2l.1.1c.1 0 .1-.1.1-.2s0-.1-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3010" d="M158.3 347.5c-.1 0-.1.1 0 .2h.2c.1 0 .1-.1 0-.2h-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3012" d="M139.8 79.3s.1 0 .1-.1 0-.1-.1-.2c-.1 0-.1 0-.2.1 0 .1 0 .1.2.2-.1 0-.1 0 0 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3014" d="M145.4 68.2v.2h.1s.1 0 .1-.1v-.2c-.1.1-.2.1-.2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3016" d="M158.4 47.4v.2h.1s.1 0 .1-.1v-.2c-.1 0-.2 0-.2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3018" d="M145.4 326.7c-.1 0-.1.1 0 .2l.1.1h.1c.1 0 .1-.1 0-.2s-.1-.2-.2-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3020" d="M165.5 357.5s-.1.1 0 .2h.2c.1 0 .1-.1 0-.2s-.2-.1-.2 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3022" d="M151.6 337.2c-.1 0-.1.1 0 .2l.1.1h.1c.1 0 .1-.1 0-.2 0-.1-.1-.1-.2-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3024" d="M181.2 376.3s-.1.1 0 .2h.2s.1-.1 0-.2h-.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3026" d="M151.6 57.6v.2h.1s.1 0 .1-.1v-.2c-.1.1-.1.1-.2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3028" d="M181.3 18.7s.1 0 0 0c.1-.1.1-.2.1-.2h-.2c0 .1 0 .1.1.2z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3030" d="M112.2 185.5c.1 0 .1 0 .1-.1s0-.1-.1-.1-.1 0-.1.1l.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3032" d="M173.2 27.8v.2h.2v-.2c-.1-.1-.2-.1-.2 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3034" d="M173.1 367.1s-.1.1 0 .2h.2s.1-.1 0-.2-.1-.1-.2 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3036" d="M112.2 209.8c-.1 0-.1.1-.1.1 0 .1.1.1.1.1.1 0 .1-.1.1-.1.1-.1 0-.1-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3038" d="M165.5 37.4v.2h.2v-.2c-.1-.1-.1-.1-.2 0z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3040" d="M112 197.5c-.1 0-.1 0-.1.1s0 .1.1.1.1-.1.1-.1l-.1-.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3042" d="M134.5 90.4s.1 0 .1-.1 0-.1-.1-.1-.1 0-.2.1c.1 0 .1.1.2.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__path3044" d="M125.6 113.3c.1 0 .1 0 .1-.1s0-.1-.1-.1-.1 0-.1.1c0 0 0 .1.1.1z" class="epichains_svg__st0"/>
+  <path id="epichains_svg__rect3048" d="M26.2 211.3h284.4v66.5H26.2z" style="display:inline;fill:none"/>
+  <linearGradient id="epichains_svg__SVGID_1_" x1="133.059" x2="470.048" y1="392.074" y2="392.074" gradientTransform="translate(-133.1 -197.5)" gradientUnits="userSpaceOnUse">
+    <stop id="epichains_svg__stop3052" offset="0" style="stop-color:#0b1e2c"/>
+    <stop id="epichains_svg__stop3054" offset="1" style="stop-color:#0d6c9f"/>
+  </linearGradient>
+  <path id="epichains_svg__path3057" d="m168.5 5.8 163.4 94.4V289l-163.5 94.4L4.9 289V100.2L168.5 5.8m0-5.8L0 97.3v194.6l168.5 97.3 168.4-97.3V97.3Z" style="display:inline;fill:url(#epichains_svg__SVGID_1_)"/>
+  <g id="epichains_svg__g3123" style="display:inline;opacity:.86" transform="translate(-53.04 -53.014) scale(.73534)">
+    <g id="epichains_svg__g3097">
+      <path id="epichains_svg__path3081" d="M248.7 515.7v-17.1h14.4v3.4h-10.6v3.3h10.2v3.4h-10.2v3.4h10.6v3.4h-14.4z"/>
+      <path id="epichains_svg__path3083" d="M264.9 520.1V503h3.5v3.5h.3c.5-2.4 2.1-3.7 5.1-3.7 3.9 0 6.1 2.6 6.1 6.6 0 4-2.1 6.6-6 6.6-3 0-4.6-1.5-5-3.7h-.2v7.8zm7.5-7.6c2.4 0 3.6-.8 3.6-3.1s-1.1-3.1-3.6-3.1-3.7.8-3.7 3.1v.2c0 2.1 1.3 2.9 3.7 2.9z"/>
+      <path id="epichains_svg__path3085" d="M281.4 501.7v-3h3.8v3zm0 14V503h3.8v12.8h-3.8z"/>
+      <path id="epichains_svg__path3087" d="m291.6 515.7-5.8-12.8h4.3l3.8 9h.3l3.8-9h4.2l-5.8 12.8z"/>
+      <path id="epichains_svg__path3089" d="M301.7 509.4c0-4.1 2.9-6.6 7.1-6.6 4.2 0 7 2.2 7 6.2 0 .5-.1.8-.1 1.2h-10.4c.1 2 1 2.8 3.5 2.8 2.3 0 3.1-.6 3.1-1.7v-.3h3.8v.3c0 2.8-2.7 4.7-6.8 4.7-4.3 0-7.2-2.1-7.2-6.6zm3.6-1.3h6.8c-.1-1.7-1.1-2.4-3.4-2.4s-3.3.8-3.4 2.4z"/>
+      <path id="epichains_svg__path3091" d="M317.2 515.7V503h3.5v3.4h.2c.4-2.1 1.7-3.6 4.3-3.6 2.9 0 4 2 4 4.5v2.1h-3.8V508c0-1.4-.6-2-2.1-2-1.7 0-2.3.8-2.3 2.4v7.4h-3.8z"/>
+      <path id="epichains_svg__path3093" d="m330.1 511.4 3.8-.1v.2c0 1.1.7 1.5 2.9 1.5 2 0 2.5-.3 2.5-1.1 0-.7-.4-.9-1.9-1.1l-3.6-.4c-2.5-.3-4-1.4-4-3.6s1.9-4 6.3-4c4.2 0 6.5 1.6 6.5 4.7v.1h-3.8v-.2c0-1-.5-1.6-2.9-1.6-1.9 0-2.4.3-2.4 1.1 0 .7.4.9 2 1.1l2.7.3c3.4.4 4.7 1.5 4.7 3.6 0 2.4-2.4 4-6.3 4-4.1.1-6.5-1.6-6.5-4.5z"/>
+      <path id="epichains_svg__path3095" d="M343.9 509.4c0-4.1 2.9-6.6 7.1-6.6 4.2 0 7 2.2 7 6.2 0 .5-.1.8-.1 1.2h-10.4c.1 2 1 2.8 3.5 2.8 2.3 0 3.1-.6 3.1-1.7v-.3h3.8v.3c0 2.8-2.7 4.7-6.8 4.7-4.3 0-7.2-2.1-7.2-6.6zm3.6-1.3h6.8c-.1-1.7-1.1-2.4-3.4-2.4s-3.2.8-3.4 2.4z"/>
+    </g>
+    <g id="epichains_svg__g3121">
+      <path id="epichains_svg__path3099" d="M336 528.3c-1.1 0-2 .9-2 2s.8 2 2 2c1.1 0 2-.8 2-2-.1-1.1-.9-2-2-2m7.3-5.2c-3 0-5.2 1.9-5.2 4.6 0 2.7 2.1 4.6 5.2 4.6s5.2-1.9 5.2-4.6c0-2.8-2.1-4.6-5.2-4.6m0 8.9c-1 0-1.3-.9-1.3-4.3 0-3.6.3-4.3 1.3-4.3s1.3.8 1.3 4.3c0 3.4-.3 4.3-1.3 4.3m12-9.1c-1.1 0-2.3.9-2.5 2.2l-3.5-2.2v9.3h3.6v-6.6c0-1.4 1.6-1.4 2.4-1.4zm6.2 6.8h-5.3l1.5-1c.6.2.9.2 1.7.2 2.9 0 4.3-1 4.3-2.9 0-1.1-.6-2-1.7-2.6h2.2v-1.3l-2.6 1.2c-.6-.2-1.3-.3-2.1-.3-2.9 0-4.2 1.2-4.2 2.9 0 1.2.7 2.1 1.8 2.6l-1.9 1.2.9 3.1h8.4c0-.1 0-.1.1-.2v-1.1c-.4-1.1-1.3-1.8-3.1-1.8m-2.1-6.5c1 0 1.2 1.1 1.2 2.8 0 1.5-.2 2.7-1.2 2.7s-1.2-1.2-1.2-2.7c0-1.7.2-2.8 1.2-2.8m-45.2-.2c-.9 0-2.3.2-3.5.6v1.3c.8-.8 2-1.7 3-1.7s1.1.9 1.1 1.6v1.9c-3.6 0-5.2 1.2-5.2 3 0 1.6 1.1 2.3 2.6 2.3 1.6 0 2.6-1 2.7-2l3.4 2.1v-5.9c.2-2.5-1.2-3.2-4.1-3.2m.7 6.9c0 .4-.3.9-.9.9-.7 0-1-.6-1-1.6s.5-2 1.9-2.3zm10-7.2h-1.4v-2.4l-3.5 2.4h-1.3v.7h1.3v8.8h3.5v-8.8h1.4zm-19.3-3.3v5.3c-.3-1.1-1.3-1.8-2.6-1.8-2.3 0-3.8 1.9-3.8 4.7 0 2.7 1.3 4.5 3.7 4.5 1.6 0 2.5-.8 2.7-2l3.6 2.1v-10.6zm0 10.1c0 .8-.4 1.4-1.1 1.4-1.3 0-1.4-1.5-1.4-3.4 0-2.1.2-3.3 1.4-3.3.7 0 1.1.6 1.1 1.4zm23.2-6.5c-.9 0-2.3.2-3.5.6v1.3c.8-.8 2-1.7 3-1.7s1.1.9 1.1 1.6v1.9c-3.6 0-5.2 1.2-5.2 3 0 1.6 1.1 2.3 2.6 2.3 1.6 0 2.6-1 2.7-2l3.4 2.1v-5.9c.2-2.5-1.3-3.2-4.1-3.2m.7 6.9c0 .4-.3.9-.9.9-.7 0-1-.6-1-1.6s.5-2 1.9-2.3z" style="fill-rule:evenodd;clip-rule:evenodd"/>
+      <g id="epichains_svg__g3119">
+        <path id="epichains_svg__path3101" d="M240.4 532.8v-6.3h.4v1.3h.1c.2-.8 1-1.4 2.1-1.4 1.5 0 2.3 1 2.3 2.4s-.8 2.4-2.3 2.4c-1 0-1.8-.6-2-1.5v3h-.6zm2.4-1.9c1.2 0 2-.5 2-2s-.8-2-1.9-2c-1.2 0-2 .8-2 2v.1c-.1 1.2.7 1.9 1.9 1.9z"/>
+        <path id="epichains_svg__path3103" d="M245.8 528.9c0-1.4 1-2.4 2.5-2.4s2.5 1 2.5 2.4-1 2.4-2.5 2.4-2.5-1-2.5-2.4zm4.6 0c0-1.2-.7-2-2.1-2-1.4 0-2.1.8-2.1 2s.7 2 2.1 2c1.4 0 2.1-.8 2.1-2z"/>
+        <path id="epichains_svg__path3105" d="m252.9 531.2-1.6-4.6h.4l1.1 3.4.2.8.3-.9 1.4-3.4h.5l1.4 3.4.3.8.2-.8 1.1-3.4h.4l-1.6 4.6h-.5l-1.4-3.3-.4-.9-.4.9-1.4 3.3z"/>
+        <path id="epichains_svg__path3107" d="M259.4 528.9c0-1.4.9-2.4 2.4-2.4 1.3 0 2.3.8 2.3 2.1v.4h-4.2c0 1.2.6 1.9 2 1.9 1.2 0 1.8-.5 1.8-1.2v-.1h.4v.1c0 1-1 1.6-2.2 1.6-1.5 0-2.5-1-2.5-2.4zm.4-.2h3.9v-.2c0-1.1-.7-1.7-1.9-1.7-1.3 0-1.9.8-2 1.9z"/>
+        <path id="epichains_svg__path3109" d="M265.1 531.2v-4.6h.4v1.3c.1-.7.7-1.4 1.7-1.4 1.1 0 1.6.8 1.6 1.7v.4h-.4v-.3c0-.9-.4-1.3-1.3-1.3-1.1 0-1.5.7-1.5 1.9v2.5h-.5z"/>
+        <path id="epichains_svg__path3111" d="M269.3 528.9c0-1.4.9-2.4 2.4-2.4 1.3 0 2.3.8 2.3 2.1v.4h-4.2c0 1.2.6 1.9 2 1.9 1.2 0 1.8-.5 1.8-1.2v-.1h.4v.1c0 1-1 1.6-2.2 1.6-1.6 0-2.5-1-2.5-2.4zm.4-.2h3.9v-.2c0-1.1-.7-1.7-1.9-1.7-1.3 0-1.9.8-2 1.9z"/>
+        <path id="epichains_svg__path3113" d="M274.6 528.9c0-1.4.8-2.4 2.3-2.4 1.1 0 1.8.6 2 1.4v-3h.4v6.3h-.3v-1.4c-.2.9-1 1.5-2.1 1.5-1.5 0-2.3-1-2.3-2.4zm2.4 2c1.2 0 2-.7 2-1.9v-.1c0-1.3-.8-2-2-2-1.1 0-1.9.5-1.9 2-.1 1.5.7 2 1.9 2z"/>
+        <path id="epichains_svg__path3115" d="M284.3 531.2v-6.3h.4v3c.2-.8.9-1.4 2.1-1.4 1.5 0 2.3 1 2.3 2.4s-.8 2.4-2.3 2.4c-1 0-1.8-.6-2-1.5v1.4zm2.4-.3c1.2 0 2-.5 2-2s-.8-2-1.9-2c-1.3 0-2 .8-2 2v.1c0 1.2.7 1.9 1.9 1.9z"/>
+        <path id="epichains_svg__path3117" d="M290 532.8v-.4h.6c.5 0 .7-.2.9-.6l.3-.7-2.3-4.6h.5l1.5 3 .6 1.2.5-1.2 1.4-3h.4l-2.6 5.4c-.3.6-.7.9-1.3.9z"/>
+      </g>
+    </g>
+  </g>
+  <g id="epichains_svg__g2724" style="display:inline" transform="translate(-133.1 -197.5)">
+    <g id="epichains_svg__chain2" style="display:inline" transform="rotate(-14.331 1280.991 -163.257) scale(.42193)">
+      <g id="epichains_svg__g19771" style="display:inline" transform="translate(25.622 -82.848) scale(1.14391)">
+        <g id="epichains_svg__g19741" style="display:inline" transform="translate(.933 .133)">
+          <path id="epichains_svg__rect19735" d="M240.617 366.057h3.017v74.674h-3.017z" style="display:inline;mix-blend-mode:normal;fill:#e3202c;fill-opacity:1;stroke:none;stroke-opacity:1"/>
+          <circle id="epichains_svg__circle19737" cx="242.099" cy="443.747" r="4.893" style="mix-blend-mode:normal;fill:#e3202c;fill-opacity:1;stroke:none;stroke-width:1.62162;stroke-opacity:1"/>
+        </g>
+        <g id="epichains_svg__g19749" style="display:inline" transform="rotate(89.924 243.167 403.879)">
+          <path id="epichains_svg__rect19743" d="M240.617 366.057h3.017v74.674h-3.017z" style="mix-blend-mode:normal;fill:#e3202c;fill-opacity:1;stroke:none;stroke-opacity:1"/>
+          <circle id="epichains_svg__circle19745" cx="242.099" cy="443.747" r="4.893" style="mix-blend-mode:normal;fill:#e3202c;fill-opacity:1;stroke:none;stroke-width:1.62162;stroke-opacity:1"/>
+          <circle id="epichains_svg__circle19747" cx="241.966" cy="363.609" r="4.893" style="display:inline;mix-blend-mode:normal;fill:#e3202c;fill-opacity:1;stroke:none;stroke-width:1.62162;stroke-opacity:1"/>
+        </g>
+        <g id="epichains_svg__g19757" style="display:inline" transform="rotate(45.893 243.802 404.786)">
+          <path id="epichains_svg__rect19751" d="M240.617 366.057h3.017v74.674h-3.017z" style="display:inline;mix-blend-mode:normal;fill:#e3202c;fill-opacity:1;stroke:none;stroke-opacity:1"/>
+          <circle id="epichains_svg__circle19753" cx="242.099" cy="443.747" r="4.893" style="mix-blend-mode:normal;fill:#e3202c;fill-opacity:1;stroke:none;stroke-width:1.62162;stroke-opacity:1"/>
+          <circle id="epichains_svg__circle19755" cx="241.966" cy="363.609" r="4.893" style="mix-blend-mode:normal;fill:#e3202c;fill-opacity:1;stroke:none;stroke-width:1.62162;stroke-opacity:1"/>
+        </g>
+        <g id="epichains_svg__g19765" style="display:inline" transform="rotate(131.563 242.91 403.511)">
+          <path id="epichains_svg__rect19759" d="M240.617 366.057h3.017v74.674h-3.017z" style="mix-blend-mode:normal;fill:#e3202c;fill-opacity:1;stroke:none;stroke-opacity:1"/>
+          <circle id="epichains_svg__circle19761" cx="242.099" cy="443.747" r="4.893" style="mix-blend-mode:normal;fill:#e3202c;fill-opacity:1;stroke:none;stroke-width:1.62162;stroke-opacity:1"/>
+          <circle id="epichains_svg__circle19763" cx="241.966" cy="363.609" r="4.893" style="display:inline;mix-blend-mode:normal;fill:#e3202c;fill-opacity:1;stroke:none;stroke-width:1.62162;stroke-opacity:1"/>
+        </g>
+        <circle id="epichains_svg__circle19767" cx="305.486" cy="380.835" r="27.735" style="display:inline;fill:#e3202c;fill-opacity:1;fill-rule:evenodd;stroke:none;stroke-opacity:1" transform="translate(-64.137 21.068)"/>
+        <path id="epichains_svg__rect19769" d="M-390.746 300.645h107.741v7.467h-107.741z" style="display:inline;fill:#e3202c;fill-opacity:1;stroke:none;stroke-width:1.00002;stroke-opacity:1" transform="matrix(-.00685 -.99998 1 .00295 -64.137 21.068)"/>
+      </g>
+      <g id="epichains_svg__g19783" style="display:inline" transform="translate(-47.745 -58.748) scale(1.14391)">
+        <circle id="epichains_svg__circle19773" cx="207.667" cy="342.174" r="27.735" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-opacity:1"/>
+        <path id="epichains_svg__rect19775" d="M-131.05-394.362h107.739v7.467H-131.05z" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-opacity:1" transform="rotate(149.27)"/>
+        <circle id="epichains_svg__circle19777" cx="374.725" cy="-372.876" r="27.735" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-opacity:1" transform="rotate(85.643)"/>
+        <circle id="epichains_svg__circle19779" cx="305.486" cy="277.496" r="27.735" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-opacity:1"/>
+        <path id="epichains_svg__rect19781" d="M-524.27-72.282h107.739v7.467H-524.27z" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-opacity:1" transform="rotate(-147.841)"/>
+      </g>
+    </g>
+    <g id="epichains_svg__chains_top_part" style="display:inline">
+      <path id="epichains_svg__rect19787" d="M14.747-441.863h52v3.604h-52z" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-width:.482645;stroke-opacity:1" transform="rotate(139.384)"/>
+      <circle id="epichains_svg__path11860" cx="-487.392" cy="21.939" r="13.386" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-width:.482645;stroke-opacity:1" transform="rotate(-129.866)"/>
+      <path id="epichains_svg__rect11918" d="M368.686 233.029h52v3.604h-52z" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-width:.482645;stroke-opacity:1" transform="rotate(19.404)"/>
+      <circle id="epichains_svg__circle15037" cx="-6.579" cy="395.076" r="13.386" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-width:.482645;stroke-opacity:1" transform="rotate(-44.223)"/>
+      <circle id="epichains_svg__circle11862" cx="-440.907" cy="56.314" r="13.386" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-width:.482645;stroke-opacity:1" transform="rotate(-129.866)"/>
+      <path id="epichains_svg__rect5501" d="M286.13-274.344h52v3.604h-52z" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-width:.482645;stroke-opacity:1" transform="rotate(90.267)"/>
+      <path id="epichains_svg__rect24989" d="M-77.014-437.908h52v3.604h-52z" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-width:.482645;stroke-opacity:1" transform="rotate(143.35)"/>
+      <circle id="epichains_svg__circle27589" cx="324.163" cy="-186.517" r="13.386" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-width:.482645;stroke-opacity:1" transform="rotate(82.685)"/>
+      <path id="epichains_svg__rect27591" d="M-432.716-48.932h52v3.604h-52z" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-width:.482645;stroke-opacity:1" transform="rotate(-134.56)"/>
+      <circle id="epichains_svg__circle27595" cx="369.428" cy="-158.348" r="13.386" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-width:.482645;stroke-opacity:1" transform="rotate(82.685)"/>
+      <path id="epichains_svg__rect27599" d="M198.366 349.625h52v3.604h-52z" style="display:inline;fill:#0d6b9e;fill-opacity:1;stroke:none;stroke-width:.482645;stroke-opacity:1" transform="rotate(-1.59)"/>
+      <circle id="epichains_svg__circle24987" cx="-436.78" cy="-10.995" r="13.386" style="display:inline;fill:#e3202c;fill-opacity:1;stroke:none;stroke-width:.482645;stroke-opacity:1" transform="rotate(-129.866)"/>
+    </g>
+  </g>
+  <g id="epichains_svg__text1912" aria-label="epichains" style="font-size:64px;font-family:&quot;Clash Display&quot;;-inkscape-font-specification:&quot;Clash Display&quot;;display:inline;fill:#e3202c" transform="translate(-133.1 -197.5)">
+    <path id="epichains_svg__path6722" d="M177.475 478.664c9.472 0 16.128-4.672 16.128-11.456v-.576h-6.912v.512c0 3.776-2.752 5.952-9.408 5.952-7.36 0-10.368-3.264-10.624-9.408h27.008c.192-.96.256-1.856.256-3.008 0-9.664-6.72-15.104-16.576-15.104-10.368 0-17.152 6.656-17.152 16.576 0 10.624 6.848 16.512 17.28 16.512zm-.256-27.648c6.784 0 10.176 2.816 10.176 8.704v.128h-20.672c.384-5.696 3.456-8.832 10.496-8.832z" style="font-weight:500;-inkscape-font-specification:&quot;Clash Display Medium&quot;"/>
+    <path id="epichains_svg__path6724" d="M206.019 488.904v-19.84h.256c1.28 5.824 5.888 9.6 13.056 9.6 9.92 0 15.36-6.656 15.36-16.512 0-9.856-5.568-16.576-15.424-16.576-7.488 0-11.968 3.584-13.248 9.472h-.576v-8.832h-6.336v42.688zm0-26.24v-.576c0-6.848 3.968-10.176 11.072-10.176 6.656 0 10.688 2.56 10.688 10.24 0 7.616-3.968 10.24-10.816 10.24-6.784 0-10.944-3.2-10.944-9.728z" style="font-weight:500;-inkscape-font-specification:&quot;Clash Display Medium&quot;"/>
+    <path id="epichains_svg__path6726" d="M246.403 442.504v-7.36h-6.912v7.36zm0 35.52v-31.808h-6.912v31.808z" style="font-weight:500;-inkscape-font-specification:&quot;Clash Display Medium&quot;"/>
+    <path id="epichains_svg__path6728" d="M268.867 478.664c9.536 0 16.512-5.376 16.512-13.312v-.704h-6.848v.448c0 4.928-3.584 7.36-9.792 7.36-7.168 0-10.368-3.456-10.368-10.304 0-6.976 3.2-10.368 10.368-10.368 6.208 0 9.792 2.432 9.792 7.36v.384h6.848v-.64c0-7.936-6.976-13.312-16.512-13.312-10.496 0-17.344 6.656-17.344 16.576 0 9.792 6.848 16.512 17.344 16.512z" style="fill:#241c1c"/>
+    <path id="epichains_svg__path6730" d="M297.41 478.024V462.6c0-6.72 2.689-10.816 10.24-10.816 6.593 0 9.473 2.56 9.473 8.96v17.28h6.848V459.08c0-7.744-4.288-13.504-13.056-13.504-8.128 0-12.032 4.992-13.12 10.368h-.384v-20.8h-6.912v42.88z" style="fill:#241c1c"/>
+    <path id="epichains_svg__path6732" d="M339.523 478.664c7.232 0 12.288-3.2 13.952-8.576h.448v7.936h6.336v-19.328c0-7.616-4.544-13.12-14.656-13.12-10.112 0-16.128 5.44-16.128 12.672v.256h6.848v-.256c0-4.672 2.88-6.592 8.768-6.592 6.08 0 8.384 1.856 8.384 7.104v1.856l-14.464 1.536c-6.272.704-10.24 3.264-10.24 8.192 0 5.248 4.16 8.32 10.752 8.32zm-3.84-8.768c0-2.496 1.728-3.328 5.312-3.776l12.48-1.408c0 6.272-4.864 8.96-12.096 8.96-3.968 0-5.696-1.216-5.696-3.776z" style="fill:#241c1c"/>
+    <path id="epichains_svg__path6734" d="M373.315 442.504v-7.36h-6.912v7.36zm0 35.52v-31.808h-6.912v31.808z" style="fill:#241c1c"/>
+    <path id="epichains_svg__path6736" d="M386.627 478.024V462.28c0-6.912 3.136-10.496 10.112-10.496 6.592 0 9.28 3.008 9.28 8.96v17.28h6.912v-19.136c0-7.36-4.352-13.312-13.184-13.312-8.064 0-12.288 5.12-13.248 10.496h-.448v-9.856h-6.336v31.808z" style="fill:#241c1c"/>
+    <path id="epichains_svg__path6738" d="M434.115 478.664c8.96 0 14.848-3.52 14.848-9.472 0-5.248-3.456-7.808-11.008-8.768l-7.36-1.024c-4.8-.64-6.08-1.536-6.08-4.032 0-3.008 2.048-4.288 7.872-4.288 7.04 0 8.96 1.984 8.96 5.76v.384h6.848v-.192c0-7.552-5.888-11.456-15.616-11.456-9.792 0-14.848 3.968-14.848 9.536 0 5.312 3.648 7.872 9.664 8.64l8.704 1.152c4.544.64 6.08 1.472 6.08 4.096 0 2.816-1.856 4.096-8.064 4.096-6.656 0-9.152-1.344-9.152-5.504v-.512h-6.912v.192c0 7.36 5.824 11.392 16.064 11.392z" style="fill:#241c1c"/>
+  </g>
+</svg>

From 8f54ef468ddcce0f0dfa94b6a29554e6462c167e Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 17:50:25 +0000
Subject: [PATCH 732/828] Reword some arguments to reflect input checks

---
 R/borel.r                | 6 +++---
 R/utils.r                | 4 ++--
 man/dborel.Rd            | 4 ++--
 man/rborel.Rd            | 2 +-
 man/rnbinom_mean_disp.Rd | 4 ++--
 5 files changed, 10 insertions(+), 10 deletions(-)

diff --git a/R/borel.r b/R/borel.r
index 1ec154d9..0b1056cb 100644
--- a/R/borel.r
+++ b/R/borel.r
@@ -1,7 +1,7 @@
 ##' Density of the Borel distribution
 ##'
-##' @param x Vector of integers.
-##' @param mu mu parameter.
+##' @param x Vector of quantiles; integer.
+##' @param mu mu parameter (the poisson mean); non-negative.
 ##' @param log Logical; if TRUE, probabilities p are given as log(p).
 ##' @return Probability mass.
 ##' @author Sebastian Funk
@@ -17,7 +17,7 @@ dborel <- function(x, mu, log = FALSE) {
 ##'
 ##' Random numbers are generated by simulating from a Poisson branching process
 ##' @param n Number of random variates to generate.
-##' @param mu mu parameter.
+##' @inheritParams dborel
 ##' @param infinite Any number to treat as infinite; simulations will be
 ##' stopped if this number is reached
 ##' @return Vector of random numbers
diff --git a/R/utils.r b/R/utils.r
index c4d135a5..338ecd3c 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -43,8 +43,8 @@ rgen_length <- function(n, x, prob) {
 #' Negative binomial random numbers parametrized
 #' in terms of mean and dispersion coefficient
 #' @param n number of samples to draw
-#' @param mn mean of distribution
-#' @param disp dispersion coefficient (var/mean)
+#' @param mn mean of distribution; Must be > 0.
+#' @param disp dispersion coefficient (var/mean); Must be > 1.
 #' @return vector containing the random numbers
 #' @author Flavio Finger
 #' @export
diff --git a/man/dborel.Rd b/man/dborel.Rd
index 52c4db77..51b944ab 100644
--- a/man/dborel.Rd
+++ b/man/dborel.Rd
@@ -7,9 +7,9 @@
 dborel(x, mu, log = FALSE)
 }
 \arguments{
-\item{x}{Vector of integers.}
+\item{x}{Vector of quantiles; integer.}
 
-\item{mu}{mu parameter.}
+\item{mu}{mu parameter (the poisson mean); non-negative.}
 
 \item{log}{Logical; if TRUE, probabilities p are given as log(p).}
 }
diff --git a/man/rborel.Rd b/man/rborel.Rd
index 70cd22fb..605c2ac6 100644
--- a/man/rborel.Rd
+++ b/man/rborel.Rd
@@ -9,7 +9,7 @@ rborel(n, mu, infinite = Inf)
 \arguments{
 \item{n}{Number of random variates to generate.}
 
-\item{mu}{mu parameter.}
+\item{mu}{mu parameter (the poisson mean); non-negative.}
 
 \item{infinite}{Any number to treat as infinite; simulations will be
 stopped if this number is reached}
diff --git a/man/rnbinom_mean_disp.Rd b/man/rnbinom_mean_disp.Rd
index 698836d6..fb9df0aa 100644
--- a/man/rnbinom_mean_disp.Rd
+++ b/man/rnbinom_mean_disp.Rd
@@ -10,9 +10,9 @@ rnbinom_mean_disp(n, mn, disp)
 \arguments{
 \item{n}{number of samples to draw}
 
-\item{mn}{mean of distribution}
+\item{mn}{mean of distribution; Must be > 0.}
 
-\item{disp}{dispersion coefficient (var/mean)}
+\item{disp}{dispersion coefficient (var/mean); Must be > 1.}
 }
 \value{
 vector containing the random numbers

From 95f6d1e95e3bfe28753b69d8fc89a34dfaaa0f6f Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 17:50:44 +0000
Subject: [PATCH 733/828] Add input checks to complementary_logprob()

---
 R/utils.r | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/R/utils.r b/R/utils.r
index 338ecd3c..09636291 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -6,6 +6,10 @@
 #' @author Sebastian Funk
 #' @keywords internal
 complementary_logprob <- function(x) {
+  checkmate::assert_numeric(
+    x, lower = -Inf, upper = 0
+  )
+
   tryCatch(log1p(-sum(exp(x))), error = function(e) -Inf)
 }
 

From 32a4cec185199d40388cea981407c67937535f6f Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 17:51:01 +0000
Subject: [PATCH 734/828] Add input checks to rnbinom_mean_disp()

---
 R/utils.r | 10 ++++++++++
 1 file changed, 10 insertions(+)

diff --git a/R/utils.r b/R/utils.r
index 09636291..6fd239cf 100644
--- a/R/utils.r
+++ b/R/utils.r
@@ -55,6 +55,16 @@ rgen_length <- function(n, x, prob) {
 #' @examples
 #' rnbinom_mean_disp(n = 5, mn = 4, disp = 2)
 rnbinom_mean_disp <- function(n, mn, disp) {
+  checkmate::assert_number(
+    n, lower = 1, finite = TRUE, na.ok = FALSE
+  )
+  checkmate::assert_number(
+    disp, lower = 1, finite = TRUE, na.ok = FALSE
+  )
+  checkmate::assert_number(
+    mn, lower = 1E-100, finite = TRUE, na.ok = FALSE
+  )
+
   size <- mn / (disp - 1)
   stats::rnbinom(n, size = size, mu = mn)
 }

From 55db2f3074c08e6ccb2f53d93f96dfe464622d2b Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 17:51:34 +0000
Subject: [PATCH 735/828] Allow only one argument in serials_dist arg

---
 R/checks.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/checks.R b/R/checks.R
index 957d2dab..8a85a346 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -38,7 +38,7 @@ check_offspring_func_valid <- function(roffspring_name) {
 #'
 #' @keywords internal
 check_serial_valid <- function(serials_dist) {
-  if (!checkmate::test_function(serials_dist)) {
+  if (!checkmate::test_function(serials_dist, nargs = 1)) {
     stop(sprintf(
       "%s %s",
       "The `serials_dist` argument must be a function",

From 2f2ebab0772bcd0b5eb81dd5d3cbe1225a3b1f8b Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 17:52:00 +0000
Subject: [PATCH 736/828] Add input checks to simulate_*() functions

---
 R/simulate.r | 22 ++++++++++++++++++++++
 1 file changed, 22 insertions(+)

diff --git a/R/simulate.r b/R/simulate.r
index c5b81d1b..e968a2eb 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -117,7 +117,9 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
                           tf = Inf, ...) {
   statistic <- match.arg(statistic)
 
+  # Input checking
   check_nchains_valid(nchains = nchains)
+  checkmate::assert_character(statistic)
 
   # check that offspring is properly specified
   check_offspring_valid(offspring_dist)
@@ -126,6 +128,11 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
   roffspring_name <- paste0("r", offspring_dist)
   check_offspring_func_valid(roffspring_name)
 
+  checkmate::assert_numeric(stat_max, lower = 0)
+  check_serial_valid(serials_dist)
+  checkmate::assert_number(t0, lower = 0, finite = TRUE)
+  checkmate::assert_number(tf, lower = 0, finite = TRUE)
+
   # Gather offspring distribution parameters
   pars <- list(...)
 
@@ -277,7 +284,9 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
                              stat_max = Inf, ...) {
   statistic <- match.arg(statistic)
 
+  # Input checking
   check_nchains_valid(nchains = nchains)
+  checkmate::assert_character(statistic)
 
   # check that offspring is properly specified
   check_offspring_valid(offspring_dist)
@@ -286,6 +295,11 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
   roffspring_name <- paste0("r", offspring_dist)
   check_offspring_func_valid(roffspring_name)
 
+  checkmate::assert_numeric(stat_max, lower = 0)
+  check_serial_valid(serials_dist)
+  checkmate::assert_number(t0, lower = 0, finite = TRUE)
+  checkmate::assert_number(tf, lower = 0, finite = TRUE)
+
   # Gather offspring distribution parameters
   pars <- list(...)
 
@@ -413,6 +427,14 @@ simulate_tree_from_pop <- function(pop,
                                    ...) {
   offspring_dist <- match.arg(offspring_dist)
 
+  # Input checking
+  checkmate::assert_number(pop, lower = 1, finite = TRUE)
+  checkmate::assert_string(offspring_dist)
+  checkmate::assert_function(serials_dist, nargs = 1)
+  checkmate::assert_number(initial_immune, lower = 0, upper = pop - 1)
+  checkmate::assert_number(t0, lower = 0, finite = TRUE)
+  checkmate::assert_number(tf, lower = 0, finite = TRUE)
+
   # Gather offspring distribution parameters
   pars <- list(...)
 

From eb2967e3924d6e38f3cba0233259494d5201e869 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 17:52:18 +0000
Subject: [PATCH 737/828] Add input checks to d and rborel()

---
 R/borel.r | 14 ++++++++++++++
 1 file changed, 14 insertions(+)

diff --git a/R/borel.r b/R/borel.r
index 0b1056cb..c1a44065 100644
--- a/R/borel.r
+++ b/R/borel.r
@@ -7,6 +7,13 @@
 ##' @author Sebastian Funk
 ##' @export
 dborel <- function(x, mu, log = FALSE) {
+  checkmate::assert_numeric(
+    x, lower = 1, upper = Inf
+  )
+  checkmate::assert_number(
+    mu, lower = 0, finite = TRUE, na.ok = FALSE
+  )
+
   if (x < 1) stop("'x' must be greater than 0")
   ld <- -mu * x + (x - 1) * log(mu * x) - lgamma(x + 1)
   if (!log) ld <- exp(ld)
@@ -24,6 +31,13 @@ dborel <- function(x, mu, log = FALSE) {
 ##' @author Sebastian Funk
 ##' @export
 rborel <- function(n, mu, infinite = Inf) {
+  checkmate::assert_number(
+    n, lower = 1, finite = TRUE, na.ok = FALSE
+  )
+  checkmate::assert_number(
+    mu, lower = 0, finite = TRUE, na.ok = FALSE
+  )
+  # Run simulations
   simulate_summary(
     nchains = n,
     offspring_dist = "pois",

From ba10c6b4a16c8fb59a793a9712d2b1609f82e073 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 17:52:31 +0000
Subject: [PATCH 738/828] Add input checks to likelihood()

---
 R/likelihood.R | 21 ++++++++++++++++++---
 1 file changed, 18 insertions(+), 3 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 68f0f24c..2cd94e12 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -47,17 +47,32 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
   statistic <- match.arg(statistic)
 
   ## checks
+  checkmate::assert_numeric(chains, lower = 0, upper = Inf)
+  checkmate::assert_character(statistic)
   check_offspring_valid(offspring_dist)
+  checkmate::assert_number(
+    nsim_obs, lower = 1, finite = TRUE, na.ok = FALSE
+  )
+  checkmate::assert_number(
+    obs_prob, lower = 0, upper = 1, finite = TRUE, na.ok = FALSE
+  )
+  checkmate::assert_number(
+    stat_max, lower = 0, finite = TRUE, na.ok = FALSE
+  )
   checkmate::assert_logical(
     log,
     any.missing = FALSE,
     all.missing = FALSE,
     len = 1
   )
+  checkmate::assert_logical(
+    individual,
+    any.missing = FALSE,
+    all.missing = FALSE,
+    len = 1
+  )
+  checkmate::assert_numeric(exclude, null.ok = TRUE)
 
-  if (obs_prob <= 0 || obs_prob > 1) {
-    stop("'obs_prob' is a probability and must be between 0 and 1 inclusive")
-  }
   if (obs_prob < 1) {
     if (missing(nsim_obs)) {
       stop("'nsim_obs' must be specified if 'obs_prob' is < 1")

From e08b09a5bcc886a7c687d1af12c21551c6bf20af Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 19:23:54 +0000
Subject: [PATCH 739/828] t0 can be a vector

---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index e968a2eb..fff92226 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -130,7 +130,7 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 
   checkmate::assert_numeric(stat_max, lower = 0)
   check_serial_valid(serials_dist)
-  checkmate::assert_number(t0, lower = 0, finite = TRUE)
+  checkmate::assert_numeric(t0, lower = 0, finite = TRUE)
   checkmate::assert_number(tf, lower = 0, finite = TRUE)
 
   # Gather offspring distribution parameters

From b5f0765e9157090100a0cd696f291d3765471606 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 19:24:16 +0000
Subject: [PATCH 740/828] Remove checks

---
 R/simulate.r | 3 ---
 1 file changed, 3 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index fff92226..ad924c35 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -296,9 +296,6 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
   check_offspring_func_valid(roffspring_name)
 
   checkmate::assert_numeric(stat_max, lower = 0)
-  check_serial_valid(serials_dist)
-  checkmate::assert_number(t0, lower = 0, finite = TRUE)
-  checkmate::assert_number(tf, lower = 0, finite = TRUE)
 
   # Gather offspring distribution parameters
   pars <- list(...)

From 71a1167d10680a2057bfe4ab55da52d0a8e795a2 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 20:32:42 +0000
Subject: [PATCH 741/828] Fix error message regrex

---
 tests/testthat/test-likelihood.R | 2 +-
 tests/testthat/test-utils.R      | 6 +++---
 tests/testthat/tests-borel.r     | 2 +-
 3 files changed, 5 insertions(+), 5 deletions(-)

diff --git a/tests/testthat/test-likelihood.R b/tests/testthat/test-likelihood.R
index cc41654b..96d86735 100644
--- a/tests/testthat/test-likelihood.R
+++ b/tests/testthat/test-likelihood.R
@@ -168,7 +168,7 @@ test_that("Errors are thrown", {
       lambda = 0.5,
       obs_prob = 3
     ),
-    "must be between 0 and 1"
+    "is not <= 1"
   )
   expect_error(
     likelihood(
diff --git a/tests/testthat/test-utils.R b/tests/testthat/test-utils.R
index 4118e7cb..bab9a467 100644
--- a/tests/testthat/test-utils.R
+++ b/tests/testthat/test-utils.R
@@ -104,17 +104,17 @@ test_that("Reparametrized distributions throw warnings", {
       mn = 4,
       disp = 0.9
     ),
-    "NAs produced"
+    "not >= 1"
   )
 })
 
 test_that("Log-probabilities throw warnings", {
   expect_warning(
     complementary_logprob(0.1),
-    "NaNs produced"
+    "is not <= 0"
   )
   expect_warning(
     complementary_logprob(Inf),
-    "NaNs produced"
+    "is not <= 0"
   )
 })
diff --git a/tests/testthat/tests-borel.r b/tests/testthat/tests-borel.r
index e17512e1..dedc6cd2 100644
--- a/tests/testthat/tests-borel.r
+++ b/tests/testthat/tests-borel.r
@@ -5,5 +5,5 @@ test_that("We can calculate probabilities and sample", {
 })
 
 test_that("Errors are thrown", {
-  expect_error(dborel(0, 0.5), "greater than 0")
+  expect_error(dborel(0, 0.5), "is not >= 1")
 })

From 2946d09164cf9f0e557d1c4a7addd137224f5a6f Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 20:33:21 +0000
Subject: [PATCH 742/828] Change warning expectation to error

---
 tests/testthat/test-utils.R | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/tests/testthat/test-utils.R b/tests/testthat/test-utils.R
index bab9a467..c3f97bbf 100644
--- a/tests/testthat/test-utils.R
+++ b/tests/testthat/test-utils.R
@@ -98,7 +98,7 @@ test_that("Chain sizes sampler is numerically correct", {
 })
 
 test_that("Reparametrized distributions throw warnings", {
-  expect_warning(
+  expect_error(
     rnbinom_mean_disp(
       n = 5,
       mn = 4,
@@ -109,11 +109,11 @@ test_that("Reparametrized distributions throw warnings", {
 })
 
 test_that("Log-probabilities throw warnings", {
-  expect_warning(
+  expect_error(
     complementary_logprob(0.1),
     "is not <= 0"
   )
-  expect_warning(
+  expect_error(
     complementary_logprob(Inf),
     "is not <= 0"
   )

From 6aa40829aee3b11c3bb2dee22956b3d35087a44a Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 20:35:25 +0000
Subject: [PATCH 743/828] Fix input checks in likelihood()

---
 R/likelihood.R | 32 +++++++++++++++++---------------
 1 file changed, 17 insertions(+), 15 deletions(-)

diff --git a/R/likelihood.R b/R/likelihood.R
index 2cd94e12..79b3ec75 100644
--- a/R/likelihood.R
+++ b/R/likelihood.R
@@ -46,32 +46,34 @@ likelihood <- function(chains, statistic = c("size", "length"), offspring_dist,
                        exclude = NULL, individual = FALSE, ...) {
   statistic <- match.arg(statistic)
 
-  ## checks
-  checkmate::assert_numeric(chains, lower = 0, upper = Inf)
+  ## Input checking
+  ## Check nsim_obs when specified
+  if (!missing(nsim_obs)) {
+    checkmate::assert_number(
+      nsim_obs, lower = 1, finite = TRUE, na.ok = FALSE
+    )
+  }
+
+  checkmate::assert_numeric(
+    chains, lower = 0, upper = Inf, any.missing = FALSE
+  )
   checkmate::assert_character(statistic)
   check_offspring_valid(offspring_dist)
-  checkmate::assert_number(
-    nsim_obs, lower = 1, finite = TRUE, na.ok = FALSE
-  )
   checkmate::assert_number(
     obs_prob, lower = 0, upper = 1, finite = TRUE, na.ok = FALSE
   )
   checkmate::assert_number(
-    stat_max, lower = 0, finite = TRUE, na.ok = FALSE
+    stat_max, lower = 0, na.ok = FALSE
   )
   checkmate::assert_logical(
-    log,
-    any.missing = FALSE,
-    all.missing = FALSE,
-    len = 1
+    log, any.missing = FALSE, all.missing = FALSE, len = 1
   )
   checkmate::assert_logical(
-    individual,
-    any.missing = FALSE,
-    all.missing = FALSE,
-    len = 1
+    individual, any.missing = FALSE, all.missing = FALSE, len = 1
+  )
+  checkmate::assert_numeric(
+    exclude, null.ok = TRUE
   )
-  checkmate::assert_numeric(exclude, null.ok = TRUE)
 
   if (obs_prob < 1) {
     if (missing(nsim_obs)) {

From 5c9d77a0127b1a0c891436363a8a3ffd304f8a96 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 20:35:49 +0000
Subject: [PATCH 744/828] Add input checks to pois_size_ll()

---
 R/stat_likelihoods.R | 7 +++++++
 1 file changed, 7 insertions(+)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 148ffbf6..c3b9ccaf 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -6,6 +6,13 @@
 #' @author Sebastian Funk
 #' @keywords internal
 pois_size_ll <- function(x, lambda) {
+  checkmate::assert_numeric(
+    x, lower = 0, any.missing = FALSE
+  )
+  checkmate::assert_number(
+    lambda, finite = TRUE, lower = 0
+  )
+
   (x - 1) * log(lambda) - lambda * x + (x - 2) * log(x) - lgamma(x)
 }
 

From 2ac8f937930d54563b102e6a743bd405435d6fca Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 20:36:00 +0000
Subject: [PATCH 745/828] Add input checks to nbinom_size_ll()

---
 R/stat_likelihoods.R | 16 ++++++++++++++++
 1 file changed, 16 insertions(+)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index c3b9ccaf..5b80e8b4 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -29,6 +29,22 @@ pois_size_ll <- function(x, lambda) {
 #' @author Sebastian Funk
 #' @keywords internal
 nbinom_size_ll <- function(x, size, prob, mu) {
+  checkmate::assert_numeric(
+    x, lower = 0, any.missing = FALSE
+  )
+  checkmate::assert_number(
+    size, finite = TRUE, lower = 0
+  )
+  if (!missing(prob)) {
+    checkmate::assert_number(
+      prob, lower = 0, upper = 1
+    )
+  }
+  if (!missing(mu)) {
+    checkmate::assert_number(
+      mu, finite = TRUE, lower = 0
+    )
+  }
   if (!missing(prob)) {
     if (!missing(mu)) stop("'prob' and 'mu' both specified")
     mu <- size * (1 - prob) / prob

From 829e68da29ec76e73da6be0d7808514342d15929 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 20:36:29 +0000
Subject: [PATCH 746/828] Add input checks to gborel_size_ll()

---
 R/stat_likelihoods.R | 17 +++++++++++++++++
 1 file changed, 17 insertions(+)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 5b80e8b4..1dbfb018 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -66,6 +66,23 @@ nbinom_size_ll <- function(x, size, prob, mu) {
 #' @author Sebastian Funk
 #' @keywords internal
 gborel_size_ll <- function(x, size, prob, mu) {
+  checkmate::assert_numeric(
+    x, lower = 0, any.missing = FALSE
+  )
+  checkmate::assert_number(
+    size, finite = TRUE, lower = 0
+  )
+  if (!missing(prob)) {
+    checkmate::assert_number(
+      prob, lower = 0, upper = 1
+    )
+  }
+  if (!missing(mu)) {
+    checkmate::assert_number(
+      mu, finite = TRUE, lower = 0
+    )
+  }
+
   if (!missing(prob)) {
     if (!missing(mu)) stop("'prob' and 'mu' both specified")
     mu <- size * (1 - prob) / prob

From 29f2c07f860009ea379a82949839284d028d255f Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 20:36:42 +0000
Subject: [PATCH 747/828] Add input checks to pois_length_ll()

---
 R/stat_likelihoods.R | 7 +++++++
 1 file changed, 7 insertions(+)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 1dbfb018..be43b0fb 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -100,6 +100,13 @@ gborel_size_ll <- function(x, size, prob, mu) {
 #' @author Sebastian Funk
 #' @keywords internal
 pois_length_ll <- function(x, lambda) {
+  checkmate::assert_numeric(
+    x, lower = 0, any.missing = FALSE
+  )
+  checkmate::assert_number(
+    lambda, finite = TRUE, lower = 0
+  )
+
   ## iterated exponential function
   arg <- exp(lambda * exp(-lambda))
   itex <- 1

From eb8b85e1151d8b20623fe66e30e21eed45be95d8 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 20:36:58 +0000
Subject: [PATCH 748/828] Add input checks to geom_length_ll()

---
 R/stat_likelihoods.R | 7 +++++++
 1 file changed, 7 insertions(+)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index be43b0fb..3f214505 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -126,6 +126,13 @@ pois_length_ll <- function(x, lambda) {
 #' @author Sebastian Funk
 #' @keywords internal
 geom_length_ll <- function(x, prob) {
+  checkmate::assert_numeric(
+    x, lower = 0, any.missing = FALSE
+  )
+  checkmate::assert_number(
+    prob, lower = 0, upper = 1
+  )
+
   lambda <- 1 / prob
   GkmGkm1 <- (1 - lambda^(x)) / (1 - lambda^(x + 1)) -
     (1 - lambda^(x - 1)) / (1 - lambda^(x))

From 493680864cadd787f083435f3964338c4de6378d Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 20:37:16 +0000
Subject: [PATCH 749/828] Add input checks to offspring_ll()

---
 R/stat_likelihoods.R | 11 +++++++++++
 1 file changed, 11 insertions(+)

diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 3f214505..176494e1 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -168,6 +168,17 @@ geom_length_ll <- function(x, prob) {
 #' )
 offspring_ll <- function(x, offspring_dist, statistic,
                          nsim_offspring = 100, ...) {
+  # Input checking
+  checkmate::assert_numeric(
+    x, lower = 0, any.missing = FALSE
+  )
+  # check that offspring is properly specified
+  check_offspring_valid(offspring_dist)
+  checkmate::assert_character(statistic)
+  checkmate::assert_numeric(
+    nsim_offspring, lower = 1
+  )
+
   # Simulate the chains
   dist <- simulate_summary(
     nchains = nsim_offspring,

From 0a28674a87b5125b7ee1ba504cad3075efbf3c19 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 10 Nov 2023 20:40:58 +0000
Subject: [PATCH 750/828] Style code

---
 R/simulate.r | 41 +++++++++++++++++++++++++++++++----------
 1 file changed, 31 insertions(+), 10 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index ad924c35..98374c71 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -128,10 +128,19 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
   roffspring_name <- paste0("r", offspring_dist)
   check_offspring_func_valid(roffspring_name)
 
-  checkmate::assert_numeric(stat_max, lower = 0)
-  check_serial_valid(serials_dist)
-  checkmate::assert_numeric(t0, lower = 0, finite = TRUE)
-  checkmate::assert_number(tf, lower = 0, finite = TRUE)
+  checkmate::assert_number(
+    stat_max, lower = 0
+  )
+
+  if (!missing(serials_dist)) {
+    check_serial_valid(serials_dist)
+  }
+  checkmate::assert_numeric(
+    t0, lower = 0, finite = TRUE
+  )
+  checkmate::assert_number(
+    tf, lower = 0
+  )
 
   # Gather offspring distribution parameters
   pars <- list(...)
@@ -295,7 +304,9 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
   roffspring_name <- paste0("r", offspring_dist)
   check_offspring_func_valid(roffspring_name)
 
-  checkmate::assert_numeric(stat_max, lower = 0)
+  checkmate::assert_number(
+    stat_max, lower = 0
+  )
 
   # Gather offspring distribution parameters
   pars <- list(...)
@@ -425,12 +436,22 @@ simulate_tree_from_pop <- function(pop,
   offspring_dist <- match.arg(offspring_dist)
 
   # Input checking
-  checkmate::assert_number(pop, lower = 1, finite = TRUE)
+  checkmate::assert_number(
+    pop, lower = 1, finite = TRUE
+  )
   checkmate::assert_string(offspring_dist)
-  checkmate::assert_function(serials_dist, nargs = 1)
-  checkmate::assert_number(initial_immune, lower = 0, upper = pop - 1)
-  checkmate::assert_number(t0, lower = 0, finite = TRUE)
-  checkmate::assert_number(tf, lower = 0, finite = TRUE)
+  if (!missing(serials_dist)) {
+    check_serial_valid(serials_dist)
+  }
+  checkmate::assert_number(
+    initial_immune, lower = 0, upper = pop - 1
+  )
+  checkmate::assert_number(
+    t0, lower = 0, finite = TRUE
+  )
+  checkmate::assert_number(
+    tf, lower = 0
+  )
 
   # Gather offspring distribution parameters
   pars <- list(...)

From 4a0e84aa6d51b6723dd3e26d5d2fea7cf470413a Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Tue, 28 Nov 2023 15:20:08 +0000
Subject: [PATCH 751/828] add length check in serial distribution

---
 R/checks.R | 7 +++++++
 1 file changed, 7 insertions(+)

diff --git a/R/checks.R b/R/checks.R
index 8a85a346..96a3edad 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -45,6 +45,13 @@ check_serial_valid <- function(serials_dist) {
       "(see details in ?sim_chain_tree)."
     ))
   }
+  x <- serials_dist(10)
+  if (!checkmate::test_numeric(x, len = 10)) {
+    stop(
+      "The return values of `serials_dist` must be a numeric vector of length ",
+      "`n`."
+    )
+  }
 }
 
 
From 117946956efd085c1e85f4f205f6c3ff01718724 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Tue, 28 Nov 2023 15:21:01 +0000
Subject: [PATCH 752/828] clarify serials_dist argument

---
 R/simulate.r                  | 7 ++++---
 man/simulate_tree.Rd          | 7 ++++---
 man/simulate_tree_from_pop.Rd | 7 ++++---
 3 files changed, 12 insertions(+), 9 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 98374c71..9699aa9a 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -15,9 +15,10 @@
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to this value.
 #' Defaults to `Inf`.
-#' @param serials_dist The serial interval distribution function; the name
-#' of a user-defined named or anonymous function with only one argument `n`,
-#' representing the number of serial intervals to generate. See details.
+#' @param serials_dist The serial interval distribution function; the name of a
+#' user-defined named or anonymous function with only one argument, usually
+#' called `n`, that returns a numeric vector of `n` randomly sampled serial
+#' intervals. See details.
 #' @param t0 Start time (if serial interval is given); either a single value
 #' or a vector of same length as `nchains` (number of simulations) with
 #' initial times. Defaults to 0.
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 1d6722dc..b9e39a0e 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -35,9 +35,10 @@ numbers).}
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
-\item{serials_dist}{The serial interval distribution function; the name
-of a user-defined named or anonymous function with only one argument \code{n},
-representing the number of serial intervals to generate. See details.}
+\item{serials_dist}{The serial interval distribution function; the name of a
+user-defined named or anonymous function with only one argument, usually
+called \code{n}, that returns a numeric vector of \code{n} randomly sampled serial
+intervals. See details.}
 
 \item{t0}{Start time (if serial interval is given); either a single value
 or a vector of same length as \code{nchains} (number of simulations) with
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 4f42d429..c2a5760d 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -23,9 +23,10 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers). Only supports "pois" and "nbinom".}
 
-\item{serials_dist}{The serial interval distribution function; the name
-of a user-defined named or anonymous function with only one argument \code{n},
-representing the number of serial intervals to generate. See details.}
+\item{serials_dist}{The serial interval distribution function; the name of a
+user-defined named or anonymous function with only one argument, usually
+called \code{n}, that returns a numeric vector of \code{n} randomly sampled serial
+intervals. See details.}
 
 \item{initial_immune}{The number of initial immunes in the population.
 Must be less than \code{pop} - 1.}

From 0ca8ad4b5f4ebf0593e327df3ad6ece9b7f71c67 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Tue, 28 Nov 2023 15:21:19 +0000
Subject: [PATCH 753/828] add test of serial distribution function result

---
 tests/testthat/test-checks.R | 8 ++++++++
 1 file changed, 8 insertions(+)

diff --git a/tests/testthat/test-checks.R b/tests/testthat/test-checks.R
index fde23d57..d0b619f0 100644
--- a/tests/testthat/test-checks.R
+++ b/tests/testthat/test-checks.R
@@ -11,6 +11,14 @@ test_that("Checks work", {
     check_serial_valid("a"),
     "must be a function"
   )
+  expect_error(
+    check_serial_valid(function(x) rep("a", 10)),
+    "numeric"
+  )
+  expect_error(
+    check_serial_valid(function(x) 3),
+    "vector of length"
+  )
   expect_error(
     check_nchains_valid(1.1),
     "less than"

From 8309310f0545c276b09c6557ddaea95e8a35ee50 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Tue, 28 Nov 2023 15:22:08 +0000
Subject: [PATCH 754/828] update static serial interval distributions

---
 R/epichains.R                   |  2 +-
 R/simulate.r                    |  6 +++---
 man/aggregate.epichains.Rd      |  2 +-
 man/simulate_tree.Rd            |  2 +-
 man/simulate_tree_from_pop.Rd   |  4 ++--
 tests/testthat/test-epichains.R | 20 ++++++++++----------
 tests/testthat/test-simulate.R  |  4 ++--
 vignettes/epichains.Rmd         |  2 +-
 8 files changed, 21 insertions(+), 21 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 0dd0ab70..269f83f8 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -259,7 +259,7 @@ tail.epichains <- function(x, ...) {
 #'   statistic = "size",
 #'   offspring_dist = "pois",
 #'   stat_max = 10,
-#'   serials_dist = function(x) 3,
+#'   serials_dist = function(n) rep(3, n),
 #'   lambda = 2
 #' )
 #' chains
diff --git a/R/simulate.r b/R/simulate.r
index 9699aa9a..ad969219 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -95,7 +95,7 @@
 #'   statistic = "size",
 #'   offspring_dist = "pois",
 #'   stat_max = 10,
-#'   serials_dist = function(x) 3,
+#'   serials_dist = function(n) rep(3, n),
 #'   lambda = 2
 #' )
 #' @references
@@ -416,7 +416,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #'   pop = 100,
 #'   offspring_dist = "pois",
 #'   lambda = 0.5,
-#'   serials_dist = function(x) 3
+#'   serials_dist = function(n) rep(3, n)
 #' )
 #'
 #' # Simulate with negative binomial offspring
@@ -424,7 +424,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
 #' pop = 100, offspring_dist = "nbinom",
 #' mu = 0.5,
 #' size = 1.1,
-#' serials_dist = function(x) 3
+#' serials_dist = function(n) rep(3, n)
 #' )
 #' @export
 simulate_tree_from_pop <- function(pop,
diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
index 27edc079..b4797d85 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains.Rd
@@ -29,7 +29,7 @@ chains <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  serials_dist = function(x) 3,
+  serials_dist = function(n) rep(3, n),
   lambda = 2
 )
 chains
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index b9e39a0e..568d86e3 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -119,7 +119,7 @@ chains <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  serials_dist = function(x) 3,
+  serials_dist = function(n) rep(3, n),
   lambda = 2
 )
 }
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index c2a5760d..4bb937f4 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -121,7 +121,7 @@ simulate_tree_from_pop(
   pop = 100,
   offspring_dist = "pois",
   lambda = 0.5,
-  serials_dist = function(x) 3
+  serials_dist = function(n) rep(3, n)
 )
 
 # Simulate with negative binomial offspring
@@ -129,7 +129,7 @@ simulate_tree_from_pop(
 pop = 100, offspring_dist = "nbinom",
 mu = 0.5,
 size = 1.1,
-serials_dist = function(x) 3
+serials_dist = function(n) rep(3, n)
 )
 }
 \seealso{
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index 9b1d4f75..58daf454 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -33,7 +33,7 @@ test_that("Simulators return epichains objects", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(x) 3,
+    serials_dist = function(n) rep(3, n),
     lambda = 2
   )
   #' Simulate chain statistics
@@ -96,7 +96,7 @@ test_that("print.epichains works for simulation functions", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(x) 3,
+    serials_dist = function(n) rep(3, n),
     lambda = 2
   )
   #' Simulate chain statistics
@@ -144,7 +144,7 @@ test_that("summary.epichains works as expected", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(x) 3,
+    serials_dist = function(n) rep(3, n),
     lambda = 2
   )
   #' Simulate chain statistics
@@ -250,7 +250,7 @@ test_that("validate_epichains works", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(x) 3,
+    serials_dist = function(n) rep(3, n),
     lambda = 2
   )
   #' Simulate chain statistics
@@ -312,7 +312,7 @@ test_that("is_chains_tree works", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(x) 3,
+    serials_dist = function(n) rep(3, n),
     lambda = 2
   )
   #' Simulate chain statistics
@@ -370,7 +370,7 @@ test_that("is_chains_summary works", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(x) 3,
+    serials_dist = function(n) rep(3, n),
     lambda = 2
   )
   #' Simulate chain statistics
@@ -406,7 +406,7 @@ test_that("aggregate.epichains method returns correct objects", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(x) 3,
+    serials_dist = function(n) rep(3, n),
     lambda = 2
   )
   #' Create aggregates
@@ -467,7 +467,7 @@ test_that("aggregate.epichains method is numerically correct", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(x) 3,
+    serials_dist = function(n) rep(3, n),
     lambda = 2
   )
   #' Create aggregates
@@ -519,7 +519,7 @@ test_that("head and tail print output as expected", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(x) 3,
+    serials_dist = function(n) rep(3, n),
     lambda = 2
   )
   expect_snapshot(head(susc_outbreak_raw))
@@ -562,7 +562,7 @@ test_that("head and tail return data.frames", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(x) 3,
+    serials_dist = function(n) rep(3, n),
     lambda = 2
   )
   #' Expectations
diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index 4852fd9c..29354fda 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -33,7 +33,7 @@ test_that("Simulators work", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(x) 3,
+    serials_dist = function(n) rep(3, n),
     lambda = 2
   )
   #' Simulate chain statistics
@@ -71,7 +71,7 @@ test_that("Simulators work", {
         statistic = "size",
         offspring_dist = "pois",
         stat_max = 10,
-        serials_dist = function(x) 3,
+        serials_dist = function(n) rep(3, n),
         lambda = 2,
         tf = 5
       )$time < 5
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 7d361795..07f97935 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -203,7 +203,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  serials_dist = function(x) 3,
+  serials_dist = function(n) rep(3, n),
   lambda = 0.9
 )
 

From 20b150c05c0959ccfbcb4cf14d2282e9a3abfcc9 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Tue, 28 Nov 2023 16:21:21 +0000
Subject: [PATCH 755/828] update more examples in vignette

---
 vignettes/epichains.Rmd | 16 ++++++++--------
 1 file changed, 8 insertions(+), 8 deletions(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 07f97935..1c6fbf90 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -241,8 +241,8 @@ Here is a quick example where we simulate an outbreak in a population of size $1
 ```{r}
 set.seed(7)
 # Define serial distribution
-serial_func <- function(x) {
-  return(3)
+serial_func <- function(n) {
+  return(rep(3, n))
 }
 
 sim_tree_from_pop_eg <- simulate_tree_from_pop(
@@ -264,8 +264,8 @@ You can run `summary()` on `<epichains>` objects to get useful summaries.
 # Example with simulate_tree()
 set.seed(123)
 # Define serial distribution
-serial_func <- function(x) {
-  return(3)
+serial_func <- function(n) {
+  return(rep(3, n))
 }
 
 sim_tree_eg <- simulate_tree(
@@ -304,8 +304,8 @@ To aggregate over "time", you must have specified a serial interval distribution
 set.seed(123)
 
 # Define serial distribution
-serial_func <- function(x) {
-  return(3)
+serial_func <- function(n) {
+  return(rep(3, n))
 }
 
 sim_tree_eg <- simulate_tree(
@@ -329,8 +329,8 @@ Here is an end-to-end example from simulation through aggregation to plotting.
 # Run simulation with simulate_tree()
 set.seed(123)
 # Define serial distribution
-serial_func <- function(x) {
-  return(3)
+serial_func <- function(n) {
+  return(rep(3, n))
 }
 
 sim_tree_eg <- simulate_tree(

From 27078d8461ff22aae5ccc3e935544aed9610ff2c Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 12:19:13 +0000
Subject: [PATCH 756/828] update description of check_serial_valid

---
 R/checks.R                | 5 ++++-
 man/check_serial_valid.Rd | 5 +++--
 2 files changed, 7 insertions(+), 3 deletions(-)

diff --git a/R/checks.R b/R/checks.R
index 96a3edad..e43f3910 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -30,7 +30,10 @@ check_offspring_func_valid <- function(roffspring_name) {
 }
 
 
-#' Check if the serials_dist argument is specified as a function
+#' Check if the serials_dist argument is valid.
+#'
+#' Check if the serials_dist argument is a function with one argument `n`
+#' and returns a numerical vector of length `n`.
 #'
 #' @param serials_dist The serial interval distribution function; the name of a
 #' user-defined named or anonymous function with only one argument `n`,
diff --git a/man/check_serial_valid.Rd b/man/check_serial_valid.Rd
index 3aef91b5..aec80683 100644
--- a/man/check_serial_valid.Rd
+++ b/man/check_serial_valid.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/checks.R
 \name{check_serial_valid}
 \alias{check_serial_valid}
-\title{Check if the serials_dist argument is specified as a function}
+\title{Check if the serials_dist argument is valid.}
 \usage{
 check_serial_valid(serials_dist)
 }
@@ -12,6 +12,7 @@ user-defined named or anonymous function with only one argument \code{n},
 representing the number of serial intervals to generate.}
 }
 \description{
-Check if the serials_dist argument is specified as a function
+Check if the serials_dist argument is a function with one argument \code{n}
+and returns a numerical vector of length \code{n}.
 }
 \keyword{internal}

From 5633a044c9fcb4d2eda51df64436693ee15a77b1 Mon Sep 17 00:00:00 2001
From: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 12:19:50 +0000
Subject: [PATCH 757/828] update test snapshot

---
 tests/testthat/_snaps/epichains.md | 146 +++++++++++++++++------------
 1 file changed, 84 insertions(+), 62 deletions(-)

diff --git a/tests/testthat/_snaps/epichains.md b/tests/testthat/_snaps/epichains.md
index bb23c3c1..41cc691b 100644
--- a/tests/testthat/_snaps/epichains.md
+++ b/tests/testthat/_snaps/epichains.md
@@ -27,20 +27,25 @@
       
       < tree head (from first known ancestor) >
       
-        sim_id ancestor generation       time
-      2      2        1          2 21.5834705
-      3      3        1          2  0.3939008
-      4      4        2          3 21.6595273
+        sim_id ancestor generation     time
+      2      2        1          2 42.57973
+      3      3        2          3 42.80500
+      4      4        2          3 42.70415
+      5      5        4          4 43.87477
+      6      6        4          4 44.00812
+      7      7        3          4 78.73481
       
       < tree tail >
       
-        sim_id ancestor generation       time
-      1      1       NA          1  0.0000000
-      2      2        1          2 21.5834705
-      3      3        1          2  0.3939008
-      4      4        2          3 21.6595273
-      Number of ancestors (known): 2
-      Number of generations: 3
+         sim_id ancestor generation     time
+      7       7        3          4 78.73481
+      8       8        5          5 47.03948
+      9       9        6          5 45.38534
+      10     10        9          6 46.14505
+      11     11        8          6 48.03103
+      12     12        7          5 81.49185
+      Number of ancestors (known): 9
+      Number of generations: 6
       Use `as.data.frame(<object_name>)` to view the full output in the console.
 
 ---
@@ -52,20 +57,26 @@
       
       < tree head (from first known ancestor) >
       
-        chain_id sim_id ancestor generation
-      3        1      2        1          2
-      4        1      3        1          2
+         chain_id sim_id ancestor generation
+      3         1      2        1          2
+      6         2      2        1          2
+      4         1      3        1          2
+      7         2      3        1          2
+      5         1      4        1          2
+      11        2      4        2          3
       
       < tree tail >
       
-        chain_id sim_id ancestor generation
-      1        1      1       NA          1
-      2        2      1       NA          1
-      3        1      2        1          2
-      4        1      3        1          2
+         chain_id sim_id ancestor generation
+      9         1      6        4          3
+      10        1      7        4          3
+      15        2      7        6          4
+      16        2      8        6          4
+      14        1      8        7          4
+      17        2      9        8          5
       Chains simulated: 2
-      Number of ancestors (known): 1
-      Number of generations: 2
+      Number of ancestors (known): 7
+      Number of generations: 5
       Use `as.data.frame(<object_name>)` to view the full output in the console.
 
 ---
@@ -79,21 +90,21 @@
       
          chain_id sim_id ancestor generation time
       11        1      2        1          2    3
-      13        2      2        1          2    3
-      15        3      2        1          2    3
-      17        4      2        1          2    3
-      19        6      2        1          2    3
-      20        7      2        1          2    3
+      12        2      2        1          2    3
+      13        3      2        1          2    3
+      15        4      2        1          2    3
+      17        5      2        1          2    3
+      20        6      2        1          2    3
       
       < tree tail >
       
           chain_id sim_id ancestor generation time
-      92         9     19        8          4    9
-      109        6     19        8          5   12
-      93         9     20        9          4    9
-      110        6     20        9          5   12
-      94         9     21        9          4    9
-      111        6     21        9          5   12
+      131       10     19        9          4    9
+      81         2     20        6          4    9
+      103        4     20        9          4    9
+      104        4     21        9          4    9
+      105        4     22        9          4    9
+      106        4     23        9          4    9
       Chains simulated: 10
       Number of ancestors (known): 9
       Number of generations: 5
@@ -106,11 +117,11 @@
     Output
       `epichains` object 
       
-      [1] 4 1
+      [1] 1 3
       
        Simulated chain lengths: 
       
-      Max: 4
+      Max: 3
       Min: 1
 
 # head and tail print output as expected
@@ -130,10 +141,13 @@
     Output
       < tree head (from first known ancestor) >
       
-        sim_id ancestor generation       time
-      2      2        1          2 21.5834705
-      3      3        1          2  0.3939008
-      4      4        2          3 21.6595273
+        sim_id ancestor generation     time
+      2      2        1          2 42.57973
+      3      3        2          3 42.80500
+      4      4        2          3 42.70415
+      5      5        4          4 43.87477
+      6      6        4          4 44.00812
+      7      7        3          4 78.73481
 
 ---
 
@@ -142,9 +156,13 @@
     Output
       < tree head (from first known ancestor) >
       
-        chain_id sim_id ancestor generation
-      3        1      2        1          2
-      4        1      3        1          2
+         chain_id sim_id ancestor generation
+      3         1      2        1          2
+      6         2      2        1          2
+      4         1      3        1          2
+      7         2      3        1          2
+      5         1      4        1          2
+      11        2      4        2          3
 
 ---
 
@@ -155,11 +173,11 @@
       
          chain_id sim_id ancestor generation time
       11        1      2        1          2    3
-      13        2      2        1          2    3
-      15        3      2        1          2    3
-      17        4      2        1          2    3
-      19        6      2        1          2    3
-      20        7      2        1          2    3
+      12        2      2        1          2    3
+      13        3      2        1          2    3
+      15        4      2        1          2    3
+      17        5      2        1          2    3
+      20        6      2        1          2    3
 
 ---
 
@@ -180,11 +198,13 @@
       
       < tree tail >
       
-        sim_id ancestor generation       time
-      1      1       NA          1  0.0000000
-      2      2        1          2 21.5834705
-      3      3        1          2  0.3939008
-      4      4        2          3 21.6595273
+         sim_id ancestor generation     time
+      7       7        3          4 78.73481
+      8       8        5          5 47.03948
+      9       9        6          5 45.38534
+      10     10        9          6 46.14505
+      11     11        8          6 48.03103
+      12     12        7          5 81.49185
 
 ---
 
@@ -194,11 +214,13 @@
       
       < tree tail >
       
-        chain_id sim_id ancestor generation
-      1        1      1       NA          1
-      2        2      1       NA          1
-      3        1      2        1          2
-      4        1      3        1          2
+         chain_id sim_id ancestor generation
+      9         1      6        4          3
+      10        1      7        4          3
+      15        2      7        6          4
+      16        2      8        6          4
+      14        1      8        7          4
+      17        2      9        8          5
 
 ---
 
@@ -209,10 +231,10 @@
       < tree tail >
       
           chain_id sim_id ancestor generation time
-      92         9     19        8          4    9
-      109        6     19        8          5   12
-      93         9     20        9          4    9
-      110        6     20        9          5   12
-      94         9     21        9          4    9
-      111        6     21        9          5   12
+      131       10     19        9          4    9
+      81         2     20        6          4    9
+      103        4     20        9          4    9
+      104        4     21        9          4    9
+      105        4     22        9          4    9
+      106        4     23        9          4    9
 

From 3fc776b96d79696fab91b6d5e9284cebc624461e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 28 Nov 2023 18:19:47 +0000
Subject: [PATCH 758/828] Rename function

---
 R/checks.R                 |  6 +++---
 man/check_nchains_valid.Rd | 15 ---------------
 man/check_ntrees_valid.Rd  | 15 +++++++++++++++
 3 files changed, 18 insertions(+), 18 deletions(-)
 delete mode 100644 man/check_nchains_valid.Rd
 create mode 100644 man/check_ntrees_valid.Rd

diff --git a/R/checks.R b/R/checks.R
index e43f3910..c8ece961 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -63,8 +63,8 @@ check_serial_valid <- function(serials_dist) {
 #' @param nchains Number of chains to simulate.
 #'
 #' @keywords internal
-check_nchains_valid <- function(nchains) {
-  if (!checkmate::test_count(nchains, positive = TRUE)) {
-    stop("`nchains` must be > 0 but less than `Inf`")
+check_ntrees_valid <- function(ntrees) {
+  if (!checkmate::test_count(ntrees, positive = TRUE)) {
+    stop("`ntrees` must be > 0 but less than `Inf`")
   }
 }
diff --git a/man/check_nchains_valid.Rd b/man/check_nchains_valid.Rd
deleted file mode 100644
index 1a20e8b5..00000000
--- a/man/check_nchains_valid.Rd
+++ /dev/null
@@ -1,15 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/checks.R
-\name{check_nchains_valid}
-\alias{check_nchains_valid}
-\title{Check that nchains is greater than 0 and not infinity}
-\usage{
-check_nchains_valid(nchains)
-}
-\arguments{
-\item{nchains}{Number of chains to simulate.}
-}
-\description{
-Check that nchains is greater than 0 and not infinity
-}
-\keyword{internal}
diff --git a/man/check_ntrees_valid.Rd b/man/check_ntrees_valid.Rd
new file mode 100644
index 00000000..67b8dac6
--- /dev/null
+++ b/man/check_ntrees_valid.Rd
@@ -0,0 +1,15 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/checks.R
+\name{check_ntrees_valid}
+\alias{check_ntrees_valid}
+\title{Check that \code{ntrees} is greater than 0 and not infinity}
+\usage{
+check_ntrees_valid(ntrees)
+}
+\arguments{
+\item{ntrees}{Number of trees to simulate.}
+}
+\description{
+Check that \code{ntrees} is greater than 0 and not infinity
+}
+\keyword{internal}

From 43fd1ebfdda71192dcc0d8cea44030e4780e46f8 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 28 Nov 2023 18:20:19 +0000
Subject: [PATCH 759/828] Rename nchains argument to ntrees

---
 R/borel.r                          |  2 +-
 R/checks.R                         |  4 +--
 R/epichains.R                      |  2 +-
 R/simulate.r                       | 42 +++++++++++------------
 R/stat_likelihoods.R               |  2 +-
 man/aggregate.epichains.Rd         |  2 +-
 man/simulate_summary.Rd            |  6 ++--
 man/simulate_tree.Rd               |  8 ++---
 tests/testthat/test-checks.R       | 10 +-----
 tests/testthat/test-epichains.R    | 54 +++++++++++++++---------------
 tests/testthat/test-simulate.R     | 32 +++++++++---------
 vignettes/epichains.Rmd            | 12 +++----
 vignettes/interventions.Rmd        |  8 ++---
 vignettes/projecting_incidence.Rmd |  2 +-
 14 files changed, 89 insertions(+), 97 deletions(-)

diff --git a/R/borel.r b/R/borel.r
index c1a44065..43125f07 100644
--- a/R/borel.r
+++ b/R/borel.r
@@ -39,7 +39,7 @@ rborel <- function(n, mu, infinite = Inf) {
   )
   # Run simulations
   simulate_summary(
-    nchains = n,
+    ntrees = n,
     offspring_dist = "pois",
     statistic = "size",
     stat_max = infinite,
diff --git a/R/checks.R b/R/checks.R
index c8ece961..5625922a 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -58,9 +58,9 @@ check_serial_valid <- function(serials_dist) {
 }
 
 
-#' Check that nchains is greater than 0 and not infinity
+#' Check that `ntrees` is greater than 0 and not infinity
 #'
-#' @param nchains Number of chains to simulate.
+#' @param ntrees Number of trees to simulate.
 #'
 #' @keywords internal
 check_ntrees_valid <- function(ntrees) {
diff --git a/R/epichains.R b/R/epichains.R
index 269f83f8..3919dcf2 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -255,7 +255,7 @@ tail.epichains <- function(x, ...) {
 #' @examples
 #' set.seed(123)
 #' chains <- simulate_tree(
-#'   nchains = 10,
+#'   ntrees = 10,
 #'   statistic = "size",
 #'   offspring_dist = "pois",
 #'   stat_max = 10,
diff --git a/R/simulate.r b/R/simulate.r
index ad969219..22af4b0f 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -1,6 +1,6 @@
 #' Simulate transmission trees from an initial number of infections
 #'
-#' @param nchains Number of chains to simulate.
+#' @param ntrees Number of trees to simulate.
 #' @param offspring_dist Offspring distribution: a character string
 #' corresponding to the R distribution function (e.g., "pois" for Poisson,
 #' where \code{\link{rpois}} is the R function to generate Poisson random
@@ -20,7 +20,7 @@
 #' called `n`, that returns a numeric vector of `n` randomly sampled serial
 #' intervals. See details.
 #' @param t0 Start time (if serial interval is given); either a single value
-#' or a vector of same length as `nchains` (number of simulations) with
+#' or a vector of same length as `ntrees` (number of simulations) with
 #' initial times. Defaults to 0.
 #' @param tf End time (if serial interval is given).
 #' @param ... Parameters of the offspring distribution as required by R.
@@ -91,7 +91,7 @@
 #' @examples
 #' set.seed(123)
 #' chains <- simulate_tree(
-#'   nchains = 10,
+#'   ntrees = 10,
 #'   statistic = "size",
 #'   offspring_dist = "pois",
 #'   stat_max = 10,
@@ -112,14 +112,14 @@
 #' Jacob C. (2010). Branching processes: their role in epidemiology.
 #' International journal of environmental research and public health, 7(3),
 #' 1186–1204. \doi{https://doi.org/10.3390/ijerph7031204}
-simulate_tree <- function(nchains, statistic = c("size", "length"),
+simulate_tree <- function(ntrees, statistic = c("size", "length"),
                           offspring_dist, stat_max = Inf,
                           serials_dist, t0 = 0,
                           tf = Inf, ...) {
   statistic <- match.arg(statistic)
 
   # Input checking
-  check_nchains_valid(nchains = nchains)
+  check_ntrees_valid(ntrees = ntrees)
   checkmate::assert_character(statistic)
 
   # check that offspring is properly specified
@@ -153,15 +153,15 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
   }
 
   # Initialisations
-  stat_track <- rep(1, nchains) # track length or size (depending on `statistic`) #nolint
-  n_offspring <- rep(1, nchains) # current number of offspring
-  sim <- seq_len(nchains) # track chains that are still being simulated
-  ancestor_ids <- rep(1, nchains) # all chains start in generation 1
+  stat_track <- rep(1, ntrees) # track length or size (depending on `statistic`) #nolint
+  n_offspring <- rep(1, ntrees) # current number of offspring
+  sim <- seq_len(ntrees) # track chains that are still being simulated
+  ancestor_ids <- rep(1, ntrees) # all chains start in generation 1
 
   # initialise data frame to hold the transmission trees
   generation <- 1L
   tree_df <- data.frame(
-    chain_id = seq_len(nchains),
+    chain_id = seq_len(ntrees),
     sim_id = 1L,
     ancestor = NA_integer_,
     generation = generation
@@ -190,7 +190,7 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
     indices <- rep(sim, n_offspring[sim])
 
     # initialise placeholder for the number of offspring
-    n_offspring <- rep(0, nchains)
+    n_offspring <- rep(0, ntrees)
     # assign offspring sum to indices still being simulated
     n_offspring[sim] <- tapply(next_gen, indices, sum)
 
@@ -257,7 +257,7 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 
   structure(
     tree_df,
-    chains = nchains,
+    chains = ntrees,
     chain_type = "chains_tree",
     rownames = NULL,
     track_pop = FALSE,
@@ -282,20 +282,20 @@ simulate_tree <- function(nchains, statistic = c("size", "length"),
 #'   susceptible or partially immune population.
 #' @examples
 #' simulate_summary(
-#'   nchains = 10,
+#'   ntrees = 10,
 #'   statistic = "size",
 #'   offspring_dist = "pois",
 #'   stat_max = 10,
 #'   lambda = 2
 #' )
 #' @export
-simulate_summary <- function(nchains, statistic = c("size", "length"),
+simulate_summary <- function(ntrees, statistic = c("size", "length"),
                              offspring_dist,
                              stat_max = Inf, ...) {
   statistic <- match.arg(statistic)
 
   # Input checking
-  check_nchains_valid(nchains = nchains)
+  check_ntrees_valid(ntrees = ntrees)
   checkmate::assert_character(statistic)
 
   # check that offspring is properly specified
@@ -313,11 +313,11 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
   pars <- list(...)
 
   # Initialisations
-  stat_track <- rep(1, nchains) ## track length or size (depending on `stat`)
-  n_offspring <- rep(1, nchains) ## current number of offspring
-  sim <- seq_len(nchains) ## track chains that are still being simulated
+  stat_track <- rep(1, ntrees) ## track length or size (depending on `stat`)
+  n_offspring <- rep(1, ntrees) ## current number of offspring
+  sim <- seq_len(ntrees) ## track chains that are still being simulated
 
-  ## next, simulate nchains chains
+  ## next, simulate ntrees chains
   while (length(sim) > 0) {
     ## simulate next generation
     next_gen <- do.call(
@@ -335,7 +335,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
     indices <- rep(sim, n_offspring[sim])
 
     ## initialise number of offspring
-    n_offspring <- rep(0, nchains)
+    n_offspring <- rep(0, ntrees)
     ## assign offspring sum to indices still being simulated
     n_offspring[sim] <- tapply(next_gen, indices, sum)
 
@@ -357,7 +357,7 @@ simulate_summary <- function(nchains, statistic = c("size", "length"),
     stat_track,
     chain_type = "chains_summary",
     statistic = statistic,
-    chains = nchains,
+    chains = ntrees,
     class = c("epichains", class(stat_track))
   )
 }
diff --git a/R/stat_likelihoods.R b/R/stat_likelihoods.R
index 176494e1..af5c18b8 100644
--- a/R/stat_likelihoods.R
+++ b/R/stat_likelihoods.R
@@ -181,7 +181,7 @@ offspring_ll <- function(x, offspring_dist, statistic,
 
   # Simulate the chains
   dist <- simulate_summary(
-    nchains = nsim_offspring,
+    ntrees = nsim_offspring,
     offspring_dist = offspring_dist,
     statistic = statistic,
     ...
diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
index b4797d85..1582c810 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains.Rd
@@ -25,7 +25,7 @@ Aggregate cases in \verb{<epichains>} objects by "time" or "generation"
 \examples{
 set.seed(123)
 chains <- simulate_tree(
-  nchains = 10,
+  ntrees = 10,
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index a453d680..71c0af9f 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -5,7 +5,7 @@
 \title{Simulate transmission chains sizes/lengths}
 \usage{
 simulate_summary(
-  nchains,
+  ntrees,
   statistic = c("size", "length"),
   offspring_dist,
   stat_max = Inf,
@@ -13,7 +13,7 @@ simulate_summary(
 )
 }
 \arguments{
-\item{nchains}{Number of chains to simulate.}
+\item{ntrees}{Number of trees to simulate.}
 
 \item{statistic}{String; Statistic (size/length) to calculate. Used to
 determine stopping criteria for simulations when \code{stat_max} is finite.
@@ -92,7 +92,7 @@ where \code{...} are the other arguments to \verb{simulate_*()}.
 
 \examples{
 simulate_summary(
-  nchains = 10,
+  ntrees = 10,
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 568d86e3..79954c2a 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -5,7 +5,7 @@
 \title{Simulate transmission trees from an initial number of infections}
 \usage{
 simulate_tree(
-  nchains,
+  ntrees,
   statistic = c("size", "length"),
   offspring_dist,
   stat_max = Inf,
@@ -16,7 +16,7 @@ simulate_tree(
 )
 }
 \arguments{
-\item{nchains}{Number of chains to simulate.}
+\item{ntrees}{Number of trees to simulate.}
 
 \item{statistic}{String; Statistic (size/length) to calculate. Used to
 determine stopping criteria for simulations when \code{stat_max} is finite.
@@ -41,7 +41,7 @@ called \code{n}, that returns a numeric vector of \code{n} randomly sampled seri
 intervals. See details.}
 
 \item{t0}{Start time (if serial interval is given); either a single value
-or a vector of same length as \code{nchains} (number of simulations) with
+or a vector of same length as \code{ntrees} (number of simulations) with
 initial times. Defaults to 0.}
 
 \item{tf}{End time (if serial interval is given).}
@@ -115,7 +115,7 @@ where \code{...} are the other arguments to \verb{simulate_*()}.
 \examples{
 set.seed(123)
 chains <- simulate_tree(
-  nchains = 10,
+  ntrees = 10,
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
diff --git a/tests/testthat/test-checks.R b/tests/testthat/test-checks.R
index d0b619f0..a9d4313b 100644
--- a/tests/testthat/test-checks.R
+++ b/tests/testthat/test-checks.R
@@ -12,15 +12,7 @@ test_that("Checks work", {
     "must be a function"
   )
   expect_error(
-    check_serial_valid(function(x) rep("a", 10)),
-    "numeric"
-  )
-  expect_error(
-    check_serial_valid(function(x) 3),
-    "vector of length"
-  )
-  expect_error(
-    check_nchains_valid(1.1),
+    check_ntrees_valid(1.1),
     "less than"
   )
 })
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index 58daf454..1232fcb9 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -22,14 +22,14 @@ test_that("Simulators return epichains objects", {
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
   )
   #' Simulate a tree of infections with serials
   tree_sim_raw2 <- simulate_tree(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
@@ -38,7 +38,7 @@ test_that("Simulators return epichains objects", {
   )
   #' Simulate chain statistics
   chain_summary_raw <- simulate_summary(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
@@ -85,14 +85,14 @@ test_that("print.epichains works for simulation functions", {
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
   )
   #' Simulate a tree of infections with serials
   tree_sim_raw2 <- simulate_tree(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
@@ -101,7 +101,7 @@ test_that("print.epichains works for simulation functions", {
   )
   #' Simulate chain statistics
   chain_summary_raw <- simulate_summary(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
@@ -133,14 +133,14 @@ test_that("summary.epichains works as expected", {
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
   )
   #' Simulate a tree of infections with serials
   tree_sim_raw2 <- simulate_tree(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
@@ -149,7 +149,7 @@ test_that("summary.epichains works as expected", {
   )
   #' Simulate chain statistics
   chain_summary_raw <- simulate_summary(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
@@ -157,7 +157,7 @@ test_that("summary.epichains works as expected", {
   #' Simulate case where all the chain statistics are Inf
   set.seed(11223)
   epichains_summary_all_infs <- simulate_summary(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
@@ -239,14 +239,14 @@ test_that("validate_epichains works", {
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
   )
   #' Simulate a tree of infections with serials
   tree_sim_raw2 <- simulate_tree(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
@@ -255,7 +255,7 @@ test_that("validate_epichains works", {
   )
   #' Simulate chain statistics
   chain_summary_raw <- simulate_summary(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
@@ -301,14 +301,14 @@ test_that("is_chains_tree works", {
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
   )
   #' Simulate a tree of infections with serials
   tree_sim_raw2 <- simulate_tree(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
@@ -317,7 +317,7 @@ test_that("is_chains_tree works", {
   )
   #' Simulate chain statistics
   chain_summary_raw <- simulate_summary(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
@@ -359,14 +359,14 @@ test_that("is_chains_summary works", {
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
   )
   #' Simulate a tree of infections with serials
   tree_sim_raw2 <- simulate_tree(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
@@ -375,7 +375,7 @@ test_that("is_chains_summary works", {
   )
   #' Simulate chain statistics
   chain_summary_raw <- simulate_summary(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
@@ -402,7 +402,7 @@ test_that("aggregate.epichains method returns correct objects", {
   set.seed(12)
   #' Simulate a tree of infections with serials
   tree_sim_raw2 <- simulate_tree(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
@@ -439,7 +439,7 @@ test_that("aggregate.epichains method throws errors", {
   expect_error(
     aggregate(
       simulate_tree(
-        nchains = 10,
+        ntrees = 10,
         statistic = "size",
         offspring_dist = "pois",
         stat_max = 10,
@@ -455,7 +455,7 @@ test_that("aggregate.epichains method is numerically correct", {
   set.seed(12)
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
@@ -463,7 +463,7 @@ test_that("aggregate.epichains method is numerically correct", {
   )
   #' Simulate a tree of infections with serials
   tree_sim_raw2 <- simulate_tree(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
@@ -508,14 +508,14 @@ test_that("head and tail print output as expected", {
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
   )
   #' Simulate a tree of infections with serials
   tree_sim_raw2 <- simulate_tree(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
@@ -551,14 +551,14 @@ test_that("head and tail return data.frames", {
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
   )
   #' Simulate a tree of infections with serials
   tree_sim_raw2 <- simulate_tree(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index 29354fda..8cc039ab 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -22,14 +22,14 @@ test_that("Simulators work", {
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
   )
   #' Simulate a tree of infections with serials
   tree_sim_raw2 <- simulate_tree(
-    nchains = 10,
+    ntrees = 10,
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
@@ -38,7 +38,7 @@ test_that("Simulators work", {
   )
   #' Simulate chain statistics
   chain_summary_raw <- simulate_summary(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
@@ -67,7 +67,7 @@ test_that("Simulators work", {
   expect_true(
     all(
       simulate_tree(
-        nchains = 10,
+        ntrees = 10,
         statistic = "size",
         offspring_dist = "pois",
         stat_max = 10,
@@ -82,7 +82,7 @@ test_that("Simulators work", {
 test_that("simulate_tree throws errors", {
   expect_error(
     simulate_tree(
-      nchains = 2,
+      ntrees = 2,
       offspring_dist = "s",
       statistic = "length",
       lambda = 0.9
@@ -91,7 +91,7 @@ test_that("simulate_tree throws errors", {
   )
   expect_error(
     simulate_tree(
-      nchains = 2,
+      ntrees = 2,
       offspring_dist = "lnorm",
       statistic = "length",
       meanlog = 0.9,
@@ -101,7 +101,7 @@ test_that("simulate_tree throws errors", {
   )
   expect_error(
     simulate_tree(
-      nchains = 2,
+      ntrees = 2,
       offspring_dist = s,
       statistic = "length",
       meanlog = 0.9,
@@ -111,7 +111,7 @@ test_that("simulate_tree throws errors", {
   )
   expect_error(
     simulate_tree(
-      nchains = 2,
+      ntrees = 2,
       offspring_dist = "pois",
       statistic = "size",
       lambda = 0.9,
@@ -121,7 +121,7 @@ test_that("simulate_tree throws errors", {
   )
   expect_error(
     simulate_tree(
-      nchains = 2,
+      ntrees = 2,
       offspring_dist = c(1, 2),
       statistic = "length",
       lambda = 0.9
@@ -130,7 +130,7 @@ test_that("simulate_tree throws errors", {
   )
   expect_error(
     simulate_tree(
-      nchains = 2,
+      ntrees = 2,
       offspring_dist = "pois",
       statistic = "size",
       lambda = 0.9,
@@ -143,7 +143,7 @@ test_that("simulate_tree throws errors", {
 test_that("simulate_summary throws errors", {
   expect_error(
     simulate_summary(
-      nchains = 2,
+      ntrees = 2,
       offspring_dist = "s",
       statistic = "length",
       lambda = 0.9
@@ -152,7 +152,7 @@ test_that("simulate_summary throws errors", {
   )
   expect_error(
     simulate_summary(
-      nchains = 2,
+      ntrees = 2,
       offspring_dist = "lnorm",
       statistic = "length",
       meanlog = 0.9,
@@ -162,7 +162,7 @@ test_that("simulate_summary throws errors", {
   )
   expect_error(
     simulate_summary(
-      nchains = 2,
+      ntrees = 2,
       offspring_dist = s,
       statistic = "length",
       meanlog = 0.9,
@@ -172,7 +172,7 @@ test_that("simulate_summary throws errors", {
   )
   expect_error(
     simulate_summary(
-      nchains = 2,
+      ntrees = 2,
       offspring_dist = c(1, 2),
       statistic = "length",
       lambda = 0.9
@@ -226,7 +226,7 @@ test_that("simulate_tree is numerically correct", {
   set.seed(12)
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
@@ -297,7 +297,7 @@ test_that("simulate_summary is numerically correct", {
   set.seed(12)
   #' Simulate chain statistics
   chain_summary_raw <- simulate_summary(
-    nchains = 2,
+    ntrees = 2,
     offspring_dist = "pois",
     statistic = "length",
     lambda = 0.9
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 1c6fbf90..6d6c979f 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -199,7 +199,7 @@ serial_func <- function(x) {
 }
 
 sim_tree_eg <- simulate_tree(
-  nchains = 10,
+  ntrees = 10,
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
@@ -222,7 +222,7 @@ mean of $0.9$.
 set.seed(123)
 
 simulate_summary_eg <- simulate_summary(
-  nchains = 10,
+  ntrees = 10,
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
@@ -269,7 +269,7 @@ serial_func <- function(n) {
 }
 
 sim_tree_eg <- simulate_tree(
-  nchains = 10,
+  ntrees = 10,
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
@@ -283,7 +283,7 @@ summary(sim_tree_eg)
 set.seed(123)
 
 simulate_summary_eg <- simulate_summary(
-  nchains = 10,
+  ntrees = 10,
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
@@ -309,7 +309,7 @@ serial_func <- function(n) {
 }
 
 sim_tree_eg <- simulate_tree(
-  nchains = 10,
+  ntrees = 10,
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
@@ -334,7 +334,7 @@ serial_func <- function(n) {
 }
 
 sim_tree_eg <- simulate_tree(
-  nchains = 10,
+  ntrees = 10,
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
diff --git a/vignettes/interventions.Rmd b/vignettes/interventions.Rmd
index 2176283f..4b445c31 100644
--- a/vignettes/interventions.Rmd
+++ b/vignettes/interventions.Rmd
@@ -45,7 +45,7 @@ We simulate 200 chains tracking up to 99 infections:
 
 ```{r simulate_chains}
 sims <- simulate_summary(
-  nchains = 200, offspring_dist = "nbinom", stat_max = 99, mu = 1.2, size = 0.5
+  ntrees = 200, offspring_dist = "nbinom", stat_max = 99, mu = 1.2, size = 0.5
 )
 ```
 
@@ -70,7 +70,7 @@ For example, to reduce R by 25% at the population level we scale the `mu` parame
 
 ```{r simulate_chains_pop_control}
 sims <- simulate_summary(
-  nchains = 200, offspring_dist = "nbinom", stat_max = 99, mu = 0.9, size = 0.5
+  ntrees = 200, offspring_dist = "nbinom", stat_max = 99, mu = 0.9, size = 0.5
 )
 sims[is.infinite(sims)] <- 100 # Replace infections > 99 with 100 for plotting.
 ggplot(data.frame(x = sims), aes(x = x)) +
@@ -107,7 +107,7 @@ Having defined this, we can generate simulations as before:
 
 ```{r simulate_chains_ind_control}
 sims <- simulate_summary(
-  nchains = 200, offspring_dist = "nbinom_ind", stat_max = 99, mu = 1.2,
+  ntrees = 200, offspring_dist = "nbinom_ind", stat_max = 99, mu = 1.2,
   size = 0.5, control = 0.25
 )
 sims[is.infinite(sims)] <- 100 # Replace infections > 99 with 100 for plotting.
@@ -137,7 +137,7 @@ This can be likened to a disease control strategy where gatherings are limited t
 
 ```{r simulate_chains_truncated}
 sims <- simulate_summary(
-  nchains = 200, offspring_dist = "nbinom_truncated", stat_max = 99, mu = 1.2,
+  ntrees = 200, offspring_dist = "nbinom_truncated", stat_max = 99, mu = 1.2,
   size = 0.5, max =  10
 )
 sims[is.infinite(sims)] <- 100 # Replace infections > 99 with 100 for plotting.
diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 63f9eb53..b147ff72 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -227,7 +227,7 @@ sim_chain_sizes <- lapply(
   seq_len(sim_rep),
   function(sim) {
     simulate_tree(
-      nchains = length(t0),
+      ntrees = length(t0),
       offspring_dist = "nbinom",
       mu = mu,
       size = size,

From 9fe2d89ca4f02c13a9b9d210e263a4947f9be55f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 29 Nov 2023 15:40:01 +0000
Subject: [PATCH 760/828] Rename ancestor and chain_id to infector_id and
 infectee_id

---
 R/epichains.R                   | 28 +++++++++++------------
 R/simulate.r                    | 39 +++++++++++++++------------------
 man/head.epichains.Rd           |  2 +-
 man/likelihood.Rd               |  2 +-
 man/offspring_ll.Rd             |  2 +-
 man/simulate_summary.Rd         |  2 +-
 man/simulate_tree.Rd            |  8 +++----
 man/simulate_tree_from_pop.Rd   |  5 ++---
 man/tail.epichains.Rd           |  4 ++--
 tests/testthat/test-epichains.R |  8 +++----
 tests/testthat/test-simulate.R  | 16 +++++++-------
 11 files changed, 55 insertions(+), 61 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 3919dcf2..9c08db50 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -35,8 +35,8 @@ format.epichains <- function(x, ...) {
       c(
         sprintf("Chains simulated: %s", chain_info[["chains_run"]]),
         sprintf(
-          "Number of ancestors (known): %s",
-          chain_info[["unique_ancestors"]]
+          "Number of infectors (known): %s",
+          chain_info[["unique_infectors"]]
         ),
         sprintf(
           "Number of generations: %s", chain_info[["max_generation"]]
@@ -89,8 +89,8 @@ summary.epichains <- function(object, ...) {
   if (is_chains_tree(object)) {
     max_time <- ifelse(("time" %in% names(object)), max(object$time), NA)
 
-    n_unique_ancestors <- length(
-      unique(object$ancestor[!is.na(object$ancestor)])
+    n_unique_infectors <- length(
+      unique(object$infector_id[!is.na(object$infector_id)])
     )
 
     max_generation <- max(object$generation)
@@ -99,7 +99,7 @@ summary.epichains <- function(object, ...) {
     res <- list(
       chains_run = chains_run,
       max_time = max_time,
-      unique_ancestors = n_unique_ancestors,
+      unique_infectors = n_unique_infectors,
       max_generation = max_generation
     )
   } else if (is_chains_summary(object)) {
@@ -160,12 +160,12 @@ validate_epichains <- function(x) {
   if (is_chains_tree(x)) {
     stopifnot(
       "object does not contain the correct columns" =
-        c("sim_id", "ancestor", "generation") %in%
+        c("sim_id", "infector_id", "generation") %in%
         colnames(x),
       "column `sim_id` must be a numeric" =
         is.numeric(x$sim_id),
-      "column `ancestor` must be a numeric" =
-        is.numeric(x$ancestor),
+      "column `infector_id` must be a numeric" =
+        is.numeric(x$infector_id),
       "column `generation` must be a numeric" =
         is.numeric(x$generation)
     )
@@ -212,14 +212,14 @@ is_chains_summary <- function(x) {
 #' @export
 #' @details
 #' This returns the top rows of an `epichains` object. Note that the object
-#' is originally sorted by `sim_id` and `ancestor` and the first
+#' is originally sorted by `sim_id` and `infector_id` and the first
 #' unknown ancestors (NA) have been dropped from
 #' printing method. To view the full output, use `as.data.frame(<object_name>)`.
 #'
 head.epichains <- function(x, ...) {
-  writeLines("< tree head (from first known ancestor) >\n")
-  # print head of the simulation output from the first known ancestor
-  x <- x[!is.na(x$ancestor), ]
+  writeLines("< tree head (from first known infector) >\n")
+  # print head of the simulation output from the first known infector
+  x <- x[!is.na(x$infector_id), ]
   utils::head(as.data.frame(x), ...)
 }
 
@@ -231,8 +231,8 @@ head.epichains <- function(x, ...) {
 #' @author James M. Azam
 #' @export
 #' @details
-#' This returns the top rows of an `epichains` object. Note that the object
-#' is originally sorted by `sim_id` and `ancestor` and the first
+#' This returns the bottom part of an `epichains` object. Note that the object
+#' is originally sorted by `sim_id` and `infector_id` and the first
 #' unknown ancestors (NA) have been dropped from
 #' printing method. To view the full output, use `as.data.frame(<object_name>)`.
 tail.epichains <- function(x, ...) {
diff --git a/R/simulate.r b/R/simulate.r
index 22af4b0f..a9d1c292 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -10,7 +10,7 @@
 #' Can be one of:
 #' \itemize{
 #'   \item "size": the total number of offspring.
-#'   \item "length": the total number of ancestors.
+#'   \item "length": the total number of infectors.
 #' }
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to this value.
@@ -25,10 +25,8 @@
 #' @param tf End time (if serial interval is given).
 #' @param ... Parameters of the offspring distribution as required by R.
 #' @return An `<epichains>` object, which is basically a `<data.frame>` with
-#' columns `chain_id` (chain ID), `sim_id` (a unique ID within each simulation
-#' for each individual), `ancestor`
-#' (the ID of the ancestor of each individual), `generation`, and
-#' `time` (of infection)
+#' columns `infectee_id`, `sim_id` (a unique ID within each simulation
+#' for each infectee), `infector_id`, `generation`, and `time` (of infection)
 #' @author James M. Azam, Sebastian Funk
 #' @export
 #nolint start
@@ -163,7 +161,7 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
   tree_df <- data.frame(
     chain_id = seq_len(ntrees),
     sim_id = 1L,
-    ancestor = NA_integer_,
+    infector_id = NA_integer_,
     generation = generation
   )
 
@@ -201,10 +199,10 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
       n_offspring = n_offspring
     )
 
-    # record times/ancestors
+    # record times/infectors
     if (sum(n_offspring[sim]) > 0) {
-      ancestors <- rep(ancestor_ids, next_gen)
-      current_max_id <- unname(tapply(ancestor_ids, indices, max))
+      infectors <- rep(infector_ids, next_gen)
+      current_max_id <- unname(tapply(infector_ids, indices, max))
       indices <- rep(sim, n_offspring[sim])
 
       # create new ids
@@ -217,9 +215,9 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
       # store new simulation results
       new_df <-
         data.frame(
-          chain_id = indices,
+          infectee_id = indices,
           sim_id = ids,
-          ancestor = ancestors,
+          infector_id = infectors,
           generation = generation
         )
 
@@ -244,7 +242,7 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
       if (!missing(serials_dist)) {
         times <- times[indices %in% sim]
       }
-      ancestor_ids <- ids[indices %in% sim]
+      infector_ids <- ids[indices %in% sim]
     }
   }
 
@@ -252,8 +250,8 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
     tree_df <- tree_df[tree_df$time < tf, ]
   }
 
-  # sort by sim_id and ancestor
-  tree_df <- tree_df[order(tree_df$sim_id, tree_df$ancestor), ]
+  # sort by sim_id and infector
+  tree_df <- tree_df[order(tree_df$sim_id, tree_df$infector_id), ]
 
   structure(
     tree_df,
@@ -376,9 +374,8 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
 #' @param t0 Start time; Defaults to 0.
 #' @param tf End time; Defaults to `Inf`.
 #' @return An `<epichains>` object, which is basically a `<data.frame>` with
-#' columns `sim_id` (a unique ID within each simulation for each individual
-#' of the chain), `ancestor` (the ID of the ancestor of each individual),
-#' `generation`, and `time` (of infection).
+#' columns `sim_id` (a unique ID within each simulation for each infectee
+#' in the chain), `infector_id`, `generation`, and `time` (of infection).
 #' @details
 #' # Offspring distributions
 #' Currently, `offspring_dist` only supports "pois" & "nbinom".
@@ -501,7 +498,7 @@ simulate_tree_from_pop <- function(pop,
   ## initializations
   tree_df <- data.frame(
     sim_id = 1L,
-    ancestor = NA_integer_,
+    infector_id = NA_integer_,
     generation = 1L,
     time = t0,
     offspring_generated = FALSE # tracks simulation and dropped afterwards
@@ -544,7 +541,7 @@ simulate_tree_from_pop <- function(pop,
 
       new_df <- data.frame(
         sim_id = current_max_id + seq_len(n_offspring),
-        ancestor = id_parent,
+        infector_id = id_parent,
         generation = gen_parent + 1L,
         time = new_times + t_parent,
         offspring_generated = FALSE
@@ -562,8 +559,8 @@ simulate_tree_from_pop <- function(pop,
   ## have been generated in the last generation
   tree_df <- tree_df[tree_df$time <= tf, ]
 
-  # sort by sim_id and ancestor
-  tree_df <- tree_df[order(tree_df$sim_id, tree_df$ancestor), ]
+  # sort by sim_id and infector
+  tree_df <- tree_df[order(tree_df$sim_id, tree_df$infector_id), ]
   tree_df$offspring_generated <- NULL
 
   structure(
diff --git a/man/head.epichains.Rd b/man/head.epichains.Rd
index 7b06d4b4..d9dd5a14 100644
--- a/man/head.epichains.Rd
+++ b/man/head.epichains.Rd
@@ -19,7 +19,7 @@ object of class \code{data.frame}
 }
 \details{
 This returns the top rows of an \code{epichains} object. Note that the object
-is originally sorted by \code{sim_id} and \code{ancestor} and the first
+is originally sorted by \code{sim_id} and \code{infector_id} and the first
 unknown ancestors (NA) have been dropped from
 printing method. To view the full output, use \verb{as.data.frame(<object_name>)}.
 }
diff --git a/man/likelihood.Rd b/man/likelihood.Rd
index cfc5cb1e..8486467d 100644
--- a/man/likelihood.Rd
+++ b/man/likelihood.Rd
@@ -25,7 +25,7 @@ determine stopping criteria for simulations when \code{stat_max} is finite.
 Can be one of:
 \itemize{
 \item "size": the total number of offspring.
-\item "length": the total number of ancestors.
+\item "length": the total number of infectors.
 }}
 
 \item{offspring_dist}{Offspring distribution: a character string
diff --git a/man/offspring_ll.Rd b/man/offspring_ll.Rd
index bd8caeb2..35d08663 100644
--- a/man/offspring_ll.Rd
+++ b/man/offspring_ll.Rd
@@ -20,7 +20,7 @@ determine stopping criteria for simulations when \code{stat_max} is finite.
 Can be one of:
 \itemize{
 \item "size": the total number of offspring.
-\item "length": the total number of ancestors.
+\item "length": the total number of infectors.
 }}
 
 \item{nsim_offspring}{Number of simulations of the offspring distribution
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index 71c0af9f..c708af9c 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -20,7 +20,7 @@ determine stopping criteria for simulations when \code{stat_max} is finite.
 Can be one of:
 \itemize{
 \item "size": the total number of offspring.
-\item "length": the total number of ancestors.
+\item "length": the total number of infectors.
 }}
 
 \item{offspring_dist}{Offspring distribution: a character string
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 79954c2a..ec98c3e3 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -23,7 +23,7 @@ determine stopping criteria for simulations when \code{stat_max} is finite.
 Can be one of:
 \itemize{
 \item "size": the total number of offspring.
-\item "length": the total number of ancestors.
+\item "length": the total number of infectors.
 }}
 
 \item{offspring_dist}{Offspring distribution: a character string
@@ -50,10 +50,8 @@ initial times. Defaults to 0.}
 }
 \value{
 An \verb{<epichains>} object, which is basically a \verb{<data.frame>} with
-columns \code{chain_id} (chain ID), \code{sim_id} (a unique ID within each simulation
-for each individual), \code{ancestor}
-(the ID of the ancestor of each individual), \code{generation}, and
-\code{time} (of infection)
+columns \code{infectee_id}, \code{sim_id} (a unique ID within each simulation
+for each infectee), \code{infector_id}, \code{generation}, and \code{time} (of infection)
 }
 \description{
 Simulate transmission trees from an initial number of infections
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 4bb937f4..f0a061aa 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -39,9 +39,8 @@ Must be less than \code{pop} - 1.}
 }
 \value{
 An \verb{<epichains>} object, which is basically a \verb{<data.frame>} with
-columns \code{sim_id} (a unique ID within each simulation for each individual
-of the chain), \code{ancestor} (the ID of the ancestor of each individual),
-\code{generation}, and \code{time} (of infection).
+columns \code{sim_id} (a unique ID within each simulation for each infectee
+in the chain), \code{infector_id}, \code{generation}, and \code{time} (of infection).
 }
 \description{
 Simulate transmission trees from a susceptible or partially immune
diff --git a/man/tail.epichains.Rd b/man/tail.epichains.Rd
index 21502c04..75b3134d 100644
--- a/man/tail.epichains.Rd
+++ b/man/tail.epichains.Rd
@@ -15,8 +15,8 @@
 \code{tail} method for \code{\link{epichains}} class
 }
 \details{
-This returns the top rows of an \code{epichains} object. Note that the object
-is originally sorted by \code{sim_id} and \code{ancestor} and the first
+This returns the bottom part of an \code{epichains} object. Note that the object
+is originally sorted by \code{sim_id} and \code{infector_id} and the first
 unknown ancestors (NA) have been dropped from
 printing method. To view the full output, use \verb{as.data.frame(<object_name>)}.
 }
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index 1232fcb9..cb168ba3 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -169,7 +169,7 @@ test_that("summary.epichains works as expected", {
     c(
       "chains_run",
       "max_time",
-      "unique_ancestors",
+      "unique_infectors",
       "max_generation"
     )
   )
@@ -178,7 +178,7 @@ test_that("summary.epichains works as expected", {
     c(
       "chains_run",
       "max_time",
-      "unique_ancestors",
+      "unique_infectors",
       "max_generation"
     )
   )
@@ -187,7 +187,7 @@ test_that("summary.epichains works as expected", {
     c(
       "chains_run",
       "max_time",
-      "unique_ancestors",
+      "unique_infectors",
       "max_generation"
     )
   )
@@ -196,7 +196,7 @@ test_that("summary.epichains works as expected", {
     c(
       "chains_run",
       "max_time",
-      "unique_ancestors",
+      "unique_infectors",
       "max_generation"
     )
   )
diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index 8cc039ab..9cad2165 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -239,7 +239,7 @@ test_that("simulate_tree is numerically correct", {
     2.00
   )
   expect_identical(
-    tree_sim_summary$unique_ancestors,
+    tree_sim_summary$unique_infectors,
     2L
   )
   expect_identical(
@@ -247,7 +247,7 @@ test_that("simulate_tree is numerically correct", {
     3L
   )
   expect_identical(
-    tree_sim_raw$chain_id,
+    tree_sim_raw$infectee_id,
     c(1L, 2L, 2L, 2L, 2L, 2L, 2L)
   )
   expect_identical(
@@ -255,7 +255,7 @@ test_that("simulate_tree is numerically correct", {
     c(1, 1, 2, 3, 4, 5, 6)
   )
   expect_identical(
-    tree_sim_raw$ancestor,
+    tree_sim_raw$infector_id,
     c(NA, NA, 1, 1, 2, 2, 2)
   )
   expect_identical(
@@ -268,7 +268,7 @@ test_that("simulate_tree is numerically correct", {
     2.0
   )
   expect_identical(
-    tree_sim_summary$unique_ancestors,
+    tree_sim_summary$unique_infectors,
     2L
   )
   expect_identical(
@@ -276,7 +276,7 @@ test_that("simulate_tree is numerically correct", {
     3L
   )
   expect_identical(
-    tree_sim_raw$chain_id,
+    tree_sim_raw$infectee_id,
     c(1L, 2L, 2L, 2L, 2L, 2L, 2L)
   )
   expect_identical(
@@ -284,7 +284,7 @@ test_that("simulate_tree is numerically correct", {
     c(1, 1, 2, 3, 4, 5, 6)
   )
   expect_identical(
-    tree_sim_raw$ancestor,
+    tree_sim_raw$infector_id,
     c(NA, NA, 1, 1, 2, 2, 2)
   )
   expect_identical(
@@ -336,7 +336,7 @@ test_that("simulate_tree_from_pop is numerically correct", {
   susc_outbreak_summary <- summary(susc_outbreak_raw)
   #' Expectations
   expect_identical(
-    susc_outbreak_summary$unique_ancestors,
+    susc_outbreak_summary$unique_infectors,
     0L
   )
   expect_identical(
@@ -353,7 +353,7 @@ test_that("simulate_tree_from_pop is numerically correct", {
     1L
   )
   expect_identical(
-    susc_outbreak_raw$ancestor,
+    susc_outbreak_raw$infector_id,
     NA_integer_
   )
   expect_identical(

From 4dc7fa4e6a35281fe6dfad267773eed93b3926f9 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 29 Nov 2023 15:40:23 +0000
Subject: [PATCH 761/828] Remove rownames

---
 R/simulate.r | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index a9d1c292..bcade79c 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -252,7 +252,7 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
 
   # sort by sim_id and infector
   tree_df <- tree_df[order(tree_df$sim_id, tree_df$infector_id), ]
-
+  row.names(tree_df) <- NULL
   structure(
     tree_df,
     chains = ntrees,
@@ -562,7 +562,7 @@ simulate_tree_from_pop <- function(pop,
   # sort by sim_id and infector
   tree_df <- tree_df[order(tree_df$sim_id, tree_df$infector_id), ]
   tree_df$offspring_generated <- NULL
-
+  row.names(tree_df) <- NULL
   structure(
     tree_df,
     chain_type = "chains_tree",

From fff9cf8bf86293e4f120852578498331ebe51601 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 29 Nov 2023 15:42:02 +0000
Subject: [PATCH 762/828] Update snapshots

---
 tests/testthat/_snaps/epichains.md | 32 +++++++++++++++---------------
 1 file changed, 16 insertions(+), 16 deletions(-)

diff --git a/tests/testthat/_snaps/epichains.md b/tests/testthat/_snaps/epichains.md
index 41cc691b..8871a0a8 100644
--- a/tests/testthat/_snaps/epichains.md
+++ b/tests/testthat/_snaps/epichains.md
@@ -5,16 +5,16 @@
     Output
       `epichains` object
       
-      < tree head (from first known ancestor) >
+      < tree head (from first known infector) >
       
-      [1] sim_id     ancestor   generation time      
+      [1] sim_id      infector_id generation  time       
       <0 rows> (or 0-length row.names)
       
       < tree tail >
       
-        sim_id ancestor generation time
-      1      1       NA          1    0
-      Number of ancestors (known): 0
+      [1] sim_id      infector_id generation  time       
+      <0 rows> (or 0-length row.names)
+      Number of infectors (known): 0
       Number of generations: 1
       Use `as.data.frame(<object_name>)` to view the full output in the console.
 
@@ -25,7 +25,7 @@
     Output
       `epichains` object
       
-      < tree head (from first known ancestor) >
+      < tree head (from first known infector) >
       
         sim_id ancestor generation     time
       2      2        1          2 42.57973
@@ -55,7 +55,7 @@
     Output
       `epichains` object
       
-      < tree head (from first known ancestor) >
+      < tree head (from first known infector) >
       
          chain_id sim_id ancestor generation
       3         1      2        1          2
@@ -86,7 +86,7 @@
     Output
       `epichains` object
       
-      < tree head (from first known ancestor) >
+      < tree head (from first known infector) >
       
          chain_id sim_id ancestor generation time
       11        1      2        1          2    3
@@ -106,7 +106,7 @@
       105        4     22        9          4    9
       106        4     23        9          4    9
       Chains simulated: 10
-      Number of ancestors (known): 9
+      Number of infectors (known): 9
       Number of generations: 5
       Use `as.data.frame(<object_name>)` to view the full output in the console.
 
@@ -129,9 +129,9 @@
     Code
       head(susc_outbreak_raw)
     Output
-      < tree head (from first known ancestor) >
+      < tree head (from first known infector) >
       
-      [1] sim_id     ancestor   generation time      
+      [1] sim_id      infector_id generation  time       
       <0 rows> (or 0-length row.names)
 
 ---
@@ -139,7 +139,7 @@
     Code
       head(susc_outbreak_raw2)
     Output
-      < tree head (from first known ancestor) >
+      < tree head (from first known infector) >
       
         sim_id ancestor generation     time
       2      2        1          2 42.57973
@@ -154,7 +154,7 @@
     Code
       head(tree_sim_raw)
     Output
-      < tree head (from first known ancestor) >
+      < tree head (from first known infector) >
       
          chain_id sim_id ancestor generation
       3         1      2        1          2
@@ -169,7 +169,7 @@
     Code
       head(tree_sim_raw2)
     Output
-      < tree head (from first known ancestor) >
+      < tree head (from first known infector) >
       
          chain_id sim_id ancestor generation time
       11        1      2        1          2    3
@@ -187,8 +187,8 @@
       
       < tree tail >
       
-        sim_id ancestor generation time
-      1      1       NA          1    0
+      [1] sim_id      infector_id generation  time       
+      <0 rows> (or 0-length row.names)
 
 ---
 

From 11bc5dd59e73ffd5139c57834acba3d13590161c Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 1 Dec 2023 15:17:25 +0000
Subject: [PATCH 763/828] Rename ancestor_id to infector_id

---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index bcade79c..2f1887f5 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -154,7 +154,7 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
   stat_track <- rep(1, ntrees) # track length or size (depending on `statistic`) #nolint
   n_offspring <- rep(1, ntrees) # current number of offspring
   sim <- seq_len(ntrees) # track chains that are still being simulated
-  ancestor_ids <- rep(1, ntrees) # all chains start in generation 1
+  infector_ids <- rep(1, ntrees) # all chains start in generation 1
 
   # initialise data frame to hold the transmission trees
   generation <- 1L

From adb123f0ee2a3f20cb59f371eb9d0d6a21102941 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 1 Dec 2023 15:17:47 +0000
Subject: [PATCH 764/828] Rename chain_id col name to infectee_id

---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index 2f1887f5..41ddb7a9 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -159,7 +159,7 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
   # initialise data frame to hold the transmission trees
   generation <- 1L
   tree_df <- data.frame(
-    chain_id = seq_len(ntrees),
+    infectee_id = seq_len(ntrees),
     sim_id = 1L,
     infector_id = NA_integer_,
     generation = generation

From 601476313989e6a587a9b37bda36762ae5bf2ec2 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 1 Dec 2023 15:17:57 +0000
Subject: [PATCH 765/828] Fix snapshots

---
 tests/testthat/_snaps/epichains.md | 180 ++++++++++++++---------------
 1 file changed, 90 insertions(+), 90 deletions(-)

diff --git a/tests/testthat/_snaps/epichains.md b/tests/testthat/_snaps/epichains.md
index 8871a0a8..9894418f 100644
--- a/tests/testthat/_snaps/epichains.md
+++ b/tests/testthat/_snaps/epichains.md
@@ -12,8 +12,8 @@
       
       < tree tail >
       
-      [1] sim_id      infector_id generation  time       
-      <0 rows> (or 0-length row.names)
+        sim_id infector_id generation time
+      1      1          NA          1    0
       Number of infectors (known): 0
       Number of generations: 1
       Use `as.data.frame(<object_name>)` to view the full output in the console.
@@ -27,24 +27,24 @@
       
       < tree head (from first known infector) >
       
-        sim_id ancestor generation     time
-      2      2        1          2 42.57973
-      3      3        2          3 42.80500
-      4      4        2          3 42.70415
-      5      5        4          4 43.87477
-      6      6        4          4 44.00812
-      7      7        3          4 78.73481
+        sim_id infector_id generation     time
+      2      2           1          2 42.57973
+      3      3           2          3 42.80500
+      4      4           2          3 42.70415
+      5      5           4          4 43.87477
+      6      6           4          4 44.00812
+      7      7           3          4 78.73481
       
       < tree tail >
       
-         sim_id ancestor generation     time
-      7       7        3          4 78.73481
-      8       8        5          5 47.03948
-      9       9        6          5 45.38534
-      10     10        9          6 46.14505
-      11     11        8          6 48.03103
-      12     12        7          5 81.49185
-      Number of ancestors (known): 9
+         sim_id infector_id generation     time
+      7       7           3          4 78.73481
+      8       8           5          5 47.03948
+      9       9           6          5 45.38534
+      10     10           9          6 46.14505
+      11     11           8          6 48.03103
+      12     12           7          5 81.49185
+      Number of infectors (known): 9
       Number of generations: 6
       Use `as.data.frame(<object_name>)` to view the full output in the console.
 
@@ -57,25 +57,25 @@
       
       < tree head (from first known infector) >
       
-         chain_id sim_id ancestor generation
-      3         1      2        1          2
-      6         2      2        1          2
-      4         1      3        1          2
-      7         2      3        1          2
-      5         1      4        1          2
-      11        2      4        2          3
+        infectee_id sim_id infector_id generation
+      3           1      2           1          2
+      4           2      2           1          2
+      5           1      3           1          2
+      6           2      3           1          2
+      7           1      4           1          2
+      8           2      4           2          3
       
       < tree tail >
       
-         chain_id sim_id ancestor generation
-      9         1      6        4          3
-      10        1      7        4          3
-      15        2      7        6          4
-      16        2      8        6          4
-      14        1      8        7          4
-      17        2      9        8          5
+         infectee_id sim_id infector_id generation
+      12           1      6           4          3
+      13           1      7           4          3
+      14           2      7           6          4
+      15           2      8           6          4
+      16           1      8           7          4
+      17           2      9           8          5
       Chains simulated: 2
-      Number of ancestors (known): 7
+      Number of infectors (known): 7
       Number of generations: 5
       Use `as.data.frame(<object_name>)` to view the full output in the console.
 
@@ -88,23 +88,23 @@
       
       < tree head (from first known infector) >
       
-         chain_id sim_id ancestor generation time
-      11        1      2        1          2    3
-      12        2      2        1          2    3
-      13        3      2        1          2    3
-      15        4      2        1          2    3
-      17        5      2        1          2    3
-      20        6      2        1          2    3
+         infectee_id sim_id infector_id generation time
+      11           1      2           1          2    3
+      12           2      2           1          2    3
+      13           3      2           1          2    3
+      14           4      2           1          2    3
+      15           5      2           1          2    3
+      16           6      2           1          2    3
       
       < tree tail >
       
-          chain_id sim_id ancestor generation time
-      131       10     19        9          4    9
-      81         2     20        6          4    9
-      103        4     20        9          4    9
-      104        4     21        9          4    9
-      105        4     22        9          4    9
-      106        4     23        9          4    9
+          infectee_id sim_id infector_id generation time
+      138          10     19           9          4    9
+      139           2     20           6          4    9
+      140           4     20           9          4    9
+      141           4     21           9          4    9
+      142           4     22           9          4    9
+      143           4     23           9          4    9
       Chains simulated: 10
       Number of infectors (known): 9
       Number of generations: 5
@@ -141,13 +141,13 @@
     Output
       < tree head (from first known infector) >
       
-        sim_id ancestor generation     time
-      2      2        1          2 42.57973
-      3      3        2          3 42.80500
-      4      4        2          3 42.70415
-      5      5        4          4 43.87477
-      6      6        4          4 44.00812
-      7      7        3          4 78.73481
+        sim_id infector_id generation     time
+      2      2           1          2 42.57973
+      3      3           2          3 42.80500
+      4      4           2          3 42.70415
+      5      5           4          4 43.87477
+      6      6           4          4 44.00812
+      7      7           3          4 78.73481
 
 ---
 
@@ -156,13 +156,13 @@
     Output
       < tree head (from first known infector) >
       
-         chain_id sim_id ancestor generation
-      3         1      2        1          2
-      6         2      2        1          2
-      4         1      3        1          2
-      7         2      3        1          2
-      5         1      4        1          2
-      11        2      4        2          3
+        infectee_id sim_id infector_id generation
+      3           1      2           1          2
+      4           2      2           1          2
+      5           1      3           1          2
+      6           2      3           1          2
+      7           1      4           1          2
+      8           2      4           2          3
 
 ---
 
@@ -171,13 +171,13 @@
     Output
       < tree head (from first known infector) >
       
-         chain_id sim_id ancestor generation time
-      11        1      2        1          2    3
-      12        2      2        1          2    3
-      13        3      2        1          2    3
-      15        4      2        1          2    3
-      17        5      2        1          2    3
-      20        6      2        1          2    3
+         infectee_id sim_id infector_id generation time
+      11           1      2           1          2    3
+      12           2      2           1          2    3
+      13           3      2           1          2    3
+      14           4      2           1          2    3
+      15           5      2           1          2    3
+      16           6      2           1          2    3
 
 ---
 
@@ -187,8 +187,8 @@
       
       < tree tail >
       
-      [1] sim_id      infector_id generation  time       
-      <0 rows> (or 0-length row.names)
+        sim_id infector_id generation time
+      1      1          NA          1    0
 
 ---
 
@@ -198,13 +198,13 @@
       
       < tree tail >
       
-         sim_id ancestor generation     time
-      7       7        3          4 78.73481
-      8       8        5          5 47.03948
-      9       9        6          5 45.38534
-      10     10        9          6 46.14505
-      11     11        8          6 48.03103
-      12     12        7          5 81.49185
+         sim_id infector_id generation     time
+      7       7           3          4 78.73481
+      8       8           5          5 47.03948
+      9       9           6          5 45.38534
+      10     10           9          6 46.14505
+      11     11           8          6 48.03103
+      12     12           7          5 81.49185
 
 ---
 
@@ -214,13 +214,13 @@
       
       < tree tail >
       
-         chain_id sim_id ancestor generation
-      9         1      6        4          3
-      10        1      7        4          3
-      15        2      7        6          4
-      16        2      8        6          4
-      14        1      8        7          4
-      17        2      9        8          5
+         infectee_id sim_id infector_id generation
+      12           1      6           4          3
+      13           1      7           4          3
+      14           2      7           6          4
+      15           2      8           6          4
+      16           1      8           7          4
+      17           2      9           8          5
 
 ---
 
@@ -230,11 +230,11 @@
       
       < tree tail >
       
-          chain_id sim_id ancestor generation time
-      131       10     19        9          4    9
-      81         2     20        6          4    9
-      103        4     20        9          4    9
-      104        4     21        9          4    9
-      105        4     22        9          4    9
-      106        4     23        9          4    9
+          infectee_id sim_id infector_id generation time
+      138          10     19           9          4    9
+      139           2     20           6          4    9
+      140           4     20           9          4    9
+      141           4     21           9          4    9
+      142           4     22           9          4    9
+      143           4     23           9          4    9
 

From bf7000d1395b520b542f8b514cd159333e74fa69 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 29 Nov 2023 17:04:01 +0000
Subject: [PATCH 766/828] Rename function to refer to gen_interval

---
 R/checks.R                      | 17 +++++++----------
 man/check_gen_interval_valid.Rd | 17 +++++++++++++++++
 2 files changed, 24 insertions(+), 10 deletions(-)
 create mode 100644 man/check_gen_interval_valid.Rd

diff --git a/R/checks.R b/R/checks.R
index 5625922a..9a329fa0 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -30,22 +30,19 @@ check_offspring_func_valid <- function(roffspring_name) {
 }
 
 
-#' Check if the serials_dist argument is valid.
+#' Check if the gen_interval argument is specified as a function
 #'
-#' Check if the serials_dist argument is a function with one argument `n`
-#' and returns a numerical vector of length `n`.
-#'
-#' @param serials_dist The serial interval distribution function; the name of a
+#' @param gen_interval The generation interval function; the name of a
 #' user-defined named or anonymous function with only one argument `n`,
-#' representing the number of serial intervals to generate.
+#' representing the number of generation intervals to sample.
 #'
 #' @keywords internal
-check_serial_valid <- function(serials_dist) {
-  if (!checkmate::test_function(serials_dist, nargs = 1)) {
+check_gen_interval_valid <- function(gen_interval) {
+  if (!checkmate::test_function(gen_interval, nargs = 1)) {
     stop(sprintf(
       "%s %s",
-      "The `serials_dist` argument must be a function",
-      "(see details in ?sim_chain_tree)."
+      "The `gen_interval` argument must be a function",
+      "(see details in ?simulate_tree)."
     ))
   }
   x <- serials_dist(10)
diff --git a/man/check_gen_interval_valid.Rd b/man/check_gen_interval_valid.Rd
new file mode 100644
index 00000000..c9d0b376
--- /dev/null
+++ b/man/check_gen_interval_valid.Rd
@@ -0,0 +1,17 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/checks.R
+\name{check_gen_interval_valid}
+\alias{check_gen_interval_valid}
+\title{Check if the gen_interval argument is specified as a function}
+\usage{
+check_gen_interval_valid(gen_interval)
+}
+\arguments{
+\item{gen_interval}{The generation interval function; the name of a
+user-defined named or anonymous function with only one argument \code{n},
+representing the number of generation intervals to sample.}
+}
+\description{
+Check if the gen_interval argument is specified as a function
+}
+\keyword{internal}

From 4481ebe6eed0e6366ed922c88367621957104649 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 29 Nov 2023 17:12:47 +0000
Subject: [PATCH 767/828] Replace all occurrences of serial interval with
 generation interval

---
 R/epichains.R                      |  2 +-
 R/simulate.r                       | 95 ++++++++++++------------------
 man/aggregate.epichains.Rd         |  2 +-
 man/simulate_summary.Rd            | 39 ++++--------
 man/simulate_tree.Rd               | 56 ++++++------------
 man/simulate_tree_from_pop.Rd      | 53 ++---------------
 tests/testthat/test-checks.R       |  2 +-
 tests/testthat/test-epichains.R    | 52 ++++++++--------
 tests/testthat/test-simulate.R     | 22 +++----
 vignettes/epichains.Rmd            | 46 +++++++--------
 vignettes/projecting_incidence.Rmd |  7 ++-
 11 files changed, 142 insertions(+), 234 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 9c08db50..93bbde3e 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -259,7 +259,7 @@ tail.epichains <- function(x, ...) {
 #'   statistic = "size",
 #'   offspring_dist = "pois",
 #'   stat_max = 10,
-#'   serials_dist = function(n) rep(3, n),
+#'   gen_interval = function(x) 3,
 #'   lambda = 2
 #' )
 #' chains
diff --git a/R/simulate.r b/R/simulate.r
index 41ddb7a9..ec096a70 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -15,14 +15,13 @@
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to this value.
 #' Defaults to `Inf`.
-#' @param serials_dist The serial interval distribution function; the name of a
-#' user-defined named or anonymous function with only one argument, usually
-#' called `n`, that returns a numeric vector of `n` randomly sampled serial
-#' intervals. See details.
-#' @param t0 Start time (if serial interval is given); either a single value
-#' or a vector of same length as `ntrees` (number of simulations) with
+#' @param gen_interval The generation interval function; the name
+#' of a user-defined named or anonymous function with only one argument `n`,
+#' representing the number of generation intervals to generate. See details.
+#' @param t0 Start time (if generation interval is given); either a single value
+#' or a vector of same length as `nchains` (number of simulations) with
 #' initial times. Defaults to 0.
-#' @param tf End time (if serial interval is given).
+#' @param tf End time (if generation interval is given).
 #' @param ... Parameters of the offspring distribution as required by R.
 #' @return An `<epichains>` object, which is basically a `<data.frame>` with
 #' columns `infectee_id`, `sim_id` (a unique ID within each simulation
@@ -44,41 +43,27 @@
 #' The distribution of secondary cases, \eqn{X_{t, i}} is modelled by the
 #' offspring distribution (`offspring_dist`).
 #'
-#' # The serial interval (`serials_dist`)
-#' ## Assumptions/disambiguation
+#' ## Specifying `gen_interval`
 #'
-#' In epidemiology, the generation interval is the duration between successive
-#' infectious events in a chain of transmission. Similarly, the serial
-#' interval is the duration between observed symptom onset times between
-#' successive cases in a transmission chain. The generation interval is
-#' often hard to observe because exact times of infection are hard to
-#' measure hence, the serial interval is often used instead . Here, we
-#' use the serial interval to represent what would normally be called the
-#' generation interval, that is, the time between successive cases.
-#'
-#' See References below for some literature on the subject.
-#'
-#' ## Specifying `serials_dist`
-#'
-#' `serials_dist` must be specified as a named or
+#' `gen_interval` must be specified as a named or
 #' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R)
 #' with one argument.
 #'
-#' For example, assuming we want to specify the serial interval
-#' distribution as a random log-normally distributed variable with
+#' For example, assuming we want to specify the generation interval
+#' as a random log-normally distributed variable with
 #' `meanlog = 0.58` and `sdlog = 1.58`, we could define a named function,
-#' let's call it "serial_interval", with only one argument representing the
-#' number of serial intervals to sample:
-#' \code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
-#' and assign the name of the function to `serials_dist` in
+#' let's call it "gen_interval", with only one argument representing the
+#' number of generation intervals to sample:
+#' \code{gen_interval_func <- function(n){rlnorm(n, 0.58, 1.38)}},
+#' and assign the name of the function to `gen_interval` in
 #' the simulation function, i.e.
-#' \code{`simulate_*`(..., serials_dist = serial_interval)},
+#' \code{`simulate_*`(..., gen_interval = gen_interval_func)},
 #' where `...` are the other arguments to `simulate_*()` and * is a placeholder
 #' for the rest of simulation function's name.
 #'
-#' Alternatively, we could assign an anonymous function to `serials_dist`
+#' Alternatively, we could assign an anonymous function to `gen_interval`
 #' in the `simulate_*()` call, i.e.
-#' \code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
+#' \code{simulate_*(..., gen_interval = function(n){rlnorm(n, 0.58, 1.38)})},
 #' where `...` are the other arguments to `simulate_*()`.
 #nolint end
 #' @seealso
@@ -93,7 +78,7 @@
 #'   statistic = "size",
 #'   offspring_dist = "pois",
 #'   stat_max = 10,
-#'   serials_dist = function(n) rep(3, n),
+#'   gen_interval = function(x) 3,
 #'   lambda = 2
 #' )
 #' @references
@@ -112,7 +97,7 @@
 #' 1186–1204. \doi{https://doi.org/10.3390/ijerph7031204}
 simulate_tree <- function(ntrees, statistic = c("size", "length"),
                           offspring_dist, stat_max = Inf,
-                          serials_dist, t0 = 0,
+                          gen_interval, t0 = 0,
                           tf = Inf, ...) {
   statistic <- match.arg(statistic)
 
@@ -131,8 +116,8 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
     stat_max, lower = 0
   )
 
-  if (!missing(serials_dist)) {
-    check_serial_valid(serials_dist)
+  if (!missing(gen_interval)) {
+    check_gen_interval_valid(gen_interval)
   }
   checkmate::assert_numeric(
     t0, lower = 0, finite = TRUE
@@ -144,10 +129,10 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
   # Gather offspring distribution parameters
   pars <- list(...)
 
-  if (!missing(serials_dist)) {
-    check_serial_valid(serials_dist)
+  if (!missing(gen_interval)) {
+    check_gen_interval_valid(gen_interval)
   } else if (!missing(tf)) {
-    stop("If `tf` is specified, `serials_dist` must be specified too.")
+    stop("If `tf` is specified, `gen_interval` must be specified too.")
   }
 
   # Initialisations
@@ -165,7 +150,7 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
     generation = generation
   )
 
-  if (!missing(serials_dist)) {
+  if (!missing(gen_interval)) {
     tree_df$time <- t0
     times <- tree_df$time
   }
@@ -221,10 +206,10 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
           generation = generation
         )
 
-      # if a serial interval model/function was specified, use it
-      # to generate serial intervals for the cases
-      if (!missing(serials_dist)) {
-        times <- rep(times, next_gen) + serials_dist(sum(n_offspring))
+      # if a generation interval model/function was specified, use it
+      # to generate generation intervals for the cases
+      if (!missing(gen_interval)) {
+        times <- rep(times, next_gen) + gen_interval(sum(n_offspring))
         current_min_time <- unname(tapply(times, indices, min))
         new_df$time <- times
       }
@@ -235,11 +220,11 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
     ## the specified maximum size/length
     sim <- which(n_offspring > 0 & stat_track < stat_max)
     if (length(sim) > 0) {
-      if (!missing(serials_dist)) {
+      if (!missing(gen_interval)) {
         ## only continue to simulate chains that don't go beyond tf
         sim <- intersect(sim, unique(indices)[current_min_time < tf])
       }
-      if (!missing(serials_dist)) {
+      if (!missing(gen_interval)) {
         times <- times[indices %in% sim]
       }
       infector_ids <- ids[indices %in% sim]
@@ -271,7 +256,6 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to `Inf`.
 #' @inheritSection simulate_tree Calculating chain sizes and lengths
-#' @inheritSection simulate_tree The serial interval (`serials_dist`)
 #' @author James M. Azam, Sebastian Funk
 #' @seealso
 #' * [simulate_tree()] for simulating transmission trees from an
@@ -400,7 +384,6 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
 #'  * the maximal chain statistic is limited by `pop` instead of
 #'  `stat_max` (in `simulate_tree()`),
 #'  * `offspring_dist` can only handle "pois" and "nbinom".
-#' @inheritSection simulate_tree The serial interval (`serials_dist`)
 #' @author Flavio Finger, James M. Azam, Sebastian Funk
 #' @seealso
 #' * [simulate_tree()] for simulating transmission trees from an
@@ -413,7 +396,7 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
 #'   pop = 100,
 #'   offspring_dist = "pois",
 #'   lambda = 0.5,
-#'   serials_dist = function(n) rep(3, n)
+#'   gen_interval = function(x) 3
 #' )
 #'
 #' # Simulate with negative binomial offspring
@@ -421,12 +404,12 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
 #' pop = 100, offspring_dist = "nbinom",
 #' mu = 0.5,
 #' size = 1.1,
-#' serials_dist = function(n) rep(3, n)
+#' gen_interval = function(x) 3
 #' )
 #' @export
 simulate_tree_from_pop <- function(pop,
                                    offspring_dist = c("pois", "nbinom"),
-                                   serials_dist,
+                                   gen_interval,
                                    initial_immune = 0,
                                    t0 = 0,
                                    tf = Inf,
@@ -438,8 +421,8 @@ simulate_tree_from_pop <- function(pop,
     pop, lower = 1, finite = TRUE
   )
   checkmate::assert_string(offspring_dist)
-  if (!missing(serials_dist)) {
-    check_serial_valid(serials_dist)
+  if (!missing(gen_interval)) {
+    check_gen_interval_valid(gen_interval)
   }
   checkmate::assert_number(
     initial_immune, lower = 0, upper = pop - 1
@@ -532,11 +515,11 @@ simulate_tree_from_pop <- function(pop,
 
     ## add to df
     if (n_offspring > 0) {
-      ## draw serial times
-      new_times <- serials_dist(n_offspring)
+      ## draw generation times
+      new_times <- gen_interval(n_offspring)
 
       if (any(new_times < 0)) {
-        stop("Serial interval must be >= 0.")
+        stop("Generation interval must be >= 0.")
       }
 
       new_df <- data.frame(
diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
index 1582c810..734ada1a 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains.Rd
@@ -29,7 +29,7 @@ chains <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  serials_dist = function(n) rep(3, n),
+  gen_interval = function(x) 3,
   lambda = 2
 )
 chains
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index c708af9c..0f868a53 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -48,44 +48,27 @@ at time \eqn{t}, and \eqn{I_{0, i} = L_{0, i} = 1}.
 
 The distribution of secondary cases, \eqn{X_{t, i}} is modelled by the
 offspring distribution (\code{offspring_dist}).
-}
-
-\section{The serial interval (\code{serials_dist})}{
-\subsection{Assumptions/disambiguation}{
-
-In epidemiology, the generation interval is the duration between successive
-infectious events in a chain of transmission. Similarly, the serial
-interval is the duration between observed symptom onset times between
-successive cases in a transmission chain. The generation interval is
-often hard to observe because exact times of infection are hard to
-measure hence, the serial interval is often used instead . Here, we
-use the serial interval to represent what would normally be called the
-generation interval, that is, the time between successive cases.
-
-See References below for some literature on the subject.
-}
-
-\subsection{Specifying \code{serials_dist}}{
+\subsection{Specifying \code{gen_interval}}{
 
-\code{serials_dist} must be specified as a named or
+\code{gen_interval} must be specified as a named or
 \href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function}
 with one argument.
 
-For example, assuming we want to specify the serial interval
-distribution as a random log-normally distributed variable with
+For example, assuming we want to specify the generation interval
+as a random log-normally distributed variable with
 \code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
-let's call it "serial_interval", with only one argument representing the
-number of serial intervals to sample:
-\code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
-and assign the name of the function to \code{serials_dist} in
+let's call it "gen_interval", with only one argument representing the
+number of generation intervals to sample:
+\code{gen_interval_func <- function(n){rlnorm(n, 0.58, 1.38)}},
+and assign the name of the function to \code{gen_interval} in
 the simulation function, i.e.
-\code{`simulate_*`(..., serials_dist = serial_interval)},
+\code{`simulate_*`(..., gen_interval = gen_interval_func)},
 where \code{...} are the other arguments to \verb{simulate_*()} and * is a placeholder
 for the rest of simulation function's name.
 
-Alternatively, we could assign an anonymous function to \code{serials_dist}
+Alternatively, we could assign an anonymous function to \code{gen_interval}
 in the \verb{simulate_*()} call, i.e.
-\code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
+\code{simulate_*(..., gen_interval = function(n){rlnorm(n, 0.58, 1.38)})},
 where \code{...} are the other arguments to \verb{simulate_*()}.
 }
 }
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index ec98c3e3..a1d4d933 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -9,7 +9,7 @@ simulate_tree(
   statistic = c("size", "length"),
   offspring_dist,
   stat_max = Inf,
-  serials_dist,
+  gen_interval,
   t0 = 0,
   tf = Inf,
   ...
@@ -35,16 +35,15 @@ numbers).}
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
-\item{serials_dist}{The serial interval distribution function; the name of a
-user-defined named or anonymous function with only one argument, usually
-called \code{n}, that returns a numeric vector of \code{n} randomly sampled serial
-intervals. See details.}
+\item{gen_interval}{The generation interval function; the name
+of a user-defined named or anonymous function with only one argument \code{n},
+representing the number of generation intervals to generate. See details.}
 
-\item{t0}{Start time (if serial interval is given); either a single value
-or a vector of same length as \code{ntrees} (number of simulations) with
+\item{t0}{Start time (if generation interval is given); either a single value
+or a vector of same length as \code{nchains} (number of simulations) with
 initial times. Defaults to 0.}
 
-\item{tf}{End time (if serial interval is given).}
+\item{tf}{End time (if generation interval is given).}
 
 \item{...}{Parameters of the offspring distribution as required by R.}
 }
@@ -68,44 +67,27 @@ at time \eqn{t}, and \eqn{I_{0, i} = L_{0, i} = 1}.
 
 The distribution of secondary cases, \eqn{X_{t, i}} is modelled by the
 offspring distribution (\code{offspring_dist}).
-}
-
-\section{The serial interval (\code{serials_dist})}{
-\subsection{Assumptions/disambiguation}{
-
-In epidemiology, the generation interval is the duration between successive
-infectious events in a chain of transmission. Similarly, the serial
-interval is the duration between observed symptom onset times between
-successive cases in a transmission chain. The generation interval is
-often hard to observe because exact times of infection are hard to
-measure hence, the serial interval is often used instead . Here, we
-use the serial interval to represent what would normally be called the
-generation interval, that is, the time between successive cases.
-
-See References below for some literature on the subject.
-}
-
-\subsection{Specifying \code{serials_dist}}{
+\subsection{Specifying \code{gen_interval}}{
 
-\code{serials_dist} must be specified as a named or
+\code{gen_interval} must be specified as a named or
 \href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function}
 with one argument.
 
-For example, assuming we want to specify the serial interval
-distribution as a random log-normally distributed variable with
+For example, assuming we want to specify the generation interval
+as a random log-normally distributed variable with
 \code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
-let's call it "serial_interval", with only one argument representing the
-number of serial intervals to sample:
-\code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
-and assign the name of the function to \code{serials_dist} in
+let's call it "gen_interval", with only one argument representing the
+number of generation intervals to sample:
+\code{gen_interval_func <- function(n){rlnorm(n, 0.58, 1.38)}},
+and assign the name of the function to \code{gen_interval} in
 the simulation function, i.e.
-\code{`simulate_*`(..., serials_dist = serial_interval)},
+\code{`simulate_*`(..., gen_interval = gen_interval_func)},
 where \code{...} are the other arguments to \verb{simulate_*()} and * is a placeholder
 for the rest of simulation function's name.
 
-Alternatively, we could assign an anonymous function to \code{serials_dist}
+Alternatively, we could assign an anonymous function to \code{gen_interval}
 in the \verb{simulate_*()} call, i.e.
-\code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
+\code{simulate_*(..., gen_interval = function(n){rlnorm(n, 0.58, 1.38)})},
 where \code{...} are the other arguments to \verb{simulate_*()}.
 }
 }
@@ -117,7 +99,7 @@ chains <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  serials_dist = function(n) rep(3, n),
+  gen_interval = function(x) 3,
   lambda = 2
 )
 }
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index f0a061aa..73eaf939 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -8,7 +8,7 @@ population}
 simulate_tree_from_pop(
   pop,
   offspring_dist = c("pois", "nbinom"),
-  serials_dist,
+  gen_interval,
   initial_immune = 0,
   t0 = 0,
   tf = Inf,
@@ -23,10 +23,9 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers). Only supports "pois" and "nbinom".}
 
-\item{serials_dist}{The serial interval distribution function; the name of a
-user-defined named or anonymous function with only one argument, usually
-called \code{n}, that returns a numeric vector of \code{n} randomly sampled serial
-intervals. See details.}
+\item{gen_interval}{The generation interval function; the name
+of a user-defined named or anonymous function with only one argument \code{n},
+representing the number of generation intervals to generate. See details.}
 
 \item{initial_immune}{The number of initial immunes in the population.
 Must be less than \code{pop} - 1.}
@@ -74,53 +73,13 @@ This is why \code{size} must be greater than 1.
 }
 }
 
-\section{The serial interval (\code{serials_dist})}{
-\subsection{Assumptions/disambiguation}{
-
-In epidemiology, the generation interval is the duration between successive
-infectious events in a chain of transmission. Similarly, the serial
-interval is the duration between observed symptom onset times between
-successive cases in a transmission chain. The generation interval is
-often hard to observe because exact times of infection are hard to
-measure hence, the serial interval is often used instead . Here, we
-use the serial interval to represent what would normally be called the
-generation interval, that is, the time between successive cases.
-
-See References below for some literature on the subject.
-}
-
-\subsection{Specifying \code{serials_dist}}{
-
-\code{serials_dist} must be specified as a named or
-\href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function}
-with one argument.
-
-For example, assuming we want to specify the serial interval
-distribution as a random log-normally distributed variable with
-\code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
-let's call it "serial_interval", with only one argument representing the
-number of serial intervals to sample:
-\code{serial_interval <- function(n){rlnorm(n, 0.58, 1.38)}},
-and assign the name of the function to \code{serials_dist} in
-the simulation function, i.e.
-\code{`simulate_*`(..., serials_dist = serial_interval)},
-where \code{...} are the other arguments to \verb{simulate_*()} and * is a placeholder
-for the rest of simulation function's name.
-
-Alternatively, we could assign an anonymous function to \code{serials_dist}
-in the \verb{simulate_*()} call, i.e.
-\code{simulate_*(..., serials_dist = function(n){rlnorm(n, 0.58, 1.38)})},
-where \code{...} are the other arguments to \verb{simulate_*()}.
-}
-}
-
 \examples{
 # Simulate with poisson offspring
 simulate_tree_from_pop(
   pop = 100,
   offspring_dist = "pois",
   lambda = 0.5,
-  serials_dist = function(n) rep(3, n)
+  gen_interval = function(x) 3
 )
 
 # Simulate with negative binomial offspring
@@ -128,7 +87,7 @@ simulate_tree_from_pop(
 pop = 100, offspring_dist = "nbinom",
 mu = 0.5,
 size = 1.1,
-serials_dist = function(n) rep(3, n)
+gen_interval = function(x) 3
 )
 }
 \seealso{
diff --git a/tests/testthat/test-checks.R b/tests/testthat/test-checks.R
index a9d4313b..7fd70bbb 100644
--- a/tests/testthat/test-checks.R
+++ b/tests/testthat/test-checks.R
@@ -8,7 +8,7 @@ test_that("Checks work", {
     "does not exist"
   )
   expect_error(
-    check_serial_valid("a"),
+    check_gen_interval_valid("a"),
     "must be a function"
   )
   expect_error(
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index cb168ba3..a2517616 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -10,7 +10,7 @@ test_that("Simulators return epichains objects", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -18,7 +18,7 @@ test_that("Simulators return epichains objects", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -33,7 +33,7 @@ test_that("Simulators return epichains objects", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(n) rep(3, n),
+    gen_interval = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -73,7 +73,7 @@ test_that("print.epichains works for simulation functions", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -81,7 +81,7 @@ test_that("print.epichains works for simulation functions", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -96,7 +96,7 @@ test_that("print.epichains works for simulation functions", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(n) rep(3, n),
+    gen_interval = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -121,7 +121,7 @@ test_that("summary.epichains works as expected", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -129,7 +129,7 @@ test_that("summary.epichains works as expected", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -144,7 +144,7 @@ test_that("summary.epichains works as expected", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(n) rep(3, n),
+    gen_interval = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -227,7 +227,7 @@ test_that("validate_epichains works", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -235,7 +235,7 @@ test_that("validate_epichains works", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -250,7 +250,7 @@ test_that("validate_epichains works", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(n) rep(3, n),
+    gen_interval = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -289,7 +289,7 @@ test_that("is_chains_tree works", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -297,7 +297,7 @@ test_that("is_chains_tree works", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -312,7 +312,7 @@ test_that("is_chains_tree works", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(n) rep(3, n),
+    gen_interval = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -347,7 +347,7 @@ test_that("is_chains_summary works", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -355,7 +355,7 @@ test_that("is_chains_summary works", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -370,7 +370,7 @@ test_that("is_chains_summary works", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(n) rep(3, n),
+    gen_interval = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -406,7 +406,7 @@ test_that("aggregate.epichains method returns correct objects", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(n) rep(3, n),
+    gen_interval = function(x) 3,
     lambda = 2
   )
   #' Create aggregates
@@ -467,7 +467,7 @@ test_that("aggregate.epichains method is numerically correct", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(n) rep(3, n),
+    gen_interval = function(x) 3,
     lambda = 2
   )
   #' Create aggregates
@@ -496,7 +496,7 @@ test_that("head and tail print output as expected", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -504,7 +504,7 @@ test_that("head and tail print output as expected", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -519,7 +519,7 @@ test_that("head and tail print output as expected", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(n) rep(3, n),
+    gen_interval = function(x) 3,
     lambda = 2
   )
   expect_snapshot(head(susc_outbreak_raw))
@@ -539,7 +539,7 @@ test_that("head and tail return data.frames", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -547,7 +547,7 @@ test_that("head and tail return data.frames", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    serials_dist = serial_func
+    gen_interval = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -562,7 +562,7 @@ test_that("head and tail return data.frames", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(n) rep(3, n),
+    gen_interval = function(x) 3,
     lambda = 2
   )
   #' Expectations
diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index 9cad2165..36d3f192 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -1,5 +1,5 @@
 #' Define global variables and options for simulations
-serial_func <- function(n) {
+gen_interval <- function(n) {
   rlnorm(n, meanlog = 0.58, sdlog = 1.58)
 }
 
@@ -10,7 +10,7 @@ test_that("Simulators work", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    serials_dist = serial_func
+    gen_interval = gen_interval
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -18,7 +18,7 @@ test_that("Simulators work", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    serials_dist = serial_func
+    gen_interval = gen_interval
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -33,7 +33,7 @@ test_that("Simulators work", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    serials_dist = function(n) rep(3, n),
+    gen_interval = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -71,7 +71,7 @@ test_that("Simulators work", {
         statistic = "size",
         offspring_dist = "pois",
         stat_max = 10,
-        serials_dist = function(n) rep(3, n),
+        gen_interval = function(x) 3,
         lambda = 2,
         tf = 5
       )$time < 5
@@ -115,7 +115,7 @@ test_that("simulate_tree throws errors", {
       offspring_dist = "pois",
       statistic = "size",
       lambda = 0.9,
-      serials_dist = c(1, 2)
+      gen_interval = c(1, 2)
     ),
     "must be a function"
   )
@@ -187,7 +187,7 @@ test_that("simulate_tree_from_pop throws errors", {
       pop = 100,
       offspring_dist = "binom",
       offspring_mean = 0.5,
-      serials_dist = serial_func
+      gen_interval = gen_interval
     ),
     "should be one of"
   )
@@ -197,7 +197,7 @@ test_that("simulate_tree_from_pop throws errors", {
       offspring_dist = "nbinom",
       mu = 0.5,
       size = 0.9,
-      serials_dist = serial_func
+      gen_interval = gen_interval
     ),
     "> 1"
   )
@@ -207,7 +207,7 @@ test_that("simulate_tree_from_pop throws errors", {
       offspring_dist = p,
       offspring_mean = 0.5,
       offspring_disp = 0.9,
-      serials_dist = serial_func
+      gen_interval = gen_interval
     ),
     "not found"
   )
@@ -216,7 +216,7 @@ test_that("simulate_tree_from_pop throws errors", {
       pop = 100,
       offspring_dist = "nbinom",
       offspring_mean = 0.5,
-      serials_dist = serial_func
+      gen_interval = gen_interval
     ),
     "must be specified"
   )
@@ -330,7 +330,7 @@ test_that("simulate_tree_from_pop is numerically correct", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    serials_dist = serial_func
+    gen_interval = gen_interval
   )
   #' Summarise the results
   susc_outbreak_summary <- summary(susc_outbreak_raw)
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 6d6c979f..25497c64 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -187,14 +187,14 @@ There are three simulation functions, herein referred to collectively as the `si
 ### [`simulate_tree()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html) 
 
 `simulate_tree()` simulates an outbreak from a given number of infections.
-It retains and returns information on infectors (ancestors), infectees, the generation of infection, and the time, if a serial distribution is specified.
+It retains and returns information on infectors (ancestors), infectees, the generation of infection, and the time, if a generation interval distribution is specified.
 
 Let's look at an example where we simulate the transmission trees of $10$ initial infections/chains. We 
-assume a poisson offspring distribution with mean, $\text{lambda} = 0.9$, and a serial interval of $3$ days:
+assume a poisson offspring distribution with mean, $\text{lambda} = 0.9$, and a generation interval of $3$ days:
 ```{r}
 set.seed(123)
-# Define serial distribution
-serial_func <- function(x) {
+# Define generation interval
+gen_interval <- function(x) {
   return(3)
 }
 
@@ -203,7 +203,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  serials_dist = function(n) rep(3, n),
+  gen_interval = function(x) 3,
   lambda = 0.9
 )
 
@@ -237,19 +237,19 @@ simulate_summary_eg
 
 `simulate_tree_from_pop()` simulates outbreaks based on a specified population size and pre-existing immunity until the susceptible pool runs out.
   
-Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and serial interval of $3$:
+Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and generation interval of $3$:
 ```{r}
 set.seed(7)
-# Define serial distribution
-serial_func <- function(n) {
-  return(rep(3, n))
+# Define generation interval
+gen_interval <- function(x) {
+  return(3)
 }
 
 sim_tree_from_pop_eg <- simulate_tree_from_pop(
   pop = 1000,
   offspring_dist = "pois",
   lambda = 1,
-  serials_dist = serial_func
+  gen_interval = gen_interval
 )
 
 head(sim_tree_from_pop_eg)
@@ -263,9 +263,9 @@ You can run `summary()` on `<epichains>` objects to get useful summaries.
 ```{r include=TRUE,echo=TRUE}
 # Example with simulate_tree()
 set.seed(123)
-# Define serial distribution
-serial_func <- function(n) {
-  return(rep(3, n))
+# Define generation interval
+gen_interval <- function(x) {
+  return(3)
 }
 
 sim_tree_eg <- simulate_tree(
@@ -273,7 +273,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  serials_dist = serial_func,
+  gen_interval = gen_interval,
   lambda = 0.9
 )
 
@@ -298,14 +298,14 @@ summary(simulate_summary_eg)
 
 You can aggregate `<epichains>` objects returned by the `simulate_*()` functions into a time series, which is a `<data.frame>` with columns "cases"  and either "generation" or "time", depending on the value of `grouping_var`.
 
-To aggregate over "time", you must have specified a serial interval distribution in the simulation step.
+To aggregate over "time", you must have specified a generation interval distribution in the simulation step.
 ```{r include=TRUE,echo=TRUE}
 # Example with simulate_tree()
 set.seed(123)
 
-# Define serial distribution
-serial_func <- function(n) {
-  return(rep(3, n))
+# Define generation interval
+gen_interval <- function(x) {
+  return(3)
 }
 
 sim_tree_eg <- simulate_tree(
@@ -313,7 +313,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  serials_dist = serial_func,
+  gen_interval = gen_interval,
   lambda = 0.9
 )
 
@@ -328,9 +328,9 @@ Here is an end-to-end example from simulation through aggregation to plotting.
 ```{r}
 # Run simulation with simulate_tree()
 set.seed(123)
-# Define serial distribution
-serial_func <- function(n) {
-  return(rep(3, n))
+# Define generation interval
+gen_interval <- function(x) {
+  return(3)
 }
 
 sim_tree_eg <- simulate_tree(
@@ -338,7 +338,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  serials_dist = serial_func,
+  gen_interval = gen_interval,
   lambda = 0.9
 )
 
diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index b147ff72..282a0975 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -137,7 +137,7 @@ log_mean <- log((mu^2) / (sqrt(sgma^2 + mu^2)))  # log mean
 log_sd <- sqrt(log(1 + (sgma / mu)^2)) # log sd
 
 #' serial interval function
-serials_dist <- function(sample_size) {
+gen_interval <- function(sample_size) {
   si <- rlnorm(sample_size, meanlog = log_mean, sdlog = log_sd)
   return(si)
 }
@@ -204,9 +204,10 @@ stat_max <- 1000
 
 ## Modelling assumptions
 
-`simulate_tree()` makes the following simplifying assumptions:
+This exercise makes the following simplifying assumptions:
 
 1. All cases are observed.
+1. Cases are observed exactly at the time of infection.
 1. There is no reporting delay.
 1. Reporting rate is constant through the course of the epidemic.
 1. No interventions have been implemented.
@@ -233,7 +234,7 @@ sim_chain_sizes <- lapply(
       size = size,
       statistic = "size",
       stat_max = stat_max,
-      serials_dist = serials_dist,
+      gen_interval = gen_interval,
       t0 = t0,
       tf = tf
     ) %>%

From f47dbb5a7103fa2265e17c19995b06d1146cf661 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 21:37:19 +0000
Subject: [PATCH 768/828] Rename gen_interval to generation_time

---
 R/checks.R                         | 10 +++---
 R/epichains.R                      |  2 +-
 R/simulate.r                       | 56 +++++++++++++++---------------
 man/aggregate.epichains.Rd         |  2 +-
 man/check_gen_interval_valid.Rd    | 17 ---------
 man/check_generation_time_valid.Rd | 17 +++++++++
 man/simulate_summary.Rd            | 18 +++++-----
 man/simulate_tree.Rd               | 24 ++++++-------
 man/simulate_tree_from_pop.Rd      |  8 ++---
 tests/testthat/test-checks.R       |  2 +-
 tests/testthat/test-epichains.R    | 52 +++++++++++++--------------
 tests/testthat/test-simulate.R     | 22 ++++++------
 vignettes/epichains.Rmd            | 20 +++++------
 vignettes/projecting_incidence.Rmd |  4 +--
 14 files changed, 127 insertions(+), 127 deletions(-)
 delete mode 100644 man/check_gen_interval_valid.Rd
 create mode 100644 man/check_generation_time_valid.Rd

diff --git a/R/checks.R b/R/checks.R
index 9a329fa0..b351c742 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -30,18 +30,18 @@ check_offspring_func_valid <- function(roffspring_name) {
 }
 
 
-#' Check if the gen_interval argument is specified as a function
+#' Check if the generation_time argument is specified as a function
 #'
-#' @param gen_interval The generation interval function; the name of a
+#' @param generation_time The generation interval function; the name of a
 #' user-defined named or anonymous function with only one argument `n`,
 #' representing the number of generation intervals to sample.
 #'
 #' @keywords internal
-check_gen_interval_valid <- function(gen_interval) {
-  if (!checkmate::test_function(gen_interval, nargs = 1)) {
+check_generation_time_valid <- function(generation_time) {
+  if (!checkmate::test_function(generation_time, nargs = 1)) {
     stop(sprintf(
       "%s %s",
-      "The `gen_interval` argument must be a function",
+      "The `generation_time` argument must be a function",
       "(see details in ?simulate_tree)."
     ))
   }
diff --git a/R/epichains.R b/R/epichains.R
index 93bbde3e..ad0de528 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -259,7 +259,7 @@ tail.epichains <- function(x, ...) {
 #'   statistic = "size",
 #'   offspring_dist = "pois",
 #'   stat_max = 10,
-#'   gen_interval = function(x) 3,
+#'   generation_time = function(x) 3,
 #'   lambda = 2
 #' )
 #' chains
diff --git a/R/simulate.r b/R/simulate.r
index ec096a70..f66437ef 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -15,7 +15,7 @@
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to this value.
 #' Defaults to `Inf`.
-#' @param gen_interval The generation interval function; the name
+#' @param generation_time The generation interval function; the name
 #' of a user-defined named or anonymous function with only one argument `n`,
 #' representing the number of generation intervals to generate. See details.
 #' @param t0 Start time (if generation interval is given); either a single value
@@ -43,27 +43,27 @@
 #' The distribution of secondary cases, \eqn{X_{t, i}} is modelled by the
 #' offspring distribution (`offspring_dist`).
 #'
-#' ## Specifying `gen_interval`
+#' ## Specifying `generation_time`
 #'
-#' `gen_interval` must be specified as a named or
+#' `generation_time` must be specified as a named or
 #' [anonymous/inline/unnamed function](https://en.wikipedia.org/wiki/Anonymous_function#R)
 #' with one argument.
 #'
-#' For example, assuming we want to specify the generation interval
+#' For example, assuming we want to specify the generation time
 #' as a random log-normally distributed variable with
 #' `meanlog = 0.58` and `sdlog = 1.58`, we could define a named function,
-#' let's call it "gen_interval", with only one argument representing the
+#' let's call it "generation_time_fn", with only one argument representing the
 #' number of generation intervals to sample:
-#' \code{gen_interval_func <- function(n){rlnorm(n, 0.58, 1.38)}},
-#' and assign the name of the function to `gen_interval` in
+#' \code{generation_time_fn <- function(n){rlnorm(n, 0.58, 1.38)}},
+#' and assign the name of the function to `generation_time` in
 #' the simulation function, i.e.
-#' \code{`simulate_*`(..., gen_interval = gen_interval_func)},
+#' \code{`simulate_*`(..., generation_time = generation_time_fn)},
 #' where `...` are the other arguments to `simulate_*()` and * is a placeholder
 #' for the rest of simulation function's name.
 #'
-#' Alternatively, we could assign an anonymous function to `gen_interval`
+#' Alternatively, we could assign an anonymous function to `generation_time`
 #' in the `simulate_*()` call, i.e.
-#' \code{simulate_*(..., gen_interval = function(n){rlnorm(n, 0.58, 1.38)})},
+#' \code{simulate_*(..., generation_time = function(n){rlnorm(n, 0.58, 1.38)})},
 #' where `...` are the other arguments to `simulate_*()`.
 #nolint end
 #' @seealso
@@ -78,7 +78,7 @@
 #'   statistic = "size",
 #'   offspring_dist = "pois",
 #'   stat_max = 10,
-#'   gen_interval = function(x) 3,
+#'   generation_time = function(x) 3,
 #'   lambda = 2
 #' )
 #' @references
@@ -97,7 +97,7 @@
 #' 1186–1204. \doi{https://doi.org/10.3390/ijerph7031204}
 simulate_tree <- function(ntrees, statistic = c("size", "length"),
                           offspring_dist, stat_max = Inf,
-                          gen_interval, t0 = 0,
+                          generation_time, t0 = 0,
                           tf = Inf, ...) {
   statistic <- match.arg(statistic)
 
@@ -116,8 +116,8 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
     stat_max, lower = 0
   )
 
-  if (!missing(gen_interval)) {
-    check_gen_interval_valid(gen_interval)
+  if (!missing(generation_time)) {
+    check_generation_time_valid(generation_time)
   }
   checkmate::assert_numeric(
     t0, lower = 0, finite = TRUE
@@ -129,10 +129,10 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
   # Gather offspring distribution parameters
   pars <- list(...)
 
-  if (!missing(gen_interval)) {
-    check_gen_interval_valid(gen_interval)
+  if (!missing(generation_time)) {
+    check_generation_time_valid(generation_time)
   } else if (!missing(tf)) {
-    stop("If `tf` is specified, `gen_interval` must be specified too.")
+    stop("If `tf` is specified, `generation_time` must be specified too.")
   }
 
   # Initialisations
@@ -150,7 +150,7 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
     generation = generation
   )
 
-  if (!missing(gen_interval)) {
+  if (!missing(generation_time)) {
     tree_df$time <- t0
     times <- tree_df$time
   }
@@ -208,8 +208,8 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
 
       # if a generation interval model/function was specified, use it
       # to generate generation intervals for the cases
-      if (!missing(gen_interval)) {
-        times <- rep(times, next_gen) + gen_interval(sum(n_offspring))
+      if (!missing(generation_time)) {
+        times <- rep(times, next_gen) + generation_time(sum(n_offspring))
         current_min_time <- unname(tapply(times, indices, min))
         new_df$time <- times
       }
@@ -220,11 +220,11 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
     ## the specified maximum size/length
     sim <- which(n_offspring > 0 & stat_track < stat_max)
     if (length(sim) > 0) {
-      if (!missing(gen_interval)) {
+      if (!missing(generation_time)) {
         ## only continue to simulate chains that don't go beyond tf
         sim <- intersect(sim, unique(indices)[current_min_time < tf])
       }
-      if (!missing(gen_interval)) {
+      if (!missing(generation_time)) {
         times <- times[indices %in% sim]
       }
       infector_ids <- ids[indices %in% sim]
@@ -396,7 +396,7 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
 #'   pop = 100,
 #'   offspring_dist = "pois",
 #'   lambda = 0.5,
-#'   gen_interval = function(x) 3
+#'   generation_time = function(x) 3
 #' )
 #'
 #' # Simulate with negative binomial offspring
@@ -404,12 +404,12 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
 #' pop = 100, offspring_dist = "nbinom",
 #' mu = 0.5,
 #' size = 1.1,
-#' gen_interval = function(x) 3
+#' generation_time = function(x) 3
 #' )
 #' @export
 simulate_tree_from_pop <- function(pop,
                                    offspring_dist = c("pois", "nbinom"),
-                                   gen_interval,
+                                   generation_time,
                                    initial_immune = 0,
                                    t0 = 0,
                                    tf = Inf,
@@ -421,8 +421,8 @@ simulate_tree_from_pop <- function(pop,
     pop, lower = 1, finite = TRUE
   )
   checkmate::assert_string(offspring_dist)
-  if (!missing(gen_interval)) {
-    check_gen_interval_valid(gen_interval)
+  if (!missing(generation_time)) {
+    check_generation_time_valid(generation_time)
   }
   checkmate::assert_number(
     initial_immune, lower = 0, upper = pop - 1
@@ -516,7 +516,7 @@ simulate_tree_from_pop <- function(pop,
     ## add to df
     if (n_offspring > 0) {
       ## draw generation times
-      new_times <- gen_interval(n_offspring)
+      new_times <- generation_time(n_offspring)
 
       if (any(new_times < 0)) {
         stop("Generation interval must be >= 0.")
diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
index 734ada1a..c6460ccc 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains.Rd
@@ -29,7 +29,7 @@ chains <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  gen_interval = function(x) 3,
+  generation_time = function(x) 3,
   lambda = 2
 )
 chains
diff --git a/man/check_gen_interval_valid.Rd b/man/check_gen_interval_valid.Rd
deleted file mode 100644
index c9d0b376..00000000
--- a/man/check_gen_interval_valid.Rd
+++ /dev/null
@@ -1,17 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/checks.R
-\name{check_gen_interval_valid}
-\alias{check_gen_interval_valid}
-\title{Check if the gen_interval argument is specified as a function}
-\usage{
-check_gen_interval_valid(gen_interval)
-}
-\arguments{
-\item{gen_interval}{The generation interval function; the name of a
-user-defined named or anonymous function with only one argument \code{n},
-representing the number of generation intervals to sample.}
-}
-\description{
-Check if the gen_interval argument is specified as a function
-}
-\keyword{internal}
diff --git a/man/check_generation_time_valid.Rd b/man/check_generation_time_valid.Rd
new file mode 100644
index 00000000..4bc950e5
--- /dev/null
+++ b/man/check_generation_time_valid.Rd
@@ -0,0 +1,17 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/checks.R
+\name{check_generation_time_valid}
+\alias{check_generation_time_valid}
+\title{Check if the generation_time argument is specified as a function}
+\usage{
+check_generation_time_valid(generation_time)
+}
+\arguments{
+\item{generation_time}{The generation interval function; the name of a
+user-defined named or anonymous function with only one argument \code{n},
+representing the number of generation intervals to sample.}
+}
+\description{
+Check if the generation_time argument is specified as a function
+}
+\keyword{internal}
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index 0f868a53..6989ee22 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -48,27 +48,27 @@ at time \eqn{t}, and \eqn{I_{0, i} = L_{0, i} = 1}.
 
 The distribution of secondary cases, \eqn{X_{t, i}} is modelled by the
 offspring distribution (\code{offspring_dist}).
-\subsection{Specifying \code{gen_interval}}{
+\subsection{Specifying \code{generation_time}}{
 
-\code{gen_interval} must be specified as a named or
+\code{generation_time} must be specified as a named or
 \href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function}
 with one argument.
 
-For example, assuming we want to specify the generation interval
+For example, assuming we want to specify the generation time
 as a random log-normally distributed variable with
 \code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
-let's call it "gen_interval", with only one argument representing the
+let's call it "generation_time_fn", with only one argument representing the
 number of generation intervals to sample:
-\code{gen_interval_func <- function(n){rlnorm(n, 0.58, 1.38)}},
-and assign the name of the function to \code{gen_interval} in
+\code{generation_time_fn <- function(n){rlnorm(n, 0.58, 1.38)}},
+and assign the name of the function to \code{generation_time} in
 the simulation function, i.e.
-\code{`simulate_*`(..., gen_interval = gen_interval_func)},
+\code{`simulate_*`(..., generation_time = generation_time_fn)},
 where \code{...} are the other arguments to \verb{simulate_*()} and * is a placeholder
 for the rest of simulation function's name.
 
-Alternatively, we could assign an anonymous function to \code{gen_interval}
+Alternatively, we could assign an anonymous function to \code{generation_time}
 in the \verb{simulate_*()} call, i.e.
-\code{simulate_*(..., gen_interval = function(n){rlnorm(n, 0.58, 1.38)})},
+\code{simulate_*(..., generation_time = function(n){rlnorm(n, 0.58, 1.38)})},
 where \code{...} are the other arguments to \verb{simulate_*()}.
 }
 }
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index a1d4d933..3d973ef9 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -9,7 +9,7 @@ simulate_tree(
   statistic = c("size", "length"),
   offspring_dist,
   stat_max = Inf,
-  gen_interval,
+  generation_time,
   t0 = 0,
   tf = Inf,
   ...
@@ -35,7 +35,7 @@ numbers).}
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
-\item{gen_interval}{The generation interval function; the name
+\item{generation_time}{The generation interval function; the name
 of a user-defined named or anonymous function with only one argument \code{n},
 representing the number of generation intervals to generate. See details.}
 
@@ -67,27 +67,27 @@ at time \eqn{t}, and \eqn{I_{0, i} = L_{0, i} = 1}.
 
 The distribution of secondary cases, \eqn{X_{t, i}} is modelled by the
 offspring distribution (\code{offspring_dist}).
-\subsection{Specifying \code{gen_interval}}{
+\subsection{Specifying \code{generation_time}}{
 
-\code{gen_interval} must be specified as a named or
+\code{generation_time} must be specified as a named or
 \href{https://en.wikipedia.org/wiki/Anonymous_function#R}{anonymous/inline/unnamed function}
 with one argument.
 
-For example, assuming we want to specify the generation interval
+For example, assuming we want to specify the generation time
 as a random log-normally distributed variable with
 \code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
-let's call it "gen_interval", with only one argument representing the
+let's call it "generation_time_fn", with only one argument representing the
 number of generation intervals to sample:
-\code{gen_interval_func <- function(n){rlnorm(n, 0.58, 1.38)}},
-and assign the name of the function to \code{gen_interval} in
+\code{generation_time_fn <- function(n){rlnorm(n, 0.58, 1.38)}},
+and assign the name of the function to \code{generation_time} in
 the simulation function, i.e.
-\code{`simulate_*`(..., gen_interval = gen_interval_func)},
+\code{`simulate_*`(..., generation_time = generation_time_fn)},
 where \code{...} are the other arguments to \verb{simulate_*()} and * is a placeholder
 for the rest of simulation function's name.
 
-Alternatively, we could assign an anonymous function to \code{gen_interval}
+Alternatively, we could assign an anonymous function to \code{generation_time}
 in the \verb{simulate_*()} call, i.e.
-\code{simulate_*(..., gen_interval = function(n){rlnorm(n, 0.58, 1.38)})},
+\code{simulate_*(..., generation_time = function(n){rlnorm(n, 0.58, 1.38)})},
 where \code{...} are the other arguments to \verb{simulate_*()}.
 }
 }
@@ -99,7 +99,7 @@ chains <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  gen_interval = function(x) 3,
+  generation_time = function(x) 3,
   lambda = 2
 )
 }
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 73eaf939..54533057 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -8,7 +8,7 @@ population}
 simulate_tree_from_pop(
   pop,
   offspring_dist = c("pois", "nbinom"),
-  gen_interval,
+  generation_time,
   initial_immune = 0,
   t0 = 0,
   tf = Inf,
@@ -23,7 +23,7 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers). Only supports "pois" and "nbinom".}
 
-\item{gen_interval}{The generation interval function; the name
+\item{generation_time}{The generation interval function; the name
 of a user-defined named or anonymous function with only one argument \code{n},
 representing the number of generation intervals to generate. See details.}
 
@@ -79,7 +79,7 @@ simulate_tree_from_pop(
   pop = 100,
   offspring_dist = "pois",
   lambda = 0.5,
-  gen_interval = function(x) 3
+  generation_time = function(x) 3
 )
 
 # Simulate with negative binomial offspring
@@ -87,7 +87,7 @@ simulate_tree_from_pop(
 pop = 100, offspring_dist = "nbinom",
 mu = 0.5,
 size = 1.1,
-gen_interval = function(x) 3
+generation_time = function(x) 3
 )
 }
 \seealso{
diff --git a/tests/testthat/test-checks.R b/tests/testthat/test-checks.R
index 7fd70bbb..2409dd98 100644
--- a/tests/testthat/test-checks.R
+++ b/tests/testthat/test-checks.R
@@ -8,7 +8,7 @@ test_that("Checks work", {
     "does not exist"
   )
   expect_error(
-    check_gen_interval_valid("a"),
+    check_generation_time_valid("a"),
     "must be a function"
   )
   expect_error(
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index a2517616..101876fe 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -10,7 +10,7 @@ test_that("Simulators return epichains objects", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -18,7 +18,7 @@ test_that("Simulators return epichains objects", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -33,7 +33,7 @@ test_that("Simulators return epichains objects", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    gen_interval = function(x) 3,
+    generation_time = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -73,7 +73,7 @@ test_that("print.epichains works for simulation functions", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -81,7 +81,7 @@ test_that("print.epichains works for simulation functions", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -96,7 +96,7 @@ test_that("print.epichains works for simulation functions", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    gen_interval = function(x) 3,
+    generation_time = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -121,7 +121,7 @@ test_that("summary.epichains works as expected", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -129,7 +129,7 @@ test_that("summary.epichains works as expected", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -144,7 +144,7 @@ test_that("summary.epichains works as expected", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    gen_interval = function(x) 3,
+    generation_time = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -227,7 +227,7 @@ test_that("validate_epichains works", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -235,7 +235,7 @@ test_that("validate_epichains works", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -250,7 +250,7 @@ test_that("validate_epichains works", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    gen_interval = function(x) 3,
+    generation_time = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -289,7 +289,7 @@ test_that("is_chains_tree works", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -297,7 +297,7 @@ test_that("is_chains_tree works", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -312,7 +312,7 @@ test_that("is_chains_tree works", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    gen_interval = function(x) 3,
+    generation_time = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -347,7 +347,7 @@ test_that("is_chains_summary works", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -355,7 +355,7 @@ test_that("is_chains_summary works", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -370,7 +370,7 @@ test_that("is_chains_summary works", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    gen_interval = function(x) 3,
+    generation_time = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -406,7 +406,7 @@ test_that("aggregate.epichains method returns correct objects", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    gen_interval = function(x) 3,
+    generation_time = function(x) 3,
     lambda = 2
   )
   #' Create aggregates
@@ -467,7 +467,7 @@ test_that("aggregate.epichains method is numerically correct", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    gen_interval = function(x) 3,
+    generation_time = function(x) 3,
     lambda = 2
   )
   #' Create aggregates
@@ -496,7 +496,7 @@ test_that("head and tail print output as expected", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -504,7 +504,7 @@ test_that("head and tail print output as expected", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -519,7 +519,7 @@ test_that("head and tail print output as expected", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    gen_interval = function(x) 3,
+    generation_time = function(x) 3,
     lambda = 2
   )
   expect_snapshot(head(susc_outbreak_raw))
@@ -539,7 +539,7 @@ test_that("head and tail return data.frames", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -547,7 +547,7 @@ test_that("head and tail return data.frames", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    gen_interval = serial_func
+    generation_time = serial_func
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -562,7 +562,7 @@ test_that("head and tail return data.frames", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    gen_interval = function(x) 3,
+    generation_time = function(x) 3,
     lambda = 2
   )
   #' Expectations
diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index 36d3f192..e542ce65 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -1,5 +1,5 @@
 #' Define global variables and options for simulations
-gen_interval <- function(n) {
+generation_time <- function(n) {
   rlnorm(n, meanlog = 0.58, sdlog = 1.58)
 }
 
@@ -10,7 +10,7 @@ test_that("Simulators work", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    gen_interval = gen_interval
+    generation_time = generation_time
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -18,7 +18,7 @@ test_that("Simulators work", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    gen_interval = gen_interval
+    generation_time = generation_time
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -33,7 +33,7 @@ test_that("Simulators work", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    gen_interval = function(x) 3,
+    generation_time = function(x) 3,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -71,7 +71,7 @@ test_that("Simulators work", {
         statistic = "size",
         offspring_dist = "pois",
         stat_max = 10,
-        gen_interval = function(x) 3,
+        generation_time = function(x) 3,
         lambda = 2,
         tf = 5
       )$time < 5
@@ -115,7 +115,7 @@ test_that("simulate_tree throws errors", {
       offspring_dist = "pois",
       statistic = "size",
       lambda = 0.9,
-      gen_interval = c(1, 2)
+      generation_time = c(1, 2)
     ),
     "must be a function"
   )
@@ -187,7 +187,7 @@ test_that("simulate_tree_from_pop throws errors", {
       pop = 100,
       offspring_dist = "binom",
       offspring_mean = 0.5,
-      gen_interval = gen_interval
+      generation_time = generation_time
     ),
     "should be one of"
   )
@@ -197,7 +197,7 @@ test_that("simulate_tree_from_pop throws errors", {
       offspring_dist = "nbinom",
       mu = 0.5,
       size = 0.9,
-      gen_interval = gen_interval
+      generation_time = generation_time
     ),
     "> 1"
   )
@@ -207,7 +207,7 @@ test_that("simulate_tree_from_pop throws errors", {
       offspring_dist = p,
       offspring_mean = 0.5,
       offspring_disp = 0.9,
-      gen_interval = gen_interval
+      generation_time = generation_time
     ),
     "not found"
   )
@@ -216,7 +216,7 @@ test_that("simulate_tree_from_pop throws errors", {
       pop = 100,
       offspring_dist = "nbinom",
       offspring_mean = 0.5,
-      gen_interval = gen_interval
+      generation_time = generation_time
     ),
     "must be specified"
   )
@@ -330,7 +330,7 @@ test_that("simulate_tree_from_pop is numerically correct", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    gen_interval = gen_interval
+    generation_time = generation_time
   )
   #' Summarise the results
   susc_outbreak_summary <- summary(susc_outbreak_raw)
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 25497c64..a2f28d61 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -194,7 +194,7 @@ assume a poisson offspring distribution with mean, $\text{lambda} = 0.9$, and a
 ```{r}
 set.seed(123)
 # Define generation interval
-gen_interval <- function(x) {
+generation_time <- function(x) {
   return(3)
 }
 
@@ -203,7 +203,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  gen_interval = function(x) 3,
+  generation_time = function(x) 3,
   lambda = 0.9
 )
 
@@ -241,7 +241,7 @@ Here is a quick example where we simulate an outbreak in a population of size $1
 ```{r}
 set.seed(7)
 # Define generation interval
-gen_interval <- function(x) {
+generation_time <- function(x) {
   return(3)
 }
 
@@ -249,7 +249,7 @@ sim_tree_from_pop_eg <- simulate_tree_from_pop(
   pop = 1000,
   offspring_dist = "pois",
   lambda = 1,
-  gen_interval = gen_interval
+  generation_time = generation_time
 )
 
 head(sim_tree_from_pop_eg)
@@ -264,7 +264,7 @@ You can run `summary()` on `<epichains>` objects to get useful summaries.
 # Example with simulate_tree()
 set.seed(123)
 # Define generation interval
-gen_interval <- function(x) {
+generation_time <- function(x) {
   return(3)
 }
 
@@ -273,7 +273,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  gen_interval = gen_interval,
+  generation_time = generation_time,
   lambda = 0.9
 )
 
@@ -304,7 +304,7 @@ To aggregate over "time", you must have specified a generation interval distribu
 set.seed(123)
 
 # Define generation interval
-gen_interval <- function(x) {
+generation_time <- function(x) {
   return(3)
 }
 
@@ -313,7 +313,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  gen_interval = gen_interval,
+  generation_time = generation_time,
   lambda = 0.9
 )
 
@@ -329,7 +329,7 @@ Here is an end-to-end example from simulation through aggregation to plotting.
 # Run simulation with simulate_tree()
 set.seed(123)
 # Define generation interval
-gen_interval <- function(x) {
+generation_time <- function(x) {
   return(3)
 }
 
@@ -338,7 +338,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  gen_interval = gen_interval,
+  generation_time = generation_time,
   lambda = 0.9
 )
 
diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 282a0975..84babd11 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -137,7 +137,7 @@ log_mean <- log((mu^2) / (sqrt(sgma^2 + mu^2)))  # log mean
 log_sd <- sqrt(log(1 + (sgma / mu)^2)) # log sd
 
 #' serial interval function
-gen_interval <- function(sample_size) {
+generation_time <- function(sample_size) {
   si <- rlnorm(sample_size, meanlog = log_mean, sdlog = log_sd)
   return(si)
 }
@@ -234,7 +234,7 @@ sim_chain_sizes <- lapply(
       size = size,
       statistic = "size",
       stat_max = stat_max,
-      gen_interval = gen_interval,
+      generation_time = generation_time,
       t0 = t0,
       tf = tf
     ) %>%

From 0d6bd69e86ddabbaeba0afcac886a5aa703565a0 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 22:02:45 +0000
Subject: [PATCH 769/828] Change serial interval to generation time in README

---
 README.Rmd | 6 ++++--
 1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/README.Rmd b/README.Rmd
index c98b8d6b..1328b9fd 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -70,7 +70,8 @@ _{{ packagename }}_ provides four main functions:
 * `simulate_tree()`: simulates transmission chains using an initial number of
 cases and information on the offspring distribution. This function returns
 an object with columns that track information on who infected whom, the
-generation of infection and, if a serial interval is given, the time of infection.
+generation of infection and, if a generation time function is specified, the
+time of infection.
 
 * `simulate_summary()`: simulates a vector of transmission chain sizes or
 lengths using an initial number of cases and information on the offspring
@@ -81,7 +82,8 @@ length.
 population size and information on the offspring distribution. You can also
 specify a given level of pre-existing immunity. This function returns
 an object with columns that track information on who infected whom, the
-generation of infection and, if a serial interval is given, the time of infection.
+generation of infection and, if a generation time function is given, the
+time of infection.
 
 * `likelihood()`: calculates the loglikelihood (or likelihood, depending
 on the value of `log`) of observing a vector of transmission chain sizes or

From 36f0ceef68bf2b0dccfc74ed120bb1c70fb28f2b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 22:03:54 +0000
Subject: [PATCH 770/828] Change generation interval to generation time in
 vignette for consistency

---
 vignettes/epichains.Rmd | 7 +++----
 1 file changed, 3 insertions(+), 4 deletions(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index a2f28d61..2fcfbf2f 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -187,10 +187,9 @@ There are three simulation functions, herein referred to collectively as the `si
 ### [`simulate_tree()`](https://epiverse-trace.github.io/epichains/reference/simulate_tree.html) 
 
 `simulate_tree()` simulates an outbreak from a given number of infections.
-It retains and returns information on infectors (ancestors), infectees, the generation of infection, and the time, if a generation interval distribution is specified.
+It retains and returns information on infectors (ancestors), infectees, the generation of infection, and the time, if a generation time function is specified.
 
-Let's look at an example where we simulate the transmission trees of $10$ initial infections/chains. We 
-assume a poisson offspring distribution with mean, $\text{lambda} = 0.9$, and a generation interval of $3$ days:
+Let's look at an example where we simulate the transmission trees of $10$ initial infections/chains. We assume a poisson offspring distribution with mean, $\text{lambda} = 0.9$, and a generation time of $3$ days:
 ```{r}
 set.seed(123)
 # Define generation interval
@@ -237,7 +236,7 @@ simulate_summary_eg
 
 `simulate_tree_from_pop()` simulates outbreaks based on a specified population size and pre-existing immunity until the susceptible pool runs out.
   
-Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and generation interval of $3$:
+Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and generation time of $3$:
 ```{r}
 set.seed(7)
 # Define generation interval

From b74cfb2490304b3f1622d6968c0324d72edd72b4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 22:04:32 +0000
Subject: [PATCH 771/828] Reword serial interval section to use generation time

---
 vignettes/projecting_incidence.Rmd | 46 +++++++++++++++++-------------
 1 file changed, 26 insertions(+), 20 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 84babd11..ae6c827e 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -92,23 +92,29 @@ t0 <- rep(days_since_index, seed_cases$cases)
 t0
 ```
 
-### Serial interval
+### Generation time
 
-Next, we will set up the serial interval distribution, that is, the time
-between successive onsets of cases in a transmission chain.
-
-The log-normal distribution is commonly used in epidemiology to characterise 
-quantities such as the serial interval because it has a large variance 
-and can only be positive-valued [@nishiura2007; @limpert2001]. 
+In epidemiology, the generation interval is the duration between successive
+infectious events in a chain of transmission. Similarly, the serial
+interval is the duration between observed symptom onset times between
+successive cases in a transmission chain. The generation interval is
+often hard to observe because exact times of infection are hard to
+measure hence, the serial interval is often used instead. Here, we
+use the serial interval to represent what would normally be called the
+generation interval, that is, the time between successive cases.
 
 In this example, we will assume based on COVID-19 literature that the 
 serial interval, S, is log-normal distributed with parameters, 
-$\mu = 4.7$ and $\sigma = 2.9$ [@pearson2020]. Note that when the distribution
-is described this way, it means $\mu$ and $\sigma$ are the expected value 
-and standard deviation of the natural logarithm of the serial interval. Hence, 
-in order to sample the "back-transformed" measured serial interval with 
-expectation/mean, $E[S]$ and standard deviation, $SD [S]$, 
-we can use the following parametrisation:
+$\mu = 4.7$ and $\sigma = 2.9$ [@pearson2020]. The log-normal distribution is
+commonly used in epidemiology to characterise quantities such as the serial
+interval because it has a large variance and can only be positive-valued
+[@nishiura2007; @limpert2001].
+
+Note that when the distribution is described this way, it means $\mu$ and
+$\sigma$ are the expected value and standard deviation of the natural
+logarithm of the serial interval. Hence, in order to sample the
+"back-transformed" measured serial interval with expectation/mean, $E[S]$
+and standard deviation, $SD [S]$, we can use the following parametrisation:
 
 \begin{align}
 E[S] &= \ln \left( \dfrac{\mu^2}{(\sqrt{\mu^2 + \sigma^2}} \right) \\
@@ -120,13 +126,13 @@ SD [S] &= \sqrt {\ln \left(1 + \dfrac{\sigma^2}{\mu^2} \right)}
 See ["log-normal_distribution" on Wikipedia](https://en.wikipedia.org/wiki/Log-normal_distribution) for a
 detailed explanation of this parametrisation.
 
-We will now set up the serial interval function with the appropriate inputs.
+We will now set up the generation time function with the appropriate inputs.
 We adopt R's random lognormal distribution generator (`rlnorm()`) that
 takes `meanlog` and `sdlog` as arguments, which we define with the
 parametrisation above as `log_mean()` and `log_sd()` respectively and wrap it in 
-the `serial_interval()` function. Moreover, `serial_interval()` takes one
-argument `sample_size` as is required by _epichains_ 
-(See `?epichains::simulate_tree`), which is further passed to `rlnorm()` as the 
+the `generation_time_fn()` function. Moreover, `generation_time_fn()` takes one
+argument `n` as is required by _epichains_ (See `?epichains::simulate_tree`),
+which is further passed to `rlnorm()` as the 
 first argument to determine the number of observations to sample
 (See `?rlnorm`).
 ```{r input_prep3, message=FALSE}
@@ -137,9 +143,9 @@ log_mean <- log((mu^2) / (sqrt(sgma^2 + mu^2)))  # log mean
 log_sd <- sqrt(log(1 + (sgma / mu)^2)) # log sd
 
 #' serial interval function
-generation_time <- function(sample_size) {
-  si <- rlnorm(sample_size, meanlog = log_mean, sdlog = log_sd)
-  return(si)
+generation_time <- function(n) {
+  gt <- rlnorm(n, meanlog = log_mean, sdlog = log_sd)
+  return(gt)
 }
 ```
 

From 78352460522a7094b5c4026153e39c89edc977f4 Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Thu, 30 Nov 2023 22:09:34 +0000
Subject: [PATCH 772/828] Automatic readme update

---
 README.md | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/README.md b/README.md
index bb9a3d71..5dcd207d 100644
--- a/README.md
+++ b/README.md
@@ -69,8 +69,8 @@ library("epichains")
 - `simulate_tree()`: simulates transmission chains using an initial
   number of cases and information on the offspring distribution. This
   function returns an object with columns that track information on who
-  infected whom, the generation of infection and, if a serial interval
-  is given, the time of infection.
+  infected whom, the generation of infection and, if a generation time
+  function is specified, the time of infection.
 
 - `simulate_summary()`: simulates a vector of transmission chain sizes
   or lengths using an initial number of cases and information on the
@@ -81,8 +81,8 @@ library("epichains")
   initial population size and information on the offspring distribution.
   You can also specify a given level of pre-existing immunity. This
   function returns an object with columns that track information on who
-  infected whom, the generation of infection and, if a serial interval
-  is given, the time of infection.
+  infected whom, the generation of infection and, if a generation time
+  function is given, the time of infection.
 
 - `likelihood()`: calculates the loglikelihood (or likelihood, depending
   on the value of `log`) of observing a vector of transmission chain

From cdffac0d97a7d11892d8757c23f9ed8831d2c64f Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 23:35:00 +0000
Subject: [PATCH 773/828] Rename generation time function

---
 tests/testthat/test-epichains.R | 56 ++++++++++++++++-----------------
 tests/testthat/test-simulate.R  | 20 ++++++------
 2 files changed, 38 insertions(+), 38 deletions(-)

diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index 101876fe..e331ea11 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -1,5 +1,5 @@
 #' Define global variables and options for simulations
-serial_func <- function(n) {
+generation_time_fn <- function(n) {
   rlnorm(n, meanlog = 0.58, sdlog = 1.58)
 }
 
@@ -10,7 +10,7 @@ test_that("Simulators return epichains objects", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -18,7 +18,7 @@ test_that("Simulators return epichains objects", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -33,7 +33,7 @@ test_that("Simulators return epichains objects", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    generation_time = function(x) 3,
+    generation_time = generation_time_fn,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -73,7 +73,7 @@ test_that("print.epichains works for simulation functions", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -81,7 +81,7 @@ test_that("print.epichains works for simulation functions", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -96,7 +96,7 @@ test_that("print.epichains works for simulation functions", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    generation_time = function(x) 3,
+    generation_time = generation_time_fn,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -121,7 +121,7 @@ test_that("summary.epichains works as expected", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -129,7 +129,7 @@ test_that("summary.epichains works as expected", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -144,7 +144,7 @@ test_that("summary.epichains works as expected", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    generation_time = function(x) 3,
+    generation_time = generation_time_fn,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -227,7 +227,7 @@ test_that("validate_epichains works", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -235,7 +235,7 @@ test_that("validate_epichains works", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -250,7 +250,7 @@ test_that("validate_epichains works", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    generation_time = function(x) 3,
+    generation_time = generation_time_fn,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -289,7 +289,7 @@ test_that("is_chains_tree works", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -297,7 +297,7 @@ test_that("is_chains_tree works", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -312,7 +312,7 @@ test_that("is_chains_tree works", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    generation_time = function(x) 3,
+    generation_time = generation_time_fn,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -347,7 +347,7 @@ test_that("is_chains_summary works", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -355,7 +355,7 @@ test_that("is_chains_summary works", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -370,7 +370,7 @@ test_that("is_chains_summary works", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    generation_time = function(x) 3,
+    generation_time = generation_time_fn,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -406,7 +406,7 @@ test_that("aggregate.epichains method returns correct objects", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    generation_time = function(x) 3,
+    generation_time = generation_time_fn,
     lambda = 2
   )
   #' Create aggregates
@@ -467,7 +467,7 @@ test_that("aggregate.epichains method is numerically correct", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    generation_time = function(x) 3,
+    generation_time = generation_time_fn,
     lambda = 2
   )
   #' Create aggregates
@@ -485,7 +485,7 @@ test_that("aggregate.epichains method is numerically correct", {
   )
   expect_identical(
     aggreg_by_time$cases,
-    c(10L, 17L, 38L, 38L, 12L)
+    as.integer(c(10, rep(1, 82)))
   )
 })
 
@@ -496,7 +496,7 @@ test_that("head and tail print output as expected", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -504,7 +504,7 @@ test_that("head and tail print output as expected", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -519,7 +519,7 @@ test_that("head and tail print output as expected", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    generation_time = function(x) 3,
+    generation_time = generation_time_fn,
     lambda = 2
   )
   expect_snapshot(head(susc_outbreak_raw))
@@ -539,7 +539,7 @@ test_that("head and tail return data.frames", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -547,7 +547,7 @@ test_that("head and tail return data.frames", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    generation_time = serial_func
+    generation_time = generation_time_fn
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -562,7 +562,7 @@ test_that("head and tail return data.frames", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    generation_time = function(x) 3,
+    generation_time = generation_time_fn,
     lambda = 2
   )
   #' Expectations
diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index e542ce65..452bac43 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -1,5 +1,5 @@
 #' Define global variables and options for simulations
-generation_time <- function(n) {
+generation_time_fn <- function(n) {
   rlnorm(n, meanlog = 0.58, sdlog = 1.58)
 }
 
@@ -10,7 +10,7 @@ test_that("Simulators work", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    generation_time = generation_time
+    generation_time = generation_time_fn
   )
   #' Simulate an outbreak from a susceptible population (nbinom)
   susc_outbreak_raw2 <- simulate_tree_from_pop(
@@ -18,7 +18,7 @@ test_that("Simulators work", {
     offspring_dist = "nbinom",
     mu = 1,
     size = 1.1,
-    generation_time = generation_time
+    generation_time = generation_time_fn
   )
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(
@@ -33,7 +33,7 @@ test_that("Simulators work", {
     statistic = "size",
     offspring_dist = "pois",
     stat_max = 10,
-    generation_time = function(x) 3,
+    generation_time = generation_time_fn,
     lambda = 2
   )
   #' Simulate chain statistics
@@ -71,7 +71,7 @@ test_that("Simulators work", {
         statistic = "size",
         offspring_dist = "pois",
         stat_max = 10,
-        generation_time = function(x) 3,
+        generation_time = generation_time_fn,
         lambda = 2,
         tf = 5
       )$time < 5
@@ -187,7 +187,7 @@ test_that("simulate_tree_from_pop throws errors", {
       pop = 100,
       offspring_dist = "binom",
       offspring_mean = 0.5,
-      generation_time = generation_time
+      generation_time = generation_time_fn
     ),
     "should be one of"
   )
@@ -197,7 +197,7 @@ test_that("simulate_tree_from_pop throws errors", {
       offspring_dist = "nbinom",
       mu = 0.5,
       size = 0.9,
-      generation_time = generation_time
+      generation_time = generation_time_fn
     ),
     "> 1"
   )
@@ -207,7 +207,7 @@ test_that("simulate_tree_from_pop throws errors", {
       offspring_dist = p,
       offspring_mean = 0.5,
       offspring_disp = 0.9,
-      generation_time = generation_time
+      generation_time = generation_time_fn
     ),
     "not found"
   )
@@ -216,7 +216,7 @@ test_that("simulate_tree_from_pop throws errors", {
       pop = 100,
       offspring_dist = "nbinom",
       offspring_mean = 0.5,
-      generation_time = generation_time
+      generation_time = generation_time_fn
     ),
     "must be specified"
   )
@@ -330,7 +330,7 @@ test_that("simulate_tree_from_pop is numerically correct", {
     pop = 100,
     offspring_dist = "pois",
     lambda = 0.9,
-    generation_time = generation_time
+    generation_time = generation_time_fn
   )
   #' Summarise the results
   susc_outbreak_summary <- summary(susc_outbreak_raw)

From 102a3cdf73374453b6cd1debe7c887a9a60f12fc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 23:36:28 +0000
Subject: [PATCH 774/828] Define generation time function appropriately

---
 R/epichains.R                 |  2 +-
 R/simulate.r                  |  6 ++---
 man/aggregate.epichains.Rd    |  2 +-
 man/simulate_tree.Rd          |  2 +-
 man/simulate_tree_from_pop.Rd |  4 ++--
 vignettes/epichains.Rmd       | 45 +++++++++++++++++++----------------
 6 files changed, 33 insertions(+), 28 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index ad0de528..c98b5407 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -259,7 +259,7 @@ tail.epichains <- function(x, ...) {
 #'   statistic = "size",
 #'   offspring_dist = "pois",
 #'   stat_max = 10,
-#'   generation_time = function(x) 3,
+#'   generation_time = function(n) rep(3, n),
 #'   lambda = 2
 #' )
 #' chains
diff --git a/R/simulate.r b/R/simulate.r
index f66437ef..0bae243e 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -78,7 +78,7 @@
 #'   statistic = "size",
 #'   offspring_dist = "pois",
 #'   stat_max = 10,
-#'   generation_time = function(x) 3,
+#'   generation_time = function(n) rep(3, n),
 #'   lambda = 2
 #' )
 #' @references
@@ -396,7 +396,7 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
 #'   pop = 100,
 #'   offspring_dist = "pois",
 #'   lambda = 0.5,
-#'   generation_time = function(x) 3
+#'   generation_time = function(n) rep(3, n)
 #' )
 #'
 #' # Simulate with negative binomial offspring
@@ -404,7 +404,7 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
 #' pop = 100, offspring_dist = "nbinom",
 #' mu = 0.5,
 #' size = 1.1,
-#' generation_time = function(x) 3
+#' generation_time = function(n) rep(3, n)
 #' )
 #' @export
 simulate_tree_from_pop <- function(pop,
diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
index c6460ccc..14ea9593 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains.Rd
@@ -29,7 +29,7 @@ chains <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  generation_time = function(x) 3,
+  generation_time = function(n) rep(3, n),
   lambda = 2
 )
 chains
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 3d973ef9..54d21dc5 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -99,7 +99,7 @@ chains <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  generation_time = function(x) 3,
+  generation_time = function(n) rep(3, n),
   lambda = 2
 )
 }
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index 54533057..f592ac1a 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -79,7 +79,7 @@ simulate_tree_from_pop(
   pop = 100,
   offspring_dist = "pois",
   lambda = 0.5,
-  generation_time = function(x) 3
+  generation_time = function(n) rep(3, n)
 )
 
 # Simulate with negative binomial offspring
@@ -87,7 +87,7 @@ simulate_tree_from_pop(
 pop = 100, offspring_dist = "nbinom",
 mu = 0.5,
 size = 1.1,
-generation_time = function(x) 3
+generation_time = function(n) rep(3, n)
 )
 }
 \seealso{
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 2fcfbf2f..cba7b612 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -192,9 +192,10 @@ It retains and returns information on infectors (ancestors), infectees, the gene
 Let's look at an example where we simulate the transmission trees of $10$ initial infections/chains. We assume a poisson offspring distribution with mean, $\text{lambda} = 0.9$, and a generation time of $3$ days:
 ```{r}
 set.seed(123)
-# Define generation interval
-generation_time <- function(x) {
-  return(3)
+# Define generation time
+generation_time_fn <- function(n) {
+  gt <- rep(3, n)
+  return(gt)
 }
 
 sim_tree_eg <- simulate_tree(
@@ -202,7 +203,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  generation_time = function(x) 3,
+  generation_time = generation_time_fn,
   lambda = 0.9
 )
 
@@ -239,16 +240,17 @@ simulate_summary_eg
 Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and generation time of $3$:
 ```{r}
 set.seed(7)
-# Define generation interval
-generation_time <- function(x) {
-  return(3)
+# Define generation time
+generation_time_fn <- function(n) {
+  gt <- rep(3, n)
+  return(gt)
 }
 
 sim_tree_from_pop_eg <- simulate_tree_from_pop(
   pop = 1000,
   offspring_dist = "pois",
   lambda = 1,
-  generation_time = generation_time
+  generation_time = generation_time_fn
 )
 
 head(sim_tree_from_pop_eg)
@@ -262,9 +264,10 @@ You can run `summary()` on `<epichains>` objects to get useful summaries.
 ```{r include=TRUE,echo=TRUE}
 # Example with simulate_tree()
 set.seed(123)
-# Define generation interval
-generation_time <- function(x) {
-  return(3)
+# Define generation time
+generation_time_fn <- function(n) {
+  gt <- rep(3, n)
+  return(gt)
 }
 
 sim_tree_eg <- simulate_tree(
@@ -272,7 +275,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  generation_time = generation_time,
+  generation_time = generation_time_fn,
   lambda = 0.9
 )
 
@@ -302,9 +305,10 @@ To aggregate over "time", you must have specified a generation interval distribu
 # Example with simulate_tree()
 set.seed(123)
 
-# Define generation interval
-generation_time <- function(x) {
-  return(3)
+# Define generation time
+generation_time_fn <- function(n) {
+  gt <- rep(3, n)
+  return(gt)
 }
 
 sim_tree_eg <- simulate_tree(
@@ -312,7 +316,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  generation_time = generation_time,
+  generation_time = generation_time_fn,
   lambda = 0.9
 )
 
@@ -327,9 +331,10 @@ Here is an end-to-end example from simulation through aggregation to plotting.
 ```{r}
 # Run simulation with simulate_tree()
 set.seed(123)
-# Define generation interval
-generation_time <- function(x) {
-  return(3)
+# Define generation time
+generation_time_fn <- function(n) {
+  gt <- rep(3, n)
+  return(gt)
 }
 
 sim_tree_eg <- simulate_tree(
@@ -337,7 +342,7 @@ sim_tree_eg <- simulate_tree(
   statistic = "size",
   offspring_dist = "pois",
   stat_max = 10,
-  generation_time = generation_time,
+  generation_time = generation_time_fn,
   lambda = 0.9
 )
 

From 98efe73c4b01b4e38f41019b38d38cf123b4dd0d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 23:37:02 +0000
Subject: [PATCH 775/828] Rename serial_dist function to generation_time

---
 R/checks.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/checks.R b/R/checks.R
index b351c742..dfedf17f 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -45,7 +45,7 @@ check_generation_time_valid <- function(generation_time) {
       "(see details in ?simulate_tree)."
     ))
   }
-  x <- serials_dist(10)
+  x <- generation_time(10)
   if (!checkmate::test_numeric(x, len = 10)) {
     stop(
       "The return values of `serials_dist` must be a numeric vector of length ",

From e2f38f71e49569d47ba0a5ab736c4713fc912fdd Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 23:37:28 +0000
Subject: [PATCH 776/828] Use renamed generation_time check function

---
 tests/testthat/test-checks.R | 8 ++++++++
 1 file changed, 8 insertions(+)

diff --git a/tests/testthat/test-checks.R b/tests/testthat/test-checks.R
index 2409dd98..e3bc64b5 100644
--- a/tests/testthat/test-checks.R
+++ b/tests/testthat/test-checks.R
@@ -11,6 +11,14 @@ test_that("Checks work", {
     check_generation_time_valid("a"),
     "must be a function"
   )
+  expect_error(
+    check_generation_time_valid(function(x) rep("a", 10)),
+    "numeric"
+  )
+  expect_error(
+    check_generation_time_valid(function(x) 3),
+    "vector of length"
+  )
   expect_error(
     check_ntrees_valid(1.1),
     "less than"

From 91e341a11ace40d6acff2e97641ab8a98de3b76b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 23:37:50 +0000
Subject: [PATCH 777/828] Delete old serial_dist check function

---
 man/check_serial_valid.Rd | 18 ------------------
 1 file changed, 18 deletions(-)
 delete mode 100644 man/check_serial_valid.Rd

diff --git a/man/check_serial_valid.Rd b/man/check_serial_valid.Rd
deleted file mode 100644
index aec80683..00000000
--- a/man/check_serial_valid.Rd
+++ /dev/null
@@ -1,18 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/checks.R
-\name{check_serial_valid}
-\alias{check_serial_valid}
-\title{Check if the serials_dist argument is valid.}
-\usage{
-check_serial_valid(serials_dist)
-}
-\arguments{
-\item{serials_dist}{The serial interval distribution function; the name of a
-user-defined named or anonymous function with only one argument \code{n},
-representing the number of serial intervals to generate.}
-}
-\description{
-Check if the serials_dist argument is a function with one argument \code{n}
-and returns a numerical vector of length \code{n}.
-}
-\keyword{internal}

From 3229c0f1b82fc50b77a7afb6123d31922dd670a8 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 23:38:14 +0000
Subject: [PATCH 778/828] Revise stop() message to use generation_time

---
 R/checks.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/checks.R b/R/checks.R
index dfedf17f..2c5c7a53 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -48,8 +48,8 @@ check_generation_time_valid <- function(generation_time) {
   x <- generation_time(10)
   if (!checkmate::test_numeric(x, len = 10)) {
     stop(
-      "The return values of `serials_dist` must be a numeric vector of length ",
-      "`n`."
+      "The return values of `generation_time`",
+      "must be a numeric vector of length `n`."
     )
   }
 }

From 2dff3e5ac8c5096454ca99fca4e03ed2d7fe5c3d Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Thu, 30 Nov 2023 23:38:22 +0000
Subject: [PATCH 779/828] Update snapshots

---
 tests/testthat/_snaps/epichains.md | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/tests/testthat/_snaps/epichains.md b/tests/testthat/_snaps/epichains.md
index 9894418f..f3e50c3d 100644
--- a/tests/testthat/_snaps/epichains.md
+++ b/tests/testthat/_snaps/epichains.md
@@ -117,12 +117,12 @@
     Output
       `epichains` object 
       
-      [1] 1 3
+      [1] 9 6
       
        Simulated chain lengths: 
       
-      Max: 3
-      Min: 1
+      Max: 9
+      Min: 6
 
 # head and tail print output as expected
 

From 00b0bd27087804daeeeab25f28f5ca9300474133 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 1 Dec 2023 15:38:14 +0000
Subject: [PATCH 780/828] Fix a test

---
 tests/testthat/test-simulate.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index 452bac43..f46c4119 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -209,7 +209,7 @@ test_that("simulate_tree_from_pop throws errors", {
       offspring_disp = 0.9,
       generation_time = generation_time_fn
     ),
-    "not found"
+    "'arg' must be NULL or a character vector"
   )
   expect_error(
     simulate_tree_from_pop(

From 3763e9b1fd8cba343bd43d90928ac288f7d450da Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 1 Dec 2023 15:38:29 +0000
Subject: [PATCH 781/828] Update snapshots

---
 tests/testthat/_snaps/epichains.md | 56 +++++++++++++++---------------
 1 file changed, 28 insertions(+), 28 deletions(-)

diff --git a/tests/testthat/_snaps/epichains.md b/tests/testthat/_snaps/epichains.md
index f3e50c3d..71ea0092 100644
--- a/tests/testthat/_snaps/epichains.md
+++ b/tests/testthat/_snaps/epichains.md
@@ -88,23 +88,23 @@
       
       < tree head (from first known infector) >
       
-         infectee_id sim_id infector_id generation time
-      11           1      2           1          2    3
-      12           2      2           1          2    3
-      13           3      2           1          2    3
-      14           4      2           1          2    3
-      15           5      2           1          2    3
-      16           6      2           1          2    3
+         infectee_id sim_id infector_id generation      time
+      11           1      2           1          2 2.6525084
+      12           2      2           1          2 0.2397245
+      13           4      2           1          2 0.9737101
+      14           5      2           1          2 0.2385887
+      15           6      2           1          2 1.7212668
+      16           7      2           1          2 1.3509058
       
       < tree tail >
       
-          infectee_id sim_id infector_id generation time
-      138          10     19           9          4    9
-      139           2     20           6          4    9
-      140           4     20           9          4    9
-      141           4     21           9          4    9
-      142           4     22           9          4    9
-      143           4     23           9          4    9
+          infectee_id sim_id infector_id generation      time
+      119           9     15           8          5 19.146936
+      120           2     16           8          4  2.941326
+      121           9     16           8          5 17.447014
+      122          10     16           9          4 17.017684
+      123           2     17           9          4  7.368167
+      124           2     18           9          4  7.931447
       Chains simulated: 10
       Number of infectors (known): 9
       Number of generations: 5
@@ -171,13 +171,13 @@
     Output
       < tree head (from first known infector) >
       
-         infectee_id sim_id infector_id generation time
-      11           1      2           1          2    3
-      12           2      2           1          2    3
-      13           3      2           1          2    3
-      14           4      2           1          2    3
-      15           5      2           1          2    3
-      16           6      2           1          2    3
+         infectee_id sim_id infector_id generation      time
+      11           1      2           1          2 2.6525084
+      12           2      2           1          2 0.2397245
+      13           4      2           1          2 0.9737101
+      14           5      2           1          2 0.2385887
+      15           6      2           1          2 1.7212668
+      16           7      2           1          2 1.3509058
 
 ---
 
@@ -230,11 +230,11 @@
       
       < tree tail >
       
-          infectee_id sim_id infector_id generation time
-      138          10     19           9          4    9
-      139           2     20           6          4    9
-      140           4     20           9          4    9
-      141           4     21           9          4    9
-      142           4     22           9          4    9
-      143           4     23           9          4    9
+          infectee_id sim_id infector_id generation      time
+      119           9     15           8          5 19.146936
+      120           2     16           8          4  2.941326
+      121           9     16           8          5 17.447014
+      122          10     16           9          4 17.017684
+      123           2     17           9          4  7.368167
+      124           2     18           9          4  7.931447
 

From bc6372232622bef9aa2854a6a35e573c978fc4fc Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Fri, 1 Dec 2023 16:06:38 +0000
Subject: [PATCH 782/828] Fix test

---
 tests/testthat/test-simulate.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index f46c4119..2decf4b0 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -204,12 +204,12 @@ test_that("simulate_tree_from_pop throws errors", {
   expect_error(
     simulate_tree_from_pop(
       pop = 100,
-      offspring_dist = p,
+      offspring_dist = "pp",
       offspring_mean = 0.5,
       offspring_disp = 0.9,
       generation_time = generation_time_fn
     ),
-    "'arg' must be NULL or a character vector"
+    "should be one of"
   )
   expect_error(
     simulate_tree_from_pop(

From 07fd2a5a018e7694df841b115be0f14a8eeafd2d Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 1 Dec 2023 22:18:39 +0000
Subject: [PATCH 783/828] Apply suggested revisions from code review

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 vignettes/epichains.Rmd            | 2 +-
 vignettes/projecting_incidence.Rmd | 6 +++---
 2 files changed, 4 insertions(+), 4 deletions(-)

diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index cba7b612..64882ff3 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -237,7 +237,7 @@ simulate_summary_eg
 
 `simulate_tree_from_pop()` simulates outbreaks based on a specified population size and pre-existing immunity until the susceptible pool runs out.
   
-Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and generation time of $3$:
+Here is a quick example where we simulate an outbreak in a population of size $1000$. We assume individuals have a poisson offspring distribution with mean, $\text{lambda} = 1$, and fixed generation time of $3$:
 ```{r}
 set.seed(7)
 # Define generation time
diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index ae6c827e..65ce0bd2 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -94,14 +94,14 @@ t0
 
 ### Generation time
 
-In epidemiology, the generation interval is the duration between successive
+In epidemiology, the generation time (also called the generation interval) is the duration between successive
 infectious events in a chain of transmission. Similarly, the serial
 interval is the duration between observed symptom onset times between
 successive cases in a transmission chain. The generation interval is
 often hard to observe because exact times of infection are hard to
 measure hence, the serial interval is often used instead. Here, we
-use the serial interval to represent what would normally be called the
-generation interval, that is, the time between successive cases.
+use the serial interval and interpret the simulated case data to represent
+symptom onset.
 
 In this example, we will assume based on COVID-19 literature that the 
 serial interval, S, is log-normal distributed with parameters, 

From adb847a2ee29bb779b29baabce05afb393a0f029 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 1 Dec 2023 22:38:27 +0000
Subject: [PATCH 784/828] Rename generation interval to generation time for
 consistency

---
 R/simulate.r                       | 16 ++++++++--------
 man/check_generation_time_valid.Rd |  6 +++---
 man/simulate_summary.Rd            |  2 +-
 man/simulate_tree.Rd               | 10 +++++-----
 man/simulate_tree_from_pop.Rd      |  4 ++--
 vignettes/epichains.Rmd            |  2 +-
 6 files changed, 20 insertions(+), 20 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 0bae243e..d725dc3f 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -15,13 +15,13 @@
 #' @param stat_max A cut off for the chain statistic (size/length) being
 #' computed. Results above the specified value, are set to this value.
 #' Defaults to `Inf`.
-#' @param generation_time The generation interval function; the name
+#' @param generation_time The generation time function; the name
 #' of a user-defined named or anonymous function with only one argument `n`,
-#' representing the number of generation intervals to generate. See details.
-#' @param t0 Start time (if generation interval is given); either a single value
+#' representing the number of generation times to sample.
+#' @param t0 Start time (if generation time is given); either a single value
 #' or a vector of same length as `nchains` (number of simulations) with
 #' initial times. Defaults to 0.
-#' @param tf End time (if generation interval is given).
+#' @param tf End time (if generation time is given).
 #' @param ... Parameters of the offspring distribution as required by R.
 #' @return An `<epichains>` object, which is basically a `<data.frame>` with
 #' columns `infectee_id`, `sim_id` (a unique ID within each simulation
@@ -53,7 +53,7 @@
 #' as a random log-normally distributed variable with
 #' `meanlog = 0.58` and `sdlog = 1.58`, we could define a named function,
 #' let's call it "generation_time_fn", with only one argument representing the
-#' number of generation intervals to sample:
+#' number of generation times to sample:
 #' \code{generation_time_fn <- function(n){rlnorm(n, 0.58, 1.38)}},
 #' and assign the name of the function to `generation_time` in
 #' the simulation function, i.e.
@@ -206,8 +206,8 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
           generation = generation
         )
 
-      # if a generation interval model/function was specified, use it
-      # to generate generation intervals for the cases
+      # if a generation time model/function was specified, use it
+      # to generate generation times for the cases
       if (!missing(generation_time)) {
         times <- rep(times, next_gen) + generation_time(sum(n_offspring))
         current_min_time <- unname(tapply(times, indices, min))
@@ -519,7 +519,7 @@ simulate_tree_from_pop <- function(pop,
       new_times <- generation_time(n_offspring)
 
       if (any(new_times < 0)) {
-        stop("Generation interval must be >= 0.")
+        stop("Generation time must be >= 0.")
       }
 
       new_df <- data.frame(
diff --git a/man/check_generation_time_valid.Rd b/man/check_generation_time_valid.Rd
index 4bc950e5..4022766e 100644
--- a/man/check_generation_time_valid.Rd
+++ b/man/check_generation_time_valid.Rd
@@ -7,9 +7,9 @@
 check_generation_time_valid(generation_time)
 }
 \arguments{
-\item{generation_time}{The generation interval function; the name of a
-user-defined named or anonymous function with only one argument \code{n},
-representing the number of generation intervals to sample.}
+\item{generation_time}{The generation time function; the name
+of a user-defined named or anonymous function with only one argument \code{n},
+representing the number of generation times to sample.}
 }
 \description{
 Check if the generation_time argument is specified as a function
diff --git a/man/simulate_summary.Rd b/man/simulate_summary.Rd
index 6989ee22..2660a448 100644
--- a/man/simulate_summary.Rd
+++ b/man/simulate_summary.Rd
@@ -58,7 +58,7 @@ For example, assuming we want to specify the generation time
 as a random log-normally distributed variable with
 \code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
 let's call it "generation_time_fn", with only one argument representing the
-number of generation intervals to sample:
+number of generation times to sample:
 \code{generation_time_fn <- function(n){rlnorm(n, 0.58, 1.38)}},
 and assign the name of the function to \code{generation_time} in
 the simulation function, i.e.
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 54d21dc5..2a8f4d1c 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -35,15 +35,15 @@ numbers).}
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
-\item{generation_time}{The generation interval function; the name
+\item{generation_time}{The generation time function; the name
 of a user-defined named or anonymous function with only one argument \code{n},
-representing the number of generation intervals to generate. See details.}
+representing the number of generation times to sample.}
 
-\item{t0}{Start time (if generation interval is given); either a single value
+\item{t0}{Start time (if generation time is given); either a single value
 or a vector of same length as \code{nchains} (number of simulations) with
 initial times. Defaults to 0.}
 
-\item{tf}{End time (if generation interval is given).}
+\item{tf}{End time (if generation time is given).}
 
 \item{...}{Parameters of the offspring distribution as required by R.}
 }
@@ -77,7 +77,7 @@ For example, assuming we want to specify the generation time
 as a random log-normally distributed variable with
 \code{meanlog = 0.58} and \code{sdlog = 1.58}, we could define a named function,
 let's call it "generation_time_fn", with only one argument representing the
-number of generation intervals to sample:
+number of generation times to sample:
 \code{generation_time_fn <- function(n){rlnorm(n, 0.58, 1.38)}},
 and assign the name of the function to \code{generation_time} in
 the simulation function, i.e.
diff --git a/man/simulate_tree_from_pop.Rd b/man/simulate_tree_from_pop.Rd
index f592ac1a..39393623 100644
--- a/man/simulate_tree_from_pop.Rd
+++ b/man/simulate_tree_from_pop.Rd
@@ -23,9 +23,9 @@ corresponding to the R distribution function (e.g., "pois" for Poisson,
 where \code{\link{rpois}} is the R function to generate Poisson random
 numbers). Only supports "pois" and "nbinom".}
 
-\item{generation_time}{The generation interval function; the name
+\item{generation_time}{The generation time function; the name
 of a user-defined named or anonymous function with only one argument \code{n},
-representing the number of generation intervals to generate. See details.}
+representing the number of generation times to sample.}
 
 \item{initial_immune}{The number of initial immunes in the population.
 Must be less than \code{pop} - 1.}
diff --git a/vignettes/epichains.Rmd b/vignettes/epichains.Rmd
index 64882ff3..fc9341d5 100644
--- a/vignettes/epichains.Rmd
+++ b/vignettes/epichains.Rmd
@@ -300,7 +300,7 @@ summary(simulate_summary_eg)
 
 You can aggregate `<epichains>` objects returned by the `simulate_*()` functions into a time series, which is a `<data.frame>` with columns "cases"  and either "generation" or "time", depending on the value of `grouping_var`.
 
-To aggregate over "time", you must have specified a generation interval distribution in the simulation step.
+To aggregate over "time", you must have specified a generation time distribution in the simulation step.
 ```{r include=TRUE,echo=TRUE}
 # Example with simulate_tree()
 set.seed(123)

From efcdc25e91e548731116e9edcd75726f1ab2df64 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 1 Dec 2023 22:38:39 +0000
Subject: [PATCH 785/828] Inherit the param

---
 R/checks.R | 4 +---
 1 file changed, 1 insertion(+), 3 deletions(-)

diff --git a/R/checks.R b/R/checks.R
index 2c5c7a53..8606bc3e 100644
--- a/R/checks.R
+++ b/R/checks.R
@@ -32,9 +32,7 @@ check_offspring_func_valid <- function(roffspring_name) {
 
 #' Check if the generation_time argument is specified as a function
 #'
-#' @param generation_time The generation interval function; the name of a
-#' user-defined named or anonymous function with only one argument `n`,
-#' representing the number of generation intervals to sample.
+#' @inheritParams simulate_tree
 #'
 #' @keywords internal
 check_generation_time_valid <- function(generation_time) {

From 30b68a02a9131273b624cafb80760f522faa79e1 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Fri, 1 Dec 2023 22:38:52 +0000
Subject: [PATCH 786/828] Break up long lines

---
 vignettes/projecting_incidence.Rmd | 10 +++++-----
 1 file changed, 5 insertions(+), 5 deletions(-)

diff --git a/vignettes/projecting_incidence.Rmd b/vignettes/projecting_incidence.Rmd
index 65ce0bd2..9058cccd 100644
--- a/vignettes/projecting_incidence.Rmd
+++ b/vignettes/projecting_incidence.Rmd
@@ -94,11 +94,11 @@ t0
 
 ### Generation time
 
-In epidemiology, the generation time (also called the generation interval) is the duration between successive
-infectious events in a chain of transmission. Similarly, the serial
-interval is the duration between observed symptom onset times between
-successive cases in a transmission chain. The generation interval is
-often hard to observe because exact times of infection are hard to
+In epidemiology, the generation time (also called the generation interval) is
+the duration between successive infectious events in a chain of transmission.
+Similarly, the serial interval is the duration between observed symptom onset
+times between successive cases in a transmission chain. The generation
+interval is often hard to observe because exact times of infection are hard to
 measure hence, the serial interval is often used instead. Here, we
 use the serial interval and interpret the simulated case data to represent
 symptom onset.

From bc83b67bdc1e2377cc057995e57a1f47be3ece67 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 10 Oct 2023 17:49:15 +0100
Subject: [PATCH 787/828] Use helper functions to create objects

---
 R/simulate.r | 51 ++++++++++++++++++++++++++++-----------------------
 1 file changed, 28 insertions(+), 23 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index d725dc3f..fb96c085 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -235,17 +235,18 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
     tree_df <- tree_df[tree_df$time < tf, ]
   }
 
-  # sort by sim_id and infector
-  tree_df <- tree_df[order(tree_df$sim_id, tree_df$infector_id), ]
-  row.names(tree_df) <- NULL
-  structure(
-    tree_df,
-    chains = ntrees,
-    chain_type = "chains_tree",
-    rownames = NULL,
-    track_pop = FALSE,
-    class = c("epichains", "data.frame")
+  # sort by sim_id and ancestor
+  tree_df <- tree_df[order(tree_df$sim_id, tree_df$ancestor), ]
+
+  out <- epichains_tree(
+    tree_df = tree_df,
+    chains_run = nchains,
+    statistic = statistic,
+    stat_max = stat_max,
+    intvn_mean_reduction = intvn_mean_reduction,
+    track_pop = FALSE
   )
+  return(out)
 }
 
 
@@ -335,13 +336,15 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
 
   stat_track[stat_track >= stat_max] <- Inf
 
-  structure(
-    stat_track,
-    chain_type = "chains_summary",
+  out <- epichains_summary(
+    chains_summary = stat_track,
+    chains_run = nchains,
     statistic = statistic,
-    chains = ntrees,
-    class = c("epichains", class(stat_track))
-  )
+    stat_max = stat_max,
+    )
+
+  return(out)
+    intvn_mean_reduction = intvn_mean_reduction
 }
 
 #' Simulate transmission trees from a susceptible or partially immune
@@ -545,12 +548,14 @@ simulate_tree_from_pop <- function(pop,
   # sort by sim_id and infector
   tree_df <- tree_df[order(tree_df$sim_id, tree_df$infector_id), ]
   tree_df$offspring_generated <- NULL
-  row.names(tree_df) <- NULL
-  structure(
+
+  out <- epichains_tree(
     tree_df,
-    chain_type = "chains_tree",
-    rownames = NULL,
-    track_pop = TRUE,
-    class = c("epichains", "data.frame")
-  )
+    chains_run = NULL,
+    statistic = NULL,
+    stat_max = NULL,
+    intvn_mean_reduction = intvn_mean_reduction,
+    track_pop = TRUE
+    )
+  return(out)
 }

From e28db9234535f5c866bcc917cf258d21586c643c Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 10 Oct 2023 17:50:22 +0100
Subject: [PATCH 788/828] Remove epichains_aggregate_df class

---
 R/epichains.R | 8 +-------
 1 file changed, 1 insertion(+), 7 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index c98b5407..ee74bed6 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -312,11 +312,5 @@ aggregate.epichains <- function(x,
     )
   }
 
-  structure(
-    out,
-    class = c("epichains_aggregate_df", "data.frame"),
-    chain_type = attributes(x)$chain_type,
-    rownames = NULL,
-    aggregated_over = grouping_var
-  )
+  return(out)
 }

From feccb25ac0045226487e2db7cf463484ab24fb38 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 10 Oct 2023 17:51:21 +0100
Subject: [PATCH 789/828] Use new validation function

---
 R/epichains.R | 9 +--------
 1 file changed, 1 insertion(+), 8 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index ee74bed6..743fc9f6 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -277,14 +277,7 @@ aggregate.epichains <- function(x,
                                   "generation"
                                 ),
                                 ...) {
-  validate_epichains(x)
-  # Check that the object is of type "chains_tree"
-  if (!is_chains_tree(x)) {
-    stop(
-      "object must be an epichains object with 'chains_tree' attribute, ",
-      "which can be generated using the `simulate_tree()` function."
-    )
-  }
+  validate_epichains_tree(x)
 
   # Get grouping variable
   grouping_var <- match.arg(grouping_var)

From a28621cf19a97dfdbd5abe100e52a633f40e8296 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 10 Oct 2023 17:52:30 +0100
Subject: [PATCH 790/828] Clean up documentation of aggregate method

---
 R/epichains.R              | 16 +++++++++-------
 man/aggregate.epichains.Rd | 13 ++++++-------
 2 files changed, 15 insertions(+), 14 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 743fc9f6..be0c3818 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -240,18 +240,20 @@ tail.epichains <- function(x, ...) {
   utils::tail(as.data.frame(x), ...)
 }
 
-#' Aggregate cases in `<epichains>` objects by "time" or "generation"
+#' Aggregate cases in `<epichains_tree>` objects by "time" or "generation"
 #'
-#' @param x An `<epichains>` object.
-#' @param grouping_var The variable to group and count over. Options include
+#' @description
+#' This function provides a quick way to create a time series of cases over
+#' time or generation from simulated `<epichains_tree>` objects.
+#'
+#' @param x An `<epichains_tree>` object.
+#' @param grouping_var The variable to aggregate by. Options include
 #' "time" and "generation".
 #' @param ... Other arguments passed to aggregate.
 #' @importFrom stats aggregate
-#' @return An `<epichains_aggregate_df>` object, which is basically a
-#' `<data.frame>`. The object stores the `chain_type = chains_tree` and
-#' `grouping_var` attributes.
-#' @export
+#' @return A `<data.frame>` object of cases by `grouping_var`.
 #' @author James M. Azam
+#' @export
 #' @examples
 #' set.seed(123)
 #' chains <- simulate_tree(
diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains.Rd
index 14ea9593..77f9cd89 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains.Rd
@@ -2,25 +2,24 @@
 % Please edit documentation in R/epichains.R
 \name{aggregate.epichains}
 \alias{aggregate.epichains}
-\title{Aggregate cases in \verb{<epichains>} objects by "time" or "generation"}
+\title{Aggregate cases in \verb{<epichains_tree>} objects by "time" or "generation"}
 \usage{
 \method{aggregate}{epichains}(x, grouping_var = c("time", "generation"), ...)
 }
 \arguments{
-\item{x}{An \verb{<epichains>} object.}
+\item{x}{An \verb{<epichains_tree>} object.}
 
-\item{grouping_var}{The variable to group and count over. Options include
+\item{grouping_var}{The variable to aggregate by. Options include
 "time" and "generation".}
 
 \item{...}{Other arguments passed to aggregate.}
 }
 \value{
-An \verb{<epichains_aggregate_df>} object, which is basically a
-\verb{<data.frame>}. The object stores the \code{chain_type = chains_tree} and
-\code{grouping_var} attributes.
+A \verb{<data.frame>} object of cases by \code{grouping_var}.
 }
 \description{
-Aggregate cases in \verb{<epichains>} objects by "time" or "generation"
+This function provides a quick way to create a time series of cases over
+time or generation from simulated \verb{<epichains_tree>} objects.
 }
 \examples{
 set.seed(123)

From 0ac382de58645e0bc764be4d9063deef25561a81 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 11 Oct 2023 12:49:36 +0100
Subject: [PATCH 791/828] Condense head and tail methods for new class

---
 R/epichains.R              | 37 +++++++++++++------------------------
 man/head.epichains.Rd      | 28 ----------------------------
 man/head.epichains_tree.Rd | 33 +++++++++++++++++++++++++++++++++
 3 files changed, 46 insertions(+), 52 deletions(-)
 delete mode 100644 man/head.epichains.Rd
 create mode 100644 man/head.epichains_tree.Rd

diff --git a/R/epichains.R b/R/epichains.R
index be0c3818..e6495890 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -201,42 +201,31 @@ is_chains_summary <- function(x) {
     attributes(x)$chain_type == "chains_summary"
 }
 
-
-#' `head` method for [`epichains`] class
+#' `head` and `tail` method for `<epichains_tree>` class
 #'
-#' @param x An [`epichains`] object
+#' @param x An `<epichains_tree>` object
 #' @param ... further arguments passed to or from other methods
 #' @importFrom utils head
-#' @return object of class `data.frame`
+#' @importFrom utils tail
+#' @return Object of class `data.frame`
 #' @author James M. Azam
 #' @export
 #' @details
-#' This returns the top rows of an `epichains` object. Note that the object
-#' is originally sorted by `sim_id` and `infector_id` and the first
+#' This returns the top rows of an `<epichains_tree>` object. Note that
+#' the object is originally sorted by `sim_id` and `ancestor` and the first
 #' unknown ancestors (NA) have been dropped from
-#' printing method. To view the full output, use `as.data.frame(<object_name>)`.
+#' printing method.
 #'
-head.epichains <- function(x, ...) {
-  writeLines("< tree head (from first known infector) >\n")
-  # print head of the simulation output from the first known infector
-  x <- x[!is.na(x$infector_id), ]
+#' To view the full output, use `as.data.frame(<object_name>)`.
+head.epichains_tree <- function(x, ...) {
+  # print head of the simulation output from the first known ancestor
+  x <- x[!is.na(x$ancestor), ]
   utils::head(as.data.frame(x), ...)
 }
 
-#' `tail` method for [`epichains`] class
-#'
-#' @param x An [`epichains`] object
-#' @param ... further arguments passed to or from other methods
-#' @importFrom utils tail
-#' @author James M. Azam
+#' @rdname head.epichains_tree
 #' @export
-#' @details
-#' This returns the bottom part of an `epichains` object. Note that the object
-#' is originally sorted by `sim_id` and `infector_id` and the first
-#' unknown ancestors (NA) have been dropped from
-#' printing method. To view the full output, use `as.data.frame(<object_name>)`.
-tail.epichains <- function(x, ...) {
-  writeLines("\n< tree tail >\n")
+tail.epichains_tree <- function(x, ...) {
   utils::tail(as.data.frame(x), ...)
 }
 
diff --git a/man/head.epichains.Rd b/man/head.epichains.Rd
deleted file mode 100644
index d9dd5a14..00000000
--- a/man/head.epichains.Rd
+++ /dev/null
@@ -1,28 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{head.epichains}
-\alias{head.epichains}
-\title{\code{head} method for \code{\link{epichains}} class}
-\usage{
-\method{head}{epichains}(x, ...)
-}
-\arguments{
-\item{x}{An \code{\link{epichains}} object}
-
-\item{...}{further arguments passed to or from other methods}
-}
-\value{
-object of class \code{data.frame}
-}
-\description{
-\code{head} method for \code{\link{epichains}} class
-}
-\details{
-This returns the top rows of an \code{epichains} object. Note that the object
-is originally sorted by \code{sim_id} and \code{infector_id} and the first
-unknown ancestors (NA) have been dropped from
-printing method. To view the full output, use \verb{as.data.frame(<object_name>)}.
-}
-\author{
-James M. Azam
-}
diff --git a/man/head.epichains_tree.Rd b/man/head.epichains_tree.Rd
new file mode 100644
index 00000000..b758772f
--- /dev/null
+++ b/man/head.epichains_tree.Rd
@@ -0,0 +1,33 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{head.epichains_tree}
+\alias{head.epichains_tree}
+\alias{tail.epichains_tree}
+\title{\code{head} and \code{tail} method for \verb{<epichains_tree>} class}
+\usage{
+\method{head}{epichains_tree}(x, ...)
+
+\method{tail}{epichains_tree}(x, ...)
+}
+\arguments{
+\item{x}{An \verb{<epichains_tree>} object}
+
+\item{...}{further arguments passed to or from other methods}
+}
+\value{
+Object of class \code{data.frame}
+}
+\description{
+\code{head} and \code{tail} method for \verb{<epichains_tree>} class
+}
+\details{
+This returns the top rows of an \verb{<epichains_tree>} object. Note that
+the object is originally sorted by \code{sim_id} and \code{ancestor} and the first
+unknown ancestors (NA) have been dropped from
+printing method.
+
+To view the full output, use \verb{as.data.frame(<object_name>)}.
+}
+\author{
+James M. Azam
+}

From 9cdb8536fe52080e21cacfa616c0ca15112f267b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 11 Oct 2023 12:51:55 +0100
Subject: [PATCH 792/828] Add constructor and helper for epichains_tree class

---
 R/epichains.R             | 92 ++++++++++++++++++++++++++++++++++++---
 man/epichains_tree.Rd     | 62 ++++++++++++++++++++++++++
 man/new_epichains_tree.Rd | 56 ++++++++++++++++++++++++
 3 files changed, 205 insertions(+), 5 deletions(-)
 create mode 100644 man/epichains_tree.Rd
 create mode 100644 man/new_epichains_tree.Rd

diff --git a/R/epichains.R b/R/epichains.R
index e6495890..3b2a3987 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -1,11 +1,93 @@
-#' Print an [`epichains`] object
+#' Construct a `<epichains_tree>` object
 #'
-#' @param x An [`epichains`] object.
-#' @param ... Other parameters passed to [print()].
-#' @return Invisibly returns an [`epichains`]. Called for side-effects.
+#' @description
+#' `new_epichains_tree()` constructs an `<epichains_tree>` object from a
+#' supplied `<data.frame>` and extra attributes passed as individual arguments.
+#' It is meant to be lazy and performant, by creating the object without
+#' checking the arguments for correctness. It is not safe to call
+#' `new_epichains_tree()` on its own as is called within `epichains_tree()`
+#' after the arguments have been checked. To create an `<epichains_tree>`
+#' object, use `epichains_tree()`.
+#' @param tree_df a `<data.frame>` containing at least columns for "chain_id",
+#' "ancestor", and "generation". Also has optional columns for "time", and
+#' "chain_id".
+#' @param chains_run Number of chains/cases used to generate the outbreak;
+#' Integer
+#' @param track_pop Was the susceptible population tracked; Logical
+#' @inheritParams epichains_tree
+#' @author James M. Azam
+#' @keywords internal
+new_epichains_tree <- function(tree_df = data.frame(),
+                               chains_run = integer(),
+                               statistic = character(),
+                               stat_max = double(),
+                               intvn_mean_reduction = double(),
+                               track_pop = logical()
+                               ) {
+  # Assemble the elements of the object
+  obj <- structure(
+    tree_df,
+    chains_run = chains_run,
+    statistic = statistic,
+    stat_max = stat_max,
+    intvn_mean_reduction = intvn_mean_reduction,
+    track_pop = track_pop,
+    class = c("epichains_tree", "data.frame")
+  )
+  return(obj)
+}
+
+#' Create an `<epichains_tree>` object
+#'
+#' @description
+#' `epichains_tree()` constructs an `<epichains_tree>` object, which is
+#' inherently an `<data.frame>` object that stores some of the inputs
+#' passed to the `simulate_tree()` and `simulate_tree_from_pop()` and the
+#' simulated output. The stored attributes are useful for scenario
+#' analyses where the inputs are required for downstream analyses.
+#'
+#' An `<epichains_tree>` object contains a `<data.frame>` of the simulated
+#' outbreak with ids for each case/chain and the chain the produced, the
+#' number of cases/chains used for the simulation, the statistic that was
+#' tracked, the intervention level, and whether the susceptible population was
+#' tracked.
+#'
+#' @inheritParams simulate_tree
+#' @inheritParams new_epichains_tree
+#'
+#' @return An `<epichains_tree>` object
 #' @author James M. Azam
 #' @export
-print.epichains <- function(x, ...) {
+epichains_tree <- function(tree_df = data.frame(),
+                           chains_run = integer(),
+                           statistic = character(),
+                           stat_max = double(),
+                           intvn_mean_reduction = double(),
+                           track_pop = logical()
+                           ) {
+  # Check that inputs are well specified
+  checkmate::assert_data_frame(tree_df)
+  checkmate::assert_integerish(chains_run, null.ok = TRUE)
+  checkmate::assert_character(statistic, null.ok = TRUE)
+  checkmate::assert_integerish(stat_max, null.ok = TRUE)
+  checkmate::assert_double(intvn_mean_reduction)
+  checkmate::assert_logical(track_pop)
+
+  # Create <epichains_tree> object
+  epichains_tree <- new_epichains_tree(
+    tree_df = tree_df,
+    chains_run = chains_run,
+    statistic = statistic,
+    stat_max = stat_max,
+    intvn_mean_reduction = intvn_mean_reduction,
+    track_pop = track_pop
+    )
+
+  # Validate the created object
+  validate_epichains_tree(epichains_tree)
+
+  return(epichains_tree)
+}
   format(x, ...)
 }
 
diff --git a/man/epichains_tree.Rd b/man/epichains_tree.Rd
new file mode 100644
index 00000000..68a04db6
--- /dev/null
+++ b/man/epichains_tree.Rd
@@ -0,0 +1,62 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{epichains_tree}
+\alias{epichains_tree}
+\title{Create an \verb{<epichains_tree>} object}
+\usage{
+epichains_tree(
+  tree_df = data.frame(),
+  chains_run = integer(),
+  statistic = character(),
+  stat_max = double(),
+  intvn_mean_reduction = double(),
+  track_pop = logical()
+)
+}
+\arguments{
+\item{tree_df}{a \verb{<data.frame>} containing at least columns for "chain_id",
+"ancestor", and "generation". Also has optional columns for "time", and
+"chain_id".}
+
+\item{chains_run}{Number of chains/cases used to generate the outbreak;
+Integer}
+
+\item{statistic}{String; Statistic (size/length) to calculate. Used to
+determine stopping criteria for simulations when \code{stat_max} is finite.
+Can be one of:
+\itemize{
+\item "size": the total number of offspring.
+\item "length": the total number of ancestors.
+}}
+
+\item{stat_max}{A cut off for the chain statistic (size/length) being
+computed. Results above the specified value, are set to this value.
+Defaults to \code{Inf}.}
+
+\item{intvn_mean_reduction}{A number between 0
+and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
+population-level intervention. \code{intvn_mean_reduction} = 0
+implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
+impact.}
+
+\item{track_pop}{Was the susceptible population tracked; Logical}
+}
+\value{
+An \verb{<epichains_tree>} object
+}
+\description{
+\code{epichains_tree()} constructs an \verb{<epichains_tree>} object, which is
+inherently an \verb{<data.frame>} object that stores some of the inputs
+passed to the \code{simulate_tree()} and \code{simulate_tree_from_pop()} and the
+simulated output. The stored attributes are useful for scenario
+analyses where the inputs are required for downstream analyses.
+
+An \verb{<epichains_tree>} object contains a \verb{<data.frame>} of the simulated
+outbreak with ids for each case/chain and the chain the produced, the
+number of cases/chains used for the simulation, the statistic that was
+tracked, the intervention level, and whether the susceptible population was
+tracked.
+}
+\author{
+James M. Azam
+}
diff --git a/man/new_epichains_tree.Rd b/man/new_epichains_tree.Rd
new file mode 100644
index 00000000..eba1069b
--- /dev/null
+++ b/man/new_epichains_tree.Rd
@@ -0,0 +1,56 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{new_epichains_tree}
+\alias{new_epichains_tree}
+\title{Construct a \verb{<epichains_tree>} object}
+\usage{
+new_epichains_tree(
+  tree_df = data.frame(),
+  chains_run = integer(),
+  statistic = character(),
+  stat_max = double(),
+  intvn_mean_reduction = double(),
+  track_pop = logical()
+)
+}
+\arguments{
+\item{tree_df}{a \verb{<data.frame>} containing at least columns for "chain_id",
+"ancestor", and "generation". Also has optional columns for "time", and
+"chain_id".}
+
+\item{chains_run}{Number of chains/cases used to generate the outbreak;
+Integer}
+
+\item{statistic}{String; Statistic (size/length) to calculate. Used to
+determine stopping criteria for simulations when \code{stat_max} is finite.
+Can be one of:
+\itemize{
+\item "size": the total number of offspring.
+\item "length": the total number of ancestors.
+}}
+
+\item{stat_max}{A cut off for the chain statistic (size/length) being
+computed. Results above the specified value, are set to this value.
+Defaults to \code{Inf}.}
+
+\item{intvn_mean_reduction}{A number between 0
+and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
+population-level intervention. \code{intvn_mean_reduction} = 0
+implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
+impact.}
+
+\item{track_pop}{Was the susceptible population tracked; Logical}
+}
+\description{
+\code{new_epichains_tree()} constructs an \verb{<epichains_tree>} object from a
+supplied \verb{<data.frame>} and extra attributes passed as individual arguments.
+It is meant to be lazy and performant, by creating the object without
+checking the arguments for correctness. It is not safe to call
+\code{new_epichains_tree()} on its own as is called within \code{epichains_tree()}
+after the arguments have been checked. To create an \verb{<epichains_tree>}
+object, use \code{epichains_tree()}.
+}
+\author{
+James M. Azam
+}
+\keyword{internal}

From 9fd0c34b718b07d71b6e22a64d83d3fc360b16ad Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 11 Oct 2023 12:53:21 +0100
Subject: [PATCH 793/828] Add constructor for epichains_summary class

---
 R/epichains.R                | 79 ++++++++++++++++++++++++++++++++++++
 man/epichains_summary.Rd     | 52 ++++++++++++++++++++++++
 man/new_epichains_summary.Rd | 53 ++++++++++++++++++++++++
 3 files changed, 184 insertions(+)
 create mode 100644 man/epichains_summary.Rd
 create mode 100644 man/new_epichains_summary.Rd

diff --git a/R/epichains.R b/R/epichains.R
index 3b2a3987..741a4538 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -88,6 +88,85 @@ epichains_tree <- function(tree_df = data.frame(),
 
   return(epichains_tree)
 }
+
+#' Construct a `<epichains_summary>` object
+#'
+#' @description
+#' `new_epichains_summary()` constructs an `<epichains_summary>` object from a
+#' supplied `<vector>` of chain sizes or lengths. It also stores extra
+#' attributes passed as individual arguments.
+#'
+#' `new_epichains_summary()` is meant to be lazy and performant, by creating
+#' the object without checking the arguments for correctness. It is not safe
+#' to call `new_epichains_summary()` on its own as is called within
+#' `epichains_summary()` after the arguments have been checked. To create a
+#' new `<epichains_summary>` object safely, use `epichains_summary()`.
+#'
+#' @param chains_summary a `<vector>` of chain sizes and lengths.
+#' @inheritParams new_epichains_tree
+#' @inheritParams simulate_tree
+#' @author James M. Azam
+#' @keywords internal
+new_epichains_summary <- function(chains_summary = vector(),
+                                  chains_run = integer(),
+                                  statistic = character(),
+                                  stat_max = double(),
+                                  intvn_mean_reduction = double()
+                                  ) {
+  # Assemble the elements of the object
+  obj <- structure(
+    chains_summary,
+    chains_run = chains_run,
+    statistic = statistic,
+    stat_max = stat_max,
+    intvn_mean_reduction = intvn_mean_reduction,
+    class = c("epichains_summary", "vector")
+  )
+  return(obj)
+}
+
+#' Create an `<epichains_summary>` object
+#'
+#' @description
+#' `epichains_summary()` constructs an `<epichains_summary>` object.
+#'
+#' An `<epichains_summary>` object is a `<vector>` of the simulated
+#' chain sizes or lengths. It also stores information on the
+#' number of cases/chains used for the simulation, and the statistic that was
+#' tracked, the intervention level.
+#'
+#' @inheritParams new_epichains_summary
+#'
+#' @return An `<epichains_summary>` object
+#' @author James M. Azam
+#' @export
+epichains_summary <- function(chains_summary = vector(),
+                              chains_run = integer(),
+                              statistic = character(),
+                              stat_max = double(),
+                              intvn_mean_reduction = double()
+                              ) {
+  # Check that inputs are well specified
+  checkmate::assert_vector(chains_summary)
+  checkmate::assert_integerish(chains_run, null.ok = TRUE)
+  checkmate::assert_character(statistic)
+  checkmate::assert_integerish(stat_max, null.ok = TRUE)
+  checkmate::assert_double(intvn_mean_reduction)
+
+  # Create <epichains_summary> object
+  epichains_summary <- new_epichains_summary(
+    chains_summary,
+    chains_run = chains_run,
+    statistic = statistic,
+    stat_max = stat_max,
+    intvn_mean_reduction = intvn_mean_reduction
+  )
+
+  # Validate the created object
+  validate_epichains_summary(epichains_summary)
+
+  return(epichains_summary)
+}
   format(x, ...)
 }
 
diff --git a/man/epichains_summary.Rd b/man/epichains_summary.Rd
new file mode 100644
index 00000000..97a6931a
--- /dev/null
+++ b/man/epichains_summary.Rd
@@ -0,0 +1,52 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{epichains_summary}
+\alias{epichains_summary}
+\title{Create an \verb{<epichains_summary>} object}
+\usage{
+epichains_summary(
+  chains_summary = vector(),
+  chains_run = integer(),
+  statistic = character(),
+  stat_max = double(),
+  intvn_mean_reduction = double()
+)
+}
+\arguments{
+\item{chains_summary}{a \verb{<vector>} of chain sizes and lengths.}
+
+\item{chains_run}{Number of chains/cases used to generate the outbreak;
+Integer}
+
+\item{statistic}{String; Statistic (size/length) to calculate. Used to
+determine stopping criteria for simulations when \code{stat_max} is finite.
+Can be one of:
+\itemize{
+\item "size": the total number of offspring.
+\item "length": the total number of ancestors.
+}}
+
+\item{stat_max}{A cut off for the chain statistic (size/length) being
+computed. Results above the specified value, are set to this value.
+Defaults to \code{Inf}.}
+
+\item{intvn_mean_reduction}{A number between 0
+and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
+population-level intervention. \code{intvn_mean_reduction} = 0
+implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
+impact.}
+}
+\value{
+An \verb{<epichains_summary>} object
+}
+\description{
+\code{epichains_summary()} constructs an \verb{<epichains_summary>} object.
+
+An \verb{<epichains_summary>} object is a \verb{<vector>} of the simulated
+chain sizes or lengths. It also stores information on the
+number of cases/chains used for the simulation, and the statistic that was
+tracked, the intervention level.
+}
+\author{
+James M. Azam
+}
diff --git a/man/new_epichains_summary.Rd b/man/new_epichains_summary.Rd
new file mode 100644
index 00000000..bf7ffef2
--- /dev/null
+++ b/man/new_epichains_summary.Rd
@@ -0,0 +1,53 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{new_epichains_summary}
+\alias{new_epichains_summary}
+\title{Construct a \verb{<epichains_summary>} object}
+\usage{
+new_epichains_summary(
+  chains_summary = vector(),
+  chains_run = integer(),
+  statistic = character(),
+  stat_max = double(),
+  intvn_mean_reduction = double()
+)
+}
+\arguments{
+\item{chains_summary}{a \verb{<vector>} of chain sizes and lengths.}
+
+\item{chains_run}{Number of chains/cases used to generate the outbreak;
+Integer}
+
+\item{statistic}{String; Statistic (size/length) to calculate. Used to
+determine stopping criteria for simulations when \code{stat_max} is finite.
+Can be one of:
+\itemize{
+\item "size": the total number of offspring.
+\item "length": the total number of ancestors.
+}}
+
+\item{stat_max}{A cut off for the chain statistic (size/length) being
+computed. Results above the specified value, are set to this value.
+Defaults to \code{Inf}.}
+
+\item{intvn_mean_reduction}{A number between 0
+and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
+population-level intervention. \code{intvn_mean_reduction} = 0
+implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
+impact.}
+}
+\description{
+\code{new_epichains_summary()} constructs an \verb{<epichains_summary>} object from a
+supplied \verb{<vector>} of chain sizes or lengths. It also stores extra
+attributes passed as individual arguments.
+
+\code{new_epichains_summary()} is meant to be lazy and performant, by creating
+the object without checking the arguments for correctness. It is not safe
+to call \code{new_epichains_summary()} on its own as is called within
+\code{epichains_summary()} after the arguments have been checked. To create a
+new \verb{<epichains_summary>} object safely, use \code{epichains_summary()}.
+}
+\author{
+James M. Azam
+}
+\keyword{internal}

From d61f0fa35276222021f283ef031acb1ed7f1fbaa Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 11 Oct 2023 12:55:43 +0100
Subject: [PATCH 794/828] Add print and format methods for epichains_tree

---
 R/epichains.R                | 70 +++++++++++++++++++++++-------------
 man/format.epichains_tree.Rd | 23 ++++++++++++
 man/is_chains_summary.Rd     | 17 ---------
 man/print.epichains_tree.Rd  | 23 ++++++++++++
 4 files changed, 92 insertions(+), 41 deletions(-)
 create mode 100644 man/format.epichains_tree.Rd
 delete mode 100644 man/is_chains_summary.Rd
 create mode 100644 man/print.epichains_tree.Rd

diff --git a/R/epichains.R b/R/epichains.R
index 741a4538..1a7f705e 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -167,52 +167,74 @@ epichains_summary <- function(chains_summary = vector(),
 
   return(epichains_summary)
 }
+
+#' Print an `<epichains_tree>` object
+#'
+#' @param x An `<epichains_tree>` object.
+#' @param ... Other parameters passed to `print()`.
+#' @return Invisibly returns an `<epichains_tree>`. Called for
+#' side-effects.
+#' @author James M. Azam
+#' @export
+print.epichains_tree <- function(x, ...) {
   format(x, ...)
 }
 
 #' Format method for epichains class
 #'
 #' @param x epichains object
+#' Format method for `<epichains_tree>` class
+#'
+#' @param x An `<epichains_tree>` object
 #' @param ... further arguments passed to or from other methods
-#' @return Invisibly returns an [`epichains`]. Called for printing side-effects.
+#' @return Invisibly returns an `<epichains_tree>`.
+#' Called for printing side-effects.
 #' @author James M. Azam
 #' @export
-format.epichains <- function(x, ...) {
-  # check that x is an epichains object
-  validate_epichains(x)
+format.epichains_tree <- function(x, ...) {
+  # check that x is an <epichains_tree> object
+  validate_epichains_tree(x)
 
   # summarise the information stored in x
   chain_info <- summary(x)
 
-  if (is_chains_tree(x)) {
-    writeLines(sprintf("`epichains` object\n"))
-    # print head of the object
-    print(head(x))
-    # print tail of object
-    print(tail(x))
+  writeLines(sprintf("`<epichains_tree>` object\n"))
 
-    # print summary information
-    writeLines(
-      c(
-        sprintf("Chains simulated: %s", chain_info[["chains_run"]]),
-        sprintf(
-          "Number of infectors (known): %s",
-          chain_info[["unique_infectors"]]
+  # print head of the object
+  writeLines("< tree head (from first known ancestor) >\n")
+  print(head(x))
+
+  # print summary information
+  writeLines(
+    c(
+      sprintf(
+        "%s",
+        "\n"
+        ),
+      sprintf(
+        "Chains simulated: %s",
+        chain_info[["chains_run"]]
+        ),
+      sprintf(
+          "Number of ancestors (known): %s",
+          chain_info[["unique_ancestors"]]
         ),
         sprintf(
-          "Number of generations: %s", chain_info[["max_generation"]]
+          "Number of generations: %s",
+          chain_info[["max_generation"]]
         )
       )
     )
 
-    # Offer more information to view the full dataset
-    writeLines(sprintf(
+  # Offer more information to view the full dataset
+  writeLines(
+    sprintf(
       "%s %s", "Use `as.data.frame(<object_name>)`",
       "to view the full output in the console."
-    ))
-  } else if (is_chains_summary(x)) {
-    writeLines(sprintf("`epichains` object \n"))
-    print(as.vector(x))
+      )
+    )
+  invisible(x)
+}
     writeLines(sprintf(
       "\n Number of chains simulated: %s",
       chain_info[["unique_chains"]]
diff --git a/man/format.epichains_tree.Rd b/man/format.epichains_tree.Rd
new file mode 100644
index 00000000..efd19817
--- /dev/null
+++ b/man/format.epichains_tree.Rd
@@ -0,0 +1,23 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{format.epichains_tree}
+\alias{format.epichains_tree}
+\title{Format method for \verb{<epichains_tree>} class}
+\usage{
+\method{format}{epichains_tree}(x, ...)
+}
+\arguments{
+\item{x}{An \verb{<epichains_tree>} object}
+
+\item{...}{further arguments passed to or from other methods}
+}
+\value{
+Invisibly returns an \verb{<epichains_tree>}.
+Called for printing side-effects.
+}
+\description{
+Format method for \verb{<epichains_tree>} class
+}
+\author{
+James M. Azam
+}
diff --git a/man/is_chains_summary.Rd b/man/is_chains_summary.Rd
deleted file mode 100644
index 6a7e0adb..00000000
--- a/man/is_chains_summary.Rd
+++ /dev/null
@@ -1,17 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{is_chains_summary}
-\alias{is_chains_summary}
-\title{Check if an epichains object has the \code{chains_summary} attribute}
-\usage{
-is_chains_summary(x)
-}
-\arguments{
-\item{x}{An \code{\link{epichains}} object}
-}
-\description{
-Check if an epichains object has the \code{chains_summary} attribute
-}
-\author{
-James M. Azam
-}
diff --git a/man/print.epichains_tree.Rd b/man/print.epichains_tree.Rd
new file mode 100644
index 00000000..bd518dc3
--- /dev/null
+++ b/man/print.epichains_tree.Rd
@@ -0,0 +1,23 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{print.epichains_tree}
+\alias{print.epichains_tree}
+\title{Print an \verb{<epichains_tree>} object}
+\usage{
+\method{print}{epichains_tree}(x, ...)
+}
+\arguments{
+\item{x}{An \verb{<epichains_tree>} object.}
+
+\item{...}{Other parameters passed to \code{print()}.}
+}
+\value{
+Invisibly returns an \verb{<epichains_tree>}. Called for
+side-effects.
+}
+\description{
+Print an \verb{<epichains_tree>} object
+}
+\author{
+James M. Azam
+}

From 1d232b1c8b956a71aa540559e0ae4ef22e2597b6 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 11 Oct 2023 12:56:35 +0100
Subject: [PATCH 795/828] Add print and format methods for epichains_summary
 class

---
 R/epichains.R                   | 58 ++++++++++++++++++++++++++-------
 man/format.epichains_summary.Rd | 23 +++++++++++++
 man/print.epichains.Rd          | 22 -------------
 man/print.epichains_summary.Rd  | 23 +++++++++++++
 man/tail.epichains.Rd           | 25 --------------
 5 files changed, 92 insertions(+), 59 deletions(-)
 create mode 100644 man/format.epichains_summary.Rd
 delete mode 100644 man/print.epichains.Rd
 create mode 100644 man/print.epichains_summary.Rd
 delete mode 100644 man/tail.epichains.Rd

diff --git a/R/epichains.R b/R/epichains.R
index 1a7f705e..105972d3 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -180,9 +180,18 @@ print.epichains_tree <- function(x, ...) {
   format(x, ...)
 }
 
-#' Format method for epichains class
+#' Print an `<epichains_summary>` object
 #'
-#' @param x epichains object
+#' @param x An `<epichains_summary>` object.
+#' @param ... Other parameters passed to `print()`.
+#' @return Invisibly returns an `<epichains_summary>`. Called for
+#' side-effects.
+#' @author James M. Azam
+#' @export
+print.epichains_summary <- function(x, ...) {
+  format(x, ...)
+}
+
 #' Format method for `<epichains_tree>` class
 #'
 #' @param x An `<epichains_tree>` object
@@ -235,21 +244,46 @@ format.epichains_tree <- function(x, ...) {
     )
   invisible(x)
 }
-    writeLines(sprintf(
+
+#' Format method for `<epichains_summary>` class
+#'
+#' @param x An `<epichains_summary>` object
+#' @param ... further arguments passed to or from other methods
+#' @return Invisibly returns an `<epichains_summary>`. Called for printing
+#' side-effects.
+#' @author James M. Azam
+#' @export
+format.epichains_summary <- function(x, ...) {
+  # check that x is an <epichains_summary> object
+  validate_epichains_summary(x)
+
+  # summarise the information stored in x
+  chain_info <- summary(x)
+
+  writeLines(sprintf("`epichains_summary` object \n"))
+  print(as.vector(x))
+  writeLines(
+    sprintf(
       "\n Number of chains simulated: %s",
       chain_info[["unique_chains"]]
-    ))
-    writeLines(
-      c(
-        sprintf(
-          "\n Simulated chain %ss: \n",
-          attr(x, "statistic", exact = TRUE)
+      )
+    )
+  writeLines(
+    c(
+      sprintf(
+        "\n Simulated chain %ss: \n",
+        attr(x, "statistic", exact = TRUE)
+        ),
+      sprintf(
+        "Max: %s",
+        chain_info[["max_chain_stat"]]
         ),
-        sprintf("Max: %s", chain_info[["max_chain_stat"]]),
-        sprintf("Min: %s", chain_info[["min_chain_stat"]])
+      sprintf(
+        "Min: %s",
+        chain_info[["min_chain_stat"]]
+        )
       )
     )
-  }
 
   invisible(x)
 }
diff --git a/man/format.epichains_summary.Rd b/man/format.epichains_summary.Rd
new file mode 100644
index 00000000..853c6e11
--- /dev/null
+++ b/man/format.epichains_summary.Rd
@@ -0,0 +1,23 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{format.epichains_summary}
+\alias{format.epichains_summary}
+\title{Format method for \verb{<epichains_summary>} class}
+\usage{
+\method{format}{epichains_summary}(x, ...)
+}
+\arguments{
+\item{x}{An \verb{<epichains_summary>} object}
+
+\item{...}{further arguments passed to or from other methods}
+}
+\value{
+Invisibly returns an \verb{<epichains_summary>}. Called for printing
+side-effects.
+}
+\description{
+Format method for \verb{<epichains_summary>} class
+}
+\author{
+James M. Azam
+}
diff --git a/man/print.epichains.Rd b/man/print.epichains.Rd
deleted file mode 100644
index ad9c2347..00000000
--- a/man/print.epichains.Rd
+++ /dev/null
@@ -1,22 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{print.epichains}
-\alias{print.epichains}
-\title{Print an \code{\link{epichains}} object}
-\usage{
-\method{print}{epichains}(x, ...)
-}
-\arguments{
-\item{x}{An \code{\link{epichains}} object.}
-
-\item{...}{Other parameters passed to \code{\link[=print]{print()}}.}
-}
-\value{
-Invisibly returns an \code{\link{epichains}}. Called for side-effects.
-}
-\description{
-Print an \code{\link{epichains}} object
-}
-\author{
-James M. Azam
-}
diff --git a/man/print.epichains_summary.Rd b/man/print.epichains_summary.Rd
new file mode 100644
index 00000000..4c67b8bb
--- /dev/null
+++ b/man/print.epichains_summary.Rd
@@ -0,0 +1,23 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{print.epichains_summary}
+\alias{print.epichains_summary}
+\title{Print an \verb{<epichains_summary>} object}
+\usage{
+\method{print}{epichains_summary}(x, ...)
+}
+\arguments{
+\item{x}{An \verb{<epichains_summary>} object.}
+
+\item{...}{Other parameters passed to \code{print()}.}
+}
+\value{
+Invisibly returns an \verb{<epichains_summary>}. Called for
+side-effects.
+}
+\description{
+Print an \verb{<epichains_summary>} object
+}
+\author{
+James M. Azam
+}
diff --git a/man/tail.epichains.Rd b/man/tail.epichains.Rd
deleted file mode 100644
index 75b3134d..00000000
--- a/man/tail.epichains.Rd
+++ /dev/null
@@ -1,25 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{tail.epichains}
-\alias{tail.epichains}
-\title{\code{tail} method for \code{\link{epichains}} class}
-\usage{
-\method{tail}{epichains}(x, ...)
-}
-\arguments{
-\item{x}{An \code{\link{epichains}} object}
-
-\item{...}{further arguments passed to or from other methods}
-}
-\description{
-\code{tail} method for \code{\link{epichains}} class
-}
-\details{
-This returns the bottom part of an \code{epichains} object. Note that the object
-is originally sorted by \code{sim_id} and \code{infector_id} and the first
-unknown ancestors (NA) have been dropped from
-printing method. To view the full output, use \verb{as.data.frame(<object_name>)}.
-}
-\author{
-James M. Azam
-}

From 33b24a6aa98e87094e3638fda14919846962198b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 11 Oct 2023 12:57:30 +0100
Subject: [PATCH 796/828] Add summary method for epichains_tree and
 epichains_summary

---
 R/epichains.R                    | 63 +++++++++++++++++++-------------
 man/format.epichains.Rd          | 22 -----------
 man/summary.epichains.Rd         | 22 -----------
 man/summary.epichains_summary.Rd | 22 +++++++++++
 man/summary.epichains_tree.Rd    | 22 +++++++++++
 5 files changed, 81 insertions(+), 70 deletions(-)
 delete mode 100644 man/format.epichains.Rd
 delete mode 100644 man/summary.epichains.Rd
 create mode 100644 man/summary.epichains_summary.Rd
 create mode 100644 man/summary.epichains_tree.Rd

diff --git a/R/epichains.R b/R/epichains.R
index 105972d3..dd98fb38 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -288,53 +288,64 @@ format.epichains_summary <- function(x, ...) {
   invisible(x)
 }
 
-
-
-#' Summary method for epichains class
+#' Summary method for `epichains_tree` class
 #'
-#' @param object An [`epichains`] object
+#' @param object An `epichains_tree` object
 #' @param ... further arguments passed to or from other methods
 #'
-#' @return data frame of information
+#' @return List of summaries
 #' @author James M. Azam
 #' @export
-summary.epichains <- function(object, ...) {
-  validate_epichains(object)
+summary.epichains_tree <- function(object, ...) {
+  validate_epichains_tree(object)
 
-  chains_run <- attr(object, "chains", exact = TRUE)
+  chains_run <- attr(object, "chains_run", exact = TRUE)
 
-  if (is_chains_tree(object)) {
-    max_time <- ifelse(("time" %in% names(object)), max(object$time), NA)
+  max_time <- ifelse(("time" %in% names(object)), max(object$time), NA)
 
-    n_unique_infectors <- length(
-      unique(object$infector_id[!is.na(object$infector_id)])
-    )
+  n_unique_ancestors <- length(unique(object$ancestor[!is.na(object$ancestor)]))
 
-    max_generation <- max(object$generation)
+  max_generation <- max(object$generation)
 
-    # out of summary
-    res <- list(
-      chains_run = chains_run,
-      max_time = max_time,
-      unique_infectors = n_unique_infectors,
-      max_generation = max_generation
+  # List of summaries
+  out <- list(
+    chains_run = chains_run,
+    max_time = max_time,
+    unique_ancestors = n_unique_ancestors,
+    max_generation = max_generation
     )
-  } else if (is_chains_summary(object)) {
-    if (all(is.infinite(object))) {
-      max_chain_stat <- min_chain_stat <- Inf
+
+  return(out)
+}
+
+#' Summary method for `<epichains_summary>` class
+#'
+#' @param object An `<epichains_summary>` object
+#' @param ... further arguments passed to or from other methods
+#'
+#' @return List of summaries
+#' @author James M. Azam
+#' @export
+summary.epichains_summary <- function(object, ...) {
+  validate_epichains_summary(object)
+
+  chains_run <- attr(object, "chains_run", exact = TRUE)
+
+
+  if (all(is.infinite(object))) {
+    max_chain_stat <- min_chain_stat <- Inf
     } else {
       max_chain_stat <- max(object[!is.infinite(object)])
       min_chain_stat <- min(object[!is.infinite(object)])
     }
 
-    res <- list(
+    out <- list(
       chains_run = chains_run,
       max_chain_stat = max_chain_stat,
       min_chain_stat = min_chain_stat
     )
-  }
 
-  return(res)
+  return(out)
 }
 
 #' Reports whether x is an `epichains` object
diff --git a/man/format.epichains.Rd b/man/format.epichains.Rd
deleted file mode 100644
index 6b46c5ca..00000000
--- a/man/format.epichains.Rd
+++ /dev/null
@@ -1,22 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{format.epichains}
-\alias{format.epichains}
-\title{Format method for epichains class}
-\usage{
-\method{format}{epichains}(x, ...)
-}
-\arguments{
-\item{x}{epichains object}
-
-\item{...}{further arguments passed to or from other methods}
-}
-\value{
-Invisibly returns an \code{\link{epichains}}. Called for printing side-effects.
-}
-\description{
-Format method for epichains class
-}
-\author{
-James M. Azam
-}
diff --git a/man/summary.epichains.Rd b/man/summary.epichains.Rd
deleted file mode 100644
index 83e28801..00000000
--- a/man/summary.epichains.Rd
+++ /dev/null
@@ -1,22 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{summary.epichains}
-\alias{summary.epichains}
-\title{Summary method for epichains class}
-\usage{
-\method{summary}{epichains}(object, ...)
-}
-\arguments{
-\item{object}{An \code{\link{epichains}} object}
-
-\item{...}{further arguments passed to or from other methods}
-}
-\value{
-data frame of information
-}
-\description{
-Summary method for epichains class
-}
-\author{
-James M. Azam
-}
diff --git a/man/summary.epichains_summary.Rd b/man/summary.epichains_summary.Rd
new file mode 100644
index 00000000..f66bb51f
--- /dev/null
+++ b/man/summary.epichains_summary.Rd
@@ -0,0 +1,22 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{summary.epichains_summary}
+\alias{summary.epichains_summary}
+\title{Summary method for \verb{<epichains_summary>} class}
+\usage{
+\method{summary}{epichains_summary}(object, ...)
+}
+\arguments{
+\item{object}{An \verb{<epichains_summary>} object}
+
+\item{...}{further arguments passed to or from other methods}
+}
+\value{
+List of summaries
+}
+\description{
+Summary method for \verb{<epichains_summary>} class
+}
+\author{
+James M. Azam
+}
diff --git a/man/summary.epichains_tree.Rd b/man/summary.epichains_tree.Rd
new file mode 100644
index 00000000..fe8a290d
--- /dev/null
+++ b/man/summary.epichains_tree.Rd
@@ -0,0 +1,22 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{summary.epichains_tree}
+\alias{summary.epichains_tree}
+\title{Summary method for \code{epichains_tree} class}
+\usage{
+\method{summary}{epichains_tree}(object, ...)
+}
+\arguments{
+\item{object}{An \code{epichains_tree} object}
+
+\item{...}{further arguments passed to or from other methods}
+}
+\value{
+List of summaries
+}
+\description{
+Summary method for \code{epichains_tree} class
+}
+\author{
+James M. Azam
+}

From c047cc7690dfb4a86682cf734998a5145e172015 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 11 Oct 2023 12:58:22 +0100
Subject: [PATCH 797/828] Add validation checkers for the two classes

---
 R/epichains.R                     | 70 +++++++++++++------------------
 man/is_chains_tree.Rd             | 17 --------
 man/is_epichains.Rd               | 21 ----------
 man/is_epichains_aggregate_df.Rd  | 21 ----------
 man/is_epichains_summary.Rd       | 21 ++++++++++
 man/is_epichains_tree.Rd          | 21 ++++++++++
 man/validate_epichains.Rd         | 20 ---------
 man/validate_epichains_summary.Rd | 20 +++++++++
 man/validate_epichains_tree.Rd    | 20 +++++++++
 9 files changed, 111 insertions(+), 120 deletions(-)
 delete mode 100644 man/is_chains_tree.Rd
 delete mode 100644 man/is_epichains.Rd
 delete mode 100644 man/is_epichains_aggregate_df.Rd
 create mode 100644 man/is_epichains_summary.Rd
 create mode 100644 man/is_epichains_tree.Rd
 delete mode 100644 man/validate_epichains.Rd
 create mode 100644 man/validate_epichains_summary.Rd
 create mode 100644 man/validate_epichains_tree.Rd

diff --git a/R/epichains.R b/R/epichains.R
index dd98fb38..4d1887ab 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -348,39 +348,40 @@ summary.epichains_summary <- function(object, ...) {
   return(out)
 }
 
-#' Reports whether x is an `epichains` object
+#' Test if x is an `epichains_tree` object
 #'
 #' @param x An R object
 #'
-#' @return logical, `TRUE` if the object is an `epichains` and `FALSE`
+#' @return logical, `TRUE` if the object is an `epichains_tree` and `FALSE`
 #' otherwise
-#' @export
 #' @author James M. Azam
-is_epichains <- function(x) {
-  inherits(x, "epichains")
+#' @export
+is_epichains_tree <- function(x) {
+  inherits(x, "epichains_tree")
 }
 
-#' Reports whether x is an "epichains_aggregate_df" object
+#' Test if x is an `epichains_summary` object
 #'
-#' @param x An [`epichains`] object
-#' @return logical, `TRUE` if the object is an `epichains_aggregate_df` and
-#' `FALSE` otherwise
-#' @export
+#' @param x An R object
+#'
+#' @return logical, `TRUE` if the object is an `epichains_summary` and `FALSE`
+#' otherwise
 #' @author James M. Azam
-is_epichains_aggregate_df <- function(x) {
-  inherits(x, "epichains_aggregate_df")
+#' @export
+is_epichains_summary <- function(x) {
+  inherits(x, "epichains_summary")
 }
 
-#' `epichains` class validator
+#' Validate an `<epichains_tree>` object
 #'
-#' @param x An `epichains` object
+#' @param x An `<epichains_tree>` object
 #'
 #' @return No return.
-#' @export
 #' @author James M. Azam
-validate_epichains <- function(x) {
-  if (!is_epichains(x)) {
-    stop("Object must have an epichains class")
+#' @export
+validate_epichains_tree <- function(x) {
+  if (!is_epichains_tree(x)) {
+    stop("Object must have an `<epichains_tree>` class")
   }
 
   # check for class invariants
@@ -390,43 +391,30 @@ validate_epichains <- function(x) {
       "object does not contain the correct columns" =
         c("sim_id", "infector_id", "generation") %in%
         colnames(x),
-      "column `sim_id` must be a numeric" =
+    "column `sim_id` must be a numeric" =
         is.numeric(x$sim_id),
       "column `infector_id` must be a numeric" =
         is.numeric(x$infector_id),
       "column `generation` must be a numeric" =
         is.numeric(x$generation)
     )
-  } else {
-    stopifnot(
-      "object must be a numeric vector" =
-        is.numeric(x)
-    )
-  }
 
   invisible(x)
 }
 
-#' Check if an epichains object has the `chains_tree` attribute
+#' Validate an `<epichains_summary>` object
 #'
-#' @param x An [`epichains`] object
+#' @param x An `<epichains_summary>` object
 #'
-#' @export
+#' @return No return.
 #' @author James M. Azam
-is_chains_tree <- function(x) {
-  !is.null(attributes(x)$chain_type) &&
-    attributes(x)$chain_type == "chains_tree"
-}
-
-#' Check if an epichains object has the `chains_summary` attribute
-#'
-#' @param x An [`epichains`] object
-#'
 #' @export
-#' @author James M. Azam
-is_chains_summary <- function(x) {
-  !is.null(attributes(x)$chain_type) &&
-    attributes(x)$chain_type == "chains_summary"
+validate_epichains_summary <- function(x) {
+  if (!is_epichains_summary(x)) {
+    stop("Object must have an `<epichains_summary>` class")
+  }
+
+  invisible(x)
 }
 
 #' `head` and `tail` method for `<epichains_tree>` class
diff --git a/man/is_chains_tree.Rd b/man/is_chains_tree.Rd
deleted file mode 100644
index 951a2bcd..00000000
--- a/man/is_chains_tree.Rd
+++ /dev/null
@@ -1,17 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{is_chains_tree}
-\alias{is_chains_tree}
-\title{Check if an epichains object has the \code{chains_tree} attribute}
-\usage{
-is_chains_tree(x)
-}
-\arguments{
-\item{x}{An \code{\link{epichains}} object}
-}
-\description{
-Check if an epichains object has the \code{chains_tree} attribute
-}
-\author{
-James M. Azam
-}
diff --git a/man/is_epichains.Rd b/man/is_epichains.Rd
deleted file mode 100644
index 5b327eb7..00000000
--- a/man/is_epichains.Rd
+++ /dev/null
@@ -1,21 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{is_epichains}
-\alias{is_epichains}
-\title{Reports whether x is an \code{epichains} object}
-\usage{
-is_epichains(x)
-}
-\arguments{
-\item{x}{An R object}
-}
-\value{
-logical, \code{TRUE} if the object is an \code{epichains} and \code{FALSE}
-otherwise
-}
-\description{
-Reports whether x is an \code{epichains} object
-}
-\author{
-James M. Azam
-}
diff --git a/man/is_epichains_aggregate_df.Rd b/man/is_epichains_aggregate_df.Rd
deleted file mode 100644
index 98d779c3..00000000
--- a/man/is_epichains_aggregate_df.Rd
+++ /dev/null
@@ -1,21 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{is_epichains_aggregate_df}
-\alias{is_epichains_aggregate_df}
-\title{Reports whether x is an "epichains_aggregate_df" object}
-\usage{
-is_epichains_aggregate_df(x)
-}
-\arguments{
-\item{x}{An \code{\link{epichains}} object}
-}
-\value{
-logical, \code{TRUE} if the object is an \code{epichains_aggregate_df} and
-\code{FALSE} otherwise
-}
-\description{
-Reports whether x is an "epichains_aggregate_df" object
-}
-\author{
-James M. Azam
-}
diff --git a/man/is_epichains_summary.Rd b/man/is_epichains_summary.Rd
new file mode 100644
index 00000000..4504e9f5
--- /dev/null
+++ b/man/is_epichains_summary.Rd
@@ -0,0 +1,21 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{is_epichains_summary}
+\alias{is_epichains_summary}
+\title{Test if x is an \code{epichains_summary} object}
+\usage{
+is_epichains_summary(x)
+}
+\arguments{
+\item{x}{An R object}
+}
+\value{
+logical, \code{TRUE} if the object is an \code{epichains_summary} and \code{FALSE}
+otherwise
+}
+\description{
+Test if x is an \code{epichains_summary} object
+}
+\author{
+James M. Azam
+}
diff --git a/man/is_epichains_tree.Rd b/man/is_epichains_tree.Rd
new file mode 100644
index 00000000..6b9d5045
--- /dev/null
+++ b/man/is_epichains_tree.Rd
@@ -0,0 +1,21 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{is_epichains_tree}
+\alias{is_epichains_tree}
+\title{Test if x is an \code{epichains_tree} object}
+\usage{
+is_epichains_tree(x)
+}
+\arguments{
+\item{x}{An R object}
+}
+\value{
+logical, \code{TRUE} if the object is an \code{epichains_tree} and \code{FALSE}
+otherwise
+}
+\description{
+Test if x is an \code{epichains_tree} object
+}
+\author{
+James M. Azam
+}
diff --git a/man/validate_epichains.Rd b/man/validate_epichains.Rd
deleted file mode 100644
index 8cddc077..00000000
--- a/man/validate_epichains.Rd
+++ /dev/null
@@ -1,20 +0,0 @@
-% Generated by roxygen2: do not edit by hand
-% Please edit documentation in R/epichains.R
-\name{validate_epichains}
-\alias{validate_epichains}
-\title{\code{epichains} class validator}
-\usage{
-validate_epichains(x)
-}
-\arguments{
-\item{x}{An \code{epichains} object}
-}
-\value{
-No return.
-}
-\description{
-\code{epichains} class validator
-}
-\author{
-James M. Azam
-}
diff --git a/man/validate_epichains_summary.Rd b/man/validate_epichains_summary.Rd
new file mode 100644
index 00000000..8b393e3a
--- /dev/null
+++ b/man/validate_epichains_summary.Rd
@@ -0,0 +1,20 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{validate_epichains_summary}
+\alias{validate_epichains_summary}
+\title{Validate an \verb{<epichains_summary>} object}
+\usage{
+validate_epichains_summary(x)
+}
+\arguments{
+\item{x}{An \verb{<epichains_summary>} object}
+}
+\value{
+No return.
+}
+\description{
+Validate an \verb{<epichains_summary>} object
+}
+\author{
+James M. Azam
+}
diff --git a/man/validate_epichains_tree.Rd b/man/validate_epichains_tree.Rd
new file mode 100644
index 00000000..f06b332b
--- /dev/null
+++ b/man/validate_epichains_tree.Rd
@@ -0,0 +1,20 @@
+% Generated by roxygen2: do not edit by hand
+% Please edit documentation in R/epichains.R
+\name{validate_epichains_tree}
+\alias{validate_epichains_tree}
+\title{Validate an \verb{<epichains_tree>} object}
+\usage{
+validate_epichains_tree(x)
+}
+\arguments{
+\item{x}{An \verb{<epichains_tree>} object}
+}
+\value{
+No return.
+}
+\description{
+Validate an \verb{<epichains_tree>} object
+}
+\author{
+James M. Azam
+}

From 9194d67f12b6a04208dd2aa123a04d6aead0dc24 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 11 Oct 2023 12:58:45 +0100
Subject: [PATCH 798/828] Generate new NAMESPACE

---
 NAMESPACE | 24 ++++++++++++++----------
 1 file changed, 14 insertions(+), 10 deletions(-)

diff --git a/NAMESPACE b/NAMESPACE
index ab92359f..4739a5ed 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -1,16 +1,19 @@
 # Generated by roxygen2: do not edit by hand
 
 S3method(aggregate,epichains)
-S3method(format,epichains)
-S3method(head,epichains)
-S3method(print,epichains)
-S3method(summary,epichains)
-S3method(tail,epichains)
+S3method(format,epichains_summary)
+S3method(format,epichains_tree)
+S3method(head,epichains_tree)
+S3method(print,epichains_summary)
+S3method(print,epichains_tree)
+S3method(summary,epichains_summary)
+S3method(summary,epichains_tree)
+S3method(tail,epichains_tree)
 export(dborel)
-export(is_chains_summary)
-export(is_chains_tree)
-export(is_epichains)
-export(is_epichains_aggregate_df)
+export(epichains_summary)
+export(epichains_tree)
+export(is_epichains_summary)
+export(is_epichains_tree)
 export(likelihood)
 export(offspring_ll)
 export(rborel)
@@ -18,7 +21,8 @@ export(rnbinom_mean_disp)
 export(simulate_summary)
 export(simulate_tree)
 export(simulate_tree_from_pop)
-export(validate_epichains)
+export(validate_epichains_summary)
+export(validate_epichains_tree)
 importFrom(stats,aggregate)
 importFrom(utils,head)
 importFrom(utils,tail)

From 43ae074d8c7e16dc61ab4825d3c3f8ca2ad9c7c7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 11 Oct 2023 14:56:18 +0100
Subject: [PATCH 799/828] Add comments

---
 R/epichains.R | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index 4d1887ab..12abfe73 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -297,8 +297,10 @@ format.epichains_summary <- function(x, ...) {
 #' @author James M. Azam
 #' @export
 summary.epichains_tree <- function(object, ...) {
+  # Check that object has <epichains_tree> class
   validate_epichains_tree(object)
 
+  # Get the summaries
   chains_run <- attr(object, "chains_run", exact = TRUE)
 
   max_time <- ifelse(("time" %in% names(object)), max(object$time), NA)
@@ -327,8 +329,10 @@ summary.epichains_tree <- function(object, ...) {
 #' @author James M. Azam
 #' @export
 summary.epichains_summary <- function(object, ...) {
+  # Check that object has <epichains_summary> class
   validate_epichains_summary(object)
 
+  # Get the summaries
   chains_run <- attr(object, "chains_run", exact = TRUE)
 
 
From c50eec5f2b0261d2fdf25b9a299da7a5ed4c5d9f Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 11 Oct 2023 20:39:04 +0100
Subject: [PATCH 800/828] Bind right class to aggregate method

---
 NAMESPACE                                                   | 2 +-
 man/{aggregate.epichains.Rd => aggregate.epichains_tree.Rd} | 6 +++---
 2 files changed, 4 insertions(+), 4 deletions(-)
 rename man/{aggregate.epichains.Rd => aggregate.epichains_tree.Rd} (87%)

diff --git a/NAMESPACE b/NAMESPACE
index 4739a5ed..05dd64c8 100644
--- a/NAMESPACE
+++ b/NAMESPACE
@@ -1,6 +1,6 @@
 # Generated by roxygen2: do not edit by hand
 
-S3method(aggregate,epichains)
+S3method(aggregate,epichains_tree)
 S3method(format,epichains_summary)
 S3method(format,epichains_tree)
 S3method(head,epichains_tree)
diff --git a/man/aggregate.epichains.Rd b/man/aggregate.epichains_tree.Rd
similarity index 87%
rename from man/aggregate.epichains.Rd
rename to man/aggregate.epichains_tree.Rd
index 77f9cd89..b80d110a 100644
--- a/man/aggregate.epichains.Rd
+++ b/man/aggregate.epichains_tree.Rd
@@ -1,10 +1,10 @@
 % Generated by roxygen2: do not edit by hand
 % Please edit documentation in R/epichains.R
-\name{aggregate.epichains}
-\alias{aggregate.epichains}
+\name{aggregate.epichains_tree}
+\alias{aggregate.epichains_tree}
 \title{Aggregate cases in \verb{<epichains_tree>} objects by "time" or "generation"}
 \usage{
-\method{aggregate}{epichains}(x, grouping_var = c("time", "generation"), ...)
+\method{aggregate}{epichains_tree}(x, grouping_var = c("time", "generation"), ...)
 }
 \arguments{
 \item{x}{An \verb{<epichains_tree>} object.}

From 5c2c0972778f44e5b6ebac48fb85ead47d3a788d Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 11 Oct 2023 20:40:14 +0100
Subject: [PATCH 801/828] Replace old epichains class with the right classes

---
 tests/testthat/test-epichains.R | 64 ++++++++++++++-------------------
 1 file changed, 27 insertions(+), 37 deletions(-)

diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index e331ea11..231f107f 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -46,27 +46,27 @@ test_that("Simulators return epichains objects", {
   #' Expectations
   expect_s3_class(
     tree_sim_raw,
-    "epichains"
+    "epichains_tree"
   )
   expect_s3_class(
     tree_sim_raw2,
-    "epichains"
+    "epichains_tree"
   )
   expect_s3_class(
     susc_outbreak_raw,
-    "epichains"
+    "epichains_tree"
   )
   expect_s3_class(
     susc_outbreak_raw2,
-    "epichains"
+    "epichains_tree"
   )
   expect_s3_class(
     chain_summary_raw,
-    "epichains"
+    "epichains_summary"
   )
 })
 
-test_that("print.epichains works for simulation functions", {
+test_that("print.epichains_tree works for simulation functions", {
   set.seed(12)
   #' Simulate an outbreak from a susceptible population (pois)
   susc_outbreak_raw <- simulate_tree_from_pop(
@@ -114,7 +114,7 @@ test_that("print.epichains works for simulation functions", {
   expect_snapshot(chain_summary_raw)
 })
 
-test_that("summary.epichains works as expected", {
+test_that("summary.epichains_tree works as expected", {
   set.seed(12)
   #' Simulate an outbreak from a susceptible population (pois)
   susc_outbreak_raw <- simulate_tree_from_pop(
@@ -220,7 +220,7 @@ test_that("summary.epichains works as expected", {
   )
 })
 
-test_that("validate_epichains works", {
+test_that("validate_epichains_tree works", {
   set.seed(12)
   #' Simulate an outbreak from a susceptible population (pois)
   susc_outbreak_raw <- simulate_tree_from_pop(
@@ -262,23 +262,19 @@ test_that("validate_epichains works", {
   )
   #' Expectations
   expect_invisible(
-    validate_epichains(susc_outbreak_raw)
+    validate_epichains_tree(susc_outbreak_raw)
   )
   expect_invisible(
-    validate_epichains(susc_outbreak_raw2)
+    validate_epichains_tree(susc_outbreak_raw2)
   )
   expect_invisible(
-    validate_epichains(tree_sim_raw)
+    validate_epichains_tree(tree_sim_raw)
   )
   expect_invisible(
-    validate_epichains(tree_sim_raw2)
+    validate_epichains_tree(tree_sim_raw2)
   )
   expect_invisible(
-    validate_epichains(chain_summary_raw)
-  )
-  expect_error(
-    validate_epichains(mtcars),
-    "must have an epichains class"
+    validate_epichains_summary(chain_summary_raw)
   )
 })
 
@@ -324,19 +320,19 @@ test_that("is_chains_tree works", {
   )
   #' Expectations
   expect_true(
-    is_chains_tree(susc_outbreak_raw)
+    is_epichains_tree(susc_outbreak_raw)
   )
   expect_true(
-    is_chains_tree(susc_outbreak_raw2)
+    is_epichains_tree(susc_outbreak_raw2)
   )
   expect_true(
-    is_chains_tree(tree_sim_raw)
+    is_epichains_tree(tree_sim_raw)
   )
   expect_true(
-    is_chains_tree(tree_sim_raw2)
+    is_epichains_tree(tree_sim_raw2)
   )
   expect_false(
-    is_chains_tree(chain_summary_raw)
+    is_epichains_tree(chain_summary_raw)
   )
 })
 
@@ -382,23 +378,23 @@ test_that("is_chains_summary works", {
   )
   #' Expectations
   expect_true(
-    is_chains_summary(chain_summary_raw)
+    is_epichains_summary(chain_summary_raw)
   )
   expect_false(
-    is_chains_summary(susc_outbreak_raw)
+    is_epichains_summary(susc_outbreak_raw)
   )
   expect_false(
-    is_chains_summary(susc_outbreak_raw2)
+    is_epichains_summary(susc_outbreak_raw2)
   )
   expect_false(
-    is_chains_summary(tree_sim_raw)
+    is_epichains_summary(tree_sim_raw)
   )
   expect_false(
-    is_chains_summary(tree_sim_raw2)
+    is_epichains_summary(tree_sim_raw2)
   )
 })
 
-test_that("aggregate.epichains method returns correct objects", {
+test_that("aggregate.epichains_tree method returns correct objects", {
   set.seed(12)
   #' Simulate a tree of infections with serials
   tree_sim_raw2 <- simulate_tree(
@@ -418,13 +414,7 @@ test_that("aggregate.epichains method returns correct objects", {
     tree_sim_raw2,
     grouping_var = "time"
   )
-  #' Expectations for <epichains_aggregate_df> class inheritance
-  expect_true(
-    is_epichains_aggregate_df(aggreg_by_gen)
-  )
-  expect_true(
-    is_epichains_aggregate_df(aggreg_by_time)
-  )
+  #' Expectations for aggregated <epichains_tree>
   expect_named(
     aggreg_by_gen,
     c("generation", "cases")
@@ -435,7 +425,7 @@ test_that("aggregate.epichains method returns correct objects", {
   )
 })
 
-test_that("aggregate.epichains method throws errors", {
+test_that("aggregate.epichains_tree method throws errors", {
   expect_error(
     aggregate(
       simulate_tree(
@@ -451,7 +441,7 @@ test_that("aggregate.epichains method throws errors", {
   )
 })
 
-test_that("aggregate.epichains method is numerically correct", {
+test_that("aggregate.epichains_tree method is numerically correct", {
   set.seed(12)
   #' Simulate a tree of infections without serials
   tree_sim_raw <- simulate_tree(

From dd3de5369f1c05ddda0a378400c82872be2e9bf4 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 11 Oct 2023 20:41:15 +0100
Subject: [PATCH 802/828] Remove default types

---
 R/epichains.R                | 18 +++++++++---------
 man/epichains_summary.Rd     |  4 ++--
 man/epichains_tree.Rd        |  4 ++--
 man/new_epichains_summary.Rd |  4 ++--
 man/new_epichains_tree.Rd    |  4 ++--
 5 files changed, 17 insertions(+), 17 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 12abfe73..8ac97d5f 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -17,10 +17,10 @@
 #' @inheritParams epichains_tree
 #' @author James M. Azam
 #' @keywords internal
-new_epichains_tree <- function(tree_df = data.frame(),
+new_epichains_tree <- function(tree_df,
                                chains_run = integer(),
                                statistic = character(),
-                               stat_max = double(),
+                               stat_max = integer(),
                                intvn_mean_reduction = double(),
                                track_pop = logical()
                                ) {
@@ -58,10 +58,10 @@ new_epichains_tree <- function(tree_df = data.frame(),
 #' @return An `<epichains_tree>` object
 #' @author James M. Azam
 #' @export
-epichains_tree <- function(tree_df = data.frame(),
+epichains_tree <- function(tree_df,
                            chains_run = integer(),
                            statistic = character(),
-                           stat_max = double(),
+                           stat_max = integer(),
                            intvn_mean_reduction = double(),
                            track_pop = logical()
                            ) {
@@ -69,9 +69,9 @@ epichains_tree <- function(tree_df = data.frame(),
   checkmate::assert_data_frame(tree_df)
   checkmate::assert_integerish(chains_run, null.ok = TRUE)
   checkmate::assert_character(statistic, null.ok = TRUE)
-  checkmate::assert_integerish(stat_max, null.ok = TRUE)
   checkmate::assert_double(intvn_mean_reduction)
   checkmate::assert_logical(track_pop)
+  checkmate::assert_number(stat_max, null.ok = TRUE)
 
   # Create <epichains_tree> object
   epichains_tree <- new_epichains_tree(
@@ -107,10 +107,10 @@ epichains_tree <- function(tree_df = data.frame(),
 #' @inheritParams simulate_tree
 #' @author James M. Azam
 #' @keywords internal
-new_epichains_summary <- function(chains_summary = vector(),
+new_epichains_summary <- function(chains_summary,
                                   chains_run = integer(),
                                   statistic = character(),
-                                  stat_max = double(),
+                                  stat_max = integer(),
                                   intvn_mean_reduction = double()
                                   ) {
   # Assemble the elements of the object
@@ -140,10 +140,10 @@ new_epichains_summary <- function(chains_summary = vector(),
 #' @return An `<epichains_summary>` object
 #' @author James M. Azam
 #' @export
-epichains_summary <- function(chains_summary = vector(),
+epichains_summary <- function(chains_summary,
                               chains_run = integer(),
                               statistic = character(),
-                              stat_max = double(),
+                              stat_max = integer(),
                               intvn_mean_reduction = double()
                               ) {
   # Check that inputs are well specified
diff --git a/man/epichains_summary.Rd b/man/epichains_summary.Rd
index 97a6931a..8350cd73 100644
--- a/man/epichains_summary.Rd
+++ b/man/epichains_summary.Rd
@@ -5,10 +5,10 @@
 \title{Create an \verb{<epichains_summary>} object}
 \usage{
 epichains_summary(
-  chains_summary = vector(),
+  chains_summary,
   chains_run = integer(),
   statistic = character(),
-  stat_max = double(),
+  stat_max = integer(),
   intvn_mean_reduction = double()
 )
 }
diff --git a/man/epichains_tree.Rd b/man/epichains_tree.Rd
index 68a04db6..a83f2c68 100644
--- a/man/epichains_tree.Rd
+++ b/man/epichains_tree.Rd
@@ -5,10 +5,10 @@
 \title{Create an \verb{<epichains_tree>} object}
 \usage{
 epichains_tree(
-  tree_df = data.frame(),
+  tree_df,
   chains_run = integer(),
   statistic = character(),
-  stat_max = double(),
+  stat_max = integer(),
   intvn_mean_reduction = double(),
   track_pop = logical()
 )
diff --git a/man/new_epichains_summary.Rd b/man/new_epichains_summary.Rd
index bf7ffef2..23556cd0 100644
--- a/man/new_epichains_summary.Rd
+++ b/man/new_epichains_summary.Rd
@@ -5,10 +5,10 @@
 \title{Construct a \verb{<epichains_summary>} object}
 \usage{
 new_epichains_summary(
-  chains_summary = vector(),
+  chains_summary,
   chains_run = integer(),
   statistic = character(),
-  stat_max = double(),
+  stat_max = integer(),
   intvn_mean_reduction = double()
 )
 }
diff --git a/man/new_epichains_tree.Rd b/man/new_epichains_tree.Rd
index eba1069b..647185a6 100644
--- a/man/new_epichains_tree.Rd
+++ b/man/new_epichains_tree.Rd
@@ -5,10 +5,10 @@
 \title{Construct a \verb{<epichains_tree>} object}
 \usage{
 new_epichains_tree(
-  tree_df = data.frame(),
+  tree_df,
   chains_run = integer(),
   statistic = character(),
-  stat_max = double(),
+  stat_max = integer(),
   intvn_mean_reduction = double(),
   track_pop = logical()
 )

From 0a397ac3adcb3bc0189b62c437a10ec978d072a3 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 11 Oct 2023 20:42:03 +0100
Subject: [PATCH 803/828] Loosen assertion for stat_max

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 8ac97d5f..6ae75bf3 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -150,8 +150,8 @@ epichains_summary <- function(chains_summary,
   checkmate::assert_vector(chains_summary)
   checkmate::assert_integerish(chains_run, null.ok = TRUE)
   checkmate::assert_character(statistic)
-  checkmate::assert_integerish(stat_max, null.ok = TRUE)
   checkmate::assert_double(intvn_mean_reduction)
+  checkmate::assert_number(stat_max, null.ok = TRUE)
 
   # Create <epichains_summary> object
   epichains_summary <- new_epichains_summary(

From eeb507a17d491217f61c372525f0db12d9ed424b Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 11 Oct 2023 20:42:19 +0100
Subject: [PATCH 804/828] Update snapshot tests

---
 tests/testthat/_snaps/epichains.md | 16 ++++++----------
 1 file changed, 6 insertions(+), 10 deletions(-)

diff --git a/tests/testthat/_snaps/epichains.md b/tests/testthat/_snaps/epichains.md
index 71ea0092..f115828c 100644
--- a/tests/testthat/_snaps/epichains.md
+++ b/tests/testthat/_snaps/epichains.md
@@ -1,16 +1,15 @@
-# print.epichains works for simulation functions
+# print.epichains_tree works for simulation functions
 
     Code
       susc_outbreak_raw
     Output
-      `epichains` object
+      `<epichains_tree>` object
       
       < tree head (from first known infector) >
       
       [1] sim_id      infector_id generation  time       
       <0 rows> (or 0-length row.names)
       
-      < tree tail >
       
         sim_id infector_id generation time
       1      1          NA          1    0
@@ -23,7 +22,7 @@
     Code
       susc_outbreak_raw2
     Output
-      `epichains` object
+      `<epichains_tree>` object
       
       < tree head (from first known infector) >
       
@@ -35,7 +34,6 @@
       6      6           4          4 44.00812
       7      7           3          4 78.73481
       
-      < tree tail >
       
          sim_id infector_id generation     time
       7       7           3          4 78.73481
@@ -53,7 +51,7 @@
     Code
       tree_sim_raw
     Output
-      `epichains` object
+      `<epichains_tree>` object
       
       < tree head (from first known infector) >
       
@@ -65,7 +63,6 @@
       7           1      4           1          2
       8           2      4           2          3
       
-      < tree tail >
       
          infectee_id sim_id infector_id generation
       12           1      6           4          3
@@ -84,7 +81,7 @@
     Code
       tree_sim_raw2
     Output
-      `epichains` object
+      `<epichains_tree>` object
       
       < tree head (from first known infector) >
       
@@ -96,7 +93,6 @@
       15           6      2           1          2 1.7212668
       16           7      2           1          2 1.3509058
       
-      < tree tail >
       
           infectee_id sim_id infector_id generation      time
       119           9     15           8          5 19.146936
@@ -115,7 +111,7 @@
     Code
       chain_summary_raw
     Output
-      `epichains` object 
+      `epichains_summary` object 
       
       [1] 9 6
       

From fade3ca018c162867e5574844343cd30e936a1ca Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 11 Oct 2023 20:42:36 +0100
Subject: [PATCH 805/828] Bind epichains_tree class to aggregate method

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 6ae75bf3..1b5455a0 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -482,7 +482,7 @@ tail.epichains_tree <- function(x, ...) {
 #' # Aggregate cases per generation
 #' cases_per_gen <- aggregate(chains, grouping_var = "generation")
 #' head(cases_per_gen)
-aggregate.epichains <- function(x,
+aggregate.epichains_tree <- function(x,
                                 grouping_var = c(
                                   "time",
                                   "generation"

From 8bd13ce22a0c8620a42109fe830f248fe22cf642 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Wed, 11 Oct 2023 20:49:00 +0100
Subject: [PATCH 806/828] Styling to fix lintr issues

---
 R/epichains.R | 72 ++++++++++++++++++++++++---------------------------
 1 file changed, 34 insertions(+), 38 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 1b5455a0..cf609995 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -22,8 +22,7 @@ new_epichains_tree <- function(tree_df,
                                statistic = character(),
                                stat_max = integer(),
                                intvn_mean_reduction = double(),
-                               track_pop = logical()
-                               ) {
+                               track_pop = logical()) {
   # Assemble the elements of the object
   obj <- structure(
     tree_df,
@@ -63,8 +62,7 @@ epichains_tree <- function(tree_df,
                            statistic = character(),
                            stat_max = integer(),
                            intvn_mean_reduction = double(),
-                           track_pop = logical()
-                           ) {
+                           track_pop = logical()) {
   # Check that inputs are well specified
   checkmate::assert_data_frame(tree_df)
   checkmate::assert_integerish(chains_run, null.ok = TRUE)
@@ -81,7 +79,7 @@ epichains_tree <- function(tree_df,
     stat_max = stat_max,
     intvn_mean_reduction = intvn_mean_reduction,
     track_pop = track_pop
-    )
+  )
 
   # Validate the created object
   validate_epichains_tree(epichains_tree)
@@ -111,8 +109,7 @@ new_epichains_summary <- function(chains_summary,
                                   chains_run = integer(),
                                   statistic = character(),
                                   stat_max = integer(),
-                                  intvn_mean_reduction = double()
-                                  ) {
+                                  intvn_mean_reduction = double()) {
   # Assemble the elements of the object
   obj <- structure(
     chains_summary,
@@ -144,8 +141,7 @@ epichains_summary <- function(chains_summary,
                               chains_run = integer(),
                               statistic = character(),
                               stat_max = integer(),
-                              intvn_mean_reduction = double()
-                              ) {
+                              intvn_mean_reduction = double()) {
   # Check that inputs are well specified
   checkmate::assert_vector(chains_summary)
   checkmate::assert_integerish(chains_run, null.ok = TRUE)
@@ -219,29 +215,29 @@ format.epichains_tree <- function(x, ...) {
       sprintf(
         "%s",
         "\n"
-        ),
+      ),
       sprintf(
         "Chains simulated: %s",
         chain_info[["chains_run"]]
-        ),
+      ),
+      sprintf(
+        "Number of ancestors (known): %s",
+        chain_info[["unique_ancestors"]]
+      ),
       sprintf(
-          "Number of ancestors (known): %s",
-          chain_info[["unique_ancestors"]]
-        ),
-        sprintf(
-          "Number of generations: %s",
-          chain_info[["max_generation"]]
-        )
+        "Number of generations: %s",
+        chain_info[["max_generation"]]
       )
     )
+  )
 
   # Offer more information to view the full dataset
   writeLines(
     sprintf(
       "%s %s", "Use `as.data.frame(<object_name>)`",
       "to view the full output in the console."
-      )
     )
+  )
   invisible(x)
 }
 
@@ -266,24 +262,24 @@ format.epichains_summary <- function(x, ...) {
     sprintf(
       "\n Number of chains simulated: %s",
       chain_info[["unique_chains"]]
-      )
     )
+  )
   writeLines(
     c(
       sprintf(
         "\n Simulated chain %ss: \n",
         attr(x, "statistic", exact = TRUE)
-        ),
+      ),
       sprintf(
         "Max: %s",
         chain_info[["max_chain_stat"]]
-        ),
+      ),
       sprintf(
         "Min: %s",
         chain_info[["min_chain_stat"]]
-        )
       )
     )
+  )
 
   invisible(x)
 }
@@ -315,7 +311,7 @@ summary.epichains_tree <- function(object, ...) {
     max_time = max_time,
     unique_ancestors = n_unique_ancestors,
     max_generation = max_generation
-    )
+  )
 
   return(out)
 }
@@ -338,16 +334,16 @@ summary.epichains_summary <- function(object, ...) {
 
   if (all(is.infinite(object))) {
     max_chain_stat <- min_chain_stat <- Inf
-    } else {
-      max_chain_stat <- max(object[!is.infinite(object)])
-      min_chain_stat <- min(object[!is.infinite(object)])
-    }
+  } else {
+    max_chain_stat <- max(object[!is.infinite(object)])
+    min_chain_stat <- min(object[!is.infinite(object)])
+  }
 
-    out <- list(
-      chains_run = chains_run,
-      max_chain_stat = max_chain_stat,
-      min_chain_stat = min_chain_stat
-    )
+  out <- list(
+    chains_run = chains_run,
+    max_chain_stat = max_chain_stat,
+    min_chain_stat = min_chain_stat
+  )
 
   return(out)
 }
@@ -483,11 +479,11 @@ tail.epichains_tree <- function(x, ...) {
 #' cases_per_gen <- aggregate(chains, grouping_var = "generation")
 #' head(cases_per_gen)
 aggregate.epichains_tree <- function(x,
-                                grouping_var = c(
-                                  "time",
-                                  "generation"
-                                ),
-                                ...) {
+                                     grouping_var = c(
+                                       "time",
+                                       "generation"
+                                     ),
+                                     ...) {
   validate_epichains_tree(x)
 
   # Get grouping variable

From 56dc28268f7fd4ebb040aa021f93455a163eef1e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Wed, 8 Nov 2023 17:43:33 +0000
Subject: [PATCH 807/828] Linting: Fix indentation

---
 R/simulate.r | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index fb96c085..6d87c220 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -341,7 +341,8 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
     chains_run = nchains,
     statistic = statistic,
     stat_max = stat_max,
-    )
+    intvn_mean_reduction = intvn_mean_reduction
+  )
 
   return(out)
     intvn_mean_reduction = intvn_mean_reduction
@@ -556,6 +557,6 @@ simulate_tree_from_pop <- function(pop,
     stat_max = NULL,
     intvn_mean_reduction = intvn_mean_reduction,
     track_pop = TRUE
-    )
+  )
   return(out)
 }

From 940e05d0409fb4878beec6062353815c84d68947 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Tue, 28 Nov 2023 12:23:01 +0000
Subject: [PATCH 808/828] Remove intvn_mean_reduction argument

---
 R/epichains.R                | 16 +++-------------
 man/epichains_summary.Rd     |  9 +--------
 man/epichains_tree.Rd        |  7 -------
 man/new_epichains_summary.Rd |  9 +--------
 man/new_epichains_tree.Rd    |  7 -------
 5 files changed, 5 insertions(+), 43 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index cf609995..80416290 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -21,7 +21,6 @@ new_epichains_tree <- function(tree_df,
                                chains_run = integer(),
                                statistic = character(),
                                stat_max = integer(),
-                               intvn_mean_reduction = double(),
                                track_pop = logical()) {
   # Assemble the elements of the object
   obj <- structure(
@@ -29,7 +28,6 @@ new_epichains_tree <- function(tree_df,
     chains_run = chains_run,
     statistic = statistic,
     stat_max = stat_max,
-    intvn_mean_reduction = intvn_mean_reduction,
     track_pop = track_pop,
     class = c("epichains_tree", "data.frame")
   )
@@ -61,13 +59,11 @@ epichains_tree <- function(tree_df,
                            chains_run = integer(),
                            statistic = character(),
                            stat_max = integer(),
-                           intvn_mean_reduction = double(),
                            track_pop = logical()) {
   # Check that inputs are well specified
   checkmate::assert_data_frame(tree_df)
   checkmate::assert_integerish(chains_run, null.ok = TRUE)
   checkmate::assert_character(statistic, null.ok = TRUE)
-  checkmate::assert_double(intvn_mean_reduction)
   checkmate::assert_logical(track_pop)
   checkmate::assert_number(stat_max, null.ok = TRUE)
 
@@ -77,7 +73,6 @@ epichains_tree <- function(tree_df,
     chains_run = chains_run,
     statistic = statistic,
     stat_max = stat_max,
-    intvn_mean_reduction = intvn_mean_reduction,
     track_pop = track_pop
   )
 
@@ -108,15 +103,13 @@ epichains_tree <- function(tree_df,
 new_epichains_summary <- function(chains_summary,
                                   chains_run = integer(),
                                   statistic = character(),
-                                  stat_max = integer(),
-                                  intvn_mean_reduction = double()) {
+                                  stat_max = integer()) {
   # Assemble the elements of the object
   obj <- structure(
     chains_summary,
     chains_run = chains_run,
     statistic = statistic,
     stat_max = stat_max,
-    intvn_mean_reduction = intvn_mean_reduction,
     class = c("epichains_summary", "vector")
   )
   return(obj)
@@ -140,13 +133,11 @@ new_epichains_summary <- function(chains_summary,
 epichains_summary <- function(chains_summary,
                               chains_run = integer(),
                               statistic = character(),
-                              stat_max = integer(),
-                              intvn_mean_reduction = double()) {
+                              stat_max = integer()) {
   # Check that inputs are well specified
   checkmate::assert_vector(chains_summary)
   checkmate::assert_integerish(chains_run, null.ok = TRUE)
   checkmate::assert_character(statistic)
-  checkmate::assert_double(intvn_mean_reduction)
   checkmate::assert_number(stat_max, null.ok = TRUE)
 
   # Create <epichains_summary> object
@@ -154,8 +145,7 @@ epichains_summary <- function(chains_summary,
     chains_summary,
     chains_run = chains_run,
     statistic = statistic,
-    stat_max = stat_max,
-    intvn_mean_reduction = intvn_mean_reduction
+    stat_max = stat_max
   )
 
   # Validate the created object
diff --git a/man/epichains_summary.Rd b/man/epichains_summary.Rd
index 8350cd73..a4601fe0 100644
--- a/man/epichains_summary.Rd
+++ b/man/epichains_summary.Rd
@@ -8,8 +8,7 @@ epichains_summary(
   chains_summary,
   chains_run = integer(),
   statistic = character(),
-  stat_max = integer(),
-  intvn_mean_reduction = double()
+  stat_max = integer()
 )
 }
 \arguments{
@@ -29,12 +28,6 @@ Can be one of:
 \item{stat_max}{A cut off for the chain statistic (size/length) being
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
-
-\item{intvn_mean_reduction}{A number between 0
-and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
-population-level intervention. \code{intvn_mean_reduction} = 0
-implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
-impact.}
 }
 \value{
 An \verb{<epichains_summary>} object
diff --git a/man/epichains_tree.Rd b/man/epichains_tree.Rd
index a83f2c68..39e483ea 100644
--- a/man/epichains_tree.Rd
+++ b/man/epichains_tree.Rd
@@ -9,7 +9,6 @@ epichains_tree(
   chains_run = integer(),
   statistic = character(),
   stat_max = integer(),
-  intvn_mean_reduction = double(),
   track_pop = logical()
 )
 }
@@ -33,12 +32,6 @@ Can be one of:
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
-\item{intvn_mean_reduction}{A number between 0
-and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
-population-level intervention. \code{intvn_mean_reduction} = 0
-implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
-impact.}
-
 \item{track_pop}{Was the susceptible population tracked; Logical}
 }
 \value{
diff --git a/man/new_epichains_summary.Rd b/man/new_epichains_summary.Rd
index 23556cd0..972afd70 100644
--- a/man/new_epichains_summary.Rd
+++ b/man/new_epichains_summary.Rd
@@ -8,8 +8,7 @@ new_epichains_summary(
   chains_summary,
   chains_run = integer(),
   statistic = character(),
-  stat_max = integer(),
-  intvn_mean_reduction = double()
+  stat_max = integer()
 )
 }
 \arguments{
@@ -29,12 +28,6 @@ Can be one of:
 \item{stat_max}{A cut off for the chain statistic (size/length) being
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
-
-\item{intvn_mean_reduction}{A number between 0
-and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
-population-level intervention. \code{intvn_mean_reduction} = 0
-implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
-impact.}
 }
 \description{
 \code{new_epichains_summary()} constructs an \verb{<epichains_summary>} object from a
diff --git a/man/new_epichains_tree.Rd b/man/new_epichains_tree.Rd
index 647185a6..8b5fee28 100644
--- a/man/new_epichains_tree.Rd
+++ b/man/new_epichains_tree.Rd
@@ -9,7 +9,6 @@ new_epichains_tree(
   chains_run = integer(),
   statistic = character(),
   stat_max = integer(),
-  intvn_mean_reduction = double(),
   track_pop = logical()
 )
 }
@@ -33,12 +32,6 @@ Can be one of:
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
-\item{intvn_mean_reduction}{A number between 0
-and 1 for scaling/reducing the mean of \code{offspring_dist}. Serves as
-population-level intervention. \code{intvn_mean_reduction} = 0
-implies no intervention impact and \code{intvn_mean_reduction} = 1 implies full
-impact.}
-
 \item{track_pop}{Was the susceptible population tracked; Logical}
 }
 \description{

From ae2978714c97904620ee96cbb2a1bfd14a6ed90d Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Tue, 28 Nov 2023 17:44:33 +0000
Subject: [PATCH 809/828] Improve documentation

---
 R/epichains.R                   | 6 ++++--
 man/aggregate.epichains_tree.Rd | 6 ++++--
 2 files changed, 8 insertions(+), 4 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 80416290..8af29d09 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -435,11 +435,13 @@ tail.epichains_tree <- function(x, ...) {
   utils::tail(as.data.frame(x), ...)
 }
 
-#' Aggregate cases in `<epichains_tree>` objects by "time" or "generation"
+#' Aggregate cases in `<epichains_tree>` objects by "generation" or "time", if
+#' present
 #'
 #' @description
 #' This function provides a quick way to create a time series of cases over
-#' time or generation from simulated `<epichains_tree>` objects.
+#' generation or time (if serials_dist was specified) from simulated
+#' `<epichains_tree>` objects.
 #'
 #' @param x An `<epichains_tree>` object.
 #' @param grouping_var The variable to aggregate by. Options include
diff --git a/man/aggregate.epichains_tree.Rd b/man/aggregate.epichains_tree.Rd
index b80d110a..07793b2c 100644
--- a/man/aggregate.epichains_tree.Rd
+++ b/man/aggregate.epichains_tree.Rd
@@ -2,7 +2,8 @@
 % Please edit documentation in R/epichains.R
 \name{aggregate.epichains_tree}
 \alias{aggregate.epichains_tree}
-\title{Aggregate cases in \verb{<epichains_tree>} objects by "time" or "generation"}
+\title{Aggregate cases in \verb{<epichains_tree>} objects by "generation" or "time", if
+present}
 \usage{
 \method{aggregate}{epichains_tree}(x, grouping_var = c("time", "generation"), ...)
 }
@@ -19,7 +20,8 @@ A \verb{<data.frame>} object of cases by \code{grouping_var}.
 }
 \description{
 This function provides a quick way to create a time series of cases over
-time or generation from simulated \verb{<epichains_tree>} objects.
+generation or time (if serials_dist was specified) from simulated
+\verb{<epichains_tree>} objects.
 }
 \examples{
 set.seed(123)

From ac86f872772a2670ba1ce321bd5a0fd0298f8e98 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Tue, 28 Nov 2023 17:53:03 +0000
Subject: [PATCH 810/828] Remove intvn_mean_reduction

---
 R/simulate.r | 3 ---
 1 file changed, 3 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 6d87c220..c129325a 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -243,7 +243,6 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
     chains_run = nchains,
     statistic = statistic,
     stat_max = stat_max,
-    intvn_mean_reduction = intvn_mean_reduction,
     track_pop = FALSE
   )
   return(out)
@@ -341,7 +340,6 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
     chains_run = nchains,
     statistic = statistic,
     stat_max = stat_max,
-    intvn_mean_reduction = intvn_mean_reduction
   )
 
   return(out)
@@ -555,7 +553,6 @@ simulate_tree_from_pop <- function(pop,
     chains_run = NULL,
     statistic = NULL,
     stat_max = NULL,
-    intvn_mean_reduction = intvn_mean_reduction,
     track_pop = TRUE
   )
   return(out)

From b17d3e2da9bfe657f28a6aaf8b39dacb98ac24ac Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Tue, 28 Nov 2023 17:57:47 +0000
Subject: [PATCH 811/828] Remove trailing comma

---
 R/simulate.r | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index c129325a..8023f599 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -339,7 +339,7 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
     chains_summary = stat_track,
     chains_run = nchains,
     statistic = statistic,
-    stat_max = stat_max,
+    stat_max = stat_max
   )
 
   return(out)

From 19bc296d665db9718e87c61747a9b21e5610aa8b Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Thu, 30 Nov 2023 14:33:31 +0000
Subject: [PATCH 812/828] Apply suggestions from code review

Co-authored-by: Sebastian Funk <sebastian.funk@lshtm.ac.uk>
---
 R/epichains.R | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 8af29d09..bfc85f6f 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -1,4 +1,4 @@
-#' Construct a `<epichains_tree>` object
+#' Construct an `<epichains_tree>` object
 #'
 #' @description
 #' `new_epichains_tree()` constructs an `<epichains_tree>` object from a
@@ -276,7 +276,7 @@ format.epichains_summary <- function(x, ...) {
 
 #' Summary method for `epichains_tree` class
 #'
-#' @param object An `epichains_tree` object
+#' @param object An `<epichains_tree>` object
 #' @param ... further arguments passed to or from other methods
 #'
 #' @return List of summaries
@@ -342,7 +342,7 @@ summary.epichains_summary <- function(object, ...) {
 #'
 #' @param x An R object
 #'
-#' @return logical, `TRUE` if the object is an `epichains_tree` and `FALSE`
+#' @return logical, `TRUE` if the object is an `<epichains_tree>` and `FALSE`
 #' otherwise
 #' @author James M. Azam
 #' @export

From e4679a06754ab2658c8d1dd619b41c09d166a436 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Thu, 30 Nov 2023 14:26:11 +0000
Subject: [PATCH 813/828] Rename chains_run to nchains

---
 R/epichains.R                   | 32 ++++++++++++++++----------------
 R/simulate.r                    |  6 +++---
 man/epichains_summary.Rd        |  4 ++--
 man/epichains_tree.Rd           |  5 ++---
 man/new_epichains_summary.Rd    |  4 ++--
 man/new_epichains_tree.Rd       |  4 ++--
 tests/testthat/test-epichains.R | 10 +++++-----
 tests/testthat/test-simulate.R  |  8 ++++----
 8 files changed, 36 insertions(+), 37 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index bfc85f6f..49676b65 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -11,21 +11,21 @@
 #' @param tree_df a `<data.frame>` containing at least columns for "chain_id",
 #' "ancestor", and "generation". Also has optional columns for "time", and
 #' "chain_id".
-#' @param chains_run Number of chains/cases used to generate the outbreak;
+#' @param nchains Number of chains/cases used to generate the outbreak;
 #' Integer
 #' @param track_pop Was the susceptible population tracked; Logical
 #' @inheritParams epichains_tree
 #' @author James M. Azam
 #' @keywords internal
 new_epichains_tree <- function(tree_df,
-                               chains_run = integer(),
+                               nchains = integer(),
                                statistic = character(),
                                stat_max = integer(),
                                track_pop = logical()) {
   # Assemble the elements of the object
   obj <- structure(
     tree_df,
-    chains_run = chains_run,
+    nchains = nchains,
     statistic = statistic,
     stat_max = stat_max,
     track_pop = track_pop,
@@ -56,13 +56,13 @@ new_epichains_tree <- function(tree_df,
 #' @author James M. Azam
 #' @export
 epichains_tree <- function(tree_df,
-                           chains_run = integer(),
+                           nchains = integer(),
                            statistic = character(),
                            stat_max = integer(),
                            track_pop = logical()) {
   # Check that inputs are well specified
   checkmate::assert_data_frame(tree_df)
-  checkmate::assert_integerish(chains_run, null.ok = TRUE)
+  checkmate::assert_integerish(nchains, null.ok = TRUE)
   checkmate::assert_character(statistic, null.ok = TRUE)
   checkmate::assert_logical(track_pop)
   checkmate::assert_number(stat_max, null.ok = TRUE)
@@ -70,7 +70,7 @@ epichains_tree <- function(tree_df,
   # Create <epichains_tree> object
   epichains_tree <- new_epichains_tree(
     tree_df = tree_df,
-    chains_run = chains_run,
+    nchains = nchains,
     statistic = statistic,
     stat_max = stat_max,
     track_pop = track_pop
@@ -101,13 +101,13 @@ epichains_tree <- function(tree_df,
 #' @author James M. Azam
 #' @keywords internal
 new_epichains_summary <- function(chains_summary,
-                                  chains_run = integer(),
+                                  nchains = integer(),
                                   statistic = character(),
                                   stat_max = integer()) {
   # Assemble the elements of the object
   obj <- structure(
     chains_summary,
-    chains_run = chains_run,
+    nchains = nchains,
     statistic = statistic,
     stat_max = stat_max,
     class = c("epichains_summary", "vector")
@@ -131,19 +131,19 @@ new_epichains_summary <- function(chains_summary,
 #' @author James M. Azam
 #' @export
 epichains_summary <- function(chains_summary,
-                              chains_run = integer(),
+                              nchains = integer(),
                               statistic = character(),
                               stat_max = integer()) {
   # Check that inputs are well specified
   checkmate::assert_vector(chains_summary)
-  checkmate::assert_integerish(chains_run, null.ok = TRUE)
+  checkmate::assert_integerish(nchains, null.ok = TRUE)
   checkmate::assert_character(statistic)
   checkmate::assert_number(stat_max, null.ok = TRUE)
 
   # Create <epichains_summary> object
   epichains_summary <- new_epichains_summary(
     chains_summary,
-    chains_run = chains_run,
+    nchains = nchains,
     statistic = statistic,
     stat_max = stat_max
   )
@@ -208,7 +208,7 @@ format.epichains_tree <- function(x, ...) {
       ),
       sprintf(
         "Chains simulated: %s",
-        chain_info[["chains_run"]]
+        chain_info[["nchains"]]
       ),
       sprintf(
         "Number of ancestors (known): %s",
@@ -287,7 +287,7 @@ summary.epichains_tree <- function(object, ...) {
   validate_epichains_tree(object)
 
   # Get the summaries
-  chains_run <- attr(object, "chains_run", exact = TRUE)
+  nchains <- attr(object, "nchains", exact = TRUE)
 
   max_time <- ifelse(("time" %in% names(object)), max(object$time), NA)
 
@@ -297,7 +297,7 @@ summary.epichains_tree <- function(object, ...) {
 
   # List of summaries
   out <- list(
-    chains_run = chains_run,
+    nchains = nchains,
     max_time = max_time,
     unique_ancestors = n_unique_ancestors,
     max_generation = max_generation
@@ -319,7 +319,7 @@ summary.epichains_summary <- function(object, ...) {
   validate_epichains_summary(object)
 
   # Get the summaries
-  chains_run <- attr(object, "chains_run", exact = TRUE)
+  nchains <- attr(object, "nchains", exact = TRUE)
 
 
   if (all(is.infinite(object))) {
@@ -330,7 +330,7 @@ summary.epichains_summary <- function(object, ...) {
   }
 
   out <- list(
-    chains_run = chains_run,
+    nchains = nchains,
     max_chain_stat = max_chain_stat,
     min_chain_stat = min_chain_stat
   )
diff --git a/R/simulate.r b/R/simulate.r
index 8023f599..53636b78 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -240,7 +240,7 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
 
   out <- epichains_tree(
     tree_df = tree_df,
-    chains_run = nchains,
+    nchains = nchains,
     statistic = statistic,
     stat_max = stat_max,
     track_pop = FALSE
@@ -337,7 +337,7 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
 
   out <- epichains_summary(
     chains_summary = stat_track,
-    chains_run = nchains,
+    nchains = nchains,
     statistic = statistic,
     stat_max = stat_max
   )
@@ -550,7 +550,7 @@ simulate_tree_from_pop <- function(pop,
 
   out <- epichains_tree(
     tree_df,
-    chains_run = NULL,
+    nchains = NULL,
     statistic = NULL,
     stat_max = NULL,
     track_pop = TRUE
diff --git a/man/epichains_summary.Rd b/man/epichains_summary.Rd
index a4601fe0..52788303 100644
--- a/man/epichains_summary.Rd
+++ b/man/epichains_summary.Rd
@@ -6,7 +6,7 @@
 \usage{
 epichains_summary(
   chains_summary,
-  chains_run = integer(),
+  nchains = integer(),
   statistic = character(),
   stat_max = integer()
 )
@@ -14,7 +14,7 @@ epichains_summary(
 \arguments{
 \item{chains_summary}{a \verb{<vector>} of chain sizes and lengths.}
 
-\item{chains_run}{Number of chains/cases used to generate the outbreak;
+\item{nchains}{Number of chains/cases used to generate the outbreak;
 Integer}
 
 \item{statistic}{String; Statistic (size/length) to calculate. Used to
diff --git a/man/epichains_tree.Rd b/man/epichains_tree.Rd
index 39e483ea..49a5de42 100644
--- a/man/epichains_tree.Rd
+++ b/man/epichains_tree.Rd
@@ -6,7 +6,7 @@
 \usage{
 epichains_tree(
   tree_df,
-  chains_run = integer(),
+  nchains = integer(),
   statistic = character(),
   stat_max = integer(),
   track_pop = logical()
@@ -17,8 +17,7 @@ epichains_tree(
 "ancestor", and "generation". Also has optional columns for "time", and
 "chain_id".}
 
-\item{chains_run}{Number of chains/cases used to generate the outbreak;
-Integer}
+\item{nchains}{Number of chains to simulate.}
 
 \item{statistic}{String; Statistic (size/length) to calculate. Used to
 determine stopping criteria for simulations when \code{stat_max} is finite.
diff --git a/man/new_epichains_summary.Rd b/man/new_epichains_summary.Rd
index 972afd70..e7293f58 100644
--- a/man/new_epichains_summary.Rd
+++ b/man/new_epichains_summary.Rd
@@ -6,7 +6,7 @@
 \usage{
 new_epichains_summary(
   chains_summary,
-  chains_run = integer(),
+  nchains = integer(),
   statistic = character(),
   stat_max = integer()
 )
@@ -14,7 +14,7 @@ new_epichains_summary(
 \arguments{
 \item{chains_summary}{a \verb{<vector>} of chain sizes and lengths.}
 
-\item{chains_run}{Number of chains/cases used to generate the outbreak;
+\item{nchains}{Number of chains/cases used to generate the outbreak;
 Integer}
 
 \item{statistic}{String; Statistic (size/length) to calculate. Used to
diff --git a/man/new_epichains_tree.Rd b/man/new_epichains_tree.Rd
index 8b5fee28..47b6eef4 100644
--- a/man/new_epichains_tree.Rd
+++ b/man/new_epichains_tree.Rd
@@ -6,7 +6,7 @@
 \usage{
 new_epichains_tree(
   tree_df,
-  chains_run = integer(),
+  nchains = integer(),
   statistic = character(),
   stat_max = integer(),
   track_pop = logical()
@@ -17,7 +17,7 @@ new_epichains_tree(
 "ancestor", and "generation". Also has optional columns for "time", and
 "chain_id".}
 
-\item{chains_run}{Number of chains/cases used to generate the outbreak;
+\item{nchains}{Number of chains/cases used to generate the outbreak;
 Integer}
 
 \item{statistic}{String; Statistic (size/length) to calculate. Used to
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index 231f107f..794acc41 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -167,7 +167,7 @@ test_that("summary.epichains_tree works as expected", {
   expect_named(
     summary(tree_sim_raw),
     c(
-      "chains_run",
+      "nchains",
       "max_time",
       "unique_infectors",
       "max_generation"
@@ -176,7 +176,7 @@ test_that("summary.epichains_tree works as expected", {
   expect_named(
     summary(tree_sim_raw2),
     c(
-      "chains_run",
+      "nchains",
       "max_time",
       "unique_infectors",
       "max_generation"
@@ -185,7 +185,7 @@ test_that("summary.epichains_tree works as expected", {
   expect_named(
     summary(susc_outbreak_raw),
     c(
-      "chains_run",
+      "nchains",
       "max_time",
       "unique_infectors",
       "max_generation"
@@ -194,7 +194,7 @@ test_that("summary.epichains_tree works as expected", {
   expect_named(
     summary(susc_outbreak_raw2),
     c(
-      "chains_run",
+      "nchains",
       "max_time",
       "unique_infectors",
       "max_generation"
@@ -203,7 +203,7 @@ test_that("summary.epichains_tree works as expected", {
   expect_named(
     summary(chain_summary_raw),
     c(
-      "chains_run",
+      "nchains",
       "max_chain_stat",
       "min_chain_stat"
     )
diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index 2decf4b0..ddf23acd 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -235,7 +235,7 @@ test_that("simulate_tree is numerically correct", {
   tree_sim_summary <- summary(tree_sim_raw)
   #' Expectations
   expect_identical(
-    tree_sim_summary$chains_run,
+    tree_sim_summary$nchains,
     2.00
   )
   expect_identical(
@@ -264,7 +264,7 @@ test_that("simulate_tree is numerically correct", {
   )
   #' Expectations for intervention simulation
   expect_identical(
-    tree_sim_summary$chains_run,
+    tree_sim_summary$nchains,
     2.0
   )
   expect_identical(
@@ -306,7 +306,7 @@ test_that("simulate_summary is numerically correct", {
   chain_summary_summaries <- summary(chain_summary_raw)
   #' Expectations
   expect_identical(
-    chain_summary_summaries$chains_run,
+    chain_summary_summaries$nchains,
     2.00
   )
   expect_identical(
@@ -347,7 +347,7 @@ test_that("simulate_tree_from_pop is numerically correct", {
     susc_outbreak_summary$max_generation,
     1L
   )
-  expect_null(susc_outbreak_summary$chains_run)
+  expect_null(susc_outbreak_summary$nchains)
   expect_identical(
     susc_outbreak_raw$sim_id,
     1L

From f2d0330d7718522a7a047bd2b587769adeca4fe4 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Thu, 30 Nov 2023 14:27:50 +0000
Subject: [PATCH 814/828] Remove hardcoded superclass

---
 R/epichains.R | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 49676b65..75d86bc6 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -29,7 +29,7 @@ new_epichains_tree <- function(tree_df,
     statistic = statistic,
     stat_max = stat_max,
     track_pop = track_pop,
-    class = c("epichains_tree", "data.frame")
+    class = c("epichains_tree", class(tree_df))
   )
   return(obj)
 }
@@ -110,7 +110,7 @@ new_epichains_summary <- function(chains_summary,
     nchains = nchains,
     statistic = statistic,
     stat_max = stat_max,
-    class = c("epichains_summary", "vector")
+    class = c("epichains_summary", class(chains_summary))
   )
   return(obj)
 }

From 7ad28b6454041a72526f2f50445298133f3b2af8 Mon Sep 17 00:00:00 2001
From: James Azam <james.m.azam@gmail.com>
Date: Thu, 30 Nov 2023 14:31:42 +0000
Subject: [PATCH 815/828] Rename grouping_var to by

---
 R/epichains.R                   | 16 ++++++++--------
 man/aggregate.epichains_tree.Rd | 10 +++++-----
 tests/testthat/test-epichains.R | 10 +++++-----
 3 files changed, 18 insertions(+), 18 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index 75d86bc6..f7ffaa50 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -444,11 +444,11 @@ tail.epichains_tree <- function(x, ...) {
 #' `<epichains_tree>` objects.
 #'
 #' @param x An `<epichains_tree>` object.
-#' @param grouping_var The variable to aggregate by. Options include
+#' @param by The variable to aggregate by. Options include
 #' "time" and "generation".
 #' @param ... Other arguments passed to aggregate.
 #' @importFrom stats aggregate
-#' @return A `<data.frame>` object of cases by `grouping_var`.
+#' @return A `<data.frame>` object of cases by `by`.
 #' @author James M. Azam
 #' @export
 #' @examples
@@ -464,14 +464,14 @@ tail.epichains_tree <- function(x, ...) {
 #' chains
 #'
 #' # Aggregate cases per time
-#' cases_per_time <- aggregate(chains, grouping_var = "time")
+#' cases_per_time <- aggregate(chains, by = "time")
 #' head(cases_per_time)
 #'
 #' # Aggregate cases per generation
-#' cases_per_gen <- aggregate(chains, grouping_var = "generation")
+#' cases_per_gen <- aggregate(chains, by = "generation")
 #' head(cases_per_gen)
 aggregate.epichains_tree <- function(x,
-                                     grouping_var = c(
+                                     by = c(
                                        "time",
                                        "generation"
                                      ),
@@ -479,9 +479,9 @@ aggregate.epichains_tree <- function(x,
   validate_epichains_tree(x)
 
   # Get grouping variable
-  grouping_var <- match.arg(grouping_var)
+  by <- match.arg(by)
 
-  out <- if (grouping_var == "time") {
+  out <- if (by == "time") {
     if (is.null(x$time)) {
       stop(
         "Object must have a time column. ",
@@ -495,7 +495,7 @@ aggregate.epichains_tree <- function(x,
       list(time = x$time),
       FUN = NROW
     )
-  } else if (grouping_var == "generation") {
+  } else if (by == "generation") {
     # Count the number of cases per time
     stats::aggregate(
       list(cases = x$sim_id),
diff --git a/man/aggregate.epichains_tree.Rd b/man/aggregate.epichains_tree.Rd
index 07793b2c..dc1fff1f 100644
--- a/man/aggregate.epichains_tree.Rd
+++ b/man/aggregate.epichains_tree.Rd
@@ -5,18 +5,18 @@
 \title{Aggregate cases in \verb{<epichains_tree>} objects by "generation" or "time", if
 present}
 \usage{
-\method{aggregate}{epichains_tree}(x, grouping_var = c("time", "generation"), ...)
+\method{aggregate}{epichains_tree}(x, by = c("time", "generation"), ...)
 }
 \arguments{
 \item{x}{An \verb{<epichains_tree>} object.}
 
-\item{grouping_var}{The variable to aggregate by. Options include
+\item{by}{The variable to aggregate by. Options include
 "time" and "generation".}
 
 \item{...}{Other arguments passed to aggregate.}
 }
 \value{
-A \verb{<data.frame>} object of cases by \code{grouping_var}.
+A \verb{<data.frame>} object of cases by \code{by}.
 }
 \description{
 This function provides a quick way to create a time series of cases over
@@ -36,11 +36,11 @@ chains <- simulate_tree(
 chains
 
 # Aggregate cases per time
-cases_per_time <- aggregate(chains, grouping_var = "time")
+cases_per_time <- aggregate(chains, by = "time")
 head(cases_per_time)
 
 # Aggregate cases per generation
-cases_per_gen <- aggregate(chains, grouping_var = "generation")
+cases_per_gen <- aggregate(chains, by = "generation")
 head(cases_per_gen)
 }
 \author{
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index 794acc41..45767eb8 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -408,11 +408,11 @@ test_that("aggregate.epichains_tree method returns correct objects", {
   #' Create aggregates
   aggreg_by_gen <- aggregate(
     tree_sim_raw2,
-    grouping_var = "generation"
+    by = "generation"
   )
   aggreg_by_time <- aggregate(
     tree_sim_raw2,
-    grouping_var = "time"
+    by = "time"
   )
   #' Expectations for aggregated <epichains_tree>
   expect_named(
@@ -435,7 +435,7 @@ test_that("aggregate.epichains_tree method throws errors", {
         stat_max = 10,
         lambda = 2
       ),
-      grouping_var = "time"
+      by = "time"
     ),
     "Object must have a time column"
   )
@@ -463,11 +463,11 @@ test_that("aggregate.epichains_tree method is numerically correct", {
   #' Create aggregates
   aggreg_by_gen <- aggregate(
     tree_sim_raw,
-    grouping_var = "generation"
+    by = "generation"
   )
   aggreg_by_time <- aggregate(
     tree_sim_raw2,
-    grouping_var = "time"
+    by = "time"
   )
   expect_identical(
     aggreg_by_gen$cases,

From 75c51c442085007792131cc7ab03376c5b115635 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Dec 2023 13:30:29 +0000
Subject: [PATCH 816/828] Rename nchains to ntrees

---
 R/epichains.R                   | 34 ++++++++++++++++-----------------
 R/simulate.r                    |  6 +++---
 man/epichains_summary.Rd        |  7 +++----
 man/epichains_tree.Rd           |  4 ++--
 man/new_epichains_summary.Rd    |  5 ++---
 man/new_epichains_tree.Rd       |  2 +-
 man/simulate_tree.Rd            |  2 +-
 tests/testthat/test-epichains.R | 10 +++++-----
 tests/testthat/test-simulate.R  | 12 ++++++------
 9 files changed, 40 insertions(+), 42 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index f7ffaa50..c4d30195 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -18,14 +18,14 @@
 #' @author James M. Azam
 #' @keywords internal
 new_epichains_tree <- function(tree_df,
-                               nchains = integer(),
+                               ntrees = integer(),
                                statistic = character(),
                                stat_max = integer(),
                                track_pop = logical()) {
   # Assemble the elements of the object
   obj <- structure(
     tree_df,
-    nchains = nchains,
+    ntrees = ntrees,
     statistic = statistic,
     stat_max = stat_max,
     track_pop = track_pop,
@@ -56,13 +56,13 @@ new_epichains_tree <- function(tree_df,
 #' @author James M. Azam
 #' @export
 epichains_tree <- function(tree_df,
-                           nchains = integer(),
+                           ntrees = integer(),
                            statistic = character(),
                            stat_max = integer(),
                            track_pop = logical()) {
   # Check that inputs are well specified
   checkmate::assert_data_frame(tree_df)
-  checkmate::assert_integerish(nchains, null.ok = TRUE)
+  checkmate::assert_integerish(ntrees, null.ok = TRUE)
   checkmate::assert_character(statistic, null.ok = TRUE)
   checkmate::assert_logical(track_pop)
   checkmate::assert_number(stat_max, null.ok = TRUE)
@@ -70,7 +70,7 @@ epichains_tree <- function(tree_df,
   # Create <epichains_tree> object
   epichains_tree <- new_epichains_tree(
     tree_df = tree_df,
-    nchains = nchains,
+    ntrees = ntrees,
     statistic = statistic,
     stat_max = stat_max,
     track_pop = track_pop
@@ -101,13 +101,13 @@ epichains_tree <- function(tree_df,
 #' @author James M. Azam
 #' @keywords internal
 new_epichains_summary <- function(chains_summary,
-                                  nchains = integer(),
+                                  ntrees = integer(),
                                   statistic = character(),
                                   stat_max = integer()) {
   # Assemble the elements of the object
   obj <- structure(
     chains_summary,
-    nchains = nchains,
+    ntrees = ntrees,
     statistic = statistic,
     stat_max = stat_max,
     class = c("epichains_summary", class(chains_summary))
@@ -131,19 +131,19 @@ new_epichains_summary <- function(chains_summary,
 #' @author James M. Azam
 #' @export
 epichains_summary <- function(chains_summary,
-                              nchains = integer(),
+                              ntrees = integer(),
                               statistic = character(),
                               stat_max = integer()) {
   # Check that inputs are well specified
   checkmate::assert_vector(chains_summary)
-  checkmate::assert_integerish(nchains, null.ok = TRUE)
+  checkmate::assert_integerish(ntrees, null.ok = TRUE)
   checkmate::assert_character(statistic)
   checkmate::assert_number(stat_max, null.ok = TRUE)
 
   # Create <epichains_summary> object
   epichains_summary <- new_epichains_summary(
     chains_summary,
-    nchains = nchains,
+    ntrees = ntrees,
     statistic = statistic,
     stat_max = stat_max
   )
@@ -207,8 +207,8 @@ format.epichains_tree <- function(x, ...) {
         "\n"
       ),
       sprintf(
-        "Chains simulated: %s",
-        chain_info[["nchains"]]
+        "Trees simulated: %s",
+        tree_info[["ntrees"]]
       ),
       sprintf(
         "Number of ancestors (known): %s",
@@ -257,7 +257,7 @@ format.epichains_summary <- function(x, ...) {
   writeLines(
     c(
       sprintf(
-        "\n Simulated chain %ss: \n",
+        "\n Simulated tree %ss: \n",
         attr(x, "statistic", exact = TRUE)
       ),
       sprintf(
@@ -287,7 +287,7 @@ summary.epichains_tree <- function(object, ...) {
   validate_epichains_tree(object)
 
   # Get the summaries
-  nchains <- attr(object, "nchains", exact = TRUE)
+  ntrees <- attr(object, "ntrees", exact = TRUE)
 
   max_time <- ifelse(("time" %in% names(object)), max(object$time), NA)
 
@@ -297,7 +297,7 @@ summary.epichains_tree <- function(object, ...) {
 
   # List of summaries
   out <- list(
-    nchains = nchains,
+    ntrees = ntrees,
     max_time = max_time,
     unique_ancestors = n_unique_ancestors,
     max_generation = max_generation
@@ -319,7 +319,7 @@ summary.epichains_summary <- function(object, ...) {
   validate_epichains_summary(object)
 
   # Get the summaries
-  nchains <- attr(object, "nchains", exact = TRUE)
+  ntrees <- attr(object, "ntrees", exact = TRUE)
 
 
   if (all(is.infinite(object))) {
@@ -330,7 +330,7 @@ summary.epichains_summary <- function(object, ...) {
   }
 
   out <- list(
-    nchains = nchains,
+    ntrees = ntrees,
     max_chain_stat = max_chain_stat,
     min_chain_stat = min_chain_stat
   )
diff --git a/R/simulate.r b/R/simulate.r
index 53636b78..19b1aa37 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -19,7 +19,7 @@
 #' of a user-defined named or anonymous function with only one argument `n`,
 #' representing the number of generation times to sample.
 #' @param t0 Start time (if generation time is given); either a single value
-#' or a vector of same length as `nchains` (number of simulations) with
+#' or a vector of same length as `ntrees` (number of simulations) with
 #' initial times. Defaults to 0.
 #' @param tf End time (if generation time is given).
 #' @param ... Parameters of the offspring distribution as required by R.
@@ -337,7 +337,7 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
 
   out <- epichains_summary(
     chains_summary = stat_track,
-    nchains = nchains,
+    ntrees = ntrees,
     statistic = statistic,
     stat_max = stat_max
   )
@@ -550,7 +550,7 @@ simulate_tree_from_pop <- function(pop,
 
   out <- epichains_tree(
     tree_df,
-    nchains = NULL,
+    ntrees = NULL,
     statistic = NULL,
     stat_max = NULL,
     track_pop = TRUE
diff --git a/man/epichains_summary.Rd b/man/epichains_summary.Rd
index 52788303..830a1dbe 100644
--- a/man/epichains_summary.Rd
+++ b/man/epichains_summary.Rd
@@ -6,7 +6,7 @@
 \usage{
 epichains_summary(
   chains_summary,
-  nchains = integer(),
+  ntrees = integer(),
   statistic = character(),
   stat_max = integer()
 )
@@ -14,15 +14,14 @@ epichains_summary(
 \arguments{
 \item{chains_summary}{a \verb{<vector>} of chain sizes and lengths.}
 
-\item{nchains}{Number of chains/cases used to generate the outbreak;
-Integer}
+\item{ntrees}{Number of initial cases used to generate the outbreak; Integer}
 
 \item{statistic}{String; Statistic (size/length) to calculate. Used to
 determine stopping criteria for simulations when \code{stat_max} is finite.
 Can be one of:
 \itemize{
 \item "size": the total number of offspring.
-\item "length": the total number of ancestors.
+\item "length": the total number of infectors.
 }}
 
 \item{stat_max}{A cut off for the chain statistic (size/length) being
diff --git a/man/epichains_tree.Rd b/man/epichains_tree.Rd
index 49a5de42..9aef38a0 100644
--- a/man/epichains_tree.Rd
+++ b/man/epichains_tree.Rd
@@ -6,7 +6,7 @@
 \usage{
 epichains_tree(
   tree_df,
-  nchains = integer(),
+  ntrees = integer(),
   statistic = character(),
   stat_max = integer(),
   track_pop = logical()
@@ -17,7 +17,7 @@ epichains_tree(
 "ancestor", and "generation". Also has optional columns for "time", and
 "chain_id".}
 
-\item{nchains}{Number of chains to simulate.}
+\item{ntrees}{Number of trees to simulate.}
 
 \item{statistic}{String; Statistic (size/length) to calculate. Used to
 determine stopping criteria for simulations when \code{stat_max} is finite.
diff --git a/man/new_epichains_summary.Rd b/man/new_epichains_summary.Rd
index e7293f58..c9b62589 100644
--- a/man/new_epichains_summary.Rd
+++ b/man/new_epichains_summary.Rd
@@ -6,7 +6,7 @@
 \usage{
 new_epichains_summary(
   chains_summary,
-  nchains = integer(),
+  ntrees = integer(),
   statistic = character(),
   stat_max = integer()
 )
@@ -14,8 +14,7 @@ new_epichains_summary(
 \arguments{
 \item{chains_summary}{a \verb{<vector>} of chain sizes and lengths.}
 
-\item{nchains}{Number of chains/cases used to generate the outbreak;
-Integer}
+\item{ntrees}{Number of initial cases used to generate the outbreak; Integer}
 
 \item{statistic}{String; Statistic (size/length) to calculate. Used to
 determine stopping criteria for simulations when \code{stat_max} is finite.
diff --git a/man/new_epichains_tree.Rd b/man/new_epichains_tree.Rd
index 47b6eef4..85ec5129 100644
--- a/man/new_epichains_tree.Rd
+++ b/man/new_epichains_tree.Rd
@@ -6,7 +6,7 @@
 \usage{
 new_epichains_tree(
   tree_df,
-  nchains = integer(),
+  ntrees = integer(),
   statistic = character(),
   stat_max = integer(),
   track_pop = logical()
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index 2a8f4d1c..ad66dc88 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -40,7 +40,7 @@ of a user-defined named or anonymous function with only one argument \code{n},
 representing the number of generation times to sample.}
 
 \item{t0}{Start time (if generation time is given); either a single value
-or a vector of same length as \code{nchains} (number of simulations) with
+or a vector of same length as \code{ntrees} (number of simulations) with
 initial times. Defaults to 0.}
 
 \item{tf}{End time (if generation time is given).}
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index 45767eb8..abee20b5 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -167,7 +167,7 @@ test_that("summary.epichains_tree works as expected", {
   expect_named(
     summary(tree_sim_raw),
     c(
-      "nchains",
+      "ntrees",
       "max_time",
       "unique_infectors",
       "max_generation"
@@ -176,7 +176,7 @@ test_that("summary.epichains_tree works as expected", {
   expect_named(
     summary(tree_sim_raw2),
     c(
-      "nchains",
+      "ntrees",
       "max_time",
       "unique_infectors",
       "max_generation"
@@ -185,7 +185,7 @@ test_that("summary.epichains_tree works as expected", {
   expect_named(
     summary(susc_outbreak_raw),
     c(
-      "nchains",
+      "ntrees",
       "max_time",
       "unique_infectors",
       "max_generation"
@@ -194,7 +194,7 @@ test_that("summary.epichains_tree works as expected", {
   expect_named(
     summary(susc_outbreak_raw2),
     c(
-      "nchains",
+      "ntrees",
       "max_time",
       "unique_infectors",
       "max_generation"
@@ -203,7 +203,7 @@ test_that("summary.epichains_tree works as expected", {
   expect_named(
     summary(chain_summary_raw),
     c(
-      "nchains",
+      "ntrees",
       "max_chain_stat",
       "min_chain_stat"
     )
diff --git a/tests/testthat/test-simulate.R b/tests/testthat/test-simulate.R
index ddf23acd..c73b3828 100644
--- a/tests/testthat/test-simulate.R
+++ b/tests/testthat/test-simulate.R
@@ -235,7 +235,7 @@ test_that("simulate_tree is numerically correct", {
   tree_sim_summary <- summary(tree_sim_raw)
   #' Expectations
   expect_identical(
-    tree_sim_summary$nchains,
+    tree_sim_summary$ntrees,
     2.00
   )
   expect_identical(
@@ -264,7 +264,7 @@ test_that("simulate_tree is numerically correct", {
   )
   #' Expectations for intervention simulation
   expect_identical(
-    tree_sim_summary$nchains,
+    tree_sim_summary$ntrees,
     2.0
   )
   expect_identical(
@@ -306,15 +306,15 @@ test_that("simulate_summary is numerically correct", {
   chain_summary_summaries <- summary(chain_summary_raw)
   #' Expectations
   expect_identical(
-    chain_summary_summaries$nchains,
+    chain_summary_summaries$ntrees,
     2.00
   )
   expect_identical(
-    chain_summary_summaries$max_chain_stat,
+    chain_summary_summaries$max_stat,
     3.00
   )
   expect_identical(
-    chain_summary_summaries$min_chain_stat,
+    chain_summary_summaries$min_stat,
     1.00
   )
   expect_identical(
@@ -347,7 +347,7 @@ test_that("simulate_tree_from_pop is numerically correct", {
     susc_outbreak_summary$max_generation,
     1L
   )
-  expect_null(susc_outbreak_summary$nchains)
+  expect_null(susc_outbreak_summary$ntrees)
   expect_identical(
     susc_outbreak_raw$sim_id,
     1L

From 3634b10fc0d2f91ceaf47970a76d41ea04e51be5 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Dec 2023 13:32:43 +0000
Subject: [PATCH 817/828] Revise function docs

---
 R/simulate.r                  |  2 +-
 man/epichains_summary.Rd      |  6 +++---
 man/epichains_tree.Rd         | 13 +++++++------
 man/is_epichains_tree.Rd      |  2 +-
 man/new_epichains_tree.Rd     |  4 ++--
 man/simulate_tree.Rd          |  2 +-
 man/summary.epichains_tree.Rd |  6 +++---
 7 files changed, 18 insertions(+), 17 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 19b1aa37..47be9d24 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -23,7 +23,7 @@
 #' initial times. Defaults to 0.
 #' @param tf End time (if generation time is given).
 #' @param ... Parameters of the offspring distribution as required by R.
-#' @return An `<epichains>` object, which is basically a `<data.frame>` with
+#' @return An `<epichains_tree>` object, which is basically a `<data.frame>`
 #' columns `infectee_id`, `sim_id` (a unique ID within each simulation
 #' for each infectee), `infector_id`, `generation`, and `time` (of infection)
 #' @author James M. Azam, Sebastian Funk
diff --git a/man/epichains_summary.Rd b/man/epichains_summary.Rd
index 830a1dbe..cfe5f290 100644
--- a/man/epichains_summary.Rd
+++ b/man/epichains_summary.Rd
@@ -35,9 +35,9 @@ An \verb{<epichains_summary>} object
 \code{epichains_summary()} constructs an \verb{<epichains_summary>} object.
 
 An \verb{<epichains_summary>} object is a \verb{<vector>} of the simulated
-chain sizes or lengths. It also stores information on the
-number of cases/chains used for the simulation, and the statistic that was
-tracked, the intervention level.
+tree sizes or lengths. It also stores information on the number of initial
+cases used for the simulation, and the statistic that was tracked,
+the intervention level.
 }
 \author{
 James M. Azam
diff --git a/man/epichains_tree.Rd b/man/epichains_tree.Rd
index 9aef38a0..5f44afd7 100644
--- a/man/epichains_tree.Rd
+++ b/man/epichains_tree.Rd
@@ -31,7 +31,7 @@ Can be one of:
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
-\item{track_pop}{Was the susceptible population tracked; Logical}
+\item{track_pop}{Was the susceptible population tracked? Logical}
 }
 \value{
 An \verb{<epichains_tree>} object
@@ -40,13 +40,14 @@ An \verb{<epichains_tree>} object
 \code{epichains_tree()} constructs an \verb{<epichains_tree>} object, which is
 inherently an \verb{<data.frame>} object that stores some of the inputs
 passed to the \code{simulate_tree()} and \code{simulate_tree_from_pop()} and the
-simulated output. The stored attributes are useful for scenario
-analyses where the inputs are required for downstream analyses.
+simulated output. The stored attributes are useful for downstream
+analyses and reproducibility. This function checks the validity of the
+object created to ensure it has the right columns and column types.
 
 An \verb{<epichains_tree>} object contains a \verb{<data.frame>} of the simulated
-outbreak with ids for each case/chain and the chain the produced, the
-number of cases/chains used for the simulation, the statistic that was
-tracked, the intervention level, and whether the susceptible population was
+outbreak tree with ids for each infector and infectee, generation, and
+optionally, time, the number of initial cases used for the simulation,
+the statistic that was tracked, and whether the susceptible population was
 tracked.
 }
 \author{
diff --git a/man/is_epichains_tree.Rd b/man/is_epichains_tree.Rd
index 6b9d5045..e0d520a6 100644
--- a/man/is_epichains_tree.Rd
+++ b/man/is_epichains_tree.Rd
@@ -10,7 +10,7 @@ is_epichains_tree(x)
 \item{x}{An R object}
 }
 \value{
-logical, \code{TRUE} if the object is an \code{epichains_tree} and \code{FALSE}
+logical, \code{TRUE} if the object is an \verb{<epichains_tree>} and \code{FALSE}
 otherwise
 }
 \description{
diff --git a/man/new_epichains_tree.Rd b/man/new_epichains_tree.Rd
index 85ec5129..1092a470 100644
--- a/man/new_epichains_tree.Rd
+++ b/man/new_epichains_tree.Rd
@@ -2,7 +2,7 @@
 % Please edit documentation in R/epichains.R
 \name{new_epichains_tree}
 \alias{new_epichains_tree}
-\title{Construct a \verb{<epichains_tree>} object}
+\title{Construct an \verb{<epichains_tree>} object}
 \usage{
 new_epichains_tree(
   tree_df,
@@ -32,7 +32,7 @@ Can be one of:
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
 
-\item{track_pop}{Was the susceptible population tracked; Logical}
+\item{track_pop}{Was the susceptible population tracked? Logical}
 }
 \description{
 \code{new_epichains_tree()} constructs an \verb{<epichains_tree>} object from a
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index ad66dc88..c38b042a 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -48,7 +48,7 @@ initial times. Defaults to 0.}
 \item{...}{Parameters of the offspring distribution as required by R.}
 }
 \value{
-An \verb{<epichains>} object, which is basically a \verb{<data.frame>} with
+An \verb{<epichains_tree>} object, which is basically a \verb{<data.frame>}
 columns \code{infectee_id}, \code{sim_id} (a unique ID within each simulation
 for each infectee), \code{infector_id}, \code{generation}, and \code{time} (of infection)
 }
diff --git a/man/summary.epichains_tree.Rd b/man/summary.epichains_tree.Rd
index fe8a290d..a0758417 100644
--- a/man/summary.epichains_tree.Rd
+++ b/man/summary.epichains_tree.Rd
@@ -2,12 +2,12 @@
 % Please edit documentation in R/epichains.R
 \name{summary.epichains_tree}
 \alias{summary.epichains_tree}
-\title{Summary method for \code{epichains_tree} class}
+\title{Summary method for \verb{<epichains_tree>} class}
 \usage{
 \method{summary}{epichains_tree}(object, ...)
 }
 \arguments{
-\item{object}{An \code{epichains_tree} object}
+\item{object}{An \verb{<epichains_tree>} object}
 
 \item{...}{further arguments passed to or from other methods}
 }
@@ -15,7 +15,7 @@
 List of summaries
 }
 \description{
-Summary method for \code{epichains_tree} class
+Summary method for \verb{<epichains_tree>} class
 }
 \author{
 James M. Azam

From 428673fb7dcff8d34fda09f6025b74969b49d58e Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Dec 2023 13:34:35 +0000
Subject: [PATCH 818/828] Use new column names

---
 R/epichains.R                | 53 ++++++++++++++++++------------------
 man/epichains_tree.Rd        |  8 +++---
 man/head.epichains_tree.Rd   |  4 +--
 man/new_epichains_summary.Rd |  2 +-
 man/new_epichains_tree.Rd    | 11 ++++----
 man/simulate_tree.Rd         |  2 +-
 6 files changed, 39 insertions(+), 41 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index c4d30195..ee1d9eab 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -8,12 +8,11 @@
 #' `new_epichains_tree()` on its own as is called within `epichains_tree()`
 #' after the arguments have been checked. To create an `<epichains_tree>`
 #' object, use `epichains_tree()`.
-#' @param tree_df a `<data.frame>` containing at least columns for "chain_id",
-#' "ancestor", and "generation". Also has optional columns for "time", and
-#' "chain_id".
-#' @param nchains Number of chains/cases used to generate the outbreak;
-#' Integer
-#' @param track_pop Was the susceptible population tracked; Logical
+#' @param tree_df a `<data.frame>` containing at least columns for
+#' "infectee_id", "infector_id", and "generation". Also has optional columns
+#' for "time", and "chain_id".
+#' @param ntrees Number of initial cases used to generate the outbreak; Integer
+#' @param track_pop Was the susceptible population tracked? Logical
 #' @inheritParams epichains_tree
 #' @author James M. Azam
 #' @keywords internal
@@ -196,7 +195,7 @@ format.epichains_tree <- function(x, ...) {
   writeLines(sprintf("`<epichains_tree>` object\n"))
 
   # print head of the object
-  writeLines("< tree head (from first known ancestor) >\n")
+  writeLines("< tree head (from first known infector_id) >\n")
   print(head(x))
 
   # print summary information
@@ -211,8 +210,8 @@ format.epichains_tree <- function(x, ...) {
         tree_info[["ntrees"]]
       ),
       sprintf(
-        "Number of ancestors (known): %s",
-        chain_info[["unique_ancestors"]]
+        "Number of infectors (known): %s",
+        tree_info[["unique_infectors"]]
       ),
       sprintf(
         "Number of generations: %s",
@@ -291,7 +290,9 @@ summary.epichains_tree <- function(object, ...) {
 
   max_time <- ifelse(("time" %in% names(object)), max(object$time), NA)
 
-  n_unique_ancestors <- length(unique(object$ancestor[!is.na(object$ancestor)]))
+  n_unique_infectors <- length(
+    unique(object$infector_id[!is.na(object$infector_id)])
+  )
 
   max_generation <- max(object$generation)
 
@@ -299,7 +300,7 @@ summary.epichains_tree <- function(object, ...) {
   out <- list(
     ntrees = ntrees,
     max_time = max_time,
-    unique_ancestors = n_unique_ancestors,
+    unique_infectors = n_unique_infectors,
     max_generation = max_generation
   )
 
@@ -375,19 +376,17 @@ validate_epichains_tree <- function(x) {
   }
 
   # check for class invariants
-
-  if (is_chains_tree(x)) {
-    stopifnot(
-      "object does not contain the correct columns" =
-        c("sim_id", "infector_id", "generation") %in%
-        colnames(x),
+  stopifnot(
+    "object does not contain the correct columns" =
+      c("sim_id", "infector_id", "generation") %in%
+      colnames(x),
     "column `sim_id` must be a numeric" =
-        is.numeric(x$sim_id),
-      "column `infector_id` must be a numeric" =
-        is.numeric(x$infector_id),
-      "column `generation` must be a numeric" =
-        is.numeric(x$generation)
-    )
+      is.numeric(x$sim_id),
+    "column `infector_id` must be a numeric" =
+      is.numeric(x$infector_id),
+    "column `generation` must be a numeric" =
+      is.numeric(x$generation)
+  )
 
   invisible(x)
 }
@@ -418,14 +417,14 @@ validate_epichains_summary <- function(x) {
 #' @export
 #' @details
 #' This returns the top rows of an `<epichains_tree>` object. Note that
-#' the object is originally sorted by `sim_id` and `ancestor` and the first
-#' unknown ancestors (NA) have been dropped from
+#' the object is originally sorted by `sim_id` and `infector_id` and the first
+#' unknown infectors (NA) have been dropped from
 #' printing method.
 #'
 #' To view the full output, use `as.data.frame(<object_name>)`.
 head.epichains_tree <- function(x, ...) {
-  # print head of the simulation output from the first known ancestor
-  x <- x[!is.na(x$ancestor), ]
+  # print head of the simulation output from the first known infector_id
+  x <- x[!is.na(x$infector_id), ]
   utils::head(as.data.frame(x), ...)
 }
 
diff --git a/man/epichains_tree.Rd b/man/epichains_tree.Rd
index 5f44afd7..553efa36 100644
--- a/man/epichains_tree.Rd
+++ b/man/epichains_tree.Rd
@@ -13,9 +13,9 @@ epichains_tree(
 )
 }
 \arguments{
-\item{tree_df}{a \verb{<data.frame>} containing at least columns for "chain_id",
-"ancestor", and "generation". Also has optional columns for "time", and
-"chain_id".}
+\item{tree_df}{a \verb{<data.frame>} containing at least columns for
+"infectee_id", "infector_id", and "generation". Also has optional columns
+for "time", and "chain_id".}
 
 \item{ntrees}{Number of trees to simulate.}
 
@@ -24,7 +24,7 @@ determine stopping criteria for simulations when \code{stat_max} is finite.
 Can be one of:
 \itemize{
 \item "size": the total number of offspring.
-\item "length": the total number of ancestors.
+\item "length": the total number of infectors.
 }}
 
 \item{stat_max}{A cut off for the chain statistic (size/length) being
diff --git a/man/head.epichains_tree.Rd b/man/head.epichains_tree.Rd
index b758772f..b0f6675c 100644
--- a/man/head.epichains_tree.Rd
+++ b/man/head.epichains_tree.Rd
@@ -22,8 +22,8 @@ Object of class \code{data.frame}
 }
 \details{
 This returns the top rows of an \verb{<epichains_tree>} object. Note that
-the object is originally sorted by \code{sim_id} and \code{ancestor} and the first
-unknown ancestors (NA) have been dropped from
+the object is originally sorted by \code{sim_id} and \code{infector_id} and the first
+unknown infectors (NA) have been dropped from
 printing method.
 
 To view the full output, use \verb{as.data.frame(<object_name>)}.
diff --git a/man/new_epichains_summary.Rd b/man/new_epichains_summary.Rd
index c9b62589..3b55586c 100644
--- a/man/new_epichains_summary.Rd
+++ b/man/new_epichains_summary.Rd
@@ -21,7 +21,7 @@ determine stopping criteria for simulations when \code{stat_max} is finite.
 Can be one of:
 \itemize{
 \item "size": the total number of offspring.
-\item "length": the total number of ancestors.
+\item "length": the total number of infectors.
 }}
 
 \item{stat_max}{A cut off for the chain statistic (size/length) being
diff --git a/man/new_epichains_tree.Rd b/man/new_epichains_tree.Rd
index 1092a470..d321d100 100644
--- a/man/new_epichains_tree.Rd
+++ b/man/new_epichains_tree.Rd
@@ -13,19 +13,18 @@ new_epichains_tree(
 )
 }
 \arguments{
-\item{tree_df}{a \verb{<data.frame>} containing at least columns for "chain_id",
-"ancestor", and "generation". Also has optional columns for "time", and
-"chain_id".}
+\item{tree_df}{a \verb{<data.frame>} containing at least columns for
+"infectee_id", "infector_id", and "generation". Also has optional columns
+for "time", and "chain_id".}
 
-\item{nchains}{Number of chains/cases used to generate the outbreak;
-Integer}
+\item{ntrees}{Number of initial cases used to generate the outbreak; Integer}
 
 \item{statistic}{String; Statistic (size/length) to calculate. Used to
 determine stopping criteria for simulations when \code{stat_max} is finite.
 Can be one of:
 \itemize{
 \item "size": the total number of offspring.
-\item "length": the total number of ancestors.
+\item "length": the total number of infectors.
 }}
 
 \item{stat_max}{A cut off for the chain statistic (size/length) being
diff --git a/man/simulate_tree.Rd b/man/simulate_tree.Rd
index c38b042a..ecae3e01 100644
--- a/man/simulate_tree.Rd
+++ b/man/simulate_tree.Rd
@@ -49,7 +49,7 @@ initial times. Defaults to 0.}
 }
 \value{
 An \verb{<epichains_tree>} object, which is basically a \verb{<data.frame>}
-columns \code{infectee_id}, \code{sim_id} (a unique ID within each simulation
+with columns \code{infectee_id}, \code{sim_id} (a unique ID within each simulation
 for each infectee), \code{infector_id}, \code{generation}, and \code{time} (of infection)
 }
 \description{

From cdf93aef304abb71b852ade13fb56a475e6679ca Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Dec 2023 13:35:15 +0000
Subject: [PATCH 819/828] Reword function documentation

---
 R/epichains.R | 17 +++++++++--------
 R/simulate.r  |  2 +-
 2 files changed, 10 insertions(+), 9 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index ee1d9eab..e9978293 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -39,13 +39,14 @@ new_epichains_tree <- function(tree_df,
 #' `epichains_tree()` constructs an `<epichains_tree>` object, which is
 #' inherently an `<data.frame>` object that stores some of the inputs
 #' passed to the `simulate_tree()` and `simulate_tree_from_pop()` and the
-#' simulated output. The stored attributes are useful for scenario
-#' analyses where the inputs are required for downstream analyses.
+#' simulated output. The stored attributes are useful for downstream
+#' analyses and reproducibility. This function checks the validity of the
+#' object created to ensure it has the right columns and column types.
 #'
 #' An `<epichains_tree>` object contains a `<data.frame>` of the simulated
-#' outbreak with ids for each case/chain and the chain the produced, the
-#' number of cases/chains used for the simulation, the statistic that was
-#' tracked, the intervention level, and whether the susceptible population was
+#' outbreak tree with ids for each infector and infectee, generation, and
+#' optionally, time, the number of initial cases used for the simulation,
+#' the statistic that was tracked, and whether the susceptible population was
 #' tracked.
 #'
 #' @inheritParams simulate_tree
@@ -120,9 +121,9 @@ new_epichains_summary <- function(chains_summary,
 #' `epichains_summary()` constructs an `<epichains_summary>` object.
 #'
 #' An `<epichains_summary>` object is a `<vector>` of the simulated
-#' chain sizes or lengths. It also stores information on the
-#' number of cases/chains used for the simulation, and the statistic that was
-#' tracked, the intervention level.
+#' tree sizes or lengths. It also stores information on the number of initial
+#' cases used for the simulation, and the statistic that was tracked,
+#' the intervention level.
 #'
 #' @inheritParams new_epichains_summary
 #'
diff --git a/R/simulate.r b/R/simulate.r
index 47be9d24..fc36e8ef 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -24,7 +24,7 @@
 #' @param tf End time (if generation time is given).
 #' @param ... Parameters of the offspring distribution as required by R.
 #' @return An `<epichains_tree>` object, which is basically a `<data.frame>`
-#' columns `infectee_id`, `sim_id` (a unique ID within each simulation
+#' with columns `infectee_id`, `sim_id` (a unique ID within each simulation
 #' for each infectee), `infector_id`, `generation`, and `time` (of infection)
 #' @author James M. Azam, Sebastian Funk
 #' @export

From ab08a9ced300d9a49fcac052b783c7a9580eb983 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Dec 2023 13:36:20 +0000
Subject: [PATCH 820/828] Rename variables for clarity

---
 R/epichains.R                   | 24 ++++++++++++------------
 tests/testthat/test-epichains.R |  8 ++++----
 2 files changed, 16 insertions(+), 16 deletions(-)

diff --git a/R/epichains.R b/R/epichains.R
index e9978293..856692bd 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -191,7 +191,7 @@ format.epichains_tree <- function(x, ...) {
   validate_epichains_tree(x)
 
   # summarise the information stored in x
-  chain_info <- summary(x)
+  tree_info <- summary(x)
 
   writeLines(sprintf("`<epichains_tree>` object\n"))
 
@@ -216,7 +216,7 @@ format.epichains_tree <- function(x, ...) {
       ),
       sprintf(
         "Number of generations: %s",
-        chain_info[["max_generation"]]
+        tree_info[["max_generation"]]
       )
     )
   )
@@ -244,14 +244,14 @@ format.epichains_summary <- function(x, ...) {
   validate_epichains_summary(x)
 
   # summarise the information stored in x
-  chain_info <- summary(x)
+  statistics <- summary(x)
 
   writeLines(sprintf("`epichains_summary` object \n"))
   print(as.vector(x))
   writeLines(
     sprintf(
-      "\n Number of chains simulated: %s",
-      chain_info[["unique_chains"]]
+      "\n Number of trees simulated: %s",
+      statistics[["unique_trees"]]
     )
   )
   writeLines(
@@ -262,11 +262,11 @@ format.epichains_summary <- function(x, ...) {
       ),
       sprintf(
         "Max: %s",
-        chain_info[["max_chain_stat"]]
+        statistics[["max_tree_stat"]]
       ),
       sprintf(
         "Min: %s",
-        chain_info[["min_chain_stat"]]
+        statistics[["min_stat"]]
       )
     )
   )
@@ -325,16 +325,16 @@ summary.epichains_summary <- function(object, ...) {
 
 
   if (all(is.infinite(object))) {
-    max_chain_stat <- min_chain_stat <- Inf
+    max_stat <- min_stat <- Inf
   } else {
-    max_chain_stat <- max(object[!is.infinite(object)])
-    min_chain_stat <- min(object[!is.infinite(object)])
+    max_stat <- max(object[!is.infinite(object)])
+    min_stat <- min(object[!is.infinite(object)])
   }
 
   out <- list(
     ntrees = ntrees,
-    max_chain_stat = max_chain_stat,
-    min_chain_stat = min_chain_stat
+    max_stat = max_stat,
+    min_stat = min_stat
   )
 
   return(out)
diff --git a/tests/testthat/test-epichains.R b/tests/testthat/test-epichains.R
index abee20b5..0789f37a 100644
--- a/tests/testthat/test-epichains.R
+++ b/tests/testthat/test-epichains.R
@@ -204,18 +204,18 @@ test_that("summary.epichains_tree works as expected", {
     summary(chain_summary_raw),
     c(
       "ntrees",
-      "max_chain_stat",
-      "min_chain_stat"
+      "max_stat",
+      "min_stat"
     )
   )
   expect_true(
     is.infinite(
-      summary(epichains_summary_all_infs)$min_chain_stat
+      summary(epichains_summary_all_infs)$min_stat
     )
   )
   expect_true(
     is.infinite(
-      summary(epichains_summary_all_infs)$max_chain_stat
+      summary(epichains_summary_all_infs)$max_stat
     )
   )
 })

From 629d1d46d20928eb562332b3e11712194dc7e38a Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Dec 2023 13:37:07 +0000
Subject: [PATCH 821/828] Remove rownames (got lost in merge conflicts)

---
 R/simulate.r | 7 ++++---
 1 file changed, 4 insertions(+), 3 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index fc36e8ef..9cd1384e 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -235,9 +235,9 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
     tree_df <- tree_df[tree_df$time < tf, ]
   }
 
-  # sort by sim_id and ancestor
-  tree_df <- tree_df[order(tree_df$sim_id, tree_df$ancestor), ]
-
+  # sort by sim_id and infector_id
+  tree_df <- tree_df[order(tree_df$sim_id, tree_df$infector_id), ]
+  rownames(tree_df) <- NULL
   out <- epichains_tree(
     tree_df = tree_df,
     nchains = nchains,
@@ -547,6 +547,7 @@ simulate_tree_from_pop <- function(pop,
   # sort by sim_id and infector
   tree_df <- tree_df[order(tree_df$sim_id, tree_df$infector_id), ]
   tree_df$offspring_generated <- NULL
+  rownames(tree_df) <- NULL
 
   out <- epichains_tree(
     tree_df,

From 5f4e014204d341cae6aa242a367582bc08e685e7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Dec 2023 13:37:20 +0000
Subject: [PATCH 822/828] Fix a doc

---
 R/epichains.R | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/R/epichains.R b/R/epichains.R
index 856692bd..1098eb7f 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -274,7 +274,7 @@ format.epichains_summary <- function(x, ...) {
   invisible(x)
 }
 
-#' Summary method for `epichains_tree` class
+#' Summary method for `<epichains_tree>` class
 #'
 #' @param object An `<epichains_tree>` object
 #' @param ... further arguments passed to or from other methods

From bea366f2f9666e318cc68149e842da7e699c566b Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Dec 2023 13:37:51 +0000
Subject: [PATCH 823/828] Remove unwanted variable

---
 R/simulate.r | 1 -
 1 file changed, 1 deletion(-)

diff --git a/R/simulate.r b/R/simulate.r
index 9cd1384e..9e772bae 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -343,7 +343,6 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
   )
 
   return(out)
-    intvn_mean_reduction = intvn_mean_reduction
 }
 
 #' Simulate transmission trees from a susceptible or partially immune

From 9dec1926034e0a1ffb8574df20eb4bc6213a33a1 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Dec 2023 13:38:24 +0000
Subject: [PATCH 824/828] Replace chain with trees in comments to remove
 confusion

---
 R/simulate.r | 22 +++++++++++-----------
 1 file changed, 11 insertions(+), 11 deletions(-)

diff --git a/R/simulate.r b/R/simulate.r
index 9e772bae..06e4bf0a 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -136,10 +136,10 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
   }
 
   # Initialisations
-  stat_track <- rep(1, ntrees) # track length or size (depending on `statistic`) #nolint
+  stat_track <- rep(1, ntrees) # track length or size (depending on `statistic`)
   n_offspring <- rep(1, ntrees) # current number of offspring
-  sim <- seq_len(ntrees) # track chains that are still being simulated
-  infector_ids <- rep(1, ntrees) # all chains start in generation 1
+  sim <- seq_len(ntrees) # track trees that are still being simulated
+  infector_ids <- rep(1, ntrees)
 
   # initialise data frame to hold the transmission trees
   generation <- 1L
@@ -155,7 +155,7 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
     times <- tree_df$time
   }
 
-  # next, simulate n chains
+  # next, simulate n trees
   while (length(sim) > 0) {
     # simulate next generation
     next_gen <- do.call(
@@ -216,12 +216,12 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
       tree_df <- rbind(tree_df, new_df)
     }
 
-    ## only continue to simulate chains that have offspring and aren't of
+    ## only continue to simulate trees that have offspring and aren't of
     ## the specified maximum size/length
     sim <- which(n_offspring > 0 & stat_track < stat_max)
     if (length(sim) > 0) {
       if (!missing(generation_time)) {
-        ## only continue to simulate chains that don't go beyond tf
+        ## only continue to simulate trees that don't go beyond tf
         sim <- intersect(sim, unique(indices)[current_min_time < tf])
       }
       if (!missing(generation_time)) {
@@ -240,7 +240,7 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
   rownames(tree_df) <- NULL
   out <- epichains_tree(
     tree_df = tree_df,
-    nchains = nchains,
+    ntrees = ntrees,
     statistic = statistic,
     stat_max = stat_max,
     track_pop = FALSE
@@ -297,9 +297,9 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
   # Initialisations
   stat_track <- rep(1, ntrees) ## track length or size (depending on `stat`)
   n_offspring <- rep(1, ntrees) ## current number of offspring
-  sim <- seq_len(ntrees) ## track chains that are still being simulated
+  sim <- seq_len(ntrees) ## track trees that are still being simulated
 
-  ## next, simulate ntrees chains
+  ## next, simulate ntrees trees
   while (length(sim) > 0) {
     ## simulate next generation
     next_gen <- do.call(
@@ -328,7 +328,7 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
       n_offspring = n_offspring
     )
 
-    ## only continue to simulate chains that offspring and aren't of
+    ## only continue to simulate trees that have offspring and aren't of
     ## stat_max size/length
     sim <- which(n_offspring > 0 & stat_track < stat_max)
   }
@@ -491,7 +491,7 @@ simulate_tree_from_pop <- function(pop,
   susc <- pop - initial_immune - 1L
   t <- t0
 
-  ## continue if any unsimulated chains have t <= tf
+  ## continue if any unsimulated trees have t <= tf
   ## AND there is still susceptibles left
   while (any(tree_df$time[!tree_df$offspring_generated] <= tf) && susc > 0) {
     ## select from which case to generate offspring

From 1bdfc7314023be18abc99f2da119ba2c7cec1cb4 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Dec 2023 13:38:32 +0000
Subject: [PATCH 825/828] Update snaps

---
 tests/testthat/_snaps/epichains.md | 58 ++++--------------------------
 1 file changed, 7 insertions(+), 51 deletions(-)

diff --git a/tests/testthat/_snaps/epichains.md b/tests/testthat/_snaps/epichains.md
index f115828c..92533fdc 100644
--- a/tests/testthat/_snaps/epichains.md
+++ b/tests/testthat/_snaps/epichains.md
@@ -5,14 +5,12 @@
     Output
       `<epichains_tree>` object
       
-      < tree head (from first known infector) >
+      < tree head (from first known infector_id) >
       
       [1] sim_id      infector_id generation  time       
       <0 rows> (or 0-length row.names)
       
       
-        sim_id infector_id generation time
-      1      1          NA          1    0
       Number of infectors (known): 0
       Number of generations: 1
       Use `as.data.frame(<object_name>)` to view the full output in the console.
@@ -24,7 +22,7 @@
     Output
       `<epichains_tree>` object
       
-      < tree head (from first known infector) >
+      < tree head (from first known infector_id) >
       
         sim_id infector_id generation     time
       2      2           1          2 42.57973
@@ -35,13 +33,6 @@
       7      7           3          4 78.73481
       
       
-         sim_id infector_id generation     time
-      7       7           3          4 78.73481
-      8       8           5          5 47.03948
-      9       9           6          5 45.38534
-      10     10           9          6 46.14505
-      11     11           8          6 48.03103
-      12     12           7          5 81.49185
       Number of infectors (known): 9
       Number of generations: 6
       Use `as.data.frame(<object_name>)` to view the full output in the console.
@@ -53,7 +44,7 @@
     Output
       `<epichains_tree>` object
       
-      < tree head (from first known infector) >
+      < tree head (from first known infector_id) >
       
         infectee_id sim_id infector_id generation
       3           1      2           1          2
@@ -64,14 +55,7 @@
       8           2      4           2          3
       
       
-         infectee_id sim_id infector_id generation
-      12           1      6           4          3
-      13           1      7           4          3
-      14           2      7           6          4
-      15           2      8           6          4
-      16           1      8           7          4
-      17           2      9           8          5
-      Chains simulated: 2
+      Trees simulated: 2
       Number of infectors (known): 7
       Number of generations: 5
       Use `as.data.frame(<object_name>)` to view the full output in the console.
@@ -83,7 +67,7 @@
     Output
       `<epichains_tree>` object
       
-      < tree head (from first known infector) >
+      < tree head (from first known infector_id) >
       
          infectee_id sim_id infector_id generation      time
       11           1      2           1          2 2.6525084
@@ -94,14 +78,7 @@
       16           7      2           1          2 1.3509058
       
       
-          infectee_id sim_id infector_id generation      time
-      119           9     15           8          5 19.146936
-      120           2     16           8          4  2.941326
-      121           9     16           8          5 17.447014
-      122          10     16           9          4 17.017684
-      123           2     17           9          4  7.368167
-      124           2     18           9          4  7.931447
-      Chains simulated: 10
+      Trees simulated: 10
       Number of infectors (known): 9
       Number of generations: 5
       Use `as.data.frame(<object_name>)` to view the full output in the console.
@@ -115,9 +92,8 @@
       
       [1] 9 6
       
-       Simulated chain lengths: 
+       Simulated tree lengths: 
       
-      Max: 9
       Min: 6
 
 # head and tail print output as expected
@@ -125,8 +101,6 @@
     Code
       head(susc_outbreak_raw)
     Output
-      < tree head (from first known infector) >
-      
       [1] sim_id      infector_id generation  time       
       <0 rows> (or 0-length row.names)
 
@@ -135,8 +109,6 @@
     Code
       head(susc_outbreak_raw2)
     Output
-      < tree head (from first known infector) >
-      
         sim_id infector_id generation     time
       2      2           1          2 42.57973
       3      3           2          3 42.80500
@@ -150,8 +122,6 @@
     Code
       head(tree_sim_raw)
     Output
-      < tree head (from first known infector) >
-      
         infectee_id sim_id infector_id generation
       3           1      2           1          2
       4           2      2           1          2
@@ -165,8 +135,6 @@
     Code
       head(tree_sim_raw2)
     Output
-      < tree head (from first known infector) >
-      
          infectee_id sim_id infector_id generation      time
       11           1      2           1          2 2.6525084
       12           2      2           1          2 0.2397245
@@ -180,9 +148,6 @@
     Code
       tail(susc_outbreak_raw)
     Output
-      
-      < tree tail >
-      
         sim_id infector_id generation time
       1      1          NA          1    0
 
@@ -191,9 +156,6 @@
     Code
       tail(susc_outbreak_raw2)
     Output
-      
-      < tree tail >
-      
          sim_id infector_id generation     time
       7       7           3          4 78.73481
       8       8           5          5 47.03948
@@ -207,9 +169,6 @@
     Code
       tail(tree_sim_raw)
     Output
-      
-      < tree tail >
-      
          infectee_id sim_id infector_id generation
       12           1      6           4          3
       13           1      7           4          3
@@ -223,9 +182,6 @@
     Code
       tail(tree_sim_raw2)
     Output
-      
-      < tree tail >
-      
           infectee_id sim_id infector_id generation      time
       119           9     15           8          5 19.146936
       120           2     16           8          4  2.941326

From e2c27a27df5f0132f321f852944633cba34a88ab Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Dec 2023 15:27:38 +0000
Subject: [PATCH 826/828] Return offspring as part of object

---
 R/epichains.R                | 12 ++++++++++++
 R/simulate.r                 |  3 +++
 man/epichains_summary.Rd     |  6 ++++++
 man/epichains_tree.Rd        |  6 ++++++
 man/new_epichains_summary.Rd |  6 ++++++
 man/new_epichains_tree.Rd    |  6 ++++++
 6 files changed, 39 insertions(+)

diff --git a/R/epichains.R b/R/epichains.R
index 1098eb7f..a4955e64 100644
--- a/R/epichains.R
+++ b/R/epichains.R
@@ -19,6 +19,7 @@
 new_epichains_tree <- function(tree_df,
                                ntrees = integer(),
                                statistic = character(),
+                               offspring_dist = character(),
                                stat_max = integer(),
                                track_pop = logical()) {
   # Assemble the elements of the object
@@ -26,6 +27,7 @@ new_epichains_tree <- function(tree_df,
     tree_df,
     ntrees = ntrees,
     statistic = statistic,
+    offspring_dist = offspring_dist,
     stat_max = stat_max,
     track_pop = track_pop,
     class = c("epichains_tree", class(tree_df))
@@ -58,12 +60,15 @@ new_epichains_tree <- function(tree_df,
 epichains_tree <- function(tree_df,
                            ntrees = integer(),
                            statistic = character(),
+                           offspring_dist = character(),
                            stat_max = integer(),
                            track_pop = logical()) {
   # Check that inputs are well specified
   checkmate::assert_data_frame(tree_df)
   checkmate::assert_integerish(ntrees, null.ok = TRUE)
   checkmate::assert_character(statistic, null.ok = TRUE)
+  check_offspring_valid(offspring_dist)
+  check_offspring_func_valid(paste0("r", offspring_dist))
   checkmate::assert_logical(track_pop)
   checkmate::assert_number(stat_max, null.ok = TRUE)
 
@@ -72,6 +77,7 @@ epichains_tree <- function(tree_df,
     tree_df = tree_df,
     ntrees = ntrees,
     statistic = statistic,
+    offspring_dist = offspring_dist,
     stat_max = stat_max,
     track_pop = track_pop
   )
@@ -103,12 +109,14 @@ epichains_tree <- function(tree_df,
 new_epichains_summary <- function(chains_summary,
                                   ntrees = integer(),
                                   statistic = character(),
+                                  offspring_dist = character(),
                                   stat_max = integer()) {
   # Assemble the elements of the object
   obj <- structure(
     chains_summary,
     ntrees = ntrees,
     statistic = statistic,
+    offspring_dist = offspring_dist,
     stat_max = stat_max,
     class = c("epichains_summary", class(chains_summary))
   )
@@ -133,11 +141,14 @@ new_epichains_summary <- function(chains_summary,
 epichains_summary <- function(chains_summary,
                               ntrees = integer(),
                               statistic = character(),
+                              offspring_dist = character(),
                               stat_max = integer()) {
   # Check that inputs are well specified
   checkmate::assert_vector(chains_summary)
   checkmate::assert_integerish(ntrees, null.ok = TRUE)
   checkmate::assert_character(statistic)
+  check_offspring_valid(offspring_dist)
+  check_offspring_func_valid(paste0("r", offspring_dist))
   checkmate::assert_number(stat_max, null.ok = TRUE)
 
   # Create <epichains_summary> object
@@ -145,6 +156,7 @@ epichains_summary <- function(chains_summary,
     chains_summary,
     ntrees = ntrees,
     statistic = statistic,
+    offspring_dist = offspring_dist,
     stat_max = stat_max
   )
 
diff --git a/R/simulate.r b/R/simulate.r
index 06e4bf0a..b899f3f6 100644
--- a/R/simulate.r
+++ b/R/simulate.r
@@ -242,6 +242,7 @@ simulate_tree <- function(ntrees, statistic = c("size", "length"),
     tree_df = tree_df,
     ntrees = ntrees,
     statistic = statistic,
+    offspring_dist = offspring_dist,
     stat_max = stat_max,
     track_pop = FALSE
   )
@@ -339,6 +340,7 @@ simulate_summary <- function(ntrees, statistic = c("size", "length"),
     chains_summary = stat_track,
     ntrees = ntrees,
     statistic = statistic,
+    offspring_dist = offspring_dist,
     stat_max = stat_max
   )
 
@@ -552,6 +554,7 @@ simulate_tree_from_pop <- function(pop,
     tree_df,
     ntrees = NULL,
     statistic = NULL,
+    offspring_dist = offspring_dist,
     stat_max = NULL,
     track_pop = TRUE
   )
diff --git a/man/epichains_summary.Rd b/man/epichains_summary.Rd
index cfe5f290..adb2e31c 100644
--- a/man/epichains_summary.Rd
+++ b/man/epichains_summary.Rd
@@ -8,6 +8,7 @@ epichains_summary(
   chains_summary,
   ntrees = integer(),
   statistic = character(),
+  offspring_dist = character(),
   stat_max = integer()
 )
 }
@@ -24,6 +25,11 @@ Can be one of:
 \item "length": the total number of infectors.
 }}
 
+\item{offspring_dist}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
+
 \item{stat_max}{A cut off for the chain statistic (size/length) being
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
diff --git a/man/epichains_tree.Rd b/man/epichains_tree.Rd
index 553efa36..e1389f60 100644
--- a/man/epichains_tree.Rd
+++ b/man/epichains_tree.Rd
@@ -8,6 +8,7 @@ epichains_tree(
   tree_df,
   ntrees = integer(),
   statistic = character(),
+  offspring_dist = character(),
   stat_max = integer(),
   track_pop = logical()
 )
@@ -27,6 +28,11 @@ Can be one of:
 \item "length": the total number of infectors.
 }}
 
+\item{offspring_dist}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
+
 \item{stat_max}{A cut off for the chain statistic (size/length) being
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
diff --git a/man/new_epichains_summary.Rd b/man/new_epichains_summary.Rd
index 3b55586c..2a2301d0 100644
--- a/man/new_epichains_summary.Rd
+++ b/man/new_epichains_summary.Rd
@@ -8,6 +8,7 @@ new_epichains_summary(
   chains_summary,
   ntrees = integer(),
   statistic = character(),
+  offspring_dist = character(),
   stat_max = integer()
 )
 }
@@ -24,6 +25,11 @@ Can be one of:
 \item "length": the total number of infectors.
 }}
 
+\item{offspring_dist}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
+
 \item{stat_max}{A cut off for the chain statistic (size/length) being
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}
diff --git a/man/new_epichains_tree.Rd b/man/new_epichains_tree.Rd
index d321d100..0646ca72 100644
--- a/man/new_epichains_tree.Rd
+++ b/man/new_epichains_tree.Rd
@@ -8,6 +8,7 @@ new_epichains_tree(
   tree_df,
   ntrees = integer(),
   statistic = character(),
+  offspring_dist = character(),
   stat_max = integer(),
   track_pop = logical()
 )
@@ -27,6 +28,11 @@ Can be one of:
 \item "length": the total number of infectors.
 }}
 
+\item{offspring_dist}{Offspring distribution: a character string
+corresponding to the R distribution function (e.g., "pois" for Poisson,
+where \code{\link{rpois}} is the R function to generate Poisson random
+numbers).}
+
 \item{stat_max}{A cut off for the chain statistic (size/length) being
 computed. Results above the specified value, are set to this value.
 Defaults to \code{Inf}.}

From ba1dd227193f2b6ba6c96a6bfed16699944d04f7 Mon Sep 17 00:00:00 2001
From: jamesaazam <james.azam@lshtm.ac.uk>
Date: Mon, 4 Dec 2023 18:43:17 +0000
Subject: [PATCH 827/828] Fix logo

---
 README.Rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.Rmd b/README.Rmd
index 1328b9fd..72fb5b96 100644
--- a/README.Rmd
+++ b/README.Rmd
@@ -19,7 +19,7 @@ knitr::opts_chunk$set(
 )
 ```
 
-# _{{ packagename }}_: Methods for simulating and analysing the size and length of transmission chains from branching process models <img src="man/figures/epichains_logo.png" align="right" height="130" />
+# _{{ packagename }}_: Methods for simulating and analysing the size and length of transmission chains from branching process models <img src="man/figures/logo.svg" align="right" height="130" />
 
 <!-- badges: start -->
 ![GitHub R package version](https://img.shields.io/github/r-package/v/epiverse-trace/epichains)

From d0af0193745934c7e75a7e43112c72d20a1701bc Mon Sep 17 00:00:00 2001
From: GitHub Action <action@github.com>
Date: Mon, 4 Dec 2023 18:45:09 +0000
Subject: [PATCH 828/828] Automatic readme update

---
 README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.md b/README.md
index 5dcd207d..822afb24 100644
--- a/README.md
+++ b/README.md
@@ -5,7 +5,7 @@
 <!-- `packagename` is extracted from the DESCRIPTION file -->
 <!-- `gh_repo` is extracted via a special environment variable in GitHub Actions -->
 
-# *epichains*: Methods for simulating and analysing the size and length of transmission chains from branching process models <img src="man/figures/epichains_logo.png" align="right" height="130" />
+# *epichains*: Methods for simulating and analysing the size and length of transmission chains from branching process models <img src="man/figures/logo.svg" align="right" height="130" />
 
 <!-- badges: start -->