Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

284 MBox Refresher #295

Open
wants to merge 65 commits into
base: master
Choose a base branch
from
Open

284 MBox Refresher #295

wants to merge 65 commits into from

Commits on Apr 19, 2024

  1. Created parse_mbox_latest_date and refresh_mbox functions and updated…

    … helix config in accordance to new save file structure
    
    I have created the parse_mbox_latest_date and refresh_mbox functions. The latter function deletes the latest year and month mbox file that is currently downloaded (identified by parse_mbox_latest_date), and redownloads that along with any file beyond up until the current year. The naming convention of the downloaded files are also changed to what we have agreed on. Just to note, download_mod_mbox REMAINS UNCHANGED since I'm only using download_mod_mbox_per_month.
    ian-lastname committed Apr 19, 2024
    Configuration menu
    Copy the full SHA
    8c1021d View commit details
    Browse the repository at this point in the history

Commits on Apr 24, 2024

  1. Edited download_pipermail to save pipermail files as mbox files, crea…

    …ted refresh_pipermail, updated news
    
    Found out that the pipermail downloader function already downloads the files by month and year, so all I really needed to do was change it so that it downloads the files as mbox files (change the extension from .txt to .mbox). Created the refresher for pipermail. I had no need to create a parse latest pipermail since they were mbox files anyway.
    ian-lastname committed Apr 24, 2024
    Configuration menu
    Copy the full SHA
    72238a7 View commit details
    Browse the repository at this point in the history

Commits on Apr 25, 2024

  1. Configuration menu
    Copy the full SHA
    99fb7e3 View commit details
    Browse the repository at this point in the history
  2. Added checks in refresh functions and in download_mod_mbox_per_month …

    …to ensure it does not download files past current year and month
    
    Added checks in the aforementioned functions so that the refreshers won't download "mail from the future"
    ian-lastname committed Apr 25, 2024
    Configuration menu
    Copy the full SHA
    618f2d0 View commit details
    Browse the repository at this point in the history

Commits on Apr 28, 2024

  1. fix github checks

    carlosparadis committed Apr 28, 2024
    Configuration menu
    Copy the full SHA
    0751218 View commit details
    Browse the repository at this point in the history

Commits on Apr 29, 2024

  1. Re-added error message in refresh_pipermail when an http error is enc…

    …ountered
    
    Done as requested by Carlos
    ian-lastname authored Apr 29, 2024
    Configuration menu
    Copy the full SHA
    be4ff32 View commit details
    Browse the repository at this point in the history

Commits on Apr 30, 2024

  1. Configuration menu
    Copy the full SHA
    b5be04e View commit details
    Browse the repository at this point in the history

Commits on Sep 10, 2024

  1. Configuration menu
    Copy the full SHA
    d2ce222 View commit details
    Browse the repository at this point in the history

Commits on Sep 15, 2024

  1. i #284 Refactor download_pipermail function

    - Remove archive_url and archive_type parameters from download_pipermail().
    - Add start_year_month and end_year_month parameters for date filtering.
    - Remove convert_pipermail_to_mbox() function, as download_pipermail() now handles file conversion automatically.
    - Change file naming convention to 'kaiaulu_'YYYYMM.mbox'.
    - Attempt to download and decompress files directly without saving .gz to disk, but could not establish a valid connection.
    
    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Sep 15, 2024
    Configuration menu
    Copy the full SHA
    7c585ae View commit details
    Browse the repository at this point in the history

Commits on Sep 17, 2024

  1. i #284 Updated documentation and modified function for download_piper…

    …mail()
    
    - Modified helix.yml to use [[“mailing_list”]][[“pipermail”]][[“project_key_1”]]
    - Added project_key_2 to helix.yml
    - Created /vignettes/download_mail.Rmd to document information about pipermail downloader
    - Made function calls explicit for external libraries
    - ISSUE: Build -> Check is not passing. Seems to be having issues with utags_path, even though I changed the path to the one for universal-ctags in tools.yml
    daomcgill committed Sep 17, 2024
    1 Configuration menu
    Copy the full SHA
    69ca163 View commit details
    Browse the repository at this point in the history
  2. i #284 Edited download_pipermail() and Added refresh_pipermail() and …

    …process_gz_to_mbox_in_folder()
    
    - download_pipermail: Attempts to download .txt file first. If unavailable fallback to .gz. If using .gz file, unzips and writes output in .mbox
    - Added log messages
    - download_pipermail: Added timeout parameter to deal with case that server takes too long to respond
    - Added refresh_pipermail function
    - Updated vignettes/download_mail.Rmd to include refresh_pipermail
    - Added process_gz_to_mbox_in_folder function
    daomcgill committed Sep 17, 2024
    Configuration menu
    Copy the full SHA
    b9a886b View commit details
    Browse the repository at this point in the history

Commits on Sep 19, 2024

  1. i #284 Added more descriptive comments. Made minor changes to piperma…

    …il refresher.
    
    - Replaced paste0 with stringi::stri_c
    - Removed create directory if does not exist
    - Added more verbose descriptions/comments
    - Added dividers within functions
    - Added verbose parameter
    - Added else block for refresher
    - Added call to process_gz_to_mbox_in_folder at end of refresher
    - parse_mbox: stri_replace_last was not working, changed it to stringi::stri_replace_last_regex
    - Tested parse_mbox. Perceval was not returning any output. I will look further into why this is happening.
    daomcgill committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    3c88140 View commit details
    Browse the repository at this point in the history
  2. i #284 Added more descriptive comments. Made minor changes to piperma…

    …il refresher.
    
    - Replaced paste0 with stringi::stri_c
    - Removed create directory if does not exist
    - Added more verbose descriptions/comments
    - Added dividers within functions
    - Added verbose parameter
    - Added else block for refresher
    - Added call to process_gz_to_mbox_in_folder at end of refresher
    - parse_mbox: stri_replace_last was not working, changed it to stringi::stri_replace_last_regex
    - Tested parse_mbox. Perceval was not returning any output. I will look further into why this is happening.
    
    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    5de3aa2 View commit details
    Browse the repository at this point in the history
  3. Merge branch '284-mbox-download-refresher' of https://github.com/sail…

    …uh/kaiaulu into 284-mbox-download-refresher
    daomcgill committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    8a373d6 View commit details
    Browse the repository at this point in the history

Commits on Sep 21, 2024

  1. i #284 Added download_mod_mbox function and edited notebook

    Updated parameters for download_mod_mbox to use Apache Pony Mail links as Apache lists now redirect there
    - Modified downloads to use YYYYMM  instead of YYYY
    - Removed the option for downloading by year for clearer functionality.
    - Updated vignette/download_mail.Rmd
    
    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Sep 21, 2024
    Configuration menu
    Copy the full SHA
    b91389b View commit details
    Browse the repository at this point in the history

Commits on Sep 22, 2024

  1. i #284 Added refresh_mod_mbox function for refreshing Mod Mbox archives

    - Created `refresh_mod_mbox` function to automatically refresh mailing list archives downloaded using Mod Mbox.
    - The function checks for the latest downloaded file, deletes it, and redownloads the archive from that month to the current date.
    - Added documentation for `refresh_mod_mbox` to the notebook.
    
    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Sep 22, 2024
    Configuration menu
    Copy the full SHA
    0cc4123 View commit details
    Browse the repository at this point in the history

Commits on Oct 2, 2024

  1. i #284 Updated Notebook

    - Updated vignettes/download_mail.Rmd to working version
    - Fixed errors in helix.yml
    - Minor edits in mail.R
    
    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Oct 2, 2024
    Configuration menu
    Copy the full SHA
    0dc6001 View commit details
    Browse the repository at this point in the history
  2. i #284 Testing Github Actions

    - Check works locally
    - Commit all changed files
    daomcgill committed Oct 2, 2024
    Configuration menu
    Copy the full SHA
    f0027dc View commit details
    Browse the repository at this point in the history
  3. i #284 Renamed save_folder_mail parameter to mbox

    - Renamed for match with convention set by issue #230
    
    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Oct 2, 2024
    Configuration menu
    Copy the full SHA
    9b9c896 View commit details
    Browse the repository at this point in the history

Commits on Oct 3, 2024

  1. i #284 Updated Notebook download_mail.Rmd

    - Reverted name change of save_folder_mail
    - Removed previous documentation file for mail (download_mod_mbox.Rmd)
    - Updates to dowmload_mail.Rmd
    daomcgill committed Oct 3, 2024
    Configuration menu
    Copy the full SHA
    7249c9b View commit details
    Browse the repository at this point in the history
  2. Revert "i #284 Testing Github Actions"

    This reverts commit f0027dc.
    daomcgill committed Oct 3, 2024
    Configuration menu
    Copy the full SHA
    2a1ba98 View commit details
    Browse the repository at this point in the history
  3. i #284 Refactored parse_mbox_latest_date and Fixed Roxygen Errors

    - parse_mbox_lateset_date() now uses new naming convention for files
    - Added to download_mail.Rmd
    - Fixed documentation for download_pipermail()
    
    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Oct 3, 2024
    Configuration menu
    Copy the full SHA
    7bf8ba6 View commit details
    Browse the repository at this point in the history
  4. i #284 Update NEWS.md

    - added parse_mbox_latest_date
    daomcgill committed Oct 3, 2024
    Configuration menu
    Copy the full SHA
    aa60648 View commit details
    Browse the repository at this point in the history

Commits on Oct 6, 2024

  1. i #284 Updated Notebook, exec/mailinglist.R and R/mail.R

    - Update pkgdown.yml
    - Set eval to False for notebook
    - Added warning for failed downloads
    - Added check for missing months in the date range within save_folder_path
    - Changed mbox_path in parsers to mbox_file_path
    - Use gt package to view tables
    - Made changes so Knit works for download_mail.Rmd
    - Updated exec/mailinglist.R to use new functions
    - To do: Use getter functions once they are merged
    
    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Oct 6, 2024
    Configuration menu
    Copy the full SHA
    64e0646 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2b6a963 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    dc40dba View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2024

  1. i #284 fixes incorrect call

    R/example.R contained an unused parameter,
    triggering warnings on build.
    
    Signed-off-by: Carlos Paradis <[email protected]>
    carlosparadis committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    d6f3b41 View commit details
    Browse the repository at this point in the history
  2. i #284 attempt fix on Actions

    Actions is failing due to being
    unable to install XML. Some new error
    yet again on Actions. Trying to make
    the version requirement less strict
    to see if it is able to install.
    
    Signed-off-by: Carlos Paradis <[email protected]>
    carlosparadis committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    f02ecb1 View commit details
    Browse the repository at this point in the history
  3. i #284 incomplete storytelling review

    The story is a bit too dry and assumes much
    of the user. The file format stored is not
    brief. Modified it a bit to add an example
    on how it can be revised.
    
    Signed-off-by: Carlos Paradis <[email protected]>
    carlosparadis committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    7f38d1c View commit details
    Browse the repository at this point in the history
  4. i #284 downgrade version of R for XML

    In case the error of XML compile is tied to
    this issue: r-lib/actions#559
    revert to 4.1 to see if it solves the problem.
    
    Signed-off-by: Carlos Paradis <[email protected]>
    carlosparadis committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    309fa34 View commit details
    Browse the repository at this point in the history
  5. i #284 gcc not found on Actions

    Issue seems to be tied to gcc compiler
    not working. Attempt to bump OS X version
    up rather than downgrade R.
    
    See GitHub Action for CHECK on the line:
    
    "checking whether the C compiler works... no"
    
    right before: "ERROR: configuration failed for package ‘XML’"
    
    Signed-off-by: Carlos Paradis <[email protected]>
    carlosparadis committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    e04bd31 View commit details
    Browse the repository at this point in the history

Commits on Oct 10, 2024

  1. i #284 Refactored download_mail.Rmd

    - Refactored the download_mail.Rmd notebook to expect the use of the getters from R/config.R (i #230 contains the getter functions in R/config.R).
    beydlern committed Oct 10, 2024
    Configuration menu
    Copy the full SHA
    dbd7092 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ea109bd View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    c4b9d16 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    90b05ed View commit details
    Browse the repository at this point in the history
  5. i #284 Change Roxygen version

    daomcgill committed Oct 10, 2024
    Configuration menu
    Copy the full SHA
    3e5f8f7 View commit details
    Browse the repository at this point in the history

Commits on Oct 11, 2024

  1. i #284 Update Notebook and config file

    - This should fail until the getters are merged.
    
    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    4af2c21 View commit details
    Browse the repository at this point in the history

Commits on Oct 15, 2024

  1. Configuration menu
    Copy the full SHA
    8094402 View commit details
    Browse the repository at this point in the history

Commits on Oct 18, 2024

  1. i #284 Fixed Relative Paths in a Notebook

    - The project configuration section of a notebook was incorrectly using the project directory (kaiaulu/) as its working directory rather than the directory that it resides in (/vignettes/) as its working directory.
    beydlern committed Oct 18, 2024
    Configuration menu
    Copy the full SHA
    5fb3af7 View commit details
    Browse the repository at this point in the history

Commits on Nov 12, 2024

  1. i #230 create config file interface

    This commits perform a major refactoring of how Kaiaulu interface with config files, and the suggested folder organization to store rawdata and analysis. 
    
    The configuration files are generalized to account for anomaly cases when performing project analysis. For instance, long-lived projects may contain multiple repositories, issue trackers, mailing list, etc. The new template of the configuration file allows to account for this information. 
    
    Moreover, changes to the config template cascaded in changes to all notebooks, as the access to the config was hardcoded to the file organization. A new set of get_ functions should make this the last commit that change in template cascades into notebooks. All actively maintained notebooks  (not prefixed by underline under vignettes/) have been updated to use the get functions. Future changes, therefore, will only affect the get() functions in R/config.R.
    
    The folder organization of the filepaths has also been modified. Previously, filepaths assumed as default in the versioned config files suggested organizing code as rawdata/git_repo/projectX ; rawdata/jira/projectY. This organization was not practical for sharing data manually, as the user would need to zip several folders individually. The new organization is now rawdata/projectX/git_repo ; rawdata/projectX/jira. This means users only need to zip projectX and that will contain all the data wanted to be shared.
    
    A minor typo on graph.R was also fixed for merge function calls from `sorted=` to `sort=`.
    anthonyjlau authored Nov 12, 2024
    Configuration menu
    Copy the full SHA
    e56848a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b462ddb View commit details
    Browse the repository at this point in the history
  3. i #284 minor fixes and XML dependency

    Moves some of the narrative to functions. Function
    documentation can be accessed on demand instead of
    placed on Notebook. Some of the docs was moved to
    inline on the function of pipermail.
    
    Signed-off-by: Carlos Paradis <[email protected]>
    carlosparadis committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    def1660 View commit details
    Browse the repository at this point in the history
  4. revert utags

    revert utags to match master
    
    Signed-off-by: Carlos Paradis <[email protected]>
    carlosparadis committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    bfc75cb View commit details
    Browse the repository at this point in the history
  5. i #284 More narrative and config fixes

    Signed-off-by: Carlos Paradis <[email protected]>
    carlosparadis committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    c1830f6 View commit details
    Browse the repository at this point in the history
  6. i #284 Remove description tags

    The rest of kaiaulu does not use this.
    
    Signed-off-by: Carlos Paradis <[email protected]>
    carlosparadis committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    4842100 View commit details
    Browse the repository at this point in the history
  7. i #284 more minor doc formatting fixes

    added url tag, etc.
    
    Signed-off-by: Carlos Paradis <[email protected]>
    carlosparadis committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    0f9769e View commit details
    Browse the repository at this point in the history
  8. i #284 Updates to exec/mailinglist.R and Minor Fixes for Mail Configu…

    …ration
    
    - Use refresh in exec
    - Use getters in exec
    - Change kaiaulu version in exec
    - Fix paths in helix.yml
    - Remove unused parameters
    - Change cat to message
    
    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    6f6a59b View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    775b5a6 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    93f214e View commit details
    Browse the repository at this point in the history
  11. i #295 Change argument for exec from 'tabulate' to 'parse'

    - Change exec from 'tabulate' to 'parse'
    - Will update issue 310 to use this exec instead of its own
    
    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    e27a604 View commit details
    Browse the repository at this point in the history

Commits on Nov 13, 2024

  1. i #284 Testing Fix for Actions

    - Fixed uri parameter in parse_mbox for perceval
    - Small change in example
    
    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    6a5fed6 View commit details
    Browse the repository at this point in the history
  2. i #284 Try Adding Debugging

    Signed-off-by: Dao McGill <[email protected]>
    daomcgill committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    ffb5c9c View commit details
    Browse the repository at this point in the history
  3. Revert "i #284 Try Adding Debugging"

    This reverts commit ffb5c9c.
    daomcgill committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    e55b6e2 View commit details
    Browse the repository at this point in the history
  4. i #284 Revert ctags version

    daomcgill committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    c797219 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    092e2ab View commit details
    Browse the repository at this point in the history
  6. i #284 Please work

    daomcgill committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    56dff9c View commit details
    Browse the repository at this point in the history
  7. i #295 Last try

    daomcgill committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    fd97af0 View commit details
    Browse the repository at this point in the history
  8. Revert "i #295 Last try"

    This reverts commit fd97af0.
    daomcgill committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    8709b95 View commit details
    Browse the repository at this point in the history
  9. Revert "i #284 Please work"

    This reverts commit 56dff9c.
    daomcgill committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    71054f9 View commit details
    Browse the repository at this point in the history
  10. Revert "Update commit_message_id_coverage.Rd"

    This reverts commit 092e2ab.
    daomcgill committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    382383d View commit details
    Browse the repository at this point in the history
  11. Revert "i #284 Revert ctags version"

    This reverts commit c797219.
    daomcgill committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    f11e452 View commit details
    Browse the repository at this point in the history
  12. Reapply "i #284 Try Adding Debugging"

    This reverts commit e55b6e2.
    daomcgill committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    09d00c3 View commit details
    Browse the repository at this point in the history
  13. i #284 R version

    daomcgill committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    216fe07 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    99823d7 View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    6cd5e11 View commit details
    Browse the repository at this point in the history