The strayr
package provides tools to make working with Australian data
easier. This includes:
-
tidy versions of common structures used by the Australian Bureau of Statistics (ABS), like ANZSIC and ANZSCO:
-
a function to tidy up state names (
clean_states()
); -
a function that knows whether particular dates are public holidays (
is_holiday()
); and -
a table containing the start and end dates of school terms in each state and territory, back to 1978 (
school_terms
).
This package is currently in development and subject to change. The
lifecycle badge will be changed to stable
when it is stable (should be
relatively soon).
Contribute to this package: people are actively encouraged to contribute to this package.
You can install the current version of strayr
with:
remotes::install_github("runapp-aus/strayr")
Current structures stored in strayr
are:
- Australian and New Zealand Standard Classification of Occupations
(ANZSCO), Cat. 1220.0:
anzsco2022
: occupation levels of ANZSCO, 2022.anzsco2021
: occupation levels of ANZSCO, 2021.anzsco2019
: occupation levels of ANZSCO, 2013, Version 1.3.anzsco2013
: occupation levels of ANZSCO, 2013, Version 1.2.anzsco2009
: occupation levels ANZSCO, First Edition, Revision 1, 2009.
- Australian and New Zealand Standard Industrial Classification
(ANZSIC), Cat. 1292.0:
anzsic2006
: industry levels of ANZSIC, 2006 (Revision 2.0).
- Australian Standard Classification of Education (ASCED), Cat. 1272.0:
The clean_state()
function makes it easy to wrangle vectors of State
names and abbreviations - which might be in different forms and possibly
misspelled.
This package includes the auholidays
dataset from the Australian
Public Holidays Dates Machine Readable
Dataset
as well as a helper function is_holiday
.
This package includes a dataset with the start and end dates of school terms in each Australian state and territory from 1978 to 2024:
school_terms
#> # A tibble: 1,504 × 5
#> state year term start end
#> <chr> <int> <int> <date> <date>
#> 1 NSW 1978 1 1978-02-01 1978-05-05
#> 2 Vic 1978 1 1978-02-06 1978-05-12
#> 3 Qld 1978 1 1978-01-23 1978-04-28
#> 4 SA 1978 1 1978-02-06 1978-05-12
#> 5 WA 1978 1 1978-02-06 1978-05-12
#> 6 Tas 1978 1 1978-02-21 1978-05-26
#> 7 NT 1978 1 1978-02-06 1978-05-12
#> 8 ACT 1978 1 1978-02-01 1978-05-05
#> 9 NSW 1978 2 1978-05-22 1978-08-25
#> 10 Vic 1978 2 1978-05-29 1978-08-25
#> # ℹ 1,494 more rows
The parse_income_range
function provides some tools for extracting
numbers from income ranges commonly used in Australian data. For
example:
parse_income_range("$1-$199 ($1-$10,399)", limit = "lower")
#> [1] 1
The strayr
package also provides tools to access sf
objects
contained in absmapsdata
.
See ?strayr::read_absmap
for more information.
read_absmap("sa42021")