-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Phase 1 data examination results #17
Add Phase 1 data examination results #17
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
results.md:
- Would it be possible to add the output-file variable names for the two current-law runs:
- CY2023 Payroll Tax Liability ($ billion) (federal employee plus employer share) -- this looks like payroll tax
- CY2023 Individual Income Tax Liability ($ billion) (federal individual income tax) -- it looks to me like this is iitax
- Those are some big differences vs. CBO in current law liability (especially IIT), for both taxdata and pe! We will need to dig into that.
- As a result of this, I have added the following variables to the ad hoc analysis: c05800, taxbc, othertaxes, iitax, payrolltax. (See this, at bottom of table: https://boyd-psl-adhoc.netlify.app/analysis.html#comparison-of-weighted-sums-for-selected-variables)
- You can see that my ad hoc run shows, at 2023 levels, for baseline 2023 law (I think), the following.
- For iitax, I get $2,154.3 for taxdata, which about matches your $2,154.4 for taxdata. However, you get $2,012.9 for the pe phase 1 dataset but I get $1,540.4.
- For payrolltax, I match your taxdata number and am somewhat closer for pe than I am with iitax: you have 1696.7 but I have 1630.1 --far too different to be rounding differences.
- This suggests for PE that we're doing something different or using different data or comparing different results. I am using the Feb 20 version of the PE file (see top line here: https://boyd-psl-adhoc.netlify.app/prelims.html).
- Maybe we can discuss when we talk tomorrow.
@martinholmer @nikhilwoodruff @MaxGhenis
I am guessing that as a result I have some important input variables in my stacked file that I set to zero, and they are affecting tax-calculator results, giving results that are not what I intended. It seems like the fix is for me to do tax calculations in two steps:
Then, get the two resulting files and stack them. It will have missing values for the variables that are in td but not in pe, for the pe records and for the td-same-variables-as-pe records. Leave them missing. Calculate comparisons on this file. I think this should fix it. I had not thought through the implications of setting missing to zero in the stacked file. I'll try to do this now and will report back. |
That fixes it. The 2023 baseline-law results in the ad hoc analysis now match the results from @martinholmer. The updated ad hoc analysis is here. The revised R code is here. The results generally are much closer now between pe and taxdata. @nikhilwoodruff and @MaxGhenis I'm sorry for any grief or head-scratching this caused you. Still plenty of questions to investigate, but not the massive differences my erroneous earlier ad hoc analysis gave. |
@donboyd5 asked in the discussion of PR #17:
The "output-file variable names" are at the top of the |
This PR adds the
examination/results.md
document and provides a link to it in the high-levelREADME.md
document.