Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v3.7.1.111 and v3.7.2.112 and v3.7.3.113 to prod #554

Merged
merged 239 commits into from
Dec 6, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
239 commits
Select commit Hold shift + click to select a range
fd4e974
- Add log file rollover
elipe17 Sep 16, 2024
feec150
- Add initial configs for cloud deployments
elipe17 Sep 16, 2024
8805b94
- initial config for grafana deploy
elipe17 Sep 17, 2024
cd4b739
- Remove empty file
elipe17 Sep 17, 2024
c935ce0
- move data sources to template file
elipe17 Sep 18, 2024
b7bd9e4
- general deploy routine for pg and grafana
elipe17 Sep 18, 2024
092df96
- added deploy routine for prometheus
elipe17 Sep 18, 2024
1440b7f
- Added deploy routine for loki
elipe17 Sep 18, 2024
c6edd4a
- Initial update for promtail sidecars
elipe17 Sep 19, 2024
19b48a1
- allow deploy no matter test state
elipe17 Sep 19, 2024
0d2518a
- Update deploy scripts to prepare promtail config
elipe17 Sep 19, 2024
e495aac
- add quotes
elipe17 Sep 19, 2024
8388f40
- Update frontend to write error log to file
elipe17 Sep 19, 2024
57de9d4
-- for faster turnaround
elipe17 Sep 19, 2024
3da3526
- add ignore for file generation
elipe17 Sep 19, 2024
4797ced
- Move limits to per process
elipe17 Sep 19, 2024
6b1eedd
- update disk quota to match backend
elipe17 Sep 19, 2024
30ca7fc
- Uping promtail memory
elipe17 Sep 19, 2024
9926c4d
- Explicitely execute nginx
elipe17 Sep 19, 2024
79d578d
- Testing less memory
elipe17 Sep 19, 2024
5bb32d2
- Tell nginx to reload
elipe17 Sep 19, 2024
2316312
- try removing nginx command
elipe17 Sep 19, 2024
bd60f78
- remove stderr log
elipe17 Sep 19, 2024
7767c17
- try removing extra buildpaack
elipe17 Sep 19, 2024
60827e0
- re-add errorlog pipe
elipe17 Sep 19, 2024
96abb6b
- remove blank line
elipe17 Sep 19, 2024
c44692e
- remove error log for test
elipe17 Sep 19, 2024
e1e272b
- remove resolver directive as test
elipe17 Sep 19, 2024
d8b27d3
- test hard coded vals
elipe17 Sep 19, 2024
17b22f1
- revert conf changes
elipe17 Sep 19, 2024
8c121d9
Merge branch 'develop' into 3046-plg-cloud
elipe17 Sep 19, 2024
dfa6de8
- Testing with latest nginx buildpack
elipe17 Sep 20, 2024
0078a86
- revert to original manifest
elipe17 Sep 20, 2024
4f9d4d5
- revert buildpack and nginx.conf
elipe17 Sep 20, 2024
9f82e14
- test promtail as a sidecar
elipe17 Sep 20, 2024
4527ed6
- Update loki to store logs in s3
elipe17 Sep 20, 2024
764e041
- add bucket name
elipe17 Sep 20, 2024
4a35d01
- add path for local loki directories
elipe17 Sep 20, 2024
d6acc58
- Update path prefix
elipe17 Sep 20, 2024
7d70d19
- Add networking commands for PLG
elipe17 Sep 20, 2024
47fafaf
- alleviate secrets check
elipe17 Sep 20, 2024
7ceb3d9
- UPdated deploy script
elipe17 Sep 20, 2024
cc73759
- update comment in route
elipe17 Sep 23, 2024
9e00a31
- add internal apps to allowed hosts
elipe17 Sep 23, 2024
5e43a1f
- Updated local proxy config to correctly proxy grafana
elipe17 Sep 23, 2024
d08a4c9
- Explicitely mark netpols to route to dev env
elipe17 Sep 23, 2024
bc960b5
- intermediate commit
elipe17 Sep 23, 2024
0a4bbe8
- Updates to deploy script
elipe17 Sep 23, 2024
f3cc7c7
- Update prometheus scrape configs to have all envs
elipe17 Sep 23, 2024
5e64f68
Merge branch 'develop' of https://github.com/raft-tech/TANF-app into …
elipe17 Sep 23, 2024
fb7f807
- Remove promtail sidecar from frontend
elipe17 Sep 23, 2024
8cc8430
- remove manifest tremplate usage
elipe17 Sep 23, 2024
98e1cb2
- remove env expansion from loki
elipe17 Sep 23, 2024
6b3ab12
- Give loki a local config for comparison
elipe17 Sep 23, 2024
e24b3cd
- add db size visualizaiton
elipe17 Sep 24, 2024
654ccc4
- Update loki local to use local stack storage
elipe17 Sep 24, 2024
a43da06
- log level info
elipe17 Sep 24, 2024
25487cc
- get promtail logs to file
elipe17 Sep 24, 2024
2428d41
- Move promtail process into gunicorn script
elipe17 Sep 25, 2024
e949849
- Update job label to be templated
elipe17 Sep 25, 2024
f38cf87
- Add space switching to allow for correct networking
elipe17 Sep 25, 2024
88c6d4e
- Update dashboards
elipe17 Sep 25, 2024
69ee469
- export missing DB metrics
elipe17 Sep 25, 2024
3a05246
- fix dashboard for local use
elipe17 Sep 25, 2024
0dfb56c
- correct name
elipe17 Sep 25, 2024
3fe36bc
- Update to use datasource uid
elipe17 Sep 25, 2024
d519ca2
- fix name
elipe17 Sep 25, 2024
d40b2c4
- Move log file to /tmp
elipe17 Sep 25, 2024
8b310a9
- make deployments rolling
elipe17 Sep 26, 2024
0578058
- update terraform
elipe17 Sep 26, 2024
9488843
- re-enable testing
elipe17 Sep 26, 2024
9eeee32
Merge branch 'develop' into 3046-plg-cloud
elipe17 Sep 26, 2024
0d242d7
- Remove debug stuff
elipe17 Sep 26, 2024
9d1604d
Change scrape to happen every 15s
elipe17 Sep 26, 2024
1afb129
Merge branch 'develop' into 3046-plg-cloud
elipe17 Sep 30, 2024
e1696da
Merge branch 'develop' into 3046-plg-cloud
elipe17 Sep 30, 2024
84999f1
- extra tests. mroe to be added
elipe17 Oct 2, 2024
31bdfe2
Merge branch '3046-plg-cloud' of https://github.com/raft-tech/TANF-ap…
elipe17 Oct 2, 2024
2895f9c
Merge branch 'develop' into 3046-plg-cloud
elipe17 Oct 2, 2024
34798ae
Merge branch 'develop' into 3046-plg-cloud
elipe17 Oct 2, 2024
9d70a1a
- remove file
elipe17 Oct 3, 2024
ba32d76
Merge branch 'develop' of https://github.com/raft-tech/TANF-app into …
elipe17 Oct 3, 2024
89d5206
- add new tests for coverage
elipe17 Oct 3, 2024
2a35494
- linting
elipe17 Oct 3, 2024
8c53767
- add test and ignore filters
elipe17 Oct 3, 2024
783b83a
- Reset setting
elipe17 Oct 3, 2024
a920580
Update personas document for issue #3100
victoriaatraft Oct 7, 2024
1cd19cb
- add pg db for grafana locally
elipe17 Oct 9, 2024
0b98845
- Update deployment to hook grafana into rds
elipe17 Oct 9, 2024
a8cb835
- Updated deploy script to reqruie db service
elipe17 Oct 9, 2024
a7450d6
Merge branch 'develop' of https://github.com/raft-tech/TANF-app into …
elipe17 Oct 9, 2024
09e1359
Merge branch 'develop' into victoriaatraft-patch-1
victoriaatraft Oct 9, 2024
75ac67d
Update 2020, Summer - Understanding Stakeholders and creating persona…
victoriaatraft Oct 9, 2024
9c52c54
Merge branch 'develop' into victoriaatraft-patch-1
ADPennington Oct 10, 2024
bb44fb9
Merge branch 'develop' into 3046-plg-cloud
ADPennington Oct 10, 2024
e0f1ebe
Update docs/User-Experience/Research-Syntheses/2020, Summer - Underst…
victoriaatraft Oct 11, 2024
3357e84
show banner and hide errors if file submitted before 5/31/2024
jtimpe Oct 11, 2024
14cc136
add tests
jtimpe Oct 11, 2024
0dbb377
Update docs/User-Experience/Research-Syntheses/2020, Summer - Underst…
victoriaatraft Oct 11, 2024
8ea93fd
Update docs/User-Experience/Research-Syntheses/2020, Summer - Underst…
victoriaatraft Oct 11, 2024
10b2b9c
Merge branch 'develop' into 3046-plg-cloud
elipe17 Oct 15, 2024
7ee53a6
more descriptive param names
jtimpe Oct 15, 2024
b9a9724
Update docs/User-Experience/Research-Syntheses/2020, Summer - Underst…
victoriaatraft Oct 15, 2024
aa093a9
Update docs/User-Experience/Research-Syntheses/2020, Summer - Underst…
victoriaatraft Oct 15, 2024
0e65fbc
Update docs/User-Experience/Research-Syntheses/2020, Summer - Underst…
victoriaatraft Oct 15, 2024
7eb9e06
Update docs/User-Experience/Research-Syntheses/2020, Summer - Underst…
victoriaatraft Oct 15, 2024
e95876b
Update 2020, Summer - Understanding Stakeholders and creating persona…
victoriaatraft Oct 15, 2024
0f20e88
Update 2020, Summer - Understanding Stakeholders and creating persona…
victoriaatraft Oct 16, 2024
c01dd44
Update 2020, Summer - Understanding Stakeholders and creating persona…
victoriaatraft Oct 16, 2024
4e0b1bb
Merge branch 'develop' into 3014-outdated-submissions-banner
jtimpe Oct 17, 2024
b1a89bd
Merge branch 'develop' into 3014-outdated-submissions-banner
ADPennington Oct 17, 2024
f305d2a
Merge branch 'develop' into 3014-outdated-submissions-banner
ADPennington Oct 18, 2024
37a56ea
Merge branch 'develop' of https://github.com/raft-tech/TANF-app into …
elipe17 Oct 18, 2024
32c07df
Merge branch '3046-plg-cloud' of https://github.com/raft-tech/TANF-ap…
elipe17 Oct 18, 2024
c2d358c
Merge in test env
reitermb Oct 21, 2024
4fa79a1
Update index.html
reitermb Oct 22, 2024
d492d1d
Update docs/User-Experience/Research-Syntheses/2020, Summer - Underst…
reitermb Oct 22, 2024
6e6898a
Merge branch 'develop' into victoriaatraft-patch-1
reitermb Oct 22, 2024
b098d05
include reparse in data files api response
jtimpe Oct 24, 2024
a1ffb0d
check reparse finished_at if submission outdated
jtimpe Oct 24, 2024
82d0bb6
lint, clean up comments
jtimpe Oct 24, 2024
3eb696f
Merge branch 'develop' into 3014-outdated-submissions-banner
jtimpe Oct 24, 2024
ce65bec
Merge branch 'develop' into 3046-plg-cloud
elipe17 Oct 24, 2024
a10638d
Merge pull request #3239 from raft-tech/release-notes-3.6.5
reitermb Oct 24, 2024
d7291fb
Merge branch 'develop' into 3014-outdated-submissions-banner
ADPennington Oct 24, 2024
25b762b
Merge branch 'develop' into victoriaatraft-patch-1
reitermb Oct 25, 2024
f8cf20a
Merge branch 'develop' into 3046-plg-cloud
elipe17 Oct 25, 2024
78ac4eb
Merge pull request #3214 from raft-tech/victoriaatraft-patch-1
reitermb Oct 25, 2024
da67b03
Feat/3171 nexus integration (#3200)
andrew-jameson Oct 25, 2024
4fec7f2
Merge branch 'develop' into 3046-plg-cloud
elipe17 Oct 25, 2024
9ff75dc
Merge branch 'develop' into 3014-outdated-submissions-banner
ADPennington Oct 25, 2024
d54b99c
- Update script to use env specific vars for staging
elipe17 Oct 28, 2024
d9cb995
Doc/3199 monitoring adr (#3210)
andrew-jameson Oct 28, 2024
9bd3deb
Fixes formatting issue
reitermb Oct 28, 2024
cb51a68
Merge pull request #3247 from raft-tech/persona-format-fix
reitermb Oct 28, 2024
1414394
Merge branch 'develop' into 3046-plg-cloud
elipe17 Oct 28, 2024
24ad902
3224 removed PII from logs
raftmsohani Oct 29, 2024
f54a31c
Merge branch 'develop' into 3224-audit-logger
raftmsohani Oct 29, 2024
ec396bd
Removed PII
raftmsohani Oct 29, 2024
9e8904c
Merge branch '3224-audit-logger' of github.com:raft-tech/TANF-app int…
raftmsohani Oct 29, 2024
d69b552
linting
raftmsohani Oct 29, 2024
eb8f661
move outdated submission processing to backend, make configurable
jtimpe Oct 29, 2024
cc63781
revert reparse_file_metas change
jtimpe Oct 29, 2024
11fc447
add reprocessed indicators to submission history tables
jtimpe Oct 29, 2024
15dafb6
Merge branch 'develop' into 3014-outdated-submissions-banner
jtimpe Oct 29, 2024
d9d2680
- remove var
elipe17 Oct 30, 2024
e076f07
- new branch to remove cred leak
elipe17 Oct 30, 2024
ce8df47
Merge branch 'develop' into 3046-plg-cloud
ADPennington Oct 31, 2024
6650108
Merge branch '3046-plg-cloud' of https://github.com/raft-tech/TANF-ap…
elipe17 Oct 31, 2024
512eff3
clear cookies before visit
jtimpe Nov 1, 2024
f2f91ea
Merge pull request #3192 from raft-tech/3046-plg-cloud
elipe17 Nov 1, 2024
a400aab
Merge branch 'develop' into 3242-local-alert-manager-new
elipe17 Nov 1, 2024
d823eff
Merge branch 'develop' into 2435-hhs-env-vars
elipe17 Nov 1, 2024
adf48aa
Merge branch 'develop' into 3224-audit-logger
raftmsohani Nov 5, 2024
caf1c8a
update outdated error report language
jtimpe Nov 5, 2024
607b51e
Merge branch 'develop' into 3014-outdated-submissions-banner
jtimpe Nov 5, 2024
b0d3232
update language
jtimpe Nov 5, 2024
bf0bcec
Update cf-check.sh
andrew-jameson Nov 6, 2024
f8e618a
Merge pull request #3272 from raft-tech/hotfix/3171-cloudfoundry-binary
jtimpe Nov 6, 2024
825f5b3
Merge branch 'develop' into 3014-outdated-submissions-banner
jtimpe Nov 6, 2024
18d7363
fix date compare
jtimpe Nov 7, 2024
46e504c
Merge branch 'develop' into 2435-hhs-env-vars
andrew-jameson Nov 7, 2024
b019471
3224 added an extra security layer to transform function
raftmsohani Nov 7, 2024
6e22311
Merge branch 'develop' into 3141-fix-cypress
jtimpe Nov 8, 2024
d988e2d
remove old submissions banner, column text, serializer data
jtimpe Nov 8, 2024
a098543
rm tests
jtimpe Nov 8, 2024
5ed2551
Update scripts/deploy-backend.sh
andrew-jameson Nov 8, 2024
28d4b78
rm unused
jtimpe Nov 8, 2024
45b5a2e
Merge branch 'develop' into 3242-local-alert-manager-new
elipe17 Nov 12, 2024
24f3884
- Update networking in deploy.sh
elipe17 Nov 12, 2024
269b59e
- add missing arg for grafana deploy
elipe17 Nov 12, 2024
b40c243
- Update deploy script networking to test in dev initially
elipe17 Nov 12, 2024
94c6e0d
first draft memo
jtimpe Nov 13, 2024
6fd3a9b
- remove prod tunnel
elipe17 Nov 13, 2024
f9e3b7a
- fix domain
elipe17 Nov 13, 2024
ce4cb8e
Merge branch 'develop' into 3224-audit-logger
raftmsohani Nov 13, 2024
4b278d1
rm unused properties, rename used property
jtimpe Nov 13, 2024
4e50d9d
- Remove prod networking from deploy-backend.sh
elipe17 Nov 13, 2024
69f0bb4
Merge pull request #3260 from raft-tech/3141-fix-cypress
jtimpe Nov 13, 2024
6e5632e
- Updated syntax errors in script
elipe17 Nov 14, 2024
57ad0cc
- update comment
elipe17 Nov 14, 2024
e36ddf9
- Update grafana session settings
elipe17 Nov 14, 2024
08a8604
- remove very annoying log messages
elipe17 Nov 14, 2024
b3f57ad
- Move all PLG networking to plg deploy script
elipe17 Nov 14, 2024
bd0c36e
Merge branch 'develop' into 3222-plg-prod
elipe17 Nov 14, 2024
44d0923
- re add tests
elipe17 Nov 14, 2024
e69df5c
Merge branch '3222-plg-prod' of https://github.com/raft-tech/TANF-app…
elipe17 Nov 14, 2024
7b82bbe
Merge branch 'develop' of https://github.com/raft-tech/TANF-app into …
elipe17 Nov 14, 2024
d51a85d
- remove dev test code
elipe17 Nov 14, 2024
3b9e2fe
Merge branch '2435-hhs-env-vars' of https://github.com/raft-tech/TANF…
elipe17 Nov 14, 2024
6ff30f6
- Revert postgres repo back to postgres apt repo instead of nexus
elipe17 Nov 14, 2024
06f8013
Merge pull request #3283 from raft-tech/pg-repo-hotfix
elipe17 Nov 15, 2024
c585b75
Merge branch 'develop' into 3222-plg-prod
elipe17 Nov 15, 2024
3402824
Merge branch 'develop' into 3242-local-alert-manager-new
elipe17 Nov 15, 2024
d6c1cfc
Merge branch 'develop' into 2435-hhs-env-vars
elipe17 Nov 15, 2024
53d3641
Merge pull request #3246 from raft-tech/2435-hhs-env-vars
elipe17 Nov 15, 2024
7b03c7f
Merge branch 'develop' into 3222-plg-prod
elipe17 Nov 15, 2024
11abfb5
Merge branch 'develop' into 3014-outdated-submissions-banner
jtimpe Nov 15, 2024
fa5f15c
Merge branch 'develop' into 3242-local-alert-manager-new
ADPennington Nov 15, 2024
a6f7352
wip singleton class, migrations
jtimpe Nov 15, 2024
5500be8
Merge pull request #3231 from raft-tech/3014-outdated-submissions-banner
jtimpe Nov 15, 2024
743d580
finalize memo
jtimpe Nov 18, 2024
181e671
cleanup
jtimpe Nov 18, 2024
e5d20da
Merge branch 'develop' into 2562-spike-parsing-log
jtimpe Nov 18, 2024
6bf0105
- templated emails
elipe17 Nov 18, 2024
75a43cd
Merge branch 'develop' into 3242-local-alert-manager-new
elipe17 Nov 18, 2024
dd5ea65
Merge pull request #3252 from raft-tech/3242-local-alert-manager-new
elipe17 Nov 19, 2024
d86c181
Merge branch 'develop' of https://github.com/raft-tech/TANF-app into …
elipe17 Nov 19, 2024
2b515d9
Update docs/Technical-Documentation/tech-memos/parsing-log-per-file/p…
jtimpe Nov 19, 2024
750ff44
Merge branch 'develop' into 2562-spike-parsing-log
jtimpe Nov 19, 2024
85a1710
Update docs/Technical-Documentation/tech-memos/parsing-log-per-file/p…
jtimpe Nov 19, 2024
96f45d4
Update docs/Technical-Documentation/tech-memos/parsing-log-per-file/p…
jtimpe Nov 19, 2024
2b07c68
add s3 explanation
jtimpe Nov 19, 2024
455ca37
3269 [BUGFIX] refactor reparse js file (#3280)
raftmsohani Nov 20, 2024
8d01df3
Merge branch 'develop' into 2562-spike-parsing-log
jtimpe Nov 20, 2024
2b3b674
Merge pull request #3275 from raft-tech/2562-spike-parsing-log
jtimpe Nov 20, 2024
c230985
transparent background (#3312)
andrew-jameson Nov 22, 2024
5c612f6
Merge branch 'develop' into 3224-audit-logger
raftmsohani Nov 22, 2024
47fb1c3
- dummy change
elipe17 Nov 22, 2024
c7ee54a
Merge branch 'develop' into 3222-plg-prod
elipe17 Nov 22, 2024
a24e826
- revert
elipe17 Nov 22, 2024
ce3c45c
Merge branch '3222-plg-prod' of https://github.com/raft-tech/TANF-app…
elipe17 Nov 22, 2024
503c644
- Updated nginx conf to use correct auth check endpoint
elipe17 Nov 25, 2024
357f9d4
Merge pull request #3250 from raft-tech/3224-audit-logger
elipe17 Nov 25, 2024
a5fce6b
- Updated dashboard configs
elipe17 Nov 26, 2024
c051bec
- Made simple README outlining grafana's rbac/auth
elipe17 Nov 27, 2024
91a4904
Merge branch 'develop' into 3222-plg-prod
ADPennington Nov 27, 2024
2647144
- Use correct team names
elipe17 Nov 27, 2024
58c69c8
Merge branch '3222-plg-prod' of https://github.com/raft-tech/TANF-app…
elipe17 Nov 27, 2024
97bf95e
Merge pull request #3276 from raft-tech/3222-plg-prod
elipe17 Nov 27, 2024
c98eb57
Merge pull request #551 from raft-tech/release/v3.7.1-sprint-111
ADPennington Dec 2, 2024
e9bdc33
Merge pull request #552 from raft-tech/release/v3.7.2-sprint-112
ADPennington Dec 2, 2024
d728223
Merge branch 'master' into main
ADPennington Dec 4, 2024
59373e0
Update research-synthesis-issue-template.md (#3286)
victoriaatraft Dec 4, 2024
1bba246
Update design-deliverable-issue-template.md (#3287)
victoriaatraft Dec 4, 2024
d4b7f76
Hotfix/cf check (#3337)
andrew-jameson Dec 4, 2024
1bd1a43
Hotfix/cf check (#3339)
andrew-jameson Dec 4, 2024
52653f2
Hotfix/cf check (#3340)
andrew-jameson Dec 5, 2024
b8fb9cd
Merge pull request #557 from raft-tech/release/v3.7.4-cfcli-hotfix
ADPennington Dec 5, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .circleci/build-and-test/commands.yml
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@
fi
echo "export CURRENT_FLAG=$CURRENT_FLAG" >> $BASH_ENV
- run:
name: Upload code coverage report if target branch
name: Upload code coverage report of target branch
command: codecov -t "$CODECOV_TOKEN" -f <<parameters.coverage-report>> -F "$CURRENT_FLAG"

install-nodejs-machine:
Expand Down
22 changes: 3 additions & 19 deletions .circleci/deployment/commands.yml
Original file line number Diff line number Diff line change
Expand Up @@ -226,15 +226,8 @@
default: CF_APP
steps:
- checkout
- run:
name: Install dependencies
command: |
apk update
apk add jq
apk add curl
# TODO: Add Signature check
curl -L "https://packages.cloudfoundry.org/stable?release=linux64-binary&version=v7&source=github" | tar -zx
mv cf7 /usr/local/bin/cf
- sudo-check
- cf-check
- login-cloud-dot-gov:
cf-password: <<parameters.cf-password>>
cf-username: <<parameters.cf-username>>
Expand Down Expand Up @@ -285,16 +278,7 @@
type: string
steps:
- checkout
- run:
name: Install dependencies
command: |
sudo apt update
sudo apt install jq
sudo apt install curl
# TODO: Add Signature check
curl -L "https://packages.cloudfoundry.org/stable?release=linux64-binary&version=v7&source=github" | tar -zx
sudo mv cf7 /usr/local/bin/cf
sudo chmod +x /usr/local/bin/cf
- cf-check
- login-cloud-dot-gov:
cf-password: <<parameters.cf-password>>
cf-username: <<parameters.cf-username>>
Expand Down
1 change: 1 addition & 0 deletions .gitconfig
Original file line number Diff line number Diff line change
Expand Up @@ -14,3 +14,4 @@
allowed = .git/config:.*
allowed = .gitconfig:.*
allowed = .*DJANGO_SECRET_KEY=.*
allowed = ./tdrs-backend/plg/loki/manifest.yml:*
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@ assignees: ''
- [ ] Documentation work for the following has occurred:
- [ ] Relevant User stories.
- [ ] Recommended pa11y checks.
- [ ] Updating living UX documents, e.g. User Flows or Personas(if relevant).
- [ ] Updating living UX documents, e.g. User Flows, Personas, [Service Blueprint](https://www.figma.com/design/irgQPLTrajxCXNiYBTEnMV/TDP-Mockups-For-Feedback?node-id=9080-4762) (if relevant).
- [ ] Internal Raft Review has occurred to ensure DoD standards and QA
- [ ] Dev/Design sync has occurred; resulting tickets created
- [ ] The design is usable and accessible, meaning it adheres to definition of done standards for design work.
Expand Down
5 changes: 3 additions & 2 deletions .github/ISSUE_TEMPLATE/research-synthesis-issue-template.md
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,8 @@ assignees: ''

**AC:**

- [ ] A hack.md with the drafted synthesis has been reviewed.
- [ ] A Gitbook with the drafted synthesis has been reviewed.
- [ ] [TDP Service Blueprint](https://www.figma.com/design/irgQPLTrajxCXNiYBTEnMV/TDP-Mockups-For-Feedback?node-id=9080-4762) has been updated, as appplicable
- [ ] PR has been opened containing the final draft of the synthesis.
- [ ] Internal Raft Review has occurred to ensure DoD standards and QA
- [ ] The content is usable and accessible, meaning it adheres to definition of done standards for design work.
Expand All @@ -35,4 +36,4 @@ assignees: ''

**Supporting Documentation:**

- --Link to hack.md--
- --Link to the gitbook page--
5 changes: 5 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -109,4 +109,9 @@ cypress.env.json

# Patches
*.patch

# Logs
*.log

# DB seeds
tdrs-backend/*.pg
9 changes: 2 additions & 7 deletions Taskfile.yml
Original file line number Diff line number Diff line change
Expand Up @@ -2,11 +2,6 @@ version: '3'

tasks:

upload-kibana-objs:
desc: Upload dashboards to Kibana server
cmds:
- 'curl -X POST localhost:5601/api/saved_objects/_import -H "kbn-xsrf: true" --form file=@tdrs-backend/tdpservice/search_indexes/kibana_saved_objs.ndjson'

create-network:
desc: Create the external network
cmds:
Expand Down Expand Up @@ -251,7 +246,7 @@ tasks:
desc: Open a shell in the frontend container
dir: tdrs-frontend
cmds:
- docker-compose -f docker-compose.yml exec tdp-frontend sh
- docker-compose -f docker-compose.yml exec tdp-frontend bash

up:
desc: Start both frontend and backend web servers
Expand All @@ -268,4 +263,4 @@ tasks:
help:
desc: Show this help message
cmds:
- task --list
- task --list
5 changes: 4 additions & 1 deletion codecov.yml
Original file line number Diff line number Diff line change
Expand Up @@ -42,4 +42,7 @@ flags:
carryforward: true

ignore:
- "tdrs-backend/tdpservice/scheduling/db_backup.py"
- "tdrs-backend/tdpservice/scheduling/db_backup.py"
- "tdrs-backend/tdpservice/search_indexes/admin/mulitselect_filter.py"
- "tdrs-backend/tdpservice/email/helpers/account_access_requests.py"
- "tdrs-backend/tdpservice/search_indexes/admin/filters.py"
302 changes: 301 additions & 1 deletion docs/Security-Compliance/diagram.drawio

Large diffs are not rendered by default.

Binary file modified docs/Security-Compliance/diagram.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Original file line number Diff line number Diff line change
@@ -0,0 +1,68 @@
# 22. Monitoring Application Health and Performance

Date: 2024-09-30

## Status

Pending

## Context
Historic feedback highlighted an ongoing desire for improved alerting and monitoring mechanisms, particularly originating in issue [#831](https://github.com/raft-tech/TANF-app/issues/831) circa 2021. Currently, our cloud platform has limited logging features and user interface issues leading to a "blindness" to errors and stack traces that have occurred, ultimately impairing our ability to maintain system stability; additionally, the existing dashboards only offer live performance data lacking data over time or any archives. Without context for either performance or system logging, determination of anomalous or erroneous system behavior is not possible.

Additionally, we have experienced critical blocking issues related to our updates to both Elasticsearch (ES) and PostgreSQL, which have compounded the need for more proactive alerting and load-testing in lower environments. Without timely notifications, we risk delays in addressing failures that could escalate into more significant problems.


## Decision
We will build out a suite of tools in accordance with industry best practices to monitor our applications. Implementing a comprehensive monitoring and alerting ecosystem will not only help in identifying errors in real-time but also enable us to establish benchmarks based on historical data. This approach will foster a more proactive response strategy, ensuring that potential issues are mitigated before they impact our users or that system owners and system admins are aware of issues that have impacted users.

<p style="text-align:center; margin:0; padding:0;">Cloud Environments Workflow</p>

![Environments](../diagrams/TDP_Environments.png)

### Why Sentry
Sentry captures unhandled exceptions and incorporates detail context about exceptions including error messages, stack traces, affected URLs and user data information. Such information is essential in demystifying the cause of error.

Additionally, as can be seen in the image below, the following information is available:

- Frequency: shows the frequency detail of error
- Timeline: when has the error happened in a period
- Can create a ticket and assign automatically
- Variables at each step of stack trace. This is very important for debugging

<p style="text-align:center; margin:0; padding:0;">Issues with filter enabled</p>

![Issues with filter enabled](../images/sentry/1.%20Issues%20with%20filter%20enabled.png)

<p style="text-align:center; margin:0;padding:0;">Detail exceptions</p>

![Detail exceptions](../images/sentry/3.%20detail%20about%20exception.png)

<p style="text-align:center; margin:0; padding:0;">Full stack trace of the exceptions</p>

![Full stack trace of the exceptions](../images/sentry/4.%20full%20stack%20trace%20of%20the%20exceptions.png)


Performance monitoring in Sentry can greatly enhance the backend application by providing real-time insights into how the TANF app is performing. Sentry tracks various metrics such as response time, database queries, and external API calls. These metrics will help identify performance bottlenecks associated to the backend app.

A unique ability of Sentry is that it links performance issues and groups them together. This gives us the ability to visualize areas that consistently have poor performance. Allowing us to swarm and resolve the most frequent offenders that have the highest impact. Sentry also detects issues with web transactions, database queries, and function regressions (if the duration of function has increased).

### Why Prometheus-Loki-Grafana

Grafana shall provide a visualization dashboard for these various tools which will collect and aggregate performance metrics, system logs, and allow deeper analysis for all aspects of our systems: frontend, proxies, backend, databases, and even networking. Additionally, the development team will seek to hone a proactive alerting system for out-of-threshold issues and errors for improved visibility of system issues.

The storing of system logs will allow more expedient troubleshooting and debugging that is currently out of reach with Cloud.gov's existing Kibana interface for logging. The ability to find and correlate log events is critical to technical analysis of faults, performance degradation, and system's overall health.

By having our monitoring ecosystem take in performance metrics, we will garner performance metrics over time as opposed to simply a live snapshot as is currently provided. This will allow spotting of anomolous or out-of-bounds behaviors such as out of memory, high memory, cpu spikes, and disk thrashing.

Finally, having all of this data in one place will allow technical staff to easily cross-reference given time periods with problematic performance, ongoing issues, or error stacktraces leading to a holistic view of all of our applications both in lower tier development sites and in critical production.

## Consequences

* Increased platform costs for running these tools
* Time and effort maintaining and configuring these new systems
* "Noisy" notifications from from out-of-tune alerting
* Efforts made towards security compliance as these systems have intimate access to our systems and data
* Learning curve for technical staff

## Notes
Given the prohibitive costs of self-hosting Sentry in Cloud.gov, we propose using Sentry's Cloud SaaS offering which will alter the [boundary diagram](../../Security-Compliance/diagram.png). The other tools in use (PLG stack and associated), will be self-hosted and maintained by the technical staff both at Raft and OFA.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Loading