Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Upgrade next to 14.2.16 #5756

Merged
merged 1 commit into from
Nov 8, 2024
Merged

Upgrade next to 14.2.16 #5756

merged 1 commit into from
Nov 8, 2024

Conversation

clee2000
Copy link
Contributor

@clee2000 clee2000 commented Oct 11, 2024

Upgrading cuz I think it's a good idea to be on a more recent version. idk what the consequences will be

Theres also next 15.something but that came out really recently and next-auth doesn't list it as a compatible version

octokit types changed which is why webhookToDyanamo looks different. Functionality should be the same, mostly just type changes + casting, and we have tests after #5869

Cousin to #5862 but does both slightly less and slightly more

Copy link

vercel bot commented Oct 11, 2024

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
torchci ✅ Ready (Inspect) Visit Preview 💬 Add feedback Nov 8, 2024 5:44pm

@huydhn
Copy link
Contributor

huydhn commented Nov 7, 2024

Are there any setup I need to do after this land? brew install node?

@clee2000
Copy link
Contributor Author

clee2000 commented Nov 8, 2024

Are there any setup I need to do after this land? brew install node?

I do rm -rf node_modules && yarn install. I can't remember if I need to brew install node

@clee2000 clee2000 changed the title Upgrade node to 14.2.16 Upgrade next to 14.2.16 Nov 8, 2024
Copy link
Contributor

@huydhn huydhn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code-wise this LGTM! Clicking around the preview seems fine, maybe we should keep an eye on the number of 500 from vercel after landing this, to make sure that nothing breaks

@clee2000 clee2000 merged commit ce0896e into main Nov 8, 2024
7 checks passed
@clee2000 clee2000 deleted the csl/node_upgrade branch November 8, 2024 22:11
@clee2000 clee2000 restored the csl/node_upgrade branch November 8, 2024 22:11
clee2000 added a commit that referenced this pull request Nov 8, 2024
huydhn added a commit that referenced this pull request Nov 22, 2024
This change fixes a couple of issues with the workflow that refreshes
Dr.CI results for all open PRs. The key take away is that this API call
scale on the number of open pull requests on a repo. And on PyTorch, it
now takes longer than 120 seconds to finish. When that limit is reached,
the Vercel function (AWS lambda) will terminate the execution and all
PRs that are still in queue will be dropped. Their Dr.CI comments will
surely become stale.

Here is an example of the failure
https://github.com/pytorch/test-infra/actions/runs/11943802339/job/33293533522.
The error is FUNCTION_INVOCATION_TIMEOUT
(https://github.com/pytorch/test-infra/actions/runs/11964503897/job/33356932041#step:3:136),
and it stops at 2 minutes sharp. It's defined at
https://vercel.com/fbopensource/torchci/settings/functions.

* Follow
https://vercel.com/docs/functions/configuring-functions/duration to
increase the max duration to 900 seconds, the max value for enterprise
account defined at
https://vercel.com/docs/functions/runtimes#max-duration.
* Also fix a bug in `isTime0` where the value is now NaN instead of 0,
maybe this is related to our recent next.js upgrade
#5756
* Re-factor the workflow to get rid of lots of duplicated code
* Also surface the failure better via curl as the current command
returns successfully and masks the failure, i.e.
https://github.com/pytorch/test-infra/actions/runs/11964503897/job/33356932041#step:3:136

A final note, during my debug, I see this new failure shows up flakily
from time to time. I'll take a look at it in another PR as it doesn't
happen frequently (although it also causes the Dr.CI comment on the PR
in question to go stale temporarily)

```
Failed to update PR 139760 Error: Client network socket disconnected before secure TLS connection was established
    at TLSSocket.onConnectEnd (node:_tls_wrap:1732:19)
    at TLSSocket.emit (node:events:525:35)
    at endReadableNT (node:internal/streams/readable:1696:12)
    at process.processTicksAndRejections (node:internal/process/task_queues:90:21) {
  code: 'ECONNRESET',
  path: null,
  host: 'hyt81izu0c.us-east-1.aws.clickhouse.cloud',
  port: 8443,
  localAddress: undefined
}
```

### Testing

```
time curl --request POST \
  --url 'https://torchci-git-address-drci-refresh-issue-fbopensource.vercel.app/api/drci/drci' \
  --header 'Authorization: REDACT' \
  --data 'repo=pytorch' \
  --silent --output /dev/null --show-error --fail
```

return 200 OK now even when the runtime is 3+ minutes (3:12.56 total),
it was 504 before
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants