Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add initial C10S variant #1498

Open
wants to merge 17 commits into
base: master
Choose a base branch
from
Open

Add initial C10S variant #1498

wants to merge 17 commits into from

Conversation

travier
Copy link
Member

@travier travier commented Apr 30, 2024

c9s: Point image-c9s to image-rhel-9.4


c9s: Use RPMs from 9.4 RHAOS repo


Revert "c9s.repo: temporarily use mirrored repos"

This reverts commit 88e41a0.


DoNotMerge: CI changes to test C10S builds


manifests: Add initial c10s based variant

See: #1466


kola-denylist.yaml: Update for c10s variant


c10s.repo: Skip GPG checks for now


Corresponding Fedora CoreOS changes: coreos/fedora-coreos-config#3015

@travier
Copy link
Member Author

travier commented Apr 30, 2024

Do not merge as it includes changes that we do not want but that are here to test in CI (see commits).
We'll have to update the Prow CI config once ready.
/hold

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Apr 30, 2024
@openshift-ci openshift-ci bot requested review from cverna and jlebon April 30, 2024 16:11
Copy link
Contributor

openshift-ci bot commented Apr 30, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: travier

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Apr 30, 2024
@travier
Copy link
Member Author

travier commented May 2, 2024

/retest

2 similar comments
@travier
Copy link
Member Author

travier commented May 3, 2024

/retest

@travier
Copy link
Member Author

travier commented May 6, 2024

/retest

@jlebon
Copy link
Member

jlebon commented May 6, 2024

Hmm, almost seems like the logs are truncated... This is probably fallout from coreos/coreos-assembler#3785.

jlebon added a commit to jlebon/coreos-assembler that referenced this pull request May 6, 2024
I can't reproduce this locally, but I have a suspicion that `tail` can
exit too quickly in some circumstances, causing truncated output:

openshift/os#1498 (comment)
coreos#3785 (comment)

Rather than having an unconditional `sleep`, let's make it easier to
test that theory by having an env var we can use to make it optional.
Then we'll test that in CI.

Mid-term, I'd like to revert 79b15c8 soon so we can go back to
virtio-serial which is just so much cleaner.
@jlebon
Copy link
Member

jlebon commented May 6, 2024

Let's see if coreos/coreos-assembler#3792 helps (then here, we'd export TAIL_SLEEP=3 or something).

jlebon added a commit to coreos/coreos-assembler that referenced this pull request May 6, 2024
I can't reproduce this locally, but I have a suspicion that `tail` can
exit too quickly in some circumstances, causing truncated output:

openshift/os#1498 (comment)
#3785 (comment)

Rather than having an unconditional `sleep`, let's make it easier to
test that theory by having an env var we can use to make it optional.
Then we'll test that in CI.

Mid-term, I'd like to revert 79b15c8 soon so we can go back to
virtio-serial which is just so much cleaner.
@jlebon
Copy link
Member

jlebon commented May 7, 2024

/retest

@jlebon
Copy link
Member

jlebon commented May 7, 2024

Ahh and indeed now we clearly see the error for all of them which means we were missing output. Yuck OK, so we need to fix the tail handling. Though I'll see if I can bang on the virtio-serial bug again.

@travier
Copy link
Member Author

travier commented May 7, 2024

Ahh and indeed now we clearly see the error for all of them which means we were missing output. Yuck OK, so we need to fix the tail handling. Though I'll see if I can bang on the virtio-serial bug again.

Thanks! I indeed had a missing change in those commits.

@travier
Copy link
Member Author

travier commented May 7, 2024

Hum, the workaround is not that ugly and only impacts CI here so maybe we should merge it for now until we've fixed this COSA.

@travier
Copy link
Member Author

travier commented May 7, 2024

CI fixes in openshift/release#51750

@jlebon
Copy link
Member

jlebon commented May 7, 2024

and only impacts CI here

So far. :) It's racy so I don't see why it couldn't happen in the prod pipeline. I'd hate for someone to be debugging a failure there and working with incomplete output thinking the error is happening somewhere other than it really is.

I guess though a major hack is to just add a sleep 1 at least on the cosa side.

@openshift-merge-robot openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label May 10, 2024
@openshift-merge-robot openshift-merge-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label May 13, 2024
@travier
Copy link
Member Author

travier commented May 13, 2024

Failing on missing the teamd package.


# [c10s-sig-cloud-okd]
# name=CentOS Stream 10 - SIG Cloud OKD 4.15
# baseurl=https://mirror.stream.centos.org/SIGs/10-stream/cloud/$basearch/okd-4.15/
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
# baseurl=https://mirror.stream.centos.org/SIGs/10-stream/cloud/$basearch/okd-4.15/
# baseurl=https://mirror.stream.centos.org/SIGs/10-stream/cloud/$basearch/okd-4.17/

This repo doesn't exist yet, but let's point it at 4.17 already. I'll set up the repo in the coming days.

@travier
Copy link
Member Author

travier commented Aug 28, 2024

We'll need the prep work for coreos/fedora-coreos-tracker#1787 as well

Copy link
Contributor

openshift-ci bot commented Sep 18, 2024

@travier: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/scos-9-build-test-qemu 9c0b25a link true /test scos-9-build-test-qemu
ci/prow/scos-9-build-test-metal 9c0b25a link true /test scos-9-build-test-metal

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants