-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cilium-cli/connectivity: additionally check for container restarts #36299
Conversation
5e37784
to
b905019
Compare
6617765
to
92f072d
Compare
/test |
This comment was marked as outdated.
This comment was marked as outdated.
This comment was marked as outdated.
This comment was marked as outdated.
Extend the `no-errors-in-logs` check to additionally fail in case any of the Cilium-related pods have a container restart count greater than zero. This allows catching possible issues (e.g., panics) that subsequently resolve automatically, hence not preventing the successful completion of all the other tests. Signed-off-by: Marco Iorio <marco.iorio@isovalent.com>
92f072d
to
932912a
Compare
/test |
2 similar comments
/test |
/test |
Run the tests three times, and didn't observe any failure related to the new test. Marking ready for review. The current implementation enables the check for stable versions as well. That looks reasonable to me considering that container restarts are typically fairly serious, but happy to put it behind a version check if reviewers feel more inclined so. |
@thorn3r Gentle ping 🙏 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice work 👍
(from past experience - I think it would have been safer to only roll this out for |
I was honestly a bit on the fence as well on whether to enable it on all versions or main only:
|
Looks like we caught some on v1.15 and v1.16 in |
🤦, sorry, I should have tested this against all branches in advance 😭 Looking at a couple of failures, the agent seems to panic due to the following, which, well, seems fairly suspicious to me.
|
I've opened #37823 to skip the check on v1.16 and earlier. |
Extend the
no-errors-in-logs
check to additionally fail in case any of the Cilium-related pods have a container restart count greater than zero. This allows catching possible issues (e.g., panics) that subsequently resolve automatically, hence not preventing the successful completion of all the other tests.