Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Keepalived not issuing keepalive failures in all cases #2190

Closed
echlebek opened this issue Oct 19, 2018 · 2 comments · Fixed by #2245
Closed

Keepalived not issuing keepalive failures in all cases #2190

echlebek opened this issue Oct 19, 2018 · 2 comments · Fixed by #2245
Assignees
Milestone

Comments

@echlebek
Copy link
Contributor

echlebek commented Oct 19, 2018

While doing QA on beta 6, keepalive failures were not always observed as expected. The behaviour appears to be stochastic.

Expected Behavior

When an agent with deregister: false is stopped, keepalive failures should occur after the keepalive timeout.

Current Behavior

Sometimes keepalive failures do occur, and sometimes they do not.

Steps to Reproduce (for bugs)

  1. Stop an agent
  2. Observe keepalive failure does not occur

Context

QA on sensu beta 6

Your Environment

Ubuntu

@palourde
Copy link
Contributor

palourde commented Oct 19, 2018

Might be related to #2077.

Some relevant information from the staging environment.

On backend-2:

Oct 19 15:10:20 ip-10-0-1-116 sensu-backend[3858]: {"component":"monitor","level":"debug","msg":"starting a monitor for the key /sensu.io/monitors/staging-agent-0","time":"2018-10-19T15:10:20Z"}
Oct 19 15:10:20 ip-10-0-1-116 sensu-backend[3858]: {"component":"monitor","level":"info","msg":"shutting down monitor for /sensu.io/monitors/staging-agent-0","time":"2018-10-19T15:10:20Z"}
Oct 19 15:10:28 ip-10-0-1-116 sensu-backend[3858]: {"component":"monitor","level":"debug","msg":"starting a monitor for the key /sensu.io/monitors/staging-agent-0","time":"2018-10-19T15:10:28Z"}
Oct 19 15:10:28 ip-10-0-1-116 sensu-backend[3858]: {"component":"monitor","level":"info","msg":"shutting down monitor for /sensu.io/monitors/staging-agent-0","time":"2018-10-19T15:10:28Z"}
Oct 19 15:10:48 ip-10-0-1-116 sensu-backend[3858]: {"component":"monitor","level":"debug","msg":"a lease for the key /sensu.io/monitors/staging-agent-0 already exist, extending it","time":"2018-10-19T15:10:48Z"}
Oct 19 15:11:08 ip-10-0-1-116 sensu-backend[3858]: {"component":"monitor","level":"debug","msg":"a lease for the key /sensu.io/monitors/staging-agent-0 already exist, extending it","time":"2018-10-19T15:11:08Z"}

The lease is only revoked in

if _, err := cli.Lease.Revoke(ctx, mon.leaseID); err != nil {

The fact that the monitor starts and stops immediately might provoke some weird corner case.

@annaplotkin annaplotkin added this to the Beta7 milestone Oct 20, 2018
@barryorourke
Copy link

This could be related to #2135

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants