consider changing the implementation of the Envoy readiness probe #4540

skriss · 2022-05-19T17:33:38Z

Currently, the Envoy readiness probe hits the /ready endpoint on the Envoy admin interface. (https://github.com/projectcontour/contour/blob/main/examples/contour/03-envoy.yaml#L82-L87)

envoyproxy/envoy#16425 documents potential issues with using this endpoint for health-checking Envoy, due to its handler running on the main thread.

We should consider changing the implementation of the readiness probe. One option would be to set up a static HTTP listener that serves a direct response specifically for health-checking.

The text was updated successfully, but these errors were encountered:

github-actions · 2022-09-29T00:33:29Z

The Contour project currently lacks enough contributors to adequately respond to all Issues.

This bot triages Issues according to the following rules:

After 60d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, the Issue is closed

You can:

Mark this Issue as fresh by commenting
Close this Issue
Offer to help out with triage

Please send feedback to the #contour channel in the Kubernetes Slack

sunjayBhatia · 2022-09-29T00:48:42Z

unstaling, this one seems worthwhile to keep around to consider properly

github-actions · 2022-11-29T00:22:41Z

The Contour project currently lacks enough contributors to adequately respond to all Issues.

This bot triages Issues according to the following rules:

After 60d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, the Issue is closed

You can:

Mark this Issue as fresh by commenting
Close this Issue
Offer to help out with triage

Please send feedback to the #contour channel in the Kubernetes Slack

github-actions · 2023-01-29T00:23:10Z

The Contour project currently lacks enough contributors to adequately respond to all Issues.

This bot triages Issues according to the following rules:

After 60d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, the Issue is closed

You can:

Mark this Issue as fresh by commenting
Close this Issue
Offer to help out with triage

Please send feedback to the #contour channel in the Kubernetes Slack

github-actions · 2023-03-01T00:25:48Z

The Contour project currently lacks enough contributors to adequately respond to all Issues.

This bot triages Issues according to the following rules:

After 60d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, the Issue is closed

You can:

Mark this Issue as fresh by commenting
Close this Issue
Offer to help out with triage

Please send feedback to the #contour channel in the Kubernetes Slack

skriss · 2023-12-12T15:22:51Z

xref #5771

tsaarni · 2025-04-07T10:40:34Z

The HealthCheck filter (proto) can send response that mimics the behavior of /ready endpoint in admin interface, when used with "no passthrough" option

No pass through: In this mode, the health check request is never passed to the local service. Envoy will respond with a 200 or a 503 depending on the current draining state of the server.

It is not exactly the same though, since the success response from admin interface is

HTTP/1.1 200 OK
cache-control: no-cache, max-age=0
content-type: text/plain; charset=UTF-8
date: Mon, 07 Apr 2025 10:20:03 GMT
server: envoy
transfer-encoding: chunked
x-content-type-options: nosniff
x-envoy-upstream-service-time: 0

LIVE

while from HealthCheck filter it is

HTTP/1.1 200 OK
content-length: 0
date: Mon, 07 Apr 2025 10:24:15 GMT
server: envoy
x-envoy-upstream-healthchecked-cluster: projectcontour

During draining the admin interface responds:

HTTP/1.1 503 Service Unavailable
cache-control: no-cache, max-age=0
connection: close
content-type: text/plain; charset=UTF-8
date: Mon, 07 Apr 2025 10:36:57 GMT
server: envoy
transfer-encoding: chunked
x-content-type-options: nosniff
x-envoy-upstream-service-time: 0

DRAINING

and HealthCheck responds

HTTP/1.1 503 Service Unavailable
connection: close
content-length: 0
date: Mon, 07 Apr 2025 10:30:58 GMT
server: envoy
x-envoy-immediate-health-check-fail: true
x-envoy-upstream-healthchecked-cluster: projectcontour

The behavior and status codes are the same but response body and headers are not.

skriss added kind/bug Categorizes issue or PR as related to a bug. lifecycle/needs-triage Indicates that an issue needs to be triaged by a project contributor. labels May 19, 2022

github-actions bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 29, 2022

sunjayBhatia removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 29, 2022

github-actions bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 29, 2022

sunjayBhatia removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 29, 2022

github-actions bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 29, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Mar 1, 2023

sunjayBhatia removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 1, 2023

sunjayBhatia reopened this Mar 1, 2023

sunjayBhatia added the priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. label Mar 1, 2023

sunjayBhatia added this to Contour Mar 1, 2023

skriss mentioned this issue Dec 12, 2023

Readiness probe failed #5771

Closed

tsaarni linked a pull request Apr 7, 2025 that will close this issue

Use HealthCheck filter for readiness probe #6986

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

consider changing the implementation of the Envoy readiness probe #4540

consider changing the implementation of the Envoy readiness probe #4540

skriss commented May 19, 2022 •

edited

Loading

github-actions bot commented Sep 29, 2022

sunjayBhatia commented Sep 29, 2022

github-actions bot commented Nov 29, 2022

github-actions bot commented Jan 29, 2023

github-actions bot commented Mar 1, 2023

skriss commented Dec 12, 2023

tsaarni commented Apr 7, 2025

consider changing the implementation of the Envoy readiness probe #4540

consider changing the implementation of the Envoy readiness probe #4540

Comments

skriss commented May 19, 2022 • edited Loading

github-actions bot commented Sep 29, 2022

sunjayBhatia commented Sep 29, 2022

github-actions bot commented Nov 29, 2022

github-actions bot commented Jan 29, 2023

github-actions bot commented Mar 1, 2023

skriss commented Dec 12, 2023

tsaarni commented Apr 7, 2025

skriss commented May 19, 2022 •

edited

Loading