Ensure NGINX reload actually happens #664

kate-osborn · 2023-05-23T14:40:14Z

When we reload NGINX, we do not check whether the reload succeeds. We just sleep for a second to wait for the new workers to spin up. We need to verify that a reload was successful and return an error if it fails. Additionally, we should retry on failure.

Acceptance:

When a NGINX reload occurs, NKG will wait for the reload to finish, and if an error occurs, the error is reported to the user.
When NGINX fails to reload, NKG will retry reloading NGINX.

This work should resolve the following FIXME: https://github.com/nginxinc/nginx-kubernetes-gateway/blob/b00216eeac0f8edfb054d45ad073c9c004788762/internal/nginx/runtime/manager.go#L52

mpstefan · 2023-08-21T15:55:26Z

Logs could be the solution to this problem.

brianehlert · 2023-08-24T16:01:27Z

Logs should not be written to disk. Log rotate cron jobs do not run in containers. So tailing a log should be avoided.
Watching log output stream is possible but could be problematic.

sjberman · 2023-09-01T18:27:46Z

Our k8s Golang client has the ability to get a Pod's logs at any point (we would need to instantiate a client-go client vs the current controller-runtime client, though, since the latter doesn't support this yet). We'd likely want to look for the string signal 1 (SIGHUP) received which tells us a reload happened. We have to figure out how to tie it to the last reload call though, because there are going to be many instances of that string. Use a timestamp somehow.

brianehlert · 2023-09-01T18:43:26Z

What if, instead of looking to logs, you noticed the NGINX workers changed?
Which indicates a successful reload (the config was determined "good" by NGINX).

It does not solve the problem of applying configurations through the N+ API. Which would not have either of these signals.

sjberman · 2023-09-01T19:04:54Z

If the PIDs change, then maybe. I'm not sure if they do though. (I guess if the reload creates new workers, then they probably do)

EDIT: verified that at least the single worker process PID in my deployment did change on a reload.

kate-osborn self-assigned this May 23, 2023

kate-osborn changed the title ~~Placeholder FIXME: "ensure the reload actually happens"~~ Ensure NGINX reload actually happens May 23, 2023

kate-osborn added the bug Something isn't working label May 23, 2023

mpstefan added this to the v1.0.0 milestone Jun 2, 2023

pleshakov mentioned this issue Jun 22, 2023

Fix/increase nginx timeout #777

Merged

6 tasks

mpstefan mentioned this issue Jul 20, 2023

Metrics: Total and failed NGINX reloads #887

Closed

mpstefan modified the milestones: v1.0.0, v1.0.1 Aug 11, 2023

mpstefan mentioned this issue Aug 14, 2023

Ecosystem: Support liveness and readiness probes #542

Closed

mpstefan modified the milestones: v1.0.1, v1.0.0 Aug 14, 2023

mpstefan added refined Requirements are refined and the issue is ready to be implemented. size/large Estimated to be completed within two weeks labels Aug 21, 2023

mpstefan unassigned kate-osborn Aug 21, 2023

ciarams87 self-assigned this Sep 4, 2023

ciarams87 mentioned this issue Sep 5, 2023

Ensure NGINX reload occurs #1033

Merged

6 tasks

mpstefan mentioned this issue Sep 6, 2023

Make status updater production ready #691

Closed

ciarams87 closed this as completed in #1033 Sep 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure NGINX reload actually happens #664

Ensure NGINX reload actually happens #664

kate-osborn commented May 23, 2023 •

edited by mpstefan

Loading

mpstefan commented Aug 21, 2023

brianehlert commented Aug 24, 2023

sjberman commented Sep 1, 2023 •

edited

Loading

brianehlert commented Sep 1, 2023 •

edited

Loading

sjberman commented Sep 1, 2023 •

edited

Loading

Ensure NGINX reload actually happens #664

Ensure NGINX reload actually happens #664

Comments

kate-osborn commented May 23, 2023 • edited by mpstefan Loading

Acceptance:

mpstefan commented Aug 21, 2023

brianehlert commented Aug 24, 2023

sjberman commented Sep 1, 2023 • edited Loading

brianehlert commented Sep 1, 2023 • edited Loading

sjberman commented Sep 1, 2023 • edited Loading

kate-osborn commented May 23, 2023 •

edited by mpstefan

Loading

sjberman commented Sep 1, 2023 •

edited

Loading

brianehlert commented Sep 1, 2023 •

edited

Loading

sjberman commented Sep 1, 2023 •

edited

Loading