Brief post-deployment application downtime (build memory)

ChrisBolman1 · June 25, 2024, 4:51pm

We have a fairly large application (15-16GB image) and we’ve noticed recently that when we deploy, our application has about 5 minutes of downtime where it seems the new application image is “building” before it can handle requests and serve traffic.

The application deployment succeeds
The router deployment succeeds
Pods are Service status: Running

https://console.qovery.com/organization/828d9c2c-fd72-4b4e-ab58-71de40ecfdd2/project/0bf26679-c2d6-48fd-a485-aed61158fb1c/environment/e089ff92-2749-4530-9a31-12c12aa39499/application/025f4bbb-abbd-4f68-bfbf-91ffa81475fa/

But we’ve noticed immediately post-deploy that if we look in the pods console, each pod has to incrementally build up (as in, go from 1% to 100% of the full image memory commitment), and pod CPU is fully maxed while this is happening. This takes a few minutes, then once it completes and we reach the full memory size of the image, CPU normalizes and the application is good to go.

Did anything change in terms of how Qovery handles image caching, build rate or anything like that in the past month or two? We used to deploy the same size image and application and the new application build was online almost instantly (maybe the router took a few seconds to resolve). Not aware of anything with our config, cluster, DNS or anything else that would be impacting this. As a result we’ve had to be more judicious about timing deploys for late nights or weekends when we can tolerate being down a few minutes.

Wondering if there’s an underlying reason and solution for this? Thanks

rophilogene · June 26, 2024, 3:05pm

Hi @ChrisBolman1 ,

I’ll let our product team (cc @a_carrano and @Julien_Dan) appropriately respond to your questions about whether anything has changed on how Qovery handles image caching and anything build-related in the last few days. To confirm, when did you notice a change?

–

On a side note: I noticed that your prod app does not have the Health Check probes configured, and I wonder if that would be the problem of your issue. Is there a specific reason why they are not yet configured? (It’s highly recommended for production apps for precisely what you mentioned).

a_carrano · July 1, 2024, 11:50am

HI @ChrisBolman1 ,

nothing has changed in the way we manage image caching or build. As Romaric said, it seems to be mainly an issue with the healthcheck configuration, making the app unstable during the roll-out of the new version since the pods are not yet ready to receive traffic.

system · July 8, 2024, 11:51am

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
New Qovery Feature - Reduced Build Times - Does not work for us Questions and Answers	7	51	October 23, 2024
Tips for Optimizing Deployment Times on Qovery Questions and Answers	2	31	September 9, 2024
Deployment Time went up from ~30 min to ~4,5 hours Deployment performance	8	533	March 25, 2024
Pod app-... is FAILING crash loop, pod is restarting too frequently Questions and Answers	6	423	March 25, 2024
Deployment takes a long time Questions and Answers	6	530	March 25, 2024

Brief post-deployment application downtime (build memory)

Related topics