Site with a status code 503 during deployments

FlorianSuchan · May 11, 2022, 8:53am

During deployments we sometimes run into “503 Service Temporarily Unavailable” errors, which isn’t nice for the user.

Any idea why this happens? We would have expected that there is a rolling update happening, so new pods being created and the ones with older version then being shut down. But no service interruption. What can we do to mitigate this?

rophilogene · May 12, 2022, 7:53am

Hi @FlorianSuchan ,

You should not get a 503. The 503 is coming from your application or something else (E.g NGINX)? Can you post a screenshot or at least give the steps to reproduce it? cc @Pierre_Mavro

The rolling update is the strategy we use for deploying the new version of your app. You must not get any downtime. However, it can happen since Kubernetes relies on a probe to check that your app is up and running before routing the incoming traffic to the new version.

It’s possible that your app is seen as being ready while it’s not the case. If you give me the steps to reproduce the issue, I will give a shot.

FlorianSuchan · May 12, 2022, 3:11pm

Hi @rophilogene thanks for helping us, please see the screenshot attached.

Pierre_Mavro · May 20, 2022, 5:20pm

Hi @FlorianSuchan ,

Sorry for the late answer. This happens because your application takes time to start and your port is open before the application is able to serve traffic.

In a near future, we’ll add an option to define a check mechanism (defined by the user) to ensure the application is ready to handle the incoming traffic.

In the meantime, I advise your to update your code and only open the port of your application when it’s ready to serve traffic. This way you will never encounter this kind of issue anymore.

Pierre

FlorianSuchan · May 23, 2022, 10:01am

Hi @Pierre_Mavro ,

thanks for reaching out. I know of readiness/liveness probes on Kubernetes but where would I do that on a Rails app directly?

Can you share an ETA for these checks?

Best, Florian

Pierre_Mavro · July 19, 2022, 6:58am

Hi @FlorianSuchan ,

I’ve added similar issues to the Troubleshot documentation Troubleshoot | Docs | Qovery

Thanks

FlorianSuchan · July 19, 2022, 4:13pm

@Pierre_Mavro Since adding health checks on staging/production cluster you shipped some time ago, this is not an issue anymore

Topic		Replies	Views
Service is down - Error CrashLoopBackOff Deployment qovery	4	2134	March 25, 2024
Deployment Deployment	2	588	March 25, 2024
Pod stuck in pending status Questions and Answers	7	136	October 2, 2024
Issue deploying existing services Deployment	3	26	October 31, 2024
Brief post-deployment application downtime (build memory) Deployment qovery	3	42	July 1, 2024

Site with a status code 503 during deployments

Related topics