Auto-scaling with multiple apps

Talal · February 7, 2024, 3:00pm

I have an issue with my Qovery configuration.

On my project, for the prod environment, I have several apps that I need to deploy.

The problem I have is that the deployment of those apps is distributed over multiple different instances, which means that I have much more running EC2 instances than I need.

At the time I’m writing this, we have 14 EC2 instances running, with all of them using less than 15% of CPU. But because the apps are deployed over multiple instances, there is no scale down that can happen. Which means that we are running at least 3 or 4 times more EC2 instances than needed at the moment. And that costs us a lot of money.

So, is there some way/configuration to make sure that all our apps are deployed on the same instances, so that we can have just 4 or 5 running instead of 14?
I haven’t found any setting for that so far.

Thanks,

Pierre_Gerbelot · February 8, 2024, 11:21am

Hello @Talal

This kind of configuration is not available as we don’t recommend this setup as nodes can crash and then your applications will not be available if all the pods run on this node.

The only configuration available is the deployment.antiaffinity.pod.

Could you please provide the URL of your cluster so that we can check whether the optimization of the auto-scaler configuration recently implemented has been deployed successfully? (see the change log)

Cheers
Pierre

Talal · February 8, 2024, 11:30am

Thanks for the answer @Pierre_Gerbelot.

I’m not sure I understand the problem of this configuration. There are always multiple nodes running (at least 3 in the configuration I’m thinking of), so if one crashes with all the pods, the other ones will continue working until a new node is created, no?

The identifier of my cluster is: aa618a4b-f934-425a-99df-aac82c8cac32 (organization is 3842bf65-225d-43f5-8cb2-6807a0f1262f)…

Thanks,

Pierre_Gerbelot · February 8, 2024, 3:54pm

When checking your nodes resource consumption, I can see that you have requested much more CPU than you actually use. None of your nodes has less that 60% of requested CPU so the cluster autoscaler can’t drop any node.
But indeed the utilization is very low. I encourage you to review the resources assigned to the services in the resource settings to ensure they are aligned with actual usage.

In this post, you can find a complete explanation of how Kubernetes resource allocation works

I hope it will help
Cheers

Talal · February 8, 2024, 4:40pm

Thanks @Pierre_Gerbelot.

I have some questions though.
Just to be sure, can you confirm that:
Allocatable is the maximum number of CPU that can be used on the node
Requested is the sum of all the CPU requested by all the apps that are installed on the node
Utilization is the actual usage (at the time of your screenshot)

And are you saying that the scaling (down) is based on the CPU Requested, not the CPU Used?

And also, how are the cron jobs handled? Let’s say a cron job runs every day at 22:00 and needs 0.5CPU. Will the 0.5CPU be included in the “Requested” all day, or only when the cron is about to start?

moisesrodriguez · February 8, 2024, 5:47pm

Sorry to meddle in the conversation, but @Pierre_Gerbelot but that resource utilization/requested graph is very useful. What command did you use to get that info for our nodes?

Pierre_Gerbelot · February 9, 2024, 2:47pm

The data have been dumped using the following tool , you can find all description of the column in the README.

When Kubernetes starts a pod on a node, it allocates the requested resources for that pod. Regarding the cron job, pods are started only during the execution time, so the resources are only used during this period.
Regards

Talal · February 12, 2024, 11:12am

Thanks @Pierre_Gerbelot.

Would you know if there is a tool that tracks CPU usage of a container over time?
My goal is to try to see and understand what’s the real CPU consumption of a container so that I can set its resources to a better value (rather than just doing it by trial and error).

Thanks,

Talal · February 13, 2024, 9:15am

@Pierre_Gerbelot I reviewed the setting for some of the apps, so it’s a bit better but still far from what I would expect.

For example, right now, this is what I have:

There are 4 nodes with a Requested CPU below 60% (and a very low usage), but no down scaling is happening. Would you know why?

Also, I have another question. When I allocate some resources to an app, let’s say 500mCPU. Does that mean that the app will use at the maximum 500mCPU, or can it use more if there is some available CPU in the node? If it’s the latter, that would allow me to put lower resources on each app.

Thanks,

rophilogene · February 13, 2024, 9:50am

Hi @Talal ,

I think you might find this thread useful

Talal · February 13, 2024, 10:20am

Thanks @rophilogene.

The thread is useful indeed. But it doesn’t answer this question:
“When I allocate some resources to an app, let’s say 500mCPU. Does that mean that the app will use at the maximum 500mCPU, or can it use more if there is some available CPU in the node?”

rophilogene · February 13, 2024, 10:24am

It will use maximum 500 mCPU

Talal · February 13, 2024, 10:28am

OK, thank you @rophilogene!

Talal · February 15, 2024, 12:06pm

Hi @rophilogene, it’s me again!

On the same question of auto-scaling and usage, is it planned for Qovery to start allowing to set a different request and limit on resources usage? (talking about the setting spec.containers[].resources.limits.cpu and spec.containers[].resources.requests.cpu for example).

That would allow us more flexibility and handle our load in a better way.

rophilogene · February 15, 2024, 12:40pm

This is something we considered and I don’t remember what was the status for this. Do you remind @a_carrano or @Pierre_Mavro ?

Pierre_Mavro · February 15, 2024, 4:42pm

Hi,

It’s not planned since it’s not a good practice. Why? Because you can easily overcommit and impact other applications running on the same nodes than yours.

It can be too problematic (you can find several articles on the topic), however for applications/containers deployed with Qovery. However, you can do what you want with Helm deployments as we assume that if you can deploy with charts, you better know what you’re doing.

We plan in the future to restrict Helm access to power user only and let Qovery admins decide who can deploy with Helm.

Pierre

system · March 26, 2024, 10:49am

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How Kubernetes CPU and RAM resources allocation works with Qovery? Deployment aws	18	4227	September 15, 2022
EKS Node autoscaling (down) AWS qovery	5	1190	September 2, 2022
What's the recommended way to scale down a cluster? Questions and Answers	12	2437	March 25, 2024
EC2 Instances cluster AWS AWS qovery , aws-ec2 , aws	2	442	March 25, 2024
AWS Cluster scaling issue - how does it work? Questions and Answers qovery , aws	3	552	March 25, 2024

Auto-scaling with multiple apps

Related topics