Helm chart stuck in starting state

Orkin · November 27, 2024, 10:47am

Hello,

I have a helm chart that is stuck in starting state that I can’t stop . Can you have a look please ?

Cluster ID: a5181456-e3ff-46e0-b01d-088c51523656
Organization ID: a2ed680d-ffdf-44a2-905f-24fc2a58a97d
Project ID: 2257e891-1c08-4440-bd76-807e5d1b2d90
Environment ID: 6138a1e6-8090-4c93-9205-004e22473779
Service ID: ee805995-7cb7-4f50-a5fd-069b8ace9220

Thanks a lot

ce_gagnaire · November 27, 2024, 2:41pm

Hello @Orkin ,

I just checked and everything is back to normal.

When a service is in a waiting state, we must wait for a timeout before doing more action.

We can’t cancel it either.

Regards,
Charles-Edouard

Orkin · November 27, 2024, 4:05pm

It was not a timeout problem. The application was deployed properly but the “job” loop failed. I had to stop the cluster for some other reason and it solved the problem .

ce_gagnaire · November 28, 2024, 10:00am

Ok,

I just checked but I don’t have any logs on this.

Please let me know if this happens ago and we’ll try to debug it live.

Regards,
Charles-Edouard

a_carrano · November 29, 2024, 4:09pm

Hi @Orkin,

actually, we might have found the issue.

Your dev cluster is running via Karpenter and there is a known issue with the way it manages Daemonsets (Fluentbit is deployed as a daemonset).

To ensure it is deployed properly on every node of your cluster, you should:

deploy a custom priority class (see this guide)
define the fluentbit chart should use

Orkin · December 2, 2024, 1:31pm

Thanks @a_carrano for your response. Do I need to allow cluster wide ressources for fluent-bit daemonset too ?

a_carrano · December 2, 2024, 1:35pm

Hi @Orkin,

the “Allow wider resource” check is run before deploying the resources to ensure the user has the right to deploy resources outside his own namespace (the namespace of the environment where it is deployed).

Fluentbit probably deploys CRDs and pods outside the “environment namespace” so it is necessary to enable the option.

system · December 9, 2024, 1:35pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Can't stop helm charts Deployment	3	205	April 2, 2024
Datadog helm service not running (follow up on prev topic) Questions and Answers	4	33	November 20, 2024
Unable to stop a helm deployed service Questions and Answers	4	72	June 13, 2024
Helm upgrade failed : "Helm timed out" on cluster Questions and Answers	4	608	April 5, 2024
Deploy Helm Chart and documentation clarity Deployment	9	531	March 25, 2024

Helm chart stuck in starting state

Related topics