I was installing Datadog following the Qovery guide here. Helm install was successful but it seems Datadog agent pods are failing to start. I see READY state 2/3 and if I look at one of the pods I see readiness probe is failing:
Readiness probe failed: HTTP probe failed with statuscode: 500
Are the guides up to date? Is the latest Datadog agent and Qovery still compatible?
Nevermind. I found the issue: I used config with âdatadoghq.euâ domain from Qovery example but our account was supposed to use âdatadoghq.comâ.
We tried re-installing with no luck. I contacted DD support, and they tracked down that Qovery installs its own cluster-agent in âqoveryâ namespace, and our Datadog agents from âdatadogâ namespace are trying to connect to the wrong cluster-agent with an incompatible configuration. Iâm waiting for their response for suggestions on how to force our Datadog agents to use the correct cluster-agent.
We got to the bottom of this with Datadog support: our Datadog installation discovers Datadog cluster-agent in qovery namespace and tries to connect to it. However, Qovery installed Datadog has a different configuration and doesnât work here. The solution provided by Datadog support was to disable automatic discovery and add annotations to set the correct configuration explicitly. It feels like this should be supported by Qovery, and helm install should be enough for the correct Datadog installation.
@bchastanier Did you try to reproduce this? It seems this should affect all Qovery clusters.