Loki configuration

prki · January 11, 2024, 1:09pm

We recently installed Grafana on our Qovery cluster so we can access our logs from Loki. However, when we query larger amounts of logs we occasionally get “too many outstanding requests” errors.

Quick Google searched suggested this GitHub issue which points to a solution with a custom configuration. The only configuration value we found in Qovery was “loki.log_retention_in_week”. Is it possible to customize other Loki settings?

Also, most of our applications log data in JSON format so it would be great to configure JSON parsing and label extraction for each application. It would be nice to specify certain static labels for each application to improve on Qovery labels which have unique IDs in them and don’t allow us to aggregate logs or reuse dashboards for the same application in different environments.

bchastanier · January 11, 2024, 1:43pm

Hey @prki !

There is a cluster advance settings for it: cf documentation.

By default it’s 12 weeks.

Let me know if it helps !

Update: just saw your question, which params would you like to be able to customize ?

Cheers

prki · January 11, 2024, 1:56pm

As much as possible The GitHub issue suggests this configuration to solve “too many requests” issue:

loki:
  limits_config:
    split_queries_by_interval: 24h
    max_query_parallelism: 100
  query_scheduler:
    max_outstanding_requests_per_tenant: 4096
  frontend:
    max_outstanding_per_tenant: 4096

However, we would also like to customize Loki pipeline to enable JSON parsing and extracting more labels for faster queries. Ideally, this could be configured per Qovery application but more raw access to Loki configuration would be enough for us to find less elegant solutions.

bchastanier · January 11, 2024, 2:15pm

Unfortunately, for the time being, this Loki instance is for our internal usage (apps logs, etc) and is not really meant to be used otherwise. We have to keep to this service up and running.

Also, even if we add more and more params, it will eventually not be enough for everyone’s use case. So the best option for you would be to deploy your own Loki instance using Helm deployment, this way you will be able to pimp your Loki instance (and a dedicated S3 bucket) as you please and even if this one dies eventually, it won’t impact your Qovery integration.

Let me know if we can further help

Cheers

prki · January 11, 2024, 2:37pm

I avoid deploying another Loki instance to not waste additional resources (we already have one running and paying for it). I also don’t want us to start maintaining a bunch of DevOps tools, we chose Qovery for exactly this reason. Ideally, I would like Qovery to provide a good out-of-the-box configuration, make it accessible to us, and still rely on Qovery to keep it running.

rophilogene · January 11, 2024, 2:48pm

Hi @prki , can you contact us by DM via the chat?

Pierre_Mavro · January 11, 2024, 4:08pm

Hi @prki ,

Loki is used for a single reason at Qovery: providing log history as Kubernetes is not doing it.

We do not allow by default to use Loki to make search, because it can easily be unresponsive due to the setup we’ve made. It has been customized for this purpose only to limit the resources consumption and reduce the cloud provider bill.

I got your point regarding the wasting resources, but the current Loki setup is not ready to handle all those workload at the same time. And if Loki becomes unavailable, then logs will be missing as well, which is not expected.

If you want to make history log search, you can setup your own Loki. Or you can use ElasticSearch, Datadog or AWS Cloudwatch.

Pierre

prki · January 12, 2024, 8:55am

Hi @Pierre_Mavro,

Thanks for your response, I get your point but I think you are missing out on providing a lot of value to your customers.

Logging and monitoring are crucial parts every single Kubernetes cluster needs just like ingress and cert-manager utilities that you manage for us. Technically you already provide access to logs in Qovery because it’s such an important part of application deployment but your logs implementation is very basic and rather than building on top of it, I think you should strengthen support for people leveraging underlying Loki instance directly.

Pierre_Mavro · January 12, 2024, 9:45am

Hi @prki ,

This topic is discussed internally for a long time now. We totally agree with you regarding the value proposed to our customer.

And this is why we do not restrict anything. We give to our customers the possibility to install other solution (like the ones described above), and encourage them to do so with documentations.

Qovery is not a monitoring platform and will never try to compete with company doing it. Instead, we make partnership with them like Datadog, because we know it will answer 100% of customer’s need.

For power users, we’re going to deliver BYOK (Qovery Self-Managed) in the coming days, allowing you to customize everything (Loki but not only). So you may be interested by this version: https://www.qovery.com/blog/bring-your-own-kubernetes-with-qovery/.

Pierre

prki · January 12, 2024, 11:21am

It looks like we will want to move toward the BYOK solution in the near future. However, at that point, the value Qovery brings goes down significantly.

system · March 26, 2024, 10:33am

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to retrieve application and infrastructure logs from Loki on my EKS cluster? Questions and Answers	15	2976	November 9, 2022
How Qovery Cluster Agent is using Loki? Questions and Answers	4	22	September 9, 2024
Querying logs in Loki Feature requests & Improvements	3	493	April 23, 2024
Monitoring and Logs for Infra and Apps DevOps	4	54	July 16, 2024
Huge number of errors log stream on GCP Questions and Answers	14	95	November 27, 2024

Loki configuration

Related topics