[Question] Self hosted setup for monitoring Self-hosted services?

dipak@lemmy.ml · 7 months ago

[Question] Self hosted setup for monitoring Self-hosted services?

hperrin@lemmy.world · 7 months ago

https://www.netdata.cloud/

7fb2adfb45bafcc01c80@lemmy.world · 7 months ago

I’m a big fan of netdata; it’s part of my standard deployment. I put in some custom configs depending on what services are running on what servers. If there’s an issue it sends me an email and posts into a slack channel.

Next step is an influxdb backend to keep more history.

I also use monit to restart certain services in certain situations.

chirospasm@lemmy.ml · 7 months ago

+1 for Netdata

VelociCatTurd@lemmy.world · 7 months ago

-1 for Netdata. I used it for a bit, but the configuration is not very intuitive and the docs for alerts were basically “rest of the fucking owl”, at least for the non-cloud version. I ended up just switching to Glances which is pretty boneless but it’s easy.

Though for OP I’d probably recommend Prometheus.

Maximilious@kbin.social · 7 months ago

Also -1 for netdata. I loved the analytics but it brought all of my VMs to a screeching halt. It did not seem very will optimized for the amount of data it was polling.

rentar42@kbin.social · edit-2 7 months ago

This isn’t specific to just netdata, but I frequently find projects that have some feature provided via their cloud offering and then say “but you can also do it locally” and gesture vaguely at some half-written docs that don’t really help.

It makes sense for them, since one of those is how they make money and the other is how they loose cloud customers, but it’s still annoying.

Shoutout to healthcheck.io who seem to provide both nice cloud offerings and a fully-fledged server with good documentation.

Jeena@jemmy.jeena.net · 7 months ago

I like Uptime Kuma, but it only monitors if a service is online or not. I’m up to 21 services now so I’m not interested in all their details, just if I need to fix something urgently.

farcaller@fstab.sh · 7 months ago

I’d swap Prometheus for VoctoriaMetrics. It’s a drop-in replacement with a much better resource consumption story and a few extra goodies.

JASN_DE@lemmy.world · 7 months ago

Grafana + Prometheus + data gathering will at least give you the resource and usage stats.

rentar42@kbin.social · edit-2 7 months ago

I’ve not found a good solution for actual constant monitoring and I’ll be following this thread, but I have a similar/related item: I use healthcheck.io (specifically a self-hosted instance) to verify all my cron jobs (backups, syncs, …) are working correctly. Often even more involved monitoring solutions do not cover that area (and it can be quite terrible if it goes wrong), so I think it’ll be a good addition to most of these.

andy47@lemmy.world · 7 months ago

A similar solution but I use https://ntfy.sh/ I have the app on my phone and have it set to alert when jobs ping the service. Mine ping on success but it is possible to ping when the job fails as well.