Conversation

I've been playing around with setting up OpenSearch because I'm not really happy with how annoying it is to actually *search* my logs with Loki (fundamentally, I just want a Datadog- or OS-style dashboard), and… it's rough.

Setting up OpenSearch is difficult as the docs are definitely aimed at large clusters with minimal support for standing up a simple instance. But then we get to Data Prepper and… yeesh. It works, but it feels very, very rough.

1
0
0

Docs *look* comprehensive but aren't, like the date module - some docs on the docs site, but much better docs can only be found deep in GitHub; Config is very repetitive; Pipelines are *pull*-based and only poll every 3s by default, so a complex pipeline setup will *massively* delay your logs.

I'm not sure whether I want to persevere with OS, see if ElasticSearch (/LogStash/etc) goes better, or abandon the whole thing and go back to Loki, which is also painful, but maybe less so.

2
0
0

@ipsi I’m pretty close to throwing in the towel on Loki in Grafana - I used ELK a decade ago and it was significantly nicer than this experience.

1
0
1

@alpha The whole “logging” arena feels incredibly over-complicated, with the choices basically being “tail a log file” or “enterprise multi-node cluster designed for *search* (and metrics and and and…) that just happens to be good at log dashboards”. I feel like there's something missing in the middle for small-medium sized deployments.

Part of the problem is self-hosting a bunch of services, none of which have the same log-file format, none of which can be easily changed.

3
0
0

@ipsi I looked briefly at Seq and Parseable as potential candidates, but I’m more likely going to set up an ELK stack instead since I’ve used it in the past and liked that UI more than what Seq and Parseable seem to offer.

0
0
1

@ipsi @alpha We keep complaining about DataDog at work. The CTO loves it (he makes dashboards!) but it’s way to complicated to answer the question “why isn’t this fucking thing working” which is the question us devs mostly have. And somehow AWS CloudWatch is worse?

2
0
0

@ratkins @ipsi Somehow, our setup at $JOB-2 was surprisingly usable, despite it being a decade ago and being comprised mostly of statsd+graphite+elk.

Maybe something about it being data that we mostly pushed, rather than automatically pulled, so we had much better context and awareness about what was in there.

0
0
2

@ratkins @alpha I've quite liked Datadog, at least so far. It does require putting in a fair bit of effort to learn, though, plus configuring the agent everywhere to get full value out of it.

1
0
0

@ipsi @alpha Right. I’m not saying it’s not a good tool, but it’s too complex for my needs and I’m busy enough hating Python with every fibre of my being that I don’t have time to learn it.

0
0
0

@ipsi we (elastic) should have some things there to make your life easier. from agent / beats modules to UI things :)

0
0
0

@ipsi A follow-up: I’ve now spun up Vector + Parseable to see how it feels. Haven’t started any ingestion of real logs into it yet, though.

0
0
1