whack: do you have some examples are large scale deployments?
of
I recall Mozilla had a pretty big install, pretty significant events/sec
whack
once I finish deploying here I'll be doing >100000 events/sec
for mozilla I'm not sure what their event rates were estimated at
but I'm pretty sure their cluster could do at least 500k events/sec
estimated with math, that is, not actually observed
phrawzty is heading that up
thansen joined the channel
plinde_
hi all, does anyone know the state of logstash+rabbitmq/es/river? Namely, is it considered production-ready and the preferred method for doing logstash <> ES flows?
stonith
thanks, I just found your scalex deck
I'll quote that
whack
plinde_: it should work, but I don't know who uses it.
it's been around for years
jfluhmann joined the channel
plinde_
whack, running into few issues using it where upwards of 75% of our logs are not making it from RabbitMQ into ES
just wanted to see if its advised to go that route or if elasticsearch output is 'advised'
i realize there are a lot of variables in the equation (ES version, RMQ version, ES River plugin version, etc).
whack
plinde_: I try to know as little as possible about rabbitmq, so I may not be able to help
are you certain it's makign it into rabbitmq?
OG_ joined the channel
and can you pastebin your config?
plinde_
yes, we tried using amqp/rabbitmq output and all the logs end up in the queue (counts match). starting a river from ES to that queue ends up with all the messages not being ack'd by rabbitmq.
whack
no idea :\
plinde_
no worries, we are fairly pleased with performance of using elasticsearch output
we've been pondering if river might be a preferred alternative
given the issues, we'll probably stick with what we've got
whack
preference doesn't matter.
does it work for you, does it support your requirements, etc.
plinde_
agreed there, maybe preference is the wrong word. recommended
whack
do what works, if you find performance problems or want to experiment, then try something else :)
but if it ain't broken, don't fix it ;)
plinde_
whack, no argument there, appreciate the input
whack
np :)
in general, elasticsearch and elasticsearch_http have the best performance
rabbitmq performs probably the worst of all the transports logstash supports due to the library we use
so I'd expect the river output to perform worse than the two alternatives (es or es_http) right now
plinde_ is now known as HPL
ohlol has quit
woo 30% perf improvement in date filter
Stoosh
cha ching
io_syl has quit
whack: was going to ask you yesterday, what are your thoughts using nodes for logstash configs, instead of defining types, similar to puppet
whack
not sure what you mean?
you mean using specific servers to process certain kinds of logs?
Stoosh
i mean instead of defining type for every input, filter, output (when necessary)
encapsualting them in a node (which would be equivalent to a type)
whack
can you explain further? not grasping it yet ;)
new event v1 perf improvements are looking pretty good
2x faster throughput doing generator->null output
Stoosh
so you have node 'logstash:type' { inputs {} filters {} outputs {}}