berglh: oh. so it was fixed or should I put the fix? What version was the fix? how do I check my version?
gwLock has left the channel
skynat2 has quit
I am on 2.2.2
pawnbox joined the channel
berglh
it was fixed a while ago as far as i can see
what's the warning you're getting in your debug log?
pawnbox has quit
fatdragon
debug log is going, I don't see any warnning right now. but after few days.. filebeat can't send anything to LS and LS just starts to stall and refuses any new connections..and I have to restart LS routinely every few days.. I have tried..increasing memory, Xmx, heapsize, entire system memory, turned off tcp6 calls to ES.. now waiting for another crash tomorrow or next day to get a java dump..
some sort of memory leak or bottleneck..somewhere.. I do see lots of TIME_WAITS from LS to ES, not sure if that's any problem..I have 150 timewait connections then it goes down to 120 and back to 140 so it is flushing I suppose
berglh
it soudns like mayeb your logstash can't write out to elasticsearch as fast as you're receiving messages
i that case i'd expect it to use a lot more head
heap
are you using a messaging queue?
like filebeat > logstash > redis/rabbitmq/kafak < logstash > elasticseach?
in this case if the logstash can't write to elasticsearch fast enough, the message queue will take the load
fatdragon
I am new to ELK , just running FB > LS > ES > Kibanan, I suppose
berglh
yep, that sounds like your problem then
maybe
what filters are you using in your logstash config?
is your ES isntance running out of heap?
maybe that would explain slowness in ES
skynat2 joined the channel
fatdragon
all three LS, ES, Kibana is on one server with 8GB memory
I have 3 GB allocated for ES and 3GB for LS, not sure if they are being set correctly ..using LS_HEAP stuff in init scripts
berglh
its just a standalone ES config
ES continues to run when LS bums out?
fatdragon
I have beat input, syslog filter, elastic search output config for LS
berglh
ok, doesnt' sound too complex
fatdragon
yea.. ES is fine
very simple setup, 7 nodes..low traffic with auth and syslog and stuff
just trying to get off the ground and stablize it so I can add more nodes
do I need to install redis?
or tune ES or LS?
what I can do is get a java dump when it crashes, hope I am awake and am able to catch it, before LS java process dies..