11:58 AM
zwi joined the channel
11:59 AM
untergeek has quit
12:00 PM
DeepyBee
does anyone here have experience of deduplication with the document_id attribute of the elasticsearch output? I'm using lumberjack to transport my logs and have document_id => '%{offset}' in my ES output, but I'm still seeing dupes :(
12:00 PM
Anyone have any ideas?
12:01 PM
kangguru is now known as kangguru_away
12:05 PM
mdelnegro joined the channel
12:05 PM
PeterWelter has quit
12:05 PM
untergeek joined the channel
12:06 PM
mdelnegro has quit
12:06 PM
mdelnegro joined the channel
12:07 PM
gurra has quit
12:11 PM
sysmonk
offset doesn't sound like the best way to dedup
12:11 PM
i tried deduping before, but hit too many bugs, and stopped :)
12:11 PM
DeepyBee
hmm, bugger
12:12 PM
sysmonk
maybe stuff improved over time
12:12 PM
but one of the main things i remember about deduping was performance impact
12:12 PM
DeepyBee
there is very little documnetation on the document_id attribute
12:12 PM
"The document ID for the index. Useful for overwriting existing entries in Elasticsearch with the same ID."
12:12 PM
is basically it :/
12:12 PM
sysmonk
the suggested way to dedup is making a checksum of the data, and that costs lots of cpu
12:14 PM
DeepyBee
do you have a link to docs for checksumming events? I'd like to at least give it a go to assess the perf hit
12:15 PM
gurra joined the channel
12:15 PM
I'm happy having more tin to handle the checksumming and getting cleaner data in ES tbh
12:15 PM
sysmonk
lookg at fingerprint checksum/fingerprint filters
12:16 PM
DeepyBee
great thanks :)
12:17 PM
jopz
so what to do... i'm not using ipv6
12:18 PM
mskaesz joined the channel
12:19 PM
MartinCleaver has quit
12:20 PM
cakirke has quit
12:21 PM
zwi has quit
12:22 PM
mkaesz has quit
12:23 PM
is-mw joined the channel
12:26 PM
rarruda has quit
12:27 PM
DeepyBee has quit
12:28 PM
losh
jopz: are you using elasticsearch embedded or a seperate JVM instance for Elasticsearch? Your configuration suggests it's the latter.
12:32 PM
chthon joined the channel
12:34 PM
fullerja joined the channel
12:34 PM
fullerja has quit
12:34 PM
12:34 PM
logstashbot
12:34 PM
professoruss has quit
12:34 PM
ollybee joined the channel
12:35 PM
professoruss joined the channel
12:35 PM
Fuwan joined the channel
12:36 PM
Fuwan
Hellow, is there any way to see which logs are being sent by logstash-forwarders?
12:37 PM
jopz
let me check this
12:40 PM
fullerja joined the channel
12:42 PM
hulta has quit
12:42 PM
losh: not worked
12:42 PM
again getting same error
12:42 PM
losh
jopz: change 127.0.0.1 to ::1
12:42 PM
jopz
k
12:42 PM
aruntomar has quit
12:43 PM
Fuwan
oh, thought you were checking for me, jopz
12:43 PM
:p
12:43 PM
habanero joined the channel
12:43 PM
jopz
fuwan: no
12:44 PM
losh
Fuwan: I'd use tcpdump or tcpflow to see the logs that are being sent to the forwarder (assuming they're are being sent over a inet socket)
12:44 PM
Fuwan
hmm
12:45 PM
brahama has quit
12:45 PM
jopz
losh: no
12:45 PM
same error
12:45 PM
:)
12:45 PM
brahama joined the channel
12:46 PM
losh
jopz: Are you running Elasticsearch as a separate JVM instance?
12:46 PM
jopz: Or was it running in embedded?
12:46 PM
pkdubey4u has quit
12:46 PM
jopz
only one jvm instance
12:46 PM
gurra has quit
12:46 PM
for all
12:47 PM
losh
jopz: Ahh, OK. In that case, the configuration I supplied disabled to ES. Set the option embedded => true in the configuration
12:48 PM
goschtl1 has quit
12:48 PM
alcy has quit
12:49 PM
jopz: In fact, the configuration is wrong for using the embedded version, I'll re-write it and send you the link
12:49 PM
habanero has quit
12:49 PM
jopz
k
12:50 PM
brahama has quit
12:51 PM
SkyRocknRoll has quit
12:51 PM
samdoran joined the channel
12:52 PM
gentunian joined the channel
12:55 PM
jack_ruby joined the channel
12:55 PM
12:55 PM
stasher joined the channel
12:56 PM
losh
12:56 PM
jopz
i'm leaving the forum no
12:56 PM
logstashbot
12:58 PM
jopz
same error
12:58 PM
bad luck losh
12:58 PM
muranese
is it possible with logstash to convert unix time to readable format?
12:58 PM
losh
jopz: So it would seem. I'm out of ideas, sorry.
12:59 PM
jopz
k
12:59 PM
thanks
12:59 PM
i will find some others
12:59 PM
jopz has quit
12:59 PM
losh
muranese: yes, use the date plugin.
13:00 PM
zwi joined the channel
13:01 PM
mskaesz has quit
13:02 PM
aendrew has quit
13:03 PM
muranese
losh: you mean filter? don't get how it works, in grok i add 'match => { "message" => "%{NUMBER:epoch}" }' can't find option for date filter
13:05 PM
and to date fillter add this "match => [ "epoch", "UNIX" ]" right?
13:05 PM
losh
muranese: Yes, the date filter is a logstash plugin. grok is a separate plugin to date.
13:05 PM
13:05 PM
logstashbot
Title: logstash - open source log management (at
logstash.net )
13:05 PM
|splat| joined the channel
13:06 PM
losh
muranese: That looks correct to me, assuming your field name which contains the unix timestamp is called epoch.
13:08 PM
ph joined the channel
13:08 PM
wrath0r joined the channel
13:09 PM
LiamM has quit
13:09 PM
toordog has quit
13:09 PM
TomasNunez has quit
13:11 PM
jerius joined the channel
13:11 PM
jujugrrr has quit
13:11 PM
eelseth-stdout has quit
13:12 PM
gauravarora has quit
13:12 PM
LiamM joined the channel
13:13 PM
intransit joined the channel
13:14 PM
dendazen joined the channel
13:14 PM
ph has quit
13:17 PM
eper has quit
13:21 PM
TomasNunez joined the channel
13:22 PM
stasher
hey, does logstash use the same version of ruby as my system configuration or is it it's own ruby version?
13:25 PM
bfraser joined the channel
13:25 PM
mkaesz joined the channel
13:25 PM
jujugrrr joined the channel
13:25 PM
habanero joined the channel
13:29 PM
phtwo joined the channel
13:29 PM
phtwo has quit
13:29 PM
socket-
13:29 PM
qybl
stasher: it's jruby, so it uses whatever version they bundled when they packaged the jar-file I'd think
13:29 PM
phtwo joined the channel
13:30 PM
dyer is now known as dyer|away
13:30 PM
dyer|away is now known as dyer
13:30 PM
dyer is now known as dyer|away
13:30 PM
dyer|away is now known as dyer
13:30 PM
dyer is now known as dyer|away
13:30 PM
dyer|away is now known as dyer
13:31 PM
zathras joined the channel
13:32 PM
nick_schuch joined the channel
13:32 PM
aendrew joined the channel
13:33 PM
mkaesz has quit
13:34 PM
shubhang has quit
13:36 PM
yfried joined the channel
13:36 PM
mkaesz joined the channel
13:38 PM
mskaesz joined the channel
13:38 PM
samdoran1 joined the channel
13:39 PM
samdoran has quit
13:41 PM
mkaesz has quit
13:42 PM
MartinCleaver joined the channel
13:47 PM
habanero has quit
13:48 PM
habanero joined the channel