4 Replies Latest reply on Jul 10, 2015 2:52 AM by alexander_h

    ESM Parsing Regular expression




      Can anybody help me to understand the parsing regular expression, I'm just working on creating a custom parsing for a device. and i got stuck in the regular expressions and what to understand that.



      Thank you


      Soji Thomas

        • 1. Re: ESM Parsing Regular expression

          This page migth be halp to unterstand regex --> RegExr: Learn, Build, & Test RegEx

          • 2. Re: ESM Parsing Regular expression

            The goal of the regular expression is to match the fields in the log that need to be captured and put into the relevant database fields so they are available for search, correlation and reporting on the ESM/ACE. Parentheses within the regex are used to create each match group. For instance, if I have the following log line:


            <22>Jul  9 12:55:49 tecate dovecot: imap(bob): Disconnected: Logged out bytes=671/3325


            I can parse it with the following regex:


            (\w+)\s(dovecot)\x3a\simap\x28(\w+)\x29\x3a\sDisconnected\x3a\sLogged\sout\sbyte s\x3d(\d+)\x2f(\d+)


            Every time you see a pair of parentheses I'm capturing a field. So the fields here would be:




            source user=bob




            This information alone won't directly help you to make the parsing rules though since you do need to learn a bit of regex and there are some nuances with the editor but it should give you an idea of the process.


            That being said, it looks like your log sample in your screenshot is of McAfee NSP logs. It's better to grab those via a SQL pull of the NSM. If you need to use syslog, there are existing rules for the data source.

            • 3. Re: ESM Parsing Regular expression

              That is the CheatSheet from the RegEXR that was mentioned by XDED

              Now they also have a desktop app










              Character classes





              .any character except newline
              \w \d \sword, digit, whitespace
              \W \D \Snot word, digit, whitespace
              [abc]any of a, b, or c
              [^abc]not a, b, or c
              [a-g]character between a & g
              ^abc$start / end of the string
              \bword boundary
              Escaped characters
              \. \* \\escaped special characters
              \t \n \rtab, linefeed, carriage return
              \u00A9unicode escaped ©
              Groups & Lookaround
              (abc)capture group
              \1backreference to group #1
              (?:abc)non-capturing group
              (?=abc)positive lookahead
              (?!abc)negative lookahead
              Quantifiers & Alternation
              a* a+ a?0 or more, 1 or more, 0 or 1
              a{5} a{2,}exactly five, two or more
              a{1,3}between one & three
              a+? a{2,}?match as few as possible
              ab|cdmatch ab or cd
              • 4. Re: ESM Parsing Regular expression

                Also you might find really useful that you could create multiple small parsers so you could match fragments instead of parsing the entire message.


                Give it a try and let us know so we could give you some hints.