REGEX Flashcards

Question

What control characters do you know? What do they do?

Answer 1

^ - start of a line $ - end of a line \b - whole words only \B - bordered by word characters

Answer 2

Character classes distinguish kinds of characters such as, for example, distinguishing between letters and digits

Answer 3

\s - white space \S - not a white space \d - digit \D - non-digit \w -word-character (includes letters, numbers and underscore) \W - non-word character

Answer 4

the backslash character (\). It is used to escape special characters in REGEX so that they are treated literally

Answer 5

* - zero or more + - one or more ? - Zero or more

Answer 6

For Example *+?$/[]{}()

Answer 7

a special character that can be used to prevent Splunk from parsing certain characters in a field Protection characters can be used in a variety of ways in Splunk, including: To protect sensitive data in indexes and search results To prevent Splunk from parsing certain delimiters in input data To prevent Splunk from parsing certain characters in field names Examples: \, @ , " , []

Answer 8

enclose it in the @ character: @password@ Splunk will then ignore all characters inside the protection character.

Answer 9

use protection characters to prevent Splunk from parsing certain delimiters. For example, to prevent Splunk from parsing the comma delimiter in a CSV file, you would enclose the field in the " character: "value1","value2","value3" Splunk will then treat the comma character as a literal character, and will not parse it as a delimiter.

Answer 10

Protect the password field in an index [sourcetype=linux_secure] INDEX = my_index TRANSFORMS-protect_password = protect_password [protect_password] REGEX = @password@ DEST_KEY = password FORMAT = $1 Prevent Splunk from parsing the comma delimiter in a CSV file [sourcetype=csv] DELIMS = , TRANSFORMS-protect_comma = protect_comma [protect_comma] REGEX = " DEST_KEY = quote FORMAT = $1 Prevent Splunk from parsing the colon character in a field name [sourcetype=linux_secure] FIELDS = host:ip source:message Search for all events where the password field contains the word "password" index=my_index sourcetype=linux_secure @password@="password"

Answer 11

used to specify which fields should be included in a search result Examples: ( ) parentheses { } curly braces [ ] square brackets search query will include the host and message fields in the search results: (host="example.com" AND message="error")

Answer 12

used to specify which fields should be excluded from a search result. Ex: ! exclamation point NOT keyword search query will exclude all events where the host field is equal to example.com: !host="example.com"

Answer 13

a way to match multiple occurrences of a character or pattern Two main types of repetition in regex: greedy and non-greedy

Answer 14

Greedy repetition matches as many occurrences of the character or pattern as possible. For example, the regular expression a* will match any string that contains one or more occurrences of the letter a. Non-greedy repetition matches the minimum number of occurrences of the character or pattern possible. For example, the regular expression a+? will match any string that contains one or more occurrences of the letter a, but will stop matching as soon as it finds the first occurrence of another character

Answer 15

a{3}: any string that contains three occurrences of the letter a [0-9]+: any string that contains one or more occurrences of a digit (abc){2,3}: any string that contains two or three occurrences of the pattern abc [a-z]{3,}: any string that contains three or more occurrences of a lowercase letter

Answer 16

Matching a specific number of occurrences of a character or pattern. For example, the regular expression [a-z]{3} matches any string that contains three lowercase letters. Matching a minimum or maximum number of occurrences of a character or pattern. For example, the regular expression [0-9]{3,5} matches any string that contains three to five digits. Matching a pattern that occurs one or more times. For example, the regular expression (abc)+ matches any string that contains one or more occurrences of the pattern abc.

Answer 17

a way to group patterns together so that they can be treated as a single unit. There are two main types of logical grouping in regular expressions: Non-capturing groups: These groups are used to group parts of a regular expression together, but they do not capture the matched text. Capturing groups: These groups are used to group parts of a regular expression together and capture the matched text.

Answer 18

parentheses and alternation. Parentheses can be used to group patterns together so that they can be treated as a single unit. For example, the regular expression (abc) matches the pattern abc treated as a single unit. Alternation can be used to match one of two patterns. For example, the regular expression (abc|def) matches either the pattern abc or the pattern def.

Answer 19

Logical groupings can be used to create complex regular expressions that can match a wide variety of patterns. For example, the following regular expression can be used to match all email addresses: [a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,} The regular expression is broken down into the following groups: [a-zA-Z0-9._%+-]+ matches the local part of the email address. @[a-zA-Z0-9.-]+\.[a-zA-Z]{2,} matches the domain part of the email address. The two groups are separated by the @ character, which is used to group the two parts of the email address together.

Answer 20

Logical groupings can also be used to exclude certain patterns from a match. For example, the following regular expression can be used to match all phone numbers that do not contain the digit "1": \d{10}(?!1) The regular expression is broken down into the following groups: \d{10} matches any 10-digit phone number. (?!1) is a negative lookahead assertion that excludes any phone number that contains the digit "1". The negative lookahead assertion is placed at the end of the regular expression so that it only excludes phone numbers that contain the digit "1" at the end of the number.

Answer 21

a way to extract specific parts of a string using regular expressions way to name the groups in a regular expression. For example, the following regular expression creates a named capture group called username: (?\w+)

Answer 22

a way to match a pattern that is not followed by another pattern. It is denoted by the syntax (?!pattern) For example, the following regular expression matches any string that contains the word "error" but is not followed by the word "warning": error(?!warning)

Answer 23

a way to match a pattern that is followed by another pattern. It is denoted by the syntax (?=pattern)

Answer 24

For example, the following regular expression matches any string that contains the word "error" followed by the word "warning": error(?=warning)

Answer 25

REX, or Regular Expression Extractor, is a Splunk search processing language (SPL) command that is used to extract fields from raw data based on a pattern that you specify using regular expressions. REX can be used for a variety of tasks, such as: Extracting fields from log files Parsing data from structured files Transforming data into a different format REX is a powerful tool that can be used to make Splunk more efficient and effective. Here are some examples of how to use REX in Splunk: Extract the IP address from a web server log line: [sourcetype=web_server] | rex field=_raw match="(?\d+\.\d+\.\d+\.\d+)" format="$ip"

REGEX Flashcards

(51 cards)