Section 5.1 Flashcards

Question 1

Q

What does the Indexing Layer do?

Answer

A

Allows you to clean up data.

Allows you to refine data.

Allows you to store data.

Question 2

Q

What is Index clustering?

Answer

A

When multiple indexers are connected in order to replicate copies of the indexers buckets (data).

Question 3

Q

Where is data stored?

Answer

A

In indexes on the indexer that have buckets.

Question 4

Q

What is automatic failover?

Answer

A

Basically backing up data. If one indexer fails, the others will pickup the slack and maintain continuity.

Question 5

Q

High availability means…

Answer

A

Data is highly available for searching.

Question 6

Q

Index Clustering in summary means

Answer

A

Data is protected from sudden loss

More copies are available for users who are actively searching

Indexer activities will continue in the event an indexer goes down

Question 7

Q

Replication Factor determines

Answer

A

How many copies are maintained within an indexer cluster.

Deafult RF is 3

Maximum RF is determined by the number of indexers you have or nodes.

Question 8

Q

Search Factor determines

Answer

A

How many of these copies are immediately searchable.

Default SF is 2

Question 9

Q

In a clustering environment you need a minimum of ____ Indexers

Question 10

Q

Most important fact about a Search Factor (SF)

Answer

A

The Search Factor can never be more than the Replication Factor.

Question 11

Q

Explain RF & SF

Answer

A

RF factor tells us how many times we want the data to be copied over. Two of those copies are highly available and just incase something happens to the first copy. If both copies go down, the third copy is usually stored at an offsite location.

Question 12

Q

When does the Cluster Master come in?

Answer

A

The Cluster Master comes into play when we start copying our data (when the environment becomes clustered).

Question 13

Q

Cluster Master Manages what layer?

Answer

A

It manages the indexing layer.

Question 14

Q

What is the Cluster Master?

Answer

A

A centralized configuration Manager who’s job is to manage the indexer cluster.

Question 15

Q

Once the environment becomes clustered, the Deployment Server….

Answer

A

Only manages the forwarders.

Question 16

Q

What does a Cluster Master do?

Answer

A

Manages cluster activities (adding peers, distributing configurations, determines the number of copies to maintain).

Maintains memory of peers, their buckets, and configs

Tells search head where to request data.

Question 17

Q

What are Peers (Cluster Peer)?

Answer

A

Peers are Indexers

Question 18

Q

What do Peer Nodes do?

Answer

A

Peers receive and index incoming data typically from forwarders)

Replicate data to other peers

Respond to incoming searches by supplying search results

Question 19

Q

A clustered architecture is called ..

Answer

A

A distributed search

Question 20

Q

Clustering is Smart because it provides….

Answer

A

Data Availability
Data Fidelity
Data Resiliency
Disaster Recovery
Search Affinity

Question 21

Q

Multi-site clustering =

Answer

A

Storing copies of your data at a different site

Question 22

Q

Data fidelity =

Answer

A

The act of not losing data; reliability

Question 23

Q

Benefits of Clustering =

Answer

A

1.Data Availability & fast recovery
2.Easier overall administration
3.Scalability of indexing
4.No additional cost for data replication

Question 24

Q

Cons of clustering =

Answer

A

1.Increased storage requirements
2.Increased processing load
3.Requires additional Splunk instances
4.Indexers require the same OS and versions

Question 25

Q

When you enable a search head in cluster environment you must specify what?

Answer

A

Cluster settings (i.e. Master Node) and the port on which it receives data.

Question 26

Q

Transforms.conf=

Answer

A

specify transformations and lookups that can then be applied to any event

Question 27

Q

What is the filepath of the CM that sends apps to its peers ?

Answer

A

splunkhome/etc/master-apps

Question 28

Q

Where do bundles reside for cluster peer?

Answer

A

splunkhome/etc/slave-apps

Question 29

Q

Splunkhome etc slave apps =

Answer

A

where you will always find pushed configuration files (sent from CM to indexer)

Question 30

Q

Config changes that require restart?

Answer

A

A.Changes to indexes.conf,inputs.conf
B.Home path changes to Indexes.conf
C.Deleting an existing app

Question 31

Q

Configuration changes that do not need a restart ?

Answer

A

Adding a new index or new app with reloadable configs

Changes or additions to transforms.conf or props.conf

Question 32

Q

Tell me about your environment

Answer

A

In my environment we have a current quota of about 50TB, and we are currently ingesting about 49TB per day with 600 users. We have about 290 indexers, with close to 32,000 forwarders and about 12 search heads.

Question 33

Q

Environment with too many forwarders for you to manage one at a time-what Splunk instance would you install and how would you configure it to manage all the forwarders?

Answer

A

Use Deployment Server and put the forwarder in serverclass and create deployment apps to configure all of them.

Question 34

Q

In your deployment app you are Configuring inputs.conf to bring in new data-you then search with search head and cannot find the data. What happened?

Answer

A

-didn’t send deployment apps to correct serverclass
-mistake in monitoring stanza
-did not put right index
-severclass has not phoned home
-turn monitoring on(BEST ANSWER)
-Splunk does not have permissions to read source file

Question 35

Q

what directory must you place your inputs.conf file in the deployment app

Answer

A

local directory

Question 36

Q

indexer uses what port

Question 37

Q

fishbucket index importance

Answer

A

allows you to see how far into a file indexing has occurred-helps to avoid duplicates and comes in handy after server shutdown or connection errors.

Question 38

Q

advantages of indexer clustering

Answer

A

1.Data Availability & fast recovery
2.Easier overall administration
3.Scalability of indexing
4.No additional cost for data replication

A. Data Availability = how often your data is available to be utilized.

B. Data Fidelity = the act of not losing data.

C. Data Reliability = refers to the accuracy, consistency, and dependability of the data being ingested, indexed, and queried within the platform.

D. Data Resiliency = platform’s ability to maintain data availability, integrity, and accessibility even in the face of unexpected failures.

E. Disaster Recovery = set of processes and strategies put in place to ensure availability and continuity of Splunk services and data.

F. Search Affinity = search local sites; mechanism for intelligently routing and distributing search jobs across a distributed Splunk environment.

Question 39

Q

explain data availability

Answer

A

how often your data is available to be utilized

Question 40

Q

who manages all indexes in cluster environment? Explain

Answer

A

Cluster Master/Master Node

Question 41

Q

how would you configure hot bucket to roll over by time

Answer

A

Maxhotspansecs

Question 42

Q

default port used for replication

Answer

A

8080 is replication, 8089 is the management port(goes between config manager and clients-ds vs clients and then CM vs indexers-to ANY client it is managing), and 9997 is the data (receiving port)

Question 43

Q

what is metadata and what does it contain?

Answer

A

Meta data=bar code=tells you where a product is coming from (ip address, log path, and format of data)

Question 44

Q

What is source

Answer

A

name of the event or other input from which the event originates

Question 45

Q

give examples of sourcetypes you worked with

Answer

A

json and syslog or CSV

Question 46

Q

what is the largest sourcetype you have worked with?

Answer

A

syslog is network data and large

Question 47

Q

high availability

Answer

A

-High availability=when we are replicating data within our indexers
-Multiple copies available for searching
-Data gets into our indexers in round robin fashion

Question 48

Q

distributed search?

Answer

A

key feature that allows you to search and analyze data across multiple Splunk instances or indexers in a distributed Splunk deployment. This is especially useful in large-scale environments where the volume of data to be searched and analyzed exceeds the capacity of a single Splunk instance.

Question 49

Q

how replicated buckets are stored in indexers

Answer

A

1.once the data comes to the indexers the method of distributing data will be round-robin 2.once the data is written on the indexers 3.then the process of replicating data will move from indexer to indexer trying to find a healthy one to store that specific data.

Question 50

Q

how does forwarder distribute data among indexers without replication (regular data)

Answer

A

round robin fashion

Question 51

Q

reloading vs restarting DS

Answer

A

When updating clients of the DS-reload deployment server

when you make updates for DS itself you restart DS.

Question 52

Q

when increasing ingestion in cluster environment

Answer

A

add more indexers to the cluster

Question 53

Q

some considerations to consider when going into clustered environment

Answer

A

cost of more splunk instances

ingestion of data

storage requirements

processing requirements

Question 54

Q

You notice that your newly monitored data is not in the index that you have configured it to be in. Where is data possibly being stored and how would you troubleshoot it?

Answer

A

Go to the inputs.conf and validate that the ‘index’ is correct.

If index is wrong it will be in the main index

Question 55

Q

Recently got fresh new data in the splunk

Answer

A

Hot bucket

Question 56

Q

Under what circumstances would the data in the hotbucket stop writing?

Answer

A

If the hot bucket is too full or if their is restart.

Question 57

Q

In order to have have splunk search head what would you need to download?

Answer

A

Splunk Enterprise

Question 58

Q

Maximum number of concurrent users per search head

Question 59

Q

What is Maxhotbucket?

Answer

A

Maximum hot bucket that can be in an index

Question 60

Q

Which default port is for replication?

Answer

A

8080 port

Question 61

Q

What is the thawing process

Answer

A

Frozen data has to be unthawed and sent back to cold

Move that file into thaw directory and rename it to a name that splunk recognizes

Question 62

Q

What must happen before indexer can be part of a cluster?

Answer

A

Indexer must become cluster member

Question 63

Q

Cluster Master/Master Node

Answer

A

You only need ONE

Question 64

Q

Internal Index?

Answer

A

Used for troubleshooting; stores all Splunk components’ internal logs and processing metrics.

Searches for logs that say ERROR or WARN

Answer 62

A

Windows = [monitor://C:\app\log\data\catalina.out]

Linux = [monitor:///another/random/path]

Answer 63

A

raw data (full log files) and indexed files (tsidx)

Answer 64

A

Go to monitoring and change disable to true or 1

Answer 65

A

Summary indexing allows you to run fast searches over a large data set by scheduling Splunk to summarize data then import data into the summary index over time

Answer 66

A

Adding indexers to the cluster to accommodate growth.

Answer 67

A

slave-apps filepath

Answer 68

A

when CM is communicating with with its clients or slaves

Brainscape's Knowledge GenomeTM

Section 5.1 Flashcards

Brainscape's Knowledge Genome^TM