Behind the Scenes Flashcards Preview

Question 1

Q

Everything is in:

A

shards

Question 2

Q

Every shard is replicated by

Answer

A

Pairing the connected primary shard to the other’s replica and stored in its node

Question 3

Q

What happens when a document is added to a cluster?

Answer

A

The document id is hashed and sent to any node, the node will based on the hash determine the node with the primary shard, send it there, then the primary node will add its replicates to all the other nodes.

Question 4

Q

What happens when querying for a document ID?

Answer

A

The id is hashed, sends to any node that will return the data

Question 5

Q

What is round robining?

Answer

A

Splitting client calls to different nodes instead of hammering one single node

Question 6

Q

What is a shard known as?

Answer

A

It’s basically a lucene indice

Question 7

Q

What is elastisearch in construct of a lucene?

Answer

A

Elastisearch is distributed shards.

Question 8

Q

Tell me about shards

Answer

A

Each shard is a container of inverted indices stored in segments

Question 9

Q

Why does adding data take so long in elastisearch?

Answer

A

Creating the inverted index takes a long time

Question 10

Q

What is analysis?

Answer

A

The process of taking text and converting it into tokens and putting it into an inverted index. This then gets added to the buffer.

Question 11

Q

What is a buffer?

Answer

A

A temporary storage form of indexed data that will eventually form into an immutable index segment.

Question 12

Q

Things that happen in text analysis stage

Answer

A

Tokenization (breaking the sentences into words, keeps into account the original location)
Filters
Then these get indexed.

Question 13

Q

Text analysis filters

Answer

A

Remove stop words
Lowercasing
Stemming
Synonyms (skinny vs thin)

Question 14

Q

Settings for an index you can change

Answer

A

number_of_shards

number_of_replicas

Question 15

Q

To get an idea of how your document will tokenize

Answer

A

POST _analyze
{
    "analyzer": "standard",
    "text": "Your text here"
}

Question 16

Q

The 3 building blocks of an analyzer

Answer

A

character filters
tokenizers
token filters

Question 17

Q

Character filters

Answer

A

Receives original text and removes or converts them

Question 18

Q

Tokenizer

Answer

A

Breaks the string up into sections

Question 19

Q

Token filters

Answer

A

filters for strings eg lowercase, stop, synonym

Behind the Scenes Flashcards Preview

Udemy: Elastisearch Masterclass > Behind the Scenes > Flashcards

Decks in Udemy: Elastisearch Masterclass Class (5):

Brainscape's Knowledge GenomeTM

Behind the Scenes Flashcards Preview

Udemy: Elastisearch Masterclass > Behind the Scenes > Flashcards

Brainscape's Knowledge Genome^TM