Einstein Discovery Terms Flashcards

Question

Number of records that can be scored per org per day

Answer 1

CRM Analytics Plus License: Unlimited | Einstein Prediction License: 10

Answer 2

Depends on how the model associated with the request was built: Einstein Discovery-built models: 25 Externally-built models uploaded to Salesforce: 1

Answer 3

Not supported in ED

Answer 4

Number of predictions run today Number of story versions created today Number of concurrent stories that can be analyzed Number of prediction API calls run today Number of story versions created this month

Answer 5

Explanatory variable people can control. If variable is designated as actionable, model uses prescriptive analytics to suggest actions user can take to improve predicted outcome

Answer 6

An actual outcome is the real-world value of an observation's outcome variable after the outcome has occurred. Einstein Discovery calculates model performance by comparing how closely predicted outcomes come to actual outcomes. An actual outcome is sometimes called an observed outcome.

Answer 7

Variables are being treated unequally in your model

Answer 8

Number of distinct values in a category. ED supports 100 categories per variable. Null values are put into category called unspecified. Can consolidate remaining categories (categories with < 25 obs) into 'other' category

Answer 9

Represents Qualitative values. Story with binary or multi-class is categorical.

Answer 10

Statistical associated between variables.

Answer 11

Insights derived from a model. Show 'why' it happened. Drill into correlated variables.

Answer 12

Data reflects discriminatory practices towards a patricular demographic

Answer 13

Data is unbalanced. Most values are in same category.

Answer 14

Drift can occur due to changing factors in the data or in your business environment. Drift also results from now-obsolete assumptions built into the story on which the model is based. To remedy a model that has drifted, you can refresh it by adjusting story settings, retraining it on newer data, and redeploying it.

Answer 15

Two or more explanetory variables are highly correlated (ex: Zipcode and city) ED recommends choosing just one variable to improve results.

Answer 16

Variable you explore to determine whether and to what degree if can influence the outcome variable. Also called input variable, feature, predictor, or independent variable

Answer 17

Picking the best explanatory variables in a story.Too few features could result in underfitting, too many could result in overfitting. Select the most influential explanatory variables with no significant llurking variables

Answer 18

how one explanetory variable explains variation in the outcome variable. Also called bivariate analysis

Answer 19

Regression absed model

Answer 20

Specifies the desired outcome for the story. Includes the story's outcome variable plus your preferred direction for the outcome.

Answer 21

Decision-Tree based algorithm.

Answer 22

All values for a variable belong to the same category

Answer 23

Suggested action based on prescriptive analytics that use can take to improve likelihood of desired outcome. Associated with actionable variables.

Answer 24

stat technique for replacing numeric values with valies derived from subset of data.

Answer 25

Starting point for you to investiate the relationsihps among story's explanatory variables and its goal.

Answer 26

Model validation process in which Einstein Discovery randomly divides all the observations in the Analytics dataset into four separate partitions of equal size. Next, it completes four test passes (folds) in which three of the partitions serve as the training set and one partition serves as the test set. For each fold, Einstein Discovery compiles model metrics, then averages the metrics for all four folds.

Answer 27

Leakage occurs when the data used to train your model includes one or more variables that contain the information that you are trying to predict. This can result in models that are extremely accurate when, in actuality, they are problematic. To remedy data leakage, remove any variables from your model that are causing the leakage.

Answer 28

A lurking variable is an explanatory variable that is missing from your story but which significantly explains variations in the outcome variable.

Answer 29

A modeling algorithm is what Einstein Discovery uses to create a model for a story. Einstein Discovery uses one of several algorithms: generalized linear model (GLM) is a regression-based algorithm, while gradient boosting machine (GBM) and XGBoost are decision tree-based machine learning algorithms.

Answer 30

The Model Manager is the Einstein Discovery tool used to manage predictions and models you have deployed.

Answer 31

Model metrics describe the performance of the predictive model associated with your story. It provides metrics (quality indicators, which are sometimes called fit statistics) to show how well the model's predictions fit the training data in the dataset. For definitions of quality indicators shown in the Model Metrics tabs, see Evaluate Model Quality.

Answer 32

The multiclass classification use case addresses business outcome that have between 3 and 10 outcome values, such as five possible service plans or eight possible insurance policies. Multiclass classification is one of the main use cases that Einstein Discovery supports. Compare with Binary Classification.

Answer 33

any data that does not meaningfully explain variations in your outcome variable

Answer 34

In predictive analytics, overfitting occurs when a model performs well in predicting outcomes on the training data in the dataset, but less well when predicting outcomes for other data, such as production data. Using too many explanatory variables can result in an overly complex predictive model that captures the noise in your data. To mitigate overfitting, Einstein Discovery uses ridge regression and regularization

Answer 35

In an insight, a second-order analysis examines how the combination of two explanatory variables explains variation in the outcome variable. In second-order analysis, the combined impact of both variables together on the outcome is sometimes called the interaction effect. Second-order analysis is sometimes called multivariate analysis.

Einstein Discovery Terms Flashcards

(63 cards)