Domain 1 Flashcards
(98 cards)
Explain the AI relationship ven diagram
Artificial Intelligence, Machine Learning, Deep Learning
Predictions that AI makes based on historical data
Inference
When AI recognizes a change in what has happened in the past
Anomaly detection
What are some AWS services that could provide structured input data for training ML models?
RDS, Redshift
What are some AWS services that could provide semi-structured input data for training ML models?
DynamoDB, MongoDB
For semi-structured, structured data, unstructured data, and time-series, where should you export data for training models?
S3
In machine learning, what describes the relationship between inputs and outputs?
An algorithm
Describe the machine learning training process
Known data -> features -> algorithm -> output
Describe the machine learning inference process, which comes after training
new data -> features -> model -> output
What are the two artifacts produced that create a model?
Inference code + model artifacts
What type of inferencing provides low-latency, high throughput, and a persistent endpoint (also usually more expensive)?
Real-time
What type of inferencing is performed offline, uses large datasets, and either happens on an infrequent schedule?
Batch transform
Training your model with data that is pre-labeled (pictures with fish/not fish)
Supervised Learning
What is the challenge with supervised learning?
You need a lot of data, people to label…takes time and money
What is Amazon Ground Truth?
A service that helps you provided labeling
What process uses data that has features but is not labeled and is good for pattern recognition, anomaly detection, and grouping data into categories?
Unsupervised learning
What process uses both supervised and unsupervised learning, provides rewards to an agent when criteria are ment, uses trial and error, and allows the agent to make mistakes to learn, and has and end goal?
Reinforcement learning
What sub service of Ground Truth uses crowdsourcing to label
via affordable labor
AWS Mechanical Turk
A model telling you a fish is not a fish because it is out of water, a result of training being to specific and not having enough varied examples, is called what?
Overfitting
What is called when a model cannot determine a meaningful relationship between the input and output data, happens when you haven’t trained the model long enough or with a large enough set?
Underfitting
What is bias?
When a model discriminates against a specific group because of a lack of fair representation in the data used to train the model
Also, if a model is showing bias, what can be done with features?
the weight of features that are introducing noise can be directly adjusted by the data scientists. For example, it could completely remove gender consideration
Items such as age and sex discrimination, should be identified at the beginning before creating a model.
Fairness constraints
A type of machine learning that uses algorithmic structures called neural networks.
Deep learning