Final Prep COPY Flashcards

Question

Wrapping Implementation

Answer 1

Hill Climbing but taking the return scores from the learner Randomized Optimization in general can be done Forward Search

Answer 2

Search through the entire set and find the best, then try in combination with each of the other features, and continue until the increase in score plateau

Answer 3

Try all combinations of n-1 features. Then n-2. Then stop at a point when the subset does pretty well. And the error dramatically increases.

Answer 4

xi is strongly relevant if remove it degrades Bayes Optimial Classifier xi is weakly relevent if it is not strongly relevent and some subset of features S such that adding xi to S improves BOC otherwise xi is irrelevant

Answer 5

Takes weighted average of all hypthosis BOC is best you can do on average if you can find it

Answer 6

Relevance measures effect on BOC Usefulness Measures effect on a particular predictor Relevance aka information Usefulness aka Error given Model/Learner

Answer 7

Pre Process a set of features, to create a new set of features. THis should be smaller or more compact. Retain the relavent and useful information.

Answer 8

Words mean many things, false positives

Answer 9

The same thing can be represented by different words, false negatives

Answer 10

Allows transformation intoa new space that can be used for feature selection, by looking at the eigenvalue. For instance anything with an eigen value of 0 would be removed. THis finds correlation, maximizng variance and allows reconstruction. I need to research more on this.

Answer 11

Allows transformation into a new space that can be used for feature selection, by looking at the eigenvalue. For instance anything with an eigen value of 0 would be removed. This finds correlation, maximizing variance and allows reconstruction. Mutually Orthogonal Maximal Variance Ordered Features Bag of Features (because Ordered Features are a type of bag of features) I need to research more on this.

Answer 12

Independence of features. Creates new features from each base features through linear transformation that allows all the new features to be statistically independent. Mutual information of zero Mutually Independent Maximal Mutual Information Bag of Features

Answer 13

Generates Random Directions and project the data out to these. It works well if the next thing is some type of classification. The projection is lower than N but not as much as PCA. The big advantage is speed.

Answer 14

Finds a projection that discriminates based on the label. Project into clumps or clusters.

Answer 15

Supervised Learning: y = f(x) Like function approximation Unsupervised Learning: f(x) Clustering/Description Like Description Reinforcement Learning: y = f(x), z A lot like function approximation with the added z

Answer 16

State: S Model: T(s,a,s') ~ Pr(s'|s,a) Actions: A(s), A Reward: R(s), R(s,a) , R(s,a,s') ----------------------------------------- Policy: pi(s) -> a

Answer 17

Is the possible representations inside of the world

Answer 18

Things that a state can do, that when you are in a state that you can take

Answer 19

Given state, state prime, and action it gives you the probability of state prime given state and action

Answer 20

Only the present matters Pr(s'|s,a). Only the most recent state matters You can trick this to that the current state remembers everything it needs to know

Answer 21

R(s), R(s,a) , R(s,a,s') There are several ways to look at rewards. But might be better to think one way or another. We will focus on R(s). A reward for a certain state.

Answer 22

pi(s) -> a, a function that takes in a state and and gives you an action you should take. A "command". pi* is an optimized reward you should take to maximize reward.

Answer 23

This is the problem of assigning blame/credit of each move in a sequence when only the final state matters

Answer 24

If one sequence is of higher utility then all subset of the sequences are also of higher utility This can be thought of as the sum of the rewards of each state

Answer 25

Is the reward for that state, but also all the reward we could theoretically get for the rest of the sequence based on the policy.

Answer 26

1) Start with a guess at the policy 2) Evaluate the policty by calcuating the utility with that policy 3) Improve the policy based on the utility function because there is no max here, now this is a linear equation solve

Answer 27

1) Start with a guess at the policy 2) Evaluate the policy by calculating the utility with that policy 3) Improve the policy based on the utility function because there is no max here, now this is a linear equation solve

Answer 28

Takes in Transistions and puts out Models

Answer 29

Takes in Model and Makes Transistions

Answer 30

takes trancitions and puts out a policy

Answer 31

Takes in a model and puts out a policy

Answer 32

Transistions to Modeler to Planner to Policy

Answer 33

Model to Simulator to Transistions to Learner to Policy

Answer 34

Policy Search - States are given and policy Derived Value Function Based - Takes states to determine utility, states mapped to values Model Based - (fairly direct learning) States and Actions take out

Final Prep COPY Flashcards

2nd Half Semester Information (59 cards)