Midterm Flashcards

Question

What is roll forward cross validation?

Answer 1

This is similar to cross validation, except you make sure your test data is always ahead of your training data. If you have 5 chunks, you may do something like the following: Train 1:, Test 2 Train 2: Test 3 Train 3: Test 4 Train 4: Test 5

Answer 2

You've got XTest, YTest data. Then you make predictions on this test data. Now you calculate the correlation between the YTest and YPredict to see how well correlated they are. Values are -1 (inversely correlated) -> +1 where -1 or +1 is strong positive or negative correlation. Closer to 0 is light the shotgun blast. (this is NOT the slope of a line of the data in a scatter plot with YTest vs YPred. More like a tight oval is high correlation and big circle is poor correlation) Can use np.corrcoef or df.corr(method='pearson')

Answer 3

When in sample error is decreasing and you start to see out of sample error increasing.

Answer 4

Train multiple learners on the same data set (may be different types of learners). Feed each learner the X value you want to predict, then average the output (for regression) or do a majority vote for classification.

Answer 5

What an algorithm is biased toward. For instance, a linear regression learner is biased toward being linear. Creating an ensemble learner tends to smooth out the individual biases.

Answer 6

AKA Bag learning. With your training data, you create m new models, each with a subset of the original data (sample with replacement). So if you've got 100 data points in your training set, new training set also has 100 data points, but there could be repeats from the training data since you sample with replacement. Then you query each model, collect the outputs and the mean() is the prediction.

Answer 7

Adaptive Boost Starts like bagging. Sample with replacement to create a new data set and train your first learner. Test your model against your training set and calculate error. Create next new data set, sampling with replacement, but the probability of choose a sample is now weighted by the error of the previous models results. This means you're more likely to choose data points that have high error. Now you test the current ensemble against your training set and calculate the error. Then repeat for the next bag. So it's like bagging except that for each new bag, you're testing on more of the data that you don't perform well on.

Answer 8

Close is literally just the close, adjusted close is adjusted for things like splits and dividend payments, etc.

Answer 9

df = pd.DataFrame(index=pd.date_range(start, end)) where start, end are date strings (YYYY-mm-dd)

Answer 10

pd. rolling_mean(df['col'], window=N) | pd. rolling_std(df['col'], window=N)

Answer 11

dr = ( (df / df.shift(1) ) - 1) [1:] You return from the first value onward since your first value will be NaN. You can also do: dr[1:] = (df[1:] / df[:-1].values) - 1 df.ix[0, :] = 0 Must use values to make sure it does element wise division and doesn't try to use index!

Answer 12

df. fillna(method='ffill') df. fillna(method='bfill') df. fillna(0)

Answer 13

plt. axvline(value, color, linestyle) | plt. axhline(value, color, linestyle)

Answer 14

df['col'].hist(bins=20)

Answer 15

Normalize each stock, then plot one stock on each axis. Then you can look at the correlation between the stocks.

Answer 16

df.plot(kind='scatter', x='col1', y='col2')

Answer 17

np.polyfit(x_s, y_s, degree=1)

Answer 18

spo.minimze(func_to_min, guess, args=(data,)) args is what gets passed to func_to_minimize as the constants first arg to your func should be the things to find to maximize something simply negate the func_to_minimize return value.

Answer 19

ETF: Exchange Traded Funds - buy/sell like stocks - represents baskets of stocks - transparent (publish their investments) - very liquid Mutual Fund: - buy/sell at end of day only - quarterly disclosure of investments - so less transparent - (they have stated goals, eg, track some fund, etc) Hedge Fund - buy/sell by an agreement only - no disclosure of investments - no transparency

Answer 20

How easy it is to buy/sell a given stock. How easy it is to get into and out of it.

Answer 21

of outstanding shares * price/stock Large Capitalization

Answer 22

ETF: 3-4 letters Mutual Funds: 5 letters Hedge Funds: Some fund name

Answer 23

Assets Under Management - total value of the portfolio being managed.

Answer 24

ETFs: Expense Ratio of 0.01 - 1.0 % Mutual Funds: 0.5 - 3.0% Hedge Funds: "Two and Twenty" 2% of AUM + 20% of profits

Answer 25

some % * AUM. How mangers may be paid.

Answer 26

- Cumulative Returns - Volatility (std(daily_returns) - Risk Adjusted Reward (Sharpe Ratio)

Answer 27

Buy (bid)/Sell (ask), Symbol, # Shares, Limit/Market, Price

Answer 28

Market: Willing to accept the current market price (you don't specify price for market order) Limit: You don't want to do any worse than the specified price. - buy: no more than X price - sell: no less than X price

Answer 29

Buyer -> broker -> exchange broker -> broker (doesn't hit exchange) broker -> dark pool -> broker (doesn't hit exchange)

Answer 30

1) Order Book Exploit: colocated hedge fund systems observe order book with faster response time. You, a long ways away see a delayed version of the book. They can buy and sell stocks really fast. (look at the High requency training vid) 2) Geographic Arbitrage Exploit: colocated server at different geographical exchanges. Look at diff in stock value at two locations. If diff exists, sell in the high place and buy in the low.

Answer 31

Exchanges: Buy Sell Market Limit Broker: Broker holds order until conditions are met and then fulfills it Stop Loss: when stock drops to certain price, sell it Stop Gain: When stock reaches certain price, sell it Trailing Stop: combination of stop loss w/ dynamic value. 10cents behind price, so sell if it ever drops by > 10 cents, otherwise let it go up. Selling Short: negative position. Borrow from someone, wait for stock to change, sell it, then re-buy the stock and give it back to the borrowee.

Answer 32

True Value: - Intrinsic Value - Book Value - Market Cap

Answer 33

As estimated by future dividends. If we own 1 share of stock, what is the value of all the dividends we will ever get into the future for that share of stock.

Answer 34

Value based on a companies assets and debts. Total assets - intangible assets - liabilities. You don't count intangible assets b/c they are both an assets and intangible asset.

Answer 35

Value of a company based on values of stock on market and how many shares are outstanding. Eg, prices * # of outstanding shares (# owned by people).

Answer 36

Future Value / (1 + interest_rate) ^ i where i is the number of compounding periods into the future (years, for instance). Example: If interest rate = 1%, and you want to know what is the value of $1 that will be paid to you in 1 year, what is the present value of that dollar? 1 / (1 + 0.01)^1 = 0.99. So today that $1 is only worth $.99 That interest_rate (really a discount rate when calculating PV), could be the rate of your next best alternative. That would pay for the opportunity cost.

Answer 37

It's the interest rate in the PV = (1 + ir)^i. With respect to bonds and intrinsic value, the ir is really the discount rate. It's higher if you trust the company less, or the company is more risky. - Interest Rate is used when givena PV and want to figure out FV. - Discount rate if use when we have a known or desired FV and want to compute the corresponding PV.

Answer 38

PV = FV / (n-1), where n = (1 + ir), so the PV = FV / discount_rate. So it's dividend / discount rate.

Answer 39

- Company specific news - Sector News: affects a specific sector - Market wide news This kind of news frequently reduces the view of potential dividends and thus the intrinsic value.

Answer 40

- Portfolio weights must sum to one, even if you short a stock - return = sum(w_i * r_i(t))

Answer 41

The weight of an individual stock within a market is: market_cap_i / sum(market_caps)

Answer 42

r_i(t) = beta_i * r_m(t) + alpha_i(t) return of stock i at time t = beta of i * return of market at time t + alpha of i at t beta_i *r_m(t) = market influence alpha_i (t) = residual When multiplied out, you get: sum(w_i*beta_i)*r_m + sum(w_i * alpha_i)

Answer 43

Expectation of alpha => 0

Answer 44

- passive: buy an index portfolio and hold | - active: pick stocks

Answer 45

You multiple each CAPM stock equation by it's weight and add them all up. Alternatively you can calculate a porfolio beta: B_p = sum(w_i * beta_i) Then use it as: r_p(t) = B_p * r_m(t) + alpha_p(t), but CAPM says alpha => 0

Answer 46

- Passive agrees with CAPM that alpha => 0 (expectation) | - active says alpha_p = sum(w_i*alpha_i(t))

Answer 47

You want beta to follow the market. So in upward markets you want large beta and in downward markets you want smaller beta.

Answer 48

Split up the CAPM beta into the various sectors instead of just having a 'market' beta. So each can be different betas for each sector. beta_tech * return_tech + beta_energy * return_energy, etc. (Not going to use this in this class)

Answer 49

Remove the market impact by making your sum(w_i * beta_i) = 0 so r_m doesn't matter. May not always work b/c beta is always changing and may not be a perfect estimate. But in theory, it wouldn't matter which way the market goes. You'd still make yours.

Answer 50

Looks only at price and volume ONLY!! You can compute indicators from these which are heuristics.

Answer 51

Looks at earnings, dividends, book value, cash flow, etc.

Answer 52

- Individual indicators are weak - Combination of many indicators are stronger - You look for contrasts (stocks that have different indicators) - Generally work over shorter time periods than longer time periods

Answer 53

- Momentum - over some number of days, how has the price changed? (up or down) - Simple Moving Average (SMA) - just what it sounds like - Bollinger bands - explained elsewhere

Answer 54

(price[t] / price[t-n:t].mean()) - 1 price of t divided by mean of last n days.

Answer 55

( price[t] - SMA[t] ) / 2 * std(t)

Answer 56

- SMA: -0.5 : 0.5 - Momentum: -0.5 : 0.5 - BB: -1 : 1 You would normalize them because you don't want any one field to 'outweigh' the others.

Answer 57

- DT is just a single tree - DF is many trees. You query all of them to get an average return. TODO: Lookup in code about this as well

Answer 58

Roll back time and then compare to known data. - Train on a set of data - Enter positions based on trained data - train on next set of data - Enter positions based on trained data - etc...

Answer 59

- noisy and uncertain - challenging to estimate confidence - holding time and allocation (how long should you hold, and how should you allocate)

Answer 60

Weight the contributions of each datapoint based on their distance from the point of interest.

Answer 61

- In Sample - testing on your training data | - Out of sample - testing on your non-training data

Answer 62

sqrt( sum( y_test - y_predict)^2 / N) - Tends to emphasize larger error a bit more

Answer 63

You slice it up into rolling chunks. Train on 80%, test on 20%. Then shift by a chunk, train on 80% and then test on 20%.

Answer 64

You can't peak into the future for time series, so you would only ever train on a single chunk then test on the next adjacent chunk. Then you shift until you're last test chunk is at the end.

Answer 65

When the in sample error is decreasing while the out of sample error is increasing.

Answer 66

You use many different learners, have each one do a prediction, then average the total prediction.

Answer 67

Lower error and less overfitting. Tends to reduce the overall biases.

Answer 68

(AKA bootstrap aggregating) - You have many of the same learner type. You create a new data set from the training data for each learner with replacement and train that learner on the dataset. Then when you predict you have each one predict the value using their model and then average the output.

Answer 69

- Like bagging, but each successive learner you train focuses on the data the previous model performed poorly on. - Next bag is random like bagging, but higher weights are placed on samples that you performed poorly on. - After each bag is created you have to test it on the training data to see which samples it performed poorly on. - Ada boost may be more prone to overfitting because it focuses on datapoints that aren't predicted well in the training data.

Answer 70

Look for when you've got strong momentum and the SMA crosses over the price. This is especially true over a longer SMA window.

Answer 71

( price[t] / price[t-n] ) - 1 Where n is the number of days to look over. Typical values are within the range: -0.5 to +0.5

Answer 72

You can compute the number of years it takes to double something via: N years = 72 / interest rate

Answer 73

The difference between the buy and sell prices. High trading volumes tend to indicate there is a very small spread and the market is very liquid. Large spreads may be an indicator that the market is freezing up (low liquidity). Low volumes in general tend to produce high spreads.

Answer 74

When a broker issues trades in advance of those of a client, knowing the price movements that will probably occur when the clients orders are executed. It's not legal to do to your own clients, but you can do it in advance of another brokers client.

Answer 75

Market Cap / Book value also: share price / book value per share

Answer 76

The sum of the discounted value of your money over a period of time. Example: dividend = $1 interest rate you could get on your money: 0.08 % number of years: 3 1 / (1 + 0.08) + 1 / (1 + 0.08)^2 + 1 / (1 + 0.08)^3 = $2.58 Each successive year's dividend is worth less than the previous.

Answer 77

Asset based is book value. This is typically a lower bound, but during 'bad' times (liquidation, bad market), it could actually be lower. It ignores future earnings. Cash flow based relate to dividends paid into the future. Adding the two together can show the most complete view.

Answer 78

book value / market cap

Midterm Flashcards

(102 cards)