Volume 1 - Quantitative Methods Flashcards

Question

What are the uses of arithmetic mean and what to do if there's a problem ?

Answer 1

The arithmetic mean can be usefull to explain the return for 1 year of an Index. Cross-sectional mean : Average sales of 50 companies Time-series mean : Average sales for the last 10 yrs for GM This mean is susceptible to outliers: We can do nothing if they are legitimate and contain meaningful information Or: Delete the outliers by doing a trimmed mean. Excluding a small % of the lowest and highest values (Ex: 5% --> 2.5% highest and 2.5% lowest) Or: Replace the 2.5% by the value at which all others lie above --> the 96th observation 88 so 2.5% also become 88.

Answer 2

unimodal: only 1 value that is most frequent bi-modial: two values have the highest frequency .... Or no mode --> Uniform distribution

Answer 3

It is used to interpret the growth rate. Ex: The rate that makes your investement grow form initial enter into now. Also referred to as compounded returns

Answer 4

It is appropriate for averaging ratios when the ratios are repeatedly applied to a fixed quantity to yield a variable number of units. Ex: Dollar cost averaging

Answer 5

The variability around the central tendency

Answer 6

Because TWR is unaffected by the timing and amount of cash flows, it is appropriate for assessing managers who do not control external cash flows, such as a mutual fund that is regularly receiving new contributions and making payouts to meet redemptions.

Answer 7

FVn = PV(1+rs/m)** mn PV = FVn (1+rs/m)** -mn

Answer 8

This example illustrates one of the limitations of annualizing returns, which is that the calculations are based on the assumption that short-term performance could be repeated over a longer period.

Answer 9

calculate and interpret the present value (PV) of fixed-income and equity instruments based on expected future cash flows calculate and interpret the implied return of fixed-income instruments and required return and implied growth of equity instruments given the present value (PV) and cash flows explain the cash flow additivity principle, its importance for the no-arbitrage condition, and its use in calculating implied forward interest rates, forward exchange rates, and option values

Answer 10

1- Discount instruments (zero-coupons) have a very simple structure. One amount PV is borrowed today and a larger amount FV is repaid when the loan matures. 2- With coupon instruments, a principal amount PV is borrowed today and the same amount FV is repaid at maturity, but the borrower compensates that lender with periodic interest payments PMT at regular intervals during the term of the loan. 3- An annuity instrument is structured as a specified number of level cash flows. A common example of an annuity is a fixed-rate mortgage. Like a coupon instrument, the borrower makes payments at regular intervals to retire the debt.

Answer 11

calculate, interpret, and evaluate measures of central tendency and location to address an investment problem calculate, interpret, and evaluate measures of dispersion to address an investment problem interpret and evaluate measures of skewness and kurtosis to address an investment problem interpret correlation between two variables to address an investment problem

Answer 12

Data grouped in intervals have modal intervals. This is the highest bar in a histogram.

Answer 13

There are many common quantiles used in practice. Distributions are often divided into four quartiles, five quintiles, ten deciles, or one hundred percentiles. For example, the 90th percentile score (P90) on an exam is the number that separates the top 10% scores from the bottom 90%.

Answer 14

The range is the difference between the maximum and minimum values

Answer 15

Target semideviation, or target downside deviation, captures dispersion of observations below a specified target value (e.g., 10%).

Answer 16

Mean has to be positive. No units of measurement

Answer 17

A negatively skewed distribution (long left tail) has frequent small gains and a few extreme losses. The mean is less than the median, which is less than the mode. !! Investors should be concerned if returns have this distribution.

Answer 18

(L) : Leptokurtic ( greater than 3, Fat Tails) --> Meaning that extreme returns are more common (M) : Mesokurtic (normal distribution) (P) : Platykurtic ( less than 3, Thin Tails)

Answer 19

A correlation coefficient can only be between –1 and +1, but covariance is not subject to the same constraint.

Answer 20

calculate expected values, variances, and standard deviations and demonstrate their application to investment problems formulate an investment problem as a probability tree and explain the use of conditional expectations in investment application calculate and interpret an updated probability in an investment setting using Bayes’ formula

Answer 21

A random variable's variance must be greater than zero because, if there is no dispersion of outcomes, the expected value is known with certainty and the variable is not random.

Answer 22

The conditional expected value of X given that scenario S occurs is expressed as E( X|S ).

Answer 23

Prior probabilities represent the probabilities before the arrival of any new information. The posterior probability reflects the new information.

Answer 24

calculate and interpret the expected value, variance, standard deviation, covariances, and correlations of portfolio returns calculate and interpret the covariance and correlation of portfolio returns using a joint probability function for returns define shortfall risk, calculate the safety-first ratio, and identify an optimal portfolio using Roy’s safety-first criterion

Answer 25

Covariance is positive if, when one asset is generating above-average returns, the other asset is as well. Both assets will also tend to generate returns below their respective averages in the same periods. Covariance is negative if one asset is generating above-average returns while the other's returns are below its average (or vice versa). The covariance of an asset's returns with itself (own covariance) is equal to its variance.

Answer 26

The safety-first ratio can be used in a portfolio context to account for correlations between returns on individual assets.

Answer 27

explain the relationship between normal and lognormal distributions and why the lognormal distribution is used to model asset prices when using continuously compounded asset returns describe Monte Carlo simulation and explain how it can be used in investment applications describe the use of bootstrap resampling in conducting a simulation based on observed data in investment applications

Answer 28

By definition, lognormal random variables cannot have negative values.

Answer 29

Difference between inital price and end of investment horizon

Answer 30

For example, there is no option pricing model for Asian-style options, which make payoffs based on the difference between the strike price and the average price of the underlying asset over a specific period.

Answer 31

The main drawback of this technique is that it produces only statistical estimates, while analytical methods provide exact results and insights into cause-and-effect relationships. In practice, bootstrap resampling is often used as a complement to analytical methods.

Answer 32

Analysts performing bootstrap: seek to create statistical inferences of population parameters from a single sample.

Answer 33

compare and contrast simple random, stratified random, cluster, convenience, and judgmental sampling and their implications for sampling error in an investment problem explain the central limit theorem and its importance for the distribution and standard error of the sample mean describe the use of resampling (bootstrap, jackknife) to estimate the sampling distribution of a statistic

Answer 34

The two types of sampling methods : 1- Probability sampling: Every member of the population has the same chance of being selected. The sample created is usually representative of the population. 2- Non-probability sampling: Non-probability considerations (such as the convenience to access data or the sampler's judgment) are used in sample selection. The sample created may not be representative of the population.

Answer 35

According to the central limit theorem, when the sample size increases, the distribution of the sample mean will converge to a normal distribution. This is true regardless of the actual distribution of the population.

Answer 36

B) Given a population described by any probability distribution (normal or non-normal) with finite variance, the central limit theorem states that the sampling distribution of the sample mean will be approximately normal, with the mean approximately equal to the population mean, when the sample size is large.

Answer 37

The greater the number of resamples, the smaller the estimated standard error of the sample mean. Boostrap is able to determine the standard error and confidence intervals for statistics such as the median. In addition, it produces accurate estimates without relying on any analytical formula.

Answer 38

explain hypothesis testing and its components, including statistical significance, Type I and Type II errors, and the power of a test. construct hypothesis tests and determine their statistical significance, the associated Type I and Type II errors, and power of the test given a significance level compare and contrast parametric and nonparametric tests, and describe situations where each is the more appropriate type of test

Answer 39

Provide an insight to this question by examining how a sample statistic describes a population parameter.

Answer 40

Together, the null and alternative hypotheses must be collectively exhaustive, which means that they account for every possible outcome. They must also be mutually exclusive, meaning that any outcome must either confirm the null hypothesis or provide sufficient evidence to indicate that the null hypothesis should not be accepted.

Answer 41

The probability of a Type I error is the level of significance of the test, which is denoted as 'a'. The complement of this probability, 1- 'a', is the confidence level. For example, a level of significance of 5% corresponds to a confidence level of 95%. There is a 5% chance of incorrectly rejecting a true null hypothesis.

Answer 42

The complement of the probability of a Type II error is the power of a test. This is the probability of correctly rejecting the false null hypothesis. The power equals 1 -'B' .

Answer 43

The null hypothesis is rejected when the test statistic is calculated to be more extreme than the critical value(s). In this case, the result is known to be statistically significant.

Answer 44

Can be used for a population with unknown variance and large sample (>= 30)

Answer 45

A paired comparison test is a statistical test for differences in dependent items.

Answer 46

The assumption that the variances are equal allows for the combining of both samples to obtain a pooled estimate of the common variance.

Answer 47

Test Concerning a Single Variance

Answer 48

The F-test is used based on the ratio of the sample variances. The F-distribution is bounded below by 0 and defined by two values of degrees of freedom – one for the numerator, and one for the denominator.

Answer 49

On the other hand, a nonparametric test is not concerned with parameters or makes minimal assumptions on the underlying population.

Answer 50

explain parametric and nonparametric tests of the hypothesis that the population correlation coefficient equals zero, and determine whether the hypothesis is rejected at a given level of significance explain tests of independence based on contingency table data

Answer 51

If the hypothesis being tested is that the two variables have a statistically significant relationship in the population, a two-sided test is used and the null hypothesis is p = 0 . The alternative hypothesis, Ha, that p has a non-zero value is only accepted if there is sufficient evidence to reject the null hypothesis.

Answer 52

One region of rejection, on the right side of the distribution. All else equal, the critical chi-square value will be higher as the number of degrees of freedom increases and the region of rejection narrows.

Answer 53

describe a simple linear regression model, how the least squares criterion is used to estimate regression coefficients, and the interpretation of these coefficients explain the assumptions underlying the simple linear regression model, and describe how residuals and residual plots indicate if these assumptions may have been violated calculate and interpret measures of fit and formulate and evaluate tests of fit and of regression coefficients in a simple linear regression describe the use of analysis of variance (ANOVA) in regression analysis, interpret ANOVA results, and calculate and interpret the standard error of estimate in a simple linear regression calculate and interpret the predicted value for the dependent variable, and a prediction interval for it, given an estimated linear regression model and a value for the independent variable describe different functional forms of simple linear regressions

Answer 54

The variation of Y is also referred to as the sum of squares total (SST), or the total sum of squares.

Answer 55

Typical time-series data has many observations from different time periods for the same company or asset class. For example, you could collect monthly data on inflation rates to determine if they impact short-term interest rates.

Answer 56

Another implication of the linearity assumption is that the independent variable must not be random (i.e., it must be non-stochastic). This is because the linear relationship between the dependent variable and the independent variable would not exist if the independent variable is random. Consequently, the residuals are random.

Answer 57

A violation of this assumption indicates that the data series may come from two different regimes.

Answer 58

Correlation of residual errors indicates that this assumption of independence has been violated.

Answer 59

For large sample sizes, the central limit theorem applies, and consequently we may drop the normality assumption. The test statistics of the regression coefficients are still valid even if residuals are not normally distributed.

Answer 60

If there is only one independent variable in the regression, then the coefficient of determination is equal to the square of the correlation between the dependent variable and the independent variable : R^2 = r^2

Answer 61

This regression allows testing whether there are different returns for months with an earnings announcement compared to months without an earnings announcement.

Answer 62

The smaller the P-value, the smaller the probability of Type I error, the more likely the regression is valid.

Answer 63

The P-value corresponding to the slope is less than 0.01, so we reject the null hypothesis of a zero slope

Answer 64

3 forms : 1- Log-lin : The slope coefficient is the relative change in the dependent variable for an absolute change in the independent variable. 2- Lin-log : The slope coefficient is the absolute change in the dependent variable for a relative change in the independent variable. Useful for significant difference in variables scale's. 3- Log-log : The slope coefficient is the relative change in the dependent variable for a relative change in the independent variable. Useful when calculating elasticities (relative change).

Answer 65

describe aspects of “fintech” that are directly relevant for the gathering and analyzing of financial data describe Big Data, artificial intelligence, and machine learning describe applications of Big Data and Data Science to investment management

Answer 66

- "Veracity" : Analysts must be able to trust the reliability and credibility of data sources.

Answer 67

3 main sources of alternative data are generated by individuals, business processes, and sensors. Alternative data are used to identify factors that affect security prices, which can then be used to improve asset selection and trading. But investment professionals should be cautious about collecting personal information that is protected by regulations.

Answer 68

It involves formats with diverse structures.

Answer 69

An algorithm initially identifies relationships in a training dataset before further testing is performed using an evaluation or validation dataset.

Answer 70

monitor communications among employees to ensure compliance with policies.

Answer 71

A is correct. Through the text analytics application of NLP, models using NLP analysis might incorporate non-traditional information to evaluate what people are saying—via their preferences, opinions, likes, or dislikes—in the attempt to identify trends and short-term indicators about a company, a stock, or an economic event that might have a bearing on future performance.

Answer 72

- The absence of an explicit economic rationale for a variable of trading strategy is the "no story" warning sign of a data-mining porblem. - The testing of many variables by the researcher is the "too much digging" warning sign of a data-mining porblem.

Volume 1 - Quantitative Methods Flashcards

(170 cards)