Week 1 - Reading Flashcards
Decisions are…
- Often based on limited information.
- An investor does not know with certainty whether financial markets will be buoyant, steady or depressed
What is population?
- A population is a complete set of all items that interest an investigator.
- Population size, N, can be very large or even infinite
What is a sample?
- A sample is an observed subset of a population with sample size given by ‘n’.
Example of a population?
- All potential buyers of a new product
- All stocks traded on the LSE
- All registered voters in a country or region
What is random sampling?
- Simple random sampling is a procedure use to select a sample of ‘n’ objects from a population in such a way that each member of the population is chosen strictly by chance
- This method is so common that the adjective ‘simple’ is usually dropped
What is systematic sampling?
- Involves the selection of every ‘j’th item in the population, where ‘j’ is the ratio of the population size ‘N’ to the desired sample size, ‘n’; that is j=N/n
Give me an example of systematic sampling?
- Suppose that a sample size of 100 is desired and the population consists of 5,000 names in alphabetical order, then j=50. Randomly select a number from 1-50, your number is 20, giving the systematic sample of elements numbered 20, 70, 120, 170, 220, and so forth, until all 100 items are sampled
- Systematic sampling reduces bias and they provide a good representation of the population if there is no cynical variation in the population
What is a parameter?
A numerical measure that describes a specific characteristic of a population
What is a statistic?
A statistic is a numerical measure that describes a specific characteristic of a sample
What is the key thing you always have to remember when it comes to parameters and statistics?
We must realise that some element of uncertainty will always remain, as we do not know the exact value of the parameter.
That is, when a sample is taken from a population, the value of the population parameter will not be able to be known precisely
What is sampling error?
Results from the fact that information is available on only a small subset of all the population members
What are nonsampling errors?
In practical analyses, there is the possibility of an error unconnected with the kind of sampling procedure used
e.g:
- The population actually sampled is not the relevant one
- Survey subjects may give inaccurate answers
- There may be no response to survey questions
What are descriptive statistics?
Focus on graphical and numerical procedures that are used to summarise and process data
What are inferential statistics?
Focus on using the data to make predictions, forecasts, and estimates to make better decisions
What is a variable?
A specific characteristic of an individual or object
What are categorical variables?
- Produce responses that belong to groups or categories
- For example, responses to yes/no questions are categorical - ‘are you a business major - yes’
What are numerical variables?
variables that have values that are numbers and represent something that can be measured or counted
What are discrete numerical variables?
- May, but do not necessarily have a finite number of samples
- The most common type of discrete numerical variable produces a response that comes from a counting process. e.g the number of students enrolled in a class
What is a continuous numerical variable?
- These may take on any value within a given range of real numbers and usually arises from a measurement (not a counting) process
- Examples of continuous numerical variables include the weight of a cereal box or the time to run a race. In each case, the value could deviate within a certain amount
What is qualitative data?
- There is no measurable meaning to the ‘difference’ in numbers
- For example, one football player is assigned the number 7 and another player is number 10 - we cannot conclude that number 10 will play better than number 7
What is quantitative data?
- There is a measurable meaning to the difference in numbers
- When one student scores 90 on an exam and the other scores 45, the difference is measurable and meaningful
What can we use to describe categorical data?
- We can use frequency distribution tables and graphs such as bar charts, pie charts and pareto diagrams
What is a frequency distribution?
- A frequency distribution is a table used to organise data
- The left column includes all possible responses on a variable being studied
- The right column is a list of the frequencies, or number of observations for each class
When is a bar chart most likely used?
If our intent is to draw attention to the frequency of each category, then a bar chart is likely to be used