Introduction to statistics Flashcards

(45 cards)

1
Q

[used with a plural verb] Facts or data, either numerical or
non numerical, organized and summarized so as to
provide useful and accessible information about a
particular subject.

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

[used with a singular verb] the science of organizing and
summarizing numerical or non numerical information

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The collection of numerical information often leads to
large masses of data which, if they are to be understood,
or presented effectively, must be summarised and
analysed in some way.

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The art of learning from data. It is concerned with the
collection of data, their subsequent description, and their
analysis, which often leads to the drawing of conclusions.

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The collection of all individuals or items
under consideration in a statistical study.

A

Population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

That part of the population from which
information is obtained

A

Sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Two types of statistics

A

descriptive and inferential

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Concerns the summarization of data. This entails
calculating numbers from the data, called
descriptive measures, such as percentages, sums,
averages, and so forth.

A

Descriptive Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A conclusion drawn about the population
from which the data originated. In which data collected from a “sample”
population is used to infer properties of a larger
population. Consists of methods for drawing and measuring
the reliability of conclusions about a population

A

Inferential Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Descriptive statistics and Inferential
statistics are

A

Interrelated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Aims to describe a chunk of raw data using
summary statistics, graphs, and tables

A

Descriptive statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

It uses samples to draw
inferences about larger populations

A

Inferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The tools used in inferential statistics:

A

•hypothesis testing
•regression analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

used to test whether a hypothesis about a
population is true or not

A

hypothesis testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

the purpose of the study is to examine and explore
information for its own intrinsic interest only

A

descriptive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

the information is obtained from a sample of a
population and the purpose of the study is to use that
information to draw conclusions about the population

A

inferential

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

types of data

A

•quantitative data
•qualitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

___ is any piece of collected
information

A

datum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

is a collection of
data related to each other in some way

20
Q

Are any data that measure or are associated with a measurement
of the quantity of something. They invariably assume numerical values

A

Quantitative data

21
Q

2 categories of quantitative data:

A

•discrete data
•continuous data

22
Q

It take values in a finite or countably infinite set of
numbers, that is, all possible values could (at least in principle)
be written down in an ordered list.

A

Discrete data

23
Q

It take values in an interval of numbers. These
are also known as scale data, interval data, or measurement
data

Examples: height, weight, length, time, etc.

A

Continuous data

24
Q

It take values in an interval of numbers. These
are also known as scale data, interval data, or measurement
data

Examples: height, weight, length, time, etc.
They are often characterized by fractions or decimals: 3.82,
7.0001

A

Continuous data

25
Are any type of data that are not a numerical, or do not represent numerical quantities. Examples: subject’s name, gender, race/ethnicity, political party, socioeconomic status, class rank, driver’s license number, and social security number.
Qualitative data
26
2 types of factors of qualitative data
•nominal •ordinal
27
What do you call the possible values of a factor?
Levels
28
Factors that have levels that correspond to names of the categories with no implied ordering. (hair, gender, race or political party)
Nominal
29
Factors that have some sort of ordered structure to the underlying factor levels. (socioeconomic status; levels: income, education and occupation, class rank, cancer stage)
Ordinal
30
Factors that have some sort of ordered structure to the underlying factor levels. (socioeconomic status; levels: income, education and occupation, class rank, cancer stage)
Ordinal
31
It normally has columns which show the class intervals, class mid-points, class frequencies, and cumulative frequencies, the last of these being a running total of the frequencies themselves.
Grouped frequency distribution table
32
What table is being constructed from the raw data without having first arranged the values in rank order?
Tallied frequencies
33
___ shows, at a glance, how many items in the data are less than a specified value
cumulative frequency
34
It is sometimes more useful to use the ratio of the cumulative frequency to the total number of observations. This ratio is called the ___
relative cumulative frequency
35
It presents each distinct value along with its frequency of occurrence.
frequency table
36
A diagram which is directly related to a grouped frequency distribution table and consists of a collection of rectangles whose height represents the class frequency (to some suitable scale) and whose breadth represents the class width.
Histogram
37
A diagram which is directly related to a grouped frequency distribution table and consists of a collection of rectangles whose height represents the class frequency (to some suitable scale) and whose breadth represents the class width.
Histogram
38
Data from a frequency table can be graphically pictured by a line graph, which plots the successive values on the horizontal axis and indicates the corresponding frequency by the height of a vertical line.
Line graph
39
Sometimes the frequencies are represented not by lines but rather by bars having some thickness. These graphs, called bar graphs, are often utilized.
Bar graph
40
Another type of graph used to represent a frequency table is the frequency polygon, which plots the frequencies of the different data values and then connects the plotted points with straight lines.
Frequency polygon
41
Another type of graph used to represent a frequency table is the ___, which plots the frequencies of the different data values and then connects the plotted points with straight lines.
Frequency polygon
42
A plot of the relative frequencies looks exactly like a plot of the absolute frequencies, except that the labels on the vertical axis are the old labels divided by the total number of observations in the data set.
Relative frequency graphs
43
A plot of the relative frequencies looks exactly like a plot of the absolute frequencies, except that the labels on the vertical axis are the old labels divided by the total number of observations in the data set.
Relative Frequency Graphs
44
It is often used to plot relative frequencies when the data are nonnumeric.
Pie charts
45
It is used to show the relationship between two variables. It is the most accurate way to display correlations, as illustrated in the example below. However, some analysts prefer to use bar charts, as scatter plots can be difficult to interpret.
Scatter plot