descriptive statistics lec Flashcards

(45 cards)

1
Q

Observationofonevariablemay be shown visually by putting the variable’s on one axis and putting the frequency on the other.

A

Visual Presentation of Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

A bar graph wherein the number of units observed is on the y-axis (_____) while the measurement levels are on the _____

A

frequency; x-axis; histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

in histogram the bars are..

A

The bars are visually proportional to each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A figure that is shorthanded
presents a histogram.

A

Frequency Polygon

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A ___ is placed at the center of the top of the bars and connected to form a polygon. This better ennuncuates the data shape.

A

dot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Basic graphs that can illustrate one or more data sets in one graph.

A

Line Graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

two types of line graph and its difference

A

-Arithmeticlinegraphs
○ Have both x and y-axes on
an arithmetic scale.
○ Both values are numerical.

● Semi-logarithmic line graph ○ Has the y-axis as a
logarithmic axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Parameters of a Frequency Distribution

A

central tendency and dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Frequencydistributionsfrom continuous data are defined by types of descriptors, known as _____.

A

parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

● Defined as the value used to represent the center or the middle (average) of a set of data values.
● Locates observations on a measurement scale.

A

Central Tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

● Describes the spread of values in a given data set.
● Suggests how widely spread out the observations are.

A

Dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

dispersion prefers…

A

Prefers low values, low variance, low standard deviation = not spread out data, results are not far from each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Measures of Central Tendency

A

mean, median, mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Average value or the sum (Σ) of all
the observed values (𝑥𝑖) divided

A

mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

has the most mathematical
properties and most representative of the dataset if not for our outliers.

A

mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The middle observation data when
data has been arranged from ______. When the dataset is an even number (hence no natural middle point), the two middling variables are averaged to find a median.

A

highest to lowest, median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Rarely used to make inferential conclusions from, but is used frequently in-healthcare and economics.

A

median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Most commonly observed value
(the value most frequently
observed).

A

mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

The downside to using the mode

A

a set of data may have no mode, or it may have more than one mode.

20
Q

Measures of Dispersion

A

Variance and sd, mean deviation

21
Q

A statistical measurement of the
spread between numbers in a data
set.
● It measures how far each number
in the set is from the mean (average), and thus from every other number in the set.

22
Q

formula for variance

23
Q

sample, degree
offreedom

24
Q

Average amount of variability in
your dataset.
● It tells you, on average, how far
each value lies from the mean.

A

Standard Deviation

25
A high standard deviation means
that values are generally far from the mean,
26
low standard deviation means
values are clustered close to the mean.
27
dierence between the observed value of a data point and the expected value is known as deviation in statistics.
Mean Deviation
28
the average deviation of a data point from the mean, median, or mode of the data set.
mean deviation or mean absolute deviation
29
Values that split sorted data or a probability distribution into equal parts.
Quantiles
30
A statistical term that describes a division of observations into four defined intervals based on the values of the data and how they compare to the entire set of observations.
Quartiles
31
A type of quantiles, obtained by adopting a subdivision into 100 groups.
Percentiles
32
Calculated by dividing an ordered set of data into 100 equal parts.
percentiles
33
Dierence between the highest and lowest values. ○ Size of the narrowest interval which contains all the data.
range
34
○ Dierence between the third and the first quartile. ○ Size of the narrowest interval which contains all the data.
InterquartileRange
35
A measure of the asymmetry of a distribution.
skewness
36
A distribution is asymmetrical when
its left and right side are not mirror images.
37
T/F: A distribution can have right (or positive), left (or negative), or zero skewness.
true
38
other term for skewness
horizontal imbalance
39
A descriptive statistic used to help measure how data disperse between a distribution’s center and tails, with larger values indicating a data distribution may have “heavy” tails that are thickly concentrated with observations or that are long with extreme observations.
Kurtosis
40
other term for Kurtosis
vertical imbalance
41
Xi means
FOr each individual observation
42
Xi means
Or each individual observation
43
The dierence between the observed value of a data point and the expected value is known as
Deviation
44
Uses boxes and lines to depict the distributions of one or more groups of numeric data.
Box plot
45
indicate the range of the central 50% of the data, with a central line marking the median value.
Box plot