descriptive statistics lec Flashcards by Roshan Mojica

Observationofonevariablemay be shown visually by putting the variable’s on one axis and putting the frequency on the other.

Visual Presentation of Data

How well did you know this?

Not at all

Perfectly

A bar graph wherein the number of units observed is on the y-axis (_____) while the measurement levels are on the _____

frequency; x-axis; histogram

How well did you know this?

Not at all

Perfectly

in histogram the bars are..

The bars are visually proportional to each other.

How well did you know this?

Not at all

Perfectly

A figure that is shorthanded
presents a histogram.

Frequency Polygon

How well did you know this?

Not at all

Perfectly

A ___ is placed at the center of the top of the bars and connected to form a polygon. This better ennuncuates the data shape.

dot

How well did you know this?

Not at all

Perfectly

Basic graphs that can illustrate one or more data sets in one graph.

Line Graph

How well did you know this?

Not at all

Perfectly

two types of line graph and its difference

-Arithmeticlinegraphs
○ Have both x and y-axes on
an arithmetic scale.
○ Both values are numerical.

● Semi-logarithmic line graph ○ Has the y-axis as a
logarithmic axis

How well did you know this?

Not at all

Perfectly

Parameters of a Frequency Distribution

central tendency and dispersion

How well did you know this?

Not at all

Perfectly

Frequencydistributionsfrom continuous data are defined by types of descriptors, known as _____.

parameters

How well did you know this?

Not at all

Perfectly

● Defined as the value used to represent the center or the middle (average) of a set of data values.
● Locates observations on a measurement scale.

Central Tendency

How well did you know this?

Not at all

Perfectly

● Describes the spread of values in a given data set.
● Suggests how widely spread out the observations are.

Dispersion

How well did you know this?

Not at all

Perfectly

dispersion prefers…

Prefers low values, low variance, low standard deviation = not spread out data, results are not far from each other.

How well did you know this?

Not at all

Perfectly

Measures of Central Tendency

mean, median, mode

How well did you know this?

Not at all

Perfectly

Average value or the sum (Σ) of all
the observed values (𝑥𝑖) divided

mean

How well did you know this?

Not at all

Perfectly

has the most mathematical
properties and most representative of the dataset if not for our outliers.

mean

How well did you know this?

Not at all

Perfectly

The middle observation data when
data has been arranged from ______. When the dataset is an even number (hence no natural middle point), the two middling variables are averaged to find a median.

highest to lowest, median

How well did you know this?

Not at all

Perfectly

Rarely used to make inferential conclusions from, but is used frequently in-healthcare and economics.

median

How well did you know this?

Not at all

Perfectly

Most commonly observed value
(the value most frequently
observed).

mode

How well did you know this?

Not at all

Perfectly

The downside to using the mode

Study These Flashcards

a set of data may have no mode, or it may have more than one mode.

Measures of Dispersion

Study These Flashcards

Variance and sd, mean deviation

A statistical measurement of the
spread between numbers in a data
set.
● It measures how far each number
in the set is from the mean (average), and thus from every other number in the set.

Study These Flashcards

Variance

formula for variance

Study These Flashcards

lamo ne yen

sample, degree
offreedom

Study These Flashcards

N-1

Average amount of variability in
your dataset.
● It tells you, on average, how far
each value lies from the mean.

Study These Flashcards

Standard Deviation

A high standard deviation means

that values are generally far from the mean,

low standard deviation means

values are clustered close to the mean.

dierence between the observed value of a data point and the expected value is known as deviation in statistics.

Mean Deviation

the average deviation of a data point from the mean, median, or mode of the data set.

mean deviation or mean absolute deviation

Values that split sorted data or a probability distribution into equal parts.

Quantiles

A statistical term that describes a division of observations into four defined intervals based on the values of the data and how they compare to the entire set of observations.

Quartiles

A type of quantiles, obtained by adopting a subdivision into 100 groups.

Percentiles

Calculated by dividing an ordered set of data into 100 equal parts.

percentiles

Dierence between the highest and lowest values. ○ Size of the narrowest interval which contains all the data.

range

○ Dierence between the third and the first quartile. ○ Size of the narrowest interval which contains all the data.

InterquartileRange

A measure of the asymmetry of a distribution.

skewness

A distribution is asymmetrical when

its left and right side are not mirror images.

T/F: A distribution can have right (or positive), left (or negative), or zero skewness.

true

other term for skewness

horizontal imbalance

A descriptive statistic used to help measure how data disperse between a distribution’s center and tails, with larger values indicating a data distribution may have “heavy” tails that are thickly concentrated with observations or that are long with extreme observations.

Kurtosis

other term for Kurtosis

vertical imbalance

Xi means

FOr each individual observation

Xi means

Or each individual observation

The dierence between the observed value of a data point and the expected value is known as

Deviation

Uses boxes and lines to depict the distributions of one or more groups of numeric data.

Box plot

indicate the range of the central 50% of the data, with a central line marking the median value.

Box plot

descriptive statistics lec Flashcards

(45 cards)