Chapter 3: Displaying Data Flashcards
(26 cards)
Frequency (f) Distribution
- Value of dependent variables are tabled/ plotted against their frequency of occurrence
- -Follows a logical order
- Summary of a set of data showing the frequency of each value (or range of values) of a variable
Real Lower Limit
Point halfway b/w bottom of one interval & top of the one below it
Real Upper Limit
Point halfway b/w top of one interval & bottom of the one above it
Midpoint
Center of the interval; average of upper & lower limits
Histogram
Graph where a rectangle is used to rep the freq of observations w each interval; data is collapsed into intervals
Stem & Leaf Plot
- Graphical data presenting original quantitative data arranged in a histogram
- -Retains both individual values & the frequency of those values
- -Helps us visualize the shape of a distribution
- Items are “binned”
Exploratory Data Analysis (EDA)
Set of techniques developed to present data in visually meaningful ways
-Ie. stem & leaf plot
Leading Digits
Leftmost digits of a #
-Most significant
Stem
Vertical axis of display containing the leading digits
Trailing Digits
Digits to the right of the leading digits
-Less significant
Leaves
Horizontal axis of display containing all trailing digits
Drawbacks of Stem & Leaf Plot
For some data sets, it’ll lead to course grouping, therefore including too many leaves for each stem
Bar Graph
Graph in which the freq of occurrence of different values of X is represented by the height of a bar
- Hard to read graphs b/w the X & Y axis: don’t consistently or strictly represent the ie. independent variable
- -Lots of graphs & variation
Line Graph
Y values corresponding to different X values are connected by a line
Symmetric/ Normal Distribution
- Same shape on both sides & center
- Distribution can be bisected at the mean & looks like mirror image
Bimodal Distribution
Two distinct peaks
Modal Distribution (Modality)
of meaningful peaks in a frequency of the data
Negatively Skewed
Distribution which trails off to the left
Positively Skewed
Distribution which trails off to the right
Skewness
- Measure of the degree to which a distribution is assymetrical
- Extent to which a distribution of scores deviates from symmetry
- More scores are clustered in one end (tail) than the other
Why Plot Data?
- Means of communicating/ simplifying otherwise ambiguous data
- -Despite popular belief, data can’t speak for itself
- We can look more easily at the global picture of data
Histogram: Frquency
How many events do we see in each category (X)?
Histogram: Relative Frequency (Rel f)
Of all the events, what proportion of the events happened in the category (X)?
Histogram: Cumulative Frequency (Cum f)
- How many events occurred over a # of categories (X)?
- Start by adding up all categories above the one you’re working with (this makes it cumulative)