Chpt 2 - Organizing and Graphing Data Flashcards

1
Q

Why do we need to learn how to organize data?

A

Without organizing the raw data, we cannot obtain useful information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is data that is non-numerical, such as gender called?

A

Categorical

Also called qualitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is data that is numerical, such as the ages of the people involved in the study?

A

Numerical

Also called quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a variable?

A

A variable is a characteristic that varies from one person or thing to another.

Examples include gender, age, marriage status, number of email accounts, monthly salary, height, weight, etc.

Basically all of the headings of the columns is considered a variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the formal definition of data?

A

The values of a variable in a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can we categorize variables? How can these be further broken down?

A

Categorical (qualitative) or Numerical (quantitative)

Numerical variables can be further broken down to:
Discrete or continuous

Categorical variables can be further broken down to:
Nominal or ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the types of numerical variables?

A

Discrete or continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a discrete variable? Give some examples

A

A numerical variable whose values can be listed

An example might be how many email addresses do you have? If you say 4, the person would be able to list out their 4 email addresses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a continuous variable? Give some examples

A

A numerical variable whose possible values from some interval of numbers

Height is a continuous variable because its possible value is any number from 40cm to 300cm. While the height may not change once we are adults, we often don’t measure it in just feet….we use feet and inches, making this continuous as a value

Other examples are weight, temperature, and length.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Is age a discrete or continuous variable?

A

Age is also considered discrete because while you are constantly aging, we only list out age as whole numbers.

Anytime we are counting such as 1, 2, 3, it is considered discrete

That being said, with babies, we do measure the partial intervals, such as 1.5 years, or kids are only 5 days away from their next birthday…this context with looking forward and using increments, its considered a continuous variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a non-numerical variable that has no specified order called?

Give an example

A

Nominal

Marital status…there is no actual order in life that is set, you may get married, be common law, get separated, be single etc. without a defined order that everyone follows

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a non-numerical variable that has an order called?

Give an example

A

Ordinal

In many studies, there may be very like, like, neutral, dislike, or very dislike

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

If in a study, they coded females as 1 and males as 2, does this make it a numerical variable?

How can you test if the value is numerical or non-numerical?

A

No, it’s still a categorial variable because the numbers are meaningless and have no mathematical meaning

To check, do some basic math operations, if it makes sense it’s a numerical variable, if not, it is a non-numerical value. For example, if females are 1 and males are 2, you cannot add two females together to make 1 male. haha

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the purpose of organizing data?

A

To analyze the distribution of a variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What methods can we use to organize categorical data?

A

Numerical methods (frequency and relative frequency tables)

Graphical methods (pie charts and bar charts that help visualize the frequency/relative frequency tables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the distribution of a variable?

A

A table or formula (he also said graph) that tells us:

1.What values this variable takes

  1. How often it takes these values in a population
17
Q

If we are interested in the distribution of gender, what would we need to know?

A
  1. What values this variable takes (so male/female)
  2. How often it takes these values in a population (so number of males and number of females)

We might also include transmen, transwomen, and nonbinary as other values, this would mean that we would need to also know the number of transmen, transwomen, and nonbinary people exist in the population

18
Q

What is frequency?

What is the frequency of genders if the sample of 10 students has 6 girls and 4 boys in it?

A

A numerical method or organizing categorical data

It’s counting :) It’s the number of times a particular distinct value of a variable occurs in a sample

Frequency of males is 4
Frequency of females is 6

19
Q

What is relative frequency?

What is the relative frequency of genders if the sample of 10 students has 6 girls and 4 boys in it?

A

A numerical method of organizing categorical data

It’s the ratio of the frequency to the sample size. The relative frequency of a particular distinct value of a variable shows the percentage of this value occurs in a sample

Relative frequency = frequency/sample size

Females: 6/10 or 60%
Males: 4/10 or 40%

20
Q

Once we have determined the frequencies or relative frequencies, we can make a frequency/relative frequency table to show the distribution of the data. What are the steps to this?

Give an example using the possible genders of a group of 10 students with 6 women and 4 men

A
  1. List out all the possible distinct values (male, female)
  2. Find the frequencies (f) and relative frequencies (f/n) of the distinct values
    Females - f=6, f/n=6/10
    Males - f=4, f/n=4/10
    (this would be listed in a simple table…but I can’t put one in here)
21
Q

What are the graphical methods used to organize categorical data?

A

Pie charts

Bar charts

22
Q

What are the numerical methods to organize categorical data

A

Frequency/relative frequency tables

23
Q

What is pie chart?

A

A disk divided into wedge-shaped pieces proportional to the relative frequencies of the categorical data

Once slice is allotted for each category and the angle of the slice = relative frequency x 360 degrees

24
Q

A study of 10 students showed that 6 liked rom-com movies. How would this be expressed on a pie chart?

A

f = 6
f/n = 6/10 or 60%

Equation for pie chart:
angle = f/n x 360
angle = 0.60 x 360
angle = 216 degrees

25
Q

What is the notation for frequency?

A

f

26
Q

What is the notation for relative frequency?

A

f/n

27
Q

What is a bar chart?

A

A display of the distinct values of categorical data on a horizontal axis and the relative frequencies or frequencies (depending on what we want) of those values on a vertical axis

One bar is for one category

The height of the bar = relative frequency, or frequency, of a value (so we might have a frequency of 6, or a relative frequency of 0.6, depending on what we want to present)

Bars should be positioned so that they DO NOT TOUCH each other

28
Q

What is the difference between frequency/relative frequency tables and contingency tables?

A

Frequency and relative frequency tables summarize information of a SINGLE (univariate) categorical data

Contingency tables summarize information of TWO SETS (bivariate) of categorical data

29
Q

What method will help us answer the question of are there more divorced female residents or more divorced male residents of a sample?

What would this look like?

A

Contingency table

There are TWO SETS of data to be examined, gender and marital status

Gender and Marital status would both be titles; under gender we would split males to females, and then under marital status, we would use each type of status as a subheading (married, divorced etc.). We would count each group of participants to summarize the totals. So how many females are married, how many males etc. to fill in the tables. The totals would be counted both horizontally and vertically

30
Q

What are contingency tables?

A

Tables that allow us to summarize and analyze the relationship between TWO variables

31
Q

What is the relationship between 2 variables?

A

Association

32
Q

What is an assocaiton?

A

The relationship between 2 variables

33
Q

How do we know 2 variables are associated?

A

If knowing the values of one of the variables tells us something about the values of the other variables

34
Q

What are possible values of seasons? What about weather?

Are these 2 variables associated? explain.

A

Season - spring, summer, fall, winter

Weather - sunny, cloudy, snowy, windy, thunder, raining etc.

Yes they are associated. Given the information about the season, we know more about the weather. Like knowing it is cold and snowy in the winter

35
Q

What are possible values of clothing size? What about GPA?

Are these 2 variables associated? explain

A

Clothing size - S, M, L, XL

GPA - 0.00-4.00

No they are not associated because knowing your clothing size tells us nothing about GPA

36
Q

What are are some of the ways we can further examine the relationship between 2 variables if it is difficult to observe in a contingency table?

A

Apply other statistical methods such as side-by-side bar or pie charts, stacked bar charts, etc.

37
Q
A