Ch1 Exploring Data Flashcards
(15 cards)
Define “Statistics”
Statistics is the study of collecting, organizing, analyzing, and interpreting data to help us learn about things in the world.
Define the word ‘element’
An element is just a single thing that we want to study or learn about. It could be a person, a number, a measurement, or any other thing we’re interested in.
For example, if we wanted to study how tall people are in a class, each person’s height would be an element. Or if we wanted to count how many apples are in a basket, each apple would be an element. Elements are just the individual pieces of information that we collect and use to help us learn more about a topic.
Define “Inference Statistics”
Inference statistics is a way to make predictions or draw conclusions based on the data that we have collected.
Define “Descriptive Statistics”
Descriptive statistics is a way of summarizing and describing information that we have collected.
It’s like telling a story about the data we have. For example, if we collected data on how many students in a class like math, we could use descriptive statistics to summarize the data by saying things like “12 out of 20 students like math.” We might also use descriptive statistics to show the data in a graph or chart so that it’s easier to understand. Descriptive statistics helps us see patterns and understand the data we have collected by summarizing and presenting it in a clear and easy-to-understand way.
What are the two types of data in statisics and give examples.
What type of graphs can be used to display categorical data?
Bar graphs and Pie charts. We can also make frequency tables to help use make these graphs.
what is the difference between a frequency table, a relative frequency table, and a cumulative frequency table?
A frequency table shows how many times each value or category appears
A relative frequency table shows the proportion or percentage of times it appears out of the total.
A cumulative frequency table shows how many times a certain value or category and all the values or categories before it appear.
What are contingency tables?
A contingency table is a way to organize and display data for two categorical variables.
Define the following from a contingency table:
1.) Joint Frequencies
2.) Marginal Frequency
3.) Marginal Distribution
4.) Conditional Distinition
Joint frequencies in a contingency table refer to the number of times a particular combination of categories occurs for two or more variables.
Marginal frequencies in a contingency table refer to the total number of occurrences of each category for one variable.
Marginal distributions in a contingency table refer to the proportion of occurrences of each category for one variable, compared to the total number of occurrences of all categories for that variable.
Conditional distribution in a contingency table refers to the proportion of occurrences of one category for a variable, given the value of another variable.
Describe Univariate, bivariate, and multivariate data
Univariate Data (one- variable) - describes a single characteristic
of a dataset or poquation
Bivariate Data (two variables) - describes the characteristics of each Subject
Multivariate Data (Many variables) - describes many characteristics for each subject
How do we display quantitative data?
1.) Dotplots (lineplots)
2.) stem-and-leaf plots
3.) histograms
4.) Box-and- wiskers plots(box plots)
What is a dotplot and how to draw one
A dotplot is a type of graph that is used in statistics to show the distribution of a set of data.
To create a dotplot, you start by drawing a number line that includes all of the values in the data set. Then, for each value in the data set, you draw a dot above its corresponding location on the number line. If there are multiple values that are the same, you can stack the dots on top of each other.
What is a stem-and-leaf (stemleaf) plot and how to make one?
A stemplot is a type of graph used in statistics to show the distribution of a set of data. It is also known as a stem-and-leaf plot.
To create a stemplot, you start by writing each data point in the dataset vertically in a column. Then, you take the first digit of each data point and write it horizontally on the left side of the graph. This is called the “stem” of the plot. Next, you write the second digit of each data point next to the corresponding stem, like the “leaves” of a tree.
What is a histogram, and how do you make one?
A histogram is a graph that shows how often different numbers or data values occur in a set of data. It’s a way to help you see patterns and trends in the data.
To make a histogram, you first need to decide what you want to measure and collect data for that measurement. For example, you might want to measure the heights of 10 year old kids in a class. You would need to measure the height of each child in the class and write down the measurements.
Next, you would divide the range of the measurements into equal-sized groups, called “bins.” For example, you might have bins that are one inch tall, and the range of heights might be from 48 inches to 60 inches. So you would have 12 bins.
Then, you would count how many measurements fall into each bin. For example, if there are two kids who are 50 inches tall, and three kids who are 52 inches tall, you would put those measurements in the appropriate bins.
Finally, you would draw a bar above each bin that represents the number of measurements in that bin. The bars are all the same width, but the height of the bar represents the frequency of the data in that bin. This gives you a visual representation of how the data is distributed.
What is the acronym to describe a graph for a graphical display of distribution?
CUSS
Center
Unusal
Shape
Spread