Introduction to Statistics Flashcards

1
Q

statistics is derived from Latin word

A

status

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

status means

A

state

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

its early uses involved compilation of data and graphs describing various aspects of the state or country

A

statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

actual
numbers derived from data a

A

statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

method of analyzing and
interpreting data.

A

statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

is a science which deals with the collection, presentation, analysis,
and interpretation of quantitative data

A

statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

a collection of quantitative
data, such as statistics of crimes, statistics on enrollment, statistics on unemployment, and the like.

A

statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

example of use of statistics

A

surveys
consumer preference
experiments
sampling
economics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

It deals with the methods of organizing, summarizing and presenting a mass of data so as
to yield meaningful information.

A

descriptive statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

It deals with making generalizations about a body of data where only a part of it is
examined. This comprises methods concerned with the analysis of a subset of data leading
to predictions or inferences about the entire set of data

A

inferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

is the set of all individuals or entities under consideration or study. It may be
a finite or infinite collection of objects, events, or individuals, with specified class or
characteristics under consideration.

A

population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

is a characteristic of interest measurable on each and every individual in the
population, denoted by any capital letter in the English alphabet.

A

variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

types of variable

A

qualitative
quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

consists of categories or attributes, which have non-numerical
characteristics.

A

qualitative variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

consists of numbers representing counts or measurements

A

quantitative variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

classification of quantitative variable

A

discrete
continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

results from either a finite number of possible values or a
countable number of possible values

A

discrete

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

e results from infinitely many possible values that can
be associated with points on a continuous scale in such a way that there are no gaps or
interruptions

A

continuous quantitative variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

is part of the population or a sub-collection of elements drawn from a population

A

sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

is a numerical measurement describing some characteristic of a population

A

parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

c is a numerical measurement describing some characteristic of a sample

A

statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

is often conducted to gather opinions or feedbacks about a variety of topics

A

survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

most often simply referred to as census, is conducted by gathering
information from the entire population

A

census survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

most often simply referred to as survey, is conducted by gathering
information only from part of the population

A

sampling survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
classify the ff quantitative variable number of students number of books number of patient age monthly income money
discrete quantitative
26
classify the ff quantitative variable height weight gwa time length
continuous quantitative variable
27
four levels of measurement
nominal ordinal interval ratio
28
is characterized by data that consist of names, labels, or categories only
nominal
29
- The data cannot be arranged in an ordering scheme. - considered to be the weakest level as it uses number and symbols to classify object, person, or characteristics. This level or scale is classificatory in nature.
nominal
30
level of measurement ex: name, religion, civil status, address, sex, degree program
nominal
31
involves data that may be arranged in some order, but differences between data values either cannot be determined or are meaningless.
ordinal
32
`This scale, although categorical in nature, shows difference or some kind of relation between categories (greater than or Less than).
ordinal
33
level of measurement military rank job position year level
ordinal
34
is like the ordinal level, with the additional property that meaningful amounts of differences between data can be determined
interval
35
there is no inherent (natural) zero starting point.
interval
36
level of measurement IQ score temperature (C and F) dates
interval
37
is the interval level modified to include the inherent zero starting point. For values at this level, differences and ratios are meaningful.
Ratio
38
level of measurement height, area, width, weekly allowance, absolute zero (kelvin)
ratio
39
what level of measurement is applied for qualitative variables
nominal ordinal
40
level of measurement for quantitative variables
interval ratio
41
Excel, JMP, SPSS, Minitab are what
Statistical Software
42
classifying variables by type (2)
numerical categorical
43
data that is derived from counting process (how many?)
discrete
44
data that is describe from a measuring process - how much?
continuous
45
numbers from a population
parameter
46
all of the data that is collected in a particular study
data set
47
the entity on which the data or collected
elements
48
the characteristic from each element that we are studying
variable
49
each individual measurment
observation
50
data collected at one point in time
cross sectional data
51
data collected over several time periods
time series data
52
data that already exist in some form
secondary data
53
data that you collect for your use
primary data
54
data that exist within your corporation or organization
internal secondary data
55
data that exists outside your corporation or organization
external secondary data
56
process of capturing, storing , and maintaining data
data warehousing
57
system to extract information and uncover patterns
data mining
58
the process of using statistics to draw conclusions about population parameters
statistical inference
59
Determine whether the following statements use the area of descriptive statistics or statistical inference. A bowler wants to find his bowling average for the past 12 games.
descriptive
60
Determine whether the following statements use the area of descriptive statistics or statistical inference. A manager would like to predict based on previous years’ sales, the sales performance of a company for the next five years.
inferential
61
Determine whether the following statements use the area of descriptive statistics or statistical inference. A politician would like to estimate, based on an opinion poll, his chance for winning in the upcoming senatorial election.
inferential
62
Determine whether the following statements use the area of descriptive statistics or statistical inference. A teacher wishes to determine the percentage of students who passed the examination.
descriptive
63
Determine whether the following statements use the area of descriptive statistics or statistical inference. A student wishes to determine the average monthly expenditures on school supplies for the past five months.
descriptive
64
Determine whether the following statements use the area of descriptive statistics or statistical inference. A basketball player wants to estimate his chance of winning the most valuable player (MVP) award based on his current season averages and the averages of his opponents.
inferential
65
Classify the following statements as belonging to the area of descriptive statistics or statistical inference. Write DS for descriptive statistics and write IS for inferential statistics on the space provided. 1. Recording the number of infected persons of COVID 19 in a certain barangay. 2. If the present trend continues, the number of infected people of COVID 19 will reduce dramatically within a month. 3. In a certain city, arsonists deliberately set 3% of all fires reported last year. 4. Records show that case of dengue has decrease in the last 5 years. 5. As a result of a recent poll, most Filipinos are in favor of electronic voting.
DS IS DS IS DS
66
Classify the following statements as belonging to the area of descriptive statistics or statistical inference. Write DS for descriptive statistics and write IS for inferential statistics on the space provided. 6. Philippines’ Gross Domestic Product (GDP) grows by 5.9% in 2019, -0.3 percentage lower than its 6.2% performance in 2018. 7. The average grade of Annalyn, a BS Biology student in her 8 subjects is 3.57. 8. In the United States, it was predicted from current trend that the deaths due to covid19 will reach 15000 by end June 2020. 9. All four provinces of ARMM are among the 10 poorest provinces in the Philippines for 2002. 10. Data show the number of enrollees in private institution will decrease by 15% next School Year.
DS DS IS DS IS
67
Identify the population, variable of interest, and type of variable in the following: 1. The dean of CSCS would like to determine the average weekly allowance of BS Computer Science students. 2. The registrar of DLSU-D would like to conduct a survey on the preferred courses of grade 12 students in Cavite. 3. The dean of the a certain college would like to know the number of students who are smoking. 4. A survey by a group of students entitled “Dress Code” will be conducted to first year students to determine the fashion preferences of these students. 5. A group of researchers would like to know the number of deaths due to COVID 19 in all countries in Southeast Asia.
DQV Quali DQV Quali DQV
68
: Identify the population, variable of interest, and type of variable in the following: From all students registered this semester, the Mathematics and Statistics Department would like to know how many students like Statistics. 7. A study to be conducted by NGO would determine the Filipinos’ awareness about the spread of COVID in the Philippines. 8. A group of students taking Statistics conducted a study on the effect of distant learning to the academic performance of the students. 9. Some parents would like to determine whether Mobile Legend is good or bad to the behavior of their children. 10. The head librarian would like to identify the e-book/s commonly read by DLSU-D students.
DQV Quali Quali Quali Quali Quali
69
Identify each statement as having discrete or continuous data. Write D for discrete and C for continuous. 1. Among the 855 deaths due to COVID19, 349 are female. 2. A student spent on the average 3.54 hours per day studying his/her lesson. 3. Yesterday’s records show that there is an increase of 955 new cases of COVID19 infection. 4. A COVID 19 patient can recover from sickness to about 12-16 days. 5. Upon completion of a diet and exercise program, Elmer weighed 12.37 lbs. less than when he started the program.
D C D C C
70
Identify which of the following quantitative data would be presented by a discrete variable or a continuous variable. Write DV for discrete variable and write CV for continuous variable. Number of pairs of pants 2. Time (in minutes) to finish a 4-km marathon. 3. Circumference (in inches) of coconut trunk 4. Length (in cm) of harvested bamboo 5. Dimension (L*W*T) of the newest brand of cellular phone
DV CV CV CV CV
71
Identify which of the following quantitative data would be presented by a discrete variable or a continuous variable. Write DV for discrete variable and write CV for continuous variable. Percentage increase in enrolment this year 7. Number of COVID 19 infections of 50 countries 8. Monthly income of 100 randomly selected persons at KADIWA Market 9. Number of heads when a coin is tossed 25 times 10. Age in years of COVID 19 patients.
CV DV DV DV DV
72
At what level are the following variables measured? Write nominal, ordinal, interval or ratio on the space provided. 1. Student number 2. Emotional quotient of teachers 3. Telephone number 4. Species of orchid plants 5. Final course grades of 0.0, 1.0,1.25,1. 50,…
N I N N I
73
At what level are the following variables measured? Write nominal, ordinal, interval or ratio on the space provided. 6. LEVEL OF COMPLIANCE, such as always, usually, frequently, sometimes, never 7. Intelligence quotients of 50 selected students in CSU. 8. Lengths of TV commercials ( in seconds) 9. The years 1896, 2000, 1776,1995 10. Attitude toward gun laws such as favorable, somewhat favorable, somewhat unfavorable
O I R N O
74
At what level are the following variables measured? Write nominal, ordinal, interval or ratio on the space provided. 1. Zip codes 12. Board exam rating 13. Harvest in kilograms of per hectare of rice 14. Candidate voted for in 2019 senatorial elections 15. Tax Identification Number
N R R N N
75
16. Altitude of mountains liters of gasoline consumed day 17. Rate of success in the entrance exam 18. Systolic Blood pressure 19. Height of students 20. Number of COVID 19 infection per day
R R I R R
76
21. Number of won cases in court 22. Academic rank in High School 23. Savings Account Number 24. Are you a Pag-Ibig Member? (Yes/No) 25. Number of books sold per day
R O N N R
77
26. Weekly expenses in internet subscription of CvSU students 27. Main source of income 28. Birth order in the family 29. Number of organizations involved in 30. Car plate number
R N O R N
78
a narrative description of the data gathered.
textual method
79
a systematic arrangement of information into columns and rows
tabular method
80
an illustrative description of the data
graphical method
81
is a statistical table showing the frequency or number of observations contained in each of the defined classes or categories.
frequency distribution table
82
Parts of a Statistical Table (4)
table heading body stubs or classes caption
83
includes the table number and the title of the table.
table heading
84
main part of the table that contains the information or figures.
body
85
classification or categories describing the data and usually found at the left most side of the table.
stubs or classes
86
designations or identifications of the information contained in a column, usually found at the topmost of the column.
caption
87
is a frequency distribution table where the data are grouped according to some qualitative characteristics; data are grouped into non numerical categories.
qualitative or categorial FDT
88
frequency distribution table where the data are grouped according to some numerical or quantitative characteristics.
quantitative FDT
89
range formula
𝑅=ℎ𝑖𝑔ℎ𝑒𝑠𝑡 𝑣𝑎𝑙𝑢𝑒−𝑙𝑜𝑤𝑒𝑠𝑡 𝑣𝑎𝑙𝑢𝑒
90
number of classes (K) formula
𝐾= √𝑁 , where N is the total number of observations in the data set.
91
class size is determined by first computing what
preliminary class size
92
preliminary class size formula
c' = R/K
93
conditions of the actual class sizes are (2)
a. It should have the same number of decimal places as in the raw data. b. It should be odd in the last digit.
94
other columns in FDT (5)
True class boundaries Class Mark Relative frequency Cumulative frequency Relative Cumulative frequency
95
TCB (true class boundaries) include (2)
Lower True Class Boundaries (LTCB) Upper True Class Boundaries (UTCB)
96
LTCB formula
𝐿𝑇𝐶𝐵=𝐿𝐿−1/2 𝑢𝑛𝑖𝑡 𝑜𝑓 𝑚𝑒𝑎𝑠𝑢𝑟𝑒
97
UTCB formula
𝑈𝑇𝐶𝐵=𝑈𝐿+1/2 𝑢𝑛𝑖𝑡 𝑜𝑓 𝑚𝑒𝑎𝑠𝑢𝑟𝑒
98
midpoint of the class interval where the observations tend to cluster about.
class mark
99
class mark formula
𝐶𝑀= 1/2 (𝐿𝐿+𝑈𝐿) 𝑜𝑟 𝐶𝑀= 1/2(𝐿𝑇𝐶𝐵+𝑈𝑇𝐶𝐵)
100
– the proportion of observations falling in a class and is expressed in percentage.
relative frequency
101
RF formula
RF = frequency /n
102
%RF frequency
frequency / N *100
103
accumulated frequency of the classes.
cumulative frequency
104
- total number of observations whose values do not exceed the upper limit of class.
less than CF (
105
– total number of observations whose values are not less than the lower limit of the class
Greater than CF
106
s a device for showing numerical values or relationships in pictorial form
h/chart
107
advantages of a graph/chart
main features and implications of a body of data can be seen at once can attract attention and hold the reader’s interest simplifies concepts that would otherwise have been expressed in so many words can readily clarify data, frequently bring out hidden facts and relationships.
108
Qualities of a Good Graph:
It is accurate. It is clear. It is simple. It has a good appearance.
109
Common Types of Graph
line chart scatter graph pie chart column and bar graph
110
– graphical presentation of data especially useful for showing trends over a period of time.
line chart
111
Is a graph used to present measurements or values that are thought to be related.
scatter graph
112
a circular graph that is useful in showing how a total quantity is distributed among a group of categories. The “pieces of pie” represent the proportions of the total that fall into each category.
pie chart
113
like pie charts, column charts and bar charts are applicable only to grouped data. They should be used for discrete, grouped data of ordinal or nominal scale
column and bar graph
114
– a bar graph that displays the classes on the horizontal axis and the frequencies of the classes on the vertical axis.
frequency histogram
115
a line chart that is constructed by plotting the frequencies at the class marks and connecting the plotted points by means of straight lines.
frequency polygon
116
graphs of the cumulative frequency distribution
ogives
117
the
118
the >CF is plotted against the LTCB
>ogive
119
study how to make FDT
+1