Descriptive Statistics Flashcards

1
Q

Categorical data?

A

Data that can be grouped by specific categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Tabular display methods
chanariin ugugdliig negtgeh

A

Frequency distribution(频数分布)
Relative frequency distribution(相对频数分布)
Percent frequency distribution(百分数频数分布)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Categorical variable (graphic)

A

Bar chart(条形图)
Pie chart(饼图)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is frequency distribution(频数分布)

A

Davtamjiin tarhalt ni ylgaatai dvhtsaagui angilal buyu angi bur dehi ajiglaltiin too damtamjiig haruuldag husnegt
Anhnii ugugdliig shuud haraad turgen olj avch chadahgui medeelleer hangah zorilgotoi

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Quantitative data?

A

Data that use numeric values to indicate how much or how many

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to find Relative frequency distribution(相对频数分布)?

A

Tuhain angilliin haritsangui davtamj ni tuhain angilald hamaarah ajiglaltiin niit dun deh butarhai heseg buyu proports um
Haritsangui davtamj = davtamj / n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Frequency distribution (频数分布)的3步

A

Toon ugugdul buhii davtamjiin tarhaltiin huvid bulgiug(angilal) todorhoilohod hregtei 3 alham:
Davhtsaagui bulgiin toog todorhoil
Buleg buriin urtiig todorhoiloh
Bulguudiin hiliig todorhoiloh

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Bulgiin(angilaliin) urtiig herhen todorhoiloh ve

A

Tentsuu urttai bulguud ashigla
Oiroltsoogoor bulgiin urt = hamgiin ih utga - hamgiin baga utga / bulgiin too

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Making comparison of 3 or 2 data what kind of chart do we need to draw

A

Make comparison -
Side by side bar chart (复合条形图)
Stacked bar chart(结构条形图)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Show relationship diagram?

A

- scatter diagram(散点图)
-trendline(趋势线)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

散点图(scatter diagram)有几类似?
画图形

A

Eyreg hamaaral lec2b p.11
Surug hamaaral
Medegdehuits hamaaral bhgui

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

怎么找均值(mean)?

A

P.32

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Randomly sampled?

A

Hervee uzuuleltuud ni tuuvriin toon ugugdluur tootsoologdoj bvl tedgeeriig tuuvriin uzuuleltuud gene
Tuuvriin dundaj x=£xi / n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Eh olonlogiin parametriin uzuulelt

A

Hervee uzuuleltuud ni eh olonlogiin toon ugugdluur tootsoologdoj bvl tedgeeriin eh olonlogiin parametruud gj nerldg
u = £xi / N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Weighted mean

A

X= £wi xi / £wi '
xi = i-r ajiglaltiin utga
wi= i-r ajiglltiin utgiin jin
Hurtver : jinlesen ugugdliin utguudiin niilber
Huvaarit : jingiin niilber
Hervee ugugdul eh olonlog bol x bish u-g bichn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Trimmed mean(截断平均值):

A

:The mean of data deleting a percentage of the smallest and largest values

17
Q

Percentiles(百分位数)?

A

The pth percentile means approximately p%of the observations are less than It.
Location of the pth percentile:Lp = p / 100 (n+1)
pth percentile = p. 33

18
Q

Quartiles(四分位数)是什么?

A

Dood quartiles = Q1
II quartile = Q2
Deed quarttile = Q3

19
Q

What is measures of variability?

A

Helbelzliin toon uzuuleltuudiig tuviin handlagiin toon uzuuleltuudtei hamtad ni avch uzeh shaardlagatai boldg

20
Q

Measures of variability includes?
Helbelzliin toon uzuulelt

A

Range (dalaits )= largest value -smallest value
Interquartile range(四分位距): Q3 -Q1quartile hoorondiin dalaits
Variance: s2 = £(xi - x)2 / n - 1 tuuvriin variance
Eh olonlogiin variance : o2 = £(xi - u)2 / N
Standard deviation: s = yzguur s2
Coefficient of variation: CV = [ s/x * 100 ] %

21
Q

Measures of distribution shape

A

Slowness(偏度)heltiilt = n / (n - 1) (n - 2) £[xi - x / s]3
Heltiilt utga ni =0 dundj boln median in utg tentsuu

22
Q

Skewness(偏度)

A

Zuun talruu dund zerg heltiilt hiisen bl heltiiltiin utga surug
Dundaj ni ihenhdee medianii utgaas baga bn
Baruun talru heltiisen utga ni eyreg
Dundaj ni ihenhdee medianii utgaas ih bn
Baruun talruuga nileen ih heltiisen utga eyreg (ihevchlen 1s ih)
Dundaj utga ihenhdee medians ih bn

23
Q

Measures of relative location

A

Z-score/standardized value (z-分数/标准化值):
Zi = xi−x ̄ / s
Chebyshev’s Theorem: At least (1 − 1/z2) of the items in any data set will be within z standard deviations of the mean, where z is any value greater than 1.
Дурын өгөгдлийн хувьд дор хаяж (1 - 1/z2) утгууд нь z > 1 байх үед x ̄ ± 𝑧 ∙ 𝑠 завсарт оршин байна.

24
Q

Measures of relative location: 2

A

Three-sigma Rule of Thumb (68–95–99.7 rule): For a data having bell-shape distribution,
Approximately 68% of the data values will be within 𝜇 ± 𝜎.
Approximately 95% of the data values will be within 2 standard deviations of the mean(𝜇 ± 2𝜎).
Approximately all (99.7%) of the data values will be within 3 standard deviations of the mean(𝜇 ± 3𝜎 ).

25
Q

Detecting outliers

A

Outlier: a data value with a z-score less than −3 or greater than +3, it might be
an incorrectly recorded data value
a data value that was incorrectly included in the data set
a correctly recorded data value that belongs in the data set

26
Q

What is five number summary?

A

Five number summary: smallest value, first quantile, median, third quantile, largest value

27
Q

What is box plots:

A

Box plot ni tavan toonii duremd suurilan baiguuldag graphic durslel um

28
Q

What is symbol *:

A

Symbol *:Data outside the lower and upper limits are considered outliers, usually being shown with symbol *in the plot.

29
Q

What is Covariance?

A

Ковариац нь хоёр хувьсагчийн хоорондох шугаман хамаарлыг хэмжих үзүүлэлт юм..

Sample: sxy = £n (xi −x ̄)(yi −y ̄) / n - 1 & Population: σxy = £n (xi −μx )(yi −μy ) / N

30
Q

What is Correlation Coefficient?

A

Корреляци нь шугаман хамаарлыг хэмжих үзүүлэлт болох боловч харин учир шалтгаан нь болдоггүй.
• Хоёр хувьсагч хүчтэй хамааралтай байна гэдэг нь нэг хувьсагч нь нөгөө хувьсагчийнхаа учир шалтгаан нь болно гэсэн үг биш юм

31
Q

Correlation Coefficient的公式?

A

p. 40 ppt1

32
Q

Correlation Coefficient原理

A

Корреляци нь -1-с +1-ийн хооронд утгаа авна.
• •

-1 ойролцоо утга нь хүчтэй сөрөг хамаарал байгааг харуулна. +1 ойролцоо утга нь хүчтэй эерэг хамаарал байгааг харуулна.
0-ийн ойролцоо корреляци нь хамаарал сул байгааг илтгэнэ.