Data Analytics Flashcards by jack milligan

iloc example

df.iloc[1,2] - single cell (200)
df.iloc[2] - Entire row (1000, 2000, 3000, 4000)

How well did you know this?

Not at all

Perfectly

loc example

Same as iloc but with string headings
e.g.
df.loc[2,’a’]

How well did you know this?

Not at all

Perfectly

describe

Summary of a single column

df.[‘a’].describe()

How well did you know this?

Not at all

Perfectly

Mean

The total of the figures, divided by the number of individual figures

1,2,2,3,2,4
Mean: 13/6 = 2.16666

How well did you know this?

Not at all

Perfectly

Median

The middle point

1,2,2,3,2,4 -> 1,2,2,2,3,4
Median: 2

How well did you know this?

Not at all

Perfectly

Mode

The most common Figure

1,2,2,3,2,4
Mode : 2

How well did you know this?

Not at all

Perfectly

Inter Qaurtile range

The Difference between the First and Third Qaurtile Values
Q1: 10
Q3: 50

IQR: 40

How well did you know this?

Not at all

Perfectly

Nominal

Categorisation without order e.g. the books are in: English, French, German etc.

Distinctiveness ( = and != )

How well did you know this?

Not at all

Perfectly

Ordinal

Categorisation with order e.g. the coffee was: Good, Medium, Bad

Distinctiveness ( = and != )
Order ( <,<=,>,>= )

How well did you know this?

Not at all

Perfectly

Interval

Scale with an arbitrary zero value e.g. temperature, shoe size, dates

Distinctiveness ( = and != )
Order ( <,<=,>,>= )
Addition ( + and - )

How well did you know this?

Not at all

Perfectly

Ratio

Scale with a non-arbitrary zero value e.g. distance, age, speed etc.

Distinctiveness ( = and != )
Order ( <,<=,>,>= )
Addition ( + and - )
Multiplication ( * and / )

How well did you know this?

Not at all

Perfectly

NOIR

Qualitative:
Nominal
Ordinal

Quantatitive:
Interval
Ratio

How well did you know this?

Not at all

Perfectly

DOAM

Distinctiveness (=, !=)
Ordering (<, <=, >, >=)
Addition (+, -)
Multiplication (*, /)

How well did you know this?

Not at all

Perfectly

Nominal : Binary

1/0, On/Off, Yes/No, True/False

How well did you know this?

Not at all

Perfectly

Normal Distribution

Standard Bell Curve

Mode, mean and Median are in the centre

How well did you know this?

Not at all

Perfectly

Left skewed

Study These Flashcards

Tail is on the left, Hump on the right

Left: Mean
Middle: Median
Right: Mode

“You’re mean when you walk away”

Right skewed

Study These Flashcards

Tail on the right, hump on the left

Left: Mode
Middle: Median
Right: Mean

“You’re mean when you walk away”

Tuple

Study These Flashcards

stores data but cant be changed

myTuple = (1,2,3)

List in relation to tuple

Study These Flashcards

Like a tuple but can be changed

myList = [1,2,3]

List

Study These Flashcards

ordered collection of elements supporting mixed data types

Array

Study These Flashcards

similar to a list but all must be of the same type

2D array or matrix

Study These Flashcards

a grid of elements with uniform data types

DataFrame

Study These Flashcards

two dimensional, potentially tabular data structure with labelled axes, allowing different data types for each column

e.g. SQL, or CSV

Measures of Dispersion

Study These Flashcards

Standard Deviation, and Variance

Variance

The averages of the squared differences form the mean

Standard Deviation in relation to variance

The square root of the variance

Standard Deviation (small and large)

Smaller: data points tend closer to the mean Larger: data points have greater variability

Correlation

Measures the strength and direction of the linear relationship between two variables

Correlation: -1, 1

1 = Perfect positive correlation -1 = Perfect negative correlation

Covariance

The degree to which two variables change together in a dataset

Strong and Weak Correlation

Strong Correlation: High degree of association between the two variables. Weak Correlation: Low degree of association between the two variables.

Data Analytics Flashcards

(31 cards)