Lectuer 1 Flashcards

1
Q

Big Data:

A

Big data refers to data that would typically be too expensive to store, manage, and analyze using
traditional (relational and/or monolithic) database systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data analysis

A

examining, transforming and arranging a given data set in specific ways in order to study its individual
parts and extract useful information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data analytics

A

science or discipline that encompasses the complete management of data. Includes analysis, o data collection, organisation, storage, and all the tools and techniques used

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Machine Learning

A

a set of computational methods that uses experience to improve performance of make accurate predictions without the need of being programmed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Machine learning is a sub-field
of what?

A

Artificial Intelligence (AI)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

AI?

A

Man-made system that perceives, reacts and analyzes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

ML?

A

System that learns from by itself

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Supervised ML

A

where we have input variable (x) and an output (y) - labelled data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Unsupervised ML

A

only have input (x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Steps of Analysis?

A
  1. Acquire
  2. Prepare
  3. Analyze
  4. Report
    5.Act
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Acquiring Data?

A

finding the right data sources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Prepare Data includes?

A
  • Preliminary investigation
  • Correlations
  • General trends
  • Outliers
  • Summary statistics
  • Mean
  • Median
  • Range
  • standard deviation
  • Visualization techniques
  • Histogram
  • Scatter plots
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

To transform the raw data to make it suitable for analysis we need?

A
  • Scaling
  • Transformation
  • Feature selection
  • Dimensionality reduction
  • Data manipulation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Analysing Data

A

Building a model from your data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Different types of analysis techniques

A
  • Classification
  • Regression
  • Clustering
  • Association analysis
  • Graph analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Report Communication results tools

A

Visualization is an important tool in presenting your results.
* Python
* R
* D3
* Leaflet
* Tableau

17
Q

Act?

A

To determine what action or actions should be taken,
based on the insights gained