Lesson 01 Flashcards

1
Q

Objectives

A
  • define the nature of data, data types, and unstructured data;
  • define the data science process as iterative steps;
  • identify the data science steps repeated to understand a problem; and
  • begin basic programming in Python including: loading data, creating and executing functions, creating charts.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is data?

A

Data are observations that are put into context
Given that every observation has a context, an
observation is a datum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a data type? Give examples.

A

A data type is called a unit. Examples: Meters per second, kilograms, joules, seconds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Data Science Summary?

A
  • Science is the set of methods to extract
    meaning from data.
  • Data is observations in context.
  • Data Science involves predictive,
    prescriptive, and machine-learning analytics
    efforts.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the scientific method?

A

The scientific method is a body of techniques for investigating phenomena, acquiring new knowledge, or correcting and integrating previous knowledge.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Describe the steps in the traditional scientific method.

A
  1. Observe and collect data
  2. Formulate hypothesis based on the data
  3. Test hypothesis against new observations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a model?

A

A model is a hypothesis based on data and a method (alogrithm)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a hypothesis?

A

A explanation of a dataset that allows a prediction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is falsification?

A

Falsification is the process that attempts to disprove a hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a theory?

A

A fact-based explanation for an observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Describe the data science cycle.

A
  • Specify the needed data
  • ETL
  • Prepare Data for the Model
  • Build Model
  • Apply Model and Derive Insight
  • Present and Use Insight
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Where is most of the time in the data science cycle spent?

A

ETL & Preparing the data for the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Which task in the data science life cycle are specific to the data scientist?

A
  • data preparation
  • building the model
  • build model and derive an insight
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What skills does building models require?

A
  • Stats/Math
  • Software Engineering
  • Domain Understanding / Communication Skills
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Which step of the data science lifecycle experience the most failures?

A

Present and Using the Insight

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Name the types of analytics that are used.

A

Descriptive, Diagnostic, Predictive, Prescriptive

17
Q

Name the analytics that answer questions about the past.

A

Descriptive, Diagnostic

18
Q

Name the analysis techniques that answer questions about the future.

A

Predictive, Prescriptive

19
Q

Define descriptive analytics and give examples.

A

Descriptive analytics answer the question “What happened?”

year-over-year pricing changes, month-over-month sales growth, the number of users, or the total revenue per subscriber, averages.

20
Q

Define diagnostic analytics and give examples.

A

Diagnostic analytics is a form of advanced analytics that examines data or content to answer the question, “Why did it happen?” It is characterized by techniques such as drill-down, data discovery, data mining and correlations.

21
Q

Define predictive analytics and give examples.

A

Predictive analytics examines the question “What will happen next?” Examples, models like classifications and regressions.

22
Q

Define prescriptive analytics and give examples.

A

Prescriptive analytics examines the question “What should we do?”Examples, Logistic Regression Coefficients,
recommender engines, partial
dependency functions

23
Q

Why is Python so popular in Data Science?

A
  1. Great libraries.
  2. Open Source and Free
  3. Scripting lang that does’t require compilation.