Lesson 01 Flashcards
Objectives
- define the nature of data, data types, and unstructured data;
- define the data science process as iterative steps;
- identify the data science steps repeated to understand a problem; and
- begin basic programming in Python including: loading data, creating and executing functions, creating charts.
What is data?
Data are observations that are put into context
Given that every observation has a context, an
observation is a datum
What is a data type? Give examples.
A data type is called a unit. Examples: Meters per second, kilograms, joules, seconds
What is Data Science Summary?
- Science is the set of methods to extract
meaning from data. - Data is observations in context.
- Data Science involves predictive,
prescriptive, and machine-learning analytics
efforts.
What is the scientific method?
The scientific method is a body of techniques for investigating phenomena, acquiring new knowledge, or correcting and integrating previous knowledge.
Describe the steps in the traditional scientific method.
- Observe and collect data
- Formulate hypothesis based on the data
- Test hypothesis against new observations
What is a model?
A model is a hypothesis based on data and a method (alogrithm)
What is a hypothesis?
A explanation of a dataset that allows a prediction.
What is falsification?
Falsification is the process that attempts to disprove a hypothesis
What is a theory?
A fact-based explanation for an observation
Describe the data science cycle.
- Specify the needed data
- ETL
- Prepare Data for the Model
- Build Model
- Apply Model and Derive Insight
- Present and Use Insight
Where is most of the time in the data science cycle spent?
ETL & Preparing the data for the model
Which task in the data science life cycle are specific to the data scientist?
- data preparation
- building the model
- build model and derive an insight
What skills does building models require?
- Stats/Math
- Software Engineering
- Domain Understanding / Communication Skills
Which step of the data science lifecycle experience the most failures?
Present and Using the Insight