Midterm Flashcards
Lock in (17 cards)
Computer Scientists
Computer scientists use programming to design new software and websites, protect computer systems from hackers, implement algorithms, and store data.
Statisticians
Statisticians design experiments and derive/apply models to discover trends and patterns in a dataset.
Data Scientists
Data scientists use programming to transform data into meaningful information using graphs, algorithms, and models.
Creating an interactive graph of product sales in the past 18 months
Data Scientist (graph)
Building a user interface for a data storage system containing data on product sales
Computer scientist (building interface)
Designing an experiment to compare three marketing strategies in different markets
Statistician (experimentation)
When was Python released?
1991
When was R released?
1993
When was the phrase “data science” coined?
Peter Naur introduced the term in 1974.
When did businesses first begin collecting and analyzing customer data?
In the 1990s.
What are the biggest data science packages? When were they created? Which language were they made for?
matplotlib (2003, Python)
ggplot2 (2005, R)
scikit-learn (2007, Python)
pandas (2009, Python)
dplyr (2014, R)
When did data science start gaining widespread attention?
Around 2012.
Which company is considered an early adopter of data science?
Google (search engines, ads)
Data Science Methods - Statistics
Hypothesis Testing
Confidence Intervals
Linear Regression
Analysis of Variance
Logistic Regression
Data Science Methods - Machine Learning
Decision Trees
Clustering
Dimension Reduction
Support Vector Machines
Logistic Regression
Data Science Methods - Artificial Intelligence
Machine Learning
Neural Networks
Deep Learning
Image Recognition
Text Processing