Course-5 Analyze data to Answer questions Flashcards

1
Q

Analysis

A

The process used to make sense of the data collected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Goal of analysis

A

The goal of analysis is to identify trends and relationships within data so you can accurately answer the question you are asking.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The 4 phases of analysis

A

-Organize data
- Format and adjust data
-Get input from others
-Transform data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sorting

A

when you arrange data in a meaningful order to make it easier to understand, analyze and visualize.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Filtering

A

is used when you are only interested in seeing data that meets a specific criteria, and hiding the rest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Sort Sheet

A

All of the data in a spreadsheet is sorted by the ranking of a specific sorted column- data across rows is kept together.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Sort range

A

Nothing else on the spreadsheet is rearranged besides the specified cells in a column.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Customized sort order

A

When you sort data in a spreadsheet using multiple conditions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Sort function syntax

A

=SORT(Range,sort_column,true or false) True ascending and false is descending

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Incorrectly formatted data can:

A

1) Lead to mistakes
2) Take time to fix
3) Affect stakeholder’s decision-making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does data validation work?

A

It allows you to control what can and can’t be entered in your worksheet.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Data validation

A

1) Allows dropdown lists with predetermined options
2) Create custom checkboxes
3) Protect structured data and formulas

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Conditional formatting

A

A spreadsheet tool that changes how cells appear when values meet specific conditions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Concatenate

A

A function that joins together two or more text strings

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Text string

A

A group of characters within a cell, most often composed of letters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Openness ( Or open data)

A

Free access, usage, and sharing of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q
A

if(end>start,end-start,1-start+end)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Best Practices for searching online

A
  • Thinking skills
  • Data analytics terms
  • Basic knowledge of tools
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Mental model

A

Your thought process and the way you approach a problem.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

R

A

A programming language frequently used for statistical analysis, visualization, and other data analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Aggregation

A

Collecting or gathering many separate pieces into a whole.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Data aggregation

A

The process of gathering data from multiple sources in order to combine it into a single summarized collection.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Data Aggregation example

A

-Puzzle pieces= Data
-Organization = Aggregation
-Pile of pieces= Summary
-Putting the pieces together= gaining insights.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Data Aggregation Benefits

A
  • Identify trends
  • Make comparisons
  • Gain Insights
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Data can also be aggregated over a given time period to provide statistics such as:
- Average - Minimums - Maximums - Sums
26
Subquery
A query within another query
27
VLOOKUP( Vertical Lookup)
A function that searches for a certain value in a corresponding piece of information.
28
Value
A function that converts a text string that represents a number to a numerical value.
29
Remove duplicates
A tool that automatically searches for and eliminates duplicate entries from a spreadsheet.
30
=VLOOKUP Syntax
=VLOOKUP(103,A2:B26,2,False)
31
Recognize the limitations of VLOOKUP and fix some of the most common problems.
32
Troubleshooting Questions
-How should I prioritize these issues? - In a single sentence, what's the issue I am facing? - What Resources can help me solve the problem? -How can I stop this problem from happening in the future?
33
Absolute reference
A reference that is locked so that rows and columns won't change when copied.
34
MATCH
A function used to locate the position of a specific lookup value.
34
MATCH
A function used to locate the position of a specific lookup value.
35
MATCH
A function used to locate the position of a specific lookup value.
36
Join
A SQL clause that is used to combine rows from two or more tables based on a related column.
37
Common Joins
- Inner -Left -Right -Outer
38
Inner Join
A function that returns records with matching values in both tables.
38
Inner Join
A function that returns records with matching values in both tables.
39
Left Join
A function that will return all records from the left table and only the matching records from the right table.
40
Right Join
A function that will return all records from the right table and only the matching records from the left.
41
OUTER JOIN
A function that combines RIGHT and LEFT JOIN to return all matching records in both tables.
42
Count in spreadsheets
Can be used to count the total number of numerical values within a specific range in spreadsheets.
43
Count in SQL
A query that returns the number of rows in a specified range.
44
Count Distinct
A query that only returns the distinct values in a specified range.
45
Aliasing
When you temporarily name a table or column in your query to make it easier to read and write.
46
Subquery
A SQL query that is nested inside a larger query.
47
Having
Allows you to add a filter to your query instead of the underlying table that can only be used with aggregated functions.
48
CASE
Returns records with your conditions by allowing you to include if/ then statements in your query.
49
Data Calculations
-Formulas for basic calculations - Conditional formulas that use the IF Function -The SUMPRODUCT Function - Pivot tables to organize calculations - Queries and Calculations in SQL - Temporary tables in SQL
50
COUNTIF Syntax
COUNTIF(range, "value")
51
Summary Table
A table used to summarize statistical information about data.
52
SUMIF
A function that adds numeric data based on one condition
53
=SUMIF
(Range,criteria/condtion,[sum_range])
54
SUMPRODUCT
A function that multiplies arrays and returns the sum of those products.
55
SUMPRODUCT Syntax
=sumproduct(array1,[array2])
56
Array
A collection of values in cells
57
Profit margin
A percentage that indicate how many cents of profit has been generated for each dollar of sale
58
Pivot tables
Let you view data in multiple ways to find insights and trends.
59
Analysis steps
- Find out how much revenue was generated each year. - Build a pivot table to show the revenue per year. - Find the average revenue per movie. - Check our findings for some possible trends.
60
Calculated field
A new field within a pivot table that carries out a certain calculations based on the values of other fields.
61
Operator
A symbol that names the type of operation or calculation to be performed in a formula.
62
The Four Basic arithmetic operators in Spreadsheet formulas
+ Addition - Subtraction * multiplication / division
63
Modulo
An Operator (%) that returns the remainder when one number is divided by another.
64
Underscores
Lines used to underline words and connect text characters.
65
GROUP BY
A command that groups rows that have the same values from a table into summary rows.
66
Extract Command
Lets us pull one part of a given date to use
67
Data Validation process
Checking and rechecking the quality of your data so that it is incomplete, accurate, secure and consistent.
68
Temporary Table
A database table that is created and exists temporarily on a database server.