Basic Python Flashcards by Skye Long

Write a lambda function that adds a column to the dataframe ‘shoes’ that returns Not Vegan if the shoe is made out of leather and Vegan if it is not.

df[‘vegan’] = df.shoe_type.apply(lambda x: ‘Not_Vegan’ if x == ‘Leather’ else ‘Vegan’)

How well did you know this?

Not at all

Perfectly

Use the columns ‘gender’ and ‘last_name’ from ‘df’ to make a new column called ‘salutation’ that returns ‘Dear Mr.’ and the last name if gender is male or ‘Dear Ms.’ and last name if female.

df[‘salutations’] = df.apply(lambda row: “Dear Mr. “ + row.last_name if row.gender == ‘male’ else “Dear Ms. “, axis = 1)

How well did you know this?

Not at all

Perfectly

Get the last name only from a column that has the format “Skye Long” called “name”

get_last_name = lambda x: x.split()[-1]

df[‘last_name’] = df.name.apply(get_last_name)

How well did you know this?

Not at all

Perfectly

Rename individual, compared to all, columns in a df

df.rename(columns = {‘old’:’new, ‘old2’:’new2’}, inplace = True)

How well did you know this?

Not at all

Perfectly

Rename the following columns in order:

name, last name, user

df.columns = [‘first_name’, ‘last_name’, ‘id’]

Only works if you have the same # of columns as in the original. You can also accidentally overwrite something. Better to use the df.rename(columns={old:new}, inplace = True).

How well did you know this?

Not at all

Perfectly

What would this produce:

[1,2] + [3,4]

[1,2,3,4]

How well did you know this?

Not at all

Perfectly

What is the simple syntax to apply lambda to a row in a dataframe?

df.apply(lambda row: “what will be returned” row[‘name of row to act on’] , axis = 1)

Example:

total_earned = lambda row: (row.hourly_wage * 40) + ((row.hourly_wage * 1.5) * (row.hours_worked - 40)) if row.hours_worked > 40 else row.hourly_wage * row.hours_worked

df[‘total_earned’] = df.apply(total_earned, axis = 1)

Here the lambda is written seperatly, but it could be combined

How well did you know this?

Not at all

Perfectly

What is the syntax for the .apply attribute?

df.col.apply(func)

EXample:

get_last_name = lambda x: x.split()[-1]

df[‘last_name’] = df.name.apply(get_last_name)

df.apply(row func, axis = 1)

How well did you know this?

Not at all

Perfectly

Nested List Comprehensions

List 1 = [0,1,2,3,4,5]

List2 = [‘a’,’b’,’c’,’d’,’e’,’f’]

I want the value of list 2 for every value of list 1 that is less than 3

new_list = [list2[i] for i in range (0, len(list2)) if list1[i] < 3]

How well did you know this?

Not at all

Perfectly

Create a new list named double_nums by multiplying each number in nums by two

nums = [4, 8, 15, 16, 23, 42]

double_nums = [i * 2 for i in nums]

How well did you know this?

Not at all

Perfectly

create a list from 0 to 5

new_list = range(6)

How well did you know this?

Not at all

Perfectly

Write this for loop as a list comp:

for x in l:

if x>=45: x+1

else: x+5

[x+1 if x >=45 else x + 5 for x in l]

How well did you know this?

Not at all

Perfectly

Write this as an if statement

[x+1 for x in l if x >= 45]

if x >= 45:

x+1

How well did you know this?

Not at all

Perfectly

What will this code produce?

nums = [4, 8, 15, 16, 23, 42]

parity = [0 if i%2 == 0 else 1 for i in nums]

print(parity)

[0, 0, 1, 0, 1, 0]

How well did you know this?

Not at all

Perfectly

Write this as a list comp:

nums2 = [4, 8, 15, 16, 23, 42]

parity2 = []

for i in nums2:

if i%2 == 0:

parity2.append(0)

else:

parity2.append(1)

parity = [0 if i%2 == 0 else 1 for i in nums]

How well did you know this?

Not at all

Perfectly

If numbers are above 45 then add 1, if num <10 subtract 1, else add 5

l = [22, 13, 45, 50, 98, 69, 43, 44, 1]

for i in l:

if i >45:

l_2.append(i+1)

elif i

l_2.append(i-1)

else:

l_2.append(i+5)

print(l_2)

l_3 = [i+1 if i > 45 else (i-1 if i<10 else i+5) for i in l]

How well did you know this?

Not at all

Perfectly

Create a new list named first_character that contains the first character from every name in the list names

names = [“Elaine”, “George”, “Jerry”, “Cosmo”]

first_character = [i[0] for i in names]

print(first_character)

How well did you know this?

Not at all

Perfectly

Create a new list called greater_than_two, in which an entry at position i is True if the entry in nums at position i is greater than 2.

nums = [5, -10, 40, 20, 0]

greater_than_two = [True if i >2 else False for i in nums]

print(greater_than_two)

How well did you know this?

Not at all

Perfectly

Create a new list named product that contains the product of each sub-list of nested_lists

product = [x1 * x2 for (x1, x2) in nested_lists]

How well did you know this?

Not at all

Perfectly

Create a new list named greater_than that contains True if the first number in the sub-list is greater than the second number in the sub-list, and False otherwise.

nested_lists = [[4, 8], [16, 15], [23, 42]]

greater_than = [True if x1 > x2 else False for (x1, x2) in nested_lists]

https://colab.research.google.com/github/skyelong/Code-Academy/blob/master/Python/List_Comprehension_Code_Challenge.ipynb#scrollTo=f9aK6u58ROeq

How well did you know this?

Not at all

Perfectly

Create a new list named first_only that contains the first element in each sub-list of nested_lists.

nested_lists = [[4, 8], [16, 15], [23, 42]]

first_only = [x1 for (x1, x2) in nested_lists]

https://colab.research.google.com/github/skyelong/Code-Academy/blob/master/Python/List_Comprehension_Code_Challenge.ipynb#scrollTo=f2y5829cSLPQ

How well did you know this?

Not at all

Perfectly

Use list comprehension and the zip function to create a new list named sums that sums corresponding items in lists a and b. For example, the first item in the new list should be 5 from adding 1 and 4 together.

a = [1.0, 2.0, 3.0]

b = [4.0, 5.0, 6.0]

sums = [x1 + x2 for (x1, x2) in zip(a,b)]

https://colab.research.google.com/github/skyelong/Code-Academy/blob/master/Python/List_Comprehension_Code_Challenge.ipynb#scrollTo=zzd_TyqLS24I&line=4&uniqifier=1

How well did you know this?

Not at all

Perfectly

You’ve been given two lists: a list of capitals and a list of countries. Create a new list named locations that contains the string “capital, country” for each item in the original lists. For example, if the 5th item in the capitals list is “Lima” and the 5th item in the countries list is “Peru”, then the 5th item in the new list should be “Lima, Peru”

capitals = [“Santiago”, “Paris”, “Copenhagen”]

countries = [“Chile”, “France”, “Denmark”]

locations = [x1 + “, “ + x2 for (x1,x2) in zip(capitals, countries)]

https://colab.research.google.com/github/skyelong/Code-Academy/blob/master/Python/List_Comprehension_Code_Challenge.ipynb#scrollTo=jOeCN_NHUIy7

How well did you know this?

Not at all

Perfectly

You’ve been given two lists: a list of names and a list of ages. Create a new list named users that contains the string “Name: name, Age: age” for each pair of elements in the original lists. For example, if the 5th item in the names list is “John”and the 5th item in ages is 42, then the 5th item in the new list should be”Name: John, Age: 42”.

As you did in the previous exercise, concatenate your strings together using +. Make sure to add proper capitalization and spaces.

names = [“Jon”, “Arya”, “Ned”]

ages = [14, 9, 35]

users = [“Name: “ + x1 + “, Age: “ + str(x2) for (x1,x2) in zip(names, ages)]

print(users)

https://colab.research.google.com/github/skyelong/Code-Academy/blob/master/Python/List_Comprehension_Code_Challenge.ipynb#scrollTo=6Ol4V5VjUva5

How well did you know this?

Not at all

Perfectly

Create a new list named greater\_than that contains True or False depending on whether the corresponding item in list a is greater than the one in list b. For example, if the 2nd item in list a is 3, and the 2nd item in list b is 5, the 2nd item in the new list should be False.

a = [30, 42, 10] b = [15, 16, 17] greater\_than2= [True if x1 \> x2 else False for (x1, x2) in zip(a,b)] print(greater\_than2) [https://colab.research.google.com/github/skyelong/Code-Academy/blob/master/Python/List\_Comprehension\_Code\_Challenge.ipynb#scrollTo=qefmBpfVVkUh](https://colab.research.google.com/github/skyelong/Code-Academy/blob/master/Python/List_Comprehension_Code_Challenge.ipynb#scrollTo=qefmBpfVVkUh)

Create a lambda function named contains\_a that takes an input word and returns True if the input contains the letter 'a'. Otherwise, return False.

contains\_a = lambda n: "a" in n [https://colab.research.google.com/github/skyelong/Code-Academy/blob/master/Python/Lambda.ipynb#scrollTo=bnAxko3shzmc](https://colab.research.google.com/github/skyelong/Code-Academy/blob/master/Python/Lambda.ipynb#scrollTo=bnAxko3shzmc)

Create a lambda function named long\_string that takes an input str and returns True if the string has over 12 characters in it. Otherwise, return False.

long\_string = lambda x: True if len(x) \> 12 else False

Create a lambda function named ends\_in\_a that takes an input str and returns True if the last character in the string is an a. Otherwise, return False.

ends\_in\_a = lambda x: True if x[-1] == "a" else False

Create a lambda function named add\_random that takes an input named num. The function should return num plus a random integer number between 1 and 10 (inclusive).

add\_random = lambda num: num + random.randint(1,10)

You run an online clothing store called Panda’s Wardrobe. You need a DataFrame containing information about your products. Create a DataFrame with the following data that your inventory manager sent you: Product ID Product Name Color 1 t-shirt blue 2 t-shirt green 3 skirt red 4 skirt black

df1 = pd.DataFrame({ 'Product ID': [1, 2, 3, 4], 'Product Name': ['t-shirt', 't-shirt', 'skirt', 'skirt'], 'Color': ['blue', 'green', 'red', 'black']})

from this dataframe select all row for clinic north and clinic south

clinic\_north\_south = df[['clinic\_north', 'clinic\_south']]

Use iloc to return the third row from df

march = df.iloc[2]

select rows 1-5 with .iloc from 'df'

df\_1 = df.iloc[1:6]

select rows from 'df' where the month is equal to 'january' and store it into a new series

january = df[df.month == 'January']

select all rows for both 'march' and 'april' from df and store them in a new df called march\_april

march\_april = df[(df.month == 'March') | (df.month == 'April')]

Use .isin to find rows containing 'january' and 'march' in the column month

january\_february\_march = df[df.month.isin(['January', 'February', 'March'])

reset the index for a dataframe that you have subsettted. remove the old index

df2.reset\_index(drop=True, inplace = True)

Create a new column that changes the names to lower case using the str.lower and .apply df = pd.DataFrame([ ['JOHN SMITH', 'john.smith@gmail.com'], ['Jane Doe', 'jdoe@yahoo.com'], ['joe schmo', 'joeschmo@hotmail.com'] ], columns=['Name', 'Email'])

df['Lowercase Name'] = df.Name.apply(str.lower)

subject = ["physics", "calculus", "poetry", "history"] append 'computer science' to this list

subject.append("computer science")

subject = ["physics", "calculus", "poetry", "history"] grades = [98, 97, 85, 88] zip these two together and add 'visual arts' and the grade '93'

gradebook = list(zip(subject, grades)) gradebook.append(("visual arts", 93))

inventory = ['twin bed', 'twin bed', 'headboard', 'queen bed', 'king bed', 'dresser', 'dresser', 'table', 'table', 'nightstand', 'nightstand', 'king bed', 'king bed', 'twin bed', 'twin bed', 'sheets', 'sheets', 'pillow', 'pillow'] find the len of this inventory

inventory\_len = len(inventory)

third = inventory[2] last = inventory[-1] inventory1\_4 = inventory[1:5]

twin\_beds = inventory.count('twin bed')

Write a function named append\_sum that has one parameter — a list named named lst. The function should add the last two elements of lst together and append the result to lst. It should do this process three times and then return lst. For example, if lst started as [1, 1, 2], the final result should be [1, 1, 2, 3, 5, 8].

def append\_sum(lst): for x in range(3): lst.append(lst[-1] + lst[-2]) return lst

Write a function named larger\_list that has two parameters named lst1 and lst2. The function should return the last element of the list that contains more elements. If both lists are the same size, then return the last element of lst1.

def larger\_list(lst1, lst2): list1\_len = len(lst1) list2\_len = len(lst2) if list1\_len \> list2\_len: return lst1[-1] elif list1\_len \< list2\_len: return lst2[-1] elif list1\_len == list2\_len: return lst1[-1]

Create a function named more\_than\_n that has three parameters named lst, item, and n. The function should return True if item appears in the list more than n times. The function should return False otherwise.

def more\_than\_n(lst,item,n): if lst.count(item) \> n : return True else: return False

Create a function called append\_size that has one parameter named lst. The function should append the size of lst (inclusive) to the end of lst. The function should then return this new list. For example, if lst was [23, 42, 108], the function should return [23, 42, 108, 3] because the size of lst was originally 3.

def append\_size(lst): x = len(lst) lst.append(x) return lst lst = [23,42,108] append\_size(lst)

Create a function called every\_three\_nums that has one parameter named start. The function should return a list of every third number between start and 100 (inclusive). For example, every\_three\_nums(91) should return the list [91, 94, 97, 100]. If start is greater than 100, the function should return an empty list.

def every\_three\_nums(start): if start \<= 100: lst = list(range(start, 101, 3)) return lst else: lst = [] return lst

Create a function named remove\_middle which has three parameters named lst, start, and end. The function should return a list where all elements in lst with an index between start and end (inclusive) have been removed. For example, the following code should return [4, 23, 42] because elements at indices 1, 2, and 3 have been removed: remove\_middle([4, 8 , 15, 16, 23, 42], 1, 3)

def remove\_middle(lst,start,end): new\_list = lst[0:start] + lst[end+1:] return new\_list

Create a function named more\_frequent\_item that has three parameters named lst, item1, and item2. Return either item1 or item2 depending on which item appears more often in lst. If the two items appear the same number of times, return item1.

def more\_frequent\_item (lst, item1, item2): cnt\_item1 = lst.count(item1) cnt\_item2 = lst.count(item2) if cnt\_item1 \> cnt\_item2: return item1 elif cnt\_item1 \< cnt\_item2: return item2 elif cnt\_item1 == cnt\_item2: return item1 print(more\_frequent\_item([2, 3], 2, 3))

Create a function named double\_index that has two parameters: a list named lst and a single number named index. The function should return a new list where all elements are the same as in lst except for the element at index. The element at index should be double the value of the element at index of the original lst. If index is not a valid index, the function should return the original list. For example, the following code should return [1,2,6,4] because the element at index 2 has been doubled: double\_index([1, 2, 3, 4], 2)

def double\_index(lst, index): before = lst[:index] after = lst[index+1:] new = [lst[index] \* 2] new\_list = before + new + after return new\_list

Given a dataframe df, add a new column square which contains the square of each value in the points column for each row.

df['square'] = df.points.apply(lambda x: x\*\*2)

Select the rows from the column location that contain the information for Staten Island from the dataframe inventory and save them to staten\_island.

staten\_island = inventory[inventory.location == 'Staten Island']

A customer just emailed you asking what products are sold at your Staten Island location. Select the column product\_description from staten\_island and save it to the variable product\_request.

product\_request = staten\_island.product\_description

Another customer emails to ask what types of seeds are sold at the Brooklyn location. Select all rows where location is equal to Brooklyn and product\_type is equal to seeds and save them to the variable seed\_request.

seed\_request = inventory[(inventory.location == 'Brooklyn') | (inventory.product\_type == 'seeds')]

Add a column to inventory called in\_stock which is True if quantity is greater than 0 and False if quantity equals 0.

inventory['in\_stock'] = inventory.quantity.apply(lambda x: True if x \> 0 else False)

Petal Power wants to know how valuable their current inventory is. Create a column called total\_value that is equal to price multiplied by quantity.

total = lambda row: row.price \* row.quantity inventory['total\_value'] = inventory.apply(total, axis = 1)

The DataFrame customers contains the names and ages of all of your customers. You want to find the median age:

median\_price = orders.price.median() print(median\_price)

how many unique types of shoes from the df orders were bought? The name of the column is shoe\_type

unique\_type = orders.shoe\_type.nunique() print(unique\_type)

Print out all the unique types of shoes that are in 'shoe\_type' in the dataframe orders

Our finance department wants to know the price of the most expensive pair of shoes purchased. Save your answer to the variable most\_expensive.

most\_expensive = orders.price.max()

Our fashion department wants to know how many different colors of shoes we are selling. Save your answer to the variable num\_colors.

num\_colors = orders.shoe\_color.nunique() num\_colors

Suppose we have a grade book with columns student, assignment\_name, and grade. We want to get an average grade for each student across all assignments. We could do some sort of loop, but Pandas gives us a much easier option: the method .groupby. Use .groupby to get the average grade

grades = df.groupby('student').grade.mean()

This is the general syntax of .groupby

df.groupby('column1').column2.measurement()

In the previous exercise, our finance department wanted to know the most expensive shoe that we sold. Now, they want to know the most expensive shoe for each shoe\_type (i.e., the most expensive boot, the most expensive ballet flat, etc.). Save your answer to the variable pricey\_shoes.

pricey\_shoes = orders.groupby('shoe\_type').price.max()

Usually, we’d prefer that those indices were actually a column. In order to get that, we can use reset\_index(). This will transform our Series into a DataFrame and move the indices into their own column. Generally, you’ll always see a groupby statement followed by reset\_index:

df.groupby('column1').column2.measurement().reset\_index()

For example, suppose we have a DataFrame teas containing data on types of tea: id tea category caffeine price 0 earl grey black 38 3 1 english breakfast black 41 3 2 irish breakfast black 37 2.5 3 jasmine green 23 4.5 4 matcha green 48 5 5 camomile herbal 0 3 … We want to find the number of each category of tea we sell.

teas\_counts = teas.groupby('category').id.count().reset\_index()

use rename to rename the columns 'id' to 'counts'

df = df.rename(columns = {'id' : 'counts'})

Modify the code that finds the most expensive shoe from each shoe type so that it ends with reset\_index, which will change pricey\_shoes into a DataFrame.

pricey\_shoes = orders.groupby('shoe\_type').price.max().reset\_index() pricey\_shoes

we have a DataFrame of employee information called df that has the following columns: id: the employee’s id number name: the employee’s name wage: the employee’s hourly wage category: the type of work that the employee does Our data might look something like this: id name wage category 10131 Sarah Carney 39 product 14189 Heather Carey 17 design 15004 Gary Mercado 33 marketing 11204 Cora Copaz 27 design … If we want to calculate the 75th percentile (i.e., the point at which 75% of employees have a lower wage and 25% have a higher wage) for each category, we can use the following combination of apply and a lambda function

high\_earners = df.groupby('category').wage.apply(lambda x: np.percentile(x,75)).reset\_index()

Let’s calculate the 25th percentile for shoe price for each shoe\_color to help Marketing decide if we have enough cheap shoes on sale. Save the data to the variable cheap\_shoes.

cheap\_shoes = orders.groupby('shoe\_color').price.apply(lambda x: np.percentile(x, 25)).reset\_index()

Pivot Tables general syntax

df.pivot(columns='ColumnToPivot', index='ColumnToBeRows', values='ColumnToBeValues')

pivot this table to make it easier to read, save as shoe\_counts\_pivot. We want tp lmpw tje number of orders for each color for each shoe type ``` shoe\_type shoe\_color id 0 ballet flats black 2 1 ballet flats brown 5 2 ballet flats red 3 ```

shoe\_counts.pivot(columns= 'shoe\_color', index= 'shoe\_type', values= 'id').reset\_index()

The column utm\_source contains information about how users got to ShoeFly’s homepage. For instance, if utm\_source = Facebook, then the user came to ShoeFly by clicking on an ad on Facebook.com. Use a groupby statement to calculate how many visits came from each of the different sources. Save your answer to the variable click\_source. Remember to use reset\_index()!

click\_source = user\_visits.groupby('utm\_source').id.count().reset\_index()

Our Marketing department thinks that the traffic to our site has been changing over the past few months. Use groupby to calculate the number of visits to our site from each utm\_source for each month. Save your answer to the variable click\_source\_by\_month.

click\_source\_by\_month = user\_visits.groupby(['utm\_source', 'month']).id.count().reset\_index()

The head of Marketing is complaining that this table is hard to read. Use pivot to create a pivot table where the rows are utm\_source and the columns are month. Save your results to the variable click\_source\_by\_month\_pivot.

click\_source\_by\_month\_pivot = click\_source\_by\_month.pivot(index='utm\_source', columns='month', values='id').reset\_index()

A movie review website employs several different critics. They store these critics’ movie ratings in a DataFrame called movie\_ratings, which has three columns: critic, movie, and rating. Write a command to find the average rating for each movie

movie\_ratings.groupby('movie').rating.mean().reset\_index()

The City Library has several branches throughout the area. They collect all of their book checkout data in a DataFrame called checkouts. The DataFrame contains the columns ‘location’, ‘date’, and ‘book\_title’. If we want to compare the total number of books checked out at each branch, what code could we use?

checkouts.groupby('location').book\_title.count().reset\_index()

ad\_clicks['is\_click'] = ~ad\_clicks\ .ad\_click\_timestamp.isnull() What does ~ do in this operation?

The ~ is a NOT operator, and isnull() tests whether or not the value of ad\_click\_timestamp is null.

Create a new column called is\_click, which is True if ad\_click\_timestamp is not null and False otherwise.

ad\_clicks['is\_click'] = ~ad\_clicks\ .ad\_click\_timestamp.isnull()

What is the difference in outcome between these two codes: ## Footnote crushing\_it = sales\_vs\_targets[sales\_vs\_targets.revenue \> sales\_vs\_targets.target] crushing\_it = sales\_vs\_targets.revenue \> sales\_vs\_targets.target

The top will provide you with a df that contains only the rows where revenue is greater than targets the bottom will provide you with a series of True/False for the conditions revenue\>target

What is a left merge?

. A Left Merge includes all rows from the first (left) table, but only rows from the second (right) table that match the first table.

What is an inner merge

this will result in a table that has only the rows with matching values. Non matching values will be dropped

What is an outer merge

this will result in a table with 'NaN' values for rows that do not match

What is a Right Merge

Here, the merged table will include all rows from the second (right) table, but only rows from the first (left) table that match the second table.

What does pd.concat([df1,df2]) do?

It stacks the two dfs together. This is most useful when the two dataframes are chuncks of the same original df

A veterinarian’s office stores all of their data on pets and their owners in two dataframes: pets and owners. The owners dataframe has the columns ‘id’, ‘first\_name’, ‘last\_name’ and ‘address’. The ‘pets’ dataframe has the columns id, name, owner\_id, and type. If the office wanted to combine the two dataframes into one dataframe called pets\_owners, what code could work?

pets\_owners = pd.merge(pets, customers.rename(columns = {'id':'owner\_id'}))

A veterinarians office is run by two vets, Greg and Susan, and stores each of their appointment data in separate DataFrames, called greg\_appointments and susan\_appointments respectively. These DataFrames have the same columns. If the vet office wanted to combine the two DataFrames into a single DataFrame called appointments\_all which of the following commands would they use?

appointments\_all = pd.concat([greg\_appointments, susan\_appointments])

What is the correct syntax for performing an outer merge on two Dataframes: df\_one and df\_two?

merged\_df = pd.merge(df1, df2, how='outer')

How would I select all the null values from a column of a dataframe?

null\_df = df[df.column1.isnull()]

basic syntax of matplot lib

x\_values = [0, 1, 2, 3, 4] y\_values = [0, 1, 4, 9, 16] plt.plot(x\_values, y\_values) plt.show()

Specify a different color for a line in matplot lob

plt.plot(days, money\_spent, color='green') plt.plot(days, money\_spent\_2, color='#AAAAAA')

Which line of code will get the axes object of a plot and store it in a variable ax?

ax = plt.subplot()

Which line of code will create a figure with a height of 7 inches and a width of 6 inches?

plt.figure(figsize=(6,7))

What is the command to set a plot to display from x=-5 to x=5 and from y=0 to y=10?

plt.axis([-5,5,0,10])

What is the command to label the x-axis with the label 'Time'?

plt.xlabel('Time')

Which line of code will set the y-axis ticks to be at 0, 1, 2, 4, and 9?

ax.set\_yticks([0,1,2,4,9])

Which line of code will set the x-axis labels to be ["Monday", "Tuesday", "Wednesday"]?

ax.set\_xticklabels(['Monday','Tuesday','Wednesday'])

What is the command to set the color of a line to be 'green'?

plt.plot = (x,y,color='green')

What is the command to set the linestyle of a line to be dashed?

plt.plot(x,y,linestyle='--'

What is the command to add a legend to a plot with the labels ['Cats', 'Dogs']?

plt.legend(['cats','Dogs'])

What is the command to create a figure with 3 rows and 2 columns, and a subplot in the second row and the first column?

plt.subplot(3,2,3)

What is the command to set the horizontal spacing of subplots within a figure to 0.35?

plt.subplots\_adjust(wspace=0.35)

What is the result of adding autopct='%d%%' to a plt.pie function call?

pie chart will show the percentages of each slice to the nearest int

What does it mean to normalize a histogram?

dividing the height of each column by a constant so the area under the curve sums to 1. maintains the relationship of the data, but allows you to compare data that has different distributions

What is the command to stack a set of bars representing y2 on top of the set of bars representing y1?

plt.bar(range(len(y2)), y2, bottom=y1)

In the following function call, what does the list [0, 2, 4, 6, 8] represent? plt.fill\_between(range(5), [0, 2, 4, 6, 8], [4, 6, 8, 10, 12], alpha=0.2)

lower bound y values

What is the command to set x-axis ticks to be "Carbohydrates", "Lipids", "Protein"?

ax.set\_xticklabels(['])

What is a KDE plot

extract a column from pandas df as a list

newlist = df.col.values counts = cuisine\_counts.name.values

Basic Python Flashcards

(111 cards)