numpy Primer Flashcards

Question

How do you explicitly specify a datatype in numpy?

Answer 1

z = np.array([1, 2], dtype=np.int64)

Answer 2

Use astype. E.g.: print(x.astype(np.float64))

Answer 3

the two input arrays must have the same shape (I think after broadcasting, if applicable)

Answer 4

use np.dot to * compute inner products of vectors * multiply a vector by a matrix * multiply matrices dot is available both as a function in the numpy module and as an instance method of array objects. dot product of v and w print(v.dot(w)) equivalent function to compute dot product print(np.dot(v, w))

Answer 5

Use these equivalents in these situations: * if both a and b are 2-dimensional arrays, dot is equivalent to matmul or @ * if either a or b is scalar, dot is equivalent to \* (elementwise multiplication),

Answer 6

Numpy vectors are always treated as column vectors.

Answer 7

Numpy vectors are always treated as column vectors. Therefore, to perform operations that involve both row and column vectors, we cannot use the typical matrix multiplication operators, but instead need to call the appropriate Numpy function. For example, to compute the outer product 𝑤×𝑤𝑇 , which we expect to be a 2×2 matrix, we can use np.outer

Answer 8

x.sum(axis = 1)

Answer 9

x.sum(axis = 0)

Answer 10

A process where numpy scales up a smaller array to make operations with an array of a different shape possible. The simplest cases are making operations between a small array and large array possible.

Answer 11

* If a and b have different ranks, add one-element dimensions to a or b until they have the same ranks. For example, if a = [[1,2],[3,4]] (2-dimensional) and b = 10 (0-dimensional), we would turn b to 2-dimensional, i.e., [[10]]. * Now that a and b have the same ranks, iterate through each dimension i of a and b: * If the shapes of a and b in dimension i are the same, move on. * Else if the shape of b is 1 in dimension i, we'll copy index 0 of dimension i of b until its shape is the same as that of a. * Else if the shape of a is 1 in dimension i, we'll copy index 0 of dimension i of a until its shape is the same as that of b. * Else, raise "ValueError: operands could not be broadcast together"

Answer 12

Same as np.newaxis. This increases the rank of the array by 1

Answer 13

For outer products on vectors, an alternative to np.outer is broadcasting the 1D vectors to 2D matrices and multiplying them as usual. This has the advantage of working with not just outer products, but also any other binary operations.

Answer 14

numpy. tile(arr, reps) reps: The number of repetitions of A along each axis.

Answer 15

construct a new array by repeating an input array the number of times given by the reps arg

Answer 16

View: Shallow copy (i.e. shares same address in memory with the input) Copy: Deep copy (i.e. doesn't share memory with the input)

Answer 17

Use it wherever we can. It should be preferred whenever possible

Answer 18

functions that return a view typically are very fast because they do not need to allocate new memory.

Answer 19

**Knowing when a copy or a view is returned** is essential in understanding the behavior of your code. Otherwise, you may run into a situation where an array value changes even though you never touch it (but you modified a view of it), which can be difficult to debug.

Answer 20

contiguous one-dimensional segment of computer memory Similar to a C array

Answer 21

* Variables all have to be the same type (i.e. it's a homogeneous data structure)

Answer 22

Numpy arrays inherit many attributes of C arrays

Answer 23

* no error is thrown * You are not able to do anything significant with it beyond the functionalities of a standard Python list

Answer 24

* Will return a new array, instead of modifying the input in-place * Creating a new array in memory is time-consuming, so these operations **should not be used inside a loop.**

Answer 25

Matrix which contain mostly zero entries and only a few non-zero entries

Answer 26

An important advantage of Scipy's sparse matrix: It consumes a lot less memory while being functionally similar to standard Numpy matrices.

Answer 27

Use the coo\_matrix method. 1. Construct 3 lists: value, row index and column index 2. Pass the lists and the shape to the method

Answer 28

Convert a dense matrix to sparse

Answer 29

* Should be converted to csr\_matrix (compressed sparse row) or csc\_matrix (compressed sparse column) * Because coo\_matrix is slow in row and column access * Returns: 2D sparse matrices, not 1D vectors like what Numpy would return

Answer 30

* Standard mathematical transformations (e.g., power, sqrt, sum), as well as matrix operations (dot, multiply, transpose) are available. * They're usually faster

Answer 31

* always use the scipy.sparse version * Why: Sometimes the numpy version will convert the sparse matrix input to dense matrix

Answer 32

Minimize the amount of times you need to multiple two matrices together. Instead, do as many matrix\*vector operations as possible. Example: * two matrices A,B and a vector x. * (AB)x=A(Bx), but which is faster? * right side is faster because it's 2 matrix\*vector operations instead of 1 matrix\*matrix and 1 matrix\*vector operation

Answer 33

X @ X.T or X.T @ X

Answer 34

np.outer(u,v)

Answer 35

X + v[:,None]

Answer 36

* When either: * The underlying data is not sparse * There are operations that break sparsity (e.g. result of operation is not sparse) * Why: * Takes up more space when data is not sparse

Answer 37

matrix multiplication

Answer 38

rank = number of opening brackets at the beginning

Answer 39

np. linspace numpy. linspace(start, stop, num\_samples\_to\_generate)

Answer 40

np.arange(4)

Answer 41

numpy.arange(start, stop, step)

Answer 42

* empty\_like * ones\_like * zeros\_like * full\_like

Answer 43

np.full\_like(arr, 4)

Answer 44

np.full((3,4), 6)

Answer 45

arr[[indices of dimension 1], [indices of dimension 1]] E.g. this returns a 1-dimensional array of shape (4,): arr[[0, 1, 2, 3], [3, 2, 1, 0]]

Answer 46

matmul or @

Answer 47

* No error will be thrown * The default data type will be object

Answer 48

Data access is much slower in sparse matrix than in Numpy matrix:

Answer 49

df[index\_some\_row\_or\_col] = some\_vectorizable\_function(df[some\_row\_or\_col], df[another\_r\_or\_c])

numpy Primer Flashcards

(85 cards)