Python- Libraries-Streamlit Flashcards

(26 cards)

1
Q

Define Python.

A

A high-level, interpreted programming language known for its readability and versatility.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Pandas used for?

A

Pandas is a library for data manipulation and analysis, providing data structures like DataFrames.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

True or false: NumPy is primarily used for numerical computing.

A

TRUE

NumPy provides support for large, multi-dimensional arrays and matrices.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Fill in the blank: Matplotlib is used for _______ in Python.

A

data visualization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does SciPy extend?

A

SciPy extends NumPy by adding a collection of mathematical algorithms and convenience functions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define DataFrame.

A

A two-dimensional, size-mutable, potentially heterogeneous tabular data structure in Pandas.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the purpose of Jupyter Notebooks?

A

Jupyter Notebooks allow interactive computing and data visualization in a web-based format.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or false: Seaborn is built on top of Matplotlib.

A

TRUE

Seaborn provides a high-level interface for drawing attractive statistical graphics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is data wrangling?

A

Data wrangling is the process of cleaning and transforming raw data into a usable format.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Fill in the blank: Scikit-learn is a library for _______.

A

machine learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define data visualization.

A

The graphical representation of information and data to communicate insights clearly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the main use of statsmodels?

A

Statsmodels is used for estimating and testing statistical models in Python.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

True or false: PySpark is used for big data processing.

A

TRUE

PySpark is the Python API for Apache Spark, enabling large-scale data processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does data cleaning involve?

A

Data cleaning involves correcting or removing inaccurate records from a dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Fill in the blank: Plotly is used for _______ in Python.

A

interactive plotting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define data analysis.

A

Data analysis is the process of inspecting, cleansing, and modeling data to discover useful information.

17
Q

What is the purpose of data exploration?

A

Data exploration is the initial step in data analysis to summarize main characteristics.

18
Q

True or false: TensorFlow is primarily used for data visualization.

A

FALSE

TensorFlow is primarily used for machine learning and deep learning applications.

19
Q

What is the role of data types in Pandas?

A

Data types define the kind of data stored in a DataFrame’s columns, affecting operations.

20
Q

Fill in the blank: DataFrames can be created from _______.

A

lists, dictionaries, or external files

21
Q

Define time series analysis.

A

Time series analysis involves statistical techniques to analyze time-ordered data points.

22
Q

What is the function of groupby in Pandas?

A

The groupby function is used to split the data into groups based on some criteria.

23
Q

True or false: DataFrames can only hold numeric data.

A

FALSE

DataFrames can hold various data types, including numeric, string, and datetime.

24
Q

What does data aggregation mean?

A

Data aggregation is the process of combining data from multiple sources to summarize it.

25
Fill in the blank: **Feature engineering** involves _______ from raw data.
creating new features
26
Define **machine learning**.
Machine learning is a subset of AI that enables systems to learn from data and improve.