Miscellaneous Flashcards Preview

Big Data Analystics - Final Exam 2 > Miscellaneous > Flashcards

Flashcards in Miscellaneous Deck (15):
1

What is the c( ) function short for?

Combine

2

What does the c( ) function or combine do?

It creates a new vector by combining a list of values

3

What can a vector's values be?

Numbers, strings, or any other type, as long as they are all the same type

4

What does R do if you mix different types?

R will convert all values to the same type (characters)

5

What does the plot(X, Y) function produce?

A scatterplot of Y vs X

6

What does the hist(X, Y) function produce?

A histogram of X

7

What are the three steps to creating a word cloud?

-First create a term document matrix
-Term frequencies are obtained
-Frequencies are passed to function wordcloud()

8

What are collections of documents with similar terms or values?

Text clusters

9

What are latent (=hidden) dimensions that describe the conceptual content in a collection of documents?

Text topics

10

What is a collection of documents?

Corpus

11

True or False: A document could address multiple topics, or no specific topic (from a certain topic collection)

True

12

True or False: Each document is assigned to exactly one text cluster.

True

13

What typically determines document assignment to exactly one cluster?

Text clustering

14

What identifies topics addressed by documents?

Text topic

15

What does a chi-square test examine?

Whether the distributions of the rows across the columns are equal