CDE Flashcards
(22 cards)
– A technology that can collect data from all devices and systems
COLLECTION
A technology that can store and process collected largescale data using a distributed processing system.
STORAGE/PROCESSING
A method of analysis that can assist companies and the public with using
big data in business and daily life.
ANALYSIS
A technology that can visualize analyzed results effectively
VISUALIZATION
Collects data using the SQL function of the DBMS
Collection using the DBMS
– Collects data when a certain condition is met
Collection using sensors
– Collects data using port that can transfer files.
FTP Collection
– Collects data by reading HTML tags.
HTTP Collection
A file system that allows access to files on multiple
host computers which are shared over a computer network
Distributed File System (DFS)
A new type of data storage/retrieval system that uses a
less restrictive consistency model (BASE characteristics) than the traditional
relational database.
NoSQL (Not Only SQL)
A technology that processes a large amount of
data in a distributed parallel computing environment.
Distributed parallel processing
a programming model designed for the parallel distributed
processing of big data using inexpensive machines. It performs batch-based
processing and can handle large-scale data conveniently.
Map Reduce
provides insights by effectively transferring numbers, statistics, and valuable meanings,
by classifying data for the user’s easy understanding, and by analyzing large-scale data.
VISUALIZATION TECHNOLOGY
shows the passage of time,
continuous and segmented.
Time Visualization (How does it change over time?) –
show the relationship between the
whole and part
Distribution Visualization (How is it spread?)
shows the relationship
between two or more variables.
Relationship Visualization (Are they connected?)
– shows spaces and shadows
intuitively (heatmap, stars)
Comparison Visualization (Which is better/larger?)
shows information by mapping it
on the map including POI data.
Spatial Visualization (Where is it happening?)
– primary purpose is to find patterns that describe the given
data.
Descriptive Modeling
model is created based on the given data, and is used to
predict new input data
Predictive Modeling
When the target is determined
Supervised Data –
When there is no target. The correlation or similarity
between data is analyzed with the focus on input variables.
Unsupervised Data