FG Flashcards
(17 cards)
Predicts binary outcomes (Yes/No)
ex. Will a customer buy or not?
Logistic Regression
Makes decisions using if-then logic
ex. Loan approval or medical diagnosis
Decision Tree analysis
– Uses brain-inspired networks to detect complex patterns
ex. Face recognition or self-driving cars
. Neural Analysis Network
Extracts meaning from large text data. Core technologies incude:
document summarization, document classification, document clustering, and feature
extraction.
Text Mining
Analyzes relationships within a network.
ex. Finding influencers, mapping disease spread
SNA (Social Network Analysis)
Analyzes text for sentiment or opinion
Opinion Mining
– Allows machines to understand and process
human language
Natural Language Processing (NLP)
an expert who can collect, organize, investigate, analyze, and visualize data.
DATA SCIENTIST
Understanding the business of the company concerned and expressing
it as a business model.
Business
Exploring and integrating the internal/external data of a
corporation, and manipulating structured/unstructured data.
Data Management
Predictive analysis based on data mining/statistics, and analysis
based on cognitive psychology, R, and visualization techniques
Data Analysis
Establishment of the data strategy, communication skills
Change Management
Experience and training are needed to
understand statistical analysis tools.
Understanding statistical analysis tools
Overall knowledge and experience in programming are
needed. Understanding of various languages such as C, Java, Ruby and Perl.
Programming Language –
Ability to design keys, indexes, queries, normalization, and
constraints based on SQL
RDBMS Technology
Hadoop (MapReduce, HDFS), NoSQL (Cassandra,
BigTable, MongoDB)
Distributed Computing
Understanding of matrix operation and numerical
analysis.
Mathematical Knowledge