Proctor Exam 2- Secondary Data Syndicated and Big Data Flashcards

Question

Data Governance | Data is secure & accurate

Answer 1

Data Governance | Data is secure & accurate

Answer 2

Data Provenance | Data from reputable sources & tracked through all potential uses

Answer 3

Machine Learning is also being used to understand the return on investment of marketing itself ( MROI ), that is, measuring how much money is generated by investing in marketing

Answer 4

The datasets that track marketing spending are large, have more variables than cases, and contain relationships that are often non-linear mixed with responses that are well behaved and straightforward.

Answer 5

Prediction and understanding are related but independent goals of market research; It is simply a decision of the business regarding the problem at hand whether one goal might be favored over the other.

Answer 6

striking the right balance between prediction and understanding is still required

Answer 7

The complexity of big data and the algorithms that read it have increased the frequency with which predictive models take precedence as more and more marketing occurs in digital environments where automation is possible and desirable.

Answer 8

data that is collected for any purpose subsequently used in research other than to meet the needs of your particular study is called “secondary” data.

Answer 9

secondary data in some detail: any purpose other than to meet the needs of your particular study. non-specific research purposes, called “syndicated” or multi-client data. another purpose and subsequently used in research.

Answer 10

The second most important question in your research design, just behind the original purpose of the research, is “What is already known about your research goal?”

Answer 11

To reiterate and re-emphasize: All market research designs for every research project should begin with an assessment of what is already known about your research problem. The answer to that question almost always involves the search for and use of secondary data in its many forms, whether you are merely searching the Internet or buying needed data from a broker.

Answer 12

the advantage of each imagined example of secondary research is that it costs less time, money and effort.

Answer 13

No single organization outside of governments could fund entire population studies such as censuses or large -scale public health studies, for example

Answer 14

Secondary data may be older than you like or might be an annual estimate rather than the monthly trend information that would be best for your project; or the best information that you can nd may be at a regional level and you might need data about a city in the region

Answer 15

Two key elements to review to assess the potential for bias are the original purpose of the research or data and the methodology of the original research.

Answer 16

If possible, it is always a good idea to evaluate the original sources rather than a summary or an article describing the research

Answer 17

The best way to evaluate accuracy is to examine multiple sources of the same estimates

Answer 18

Finally, you should evaluate the credibility and reputation of the source of your data

Answer 19

One part of the syndicated data industry builds these pro les and makes them available to buyers.

Answer 20

The bene t from knowing more about your customer is pretty obvious, and there are many ways to know them. One part of the syndicated data industry builds these pro les and makes them available to buyers.

Answer 21

Psychographics pertain to people’s values, attitudes, lifestyles , and personalities. These deeper, more personal and less obviously observed criteria are helpful in understanding consumer motivation.

Answer 22

Demographics: Grouping people by age, gender, ethnicity, income, education, geographic or other physical and structural characteristics.

Answer 23

Behavioral: Grouping people by common behaviors, e.g., products purchased, websites or stores visited, shopping behaviors.

Answer 24

Psychographics: Grouping people by shared values, | principles, personality, interests, and lifestyle.

Answer 25

Market Share is one of the oldest and simplest forms of measuring performance and seems so basic that many do not realize that Arthur Nielsen, Sr. actually invented it as a tracking measure in his first Nielsen syndicated database in the 1930s

Answer 26

Simply tracking the sales of a product relative | to total category sales is still a simple and powerful measure of performance

Answer 27

A Development Index compares the sales of a product to either population (as sales per capita) or to total possible ACV in a market.

Answer 28

The Developmental Indices allow you to quickly spot performance gaps and detect any regional issues that might result from retail performance, sales personnel performance or even product preference

Answer 29

High Category Indices show reasonable demand for products similar to ours, but for some reason, we are not appealing to the market or there is a barrier to our sales that bears further investigation.

Answer 30

The BDI/CDI chart can help visualize where our new product might have high potential (High CDI) but face strong competition (High Competitive BDI). When over 50 markets are commonly available, this matrix helps organize and visualize the information.

Answer 31

Advanced analytic techniques can precisely estimate and predict the effects of the change for every item, product, brand, or category, but often simple observational techniques and arithmetic can provide much real information to inform price decisions

Answer 32

Every company generates and retains data as part of | conducting their business and much of it is useful in answering market research questions.

Answer 33

Most modern businesses compile information in customer databases, data warehouses, or enterprise decision support systems. Increasingly this information is augmented by external information obtained from sources we will discuss later so that massive amounts of information are available internally.

Answer 34

Modern computing power is enabling businesses to track and manage millions of customers, whether consumers or businesses, in a practice called Customer Relationship Management (CRM

Answer 35

Data mining is the application of usually automated analytic techniques to nd patterns in data that can be used to grow a business

Answer 36

two types of | external data: Syndicated Services and Big Data.

Answer 37

Technologies that promise to automate and discover new insights about customers and consumers are transforming the market research industry as well. Technology and the Internet have now automated many market research processes that replace what was considerable human effort. Data coding, text analysis, sample selection, and questionnaire management and creation are all examples Automated facial recognition algorithms can now detect emotional response to advertising real time. Advanced data fusion techniques interconnect and link attitudes and opinions from thousands of surveys that may share only the minimum of actual questionnaire content

Answer 38

So it makes sense that primary and secondary research often work together in the same project

Answer 39

The use of multiple sources validates your data and investigation by cross verifying the same ideas and information. This process is sometimes known as data triangulation , taken from the navigation process of locating an unknown point in space through geometric relationships among other known points. You can validate by triangulating data sources, research methods, or even theories

Answer 40

Data triangulation is simply using evidence from many di erent types of data sources, such as interviews, documents, public records, social media conversations, or observations.

Answer 41

``` Theory triangulation is a bit more complicated because the theories are helping you understand your data better rather than the sorts of integration required in data and methodology triangulation. Here you are applying different theories to the data to help make sense of it. One theory might support your data and another might undermine it. ```

Answer 42

It is important to remember when working with secondary data that it often contains personal data; that is, data that can be used either directly or indirectly (e.g., by combining it with other data) about a specific individual.

Answer 43

Two key responsibilities that apply to secondary data: 1. Researchers must ensure that personal data used in research is thoroughly protected from unauthorized access and never disclosed without the consent of the data subject. 2. Researchers must always behave ethically and not do anything that might cause harm to a data subject or damage the reputation of market, opinion, and social research.

Answer 44

This data is collected or purchased by research companies and then the curated data is sold to multiple buyers to help track or interpret their businesses. The companies collect the data once, but sell it many times to multiple buyers as a subscription. The research companies also leverage the data for research products and consulting services that may be customized for their customers.

Answer 45

Many research companies syndicate what are called “ omnibus surveys ”, where questions on many topics are conducted during the same interview

Answer 46

By keeping track of unique but anonymous identi ers and using advanced data fusion techniques, CivicScience builds a massive database of opinions, preferences, attitudes, and demographic information that it provides to its customers by subscription.

Answer 47

Household Panel Sales Data The types of syndicated sales tracking data discussed so far are very accurate estimates of sales and share in a market, but they hide some very important dynamics of purchasing necessary to understand trends, diagnose performance, and grow products with the right marketing plans

Answer 48

Market Measurement What the business does not know, however, is how much its competitors are selling, and the need for that knowledge drives the largest scale type of syndicated data: sales tracking for consumer goods.

Answer 49

The markets for consumer package goods (CPG) known as Fast Moving Consumer Goods (FMCG) outside the United States, is particularly well suited for this kind of tracking. The distribution channels are well known, products are identi ed easily, and there are lots of CPG businesses that can pay for the information and support the costs of collecting the data.

Answer 50

The Universal Product Code or UPC, developed in the early 1970s to help automate inventory tracking, proved also to be an essential factor in automating collection of market information and almost instantly ampli ed our ability to understand and optimize price, promotion, and distribution.

Answer 51

The UPC scanner that allowed quick check-out and payment at the supermarket also automatically entered each transaction into an electronic database that grew with each transaction

Answer 52

By the late 1970s, a research company named Information Resources Incorporated (IRI) realized the potential for using UPC’s as the basis for automated and powerful data collection for market research.

Answer 53

The basic structure of all syndicated scanner databases is similar no matter who builds or sells it. Databases consist of five essential elements: Product, Markets, class of Trade, Time Periods and measures

Answer 54

The basic measure of audience size is the audience rating .

Answer 55

rating is de ned as the number of households with their radio/TV sets tuned to a particular station/channel or program for a speci ed length of time divided by the total number of households that have radio/TV.

Answer 56

The other fundamental measure is audience share , which is the number of TV sets in use tuned to a program or commercial.

Answer 57

Nielsen has had to expand its service to cover this now very fragmented audience and other research companies have stepped up to o er di erent versions of so-called “ cross-platform ” audience measurement.

Answer 58

atistically, when a sample cannot or does not measure something that is real, it is called sampling error . Sampling error happens when a sample is not representative of a population or is not large enough to accurately measure a population condition.

Answer 59

``` Although the datasets are large, they may not be representative, and therefore not projectable to wider situations The datasets build so fast they may accumulate incorrect information. Sometimes the underlying process behind the accumulation of data makes the data in the past less predictive of the future. A predictive process that works well enough once may not work in the future Sometimes the variables of big data are not the things that are closest to what we want to measure directly ```

Answer 60

We are going to focus on one type of this research Sentiment Analysis , as an example, because of its popularity and promise. We will also see how big data can supplement and amplify primary research, and at least speculate on the value of the sensor data from connected devices as a secondary research application.

Answer 61

Sentiment Analysis tries to analytically determine the attitude of a speaker or writer with respect to some topic. The attitude may be an opinion, or an intended or actual emotional state.

Answer 62

Computers read words through a process called Natural Language Processing (NLP) and they infer sentiment by a combination of NLP and machine learning. Both NLP and sentiment analysis are very complex computer processes to accomplish a task that seems obvious to humans, but the sheer scale of the data requires a machine. New algorithms are continuously improving our ability to accurately read this data in a field of computer science that is relatively young.

Answer 63

Three Sentiment Analysis Approaches Knowledge Based Statistically Based Hybrid

Answer 64

Knowledge Based Pre-classify text by categories based on the presence of unambiguous words such as happy, sad, angry, or bored. Assign scores to more ambiguous terms based on previous search Compare analysis text to pre-scored reference database

Answer 65

Statistically Based Use machine learning to train algorithms against known outcomes. For example, a machine can sift through reviews and learn which words in the reviews are associated with accompanying star ratings Compare analysis text to a reference database built from the statistical analysis

Answer 66

Hybrid Both methods combined, for example pre- coding unambiguous words and modeling more ambiguous terms against star ratings Compare analysis text to a reference database

Answer 67

Given a target subject, then, sentiment analysis can analyze an enormous amount of text and classify whether the text is generally positive, negative, or neutral about the subject. The best techniques are pretty good at this task relative to human classi cation. Many studies have shown that there is generally about 80% agreement between human readers in classifying the same text. The best machines analysis achieves about 70%. That level is pretty good compared to 80%.

Answer 68

Machines currently are not as accurate when assessing finer degrees of positivity or negativity, as in the difference between a 4- and a 5-star rating in a review.

Answer 69

Probably the biggest limitation of sentiment analysis is that all social listening data, no matter how much of it exists, is essentially qualitative. Most comments are by de nition non-representative. People passionate enough to comment generate these comments, and they are people with access to social media. The large number of comments does not change this inherent fact.

Answer 70

Internet of Things analysis is not yet a common form of data | for marketing organizations. It is being tested and we are all learning.

Proctor Exam 2- Secondary Data Syndicated and Big Data Flashcards

(94 cards)