Module 4 (Lecture 4, Articles) Flashcards
(37 cards)
What are some limitations of scanner data?
- Small frame, small shops may not be considered.
- Cannot make causal statements right away.
- Don’t know behaviours and psychographics.
- Don’t know exact set of choices, prices, and the time of decision.
What can brands perception data tell us and what are some limitations?
Several market research companies track the perception of firms and brands. This includes variables such as attitudes towards towards the firm, customer satisfaction, reputations. This indicates how strong a brand is in the hearts and minds of consumers.
Limitations are that it is not reflecting actual purchase behaviour. Also response bias, sampling bias.
What is response bias?
When participants give inaccurate or dishonest answers to self-report questions.
What is sampling bias?
When the selected sample is not representative of the target population.
What are three essential steps in statistical data editing?
- Error localisation: determine which values are wrong.
- Correction: correct missing and wrong data in best possible way.
- Consistency: make sure everything is consistent without conflicts.
Interviewer error (why data editing is important)
Interviewers may not be giving the respondents the correct instructions.
Omissions (why data editing is important)
Respondents often fail to answer a single question or a section of the questionnaire.
Ambiguity (why data editing is important)
A response might not be readable or it might be unclear.
Lack of cooperation (why data editing is important)
In a long questionnaire, a respondent might rebel and check the same response.
Ineligible respondent (why data editing is important)
An inappropriate respondent may be included in sample, e.g. underage.
What is data coding?
You specify how the information should be categorised to facilitate the analysis.
What is data matching?
Combining data from multiple sources that refer to the same entities. So you identify, match, and merge records.
What is data imputation?
The process of estimating missing data and filling in these values.
What is data adjusting?
Process to enhance the quality of the data for the data analysis (e.g. weighting, variable respectification, scale transformation).
What is weighting? (In data adjusting)
Adjusting the influence of certain observations so that the sample better reflects the population.
What is variable respectification? (In data adjusting)
The process of restructuring existing variables. Sometimes variables are too detailed, or you have too many similar categories.
What is scale transformation? (In data adjusting)
Process of adjusting the scale to make sure it is comparable with other scales. E.g. Some people consistently use the low end, and others the high-end, even if they feel the same.
Entity extraction (a task to prepare text data)
Which words people write about?
Topic modelling (a task to prepare text data)
What topics people write about?
Sentiment analysis (a task to prepare text data)
How positive/negative is the text?
What are the two different ways we can use text data?
- Language reflects = language is used to understand people, it reflects what they think, feel, and do.
- Language affects = language can influence outcomes, it changes how people think, feel, act.
Difference between mode, median, and mean?
Mode = most frequent value.
Median = value that lies in the middle of a frequency distribution.
Mean = average value (sum/number of numbers).
Which descriptive statistics can be used with nominal, ordinal, and interval/ratio data?
Nominal = mode.
Ordinal = median, mode.
Interval/ ratio = mean, median, mode.
Imagine a correlation of 0.55 between ad expenditures and sales.
Does this mean more ad expenditures result in higher sales?
No, correlation only shows if the variables move together, not why.
There may be:
1. Spurious correlation = a third factor/confounding variable causes both.
2. Reverse causality = ad expenditures are determined as a % of sales, so sales determine ad expenditures, not the other way!