targeting for computational market research

4
Proposal for Industry Presentation with Demo Targeting for Computational Market Research Frank Smadja Toluna Inc. MATAM, Haifa 31905, ISRAEL [email protected] ABSTRACT We introduce here the concept of “computational market research” by drawing on the analogy with computational advertising. We first explain how market research traditionally works and we describe the current state of the art in the field through examples. We then explain how, by introducing appropriate targeting and user modeling techniques, market research can be conducted at a larger scale, in a more efficient (for companies) and more enjoyable (for users) manner. We focus on the technical challenges of user modeling for the specific goals of computational market research, illustrating our purpose with actual examples from toluna.com. Toluna.com is one of the leading Web2.0 sites for polls, surveys and opinions, where users can voluntarily join market research panels. We show real life examples on a live demo during the workshop, and demonstrate novel features for polling and user qualification on the toluna.com site. We conclude by discussing future research directions for this emerging field. Categories and Subject Descriptors H.1.1 [Models and Principles]: User/Machine Systems General Terms Measurement, Economics, Experimentation, Human Factors. Keywords Computational Market Research, User Models, Market Research, Placement Ads, Computational Advertising. 1. INTRODUCTION 1.1 Market Research: Common Practices and Challenges Market research is a marketing tool that allows companies to receive feedback from their customers or consumers on a variety of topics; it is usually intended to gather market intelligence, to identify and analyze market needs for specific products in order to launch a new product or change an existing one. Traditionally, this feedback is obtained via surveys that target a specific segment of the population. For example, a dairy product manufacturer that wants to better understand people’s opinions on fruit yogurt would like answers to questions such as: “do people prefer yogurt with fruit at the bottom, at the top, or blended with the yogurt itself?, is there a significant difference in terms of demographic characteristics between people who prefer the fruit at the bottom, at the top or blended?” To obtain the answers to such questions the manufacturer will need to survey a number of consumers and ask them a series of questions on their yogurt eating habits and preferences. A critical part of a market research study is the selection of the specific users who will give this type of feedback. This selection process is usually conducted through a combination of specific demographic criteria (age, gender, place of residence, income level, etc.) and domain specific questions such as: do you eat yogurt regularly? do you ride a bike regularly? do you use aspirin on a daily basis?” etc. This process consists of a targeting step and a screening step. The targeting step is typically driven by basic demographic attributes, for example a cosmetic brand might decide to mostly address young mothers. The screening step is driven by specific domain questions, using the same cosmetic example, whether the young mother has a dry skin, or whether she has long hair, etc. The users being rejected at the screening process are referred to as screened out users or as screenouts for short. A user not being targeted will simply not get invited to participate in the survey. A user who successfully went through the selection process and completed the survey is referred to as a complete. Typically, the customer (the buyer) pays only for each complete and not for screen-outs, thus the burden of selecting a good target user group falls on the market research company (the seller) inviting the users to answer specific surveys. The cost per complete usually reflects the targeting requirements and the specificity of the screening process; however the pricing is usually done manually. This can be compared to a “guaranteed delivery” system for display ads as described in [1]. This user qualification process is similar to targeting in computational advertising and is critical to a successful market research study. Standard market research practice consists of simply including the screening process as part of the survey as a set of preliminary questions. It is relatively easy to store for each user the answers to demographic questions and reuse them in the future for other surveys. However, it is very hard to know or predict the answers to various “domain” questions (e.g., owners of two dogs, bike riders, Alpha Romeo owners, Xbox players, etc.) These change all the time and cover a huge range of domains. In addition, some completes are more valuable than others, for example, there is more demand for “IT managers” than for “middle age housewives”. Similarly, the screening process has an influence on Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. UMWA’2011, February 9, 2011, Hong-Kong, China. Copyright 2010 ACM 1-58113-000-0/00/0010…$10.00.

Upload: frank-smadja

Post on 25-Jun-2015

448 views

Category:

Documents


1 download

DESCRIPTION

Proposal for user modeling workshop: http://research.yahoo.com/workshops/umwa2011/We introduce here the concept of “computational market research” by drawing on the analogy with computational advertising. We first explain how market research traditionally works and we describe the current state of the art in the field through examples. We then explain how, by introducing appropriate targeting and user modeling techniques, market research can be conducted at a larger scale, in a more efficient (for companies) and more enjoyable (for users) manner. We focus on the technical challenges of user modeling for the specific goals of computational market research, illustrating our purpose with actual examples from toluna.com. Toluna.com is one of the leading Web2.0 sites for polls, surveys and opinions, where users can voluntarily join market research panels. We show real life examples on a live demo during the workshop, and demonstrate novel features for polling and user qualification on the toluna.com site. We conclude by discussing future research directions for this emerging field.

TRANSCRIPT

Page 1: Targeting for Computational Market Research

Proposal for Industry Presentation with Demo Targeting for Computational Market Research

Frank Smadja

Toluna Inc.

MATAM, Haifa 31905, ISRAEL

[email protected]

ABSTRACT We introduce here the concept of “computational market research” by drawing on the analogy with computational advertising. We first explain how market research traditionally works and we describe the current state of the art in the field through examples. We then explain how, by introducing appropriate targeting and user modeling techniques, market research can be conducted at a larger scale, in a more efficient (for companies) and more enjoyable (for users) manner. We focus on the technical challenges of user modeling for the specific goals of computational market research, illustrating our purpose with actual examples from toluna.com. Toluna.com is one of the leading Web2.0 sites for polls, surveys and opinions, where users can voluntarily join market research panels. We show real life examples on a live demo during the workshop, and demonstrate novel features for polling and user qualification on the toluna.com site. We conclude by discussing future research directions for this emerging field.

Categories and Subject Descriptors H.1.1 [Models and Principles]: User/Machine Systems

General Terms Measurement, Economics, Experimentation, Human Factors.

Keywords Computational Market Research, User Models, Market Research, Placement Ads, Computational Advertising.

1. INTRODUCTION 1.1 Market Research: Common Practices and Challenges Market research is a marketing tool that allows companies to receive feedback from their customers or consumers on a variety of topics; it is usually intended to gather market intelligence, to identify and analyze market needs for specific products in order to launch a new product or change an existing one. Traditionally, this feedback is obtained via surveys that target a specific segment of the population. For example, a dairy product manufacturer that

wants to better understand people’s opinions on fruit yogurt would like answers to questions such as: “do people prefer yogurt with fruit at the bottom, at the top, or blended with the yogurt itself?, is there a significant difference in terms of demographic characteristics between people who prefer the fruit at the bottom, at the top or blended?” To obtain the answers to such questions the manufacturer will need to survey a number of consumers and ask them a series of questions on their yogurt eating habits and preferences.

A critical part of a market research study is the selection of the specific users who will give this type of feedback. This selection process is usually conducted through a combination of specific demographic criteria (age, gender, place of residence, income level, etc.) and domain specific questions such as: “do you eat yogurt regularly? do you ride a bike regularly? do you use aspirin on a daily basis?” etc. This process consists of a targeting step and a screening step. The targeting step is typically driven by basic demographic attributes, for example a cosmetic brand might decide to mostly address young mothers. The screening step is driven by specific domain questions, using the same cosmetic example, whether the young mother has a dry skin, or whether she has long hair, etc. The users being rejected at the screening process are referred to as screened out users or as screenouts for short. A user not being targeted will simply not get invited to participate in the survey. A user who successfully went through the selection process and completed the survey is referred to as a complete.

Typically, the customer (the buyer) pays only for each complete and not for screen-outs, thus the burden of selecting a good target user group falls on the market research company (the seller) inviting the users to answer specific surveys. The cost per complete usually reflects the targeting requirements and the specificity of the screening process; however the pricing is usually done manually. This can be compared to a “guaranteed delivery” system for display ads as described in [1]. This user qualification process is similar to targeting in computational advertising and is critical to a successful market research study.

Standard market research practice consists of simply including the screening process as part of the survey as a set of preliminary questions. It is relatively easy to store for each user the answers to demographic questions and reuse them in the future for other surveys. However, it is very hard to know or predict the answers to various “domain” questions (e.g., owners of two dogs, bike riders, Alpha Romeo owners, Xbox players, etc.) These change all the time and cover a huge range of domains. In addition, some completes are more valuable than others, for example, there is more demand for “IT managers” than for “middle age housewives”. Similarly, the screening process has an influence on

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee.

UMWA’2011, February 9, 2011, Hong-Kong, China. Copyright 2010 ACM 1-58113-000-0/00/0010…$10.00.

Page 2: Targeting for Computational Market Research

pricing as it is easier for instance to identify 2,000 people with a driver’s license than 2,000 people who own two dogs. The typical overall funnel-like process is depicted in Figure 1.

Figure 1: The Typical Market Research Funnel

1.2 Computational Market Research: Getting to the Next Level In a similar way to computational advertising [1], computational market research can be defined as a set of techniques for finding the best match between a user in a given context and a suitable survey. There are several players with somehow conflicting interests involved in the system: The customer who wants people to answer a specific survey or questionnaire (the buyer), the user answering the survey and the market research company in charge of the execution and delivery of the survey (the seller). These players have different interests: The user wants to have a fun experience and is highly motivated by social and financial incentives, the customer would like a high ROI, and the seller is interested in revenues. As illustrated in Figure 2, the role of the survey router is to maximize the interests of the various players and find the right balance.

Figure 2: Computational Market Research Players

Maximizing revenue only would result in user fatigue and low customer satisfaction; similarly, maximizing only the user interest

and fun factor might not lead to timely completion of a survey and thus not serve the interest of the seller.

Computational market research offers a new way to address this matching problem by taking advantage of the three following observations:

• Economy of Scale: With the penetration of the Internet, it is possible now to reach millions of users and thus conduct more and more surveys on larger and larger populations for more and more precise results. Traditional market research with a few customers and dozens of surveys suddenly become obsolete.

• Probe users in context: As users are more connected than ever, it becomes easier to reach them at the right time, in the right context rather than interrupting them at home by annoying phone calls. A survey offers much resemblance with an online display ad and as such has more chance to lead to conversion when presented in the right context. A survey is answered by redirecting traffic to it and not by inviting users to answer it in an offline manner.

• Make market research accessible to all: Market research should be accessible to smaller companies and individuals, and thus more affordable. This means that the market should move towards a do-it-yourself approach, sell less service and increase automation. This again reminds the ads market

The basic requirement is thus to think of the problem algorithmically. For example, pricing, targeting and routing should be done automatically on live traffic in a very similar way that is done with display ads in the industry. These are no small challenges, as, for example, it is easy to price a specific demographic target (say young mothers) but hard for domain attributes (people with two dogs, people owning an Xbox, people who commute more than 2 hours a day, etc.). Is it more expensive to reach “people who ride a bike to work” or “people who own two dogs”? See [2] for pricing models for placement ads.

In Table 1 below we show the analogy between computational market research and computational advertisement on a number of parameters.

Table 1: Computational Advertisement vs Computational Market Research

Concept Advertisement Market Research

Tool Display ad Poll, Survey

Conversion Click through Complete

Pricing Guaranteed delivery

Fixed price

Irrelevant user Worthless to advertiser

Screen-out

Targeting Targeting Targeting

Live targeting Behavioral Screening

Page 3: Targeting for Computational Market Research

targeting

Rejection Users electing not to receive Ads1

Screen-out

User Context Browsing history, behavior, demographics

Demographics, Previous answers, surveys

Perception of Irrelevance

Spam, Noise Waste, frustration

Incentive for user Buy, find target Fun, social rewards & financial incentives

User experience duration

Few clicks Several minutes

2. USER MODELING FOR COMPUTATIONAL MARKET RESEARCH As discussed above, in order to apply a scalable approach to market research and truly turn the field into computational market research, we need an automated mechanism to gather demographic and domain attributes on a large population of users.

Assume for example, that we need to find 5,000 people who ride bikes and live in the area of Central London. If we already have 2,000 available users who already answered a biking poll in the past and told us that they regularly ride bikes, the task at hand is to identify the additional 3,000 users. Knowing that on average only 5% of the London population is actually riding bikes, sending traffic or email invites indiscriminately would lead to sending invitations to 60,000 users with a screen-out rate of 95%. The result is easy to imagine, the survey would cost a lot and the customer would not be happy. In addition, the users being screened out would be annoyed and rapidly get tired of answering even relevant surveys in the future. This is where user modeling comes into play. In order to automate market research, we need a user model that consists of a set of demographic and domain attributes. Such a user model is central to the automation of the targeting and screening stages. It would allow the market research company to price and route surveys properly and in a more efficient way than is currently done in traditional market research. Like with ads, a relevant survey can be appreciated by users, while an irrelevant one is seen as spam.

3. AN EXAMPLE: TOLUNA.COM Toluna.com is one of the most active social sites for online voting and opinions. It is a Web2.0 site completely geared towards polls, surveys and opinions of users. Toluna members can voice their opinions on any topic but they can also poll the community and get other users’ opinions. Toluna currently counts more than 4 million active users worldwide. In November 2010 alone, users voted 30 million times (e.g., a rate of 1 million votes a day), created 90,000 polls and topics and expressed about 700,000 full text opinions on a huge range of topics. Traffic is constantly growing, during that same month 180,000 new users registered to

1 Some search engines allow their users not to be exposed to Ads

relating to given market domains such as gaming, electronic etc.

Toluna.com. On average, the site sees over 50,000 paid-for surveys completes every day.

There are two main things that distinguishes Toluna.com to other social sites, first and foremost, Toluna.com is a social site geared towards polls, opinions and surveys. In addition, users can easily participate in professional surveys and polls that are present on the site. The professional polls are referred to as “sponsored polls” and the user usually receive points when answering them. With their points they can buy purchase vouchers (for example Amazon purchase cards) or even get cash. The financial incentive is essential to compensate users for their time, some surveys can take more than 15mn to answer and not all of them are interesting. As demonstrated by Raban in [3] in the community Answers domain, financial incentive is critical to attract initial users even if long-term engagement relies on social rewards. We therefore use a mix of financial and social incentives for both sponsored and organic polls as discussed in [4].

Figure 3 below illustrates a sponsored poll generated by our panel team in order to get answers on a generic topic. In this case, the parameter was the activity level of the users on social sites. The goal of the sponsored polls is mostly to enhance targeting capabilities.

Figure 3: A "sponsored" poll on toluna.com

Figure 4 below shows an organic poll generated by a user for no other purpose than social engagement.

Figure 4: An organic poll on Toluna.com

Figure 6 below shows an organic topic launched by a user with no incentive other than getting other people’s opinions. The topics are answered as open-end text answers.

This combination of organic and sponsored polls as well as social and financial incentives is what makes toluna.com unique. We advocate a computational market research approach by applying the following principles:

1. Gather users’ demographic and domain attributes about users through organic polls and thus build an ever growing user model

2. Leverage users’ model for automatic targeting and screening of sponsored polls.

Page 4: Targeting for Computational Market Research

Note that organic polls are in vast majority initiated by users in a natural manner (over 95% of all polls are organic) and are critical to successful users’ engagement on the site. Toluna editors can also initiate polls which are not paid for by any customer but can either increase engagement on hot topics or gather new attributes that are expected to be relevant to paying customers in the future. Such polls often trigger more polls, user-initiated this time and thus continue enriching the user model at low cost.

Figure 3: An organic opinion topic on toluna.com

Our primary effort at Toluna, consists of replacing the email invitation process by a “live” selection of traffic; as users on toluna.com answer polls and give their opinions, some of their responses automatically qualifies them and seamlessly transfer them to a sponsored survey. We are thus merging the targeting and screening processes into a single “qualification” process as shown in Figure 5 below.

Figure 5: The Computerized Market Research Funnel

Merging the screening and targeting steps makes a huge difference both for the user and for the customer order the survey. The first advantage is that we eliminate the static selection of users and email invitations and instead, we send qualified traffic

to a survey. As a direct consequence, we significantly reduce for users the frustration of being screened out and bring down the price per complete to an affordable level.

4. CONCLUSION AND FUTURE DIRECTIONS We have described here how market research can truly become “computational” by merging the screening and targeting stages and have explained how at toluna.com we used a mix of organic and sponsored polls, as well as social and financial incentives to build a scalable users’ base that supports this approach.

We believe that the qualification process however can still be improved, so as to reduce the need for editors to generate organic polls preemptively for expected domain of interests. Indeed, one of the key challenges, of computational market research, which also exists in display ads, is that we cannot predict ahead of time which types of domains and associated features, our customers will be interested in. For recurring features, we could consider training for instance a “biker classifier” or a “two-dog owner” classifier, but in the long run we need to be able to assemble atomic features on the fly so as to generate “on demand” the appropriate user models for a given survey.

Another venue of research is to use “users similarity” models, where we seed our system with qualified users obtained possibly via traditional “manual” methods and then identify similar users based on similar behavior towards polls, surveys and opinions on the site. We are considering traditional recommender systems technologies for this purpose.

We believe that computational market research is still in its infancy and has much to learn from the progress of computational advertising in the last few years.

5. REFERENCES [1] Andrei Broder and Vanja Josifovski. Introduction to

Computational Advertising, Yahoo! Research and Stanford University. http://www.stanford.edu/class/msande239/lectures-2010/lecture-07.pdf

[2] Arpita Ghosh, Preston McAfee, Kishore Papineni, and Sergei Vassilvitskii. Bidding for representative allocations for display advertising. CoRR, abs/0910-0880, 2009. http://arxiv.org/abs/0910.0880

[3] Daphne Raban. The Incentive Structure in an Online Information Market; Journal of the American Society for Information Science and Technology, 2008. http://gsb.haifa.ac.il/~draban/home/Raban_JASIST2008.pdf

[4] Frank Smadja, “Mixing Financial, Social and Fun Incentives for Social Voting”, Webcentives, 1st International workshop on Motivation and Incentives on the Web. Collocated with WWW09, Madrid, Spain. http://webcentives09.sti-innsbruck.at/proceedings-webcentives.pdf