Free Essay

Life

In:

Submitted By fifty
Words 6122
Pages 25
Opinion Mining Using Econometrics: A Case Study on Reputation Systems
Anindya Ghose Panagiotis G. Ipeirotis Arun Sundararajan

Department of Information, Operations, and Management Sciences Leonard N. Stern School of Business, New York University {aghose,panos,arun}@stern.nyu.edu

Abstract
Deriving the polarity and strength of opinions is an important research topic, attracting significant attention over the last few years. In this work, to measure the strength and polarity of an opinion, we consider the economic context in which the opinion is evaluated, instead of using human annotators or linguistic resources. We rely on the fact that text in on-line systems influences the behavior of humans and this effect can be observed using some easy-to-measure economic variables, such as revenues or product prices. By reversing the logic, we infer the semantic orientation and strength of an opinion by tracing the changes in the associated economic variable. In effect, we use econometrics to identify the “economic value of text” and assign a “dollar value” to each opinion phrase, measuring sentiment effectively and without the need for manual labeling. We argue that by interpreting opinions using econometrics, we have the first objective, quantifiable, and contextsensitive evaluation of opinions. We make the discussion concrete by presenting results on the reputation system of Amazon.com. We show that user feedback affects the pricing power of merchants and by measuring their pricing power we can infer the polarity and strength of the underlying feedback postings.

on Amazom.com post reviews about products they bought and users on eBay.com post feedback describing their experiences with sellers. The goal of opinion mining systems is to identify such pieces of the text that express opinions (Breck et al., 2007; K¨ nig and o Brill, 2006) and then measure the polarity and strength of the expressed opinions. While intuitively the task seems straightforward, there are multiple challenges involved. • What makes an opinion positive or negative? Is there an objective measure for this task? • How can we rank opinions according to their strength? Can we define an objective measure for ranking opinions? • How does the context change the polarity and strength of an opinion and how can we take the context into consideration? To evaluate the polarity and strength of opinions, most of the existing approaches rely either on training from human-annotated data (Hatzivassiloglou and McKeown, 1997), or use linguistic resources (Hu and Liu, 2004; Kim and Hovy, 2004) like WordNet, or rely on co-occurrence statistics (Turney, 2002) between words that are unambiguously positive (e.g., “excellent”) and unambiguously negative (e.g., “horrible”). Finally, other approaches rely on reviews with numeric ratings from websites (Pang and Lee, 2002; Dave et al., 2003; Pang and Lee, 2004; Cui et al., 2006) and train (semi-)supervised learning algorithms to classify reviews as positive or negative, or in more fine-grained scales (Pang and Lee, 2005; Wilson et al., 2006). Implicitly, the supervised learning techniques assume that numeric ratings fully encapsulate the sentiment of the review.

1 Introduction
A significant number of websites today allow users to post articles where they express opinions about products, firms, people, and so on. For example, users 416

Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 416–423, Prague, Czech Republic, June 2007. c 2007 Association for Computational Linguistics

In this paper, we take a different approach and instead consider the economic context in which an opinion is evaluated. We observe that the text in on-line systems influence the behavior of the readers. This effect can be measured by observing some easy-tomeasure economic variable, such as product prices. For instance, online merchants on eBay with “positive” feedback can sell products for higher prices than competitors with “negative” evaluations. Therefore, each of these (positive or negative) evaluations has a (positive or negative) effect on the prices that the merchant can charge. For example, everything else being equal, a seller with “speedy” delivery may be able to charge $10 more than a seller with “slow” delivery. Using this information, we can conclude that “speedy” is better than “slow” when applied to “delivery” and their difference is $10. Thus, we can infer the semantic orientation and the strength of an evaluation from the changes in the observed economic variable. Following this idea, we use techniques from econometrics to identify the “economic value of text” and assign a “dollar value” to each text snippet, measuring sentiment strength and polarity effectively and without the need for labeling or any other resource. We argue that by interpreting opinions within an econometric framework, we have the first objective and context-sensitive evaluation of opinions. For example, consider the comment “good packaging,” posted by a buyer to evaluate a merchant. This comment would have been considered unambiguously positive by the existing opinion mining systems. We observed, though, that within electronic markets, such as eBay, a posting that contains the words “good packaging” has actually negative effect on the power of a merchant to charge higher prices. This surprising effect reflects the nature of the comments in online marketplaces: buyers tend to use superlatives and highly enthusiastic language to praise a good merchant, and a lukewarm “good packaging” is interpreted as negative. By introducing the econometric interpretation of opinions we can effortlessly capture such challenging scenarios, something that is impossible to achieve with the existing approaches.

over 180 days. We show that textual feedback affects the power of merchants to charge higher prices than the competition, for the same product, and still make a sale. We then reverse the logic and determine the contribution of each comment in the pricing power of a merchant. Thus, we discover the polarity and strength of each evaluation without the need for human annotation or any other form of linguistic resource. The structure of the rest of the paper is as follows. Section 2 gives the basic background on reputation systems. Section 3 describes our methodology for constructing the data set that we use in our experiments. Section 4 shows how we combine established techniques from econometrics with text mining techniques to identify the strength and polarity of the posted feedback evaluations. Section 5 presents the experimental evaluations of our techniques. Finally, Section 6 discusses related work and Section 7 discusses further applications and concludes the paper.

2

Reputation Systems and Price Premiums

We focus our paper on reputation systems in electronic markets and we examine the effect of opinions on the pricing power of merchants in the marketplace of Amazon.com. (We discuss more applications in Section 7.) We demonstrate the value of our technique Definition 2.1 Consider a set of merchants s1 , . . . , sn using a dataset with 9,500 transactions that took place selling a product for prices p1 , . . . , pn . If si makes 417

When buyers purchase products in an electronic market, they assess and pay not only for the product they wish to purchase but for a set of fulfillment characteristics as well, e.g., packaging, delivery, and the extent to which the product description matches the actual product. Electronic markets rely on reputation systems to ensure the quality of these characteristics for each merchant, and the importance of such systems is widely recognized in the literature (Resnick et al., 2000; Dellarocas, 2003). Typically, merchants’ reputation in electronic markets is encoded by a “reputation profile” that includes: (a) the number of past transactions for the merchant, (b) a summary of numeric ratings from buyers who have completed transactions with the seller, and (c) a chronological list of textual feedback provided by these buyers. Studies of online reputation, thus far, base a merchant’s reputation on the numeric rating that characterizes the seller (e.g., average number of stars and number of completed transactions) (Melnik and Alm, 2002). The general conclusion of these studies show that merchants with higher (numeric) reputation can charge higher prices than the competition, for the same products, and still manage to make a sale. This price premium that the merchants can command over the competition is a measure of their reputation.

3

Data

Figure 1: A set of merchants on Amazon.com selling an identical product for different prices the sale for price pi , then si commands a price premium equal to pi − pj over sj and a relative price p −p premium equal to i pi j . Hence, a transaction that involves n competing merchants generates n − 1 price premiums.1 The average price premium for the transaction is premium is j=i (pi −pj )

and the n−1 j=i (pi −pj ) pi (n−1) . 2

average relative price

Example 2.1 Consider the case in Figure 1 where three merchants sell the same product for $631.95, $632.26, and $637.05, respectively. If GameHog sells the product, then the price premium against XP Passport is $4.79 (= $637.05 − $632.26) and against the merchant BuyPCsoft is $5.10. The relative price premium is 0.75% and 0.8%, respectively. Similarly, the average price premium for this transaction is $4.95 and the average relative price premium 0.78%. 2 Different sellers in these markets derive their reputation from different characteristics: some sellers have a reputation for fast delivery, while some others have a reputation of having the lowest price among their peers. Similarly, while some sellers are praised for their packaging in the feedback, others get good comments for selling high-quality goods but are criticized for being rather slow with shipping. Even though previous studies have established the positive correlation between higher (numeric) reputation and higher price premiums, they ignored completely the role of the textual feedback and, in turn, the multi-dimensional nature of reputation in electronic markets. We show that the textual feedback adds significant additional value to the numerical scores, and affects the pricing power of the merchants.
1 As an alternative definition we can ignore the negative price premiums. The experimental results are similar for both versions.

We compiled a data set using software resellers from publicly available information on software product listings at Amazon.com. Our data set includes 280 individual software titles. The sellers’ reputation matters when selling identical goods, and the price variation observed can be attributed primarily to variation in the merchant’s reputation. We collected the data using Amazon Web Services over a period of 180 days, between October 2004 and March 2005. We describe below the two categories of data that we collected. Transaction Data: The first part of our data set contains details of the transactions that took place on the marketplace of Amazon.com for each of the software titles. The Amazon Web Services associates a unique transaction ID for each unique product listed by a seller. This transaction ID enables us to distinguish between multiple or successive listings of identical products sold by the same merchant. Keeping with the methodology in prior research (Ghose et al., 2006), we crawl the Amazon’s XML listings every 8 hours and when a transaction ID associated with a particular listing is removed, we infer that the listed product was successfully sold in the prior 8 hour window.2 For each transaction that takes place, we keep the price at which the product was sold and the merchant’s reputation at the time of the transaction (more on this later). Additionally, for each of the competing listings for identical products, we keep the listed price along with the competitors reputation. Using the collected data, we compute the price premium variables for each transaction3 using Definition 2.1. Overall, our data set contains 1,078 merchants, 9,484 unique transactions and 107,922 price premiums (recall that each transaction generates multiple price premiums). Reputation Data: The second part of our data set contains the reputation history of each merchant that had a (monitored) product for sale during our 180-day window. Each of these merchants has a feedback profile, which consists of numerical scores and text-based feedback, posted by buyers. We had an average of 4,932 postings per merchant. The numerical ratings
Amazon indicates that their seller listings remain on the site indefinitely until they are sold and sellers can change the price of the product without altering the transaction ID. 3 Ideally, we would also include the tax and shipping cost charged by each merchant in the computation of the price premiums. Unfortunately, we could not capture these costs using our methodology. Assuming that the fees for shipping and tax are independent of the merchants’ reputation, our analysis is not affected.
2

418

are provided on a scale of one to five stars. These ratings are averaged to provide an overall score to the seller. Note that we collect all feedback (both numerical and textual) associated with a seller over the entire lifetime of the seller and we reconstruct each seller’s exact feedback profile at the time of each transaction.

merchant’s feedback as an n × p matrix M(si ) whose rows are the p encoded vectors of modifiers associated with the seller. We construct M(si ) as follows: 1. Retrieve the postings associated with a merchant. 2. Parse the postings to identify the dimensions across which the buyer evaluates a seller, keeping4 the nouns, noun phrases, verbs, and verbal phrases as reputation characteristics.5 . 3. Retrieve adjectives and adverbs that refer to6 dimensions (Step 2) and construct the φ vectors. We have implemented this algorithm on the feedback postings of each of our sellers. Our analysis yields 151 unique dimensions, and a total of 142 modifiers (note that the same modifier can be used to evaluate multiple dimensions). 4.2 Scoring the Dimensions of Reputation As discussed above, the textual feedback profile of merchant si is encoded as a n × p matrix M(si ); the elements of this matrix belong to the set of modifiers M. In our case, we are interested in computing the “score” a(µ, d, j) that a modifier µ ∈ M assigns to the dimension d, when it appears in the j-th posting. Since buyers tend to read only the first few pages of text-based feedback, we weight higher the influence of recent text postings. We model this by assuming that K is the number of postings that appear on each page (K = 25 on Amazon.com), and that c is the probability of clicking on the “Next” link and moving the next page of evaluations.7 This assigns a q j posting-specific weight rj = c K / p c K for q=1 the j th posting, where j is the rank of the posting, K is the number of postings per page, and p is the total number of postings for the given seller. Then, we set a(µ, d, j) = rj · a(µ, d) where a(µ, d) is the “global” score that modifier µ assigns to dimension d. Finally, since each reputation dimension has potentially a different weight, we use a weight vector w to
4 We eliminate all dimensions appearing in the profiles of less than 50 (out of 1078) merchants, since we cannot extract statistically meaningful results for such sparse dimensions 5 The technique as described in this paper, considers words like “shipping” and “ delivery” as separate dimensions, although they refer to the same “real-life” dimension. We can use Latent Dirichlet Allocation (Blei et al., 2003) to reduce the number of dimensions, but this is outside the scope of this paper. 6 To associate the adjectives and adverbs with the correct dimensions, we use the Collins HeadFinder capability of the Stanford NLP Parser. 7 We report only results for c = 0.5. We conducted experiments other values of c as well and the results are similar.

4 Econometrics-based Opinion Mining
In this section, we describe how we combine econometric techniques with NLP techniques to derive the semantic orientation and strength of the feedback evaluations. Section 4.1 describes how we structure the textual feedback and Section 4.2 shows how we use econometrics to estimate the polarity and strength of the evaluations. 4.1 Retrieving the Dimensions of Reputation We characterize a merchant using a vector of reputation dimensions X = (X1 , X2 , ..., Xn ), representing its ability on each of n dimensions. We assume that each of these n dimensions is expressed by a noun, noun phrase, verb, or a verb phrase chosen from the set of all feedback postings, and that a merchant is evaluated on these n dimensions. For example, dimension 1 might be “shipping”, dimension 2 might be “packaging” and so on. In our model, each of these dimensions is assigned a numerical score. Of course, when posting textual feedback, buyers do not assign explicit numeric scores to any dimension. Rather, they use modifiers (typically adjectives or adverbs) to evaluate the seller along each of these dimensions (we describe how we assign numeric scores to each modifier in Section 4.2). Once we have identified the set of all dimensions, we can then parse each of the feedback postings, associate a modifier with each dimension, and represent a feedback posting as an n-dimensional vector φ of modifiers. Example 4.1 Suppose dimension 1 is “delivery,” dimension 2 is “packaging,” and dimension 3 is “service.” The feedback posting “I was impressed by the speedy delivery! Great service!” is then encoded as φ1 = [speedy, NULL, great], while the posting “The item arrived in awful packaging, and the delivery was slow” is encoded as φ2 = [slow , awful , NULL]. 2 Let M = {N U LL, µ1 , ..., µM } be the set of modifiers and consider a seller si with p postings in its reputation profile. We denote with µi ∈ M the modifier jk that appears in the j-th posting and is used to assess the k-th reputation dimension. We then structure the 419

weight the contribution of each reputation dimension to the overall “reputation score” Π(si ) of seller si : Π(si ) = rT · A(M(si )) · w (1) where rT = [r1 , r2 , ...rp ] is the vector of the postingspecific weights and A(M(i)) is a matrix that contains as element the score a(µj , dk ) where M(si ) contains the modifier µj in the column of the dimension dk . If we model the buyers’ preferences as independently distributed along each dimension and each modifier score a(µ, dk ) also as an independent random variable, then the random variable Π(si ) is a sum of random variables. Specifically, we have:
M n

Interestingly, if we expand the Π(·) variables according to Equation 2, we can run the regression using the modifier-dimension pairs as independent variables, whose values are equal to the R(µj , dk ) values. After running the regression, the coefficients assigned to each modifier-dimension pair correspond to the value wk · a(µj , dk ) for each modifier-dimension pair. Therefore, we can easily estimate in economic terms the “value” of a particular modifier when used to evaluate a particular dimension.

5

Experimental Evaluation

Π(si ) = j=1 k=1

(wk · a(µj , dk )) R(µj , dk )

(2)

where R(µj , dk ) is equal to the sum of the ri weights across all postings in which the modifier µj modifies dimension dk . We can easily compute the R(µj , dk ) values by simply counting appearances and weighting each appearance using the definition of ri . The question is, of course, how to estimate the values of wk · a(µj , dk ), which determine the polarity and intensity of the modifier µj modifying the dimension dk . For this, we observe that the appearance of such modifier-dimension opinion phrases has an effect on the price premiums that a merchant can charge. Hence, there is a correlation between the reputation scores Π(·) of the merchants and the price premiums observed for each transaction. To discover the level of association, we use regression. Since we are dealing with panel data, we estimate ordinary-leastsquares (OLS) regression with fixed effects (Greene, 2002), where the dependent variable is the price premium variable, and the independent variables are the reputation scores Π(·) of the merchants, together with a few other control variables. Generally, we estimate models of the form: PricePremium ij = βc · Xcij + fij + ij +

βt1 · Π(merchant)ij + βt2 · Π(competitor )ij

(3)

where PricePremiumij is one of the variations of price premium as given in Definition 2.1 for a seller si and product j, βc , βt1 , and βt2 are the regressor coefficients, Xc are the control variables, Π(·) are the text reputation scores (see Equation 1), fij denotes the fixed effects and is the error term. In Section 5, we give the details about the control variables and the regression settings. 420

In this section, we first present the experimental settings (Section 5.1), and then we describe the results of our experimental evaluation (Section 5.2). 5.1 Regression Settings In Equation 3 we presented the general form of the regression for estimating the scores a(µj , dk ). Since we want to eliminate the effect of any other factors that may influence the price premiums, we also use a set of control variables. After all the control factors are taken into consideration, the modifier scores reflect the additional value of the text opinions. Specifically, we used as control variables the product’s price on Amazon, the average star rating of the merchant, the number of merchant’s past transactions, and the number of sellers for the product. First, we ran OLS regressions with product-seller fixed effects controlling for unobserved heterogeneity across sellers and products. These fixed effects control for average product quality and differences in seller characteristics. We run multiple variations of our model, using different versions of the “price premium” variable as listed in Definition 2.1. We also tested variations where we include as independent variable not the individual reputation scores but the difference Π(merchant)−Π(competitor ). All regressions yielded qualitatively similar results, so due to space restrictions we only report results for the regressions that include all the control variables and all the text variables; we report results using the price premium as the dependent variable. Our regressions in this setting contain 107,922 observations, and a total of 547 independent variables. 5.2 Experimental Results Recall of Extraction: The first step of our experimental evaluation is to examine whether the opinion extraction technique of Section 4.1 indeed captures all the reputation characteristics expressed in the feed-

Dimension Product Condition Price Package Overall Experience Delivery Speed Item Description Product Satisfaction Problem Response Customer Service Average

Human Recall 0.76 0.91 0.96 0.65 0.96 0.22 0.68 0.30 0.57 0.66

Computer Recall 0.76 0.61 0.66 0.55 0.92 0.43 0.58 0.37 0.50 0.60

Table 1: The recall of our technique compared to the recall of the human annotators back (recall) and whether the dimensions that we capture are accurate (precision). To examine the recall question, we used two human annotators. The annotators read a random sample of 1,000 feedback postings, and identified the reputation dimensions mentioned in the text. Then, they examined the extracted modifierdimension pairs for each posting and marked whether the modifier-dimension pairs captured the identified real reputation dimensions mentioned in the posting and which pairs were spurious, non-opinion phrases. Both annotators identified nine reputation dimensions (see Table 1). Since the annotators did not agree in all annotations, we computed the average human recall hRec d = agreed d for each dimension d, where all d agreed d is the number of postings for which both annotators identified the reputation dimension d, and all d is the number of postings in which at least one annotator identified the dimension d. Based on the annotations, we computed the recall of our algorithm against each annotator. We report the average recall for each dimension, together with the human recall in Table 1. The recall of our technique is only slightly inferior to the performance of humans, indicating that the technique of Section 4.1 extracts the majority of the posted evaluations.8 Interestingly, precision is not an issue in our setting. In our framework, if an particular modifier-dimension pair is just noise, then it is almost impossible to have a statistically significant correlation with the price premiums. The noisy opinion phrases are statistically guaranteed to be filtered out by the regression. Estimating Polarity and Strength: In Table 2,
In the case of “Item Description,” where the computer recall was higher than the human recall, our technique identified almost all the phrases of one annotator, but the other annotator had a more liberal interpretation of “Item Description” dimension and annotated significantly more postings with the dimension “Item Description” than the other annotator, thus decreasing the human recall.
8

we present the modifier-dimension pairs (positive and negative) that had the strongest “dollar value” and were statistically significant across all regressions. (Due to space issues, we cannot list the values for all pairs.) These values reflect changes in the merchants’s pricing power after taking their average numerical score and level of experience into account, and also highlight the additional the value contained in textbased reputation. The examples that we list here illustrate that our technique generates a natural ranking of the opinion phrases, inferring the strength of each modifier within the context in which this opinion is evaluated. This holds true even for misspelled evaluations that would break existing techniques based on annotation or on resources like WordNet. Furthermore, these values reflect the context in which the opinion is evaluated. For example, the pair good packaging has a dollar value of -$0.58. Even though this seems counterintuitive, it actually reflects the nature of an online marketplace where most of the positive evaluations contain superlatives, and a mere “good” is actually interpreted by the buyers as a lukewarm, slightly negative evaluation. Existing techniques cannot capture such phenomena. Price Premiums vs. Ratings: One of the natural comparisons is to examine whether we could reach similar results by just using the average star rating associated with each feedback posting to infer the score of each opinion phrase. The underlying assumption behind using the ratings is that the review is perfectly summarized by the star rating, and hence the text plays mainly an explanatory role and carries no extra information, given the star rating. For this, we examined the R2 fit of the regression, with and without the use of the text variables. Without the use of text variables, the R2 was 0.35, while when using only the text-based regressors, the R2 fit increased to 0.63. This result clearly indicates that the actual text contains significantly more information than the ratings. We also experimented with predicting which merchant will make a sale, if they simultaneously sell the same product, based on their listed prices and on their numeric and text reputation. Our C4.5 classifier (Quinlan, 1992) takes a pair of merchants and decides which of the two will make a sale. We used as training set the transactions that took place in the first four months and as test set the transactions in the last two months of our data set. Table 3 summarizes the results for different sets of features used. The 55%

421

Modifier Dimension [wonderful experience] [outstanding seller] [excellant service] [lightning delivery] [highly recommended] [best seller] [perfectly packaged] [excellent condition] [excellent purchase] [excellent seller] [excellent communication] [perfect item] [terrific condition] [top quality] [awesome service] [A+++ seller] [great merchant] [friendly service] [easy service] [never received] [defective product] [horible experience] [never sent] [never recieved] [bad experience] [cancelled order] [never responded] [wrong product] [not as advertised] [poor packaging] [late shipping] [wrong item] [not yet received] [still waiting] [wrong address] [never buy]

Dollar Value $5.86 $5.76 $5.27 $4.84 $4.15 $3.80 $3.74 $3.53 $3.22 $2.70 $2.38 $1.92 $1.87 $1.67 $1.05 $1.03 $0.93 $0.81 $0.78 -$7.56 -$6.82 -$6.79 -$6.69 -$5.29 -$5.26 -$5.01 -$4.87 -$4.39 -$3.93 -$2.92 -$2.89 -$2.50 -$2.35 -$2.25 -$1.54 -$1.48

Features Price Price + Numeric Reputation Price + Numeric Reputation + Text Reputation Price + Text Reputation

Accuracy on Test Set 55% 74% 89% 87%

Table 3: Predicting the merchant who makes the sale. any NLP techniques. The technique of Section 4.1 is based on existing research in sentiment analysis. For instance, (Hatzivassiloglou and McKeown, 1997; Nigam and Hurst, 2004) use annotated data to create a supervised learning technique to identify the semantic orientation of adjectives. We follow the approach by Turney (2002), who note that the semantic orientation of an adjective depends on the noun that it modifies and suggest using adjective-noun or adverb-verb pairs to extract semantic orientation. However, we do not rely on linguistic resources (Kamps and Marx, 2002) or on search engines (Turney and Littman, 2003) to determine the semantic orientation, but rather rely on econometrics for this task. Hu and Liu (2004), whose study is the closest to our work, use WordNet to compute the semantic orientation of product evaluations and try to summarize user reviews by extracting the positive and negative evaluations of the different product features. Similarly, Snyder and Barzilay (2007) decompose an opinion across several dimensions and capture the sentiment across each dimension. Other work in this area includes (Lee, 2004; Popescu and Etzioni, 2005) which uses text mining in the context product reviews, but none uses the economic context to evaluate the opinions.

Table 2: The highest scoring opinion phrases, as determined by the product wk · a(µj , dk ). accuracy when using only prices as features indicates that customers rarely choose a product based solely on price. Rather, as indicated by the 74% accuracy, they also consider the reputation of the merchants. However, the real value of the postings relies on the text and not on the numeric ratings: the accuracy is 87%89% when using the textual reputation variables. In fact, text subsumes the numeric variables but not vice versa, as indicated by the results in Table 3.

7

Conclusion and Further Applications

6 Related Work
To the best of our knowledge, our work is the first to use economics for measuring the effect of opinions and deriving their polarity and strength in an econometric manner. A few papers in the past tried to combine text analysis with economics (Das and Chen, 2006; Lewitt and Syverson, 2005), but the text analysis was limited to token counting and did not use 422

We demonstrated the value of using econometrics for extracting a quantitative interpretation of opinions. Our technique, additionally, takes into consideration the context within which these opinions are evaluated. Our experimental results show that our techniques can capture the pragmatic meaning of the expressed opinions using simple economic variables as a form of training data. The source code with our implementation together with the data set used in this paper are available from http://economining.stern.nyu.edu. There are many other applications beyond reputation systems. For example, using sales rank data from Amazon.com, we can examine the effect of product reviews on product sales and detect the weight that

customers put on different product features; furthermore, we can discover how customer evaluations on individual product features affect product sales and extract the pragmatic meaning of these evaluations. Another application is the analysis of the effect of news stories on stock prices: we can examine what news topics are important for the stock market and see how the views of different opinion holders and the wording that they use can cause the market to move up or down. In a slightly different twist, we can analyze news stories and blogs in conjunction with results from prediction markets and extract the pragmatic effect of news and blogs on elections or other political events. Another research direction is to examine the effect of summarizing product descriptions on product sales: short descriptions reduce the cognitive load of consumers but increase their uncertainty about the underlying product characteristics; a longer description has the opposite effect. The optimum description length is the one that balances both effects and maximizes product sales. Similar approaches can improve the state of art in both economics and computational linguistics. In economics and in social sciences in general, most researchers handle textual data manually or with simplistic token counting techniques; in the worst case they ignore text data altogether. In computational linguistics, researchers often rely on human annotators to generate training data, a laborious and errorprone task. We believe that cross-fertilization of ideas between the fields of computational linguistics and econometrics can be beneficial for both fields.

References

Acknowledgments
The authors would like to thank Elena Filatova for the useful discussions and the pointers to related literature. We also thank Sanjeev Dewan, Alok Gupta, Bin Gu, and seminar participants at Carnegie Mellon University, Columbia University, Microsoft Research, New York University, Polytechnic University, and University of Florida for their comments and feedback. We thank Rhong Zheng for assistance in data collection. This work was partially supported by a Microsoft Live Labs Search Award, a Microsoft Virtual Earth Award, and by NSF grants IIS-0643847 and IIS-0643846. Any opinions, findings, and conclusions expressed in this material are those of the authors and do not necessarily reflect the views of the Microsoft Corporation or of the National Science Foundation.
423

D.M. Blei, A.Y. Ng, and M.I. Jordan. 2003. Latent Dirichlet allocation. JMLR, 3:993–1022. E. Breck, Y. Choi, and C. Cardie. 2007. Identifying expressions of opinion in context. In IJCAI-07, pages 2683–2688. H. Cui, V. Mittal, and M. Datar. 2006. Comparative experiments on sentiment classification for online product reviews. In AAAI-2006. S. Ranjan Das and M. Chen. 2006. Yahoo! for Amazon: Sentiment extraction from small talk on the web. Working Paper, Santa Clara University. K. Dave, S. Lawrence, and D.M. Pennock. 2003. Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In WWW12, pages 519–528. C. Dellarocas. 2003. The digitization of word-of-mouth: Promise and challenges of online reputation mechanisms. Management Science, 49(10):1407–1424. A. Ghose, M.D. Smith, and R. Telang. 2006. Internet exchanges for used books: An empirical analysis for product cannibalization and social welfare. Information Systems Research, 17(1):3–19. W.H. Greene. 2002. Econometric Analysis. 5th edition. V. Hatzivassiloglou and K.R. McKeown. 1997. Predicting the semantic orientation of adjectives. In ACL’97, pages 174–181. M. Hu and B. Liu. 2004. Mining and summarizing customer reviews. In KDD-2004, pages 168–177. J. Kamps and M. Marx. 2002. Words with attitude. In Proceedings of the First International Conference on Global WordNet. S.-M. Kim and E. Hovy. 2004. Determining the sentiment of opinions. In COLING 2004, pages 1367–1373. A.C. K¨ nig and E. Brill. 2006. Reducing the human overhead in o text categorization. In KDD-2006, pages 598–603. T. Lee. 2004. Use-centric mining of customer reviews. In WITS. S. Lewitt and C. Syverson. 2005. Market distortions when agents are better informed: The value of information in real estate transactions. Working Paper, University of Chicago. M.I. Melnik and J. Alm. 2002. Does a seller’s reputation matter? Evidence from eBay auctions. Journal of Industrial Economics, 50(3):337–350, September. K. Nigam and M. Hurst. 2004. Towards a robust metric of opinion. In AAAI Spring Symposium on Exploring Attitude and Affect in Text, pages 598–603. B. Pang and L. Lee. 2002. Thumbs up? Sentiment classification using machine learning techniques. In EMNLP 2002. B. Pang and L. Lee. 2004. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In ACL 2004, pages 271–278. B. Pang and L. Lee. 2005. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In ACL 2005. A.-M. Popescu and O. Etzioni. 2005. Extracting product features and opinions from reviews. In HLT/EMNLP 2005. B. Snyder and R. Barzilay. 2007. Multiple aspect ranking using the good grief algorithm. In HLT-NAACL 2007. J.R. Quinlan. 1992. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, Inc. P. Resnick, K. Kuwabara, R. Zeckhauser, and E. Friedman. 2000. Reputation systems. CACM, 43(12):45–48, December. P.D. Turney and M.L. Littman. 2003. Measuring praise and criticism: Inference of semantic orientation from association. ACM Transactions on Information Systems, 21(4):315–346. P.D. Turney. 2002. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In ACL 2002, pages 417–424. T. Wilson, J. Wiebe, and R. Hwa. 2006. Recognizing strong and weak opinion clauses. Computational Intell., 22(2):73–99.

Similar Documents

Free Essay

Life

...FROM SATAN I saw you yesterday as you began your daily chores. You awoke without kneeling to pray. As a matter of fact, you didn't even bless your meals, or pray before going to bed last night. You are so unthankful, I like that about you. I cannot tell you how glad I am that you have not changed your way of living, Fool, you are mine. Remember, you and I have been going steady for years, and I still don't love you yet. As a matter of fact, I hate you, because I hate God. He kicked me out of heaven, and I'm going to use you as long as possible to pay him back. You see, Fool, GOD LOVES YOU and HE has great plans in store for you. But you have yielded your life to me, and I'm going to make your life a living hell. That way, we'll be together twice. This will really hurt God. Thanks to you, I'm really showing Him who's boss in your life with all of the good times we've had. We have been... watching dirty movies, cursing people out, loving worldly things, having bad influences, stealing, lying, being hypocritical, fornicating, overeating, telling dirty jokes, gossiping, being judgmental, back stabbing people, disrespecting adults, and those in leadership positions, no respect for the Church, bad attitudes. SURELY you don't want to give all this up. Come on, Fool, let's burn together forever. I've got some hot plans for us. This is just a letter of appreciation from me to you. I'd like to say 'THANKS' for letting me use you for most of your...

Words: 556 - Pages: 3

Free Essay

Life

...Life in People’s Eyesight Life is what we can define. There are lots of facts we can associate and describe about it, based on different aspects and point of views of different kind of people. Well, Life is beautiful but not always easy, it has trials too and the challenges lies in dealing it with an exert effort and courage. There’s no human in this world that’s enjoying success without even conquering any hardships and failures. No wonder the saying “No Pain, No Gain”. Happiness and Victory, Sorrow and Defeat, are the two sides of life and what matter is choosing what side you prefer. There are people who see life with full of pleasures and adventures, a part of discoveries and innovations, a prospect for success and prosperity. Most of those people enjoy and live their lives to the edge. They take obstacles in life as their path through what they want. There are also people who see life with full of failures, burdens and bad omens. They think that their life is horrendous and miserable. They take it so bad, as if there’s nothing good about it. Some of them delight doing crimes as far as committing suicides, a permanent solution to a problem that’s merely temporary. There are times that other people lead them to do it, maybe because they suffered too much bullies and sarcasms that they can’t handle the pain so they decides to do unlikely things. Life shouldn’t control us, instead we should control it. We should choose the right to path, not the wrong one. But there...

Words: 423 - Pages: 2

Premium Essay

Life

...LIFE As a parent, I believed that my son would outlive me and have sons and daughters of his own, so we could grow old watching them have families of their own, because that was the way of life. I had hopes and dreams for my son. When he died, it completely destroyed my world and left so many unanswered questions. I was completely lost and bereaved. Rebuilding my life and becoming one of the living again has been a long, hard struggle. The death of my son changed my life and me as a person forever. My son chose to end his life on October 3rd 2008. He was only 29 years old. Nothing in my life prepared me for his death. My world ended and everything that I have known in life was shattered. Life as I knew it had changed forever. I couldn’t think or feel anything other than for the pain that was in my heart. My family and my friends worried for me and wanted me to come and stay with them. How could I go and stay with anyone, when I was like a zombie or the living dead. The days did not mean anything to me anymore. Everything I did was like a robot on autopilot. I ate because someone put food in front me and slept when my mind was so exhausted I had to sleep. I knew I had to try to get back into life and live again and rediscover myself as a person. I knew that was what my son would have wanted me to do. I was a completely different person when my son was alive. I loved life, laughed a lot and was happy. I greeted everyday as an adventure and with joy in my...

Words: 895 - Pages: 4

Premium Essay

Life

...to give a presentation on megaliving- how to lead a perfect life. Let’s make this an interactive session. So, hers a question I invite you to consider on. How many of you out here think that you are leading a perfect life? Or before this can any one of you describe what a perfect life is? Before answering this question, let’s do an exercise, Everyone shut your eyes, take two deep breaths and picture this scene taking place many years into the future: you are in an elegant dining hall, surrounded by those closest to you (who are dressed beautifully, looking their best). The candles shimmer on every table and the importance of the evening wafts through the air like the aromas from the kitchen. This is your testimonial dinner, an opportunity for the people who know you best to speak about you as a person, your achievements and your contributions to those you love and to society in general. Just for a moment, reflect on what you would like them to say. This reflection my friends; is the perfect life you want for yourself. And how many of us are actually living our perfect life..? There are two competing schools of thought. One says go out there live your dream, be the best, play at your highest potential. And the other says that the purpose of life is to simply enjoy the journey, simply be happy, and live in the moment. When you think about these two schools, you realize that both are equally important, life is all about the balance. It’s important to reach for the mountain...

Words: 2106 - Pages: 9

Free Essay

Life

...LIFE. Life may have many definitions. Through out the centuries, people have sought to understand life in different ways. People through their cultural practices, tries to explain and give meaning to life. Others through religion; their way of worship, explain what life is. Philosophers are in no doubt, a people who through the ages have embarked on this long search. However, with observation of nature, times and seasons, one can make an analogy to understand a section of life. The morning stage is the early part of the day or life. This can signify the period of conception in ones mothers worm till birth. This is also a stage where one begins a social orientation of knowing the parents and the family, learning to speak, walk and embracing the environment around him. We can also classify the time when one begins elementary school to graduating from the university and securing your first job as the morning stage of life. The age range is 1-25 years. The mid-day of life is when one has become a full adult. The person could at this stage be married and have children or his or her own nuclear family. One at this stage is independent and makes his own decisions; self actualization can also be found at this stage. Many times, the focus of people at this stage is their family and work. Acquiring houses, cars and all the luxury of life is very predominant at the mid-day of life. The age range is 35 to 60 years The evening stage of life is when people are taking rest from their labour...

Words: 437 - Pages: 2

Free Essay

Life

...thinking of what is life, why we have been here on this earth and what is the purpose of life? As most of you I believe, during my whole life I have been thinking of these questions and have never been able to come across the right answer for this. A poor man is always worried about feeding himself and his family. The middle class people are always worried about arranging basic comforts of life that covers house, car, clothing etc. The upper class of the society is worried about the amount of wealth they are accumulating and the power they have in their hands. You take the case of any one of these and wonder why are we doing this. I partially support the poor class because food is something we cannot avoid in our lives. But how about the others? What are they running after? Atleast I have never been able to support the answer on this. Some may argue and I do partially agree that we do need to something to live this life. But the fact that upsets me the most is what we are doing is really that which we are supposed to do on this earth. Every religion on this earth in one form or the other guides us to do our work and perform duties and leave rest to the God. But, what kind of duties and why do those? Human thinking has been made narrow to the extent that we always think about future and desperately work, always think and devote our life to future. But what future is? It is nothing but death. Then why are we running after future. Why cannot we live in present? Life is surely very...

Words: 421 - Pages: 2

Premium Essay

Life

...I can remember as a child always asking myself the "why" questions of life. Why are we here? What is the purpose of life? Why do certain things happen? And is there really a God? I had always kept these questions to myself and eventually pushed them out of my mind altogether. I was raised in a Christian household and you just were not allowed to ask questions of that nature and doubt the faith. The world is the way it is because God made it that way and that is all there is to it. I was really excited to take this class because it would finally give me the opportunity to exercise my personal thoughts and beliefs. I have come to agree with Socrates that "the unexamined life is not worth living." In my opinion life is a combination of philosophical ideas such as morality, respect…………. The study of philosophy is a very complex and complicated task. There are so many different questions on many different topics and philosophy tries to explain them all. It tries to provide answers to the many questions that science and religion cannot explain. And from this it urges you to think about issues that may otherwise be ignored. I agree with the goal of autonomy, that philosophy is having the freedom to make your own decisions and beliefs by using your own reasoning capabilities. I believe that we all have the God-like quality of reasoning and therefore can make our own educated decisions. And because we can make our own decisions, we are fully responsible for our actions. This responsibility...

Words: 356 - Pages: 2

Premium Essay

Life

...for superior and passionate patient service, clinical excellence, as the health care employer of choice, and the preferred partner of physicians. Our Values - Our values, which flow from our mission and Catholic tradition, ,ust have meaning for every one of us. Through them we put the healing minsitry of Jesus into practice throughout out organizational. The following are behaviors that are expected of all associates, physicians, volunteers, and anyone else acting on behalf of the organization. We hope these behaviors will influence for the better every person whose life we touch. Respect - we value each person as sacred, created, in the image and likeness of God, which gives worht and meaning to each person's life. Integrity - we value honesty and words and actions that build trust. Development - we value personal and professional rowth that bines the physical, emotional, spritual, and relational aspects of life and work. Excellence - we value superior performance in our work and service. Stewardship - we value our responsibility to use human, financial, and natural resources entrusted to us for the mon good, with special concern for those...

Words: 299 - Pages: 2

Premium Essay

Life

...Date: 09-12-2015 LIFE SEASON OF LIFE WHAT IS LIFE? Life can be defined as a characteristic of living. It is the state of being alive or active in lively activities. The opposite of life is death, once one has stopped to exhibit or show the characteristics of life, he or she is said to be dead. Life is also made of many activities, but we will see how to perform this activities. WHAT DO WE REQUIRE IN LIFE TO SUCCEED? For us to be able to succeed in life, we must know the times and seasons which are required of us to do everything we are to achieve in life. This leads us to “the seasons of life”. SEASONS OF LIFE: Just like the day which is made up of 24 hours is classified into morning, afternoon and night; for example, we take breakfast in the morning, lunch in the afternoon and dinner at night, we can’t say that we take breakfast at night and dinner in the morning. So also, life is classified into morning, afternoon and night, and these times have specific activities it wants us to perform. Let us consider each of them below. MORNING: The morning at every person’s life is the time frame from when he/she was born to 20 years of age. At this age, one is required to learn how to walk, read, write, show respect, learn the basics of education, respect for parents, respect for teachers and other elders in the society. One should also learn how to behave himself or herself in the public, how to speak with good manners in the public. At the latter part...

Words: 576 - Pages: 3

Premium Essay

Life

...THE ABSURDITY OF LIFE Steven Luper, Trinity University In "The Absurd"[i] Nagel claims that self-conscious human beings are necessarily absurd, so that to escape absurdity while remaining human we would have to cease being self-conscious. Fifteen years later, in The View From Nowhere,[ii] he defends the same thesis, supplementing some of his old arguments with a battery of new ones. I want to suggest that Nagel has misdiagnosed, and exaggerated the inescapability of, our absurdity. He does so partly because the grounds on which he bases his conclusion are spurious, and partly because he does not acknowledge the extent to which we can eliminate absurdity by suitably redesigning our plans and modes of justification. Nonetheless, I do not mean to imply that we can easily eliminate absurdity from our lives. Life is not necessarily absurd, but unfortunately, in a world like ours, there are limits to what we can and should do to reduce the absurd elements of our affairs. The View of the Nowhere Man "In ordinary life a situation is absurd," Nagel says, "when it includes a conspicuous discrepancy between pretension or aspiration and reality: someone gives a complicated speech in support of a motion that has already been passed. . . ; as you are being knighted, your pants fall down."[iii] In this passage from "The Absurd" Nagel claims that absurdity is a particularly striking sort of incongruity, and the conception of absurdity he discusses in his book is the same. What...

Words: 8565 - Pages: 35

Premium Essay

Student Life

...Student life is the best part of life. A student learns many things from books. But he has to enter the real world after his student life is over. So, he needs practical knowledge of things. Student life prepares a man to lead a successful life. In fact, student life is a life of learning. A student learns morality and good manners. He understand the value of discipline in life. So student life is a period of making and preparation. Student life is a care-free life. It is a life of joy. A student is free from almost all cares of the world. He gets a different atmosphere at college from that of home. He takes part in games and sports. He goes sometimes on a picnic and sometimes on educational trips. A student lives in a small world of his school or college. The life of a student, therefore, is a golden period of life. Student life is a life of discipline. At home a student may behave well or badly. But the atmosphere of a school college is completely different. Good boys are loved and praised. Wicked boys are punished. All students have to follow some rules and obey their teachers. Thus, they learn discipline. Discipline makes them self less and teachers them ‘live and let live’. Student life is a life of character building. Character has its importance in life. A student must be bold and fearless. He must think society. He should try to solve the problems of the nation. He must be loving and truthful. Student life is a life of character formation. Students must have a sense...

Words: 316 - Pages: 2

Free Essay

Beginning of Life

...AST 309 part 2: Extraterrestrial Life The Origin and Evolution of Life on Earth Overview • The formation of Earth • Pre-biotic chemistry (Miller-Urey exp.) • First evidence for early life • The evolution of life • Extreme life on Earth: lessons for astrobiology A timeline for the very early history of the Earth The formation of Earth: The Earth formed over ~50 Myr via planetesimal accretion Earth differentiation: The iron "drops" follow gravity and accumulate towards the core. Lighter materials, such as silicate minerals, migrate upwards in exchange. These silicate-rich materials may well have risen to the surface in molten form, giving rise to an initial magma ocean Early Earth heats up due to radioactive decay, compression, and impacts. Over time the temperature of the planet interior rises towards the Femelting line. After the initial segregation into a central iron (+nickel) core and an outer silicate shell, further differentiation occurred into an inner (solid) and outer (liquid) core (a pressure effect: solid iron is more densely packed than liquid iron), the mantel (Fe+Mg silicates) and the crust (K+Na silicates). Initially large portions of the crust might have been molten - the so called magma ocean. The latter would have cooled to form a layer of basaltic crust (such as is present beneath the oceans today). Continental crust would have formed later. It is probable that the Earth’s initial crust was remelted several times due to impacts with large asteroids...

Words: 2150 - Pages: 9

Free Essay

End of Life

...End of Life Essay It is very important for people to discuss and manage end of life issues. There are many issues that may need to have resolutions such as life support, curative vs symptom management, courses of treatment. Other issues such as burial plots and funeral arrangements. I have found that many people feel they have more time to discuss end of life issues. They feel that they have plenty of time to discuss their wants and wishes for the death and dying process with their loved ones. It is very important to discuss end of life issues with your significant others, spouses, children because one never knows when the end of life issues may come up. Maybe it is a young mother diagnoses with stage 4 ovarian cancer. Maybe it’s a 76 year old male patient that feels that while he has lived a good life, it is never enough time. Or it could be a 16 year old teenage girl that had a soft tissue sarcoma that wasn’t diagnosed until it had metastases all over her body. I have taken care of all of these patients. It is never too early to discuss end of life issues, but sometimes it can be too late. Death is very personalized such as life. It is important that the individual is allowed to pass in a manner that is important to them and to feel that they have taken care of issues that they may feel need to be resolved. We might think that we know what our loved ones want, but it could be very different from what they actually want. For example we may think that our...

Words: 1707 - Pages: 7

Free Essay

Journey of Life

...Running header: Journey of Life 1 Journey of Life Victoria Schoepf ENG125 October 21, 2013 Jennifer Thompson Journey of Life 2 Life starts out as an undetermined journey until the elements of choices are laid in our way. Everyone in life or spirit, will often reflect back on the path or road one has choose to contemplate, with never truly knowing if it we did choose the right or wrong direction. I will compare and contrast the literary works of “The Road Not Taken”, by Robert Frost and “I Used to Live Here Once”, by Jean Rhys. The two literary works attention is on the journey that an individual has decided to take through life. One of the narratives focuses on symbolizing choice, while the other is death, but in the same way are similar, because life itself is a journey that can lead into a life or death situation. “A symbol is an object, person, or action that conveys two meaning: its own literal meaning and something it stands for as well,” (Clugston, 2010). “The Road Not Taken” uses symbolism by starting with “Two roads diverged in a yellow wood,” (Frost, 1916). Two roads diverged is symbolizing the looking back, (a reflection point one has on life altering choices) on life’s lessons, (events) that have impacted one’s journey...

Words: 2272 - Pages: 10

Free Essay

Extraterrestrial Life

...Astronomy Research and the Search for Extraterrestrial Life The objective of this paper is to discuss life here on Earth and the possibilities of extraterrestrial life in one’s solar system. The team will describe the properties of life on Earth and explain the theories for the genesis of life, including the theory of natural selection. Members will provide a brief description of the evolution of life and include a geological timescale that describes specific eras throughout the previous three billion plus years. The team will assess the possibilities of extraterrestrial life, and results will be presented. Describe the properties of life on Earth Properties of life on Earth are what define the word life. The properties of life consist of cells and reproduction, responses to the environment, growth and development, and evolution. Every living thing is collected through cells which make tissues and organs that make a living organism. All living organisms can produce such as humans, animals, bacteria, and plants called asexual reproduction. The responses of the living in the environment are through any changes occurring in “light, sound, heat, and chemical contact” through “eyes, ears, and taste buds” (Cliff’s notes, 2011, para. 3). Behavior is one way that all living organisms changes through the environment such as the food chain for one’s survival. Growth and development of an organism takes in all substantial amount of information through energy of building...

Words: 1783 - Pages: 8