Ratings: Who Do You Trust?

My colleague, Shannon Appelcline, has been working on a game rating system for RPGnet. This has resulted in real-world application of the principles for designing rating systems which we’ve previously discussed in our Collective Choice articles. Shannon’s newest article, Ratings, Who Do You Trust? offers a look at weighting ratings based on reliability.

On the RPGnet Gaming Index we’ve put this all together to form a tree of weighted ratings that answer the question, _who do you trust?_

Here’s how we measured each type of trust, and what we did about it:

Volume of Ratings for an Item. Introduce a bayesian weight to offset the variability of items with low-volume ratings.

Volume of Ratings by a User. Give each user a weight based on his volume of contribution which is applied to his ratings.

Depth of Content by a User. Give each rating a weight based on the depth of thought implicit in the rating which is applied to that rating.

These all get put together to create our final ratings for the Gaming Index, with each user’s individual rating for an item getting multiplied by its user weight and its content weight, and then all of that averaged with the other user ratings and the bayesian weight too. The result is in no way intuitive, but users don’t really need to understand the back end of a rating system. Conversely we hope it’s accurate, or at least more accurate than would otherwise be true given the relatively low volume of ratings we’ve collected thus far.

Here are some of Shannon’s earlier discussions about the design behind the new “user content” based RPGnet Gaming Index:

Encouraging User Creativity - A look at the “XP” system which has helped to incentivize the creation of the database at the heart of the ratings.
Managing User Creativity, Part Two - An examination of the nuts and bolts of RPGnet’s Gaming Index database.

Related articles from this blog:

2005-12: Systems for Collective Choice

2005-12: Collective Choice: Rating Systems

2006-01: Collective Choice: Competitive Ranking Systems

2006-08: Using 5-Star Rating Systems

2007-01: Experimenting with Ratings

Related articles from Shannon Appelcline’s Trials, Triumphs & Trivialities:

#196: Collective Choice: Ratings, Who Do You Trust?

#198: Collective Choice: More Thoughts About Ratings

Comments

URL: Hmm, interesting on the weighting system. Can you explain with more detail how the rating by depth of content is generated? I’m looking through the other articles, so hopefully I find the info there. Frank

magicback (Frank) 2006-10-12T01:30:32-07:00

URL: My first pass system just used a different multiplier for weight based on the type of content: 1x for just a raw rating 2x for a rating with a non-blank comment 5x for a review (which goes through a different, approved content system) My second pass system also included “volume of ratings by user” and thus applied a variable multiplier depending on how many ratings the user has made: (0-2x) for a raw rating 2x(0-2x) for a rating with comment 5x for a review The 0-2x is calculated as (# of ratings by user)/50, to 2 max. I’m pretty sure that the core weighting system by depth is producing better results, though I haven’t done any studies of that yet. The additional weighting for volume by users has definitely prevented bias by hit-and-run raters who just stop by to rate one game that they’ve been asked to.

Shannon Appelcline 2006-10-16T15:39:53-07:00

Life With Alacrity

Share on

X Facebook LinkedIn Bluesky

Ratings: Who Do You Trust?

Comments

Share on

You may Also Enjoy

Musings of a Trust Architect: Fair Witnessing in a Decentralized World

Musings of a Trust Architect: Interop, What Is It Good For?

Musings of a Trust Architect: Architecting Trust in Software Releases

Musings of a Trust Architect: The Case for an International Right to Freedom to Transact