<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Collaborative Filtering: Why working on static data sets is not enough</title>
	<atom:link href="http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/</link>
	<description>Daniel Lemire's blog is about life in academia, research in Computer Science, wondering how we can reconcile fast databases and algorithms with the informal and asemantic nature of the world around us. It is broadcasted from Montreal (Canada).</description>
	<pubDate>Fri, 21 Nov 2008 20:40:07 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.3</generator>
		<item>
		<title>By: Daniel Lemire</title>
		<link>http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/#comment-49789</link>
		<dc:creator>Daniel Lemire</dc:creator>
		<pubDate>Wed, 19 Mar 2008 15:36:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/#comment-49789</guid>
		<description>Thanks for the comment. Yes, it would interesting. We need people to do this. (I can't -- at least not alone.)</description>
		<content:encoded><![CDATA[<p>Thanks for the comment. Yes, it would interesting. We need people to do this. (I can&#8217;t &#8212; at least not alone.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Christian Campbell</title>
		<link>http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/#comment-49788</link>
		<dc:creator>Christian Campbell</dc:creator>
		<pubDate>Wed, 19 Mar 2008 15:27:31 +0000</pubDate>
		<guid isPermaLink="false">http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/#comment-49788</guid>
		<description>I've always figured sites powering recommendation systems would need to perform some sort of experimentation on their users to control for the effect of recommendations. This could include selectively omitting recommendations (perhaps altogether for certain items and/or users) to establish control groups.

Regarding Note 1, I think a simulation of human behaviour adequate to explore the consequences of ratings on human behaviour would require already knowing the answer, so that's a circular and prohibitive way of going about things.</description>
		<content:encoded><![CDATA[<p>I&#8217;ve always figured sites powering recommendation systems would need to perform some sort of experimentation on their users to control for the effect of recommendations. This could include selectively omitting recommendations (perhaps altogether for certain items and/or users) to establish control groups.</p>
<p>Regarding Note 1, I think a simulation of human behaviour adequate to explore the consequences of ratings on human behaviour would require already knowing the answer, so that&#8217;s a circular and prohibitive way of going about things.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike</title>
		<link>http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/#comment-49657</link>
		<dc:creator>Mike</dc:creator>
		<pubDate>Sun, 23 Dec 2007 06:31:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/#comment-49657</guid>
		<description>[Caveat: I'm a programmer, not a researcher]

The production recommendation systems that i've had experience with attempt to avoid self-reinforcing behavior by introducing a degree of randomness.  In other words, you determine recommendations based on the user's rating profile, but then you augment that with some percentage of more remotely related items and possibly even a small percentage of unrelated items.  I wish i could provide evidence that this helps, but it's mostly a hack.

There are a couple of other biases in rating data though, at least in the area that i'm familiar with (music).  One is the "selection bias", or the fact that people don't rate everything that's presented to them but rather only  things they love or hate.  The other is that peoples' rating behavior can differ substantially from their actual listening behavior (probably more when their rating profile is public).

It might be possible to model users in the sense of reproducing the distribution of ratings in a dataset like NetFlix's.  But i think the bigger challenge for recommendation technology right now is to capture the things we aren't getting from users, like how to correlate mood to preferences, or how to distinguish true favorites from temporary enthusiasms.</description>
		<content:encoded><![CDATA[<p>[Caveat: I'm a programmer, not a researcher]</p>
<p>The production recommendation systems that i&#8217;ve had experience with attempt to avoid self-reinforcing behavior by introducing a degree of randomness.  In other words, you determine recommendations based on the user&#8217;s rating profile, but then you augment that with some percentage of more remotely related items and possibly even a small percentage of unrelated items.  I wish i could provide evidence that this helps, but it&#8217;s mostly a hack.</p>
<p>There are a couple of other biases in rating data though, at least in the area that i&#8217;m familiar with (music).  One is the &#8220;selection bias&#8221;, or the fact that people don&#8217;t rate everything that&#8217;s presented to them but rather only  things they love or hate.  The other is that peoples&#8217; rating behavior can differ substantially from their actual listening behavior (probably more when their rating profile is public).</p>
<p>It might be possible to model users in the sense of reproducing the distribution of ratings in a dataset like NetFlix&#8217;s.  But i think the bigger challenge for recommendation technology right now is to capture the things we aren&#8217;t getting from users, like how to correlate mood to preferences, or how to distinguish true favorites from temporary enthusiasms.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel Lemire</title>
		<link>http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/#comment-49655</link>
		<dc:creator>Daniel Lemire</dc:creator>
		<pubDate>Sun, 23 Dec 2007 04:13:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/#comment-49655</guid>
		<description>The key challenge seems to be: how do we study (with rigor) these problems?</description>
		<content:encoded><![CDATA[<p>The key challenge seems to be: how do we study (with rigor) these problems?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Yehuda Koren</title>
		<link>http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/#comment-49654</link>
		<dc:creator>Yehuda Koren</dc:creator>
		<pubDate>Sun, 23 Dec 2007 04:00:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/#comment-49654</guid>
		<description>I agree that this is a very important complication in the evaluation of collaborative filtering. 
To sharpen the point, I think that there are two separate issues:
(1) 
The fact that the interactive recommender system influences the users' behaviors, which, in turn, feedback into the CF system, and so in a loop. In other words, the CF mechanism is a active part of the system that it is supposed to learn and judge.

(2)
All the feedback to the collaborative filtering is conditioned on the fact that the users actually performed an action. All our observations on a product are based on the very narrow and unrepresentative sub-population that chose to reflect their opinion (implicitly or explicitly) on that product. Naturally, such a population is highly biased to like the product. For example, when we say that "the average rating for The Six Sense movie is 4.5 stars" we really mean to say: "the average rating for The Six Sense movie AMONG PEOPLE THAT CHOSE TO RATE THAT MOVIE is 4.5 stars". Now what is really the average rating for The Six Sense across all population? Well, that's hard to know. But the whole population is the one that really counts...

I used to be much more concerned about the second issue...

Yehuda</description>
		<content:encoded><![CDATA[<p>I agree that this is a very important complication in the evaluation of collaborative filtering.<br />
To sharpen the point, I think that there are two separate issues:<br />
(1)<br />
The fact that the interactive recommender system influences the users&#8217; behaviors, which, in turn, feedback into the CF system, and so in a loop. In other words, the CF mechanism is a active part of the system that it is supposed to learn and judge.</p>
<p>(2)<br />
All the feedback to the collaborative filtering is conditioned on the fact that the users actually performed an action. All our observations on a product are based on the very narrow and unrepresentative sub-population that chose to reflect their opinion (implicitly or explicitly) on that product. Naturally, such a population is highly biased to like the product. For example, when we say that &#8220;the average rating for The Six Sense movie is 4.5 stars&#8221; we really mean to say: &#8220;the average rating for The Six Sense movie AMONG PEOPLE THAT CHOSE TO RATE THAT MOVIE is 4.5 stars&#8221;. Now what is really the average rating for The Six Sense across all population? Well, that&#8217;s hard to know. But the whole population is the one that really counts&#8230;</p>
<p>I used to be much more concerned about the second issue&#8230;</p>
<p>Yehuda</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Peter Turney</title>
		<link>http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/#comment-49652</link>
		<dc:creator>Peter Turney</dc:creator>
		<pubDate>Sat, 22 Dec 2007 15:30:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.daniel-lemire.com/blog/archives/2007/12/22/collaborative-filtering-why-working-on-static-data-sets-is-not-enough/#comment-49652</guid>
		<description>Hi Daniel,

This related work comes to mind:

http://tinyurl.com/2uolhv

Intelligent Information Access Publications

- A Learning Agent that Assists the Browsing of Software Libraries
- A Learning Apprentice For Browsing
- Accelerating Browsing by Automatically Inferring a User's Search Goal</description>
		<content:encoded><![CDATA[<p>Hi Daniel,</p>
<p>This related work comes to mind:</p>
<p><a href="http://tinyurl.com/2uolhv" rel="nofollow">http://tinyurl.com/2uolhv</a></p>
<p>Intelligent Information Access Publications</p>
<p>- A Learning Agent that Assists the Browsing of Software Libraries<br />
- A Learning Apprentice For Browsing<br />
- Accelerating Browsing by Automatically Inferring a User&#8217;s Search Goal</p>
]]></content:encoded>
	</item>
</channel>
</rss>
