<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The &quot;predict flu using search&quot; study you didn&#039;t hear about</title>
	<atom:link href="http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/</link>
	<description>Musings of a computer scientist and Yahoo on prediction markets, gambling, and estimating the odds of everything</description>
	<lastBuildDate>Thu, 11 Mar 2010 12:32:57 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: David Pennock</title>
		<link>http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/comment-page-1/#comment-369</link>
		<dc:creator>David Pennock</dc:creator>
		<pubDate>Thu, 30 Apr 2009 13:28:13 +0000</pubDate>
		<guid isPermaLink="false">http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/#comment-369</guid>
		<description>This paper claims search queries led news articles in the listeria outbreak in Canada. [Hat tip: Sharad Goel.]

&lt;a href=&quot;http://www.cmaj.ca/cgi/rapidpdf/cmaj.1090215&quot; rel=&quot;nofollow&quot;&gt;Early detection of disease outbreaks using the Internet&lt;/a&gt;
Kumanan Wilson MD MSc, John S. Brownstein PhD
Canadian Medical Association Journal</description>
		<content:encoded><![CDATA[<p>This paper claims search queries led news articles in the listeria outbreak in Canada. [Hat tip: Sharad Goel.]</p>
<p><a href="http://www.cmaj.ca/cgi/rapidpdf/cmaj.1090215" rel="nofollow">Early detection of disease outbreaks using the Internet</a><br />
Kumanan Wilson MD MSc, John S. Brownstein PhD<br />
Canadian Medical Association Journal</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Pennock</title>
		<link>http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/comment-page-1/#comment-368</link>
		<dc:creator>David Pennock</dc:creator>
		<pubDate>Thu, 15 Jan 2009 16:42:42 +0000</pubDate>
		<guid isPermaLink="false">http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/#comment-368</guid>
		<description>Kostya,

Actually our method is even more naive: we identify influenza-related queries manually. The Google study is more sophisticated: they identify queries automatically using feature selection.

Thanks for the note about SKY-MAP: I will circulate your proposal.</description>
		<content:encoded><![CDATA[<p>Kostya,</p>
<p>Actually our method is even more naive: we identify influenza-related queries manually. The Google study is more sophisticated: they identify queries automatically using feature selection.</p>
<p>Thanks for the note about SKY-MAP: I will circulate your proposal.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kostya</title>
		<link>http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/comment-page-1/#comment-367</link>
		<dc:creator>Kostya</dc:creator>
		<pubDate>Mon, 24 Nov 2008 21:47:26 +0000</pubDate>
		<guid isPermaLink="false">http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/#comment-367</guid>
		<description>David,
That is certainly a great and very practical idea – analysis of the correlation between the density of subject-related queries and (the same subject)-related social events (epidemics in your case). But I think that it may be even more interesting to analyze it from more common perspective. I couldn’t find the text of your article (and would be grateful if you will find it possible to provide it for me) but from http://www.journals.uchicago.edu/doi/abs/10.1086/593098 it seems like you used some function F(q)-&gt;Q (q-query, Q belongs to R) for query’s classification (to determine if the particular query q is “influenza-related”). In this case the choice of F is critical and always questionable. Did you think about the opposite approach – to determine “influenza-sensitive” classes of queries first? In my opinion that may allow more accurate predictions because people on very early stages of the disease may subconsciously react, generating well-classified queries, not related directly to influenza at all. The class of functions F may be chosen in many different ways. For example it may be a simple neural network.

In general I think that the analysis of search-engine queries may be extremely effective tool very much applicable for the society-control.

Thanks.

Another thing-
Thank you David for your compliments – SKY-MAP is our lovely child and we will continue working on it as long as we stay ourselves. But this baby is growing rapidly and it is becoming harder for us to wear it. We are trying to find people and/or organizations that would be interested in cooperation. What we actually think – if Google has something and MS has it but Yahoo doesn’t – may be Yahoo would be interested in having something similar? Do you know any people in Yahoo who might be interested in talking to us?

Any case – thanks a lot,
K. Lysenko.</description>
		<content:encoded><![CDATA[<p>David,<br />
That is certainly a great and very practical idea – analysis of the correlation between the density of subject-related queries and (the same subject)-related social events (epidemics in your case). But I think that it may be even more interesting to analyze it from more common perspective. I couldn’t find the text of your article (and would be grateful if you will find it possible to provide it for me) but from <a href="http://www.journals.uchicago.edu/doi/abs/10.1086/593098" rel="nofollow">http://www.journals.uchicago.edu/doi/abs/10.1086/593098</a> it seems like you used some function F(q)-&gt;Q (q-query, Q belongs to R) for query’s classification (to determine if the particular query q is “influenza-related”). In this case the choice of F is critical and always questionable. Did you think about the opposite approach – to determine “influenza-sensitive” classes of queries first? In my opinion that may allow more accurate predictions because people on very early stages of the disease may subconsciously react, generating well-classified queries, not related directly to influenza at all. The class of functions F may be chosen in many different ways. For example it may be a simple neural network.</p>
<p>In general I think that the analysis of search-engine queries may be extremely effective tool very much applicable for the society-control.</p>
<p>Thanks.</p>
<p>Another thing-<br />
Thank you David for your compliments – SKY-MAP is our lovely child and we will continue working on it as long as we stay ourselves. But this baby is growing rapidly and it is becoming harder for us to wear it. We are trying to find people and/or organizations that would be interested in cooperation. What we actually think – if Google has something and MS has it but Yahoo doesn’t – may be Yahoo would be interested in having something similar? Do you know any people in Yahoo who might be interested in talking to us?</p>
<p>Any case – thanks a lot,<br />
K. Lysenko.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Pennock</title>
		<link>http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/comment-page-1/#comment-366</link>
		<dc:creator>David Pennock</dc:creator>
		<pubDate>Mon, 24 Nov 2008 14:14:11 +0000</pubDate>
		<guid isPermaLink="false">http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/#comment-366</guid>
		<description>Postscript:

We spoke with the Google authors and the Nature editors and our paper will be cited in the Google Nature paper. Many thanks to the Google authors and the Nature editors, and thanks everyone for your comments and encouragement.</description>
		<content:encoded><![CDATA[<p>Postscript:</p>
<p>We spoke with the Google authors and the Nature editors and our paper will be cited in the Google Nature paper. Many thanks to the Google authors and the Nature editors, and thanks everyone for your comments and encouragement.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Pennock</title>
		<link>http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/comment-page-1/#comment-365</link>
		<dc:creator>David Pennock</dc:creator>
		<pubDate>Mon, 24 Nov 2008 14:09:23 +0000</pubDate>
		<guid isPermaLink="false">http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/#comment-365</guid>
		<description>Thanks Kostya. Sky-Map.org (alias WikiSky.org) is fantastic. Here is my &lt;a href=&quot;http://delicious.com/url/f43860831ac00e78dba8888629ef441c&quot; rel=&quot;nofollow&quot;&gt;delicious.com quote&lt;/a&gt;:

&quot;Crazy cool and comprehensive map of the universe: virtual view of the night sky: zoom into to incredible depth anywhere and look at high resolution photos from the Hubble telescope, etc., of significant objects.&quot;

Keep up the great work. Thanks for your note: it helps keep things in perspective: your situation is even more clear and more deserving of credit with less ability to voice.</description>
		<content:encoded><![CDATA[<p>Thanks Kostya. Sky-Map.org (alias WikiSky.org) is fantastic. Here is my <a href="http://delicious.com/url/f43860831ac00e78dba8888629ef441c" rel="nofollow">delicious.com quote</a>:</p>
<p>&#8220;Crazy cool and comprehensive map of the universe: virtual view of the night sky: zoom into to incredible depth anywhere and look at high resolution photos from the Hubble telescope, etc., of significant objects.&#8221;</p>
<p>Keep up the great work. Thanks for your note: it helps keep things in perspective: your situation is even more clear and more deserving of credit with less ability to voice.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kostya Lysenko</title>
		<link>http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/comment-page-1/#comment-364</link>
		<dc:creator>Kostya Lysenko</dc:creator>
		<pubDate>Thu, 20 Nov 2008 22:27:00 +0000</pubDate>
		<guid isPermaLink="false">http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/#comment-364</guid>
		<description>Now you understand exactly how my partner and me were feeling when GOOGLE announced GOOGLE SKY as something absolutely new and the best when SKY-MAP.ORG was already online for more then a year! But you are in YAHOO and you at least can make your voice louder.</description>
		<content:encoded><![CDATA[<p>Now you understand exactly how my partner and me were feeling when GOOGLE announced GOOGLE SKY as something absolutely new and the best when SKY-MAP.ORG was already online for more then a year! But you are in YAHOO and you at least can make your voice louder.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Pennock</title>
		<link>http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/comment-page-1/#comment-363</link>
		<dc:creator>David Pennock</dc:creator>
		<pubDate>Mon, 17 Nov 2008 13:49:29 +0000</pubDate>
		<guid isPermaLink="false">http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/#comment-363</guid>
		<description>Greg: thanks: yes, credit assignment is hard, and even harder to correct after the fact. Probably in the grand scheme I should worry less about credit than progress. Usually I find myself in the opposite situation where an idea I have turns out to have been published previously. Almost every good idea has likely been thought of by someone somewhere.

Edward: thanks: I agree journal lag times are a problem, in fact the whole journal system with copyrights, fees, and embargoes, seems to hinder more than help, especially now that high quality self publishing is near costless. Likely the Google folks are blameless: I agree. We did present the work to a scientific conference in March 2008 (here is an &lt;a href=&quot;http://www.shea-online.org/Assets/files/Content_Descriptions.pdf&quot; rel=&quot;nofollow&quot;&gt;obscure online reference&lt;/a&gt;: &quot;The 2008 late-breaker session will cover topics that include tracking influenza through internet searches&quot;), but I can&#039;t fault anyone for not being aware of every related presentation or even publication out there.

Here another tidbit for the conspiracy theory: The same abstract was rejected from a CDC-sponsored meeting (ICEID) in November 2007. It was the only abstract that was rejected out of five abstracts that Phillip submitted, and CDC is a partner of the Google effort. Though it&#039;s fun to contemplate some grand conspiracy, I&#039;m sure it was coincidence.</description>
		<content:encoded><![CDATA[<p>Greg: thanks: yes, credit assignment is hard, and even harder to correct after the fact. Probably in the grand scheme I should worry less about credit than progress. Usually I find myself in the opposite situation where an idea I have turns out to have been published previously. Almost every good idea has likely been thought of by someone somewhere.</p>
<p>Edward: thanks: I agree journal lag times are a problem, in fact the whole journal system with copyrights, fees, and embargoes, seems to hinder more than help, especially now that high quality self publishing is near costless. Likely the Google folks are blameless: I agree. We did present the work to a scientific conference in March 2008 (here is an <a href="http://www.shea-online.org/Assets/files/Content_Descriptions.pdf" rel="nofollow">obscure online reference</a>: &#8220;The 2008 late-breaker session will cover topics that include tracking influenza through internet searches&#8221;), but I can&#8217;t fault anyone for not being aware of every related presentation or even publication out there.</p>
<p>Here another tidbit for the conspiracy theory: The same abstract was rejected from a CDC-sponsored meeting (ICEID) in November 2007. It was the only abstract that was rejected out of five abstracts that Phillip submitted, and CDC is a partner of the Google effort. Though it&#8217;s fun to contemplate some grand conspiracy, I&#8217;m sure it was coincidence.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Edward Ratzer</title>
		<link>http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/comment-page-1/#comment-362</link>
		<dc:creator>Edward Ratzer</dc:creator>
		<pubDate>Sat, 15 Nov 2008 10:20:16 +0000</pubDate>
		<guid isPermaLink="false">http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/#comment-362</guid>
		<description>I think this is another example of the problems behind the long lag times in journal publication.

The timescale quoted on the Clinical Infectious Diseases website for your paper is, &quot;Received 8 May 2008; accepted 11 August 2008; electronically published 27 October 2008.&quot;  You may have done this research a while back but the first Google could have heard about this would have been just over 2 weeks ago.  We can&#039;t see the timescales for the Google Nature submission yet but my guess is that it could well have been with Nature before your publication.

It&#039;s a pity that Google got all the press coverage but they&#039;re probably blameless and a victim of the secrecy in journals&#039; pre-press periods.</description>
		<content:encoded><![CDATA[<p>I think this is another example of the problems behind the long lag times in journal publication.</p>
<p>The timescale quoted on the Clinical Infectious Diseases website for your paper is, &#8220;Received 8 May 2008; accepted 11 August 2008; electronically published 27 October 2008.&#8221;  You may have done this research a while back but the first Google could have heard about this would have been just over 2 weeks ago.  We can&#8217;t see the timescales for the Google Nature submission yet but my guess is that it could well have been with Nature before your publication.</p>
<p>It&#8217;s a pity that Google got all the press coverage but they&#8217;re probably blameless and a victim of the secrecy in journals&#8217; pre-press periods.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Greg Linden</title>
		<link>http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/comment-page-1/#comment-361</link>
		<dc:creator>Greg Linden</dc:creator>
		<pubDate>Fri, 14 Nov 2008 23:51:16 +0000</pubDate>
		<guid isPermaLink="false">http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/#comment-361</guid>
		<description>That&#039;s a good point, David, that the press is praising Google for the novelty of the idea.  That the praise is more appropriate for the novelty of the implemented system, not the idea, I think we agree.

Ironically, the reverse situation happened to me in the past.  The algorithm for Amazon&#039;s recommendations was implemented, used by many, and disclosed publicly (in a patent). But, a research paper that came well after anyone could read about the technique we used is often cited in the research community as the first instance of the idea.

We attempted to correct this by later publishing an academic article that contained much of the same material in our earlier disclosure, but this time in a forum that was directed toward the research community, but still the other paper is often cited by academic researchers as the first instance.

In any case, thanks for setting the record straight with your reference to your prior work on the topic.  It is a fun and remarkably powerful idea, detecting flu outbreaks using search activity, and the general category of using online behavior trends to predict offline trends is fascinating and shows much promise.</description>
		<content:encoded><![CDATA[<p>That&#8217;s a good point, David, that the press is praising Google for the novelty of the idea.  That the praise is more appropriate for the novelty of the implemented system, not the idea, I think we agree.</p>
<p>Ironically, the reverse situation happened to me in the past.  The algorithm for Amazon&#8217;s recommendations was implemented, used by many, and disclosed publicly (in a patent). But, a research paper that came well after anyone could read about the technique we used is often cited in the research community as the first instance of the idea.</p>
<p>We attempted to correct this by later publishing an academic article that contained much of the same material in our earlier disclosure, but this time in a forum that was directed toward the research community, but still the other paper is often cited by academic researchers as the first instance.</p>
<p>In any case, thanks for setting the record straight with your reference to your prior work on the topic.  It is a fun and remarkably powerful idea, detecting flu outbreaks using search activity, and the general category of using online behavior trends to predict offline trends is fascinating and shows much promise.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Pennock</title>
		<link>http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/comment-page-1/#comment-360</link>
		<dc:creator>David Pennock</dc:creator>
		<pubDate>Fri, 14 Nov 2008 21:54:04 +0000</pubDate>
		<guid isPermaLink="false">http://blog.oddhead.com/2008/11/14/the-predict-flu-using-search-study-you-didnt-hear-about/#comment-360</guid>
		<description>Greg: Thanks for the comment. I agree with you on almost everything: the live site represents a great deal of effort (orders of magnitude more than the publication), provides significant social value, and it likely the main source of enthusiasm. Still, much of the praise has to do with the novelty of the idea, and that is what I take issue with. I believe a peer-reviewed publication represents something more tangible than &quot;discussing an idea in the research community&quot;, though less tangible than an implemented system. I&#039;m mostly referring to the Nature paper, not the Flu Trends site which I agree is great and worthy of praise. Finally, I may be jaded but I believe if Yahoo!, Microsoft, or AOL launched Flu Trends, the reception would have been less enthusiastic.

Gordon, thanks very much for the comment: agreed.</description>
		<content:encoded><![CDATA[<p>Greg: Thanks for the comment. I agree with you on almost everything: the live site represents a great deal of effort (orders of magnitude more than the publication), provides significant social value, and it likely the main source of enthusiasm. Still, much of the praise has to do with the novelty of the idea, and that is what I take issue with. I believe a peer-reviewed publication represents something more tangible than &#8220;discussing an idea in the research community&#8221;, though less tangible than an implemented system. I&#8217;m mostly referring to the Nature paper, not the Flu Trends site which I agree is great and worthy of praise. Finally, I may be jaded but I believe if Yahoo!, Microsoft, or AOL launched Flu Trends, the reception would have been less enthusiastic.</p>
<p>Gordon, thanks very much for the comment: agreed.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
