<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Questions for Tim Converse about Content Classification?</title>
	<atom:link href="http://www.ysearchblog.com/2004/11/02/questions-for-tim-converse-about-content-classification/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.ysearchblog.com/2004/11/02/questions-for-tim-converse-about-content-classification/</link>
	<description></description>
	<lastBuildDate>Fri, 20 Nov 2009 18:49:51 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Ian Saunders</title>
		<link>http://www.ysearchblog.com/2004/11/02/questions-for-tim-converse-about-content-classification/comment-page-1/#comment-391</link>
		<dc:creator>Ian Saunders</dc:creator>
		<pubDate>Sun, 14 Nov 2004 20:51:17 +0000</pubDate>
		<guid isPermaLink="false">http://ysearchblog.com/blog/2004/11/02/questions-for-tim-converse-about-content-classification/#comment-391</guid>
		<description>Hi Tim
Are your algorithms able to automatically recognise the context of a page?
Are the keywords that you retain, representative of the context or simply representative of being present on the page in question?
</description>
		<content:encoded><![CDATA[<p>Hi Tim<br />
Are your algorithms able to automatically recognise the context of a page?<br />
Are the keywords that you retain, representative of the context or simply representative of being present on the page in question?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Paulo</title>
		<link>http://www.ysearchblog.com/2004/11/02/questions-for-tim-converse-about-content-classification/comment-page-1/#comment-390</link>
		<dc:creator>Paulo</dc:creator>
		<pubDate>Thu, 04 Nov 2004 22:01:32 +0000</pubDate>
		<guid isPermaLink="false">http://ysearchblog.com/blog/2004/11/02/questions-for-tim-converse-about-content-classification/#comment-390</guid>
		<description>What about prank sites with wildcard subdomains, like IsGay and WasArrested, which repeatedly mangle the results for my name -- &lt;a href=&quot;http://search.yahoo.com/search?p=ordoveza&quot; rel=&quot;nofollow&quot;&gt;http://search.yahoo.com/search?p=ordoveza&lt;/a&gt; (which is an issue I&#039;ve tried to raise with Yahoo before)? Aren&#039;t these artificially inflating their own rankings by making themselves appear to be more sites than they are? How do you plan to filter these?
</description>
		<content:encoded><![CDATA[<p>What about prank sites with wildcard subdomains, like IsGay and WasArrested, which repeatedly mangle the results for my name &#8212; <a href="http://search.yahoo.com/search?p=ordoveza" rel="nofollow">http://search.yahoo.com/search?p=ordoveza</a> (which is an issue I&#8217;ve tried to raise with Yahoo before)? Aren&#8217;t these artificially inflating their own rankings by making themselves appear to be more sites than they are? How do you plan to filter these?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: eileen kowalski</title>
		<link>http://www.ysearchblog.com/2004/11/02/questions-for-tim-converse-about-content-classification/comment-page-1/#comment-389</link>
		<dc:creator>eileen kowalski</dc:creator>
		<pubDate>Thu, 04 Nov 2004 01:05:59 +0000</pubDate>
		<guid isPermaLink="false">http://ysearchblog.com/blog/2004/11/02/questions-for-tim-converse-about-content-classification/#comment-389</guid>
		<description>Hi Tim and Jeremy,

Thanks for hosting an open Q&amp;A session. I just have a few questions:

1. I noticed that you said &quot;algorithmically classify web pages&quot; above. What do you think about the new MSN Search&#039;s Block Level Analysis? Do you believe that one page can contain content that is relevant to disparate categories or does there always need to be an overarching theme?

2. Is your categorization system based more on creating an ontology that lasts or being able to create an ontology that morphs along with user behavior?

3. In mid-October, I posted about the 301 redirect issue on Jeremy&#039;s blog since I was seeing websites with 301 redirects being penalized for having &quot;duplicate&quot; content. Now, I&#039;ve seen some websites with redirects regain their rankings, but the pages are ranking under the old URLs and the new URLs are not being indexed. Is there still work being done on the 301 issue? And do you have any idea on when it will be resolved?

Thanks again for your time and consideration. I look forward to reading the upcoming discussion.
</description>
		<content:encoded><![CDATA[<p>Hi Tim and Jeremy,</p>
<p>Thanks for hosting an open Q&#038;A session. I just have a few questions:</p>
<p>1. I noticed that you said &#8220;algorithmically classify web pages&#8221; above. What do you think about the new MSN Search&#8217;s Block Level Analysis? Do you believe that one page can contain content that is relevant to disparate categories or does there always need to be an overarching theme?</p>
<p>2. Is your categorization system based more on creating an ontology that lasts or being able to create an ontology that morphs along with user behavior?</p>
<p>3. In mid-October, I posted about the 301 redirect issue on Jeremy&#8217;s blog since I was seeing websites with 301 redirects being penalized for having &#8220;duplicate&#8221; content. Now, I&#8217;ve seen some websites with redirects regain their rankings, but the pages are ranking under the old URLs and the new URLs are not being indexed. Is there still work being done on the 301 issue? And do you have any idea on when it will be resolved?</p>
<p>Thanks again for your time and consideration. I look forward to reading the upcoming discussion.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Karl</title>
		<link>http://www.ysearchblog.com/2004/11/02/questions-for-tim-converse-about-content-classification/comment-page-1/#comment-388</link>
		<dc:creator>Karl</dc:creator>
		<pubDate>Wed, 03 Nov 2004 19:07:01 +0000</pubDate>
		<guid isPermaLink="false">http://ysearchblog.com/blog/2004/11/02/questions-for-tim-converse-about-content-classification/#comment-388</guid>
		<description>Hello Tim,
A competitor of mine has 4 websites with identicle content. All 4 sites have a page ranked quite high on Yahoo! SERPs for &quot;luray va cabin rentals&quot;. The SERP pages are different from each other but again all 4 pages are on each site.

I have reported this to you as Index Spamming a couple of times but the 4 websites are still in business. IS HAVING MIRROR WEBSITES NOT CONSIDERED INDEX SPAMMING ANYMORE??

Thanks for your response,
Karl Baldwin
</description>
		<content:encoded><![CDATA[<p>Hello Tim,<br />
A competitor of mine has 4 websites with identicle content. All 4 sites have a page ranked quite high on Yahoo! SERPs for &#8220;luray va cabin rentals&#8221;. The SERP pages are different from each other but again all 4 pages are on each site.</p>
<p>I have reported this to you as Index Spamming a couple of times but the 4 websites are still in business. IS HAVING MIRROR WEBSITES NOT CONSIDERED INDEX SPAMMING ANYMORE??</p>
<p>Thanks for your response,<br />
Karl Baldwin</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Will Fitzgerald</title>
		<link>http://www.ysearchblog.com/2004/11/02/questions-for-tim-converse-about-content-classification/comment-page-1/#comment-387</link>
		<dc:creator>Will Fitzgerald</dc:creator>
		<pubDate>Wed, 03 Nov 2004 01:54:54 +0000</pubDate>
		<guid isPermaLink="false">http://ysearchblog.com/blog/2004/11/02/questions-for-tim-converse-about-content-classification/#comment-387</guid>
		<description>Hey Tim,

Does/will the content group make any use of the Yahoo! Directory? Or other human-created directories and/or ontologies?
</description>
		<content:encoded><![CDATA[<p>Hey Tim,</p>
<p>Does/will the content group make any use of the Yahoo! Directory? Or other human-created directories and/or ontologies?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: david</title>
		<link>http://www.ysearchblog.com/2004/11/02/questions-for-tim-converse-about-content-classification/comment-page-1/#comment-386</link>
		<dc:creator>david</dc:creator>
		<pubDate>Wed, 03 Nov 2004 00:55:40 +0000</pubDate>
		<guid isPermaLink="false">http://ysearchblog.com/blog/2004/11/02/questions-for-tim-converse-about-content-classification/#comment-386</guid>
		<description>Tim: Can you talk about how Y! sees spam versus ham? (The i&#039;d tell you, but i&#039;d have to kill you response works..)

Do you use any open source solutions in the process, perhaps SpamAssassin&#039;s engine?
</description>
		<content:encoded><![CDATA[<p>Tim: Can you talk about how Y! sees spam versus ham? (The i&#8217;d tell you, but i&#8217;d have to kill you response works..)</p>
<p>Do you use any open source solutions in the process, perhaps SpamAssassin&#8217;s engine?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nacho Hernandez</title>
		<link>http://www.ysearchblog.com/2004/11/02/questions-for-tim-converse-about-content-classification/comment-page-1/#comment-385</link>
		<dc:creator>Nacho Hernandez</dc:creator>
		<pubDate>Tue, 02 Nov 2004 23:50:17 +0000</pubDate>
		<guid isPermaLink="false">http://ysearchblog.com/blog/2004/11/02/questions-for-tim-converse-about-content-classification/#comment-385</guid>
		<description>Great Blog Jeremy!

Hello Tim: Perhaps you can tell us a little bit more about how you guys are capable of spotting correlations between categories and how that can affect the classification of web pages within the software&#039;s algorithms?

Thanks!

Nacho Hernandez
</description>
		<content:encoded><![CDATA[<p>Great Blog Jeremy!</p>
<p>Hello Tim: Perhaps you can tell us a little bit more about how you guys are capable of spotting correlations between categories and how that can affect the classification of web pages within the software&#8217;s algorithms?</p>
<p>Thanks!</p>
<p>Nacho Hernandez</p>
]]></content:encoded>
	</item>
</channel>
</rss>
