<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Latent Semantic Indexing can improve your WordPress search results</title>
	<atom:link href="http://elliottback.com/wp/latent-semantic-indexing-can-improve-your-wordpress-search-results/feed/" rel="self" type="application/rss+xml" />
	<link>http://elliottback.com/wp/latent-semantic-indexing-can-improve-your-wordpress-search-results/</link>
	<description>Internet &#38; Technology</description>
	<lastBuildDate>Wed, 08 Feb 2012 10:01:08 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Elliott Back</title>
		<link>http://elliottback.com/wp/latent-semantic-indexing-can-improve-your-wordpress-search-results/#comment-1521253</link>
		<dc:creator>Elliott Back</dc:creator>
		<pubDate>Sun, 27 Apr 2008 10:59:47 +0000</pubDate>
		<guid isPermaLink="false">/?p=766#comment-1521253</guid>
		<description>There are actually continuous models.  You can recompute an approximate SVD of the Term-document matrix as new stuff comes in without too much work, it just won&#039;t be as accurate...</description>
		<content:encoded><![CDATA[<p>There are actually continuous models.  You can recompute an approximate SVD of the Term-document matrix as new stuff comes in without too much work, it just won&#8217;t be as accurate&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Of Wordpress Search on iface thoughts</title>
		<link>http://elliottback.com/wp/latent-semantic-indexing-can-improve-your-wordpress-search-results/#comment-399955</link>
		<dc:creator>Of Wordpress Search on iface thoughts</dc:creator>
		<pubDate>Wed, 31 Jan 2007 09:26:22 +0000</pubDate>
		<guid isPermaLink="false">/?p=766#comment-399955</guid>
		<description>[...] Elliott points towards Latent Semantic Indexing (LSI) for improving the search. It might not be a viable option today, but it might be future of the search industry. Natural Language Processing is already being used in certain search functionalities. [...]</description>
		<content:encoded><![CDATA[<p>[...] Elliott points towards Latent Semantic Indexing (LSI) for improving the search. It might not be a viable option today, but it might be future of the search industry. Natural Language Processing is already being used in certain search functionalities. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David W. Lee</title>
		<link>http://elliottback.com/wp/latent-semantic-indexing-can-improve-your-wordpress-search-results/#comment-51583</link>
		<dc:creator>David W. Lee</dc:creator>
		<pubDate>Fri, 12 May 2006 14:55:32 +0000</pubDate>
		<guid isPermaLink="false">/?p=766#comment-51583</guid>
		<description>Typo in my previous message: I mean you &quot;cannot&quot; index one entry by itself.. =)</description>
		<content:encoded><![CDATA[<p>Typo in my previous message: I mean you &#8220;cannot&#8221; index one entry by itself.. =)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David W. Lee</title>
		<link>http://elliottback.com/wp/latent-semantic-indexing-can-improve-your-wordpress-search-results/#comment-51582</link>
		<dc:creator>David W. Lee</dc:creator>
		<pubDate>Fri, 12 May 2006 14:54:13 +0000</pubDate>
		<guid isPermaLink="false">/?p=766#comment-51582</guid>
		<description>I studied LSI in my Masters and I am, too, looking for a commercial implementation of it. 

One problem with LSI (based on the proposed implementation in the textbook) is that the indexing is not continuous. You can add an entry in your database and have it indexed just by itself. You have to re-index the entire database, which is pain in the butt. And it might also cause the search results to change significantly from one build to another. So, Edward, if you can find a solution to this (making the indexing continuous), then you will be the next Bill Gates.

For some applications this may not be a problem. But in general, this is bad.

Another problem with LSI is the algorithm that finds the &quot;nearest neighbors&quot;. I don&#039;t think people have a good solution to that yet. But in terms of searching, the time and space complexity is not so much of a concern.

For Google, however, is the immense number of pages and keywords they have to index. The complexity go up at least at 2nd order polynomial rate. We may not have enough atoms in the universe to store all that information.</description>
		<content:encoded><![CDATA[<p>I studied LSI in my Masters and I am, too, looking for a commercial implementation of it. </p>
<p>One problem with LSI (based on the proposed implementation in the textbook) is that the indexing is not continuous. You can add an entry in your database and have it indexed just by itself. You have to re-index the entire database, which is pain in the butt. And it might also cause the search results to change significantly from one build to another. So, Edward, if you can find a solution to this (making the indexing continuous), then you will be the next Bill Gates.</p>
<p>For some applications this may not be a problem. But in general, this is bad.</p>
<p>Another problem with LSI is the algorithm that finds the &#8220;nearest neighbors&#8221;. I don&#8217;t think people have a good solution to that yet. But in terms of searching, the time and space complexity is not so much of a concern.</p>
<p>For Google, however, is the immense number of pages and keywords they have to index. The complexity go up at least at 2nd order polynomial rate. We may not have enough atoms in the universe to store all that information.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Elliott Back</title>
		<link>http://elliottback.com/wp/latent-semantic-indexing-can-improve-your-wordpress-search-results/#comment-1726</link>
		<dc:creator>Elliott Back</dc:creator>
		<pubDate>Fri, 10 Jun 2005 00:43:45 +0000</pubDate>
		<guid isPermaLink="false">/?p=766#comment-1726</guid>
		<description>LSI is a patented technique, and is difficult to implement properly.  Don&#039;t even try it unless you read the appropriate academic papers first.  I suggest http://scholar.google.com as a first resource.  Even google doesn&#039;t use LSI, at least yet, because of its computational complexity.</description>
		<content:encoded><![CDATA[<p>LSI is a patented technique, and is difficult to implement properly.  Don&#8217;t even try it unless you read the appropriate academic papers first.  I suggest <a href="http://scholar.google.com" rel="nofollow">http://scholar.google.com</a> as a first resource.  Even google doesn&#8217;t use LSI, at least yet, because of its computational complexity.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic page generated in 0.192 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2012-02-10 05:53:40 -->

