<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Principal Component Analysis: The Lovecraft Experiment (part 1)</title>
	<atom:link href="http://mentatjack.com/2009/07/18/principal-component-analysis-the-lovecraft-experiment-part-1/feed/" rel="self" type="application/rss+xml" />
	<link>http://mentatjack.com/2009/07/18/principal-component-analysis-the-lovecraft-experiment-part-1/</link>
	<description>sff book reviews and subversive ontology</description>
	<lastBuildDate>Sun, 05 Feb 2012 20:36:10 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: mentatjack</title>
		<link>http://mentatjack.com/2009/07/18/principal-component-analysis-the-lovecraft-experiment-part-1/#comment-905</link>
		<dc:creator><![CDATA[mentatjack]]></dc:creator>
		<pubDate>Tue, 06 Oct 2009 02:16:00 +0000</pubDate>
		<guid isPermaLink="false">http://mentatjack.com/?p=973#comment-905</guid>
		<description><![CDATA[The matrix that I start from has one row per book and one column per tag.  From that compute the covariance matrix and eventually the principal components.  The plot is the first and second principal components mapped against each other.  The third-fifth principal components contribute respectively to the RGB values of the point.

I divided these into &quot;plots&quot; based on a &quot;pivot tag&quot; as displaying 65,000 books proved a bit unmanageable.  That&#039;s where the &quot;Master Cloud&quot; came from.

I&#039;ve made &lt;a href=&quot;http://mentatjack.com/tag/tagshadow/&quot; rel=&quot;nofollow&quot;&gt;numerous other posts&lt;/a&gt; on this blog related to the TagShadow project and I&#039;ve also started collecting my thoughts on the &lt;a href=&quot;http://tagshadow.com/forum&quot; rel=&quot;nofollow&quot;&gt;TagShadow forum&lt;/a&gt;.

It&#039;s a pleasure to converse with someone who understands this (probably more than me).  I&#039;m by no means an expert on any sort matrix based analysis, be it PCA or MDS, but it&#039;s been much fun plugging away at it and getting a chance to stretch my Java development muscles.]]></description>
		<content:encoded><![CDATA[<p>The matrix that I start from has one row per book and one column per tag.  From that compute the covariance matrix and eventually the principal components.  The plot is the first and second principal components mapped against each other.  The third-fifth principal components contribute respectively to the RGB values of the point.</p>
<p>I divided these into &#8220;plots&#8221; based on a &#8220;pivot tag&#8221; as displaying 65,000 books proved a bit unmanageable.  That&#8217;s where the &#8220;Master Cloud&#8221; came from.</p>
<p>I&#8217;ve made <a href="http://mentatjack.com/tag/tagshadow/" rel="nofollow">numerous other posts</a> on this blog related to the TagShadow project and I&#8217;ve also started collecting my thoughts on the <a href="http://tagshadow.com/forum" rel="nofollow">TagShadow forum</a>.</p>
<p>It&#8217;s a pleasure to converse with someone who understands this (probably more than me).  I&#8217;m by no means an expert on any sort matrix based analysis, be it PCA or MDS, but it&#8217;s been much fun plugging away at it and getting a chance to stretch my Java development muscles.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James</title>
		<link>http://mentatjack.com/2009/07/18/principal-component-analysis-the-lovecraft-experiment-part-1/#comment-904</link>
		<dc:creator><![CDATA[James]]></dc:creator>
		<pubDate>Tue, 06 Oct 2009 02:03:15 +0000</pubDate>
		<guid isPermaLink="false">http://mentatjack.com/?p=973#comment-904</guid>
		<description><![CDATA[The TagShadow mapes look very interesting. Are the positions calculated with PCA? It would also be interesting create a 
2D map for these 1500 tags to see clusters among the tags. To create such map you just need to transpose tag table.]]></description>
		<content:encoded><![CDATA[<p>The TagShadow mapes look very interesting. Are the positions calculated with PCA? It would also be interesting create a<br />
2D map for these 1500 tags to see clusters among the tags. To create such map you just need to transpose tag table.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James</title>
		<link>http://mentatjack.com/2009/07/18/principal-component-analysis-the-lovecraft-experiment-part-1/#comment-903</link>
		<dc:creator><![CDATA[James]]></dc:creator>
		<pubDate>Tue, 06 Oct 2009 01:46:12 +0000</pubDate>
		<guid isPermaLink="false">http://mentatjack.com/?p=973#comment-903</guid>
		<description><![CDATA[Most MDS methods operate on the distance matrix, similar
as PCA operates on covariance matrix. There should be no
problem with high dimensionality, at least not more so than PCA.
The number of data points is rather a limiting factor, at it 
decides the dimension of the distance matrix.
Some more recent developements in MDS run under term &quot;manifold learning&quot;, you can find more interesting stuff through google.
Anywhere, I am lookking forward to your part2 and part3 
blogs about PCA.]]></description>
		<content:encoded><![CDATA[<p>Most MDS methods operate on the distance matrix, similar<br />
as PCA operates on covariance matrix. There should be no<br />
problem with high dimensionality, at least not more so than PCA.<br />
The number of data points is rather a limiting factor, at it<br />
decides the dimension of the distance matrix.<br />
Some more recent developements in MDS run under term &#8220;manifold learning&#8221;, you can find more interesting stuff through google.<br />
Anywhere, I am lookking forward to your part2 and part3<br />
blogs about PCA.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mentatjack</title>
		<link>http://mentatjack.com/2009/07/18/principal-component-analysis-the-lovecraft-experiment-part-1/#comment-902</link>
		<dc:creator><![CDATA[mentatjack]]></dc:creator>
		<pubDate>Mon, 05 Oct 2009 23:10:03 +0000</pubDate>
		<guid isPermaLink="false">http://mentatjack.com/?p=973#comment-902</guid>
		<description><![CDATA[Thanks for the heads up!

Just a quick glance at the &lt;a href=&quot;http://en.wikipedia.org/wiki/Multidimensional_scaling&quot; rel=&quot;nofollow&quot;&gt;MDS Wikipedia Entry&lt;/a&gt; states that this is rather cumbersome for more than 20 variables.

My current &lt;a href=&quot;http://TagShadow.com/amazon/MasterCloud.html&quot; rel=&quot;nofollow&quot;&gt;TagShadow Prototype&lt;/a&gt; uses 1500 variables and PCA seems to handle it fairly well.  I&#039;ll probably explore the results of this particular analysis for awhile before starting from scratch with a new one.

I&#039;d love your input on the project.]]></description>
		<content:encoded><![CDATA[<p>Thanks for the heads up!</p>
<p>Just a quick glance at the <a href="http://en.wikipedia.org/wiki/Multidimensional_scaling" rel="nofollow">MDS Wikipedia Entry</a> states that this is rather cumbersome for more than 20 variables.</p>
<p>My current <a href="http://TagShadow.com/amazon/MasterCloud.html" rel="nofollow">TagShadow Prototype</a> uses 1500 variables and PCA seems to handle it fairly well.  I&#8217;ll probably explore the results of this particular analysis for awhile before starting from scratch with a new one.</p>
<p>I&#8217;d love your input on the project.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James</title>
		<link>http://mentatjack.com/2009/07/18/principal-component-analysis-the-lovecraft-experiment-part-1/#comment-901</link>
		<dc:creator><![CDATA[James]]></dc:creator>
		<pubDate>Mon, 05 Oct 2009 21:34:23 +0000</pubDate>
		<guid isPermaLink="false">http://mentatjack.com/?p=973#comment-901</guid>
		<description><![CDATA[You might be interested in looking into methods termed as 
&quot;multidimensional scaling(MDS)&quot; which seem to address 
the same problem as yours. PCA is just one of linear MDS methods.]]></description>
		<content:encoded><![CDATA[<p>You might be interested in looking into methods termed as<br />
&#8220;multidimensional scaling(MDS)&#8221; which seem to address<br />
the same problem as yours. PCA is just one of linear MDS methods.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pricinipal Component Analysis: The Java Iteration &#171; MentatJack</title>
		<link>http://mentatjack.com/2009/07/18/principal-component-analysis-the-lovecraft-experiment-part-1/#comment-771</link>
		<dc:creator><![CDATA[Pricinipal Component Analysis: The Java Iteration &#171; MentatJack]]></dc:creator>
		<pubDate>Sat, 12 Sep 2009 19:32:00 +0000</pubDate>
		<guid isPermaLink="false">http://mentatjack.com/?p=973#comment-771</guid>
		<description><![CDATA[[...] Component Analysis: The Java&#160;Iteration  Much more satisfying than my earlier attempt to do this by hand, I&#8217;m actually building the tools that will eventually power TagShadow. [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Component Analysis: The Java&nbsp;Iteration  Much more satisfying than my earlier attempt to do this by hand, I&#8217;m actually building the tools that will eventually power TagShadow. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tagshadow: Jama, Weka and Scatter Plots &#171; MentatJack</title>
		<link>http://mentatjack.com/2009/07/18/principal-component-analysis-the-lovecraft-experiment-part-1/#comment-769</link>
		<dc:creator><![CDATA[Tagshadow: Jama, Weka and Scatter Plots &#171; MentatJack]]></dc:creator>
		<pubDate>Fri, 11 Sep 2009 16:53:15 +0000</pubDate>
		<guid isPermaLink="false">http://mentatjack.com/?p=973#comment-769</guid>
		<description><![CDATA[[...] some progress with Jama, which allows me to represent and manipulate matrices in Java. I ran my lovecraft example data through the paces of SVD, QR, LU, and such, but the real test will be when I try and manipulate the [...]]]></description>
		<content:encoded><![CDATA[<p>[...] some progress with Jama, which allows me to represent and manipulate matrices in Java. I ran my lovecraft example data through the paces of SVD, QR, LU, and such, but the real test will be when I try and manipulate the [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

