<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>aleatory &#187; machine learning</title>
	<atom:link href="http://aleatory.clientsideweb.net/category/machine-learning/feed/" rel="self" type="application/rss+xml" />
	<link>http://aleatory.clientsideweb.net</link>
	<description></description>
	<lastBuildDate>Thu, 19 Jan 2012 18:53:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Shifting Crates</title>
		<link>http://aleatory.clientsideweb.net/2009/02/02/shifting-crates/</link>
		<comments>http://aleatory.clientsideweb.net/2009/02/02/shifting-crates/#comments</comments>
		<pubDate>Mon, 02 Feb 2009 02:21:22 +0000</pubDate>
		<dc:creator>rutherford</dc:creator>
				<category><![CDATA[computer science]]></category>
		<category><![CDATA[cool]]></category>
		<category><![CDATA[machine learning]]></category>

		<guid isPermaLink="false">http://aleatory.clientsideweb.net/?p=75</guid>
		<description><![CDATA[Stuff like this makes me want to get back into Comp Sci]]></description>
			<content:encoded><![CDATA[<p>Stuff like <a href="http://blog.wired.com/wiredscience/2009/01/retailrobots.html">this</a> makes me want to get back into Comp Sci</p>
<p><object width="480" height="295"><param name="movie" value="http://www.youtube.com/v/Fdd6sQ8Cbe0&#038;hl=en&#038;fs=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/Fdd6sQ8Cbe0&#038;hl=en&#038;fs=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="480" height="295"></embed></object></p>
]]></content:encoded>
			<wfw:commentRss>http://aleatory.clientsideweb.net/2009/02/02/shifting-crates/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Further on the trend following thing</title>
		<link>http://aleatory.clientsideweb.net/2008/07/09/further-on-the-trend-following-thing/</link>
		<comments>http://aleatory.clientsideweb.net/2008/07/09/further-on-the-trend-following-thing/#comments</comments>
		<pubDate>Wed, 09 Jul 2008 17:57:23 +0000</pubDate>
		<dc:creator>rutherford</dc:creator>
				<category><![CDATA[information media]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[newsflow]]></category>

		<guid isPermaLink="false">http://aleatory.clientsideweb.net/?p=22</guid>
		<description><![CDATA[Yesterday I blogged about the use of online news aggregators such as Google News to map trends. Further surfing on the topic yielded the current state of the art in such aggregators &#8211; Silobreaker is at it&#8217;s most basic another news aggregator. But it&#8217;s added value lies in it&#8217;s ability to analyse, group &#38; visualise [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday I <a href="http://aleatory.clientsideweb.net/?p=21">blogged</a> about the use of online news aggregators such as Google News to map trends.  Further surfing on the topic yielded the current state of the art in such aggregators &#8211; <a href="http://www.silobreaker.com/">Silobreaker</a> is at it&#8217;s most basic another news aggregator.  But it&#8217;s added value lies in it&#8217;s ability to analyse, group &amp; visualise related news stories in a way much more intelligent than the likes of Google&#8217;s related story results.</p>
<p>For an example of how this fits in with my idea of a trend alerting system, see one of it&#8217;s bespoke topics &#8211; <a href="http://www.silobreaker.com/View360.aspx?Item=7_845">profit warnings</a>.  Historic data seems to back anything up to a year, and what Silobreaker calls a &#8217;360 degree&#8217; view, you get quotes, blogs, media trends as well as the news.  The trends graph can be used to compare the relative media interest in news subjects, or &#8216;entities&#8217; &#8211; a welcome nod in the direction of the Semantic Web there, with users having the ability to add entities not already recognised.</p>
<p>Further on the Semantic theme there is a network function that produces graphs of related entities.  The obligatory news to map function is also available, and extremely useful it is too &#8211; I just learned of the US airstrikes against Al Qaeda on Somali soil because of it.  As with the trend function, a time range can be specified.</p>
<p>There are always room for improvements on machine learning, and Silobreaker is no different &#8211; the above US airstrike news item was listed as of Baghdad origin, which in itself seemed fine as source of the article was a UPI journalist in Baghdad, but there was  no mention of the Somali bombing run over Somalia itself on the world map.</p>
<p>Registering gives you the ability to personalise among other things your list of news sources.  I guess what I&#8217;m saying from all this is that Silobreaker does much of the heavy lifting I&#8217;d envisage a financial trend forecasting tool to do.  What now is needed is an API to access this analysis &#8211; something I&#8217;d be hopeful for, considering their <a href="http://silobreaker.blogspot.com/2007/09/new-open-source-intelligence-and.html">background</a> in &#8216;open source&#8217; intelligence.</p>
]]></content:encoded>
			<wfw:commentRss>http://aleatory.clientsideweb.net/2008/07/09/further-on-the-trend-following-thing/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Machine Readable Google News</title>
		<link>http://aleatory.clientsideweb.net/2008/07/08/machine-readable-google-news/</link>
		<comments>http://aleatory.clientsideweb.net/2008/07/08/machine-readable-google-news/#comments</comments>
		<pubDate>Tue, 08 Jul 2008 13:07:31 +0000</pubDate>
		<dc:creator>rutherford</dc:creator>
				<category><![CDATA[computer science]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[newsflow]]></category>

		<guid isPermaLink="false">http://aleatory.clientsideweb.net/?p=21</guid>
		<description><![CDATA[Wired yesterday reports on a health site service that tracks disease outbreaks using news feeds such as Google news. A nifty bayesian-based machine learning algorithm is used, filtering out noise with some kind of intelligent phrase indexing - For instance, key words like &#8220;mysterious&#8221; tend to pop up in outbreak stories, but not, say, in [...]]]></description>
			<content:encoded><![CDATA[<p>Wired yesterday <a href="http://blog.wired.com/wiredscience/2008/07/researchers-tra.html">reports</a> on a health site service that tracks disease outbreaks using news feeds such as Google news.  A nifty bayesian-based machine learning algorithm is used, filtering out noise with some kind of intelligent phrase indexing -</p>
<blockquote><p>For instance, key words like &#8220;mysterious&#8221; tend to pop up in outbreak stories, but not, say, in coverage of vaccine programs. Another common feature of outbreak stories is a small number in the headline, usually to denote a number of people infected or killed.</p></blockquote>
<p>The site has actually been up &amp; running since 2006 as this gmaps mashup blog <a href="http://googlemapsmania.blogspot.com/2006/10/health-map-global-disease-alert.html">records</a>.</p>
<p>More detail on how it works can be found <a href="http://medicine.plosjournals.org/perlserv/?request=get-document&amp;doi=10.1371/journal.pmed.0050151&amp;ct=1">here</a>.</p>
<p>I would like to create something along these lines for financial data, with buy/sell signals replacing the gmaps visualisation.  Google News, owing to it&#8217;s concentration on news aggregation, does not currently capture stories quick enough for it to be used as part of a beat-the-market type event trading system.  It&#8217;s aggregation nature would however lend itself perfectly to a more long term trend alerting mechanism.  The smarts to be built on top of it would I imagine be pretty similar to what goes on in HealthMap above.</p>
<p>Industry talk on news flow algorithms seems to have disappeared after a bit of <a href="http://www.wallstreetandtech.com/technology-risk-management/showArticle.jhtml;jsessionid=UMJJ3ZUHPH3CEQSNDLPSKHSCJUNN2JVN?articleID=185302817&amp;_requestid=979199">buzz</a> a few years back.  It may have went the way of the Neural Networks of the 80s.</p>
]]></content:encoded>
			<wfw:commentRss>http://aleatory.clientsideweb.net/2008/07/08/machine-readable-google-news/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

