<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>bj&#039;s Blog</title>
	<atom:link href="http://bj2bj.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://bj2bj.wordpress.com</link>
	<description></description>
	<lastBuildDate>Fri, 30 Oct 2009 21:18:51 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='bj2bj.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>bj&#039;s Blog</title>
		<link>http://bj2bj.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://bj2bj.wordpress.com/osd.xml" title="bj&#039;s Blog" />
	<atom:link rel='hub' href='http://bj2bj.wordpress.com/?pushpress=hub'/>
		<item>
		<title>System for data scraping, analysis and prediction</title>
		<link>http://bj2bj.wordpress.com/2009/10/30/system-pro-sber-analyzu-a-predikci-dat/</link>
		<comments>http://bj2bj.wordpress.com/2009/10/30/system-pro-sber-analyzu-a-predikci-dat/#comments</comments>
		<pubDate>Fri, 30 Oct 2009 20:33:30 +0000</pubDate>
		<dc:creator>bjardnin</dc:creator>
				<category><![CDATA[statistics]]></category>
		<category><![CDATA[forecasting]]></category>
		<category><![CDATA[general systems theory]]></category>
		<category><![CDATA[GTS]]></category>
		<category><![CDATA[prediction]]></category>
		<category><![CDATA[regression]]></category>
		<category><![CDATA[support vector machine]]></category>
		<category><![CDATA[SVM]]></category>

		<guid isPermaLink="false">http://bj2bj.wordpress.com/?p=3</guid>
		<description><![CDATA[In this article a system is designed, that collects data from publicly available databases from the internet, performs data analysis using General Systems Theory and applying suitable methods for time series prediction based on the obtained data.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bj2bj.wordpress.com&amp;blog=10192311&amp;post=3&amp;subd=bj2bj&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In this article a system is designed, that collects data from publicly available databases from the internet, performs data analysis using General Systems Theory and applying suitable methods for time series prediction based on the obtained data.</p>
<p>Data collection is made by parsing HTML pages that are transformed to XML using software <a href="http://web-harvest.sourceforge.net">Web-harvest</a>. It uses language XSLT for parsing HTML and extraction in XML. The files in the XML format posses uniform structure. After the files are saved, they are stored in database technology <a href="http://jibx.sourceforge.net">JiBX</a> that allows to manipulate the data in XML as objects.</p>
<p>The systems of variables are decomponed and then connected in order to figure out, how the decomposition approximates the original system. System analysis is performed using standard statistic test, the <img src='http://s0.wp.com/latex.php?latex=%5Cchi%5E2&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='&#92;chi^2' title='&#92;chi^2' class='latex' /> test.</p>
<p>Prediction is performed using the methodology of General Systems Theory, another possibility is the method Support Vector Machines (SVM) for time series forecasting, see <a href="#SVM1">[1]</a> ,<a href="#SVR1">[2]</a>. Finally the predicted values are tested using t-test and F-test.</p>
<p><DL><DD><P></P><DT><A NAME="SVM1">[1]</A><DD> Lijuan Cao: <I CLASS="slanted">Support vector machines experts for time series forecasting</I>, Neurocomputing 51: 321-339, 2003.<br />
<P></P><DT><A NAME="SVR1">[2]</A><DD> Alex J. Smola and Bernhard Schölkopf: <I CLASS="slanted">A Tutorial on Support Vector Regression</I>, Statistics and Computing, 1998.</DL></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bj2bj.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bj2bj.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bj2bj.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bj2bj.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/bj2bj.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/bj2bj.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/bj2bj.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/bj2bj.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bj2bj.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bj2bj.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bj2bj.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bj2bj.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bj2bj.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bj2bj.wordpress.com/3/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bj2bj.wordpress.com&amp;blog=10192311&amp;post=3&amp;subd=bj2bj&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://bj2bj.wordpress.com/2009/10/30/system-pro-sber-analyzu-a-predikci-dat/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/775dc788018c29bfd910fd1b6e3bc63e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">bjardnin</media:title>
		</media:content>
	</item>
	</channel>
</rss>
