<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Rapidminer 5.0 Video Tutorial #5 &#8211; Genetic Algorithmic Data Preprocessing Part 2</title>
	<atom:link href="http://www.neuralmarkettrends.com/2010/03/09/rapidminer-5-0-video-tutorial-5/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.neuralmarkettrends.com/2010/03/09/rapidminer-5-0-video-tutorial-5/</link>
	<description>Rapidminer Evangelism &#38; Consulting</description>
	<lastBuildDate>Fri, 10 Feb 2012 14:59:20 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Tom</title>
		<link>http://www.neuralmarkettrends.com/2010/03/09/rapidminer-5-0-video-tutorial-5/comment-page-1/#comment-3204</link>
		<dc:creator>Tom</dc:creator>
		<pubDate>Mon, 06 Sep 2010 12:40:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.neuralmarkettrends.com/?p=2191#comment-3204</guid>
		<description>@Seyhan: Perhaps you can ask this question in the &lt;a href=&quot;http://forums.neuralmarkettrends.com&quot; rel=&quot;nofollow&quot;&gt;forums&lt;/a&gt;?</description>
		<content:encoded><![CDATA[<p>@Seyhan: Perhaps you can ask this question in the <a href="http://forums.neuralmarkettrends.com" rel="nofollow">forums</a>?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Seyhan</title>
		<link>http://www.neuralmarkettrends.com/2010/03/09/rapidminer-5-0-video-tutorial-5/comment-page-1/#comment-3203</link>
		<dc:creator>Seyhan</dc:creator>
		<pubDate>Mon, 06 Sep 2010 02:13:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.neuralmarkettrends.com/?p=2191#comment-3203</guid>
		<description>Hi,

I really like blog. Thanks for sharing your knowledge on rapidminer.

I have a huge problem on scoring unlabeled data with rapid miner on 10 fold cross validation for two class classification.

I use 10 fold xval for model training &amp; testing usinf libsvm on rapidminer. It gives me 86% accurate classification on testing. Everthing is fine upto this point.

But, when I apply the score dataset with unlabeled data to predict the classification of the score dateset. The model classifies every observation as only one class, which has the hightes frequency of the training dataset.

I checked the confidence probabilities of each score dataset observations they are all the same (0.36 for No, and 0.64 for Yes).

Could you please advice me where the problem is, or if you have any sample share with us?

Is there any option where I can manipulate the confidences.

I use Rapidminer 5 and also look at the scoring video tutorial of Rapidminer. But it only shows training and scoring. It does not show Training, Testing and Scoring.    

Thanks in advanve.

Seyhan</description>
		<content:encoded><![CDATA[<p>Hi,</p>
<p>I really like blog. Thanks for sharing your knowledge on rapidminer.</p>
<p>I have a huge problem on scoring unlabeled data with rapid miner on 10 fold cross validation for two class classification.</p>
<p>I use 10 fold xval for model training &amp; testing usinf libsvm on rapidminer. It gives me 86% accurate classification on testing. Everthing is fine upto this point.</p>
<p>But, when I apply the score dataset with unlabeled data to predict the classification of the score dateset. The model classifies every observation as only one class, which has the hightes frequency of the training dataset.</p>
<p>I checked the confidence probabilities of each score dataset observations they are all the same (0.36 for No, and 0.64 for Yes).</p>
<p>Could you please advice me where the problem is, or if you have any sample share with us?</p>
<p>Is there any option where I can manipulate the confidences.</p>
<p>I use Rapidminer 5 and also look at the scoring video tutorial of Rapidminer. But it only shows training and scoring. It does not show Training, Testing and Scoring.    </p>
<p>Thanks in advanve.</p>
<p>Seyhan</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom</title>
		<link>http://www.neuralmarkettrends.com/2010/03/09/rapidminer-5-0-video-tutorial-5/comment-page-1/#comment-3184</link>
		<dc:creator>Tom</dc:creator>
		<pubDate>Fri, 27 Aug 2010 15:35:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.neuralmarkettrends.com/?p=2191#comment-3184</guid>
		<description>Hmm, without seeing the whole experiment process and data I can offer this suggestion.  Once the optimization has happened, write the selected variables to a file and then create a new input file with those variables.  Then train and save your model on the &quot;short list&quot; of variabels and go through the process of prediction as you described. Let me know how it turns out.</description>
		<content:encoded><![CDATA[<p>Hmm, without seeing the whole experiment process and data I can offer this suggestion.  Once the optimization has happened, write the selected variables to a file and then create a new input file with those variables.  Then train and save your model on the &#8220;short list&#8221; of variabels and go through the process of prediction as you described. Let me know how it turns out.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pathros</title>
		<link>http://www.neuralmarkettrends.com/2010/03/09/rapidminer-5-0-video-tutorial-5/comment-page-1/#comment-3174</link>
		<dc:creator>Pathros</dc:creator>
		<pubDate>Tue, 17 Aug 2010 22:45:52 +0000</pubDate>
		<guid isPermaLink="false">http://www.neuralmarkettrends.com/?p=2191#comment-3174</guid>
		<description>Hello, Tom!
When executing my process using Optimize Selection (evolutionary) i save the model using &quot;write model&quot; and when this process finishes, it is supposed that the last saved model is the best one.
When i want to apply the model again, i have problems trying to apply it to a new data. Rapidminer says that the mapping of certain variables is wrong. And it gets me even much more confused when it says that certain variables are not included (but those ones where discarded by the optimizer!)

from 60 variables, the process chose 30 as the best ones that help explain the model. So i read this new data with &quot;read csv&quot;, where i got the 30 variables plus the ID variable except the label variable (which is the one that i want to predict).

i have problems trying to apply this model. Do you have some ideas that can help to achieve this???

thanks.</description>
		<content:encoded><![CDATA[<p>Hello, Tom!<br />
When executing my process using Optimize Selection (evolutionary) i save the model using &#8220;write model&#8221; and when this process finishes, it is supposed that the last saved model is the best one.<br />
When i want to apply the model again, i have problems trying to apply it to a new data. Rapidminer says that the mapping of certain variables is wrong. And it gets me even much more confused when it says that certain variables are not included (but those ones where discarded by the optimizer!)</p>
<p>from 60 variables, the process chose 30 as the best ones that help explain the model. So i read this new data with &#8220;read csv&#8221;, where i got the 30 variables plus the ID variable except the label variable (which is the one that i want to predict).</p>
<p>i have problems trying to apply this model. Do you have some ideas that can help to achieve this???</p>
<p>thanks.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Calastro</title>
		<link>http://www.neuralmarkettrends.com/2010/03/09/rapidminer-5-0-video-tutorial-5/comment-page-1/#comment-2949</link>
		<dc:creator>Calastro</dc:creator>
		<pubDate>Thu, 18 Mar 2010 01:42:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.neuralmarkettrends.com/?p=2191#comment-2949</guid>
		<description>Thanks, tom!
I&#039;lll keep visiting your blog and learning more about the RM!</description>
		<content:encoded><![CDATA[<p>Thanks, tom!<br />
I&#39;lll keep visiting your blog and learning more about the RM!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom</title>
		<link>http://www.neuralmarkettrends.com/2010/03/09/rapidminer-5-0-video-tutorial-5/comment-page-1/#comment-2938</link>
		<dc:creator>Tom</dc:creator>
		<pubDate>Tue, 16 Mar 2010 10:49:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.neuralmarkettrends.com/?p=2191#comment-2938</guid>
		<description>c1borg: You could place it there too and it should work too, but I rarely use the model writer now. &#160;I just create a prediction experiment at the same time and connect the &quot;mod&quot; node to it so the model learns and predicts when its done.</description>
		<content:encoded><![CDATA[<p>c1borg: You could place it there too and it should work too, but I rarely use the model writer now. &nbsp;I just create a prediction experiment at the same time and connect the &quot;mod&quot; node to it so the model learns and predicts when its done.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: c1borg</title>
		<link>http://www.neuralmarkettrends.com/2010/03/09/rapidminer-5-0-video-tutorial-5/comment-page-1/#comment-2937</link>
		<dc:creator>c1borg</dc:creator>
		<pubDate>Tue, 16 Mar 2010 05:50:11 +0000</pubDate>
		<guid isPermaLink="false">http://www.neuralmarkettrends.com/?p=2191#comment-2937</guid>
		<description>Ok thanks for that this is what I tried before and discarded as I thought the model file is overwritten many times and this cannot be correct. However I guess your saying the last write to the file will be the best result. Why would it not be correct to attach to the mod o/p of the validator?</description>
		<content:encoded><![CDATA[<p>Ok thanks for that this is what I tried before and discarded as I thought the model file is overwritten many times and this cannot be correct. However I guess your saying the last write to the file will be the best result. Why would it not be correct to attach to the mod o/p of the validator?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom</title>
		<link>http://www.neuralmarkettrends.com/2010/03/09/rapidminer-5-0-video-tutorial-5/comment-page-1/#comment-2936</link>
		<dc:creator>Tom</dc:creator>
		<pubDate>Tue, 16 Mar 2010 01:38:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.neuralmarkettrends.com/?p=2191#comment-2936</guid>
		<description>@c1borg: attach the model writer operator to the &quot;mod&quot; node on the apply model operator in the testing section of the Split Validation operator. &#160;Make sure you give your mod a name or else it will give you errors. &#160;See if that helps.</description>
		<content:encoded><![CDATA[<p>@c1borg: attach the model writer operator to the &quot;mod&quot; node on the apply model operator in the testing section of the Split Validation operator. &nbsp;Make sure you give your mod a name or else it will give you errors. &nbsp;See if that helps.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: c1borg</title>
		<link>http://www.neuralmarkettrends.com/2010/03/09/rapidminer-5-0-video-tutorial-5/comment-page-1/#comment-2935</link>
		<dc:creator>c1borg</dc:creator>
		<pubDate>Mon, 15 Mar 2010 21:10:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.neuralmarkettrends.com/?p=2191#comment-2935</guid>
		<description>Many thanks for the videos so far, I have an 80% prediction using genetic optimisation. If you dont mind I have a question, where do I put the model writer in the experiment. I would assume this would go after the evaluator, as if it goes in the testing section the file is constantly overwritten as each generation of results is tried. However I get erors if I try to put the model writer in this position in the experiment.
Many thanks in advance and cant wait for the remaining 4 videos.</description>
		<content:encoded><![CDATA[<p>Many thanks for the videos so far, I have an 80% prediction using genetic optimisation. If you dont mind I have a question, where do I put the model writer in the experiment. I would assume this would go after the evaluator, as if it goes in the testing section the file is constantly overwritten as each generation of results is tried. However I get erors if I try to put the model writer in this position in the experiment.<br />
Many thanks in advance and cant wait for the remaining 4 videos.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom</title>
		<link>http://www.neuralmarkettrends.com/2010/03/09/rapidminer-5-0-video-tutorial-5/comment-page-1/#comment-2933</link>
		<dc:creator>Tom</dc:creator>
		<pubDate>Mon, 15 Mar 2010 09:28:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.neuralmarkettrends.com/?p=2191#comment-2933</guid>
		<description>Calastro: It sounds like you want to create a value like a &quot;credit score.&quot; &#160;That&#039;s mostly likely a formula that you&#039;ll have to create yourself or use the formula results writer in RM. &#160;You could use a Bayesian learner to find out how often a particular variable shows up in your data space (assuming each entry is independent).</description>
		<content:encoded><![CDATA[<p>Calastro: It sounds like you want to create a value like a &quot;credit score.&quot; &nbsp;That&#39;s mostly likely a formula that you&#39;ll have to create yourself or use the formula results writer in RM. &nbsp;You could use a Bayesian learner to find out how often a particular variable shows up in your data space (assuming each entry is independent).</p>
]]></content:encoded>
	</item>
</channel>
</rss>

