RapidMiner released its Web Mining Extension on the Marketplace. It’s super easy to install with RapidMiner Studio. Just go to Extensions > Marketplace (Updates/Extensions) and search for Web Mining.
Select the Extension and then accept the Terms and Conditions. RapidMiner will then have to restart and you should see the latest set of operators in the Extension folder of your Operators.
Web Mining Extension Operators
Here’s what you get with the extension, a web crawler, single and multiple page extraction, scraping text out of HTML tags, and much much more. My favorite operator is operator is the Enrich by WebService Operator, which I use quite a bit for mashing up geolocation data (see my Tutorials on this).
It’s ALMOST here, the R extension in Rapidminer is just one more week away!!!
If you want a sneak peak of it, check out this intro video by Ralf Klinkenberg on the R extension.
With the new GUI in 5.X and now this extension, Rapidminer will blow the doors off any data modeling suites in 2011!
RapidMiner and R Update
This post is incredibly old and the R extension in RapidMiner has been greatly overhauled. I would suggest checking out this updated video on how to use the new Execute R script operator with RapidMiner below.
After a very long hiatus I present to you my introduction to Rapidminer tutorial. This video is for RapidMiner version 5.0, and it’s just a quick 10 min introduction to the GUI and data import functions. You’re gonna like the way it looks!