Text Mining Annual Reports
I’m playing around with Rapidminer’s powerful text mining tools to dig through annual reports this evening and I’m making progress. Â Rapidminer can text mine all sorts of formats but the operators are still a bit tough to use if you don’t know what you’re doing, like me! Â Still, I did pick up a thing or two at RCOMM and I’m putting that to good use.
For tonight I decided to mine through the annual reports of $CSCO, $XOM, $INTC, $AMD, and $BP. Â Granted, these stocks are in threeÂ differentÂ industry groups but I’m just poking around to see how they use buzz words like “sustainability” and “greenhouse.” It’s all rather fun and silly, but wait till I post about my Twitter mining experiment. Â LOL.
(Note: AMD never used it but BP did the most)
(Note: AMD never used it but BP did the most)Don't forget to sign up for our monthly newsletter on Data Science and RapidMiner here!