Radoop Training in Germany

I spent last week in Dortmund at the Rapidminer office - the place where it all started - to take part in Radoop training. It was a mind blowing experience and simply put, Radoop makes Big Data easy. Why? Radoop adheres to Rapidminer’s promise to Accelerate, Connect, and Simplfy data and analytics.

Radoop is an abstraction layer on top of Hadoop that takes all the heavy coding and configuration out of it. Sure you can write Pig Script if you want to in Radoop, but do you really want to spend time writing Hive SQL? The Radoop operators translate the operations in Hive SQL and then sends it down to the cluster, all within the friendly Rapidminer GUI environment. It just sits on top of Pig, Hive, Map Reduce, YARN, and Impala, and once Spark is integrated by the end of the year, it will sit on top of that too.

We also had some intense discussions on the integration of Kerberos and other security layers, which is a topic that keeps coming up more and more in my discussions with companies. Fortunately for us, Kerberos integration is almost done! On the last day of training we spun up a Hadoop cluster on Amazon Web Services (AWS) and hooked Radoop into it. This step was a bit more involved, but within an hour, we were up and connected to AWS! For lack of better words, it was damn cool, and I can’t want to spin up my own Hadoop cluster on AWS.

So I leave with this quick posts with some photos from the Rapidminer Dortmund office and Radoop training.

Here’s Frank and Marius hard at work. They were kind enough to let me sit in their office while I was there.

Frank and Marius working away
Here’s Peter telling us about MapReduce, YARN, and Spark!
Finally, Peter is guiding us through the final configuration of integrating a Hadoop cluster with Radoop on Amazon (AWS)!
Up next Twython and the Rise of Something For the longest time I had a goal to automate Neural Market Trends with financial innovation(s) and broadcast the results to my readers. I wanted Photos from Germany
Latest posts The Ye Old Blog List Motorola: Then and Now EWM Redux Testing for mean reversion with Python & developing simple VIX system - Talaikis unsorted - Tadas Talaikis Blog Steps to calculate centroids in cluster using K-means clustering algorithm - Data Science Central Basics of Statistical Mean Reversion Testing - QuantStart Algorithmic trading in less than 100 lines of Python code - O’Reilly Media Interpreting Machine Learning Models Microsoft the AI Powerhouse Investing in the S&P500 still beats AI Trading Microsoft makes a push to simplify machine learning | TechCrunch 10 Great Articles On Python Development — Hacker Noon Introduction to Keras Democratising Machine learning with H2O — Towards Data Science Getting started with Python datatable | Kaggle Phone Addiction Version 12 Launches Today! Machine Learning Making Pesto Tastier 5 Dangerous Things You Should Let Your Kids Do The Pyschology of Writing Investing in 2019 and beyond TensorFlow and High Level APIs Driving Marketing Performance with H2O Driverless AI Machine Learning and Data Munging in H2O Driverless AI with datatable Making AI Happen Without Getting Fired Latest Musings from a Traveling Sales Engineer The Night before H2O World 2019 Why Forex Trading is Frustrating Functional Programming in Python Automatic Feature Engineering with Driverless AI Ray Dalio's Pure Alpha Fund