Data Science

All sections →
TensorFlow and High Level APIs Mar 16, 2019 TensorFlow & Data Science & machine learning TensorFlow I got a chance to watch this great presentation on the upcoming release of TensorFlow v2 by Martin Wicke. He goes over the big Getting Started in Data Science Part 2 Dec 5, 2018 Data Science & machine learning I’m finally getting around to writing Part 2 of Getting Started in Data Science. The first part can be found here. I made suggestions for H2O AI World 2018 in London Nov 21, 2018 London & H2O.ai & Data Science & social It’s been nearly a whole month since I’ve been back from H2O AI World 2018 in London. First off, London is always a great city. I love it. Add in Isolation Forests in H2O.ai Nov 17, 2018 Data Science & H2O.ai & machine learning A new feature has been added to H2O-3 open source, isolation forests. I’ve always been a fan of understanding outliers and love using One Class What is Reusable Holdout? Nov 2, 2018 Data Science & Model Training & machine learning Overfitting and introducing bias during model training is always a big topic in data science. Typically you train a model using Cross Validation by Probability Distributions Cheat Sheet Aug 13, 2018 Data Science & machine learning This probability distribution cheat sheet has been making the rounds on Twitter lately and guess what? It’s pretty darn valuable if you ask me. Using Python for Descriptive Statistics Aug 6, 2018 Data Science & Python & machine learning KD Nuggets has another great article on using Python for descriptive statistics.   Descriptive Statistics Descriptive Statistics are Productionalizing RapidMiner and Python on RapidMiner Server Jul 24, 2018 Data Science & RapidMiner Server & Python & machine learning Continuing my RapidMiner Server series. In this video I show you how to save a RapidMiner Studio process to RapidMiner Server. Then configure Introduction to RapidMiner Server Jul 24, 2018 Data Science & RapidMiner & Tutorials & machine learning I made a new video on RapidMiner Server! This is just a high level overview of the Web GUI and how to navigate through it. In future videos I’ll be WTF is a Tensor? May 8, 2018 Tensor & Data Science & Math & machine learning If you ever wanted to know what a ‘Tensor’ was, this article is for you. Basically: A tensor is a container which can house data in N dimensions, Word2Vec Example Process in RapidMiner May 3, 2018 RapidMiner & Data Science & Word2Vec & Twitter & machine learning This is an example process of how to use Word2Vec in RapidMiner with the Search Twitter operator. For more information check out this post on the Learn RapidMiner Livestream Volume 1 May 2, 2018 RapidMiner & Data Science & machine learning I had my first YouTube LiveStream on how to use RapidMiner. It’s about 48 minutes long and I do a GUI overview and do some text mining. I even Learn Javascript! Really? May 1, 2018 Javascript & Data Science & social I found Quincy Larson’s article on learning Javascript as your first computer language both interesting and funny. He makes his point for learning Getting Started in Data Science - Part 1 Apr 25, 2018 Data Science & RapidMiner & Python & machine learning This is the forward to an introduction on getting started in data science. I wanted to write a set of ‘getting started’ posts to share with readers Data Science and Machine Learning Education Apr 16, 2018 Data Science & Google & TensorFlow & machine learning When I first self-taught myself ‘data science,’ there wasn’t a lot on the Internet to help me. I spent years cobbling information together reading Orange 3 is impressive Oct 7, 2017 Machine Learning & Data Science & machine learning I’ve been keeping a lazy eye on Orange over the years and it’s (fairly) recent update has made it quite an impressive contender in the Data Science Python overtakes R for Data Science Sep 7, 2017 Python & R & Data Science & social This is really big news from KD Nuggets. Product Qualified Lead Model - The Review Aug 25, 2017 PQL & Data Science & thoughts One of the big corporate strategy things I worked on was developing and putting into production a PQL model. It was essentially a propensity to buy Keras and NLTK Aug 15, 2017 Data Science & Deep Learning & Keras & NLTK & Python & Text Analytics & Text Mining & Thoughts & machine learning I’ve been doing a lot more Python hacking, especially around text mining and using the deep learning library Keras and NLTK. Normally I’d do most of Is it Possible to Automate Data Science? Jul 10, 2017 Automation & Data Science & RapidMiner & Thoughts & machine learning A few months ago I read about a programmer that automated his job down to the point where the coffee machine would make him lattes! Despite the RapidMiner Training 2017 Jun 18, 2017 Training & Data Science & RapidMiner & social I just capped off a 4 day RapidMiner training class in NYC last week. It was a ton of fun and I got to meet lots of super smart and cool people. Chat with Traders - Episode 121 May 31, 2017 Data Science & social Maoxian has been posting his notes on some trader interviews he’s been listening too. The topic is about the quality of decision making in the Labeling Training Data Correctly May 5, 2017 Machine Learning & Data Science & AI & machine learning I recently listened to a great O’Reilly podcast on this subject. They interviewed Lukas Biewald, Chief Data Scientist and Founder of CrowdFlower. The Sudden Interest in Data Science Platforms Apr 20, 2017 Data Science & machine learning Gartner Hype Cycle For years we’ve hearing how Big Data will unlock all kinds of insights in a corporation’s data. Everyone raced to stand up Latest Writing Elsewhere December 2016 Jan 3, 2017 Blogging & Data Science & RapidMiner & Writing & social It’s hard to believe but 2016 is over. Here’s a list of my writings elsewhere in December 2016. I’m also including some RapidMiner community Rebuilding a Blog - Part 4 Oct 4, 2016 Content Marketing & Internet & Data Science & RapidMiner & social This post on rebuilding a blog is continuation of my previous post. In this post I wanted to review some goals I created in Google Analytics. These Latest Writings Elsewhere for September 2016 Sep 29, 2016 Content Marketing & Internet & Data Science & RapidMiner & Tutorials & social Just a quick list of the content I’ve created some place other than this blog. This current list is 100% RapidMiner related but I’d like to branch Rebuilding a Blog - Part 3 Sep 6, 2016 Content Marketing & Internet & Data Science & RapidMiner & social This post on rebuilding a blog is continuation of my previous post. My new SEO strategy is starting to pay off and I owe it all to tweaking the Rebuilding a Blog - Part 2 Aug 5, 2016 Content Marketing & Internet & Data Science & RapidMiner & social What’s not surprising is that my readers come from all over the world. The map below are where they came from in the first half of 2016. The