TensorFlow and High Level APIs

https://youtu.be/k5c-vg4rjBw {<} I got a chance to watch this great presentation on the upcoming release of TensorFlow v2 by Martin Wicke. He goes over the big changes - and there's a lot - plus how you can upgrade your earlier versions of...

comments

Getting Started in Data Science Part 2

I'm finally getting around to writing Part 2 of Getting Started in Data Science. The first part can be found here. I made suggestions for university students interested in the field of Data Science. I even made a video about it too.  Pick Two,...

comments

H2O AI World 2018 in London

It's been nearly a whole month since I've been back from H2O AI World 2018 in London. First off, London is always a great city. I love it. Add in H2O World and it was like a machine learning fairy tale. There were Kaggle Grandmasters, new H2O-3...

comments

Isolation Forests in H2O.ai

A new feature has been added to H2O-3 open source, isolation forests. I've always been a fan of understanding outliers and love using One Class SVM's as a method, but the isolation forests appear to be better in finding outliers, in most cases....

comments

What is Reusable Holdout?

Overfitting and introducing bias during model training is always a big topic in data science. Typically you train a model using Cross Validation by creating a model on k-1 folds and test it on the remaining one fold. This one fold is the holdout...

comments

Probability Distributions Cheat Sheet

This probability distribution cheat sheet has been making the rounds on Twitter lately and guess what? It's pretty darn valuable if you ask me. The write up explaining each different distribution is pretty good too.

comments

Using Python for Descriptive Statistics

KD Nuggets has another great article on using Python for descriptive statistics.     Descriptive Statistics are important to data science and data 'sleuthing' in general. Learn the different metrics and you're well on your way to understanding...

comments