Kaggle is a community of almost 450K data scientists who have built nearly 2 million machine learning models to participate in its competitions. Data scientists come to Kaggle to learn, collaborate and develop the state of the art in machine learning. This talk will cover some of the lessons on winning techniques we have learned from the Kaggle community. Watch Video
VIDEO PODCAST | In Episode 5 of this podcast series by Renee Teate of “Becoming a Data Scientist”, she interviews Clare Corthell, founding partner of summer.ai and creator of the Open Source Data Science Masters curriculum, about becoming a data scientist. Read More
Video (33:41) with Databricks co-founder and CTO Matei Zaharia presenting the changes in Apache Spark 2.0 and the general availability (GA) of Databricks Community Edition at Spark Summit 2016. Afterwards, Michael Armbrust demos some new features found in Spark 2.0 on Databricks Community EditionGo To Video
(Re-post) Got a need for speed processing Big Data? In this video talk given at the Apache Flink Meetup in NYC, Slim Baltagi goes over everything you need to know right now about Flink. If you utilize big data analytics, then this is a must watch video!! Enjoy Learn More
Apache Spark’s popularity as part of big data analytics solutions is exploding. Spark is an open-source data analytics cluster computing framework originally developed in the AMPLab at UC Berkeley. Spark fits into the Hadoop open-source community, building on top of the Hadoop Distributed File System (HDFS). However, Spark promises performance up to 100 times faster than Hadoop MapReduce for certain applications…and that’s why you should care!
Spark’s in-memory cluster computing is very well suited to machine learning algorithms. These Videos will give you a nice introduction to Spark, how it’s being used in business and why you should care…Watch Spark Videos…