VIDEO | Dr. Andy Feng, VP Architecture at Yahoo! | “Large-Scale Machine Learning” | 2016 CSL Student Conference

The University of Illinois’ Coordinated Science Laboratory had it student run conference last month. This video is Dr. Andy Feng – VP Architecture at Yahoo! He leads the architecture and design of big data and machine learning initiatives. “In this talk, we illustrate Yahoo use cases and datasets, and explain the evolution of big-data technology stack.” Watch Video

50+ Open Source Tools for Big Data

Open source software tools have become all the rage, especially around big data and that is a GOOD thing. It allows for many players to work off of the same code base to build more add-on tools and it’s cheap and easy for the masses to get set up and use them. Hadoop, R, Cassandra, Mongo DB, Neo4i and HBase are among the most popular, but there are many more.

I have accumulated 3 lists that are very popular. Please let me know if you see things missing and I’ll attempt to create one large master list and post it on the site. Read More…

Edu-Videos | Learn All About Apache Spark (100x Faster than Hadoop MapReduce)

Apache Spark’s popularity as part of big data analytics solutions is exploding. Spark is an open-source data analytics cluster computing framework originally developed in the AMPLab at UC Berkeley. Spark fits into the Hadoop open-source community, building on top of the Hadoop Distributed File System (HDFS). However, Spark promises performance up to 100 times faster than Hadoop MapReduce for certain applications…and that’s why you should care!

Spark’s in-memory cluster computing is very well suited to machine learning algorithms. These Videos will give you a nice introduction to Spark, how it’s being used in business and why you should care…Watch Spark Videos…

Video | Spark Summit East 2016 | Day 2 – Keynotes

DAY 2 | Spark Summit East 2016 took place last month in NYC. Here is the Day 2 Keynotes video. It begins with Reynold Xin – Chief Architect at Databricks Presenting Real-Time and Spark
It is followed by two other, very good presentations titled – ‘”Leveraging Spark, AWS, And Graph Analytics to Better Protect Customers” and “Data Profiling and Pipeline Processing with Spark – A Journey”’ (58min). Enjoy!

Edu-Video | Spark Summit East 2016 | Keynote Speakers (Day 1)

Spark Summit East 2016 took place last week in NYC. Here is the Day 1 Keynotes video. It begins with Matei Zaharia – MIT professor, Databricks co-founder and Creator of Spark – discussing the upcoming release of Spark 2.0.

It is followed by four other, very knowledgeable speakers discussing subjects like ‘Democratizing Spark,’ ‘Enterprise Spark’ and ‘Spark as an Analytics OS.’ (1Hr:12min). Enjoy! Watch Video

Blog Publisher / Head of Data Science Search

Founder & Head of Data Science Search at Starbridge Partners, LLC.