VIDEO | Apache Flink: A Quick Guide to the Next Generation Stream Processing Platform with Dr. Vladimir Bacvanski’s, founder of SciSpike, a company doing custom development, consulting and training. He is the author of the O’Reilly course “Introduction to Big Data”. Watch Video
“TRANSFER LEARNING” From The Harvard Business Review – using the recent Presidential Election & lack of data to illustrate Transfer Learning: “a field that helps to solve these problems by offering a set of algorithms that identify the areas of knowledge which are “transferable” to the target domain. This broader set of data can […]
Data Science Central is best online publication/blog for Data Science. Here are most popular articles and resources from 2016. Enjoy! Read More
IN THIS FOURTH EDITION of the O’Reilly Data Science Salary Survey. They analyzed input from 983 respondents working in the data space, across a variety of industries— representing 45 countries and 45 US states.
Through the results of their 64-question survey, They’ve explored which tools data scientists, analysts, and engineers use, which tasks they engage in, and of course—how much they make. READ MORE
Machine Learning is at the core of data science and we see it’s applications all over now (i.e. recommender engines, etc.). As Pedro Domingos’s Professor of Computer Science U. Washington writes in the piece, “In reality, the main purpose of machine learning is to predict the future.” It’s important to be aware of the MYTHs associated with Machine Learning. Read More
In 2013, Airbnb had a small, centralized team of five data scientists serving the data needs of the company. Since then, they have grown to become one of the largest, most innovative startup teams with over 70 data scientists now serving separate business units. In addition to setting a consistently high bar on new hires and focusing on technical mentorship from peers, the structure of the organization has been key to successful growth. Read More
Open source software tools have become all the rage, especially around big data and that is a GOOD thing. It allows for many players to work off of the same code base to build more add-on tools and it’s cheap and easy for the masses to get set up and use them. Hadoop, R, Cassandra, Mongo DB, Neo4i and HBase are among the most popular, but there are many more.
I have accumulated 3 lists that are very popular. Please let me know if you see things missing and I’ll attempt to create one large master list and post it on the site. Read More…
Next Tuesday (April 19th), Metis’s 6 Week program “Intro to Data Science” begins. It runs Tuesday and Thursday evenings from 6:30pm – 9:30pm in midtown NYC. I was at the Open House last week and will be attending.There are a couple of seats left. Love the lead instructor Sergey Fogelson. Read More