When
2:45 PM Sunday
Where
VPA-115
Silicon Valley Code Camp : October 3rd and 4th 2015session

Using Apache Spark on Azure in kaggle.com Machnine Learning Challenges

Spark on Azure is a good choice for large-scale data analytics in the cloud due to ease of deployment, default feature set and integration between Azure and Spark. In this session, we will tackle a challenge from kaggle.com to demo Spark on Azure.

About This Session

Today, Microsoft Azure allows us to deploy a functional Apache Spark cluster in minutes. That means we can focus on analyzing vast amounts of information and applying machine learning algorithms on that data to develop valuable predictions. In this session, we will pick a fairly sized machine learning competition from kaggle.com, create a Spark machine learning pipeline to preprocess data and make predictions, as well as evaluate different algorithms against each other. We will also touch on several several integration points between Spark and Azure that make Spark on Azure a good choice for data analytics in the cloud.

Time: 2:45 PM Sunday    Room: VPA-115 

The Speaker(s)

undefined undefined

Eugene Chuvyrov

Sr. Cloud Architect , Microsoft

Cloud Architect at Microsoft focused on accelerating modern DevOps, Machine Learning and Blockchain.