When
9:45 AM Saturday
Where
5015
Silicon Valley Code Camp : October 5th and 6th 2013session

Big Data is a Big Deal - Running Hadoop in the cloud

unassigned

About This Session

This session is designed to give you a solid understanding of underpinnings and principles of Hadoop, perhaps the most sought after high-paying skill for a developer today. The in-depth session will begin by illustrating on how you build your own single node cluster of Hadoop on the Linux virtual machine (CentOS) for free so you can start learning immediately. The session will begin with a raw Linux VM and all the needed components will be downloaded and installed. You can be up and running quickly in the cloud within 30 minutes. This session will contain code and will show no more than a few slides. We will learn about writing the low-level map/reduce code in Java, which is really the low-level assembly language of Hadoop code. From there we will introduce more efficient approaches to analyzing big data with Hadoop, running high level queries and analyzing crime information from San Francisco as the example. We will create tables, import data, and group crime types all with a simple SQL like interface that is Hive. Finally, we will include a brief talk on PIG as well to round out the high level programming models and additional follow up materials so you can be up to speed on one of the most promising and financially rewarding skills today.

Time: 9:45 AM Saturday    Room: 5015 

The Speaker(s)

undefined undefined

Bruno Terkaly

Principal Software Engineer , Microsoft

Bruno is a Principal Software Engineer at Microsoft on the Global Technical Evangelism.