[Cisl-comunidade] Encontro Técnico SouJava de Abril [Cloudera Hadoop]
Otávio Gonçalves de Santana
otaviopolianasantana em gmail.com
Quinta Março 27 22:41:46 BRT 2014
[image: Inline image 2]
O Hadoop é uma plataforma distribuída feita em Java voltada para
processamento de grande massa de dados em clusters, ele foi inspirado no
MapReduce do Google. No dia 12 de abril, sábado pela manhã, a
Cloudera<http://www.cloudera.com/>fará o encontro técnico em conjunto
com o SouJava com o intuito de falar
sobre e tirar dúvida sobre essa ferramenta e muito utilizada na era do
BigData.
- *Local:* Globalcode São Paulo <http://www.globalcode.com.br/> / Online
- *Endereço:* Avenida Bernardino de Campos, 327, Paraíso - São Paulo -
SP, 04004-050
- *Data:* 12/04/2014, sábado
- *Horário:* 9:00
- *Inscrições:* http://goo.gl/Bm4BW8
Temas:
1. Hadoop <http://hadoop.apache.org/>
2. Crunch <http://crunch.apache.org/>
3. Spark <http://spark.apache.org/>
- *Palestra:* Introduction to Apache Hadoop - HDFS and Map/Reduce
Fundamentals
- *Descrição:* This talk will introduce the concept of Map/Reduce, a
programming paradigm that enables the parallel processing of extremely
large data sets. We'll also introduce Hadoop's implementation of
Map/Reduce, and HDFS, the distributed file system that's built into Hadoop
to enable Map/Reduce. Nearly all of Hadoop is implemented in Java, and this
talk will cover some of the details of writing a Map/Reduce job in Java.
- *Palestrante:* Aaron Myers
- *Mini-Bio:* Aaron T. Myers (aka ATM) is a Platform Software Engineer
at Cloudera and an Apache Hadoop Committer/PMC Member at Apache. Aaron's
work is primarily focused on HDFS, High Availability, and Hadoop Security.
Prior to joining Cloudera, Aaron was a Software Engineer and VP of
Engineering at Amie Street, where he worked on all components of the
software stack, including operations, infrastructure, and customer-facing
feature development. Aaron holds both an Sc.B. and Sc.M. in Computer
Science from Brown University.
- *Palestra:* Beyond Map/Reduce: Introduction to Apache Crunch and
Apache Spark
- *Descrição:* Following Aaron's talk, Todd will introduce Apache Crunch
and Apache Spark. These two projects are higher-level frameworks which
allow the programmer to express complex distributed data processing tasks
on Hadoop in a more concise and simple manner than writing raw MapReduce
jobs. Additionally, Todd will introduce Spark Streaming, a processing
system which can run data flows on real-time data as it arrives. He will
cover some example use cases that show how Hadoop can be used in such
applications as real-time streaming data processing, machine learning, and
model building.
- *Palestrante:* Todd Lipcon
- *Mini-Bio:* Todd Lipcon is an engineer at Cloudera who works on Core
Hadoop as well as the Cloudera Distribution for Hadoop. Todd is also active
in other Apache projects and is always excited to hear about the
interesting ways in which people are using Hadoop for large scale data
analysis. Previously, Todd came to Cloudera from Amie Street, where he
worked on infrastructure, operations, data mining, and product development.
Prior to that, he interned at Google developing machine learning methods to
detect credit-card fraud on AdWords and Google Checkout. Todd holds a BSc
in Computer Science from Brown University, where he completed an honors
thesis developing a new collaborative filtering algorithm for the Netflix
Prize Competition.
* Fonte:*
http://soujava.org.br/2014/03/26/encontro-tecnico-de-abril-cloudera/
--
Atenciosamente.
Otávio Gonçalves de Santana
blog: http://otaviosantana.blogspot.com.br/
twitter: http://twitter.com/otaviojava
site: http://www.otaviojava.com.br
(11) 98255-3513
-------------- Próxima Parte ----------
Um anexo em HTML foi limpo...
URL: <http://listas.softwarelivre.org/pipermail/cisl-comunidade/attachments/20140327/80223ff5/attachment.html>
More information about the Cisl-comunidade
mailing list