Over the last several years, businesses have seen an explosion in the volume, variety, and velocity of the data they must deal with every day. This has been both a blessing and a curse. At the same time as the explosion in data has enabled new types of highly intelligent applications and insights, developers have found that the previous generation of data management tools and frameworks struggle to keep up with terabytes or petabytes of often ill-structured data.
In this talk, Todd will cover Apache Hadoop, an open source framework for storing and analyzing vast quantities of diverse data. In addition to giving a panorama of the core Hadoop components, including HDFS and MapReduce, Todd will also introduce other projects in the Hadoop ecosystem, including Apache HBase, Cloudera Impala, and Apache Spark. Also highlighted, are some of the most interesting applications of these new technologies from several of today's data-enabled companies.