– Joe Intrakamhang
It been a long time coming, but we’re getting back on track. The Fuse data nerds have been locked away in the dev cave, but we’re ready to come out and talk about all things Big Data. Hope you can make it.
Topics
Big Data and Machine Learning – Joe Intrakamhang from Google
Learn about a fully managed, petabyte scale, and serverless database called Big Query. Also, we will have a live machine learning demo using Tensorflow and then productionizing it with Google’s Cloud ML service.
Joe works at Google as a Solutions Engineer on the Google Cloud team. In this role, he focuses on architecting and designing solutions for companies migrating to Google Cloud. He is a passionate developer who loves technology and he continuously works on fine tuning his software craftsmanship.
Stream Processing for Analytic Workloads – Ron Buckley from Hortonworks
Stream processing has become the defacto standard for building real-time ETL and Stream Analytics applications. We see batch workloads move into Stream processing to act on the data and derive insights faster. With the explosion of data with “Perishable Insights” such IoT and machine-generated data, Stream Processing + Predictive Analytics is driving tremendous business value. This is evidenced by the explosion of Stream Processing frameworks like proven and evolving Apache Storm and newer frameworks such as Apache Flink, Apache Apex, and Spark Streaming.
Today, users have to choose and try to understand the benefits of each of these frameworks and not only that they have to learn the new APIs and also operationalize their applications. To create value faster, we are introducing new open source tool – Streamline. It is a self-service tool that will ease building streaming application and deploy the streaming application across multiple frameworks/engines that users prefer in a snap. It simplifies integration with Machine Learning models for scoring and classification of data for Predictive Analytics. It provides an elegant way to build Analytics dashboards to derive business insights out of the streaming data and to allow the business users to consume it easily.
In this talk, we will outline the fundamentals of real-time stream processing and demonstrate Streamline capabilities to show how it simplifies building real-time streaming analytics applications.
Ron Buckley is a Solutions Engineer at Hortonworks. Previous to Hortonworks, Ron worked on teams at Nationwide Children’s Research Institute and OCLC implementing Hadoop for HealthCare and Library centric systems. Ron has presented multiple times on HBase at HBaseCon and various other events.
Google Cloud Spanner: Worldwide consistent database at scale – Joe Intrakamhang from Google
A worldwide consistent database that is fully managed. What is this database you speak of? It is a product from Google called Cloud Spanner. In this talk, we will share an overview, how Cloud Spanner works, and an awesome demo.