Data By the Bay

We’re helping to promote Data By the Bay, a large-scale, data-focused gathering with 150 talks, comprising seven conferences over the course of five days.

From the organizers:

50+ founders/CEOs/CTOs, many VPs of Engineering/Directors of Research are presenting the whole spectrum of data flows: end-to-end pipelines encompassing web-scale APIs, microservices, Apache Kafka, Apache Spark, Deep Learning, and everything in between.

The matrix consists of application verticals and architectural horizontals.

The horizontals include platforms, pipelines, and algorithms. Learn them all, build your next company from lego pieces, and hear a year worth of the world’s best tech talks in the same place. Hot breakfast, infinite Ritual coffee, and 2-hour happy hour with full bar and the company of the speakers every day, 5 days a week.

In-depth talks from Google (BigQuery and Translate), Baidu Research, MetaMind, StitchFix (Deep Learning), Microsoft, Bloomberg, Quora, Kaggle, Dato (Machine Learning), Netflix (Recommender Systems), IBM (Watson), Facebook, ClearStory (DataViz), LinkedIn, Yahoo, H2O, Confluent, Mesosphere (Data Pipelines), Samsung, Automatic (IoT), AMPLab, Databricks, Salesforce, Workday, Cloudera (Spark), Pivotal (OSS), Zillow, Pandora, Nitro, Lucidworks, Mattermark, Credit Karma, Alpine Labs, , University of California-Berkeley, Stanford University, City of San Francisco, and many others.

3 full-day training courses are deep dives into Microservices, Agile Data Science with Spark, and Natural Language Processing.

Only 300 tickets for each day will be available to have a truly intimate technical community atmosphere. Register today! Use the code IBMDATA10 for 10% off.


You Might Also Enjoy

James Spyker
James Spyker
2 months ago

Streaming Transformations as Alternatives to ETL

The strategy of extracting, transforming and then loading data (ETL) to create a version of your data optimized for analytics has been around since the 1970s and its challenges are well understood. The time it takes to run an ETL job is dependent on the total data volume so that the time and resource costs rise as an enterprise’s data volume grows. The requirement for analytics databases to be mo... Read More

Seth Dobrin
Seth Dobrin
2 months ago

Non-Obvious Application of Spark™ as a Cloud-Sync Tool

When most people think about Apache Spark™, they think about analytics and machine learning. In my upcoming talk at Spark Summit East, I'll talk about leveraging Spark in conjunction with Kafka, in a hybrid cloud environment, to apply the batch and micro-batch analytic capabilities to transactional data in place of performing traditional ETL. This application of these two open source tools is a no... Read More