BigQuery is Google's fully-managed, petabyte-scale data warehouse for analytics - ideal for the serverless ecosystem. In this talk Felipe will provide an overview of BigQuery and share with us some of the insights he has derived from analyzing various datasets on BigQuery.
Â
Speaker: Felipe Hoffa (@felipehoffa)
Felipe is a Developer Advocate on the Cloud Team @Google. He also moderates the BigQuery thread on Reddit and speaks regularly on BigQuery & data flow at tech events,
Stream processing is increasingly critical for real-time solutions that involve large scale continuous data inputs - but implementing it can be complex without the right tools or understanding of concepts.
Â
In his talk titled "Foundations of streaming SQL or: how I learned to love stream & table theory", Tyler will walk through key ideas and tools in this space.
Speaker: Tyler Akidau (@takidau)
Tyler is a senior staff software engineer at Google Seattle, where he leads technical infrastructure internal data processing teams for MillWheel and Flume. He is a founding member of the Apache Beam PMC and is the co-author of the soon to be published O'Reilly book on Streaming Systems
Fastly is a global Content Delivery Network with the unique ability to stream logs from the edge in real time to the endpoint of your choice -- including directly into BigQuery.
Companies like The New York Times rely on Fastly’s BigQuery integration for real-time insights into website pageviews and usage by demographic, geo location, and other dimensions. Fastly log data can even be combined with additional data sources such as Google Analytics, Google Ads data, security and firewall logs. With a growing customer base including Vimeo, Spotify, and Gannett, Fastly is already streaming tens of millions of data points an hour from the edge into BigQuery for real-time analysis.
Speaker: Chris Jackel.Â
Chris has been working to make the Web faster for more than 10 years. He is currently a Solutions Architect at Fastly and is based in New York. He doesn't have a fear of clowns, but he doesn't much like them either.
We will use the lunch interval to reorganize the room into tables that are more suitable for codelabs, and potentially setup any accounts or software required if needed.
We hope to send a reminder with the requirements to all registered attendees so as to minimize the time spent in setup and maximize time spent in learning.
Pick your own adventure. Start by doing the beginner codelab to ensure you have your environment setup and understand the basic operations. Then continue on with other codelabs (structured) OR pick a dataset from the resources below and explore them on your own (free form). At the end of this segment, we invite attendees to share any interesting insights they may have gained either from datasets or from their experience using BigQuery.
CODELABS
 - Load and analyze Data with BigQuery (20 mins, beginner)
 - Query Wikipedia Dataset with BigQuery (20 mins, beginner)
 - Explore Campaign Finance with BigQuery (45 mins, intermediate)
 - Query & Visualize Location Data: BigQuery + Google Maps (2hrs, advanced)
RESOURCES
 - BigQuery Docs: Tutorials for Getting Started
- Felipe Hoffa moderates a Reddit that lists publicly available datasets for BigQuery
 - Google Cloud has a set of hosted public datasets for convenience
 - Sara Robinson has a great post on using Cloud ML with BigQuery that you can extend
 - Reto Meier's TILWBQ (Today I learnt with BigQuery) has tons more examples for ideas
GOOGLE CLOUD TOOLS
 - Google Cloud Data Studio - data visualization
 - Google Cloud DataPrep - clean & prepare data
We hope to have talks on real-world usage or applications for big data & BigQuery.
.We also encourage our codelab attendees to sign up to do a 2-5 minute "lightning talks" to share your insights from explorations you may have done in the prior session.
We will update this section with the details on these talks as they occur.
LIGHTNING TALKS & DEMOS
 - "BigQuery for Government Data" - Ralph Yozzo (GDG NYC)
- Geoffrey Moore, The Business Book, 2014Â
(Photo by Jason Corey on Unsplash)
 The Google Developer Group (NYC) is a for-community, by-community organization focused on supporting awareness and actionable learning for technologists.
Â
GDG NYC is organized by Nitya (@nitya), Ralph (@fedex1), Donna (@donnacheng1) and Lakshma. Learn more about us on Twitter (@gdg_nyc) or join our Meetup.
Â
If you have specific questions about Study Camps or want to work with us as a partner or a sponsor, please email nitya@gdgny.org
