Explore the insights gained from analyzing 1.1 billion GitHub events and 42 TB of code in this 42-minute conference talk. Discover how to leverage Google BigQuery to examine five years of GitHub metadata and open source code. Learn to understand community dynamics and code patterns for any programming language or project. Gain valuable knowledge for open source creators, users, and decision-makers to make informed choices. Delve into topics such as top contributing companies, star metrics, project health indicators, geographical contributions, and the impact of platforms like Hacker News. Investigate code import patterns, Stack Overflow's influence, and user-defined functions. Uncover answers to important questions about successful projects and programming language preferences. Stay curious and learn how to harness this vast dataset to enhance your understanding of the open source ecosystem.
What Can We Learn From 1.1 Billion GitHub Events and 42 TB of Code