Explore the vast world of GitHub data in this 42-minute Devoxx conference talk. Dive into an analysis of 750 billion GitHub events and 42 TB of code to gain valuable insights into software development trends, open-source community dynamics, and effective project management strategies. Learn how to leverage this massive dataset to guide project design decisions, measure community health, and understand coding patterns over time. Discover techniques for running static code analysis at scale, evaluating the impact of social media on project popularity, and identifying the most effective ways to request changes. Gain a deeper understanding of your project's audience by examining who starred it and their other interests. Through live on-stage analysis, uncover fascinating insights about coding preferences, engagement patterns, and geographical distribution of contributions. Whether you're a developer, project manager, or data enthusiast, this talk offers a unique perspective on the collaborative nature of software development and the power of big data analysis in the open-source ecosystem.
Read more
What Can We Learn from 750 Billion GitHub Events and 42 TB of Code