Главная
Study mode:
on
1
Introduction
2
What is Twilio
3
Agenda
4
What is Spark
5
We dont need Spark
6
Data Science Data Engineering
7
RDD
8
GroupByKey
9
DataSets
10
State of Password
11
Have I Been Owned
12
The Data
13
Schema Check
14
Most Popular Passwords
15
Length Column
16
Run Raw Sequel
17
Filtering Passwords
18
Password Data
19
Schema Inference
20
UserDefined Functions
21
Results
22
Dog Rights
23
Benefits of Spark
24
Challenges
25
Nested Error Messages
26
Apache Spark Documentation
27
Security Implications
28
Data Privacy
29
Security
30
What can you do
31
Thank you
32
Conclusion
33
Closing
34
Audience Questions
Description:
Explore Apache Spark's capabilities for analyzing large-scale distributed data in this GOTO Chicago 2018 conference talk. Dive into the world of password security as Kelley Robinson, Developer Evangelist at Twilio, demonstrates how to process and analyze over 500 million leaked passwords using Spark. Learn about Spark's API advancements for Scala, Python, and SQL, and discover techniques for efficient data processing. Gain insights into password trends, popular choices, and security implications. Understand the challenges and benefits of working with Spark, including nested error messages and documentation. Discuss data privacy concerns and practical steps for improving password security. Conclude with audience questions and valuable takeaways for implementing Spark in your own projects.

Analyzing Pwned Passwords with Apache Spark

GOTO Conferences
Add to list