Главная
Study mode:
on
1
Introduction
2
Scraping
3
Firehose
4
Architecture diagram
5
Goblin
6
PHP
7
ETL
8
URL
9
Language Detection
10
Architecture
11
Supervisor
12
Manager
13
Pipelines
14
Delivery
15
Kafka
16
Push Scheduler
17
Load ETL
18
More than one firehose
19
Scale up
20
Where PHP fits
21
Summary
22
History of PHP
23
Why PHP
24
PHP was no risk
25
How well PHP works
26
Other languages
27
NoJS
28
JSONDecode
29
Behavior
30
Scale
31
String handling
32
Encoding
33
JVM
34
Connectivity
35
Bundled Extensions
36
Liability in Production
37
Quality Threshold
38
PHP Philosophy
39
Slide Summary
40
Recap
41
QA
Description:
Discover how PHP plays a crucial role in handling Twitter's massive data stream at DataSift in this PHP UK Conference talk. Learn about the architecture and processes involved in managing the 'firehose' of 500 million daily tweets, including data scraping, language detection, and delivery. Explore the reasons behind choosing PHP for this high-scale operation, its performance advantages, and how it compares to other languages. Gain insights into PHP's string handling capabilities, JSON decoding behavior, and bundled extensions that make it suitable for processing large volumes of data. Understand the philosophy behind PHP and its reliability in production environments. The presentation concludes with a Q&A session, offering a comprehensive look at PHP's capabilities in handling big data at scale.

PHP at the Firehose Scale

PHP UK Conference
Add to list