Главная
Study mode:
on
1
Intro
2
Tika in the news
3
Tika's History in brief
4
Detection
5
Supported Formats
6
OCR
7
Databases
8
Tika Config XML example
9
Tika Batch
10
Geo Entity Lookup
11
Image Object Reconition
12
Text Searchable Video
13
Tika 1.14
14
Metadata Storage
15
Metadata for Video etc
16
Logging, Config, Defaults
17
Content Handler Reset Add
18
Content Enhancement
19
Metadata Standards
Description:
Explore the latest features and improvements in Apache Tika 2.0 in this comprehensive conference talk. Discover how this powerful tool detects and extracts metadata and text from a vast array of file formats, benefiting applications from search engines to big data processing. Learn about Tika's evolution over the past decade, including expanded format support, new usage methods, and refined philosophies for handling various file types. Gain insights into Tika's multi-language programming support and its capabilities for big-data scale operations. Whether you're an experienced Tika user or new to the technology, delve into topics such as detection, OCR, databases, Tika Config XML, batch processing, geo entity lookup, image object recognition, and text-searchable video. Understand the changes in metadata storage, logging, configuration, and content enhancement introduced in Tika 2.0. Presented by Nick Burch, a long-time Apache contributor and CTO at Quanticate, this talk offers valuable knowledge for anyone interested in efficient content extraction and analysis. Read more

Apache Tika 2.0 - New Features and Improvements

Linux Foundation
Add to list
0:00 / 0:00