Play all

Intro

CONTENTS

THEN, YOU NEED TO UNDERSTAND LANGUAGE

WHAT MAKES LANGUAGE HARD

INTRODUCING SPARK NLP

THE PERFORMANCE BOTTLENECK

SPARK NLP 2017: NATIVE SPARK EXTENSION

BENCHMARK: TRAINING

BENCHMARK: SCALING

SPARK NLP 2018 IMPROVEMENTS

FRICTIONLESS REUSE & OPTIMIZATION

WHAT EXACTLY IS "STATE OF THE ART"?

NAMED ENTITY RECOGNITION

NER WITH DEEP LEARNING

WORD EMBEDDINGS

EMBEDDINGS: THE NEXT GENERATION

SPARK NLP 2018: EMBEDDED TENSORFLOW

PERFORMANCE: THE NEXT LEVEL

SENTIMENT ANALYSIS

SO, TRAIN YOUR OWN DOMAIN-SPECIFIC MODELS

E-DISCOVERY

WHAT'S A GOOD FIRST NLP PROJECT?

WHAT EXPECTATIONS SHOULD I SET?

Description:

Explore state-of-the-art natural language understanding at scale in this 50-minute conference talk from ODSC West 2018. Dive into the challenges of processing language and learn about the NLP library for Apache Spark, which extends Spark ML pipeline APIs for distributed, optimized NLP and ML pipelines. Discover core NLP algorithms including lemmatization, part of speech tagging, dependency parsing, named entity recognition, spell checking, and sentiment detection. Follow along with demonstrations of building common pipelines using PySpark on notebooks. Gain insights into benchmarks, design best practices, and performance optimizations for NLP, ML, and deep learning pipelines on Spark. Understand the latest improvements in Spark NLP, including native Spark extensions, embedded TensorFlow, and advanced word embeddings. Learn about practical applications like e-discovery and domain-specific sentiment analysis models. Get guidance on starting your first NLP project and setting realistic expectations for working with natural language processing at scale. Read more

State of the Art Natural Language Understanding at Scale - David Talby

Open Data Science

Add to list

#Data Science #Big Data #Apache Spark #Computer Science #Machine Learning #Deep Learning #Programming #Programming Languages #Python #PySpark #Artificial Intelligence #Computational Linguistics #Lemmatization

0:00 / 0:00