Главная
Study mode:
on
1
Introduction
2
Overview
3
Word Segmentation
4
The apostrophe
5
What is a word
6
Tokenization
7
European Languages
8
Slides
9
Problem with tokenization
10
Rulebased tokenization
11
Sentence boundary
12
Subword analysis
13
What is morphology
14
Rulebased systems
15
Language typology
16
Isolated languages
17
Gluteative languages
18
Turkish
19
English
20
Other European Languages
21
IndoEuropean Languages
22
Germanic Languages
23
Chinese
24
Historical Linguistics
25
Patterns of Languages
26
Reduplication
27
Type token curves
28
Recognizing words of a language
29
Spelling rules
30
Finite State Automata
31
Adjectives
32
Morphology in English
33
Finite State Transducer
34
Einsertion
35
FST
Description:
Explore word segmentation and morphology in this advanced Natural Language Processing lecture from Carnegie Mellon University. Delve into the complexities of defining a "word," learn about tokenization techniques, and understand morphological analysis across various languages. Discover unsupervised subword segmentation methods and examine language typology, including isolated and agglutinative languages. Investigate historical linguistics, patterns of languages, and the application of finite state automata in morphological analysis. Gain insights into spelling rules, type-token curves, and the intricacies of morphology in English and other European languages.

CMU Advanced NLP: Word Segmentation and Morphology

Graham Neubig
Add to list