Explore word segmentation and morphology in this advanced Natural Language Processing lecture from Carnegie Mellon University. Delve into the complexities of defining a "word," learn about tokenization techniques, and examine morphological analysis across various language types. Discover unsupervised subword segmentation methods and gain insights into finite state automata and transducers. Investigate linguistic and meaning analysis, two-level morphology, and spelling rules. Enhance your understanding of advanced NLP concepts through this comprehensive exploration of word-level language processing.
CMU Advanced NLP: Word Segmentation and Morphology