Lecture Statistical NLP – Institut für Künstliche Intelligenz

Overview

Semester	Winter 2024/25
ECTS	5
Level	Master
Language	English

General

Lectures

Instructor. Henning Wachsmuth
Location. Appelstr. 11, A145
Time. Thursday, 11:00–12:30
First date. October 17, 2024
Last date. January 30, 2025

Tutorials

Instructor. Yamen Ajjour
Location. Appelstr. 9A, MZ2
Time. Wednesday, 13:15–14:45
First date. October 23, 2024
Last date. January 29, 2025

Description

This course teaches students the major skills needed to tackle typical natural language processing (NLP) tasks with statistical methods. Starting from basics of NLP and machine learning, the course introduces the main learning-based NLP techniques, from clustering and classification to sequence labeling and neural language models. The application of these techniques is exemplified for various NLP tasks, such as topic modeling, sentiment analysis, and coreference resolution. Students learn to design, implement, and evaluate respective NLP methods, both theoretically and in practical assignments.

Topics

Recap of basics of data science and natural language processing
Unsupervised NLP techniques, such as representation learning and clustering
Supervised NLP techniques, such classification, regression, and sequence labeling
Neural NLP techniques, such as feedforward networks, recurrent networks, and transformers
Practical issues when applying NLP to real-world tasks

Recommended pre-requisites

Basics of statistics
Knowledge of programming, ideally Python
Bachelor's course: Introduction to Natural Language Processing
Alternatively: Any course on machine learning or artificial intelligence

Recommended literature

Daniel Jurafsky and James H. Martin. 2009. Speech and Language Processing: An Introduction to Natural Language Processing, Speech Recognition, and Computational Linguistics. Prentice-Hall, 2nd edition. Free draft of third edition: Speech and Language Processing

Recommended other courses

The lab Human Language Technologies (HLT) is aligned with this course
This course builds on the course Introduction to Natural Language Processing (INLP)

Lecture slides

Orga 01 – Organizational information [slide s]
Orga 02 – Tentative exam dates
Part 1 – Overview [slide s]
Part 2 – Basics of Natural Language Processing [slide s]
Part 3 – Basics of Statistical Natural Language Processing [slide s]
Part 4 – Representation Learning [slide s]
Part 5 – NLP using Clustering [slide s]
Part 6 – NLP using Classification and Regression [slide s]
Part 7 – NLP using Sequence Labeling [slide s]
Part 8 – NLP using Neural Networks [slide s]
Part 9 – NLP using Transformers [slide s]
Part 10 – Practical Issues [slide s]

Statistical Natural Language Processing