Natural Language Processing and Information Extraction

Submitted by webmaster on Fri, 10/22/2021 - 14:48
Course No: 
194093
Course Type: 
VU
Term: 
2021W
Weekly Hours: 
2.0
Lecturer: 
Adam Kovacs
Gábor Recski
Allan Hanbury
Kinga Andrea Gemes
Language: 
English
Objective: 
Content: 

- Basics of text processing: segmentation, tokenization, decompounding, stemming, lemmatization; regular expressions
- N-gram language modeling, simple classification tasks in NLP
- Part-of-speech tagging, named entity recognition, and shallow parsing with Hidden Markov Models
- Syntactic representations and syntactic parsing
- Basics of natural language semantics
- Neural network basics. Feed forward networks and recurrent neural networks
- Sequence modeling and sequence-to-sequence models. 
- Neural language modeling. Word vectors and contextualized language models. 
- Information extraction tasks: entity recognition, relation extraction, knowledge base population
- Information extraction applications: summarization, question answering, chatbots

Information: 

The link to the online lectures is in TUWEL.

Workload for Students (in hours):

  • Lectures: 24
  • Milestone 1: 8
  • Milestone 2: 8
  • Final Project: 35

Summe: 75

Notes: 
Examination: 

15% for Milestone 1 15% for Milestone 2 50% for the final solution 10% for the presentation 10% for the management summary

Recommendation: