Focus Group: Natural Language Processing and Information Retrieval (Prof. Ponzetto)

The NLP and IR group at DWS conducts research on integrating knowledge from heterogeneous Web sources – ranging from large raw text collections all the way through collaboratively constructed resources (e.g., Wikipedia) and knowledge bases (DBpedia, Freebase, etc.) – and its application to Natural Language Processing (NLP), Information Analysis and Retrieval tasks. Areas of interest include “deep” NLP techniques for text understanding, ranging from lexical semantics (Word Sense Disambiguation, ontology-based and distributional meaning representations) to document understanding and structuring (entity linking, ranking and search, automatic summarization). The group applies NLP methods to support empirical research in Social Science and Humanities.

People

Head: Prof. Dr. Simone Paolo Ponzetto

Members:

Post-Docs

PhD students

Alumni

 *  joint project with the AI group

**  joint work with the Web-based Information Systems and Services @ HDM Stuttgart  

*** joint work with the Web Data Mining group

Projects

  • SFB 884: Political Economy of Reforms
  • DFG Project JOIN-T: Joining Ontologies and semantics INduced from Text
  • Juniorprofessorenprogramm MWK Baden-Württemberg: Deep semantic models for high-end NLP applications
  • Elite Post-docs program of the the Baden-Württemberg Stiftung: Knowledge consolidation and organization for query-specific Wikipedia construction
  • RiSC Programm MWK Baden-Württemberg: Vision and language understanding beyond literal meaning
  • MWFK BaWü Project: Part-Time Master Program: Data Science
  • Research and Science Center: Trust in Web Reviews
  • Research Data Service Center

Master and Bachelor Theses

This thesis should provide an in-depth overview of the various recurrent neural network models (fully recurrent networks, recursive networks, long...

more

This thesis should provide an in-depth overview of the state-of-the-art methods for representing knowledge graphs and knowledge bases in the (i.e.,...

more

Social network are of high interests, for many applications ranging from simple user profiling to user customized advertisement. In this thesis, we...

more

Continuous emotions detection is a core aspect for many real application. In this work we will experiment with an existing interactive installation...

more

The goal of this thesis would be to organize news from German news outlets in such a way to detect events and salient topics in the news. The...

more

Convolutional neural networks have been shown to be very successful to various text classification tasks. The main shortcoming of CNNs used for text...

more

Recently the DWS group released a huge repository of hypernymy relations the Web, the WebIsADb (http://webdatacommons.org/isadb/), containing a large...

more

In this thesis we will build upon and extend an annotation tool to conduct a user study and better understand the requirements towards image...

more

Object detection in images from news articles is a very challenging task. On the one hand, available training data for object detectors is only...

more

Introduction/problem: Speculation/hedging/vagueness identification plays significant role in many applications, e.g. information extraction, machine...

more

Publications

Conference Item