Focus Group: Web Data Mining (Prof. Paulheim)

The Web Data Mining group targets two main topics. First, we look on using structured and semi-structured web data as background knowledge in data mining problems. We develop methods for efficiently accessing such web data in data mining, and mining algorithms tailored to the particularities of such data. Second, we use data mining methods to create and improve large-scale web corpora. Here, we look into machine learning methods for completing missing knowledge, as well as methods for identifying wrong pieces of information.

Master and Bachelor Theses

eCommerce is on the rise. Logistics companies like Deutsche Post DHL are expanding and building up new logistics networks in emerging markets around...

more

Since mid-September 2015, the threat from ransomware has grown considerably [1]. Against this background, comprehensive geographical and temporal...

more

Target: Master

Type: Survey

Short abstract: This thesis should provide an in-depth overview of the adoption of natural language processing and...

more

Target: Master

Type: Experiments

Introduction/problem:  Party manifestos (https://manifestoproject.wzb.eu/) present the vision of a specific party...

more

Title: Enhancing Domain Specific Entity Linking

Target: Master

Type: Experiments

Introduction/problem: Entity linking could improve text...

more

In recent years thanks to the success of social networks (e.g., Facebook, Twitter, ...) and the availability of collaborative customer-made company...

more

The DWS group is happy to announce a new release of the WebDataCommons Microdata, Embedded JSON-LD, RDFa and Microformat data corpus.

The data has...

more

A large number of e-shops have started to markup structured data about products, offers and reviews in their HTML pages using the markup standard Micr...

more

Data integration problems arise whenever data from separate sources needs to be combined as the basis for new applications. Within the context of the...

more

Adtelligence ist ein internationales Software- & Technologieunternehmen und wurde vom Weltwirtschaftsforum 2014 als Technology Pioneer und von...

more

Publications

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007