Web Mining (FSS2016)

The textual content as well as the structured data which is accessible on the Web has an enormous potential for being mined to derive knowledge about nearly any aspect of human life.  The course covers advanced data mining techniques for extracting knowledge from Web content, the Web link structure, as well as usage data gathered by Web applications.  The course will cover the following topics: 

  • Web Usage Mining
  • Recommender Systems
  • Web Structure Mining
  • Social Network Analysis
  • Web Content Mining
  • Information Extraction
  • Sentiment Analysis

The course consists of a lecture together with accompanying practical exercises as well as student team projects.

In the exercises the participants will gather initial expertise in applying state of the art web mining tools/libraries on realistic data sets.

The team projects take place in the last third of the term. Within the projects, students realize more sophisticated web mining projects of personal choice and report about the results of their projects in the form of a written report as well as an oral presentation.

Exam Review

  • The exam review (Klasureinsicht) for both exam will take place on Friday, September 23rd at 14:00 in room B6, B1.21.

Time and Location

  • Thursday, 15:30 to 17:00, Room: B 6, A104, Start: 18.2.2016
  • Friday, 12:00 to 13:30, Room: B 6, A104, Start: 19.2.2016

Instructor

Final mark

  • 50 % written exam
  • 50 % project work

Slides and Excercises

The slides will be published on this webpage before each lecture.

  1. Slideset: Introduction and Course Outline
  2. Slideset: Web Usage Mining
  3. Exercise 1: Recommender Systems (task | dataset)
  4. Slideset: Web Structure Mining
  5. Exercise 2: Pajek (datasets | part 1 | part 2 | part 3)
  6. Slideset: Web Content Mining (part 1 | part 2)
  7. Exercise 3: Sentiment Analysis (slides | task_1 | task_2
  8. Slideset: Introduction to Student Projects
  9. Exercise 4: Aspect-based Sentiment Analysis (task_1 |  task_2

Lecture Videos

Participation 

  • The course is open to students of the Master Business Informatics. 
  • The course is restricted to 30 participants.
  • The registration for the course is opend on 8. February 2017 at 10:00.
  • Places are assigned on first come/first serve basis.
  • Students register for the course by joining the ILIAS group.

Requirements

  • Basic programming skills in Java are required for the exercise.
  • It is beneficial to addend the course IE 500 Data Mining before this course.

 Course Evaluations

 Outline

Week

Topic Thursday

Topic Friday

18.02.2016

Lecture: Introduction to Web Mining

Lecture: Web Usage Mining

26.02.2016

Lecture: Recommender Systems

Exercise: Recommender Systems

03.03.2016

Exercise: Recommender Systems

Lecture: Web Structure Mining

10.03.2016

Lecture: Social Network Analysis

Exercise: Introduction to Pajek

17.03.2016

Exercise: Social Network Analysis

Exercise: Social Network Analysis

 

- Easter break -

07.04.2016

Lecture: Web Content Mining: Sentiment analysis

Exercise:  Sentiment Analysis

14.04.2016 Lecture:  Web Content Mining: Information Extraction Exercise: Information Extraction
21.04.2016 Introduction to Student Projects Prepararation of Project Outlines
28.04.2016 Feedback about Project Oulines Project work

05.05.2016

Project work

Coaching

12.05.2016

Project work

Coaching

19.05.2016

Project work

Coaching

26.05.2016

Project work

Coaching

02.06.2016 Presentation of project results Presentation of project results

 Literature 

  1. Bing Liu: Web Data Mining, 2nd Edition, Springer.
  2. Wouter de Nooy, Andrej Mrvar, Vladimir Batagelj: Exploratory Social Network Analysis with Pajek, Cambridge University Press.
  3. Dietmar Jannach: Recommender Systems: An Introduction, Cambridge University Press.
  4. Pang-Ning Tan, Michael Steinback, Vipin Kumar: Introduction to Data Mining, Pearson.
  5. Ian H. Witten, Eibe Frank, Mark A. Hall: Data Mining: Practical Machine Learning Tools and Techniques, 3rd Edition, Morgan Kaufmann.

Software