The Fourth International Conference on Information Integration and Web-based Applications and Services (IIWAS2002)

Invited Speaker

SELF LEARNING METHODS TO FIND NOVEL INFORMATION ON THE WWW

Marco Wiering
Inst of Information & Computing Sciences
University Utrecht
Netherlands

Abstract
This talk will discuss self-learning search methods for finding novel interesting information on the world wide web (WWW). The methods employ reinforcement learning (RL) agents which can optimize their behavior by interacting with an environment and learn from the obtained feedback (reward signals). The reward for finding a novel WWW-page is given by a supervisedly trained naive Bayesian classifier. The goal of the agent is to maximize the cumulative reward obtained in its limited life time, and thus it will learn to find as much novel interesting WWW-pages as possible. The agents learn value functions for evaluating many possible WWW-pages as starting address for its search. It this way it can also learn to use particular WWW-pages as starting
place which lead to finding many novel interesting pages. As soon as the agent has exhaustively explored a WWW-page and its links, it does not get any more reward, and it will learn to search for different WWW-pages. The idea has been proposed by Andrew McCallum, and we provide extensions to this idea.

Biography
Dr. Marco Wiering (marco@cs.uu.nl) finished his Computer Science degree "Cum Laude" from the university of Amsterdam in 1995. Then he did his PhD research at IDSIA (Lugano, CH). He graduated on the PhD-thesis "Explorations in Efficient Reinforcement Learning" in 1999. Since then we was a postdoc at the UvA from May until October, 1999. Since January 2000, he works as a university researcher/teacher at the University Utrecht. He published about 20 papers on the topics of reinforcement learning, machine learning, and multi-agent systems.