Informatics 141/ Computer Science 121: Information Retrieval

Assignment 02

Winter 2008

Department of Informatics

Donald Bren School of Information and Computer Sciences

University of California, Irvine

Home | Administrative Policies | Course Structure | Resources & Materials | Calendar

Due 2/13/2008

  1. Java Program (100%)
    1. Set up a programming environment for developing your program from Assignment 03 in the 3rd floor ICS1 public computing lab.
    2. Show up to discussion on 2/13/2008
    3. Using your system
      1. Crawl the web pages that we give you in class
        1. Starting from the seed set that we give you
        2. Using the regular expression filter that we give you
    4. Build a webgraph from the crawled pages
    5. Calculate the shortest path between two pages.
      1. We will give you the start and end page in class
    6. This must be done without any help from anyone else in your team (if you have one)