Taggers versus Linkers: Comparing Tags and Anchor Text of Web Pages

Skills Inventory

  • Information Retrieval
  • Natural Language Processing
  • Information Organization
  • Python, MySQL

This study compares the properties of tags and anchor text metadata, with the motivation of auto suggesting tags for web pages.

Dataset

Findings

The ground work for this project was done during the research seminar class, and a paper was subsequently published in the ISD Symposium at UC Berkeley in Feb 2009.

Tag cloud for tokenized bag-of-words representation
Tag distribution for a sample web page ‘Using the