Feb 23, 2007, 01:30PM - 03:00PM, TLC 405B

Feb 23, 2007, 01:30PM - 03:00PM, TLC 405B


Synonym Resolution on the Web


Alexander Yates, University of Washington, http://www.cs.washington.edu/homes/ayates/

The Web is a vast resource of information on practically anything one can think of. Unfortunately, the information is mostly in unstructured text, making it difficult for machines to process. This talk presents new methods for identifying synonymous objects and relations on the Web, on top of an information extraction system. New techniques developed for this problem include a novel probabilistic model for synonym extraction, and a highly scalable clustering algorithm. The results have been integrated into an application that allows searching over a large set of relations extracted from the Web, and they hold promise for improved search technology.

© 2001-2013 Center for Data Analytics and Biomedical Informatics, Temple University