Word Sense Disambiguation for Web 2.0 Data

Project ID
FG-4
Project Categories
Computer Science
Completed
Abstract
In this work we plan to create an architecture that will allow for a variety of parallel similarity and parallel clustering algorithms to be tested and developed to be run against Web 2.0 data. These algorithms will be used to analyze emerging semantics and word senses within the data.
Use of FutureSystems
Investigate emerging semantics in natural language web 2.0 data.
Scale of Use
Around ten VMs to run experiments. We will use these VMs many times over the course of a couple of months to test a variety of algorithms.