Word Sense Disambiguation for Web 2.0 Data
In this work we plan to create an architecture that will allow for a variety of parallel similarity and parallel clustering algorithms to be tested and developed to be run against Web 2.0 data. These algorithms will be used to analyze emerging semantics and word senses within the data.
Use of FutureSystems
Investigate emerging semantics in natural language web 2.0 data.
Scale of Use
Around ten VMs to run experiments. We will use these VMs many times over the course of a couple of months to test a variety of algorithms.