DBPedia Project in Summer

Project Details

Project Lead
Naveen Madhire 
Project Manager
Naveen Madhire 
Institution
Indiana Universiry Bloomington, IU  
Discipline
Computer Science (401) 

Abstract

Google Summer of Code 2015 is a global program where students can participate for writing code for open source community. I've submitted a proposal this year to contribute to DBPedia open source. My proposal has been accepted and I would be working on the project from May 2015 to August 2015. The project is remodel the existing Pig framework to a more robust and scalable framework using Apache spark. I would need to test the project on large scalable Cloudera Hadoop Cluster.
PFB the link to accepted proposal,
http://www.google-melange.com/gsoc/project/details/google/gsoc2015/naveenmadhire/5766466041282560
DBpedia PigNlproc Code base which I would be working on,
https://github.com/dbpedia-spotlight/pignlproc

Intellectual Merit

DBPedia pignlproc project would help the broader open source community.

Broader Impacts

N/A

Scale of Use

Approx 6 VMs each with 8 CPUs