Big Data Research for Open Source Applications
Big data is a collection of data sets so large and complex that it becomes difficult to process using onhand database management tools or traditional data processing applications. The challenges include capture, curation, storage, search, sharing, transfer, analysis, and visualization. In this internship, we analyze a real-world big data set(s) to make sensible inferences by taking into account a selected range of criteria. A number of methods and algorithms are investigated, evaluated and evolved to advance the development of specialized tools and processes.