Data acquisition in hadoop system
Master thesis
View/ Open
Date
2010Metadata
Show full item recordCollections
- Studentoppgaver (TN-IDE) [823]
Abstract
Data has become more and more important these years, especially for
big companies, and it is of great bene t to dig out useful information inside.
In Oil & Gas industry, there are a lot of data available, both in real-time
and historical format. As the amount of data is huge, it is usually infeasible
or very time consuming to process the data. Hadoop is introduced to solve
this problem.
In order to perform Hadoop jobs, data must exist on the Hadoop lesys-
tem, which brings the problem of data acquisition. In this thesis, two so-
lutions are given out for data acquisition. The performance comparison is
introduced afterwards, and solution based on Chukwa is proved to be better
than the other solution.
Description
Master's thesis in Computer science