Data acquisition in hadoop system
Master thesis
Permanent lenke
http://hdl.handle.net/11250/181723Utgivelsesdato
2010Metadata
Vis full innførselSamlinger
- Studentoppgaver (TN-IDE) [835]
Sammendrag
Data has become more and more important these years, especially for
big companies, and it is of great bene t to dig out useful information inside.
In Oil & Gas industry, there are a lot of data available, both in real-time
and historical format. As the amount of data is huge, it is usually infeasible
or very time consuming to process the data. Hadoop is introduced to solve
this problem.
In order to perform Hadoop jobs, data must exist on the Hadoop lesys-
tem, which brings the problem of data acquisition. In this thesis, two so-
lutions are given out for data acquisition. The performance comparison is
introduced afterwards, and solution based on Chukwa is proved to be better
than the other solution.
Beskrivelse
Master's thesis in Computer science