dc.contributor.author | Kaveh, Maziar | |
dc.date.accessioned | 2015-09-11T11:22:56Z | |
dc.date.available | 2015-09-11T11:22:56Z | |
dc.date.issued | 2015-06 | |
dc.identifier.uri | http://hdl.handle.net/11250/299607 | |
dc.description | Master's thesis in Computer science | nb_NO |
dc.description.abstract | The Internet of Things (IoT) is becoming increasingly prevalent in today's society. Innovations in storage and processing methodologies enable the processing of large amounts of data in a scalable manner, and generation of insights in near real-time. Data from IoT are typically time-series data but they may also have a strong spatial correlation. In addition, many time-series data are deployed in industries that still place the data in inappropriate relational databases.
Many open-source time-series databases exist today with inspiring features in terms of storage, analytic representation, and visualization. Finding an efficient method to migrate data into a time-series database is the first objective of the thesis.
In recent decades, machine learning has become one of the backbones of data innovation. With the constantly expanding amounts of information available, there is good reason to expect that smart data analysis will become more pervasive as an essential element for innovative progress. Methods for modeling time-series data in machine learning and migrating time-series data from a database to a big data machine learning framework, such as Apache Spark, is explored in this thesis. | nb_NO |
dc.language.iso | eng | nb_NO |
dc.publisher | University of Stavanger, Norway | nb_NO |
dc.relation.ispartofseries | Masteroppgave/UIS-TN-IDE/2015; | |
dc.rights | CC0 1.0 Universal | * |
dc.rights.uri | http://creativecommons.org/publicdomain/zero/1.0/ | * |
dc.subject | informasjonsteknologi | nb_NO |
dc.subject | IoT | nb_NO |
dc.subject | HBase | nb_NO |
dc.subject | Apache Kafka | nb_NO |
dc.subject | OpenTSDB | nb_NO |
dc.subject | Apache Spark | nb_NO |
dc.subject | datateknikk | nb_NO |
dc.subject | machine learning | nb_NO |
dc.subject | time-series database | nb_NO |
dc.title | ETL and analysis of IoT data using OpenTSDB, Kafka, and Spark | nb_NO |
dc.type | Master thesis | nb_NO |
dc.subject.nsi | VDP::Technology: 500::Information and communication technology: 550::Computer technology: 551 | nb_NO |
dc.source.pagenumber | 63 | nb_NO |