Vis enkel innførsel

dc.contributor.advisorSetty, Vinay
dc.contributor.authorFossåen, Nils Magne
dc.date.accessioned2020-09-27T18:21:50Z
dc.date.available2020-09-27T18:21:50Z
dc.date.issued2020-07-15
dc.identifier.urihttps://hdl.handle.net/11250/2679785
dc.descriptionMaster's thesis in Computer scienceen_US
dc.description.abstractSemi-supervised learning defines the techniques that fall in between supervised and unsupervised learning. It is commonly used in classification settings where one has a lesser amount of labeled data compared to unlabeled. The goal is to extract extra learning from the unlabeled data to improve on the supervised classification. We will explore some of the approaches to semi-supervised learning to improve on the classification of Nordic news articles in the corpus provided. We will be exploring the methods of self-training in several different configurations and methods of feature extraction and engineering. We will also provide some background and baseline using common supervised methods for improving results as well as different document representations like word-embedding so that we will be able to compare and put our semi-supervised results in relation to these methods. We will see that while some of the methods explored did not succeed, others did and in relation to some of the supervised methods their performance is comparable. We will also see some promising approaches for countering the imbalance problem when considering confident pseudo-labels.en_US
dc.language.isoengen_US
dc.publisherUniversity of Stavanger, Norwayen_US
dc.relation.ispartofseriesMasteroppgave/UIS-TN-IDE/2020;
dc.rightsNavngivelse-Ikkekommersiell-DelPåSammeVilkår 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/deed.no*
dc.subjectinformasjonsteknologien_US
dc.subjectsemi-superviseden_US
dc.subjectclassificationen_US
dc.titleSemi-supervised learning for classification of Nordic news articlesen_US
dc.typeMaster thesisen_US
dc.subject.nsiVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550en_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse-Ikkekommersiell-DelPåSammeVilkår 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse-Ikkekommersiell-DelPåSammeVilkår 4.0 Internasjonal