Vis enkel innførsel

dc.contributor.advisorRong, Chunming
dc.contributor.advisorCristina, Viorica Heghedus
dc.contributor.authorShenavari Shirazi, Anousheh
dc.date.accessioned2018-09-25T11:22:02Z
dc.date.available2018-09-25T11:22:02Z
dc.date.issued2018-06-15
dc.identifier.urihttp://hdl.handle.net/11250/2564360
dc.descriptionMaster's thesis in Computer sciencenb_NO
dc.description.abstractThe focus of this study is on the relation between papers and their citations using Machine Learning algorithms to detect improper and irrelevant citations. The model takes the paper’s citations and classifies them into two classes, ”Related” and ”Barely related” citations. Here we considered two Machine Learning algorithms, ”Decision tree algorithm” and ”Naive Bayes algorithm” along with introducing the statistical algorithm called ”Prior statistical algorithm” to classify the relation. During the design process of the classification models, the required data for implementing have been collected from a large-scale and reliable data source. Converting techniques have been used to transform data to the structured format. The evaluation results show that the Prior statistical model has limitation since it applied on dataset considering only one feature, however from the two machine learning algorithms that we employed, Naive Bayes outperform decision tree since it was extremely fast and did not require a very large training set to obtain a good learning model, however, Decision Tree was easier to implement and understand.nb_NO
dc.language.isoengnb_NO
dc.publisherUniversity of Stavanger, Norwaynb_NO
dc.relation.ispartofseriesMasteroppgave/UIS-TN-IDE/2018;
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.subjectinformasjonsteknologinb_NO
dc.subjectdatateknikknb_NO
dc.subjectmachine learning algorithmnb_NO
dc.subjectcitations relationnb_NO
dc.subjectdata miningnb_NO
dc.subjectclassificationnb_NO
dc.subjectdecision treenb_NO
dc.subjectnaive Bayes algorithmnb_NO
dc.titleMachine Learning methods to detect improper and irrelevant citationsnb_NO
dc.typeMaster thesisnb_NO
dc.description.versionpublishedVersionnb_NO
dc.subject.nsiVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550::Datateknologi: 551nb_NO
dc.source.pagenumber53nb_NO


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal