Adapt and Generalize Deep Learning Methods for Activity Recognition on Newborn Resuscitation Videos

Lapin, Håvard

dc.contributor.author	Lapin, Håvard
dc.date.accessioned	2020-09-28T18:31:59Z
dc.date.available	2020-09-28T18:31:59Z
dc.date.issued	2020-06-15
dc.identifier.uri	https://hdl.handle.net/11250/2680064
dc.description	Master's thesis in Information technology	en_US
dc.description.abstract	Low- and middle-income countries have nearly 99 % of deaths of children under 28 days. Complications during childbirths, such as birth asphyxia, account for most of these deaths. To prevent this, strengthening the quality of resuscitation training and providing better quality health service are a major focus. Major Projects like Safer Births has had a focus to establish new knowledge and develop new technology to reduce the number of deaths. They have collected videos of newborn resuscitation at Haydom Lutheran hospital in Tanzania. This data is useful for evaluating the official resuscitation guidelines by investigating the treatment process and compliance with guidelines. With this data a timeline over the activities like suction, ventilation and stimulation can be extracted. However, issues like low brightness, non-standard camera placement and low frame rate videos persist in these data. The aims of this thesis is to improve upon previous work involving the Safer Births data and put all the previous work into a single pipeline. Earlier work has used object detection networks to locate and classify objects in the videos to post-process the videos to a format that the activity recognition network could predict on. The Convolutional Neural Network (CNN) from earlier work like RetinaNet and Inception I3D was tested and had promising results, but still had some issues with detecting objects like the suction penguin and a large amount of False Positives. To improve object detection different kinds of pre-processing methods were applied. The methods to correct the lighting resulted in poorer results, but training a new model with gradient based input together with the original model resulted in a loss of False Positives among most classes. Combining previous binary activity recognition models resulted in a promising gain in suction, but a loss in the ventilation prediction. Adding a more efficient optical flow method for usage with activity recognition encountered issues with training data generation and resulted in worse performance than previous work. Creating new methods in python from previous work was completed with a new pipeline that could predict and generate a timeline, but has worse performance and needs more work.	en_US
dc.language.iso	eng	en_US
dc.publisher	University of Stavanger, Norway	en_US
dc.relation.ispartofseries	Masteroppgave/UIS-TN-IDE/2020;
dc.subject	informasjonsteknologi	en_US
dc.subject	Convolutional Neural Network	en_US
dc.subject	kybernetikk	en_US
dc.subject	signalbehandling	en_US
dc.subject	deep learning	en_US
dc.subject	signalbehandling	en_US
dc.subject	object detection	en_US
dc.subject	image processing	en_US
dc.subject	activity recognition	en_US
dc.subject	newborn resuscitation	en_US
dc.subject	robot technology	en_US
dc.subject	fødselskomplikasjoner	en_US
dc.title	Adapt and Generalize Deep Learning Methods for Activity Recognition on Newborn Resuscitation Videos	en_US
dc.type	Master thesis	en_US
dc.subject.nsi	VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550	en_US
dc.subject.nsi	VDP::Teknologi: 500::Medisinsk teknologi: 620	en_US

Tilhørende fil(er)

Filnavn:: Lapin_Haavard.pdf
Størrelse:: 22.31Mb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Studentoppgaver (TN-IDE) [866]
Studentoppgaver i informasjonsteknologi, datateknikk / kybernetikk, signalbehandling

Vis enkel innførsel