Deep Learning for text data mining: Solving spreadsheet data classification.
Master thesis
Permanent lenke
http://hdl.handle.net/11250/2455462Utgivelsesdato
2017-06-09Metadata
Vis full innførselSamlinger
- Studentoppgaver (TN-IDE) [823]
Sammendrag
This project developed for the Avito LOOPS company. Research goals was to investigate existing algorithms and implementations of Deep Learning, to understand their applicability to text mining, to design a solution that incorporates theoretical and practical aspects, to run classification experiments on different data sets so that the pros and cons of different techniques can be understood. Classification of the text was necessary for the spreadsheet columns classification.
The work used convolutional and recurrent neural networks, trained on samples from five classes. Also, was made an attempt to classify unknowns for a neural network of classes, with an ensemble of four networks.
Beskrivelse
Master's thesis in Computer science