TakeAPeek Learning: Chaining classifiers for multiple regression

Rodrigo Gondim Carvalho da Fonte

doi:10.34626/b99s-0415

Please use this identifier to cite or link to this item: https://hdl.handle.net/10216/90140

Author(s):	Rodrigo Gondim Carvalho da Fonte
Title:	TakeAPeek Learning: Chaining classifiers for multiple regression
Issue Date:	2016-02-12
Abstract:	In multiple target regression problems, the objective is to predict the numeric value of multiple dependent Y variables from the numeric values of independent X variables using linear regression. One of the issues with this type of problem is the fact that it does not account for the codependency between the target variables Y themselves. This objective of this thesis is to adapt the existent multiple regression algorithm to use chained classifiers. By using chained classifiers the prediction of each target variable Yn is not done independently of the other Y variables. Each prediction model will include the values of previously predicted Y variables. By doing so the model will attempt to combat the co dependence between the target variables thus leading to more accurate results. This thesis works upon an existing article about multi-label classification methods for multi-target regression. I will demonstrate the effectiveness of the base algorithm using 12 datasets and prove the influence that the order in which the target variables are predicted have upon the final mean error of all target variables.
Description:	Em problemas de regressão multi-alvo o objetivo é prever o valor numérico de múltiplas variáveis dependentes Y a partir dos valores numéricos das variáveis independentes X utilizando o algoritmo de regressão linear. Um dos problemas com este tipo de algoritmo é o fato que não se tem em conta a codependência entre as variáveis alvo Y. Esta tese tem como objetivo adaptar o algoritmo de regressão múltipla existente para usar classificadores em cadeia. Ao utilizar classificadores encadeados a previsão de cada variável Yn não é feita independentemente das restantes variáveis Y. Cada modelo de previsão incluirá os valores das variáveis Y previamente calculadas. Com isto, o modelo irá ter em conta a codependência entre as variáveis alvo levando a resultados mais precisos. O trabalho feito nesta tese e baseado num artigo existente sobre métodos de classificação multi-label para regressão multi-alvo. Irei demonstrar a eficácia do algoritmo base usando 12 datasets e provar a influência da ordem do qual as variáveis alvo que são previstas têm sobre o erro médio final de todas as variáveis alvo.
Subject:	Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering
Scientific areas:	Ciências da engenharia e tecnologias::Engenharia electrotécnica, electrónica e informática Engineering and technology::Electrical engineering, Electronic engineering, Information engineering
DOI:	10.34626/b99s-0415
TID identifier:	201318881
URI:	https://hdl.handle.net/10216/90140
Document Type:	Dissertação
Rights:	openAccess
Appears in Collections:	FEUP - Dissertação

Files in This Item:

File	Description	Size	Format
119682.1.pdf	TakeAPeek Learning: Chaining classifiers for multiple regression	338.97 kB	Adobe PDF	View/Open
119682.pdf	TakeAPeek Learning: Chaining classifiers for multiple regression	338.97 kB	Adobe PDF	View/Open

Show full item record Recommend this item Display Statistics