A pipelined data-parallel algorithm for ILP

Nuno A. Fonseca; Fernando Silva; Vitor Santos Costa; Rui Camacho

doi:10.1109/clustr.2005.347059

Please use this identifier to cite or link to this item: https://hdl.handle.net/10216/74815

Author(s):	Nuno A. Fonseca Fernando Silva Vitor Santos Costa Rui Camacho
Title:	A pipelined data-parallel algorithm for ILP
Issue Date:	2006
Abstract:	The amount of data collected and stored in databases is growing considerably for almost all areas of human activity. Processing this amount of data is very expensive, both humanly and computationally. This justifies the increased interest both on the automatic discovery of useful knowledge from databases, and on using parallel processing for this task. Multi Relational Data Mining (MRDM) techniques, such as Inductive Logic Programming (ILP), can learn rules from relational databases consisting of multiple tables. However current ILP systems are designed to run in main memory and can have long running times. We propose a pipelined data-parallel algorithm for ILP. The algorithm was implemented and evaluated on a commodity PC cluster with 8 processors. The results show that our algorithm yields excellent speedups, while preserving the quality of learning.
Subject:	Ciências da computação e da informação Computer and information sciences
Scientific areas:	Ciências exactas e naturais::Ciências da computação e da informação Natural sciences::Computer and information sciences
URI:	https://hdl.handle.net/10216/74815
Source:	2005 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER)
Document Type:	Artigo em Livro de Atas de Conferência Internacional
Rights:	openAccess
License:	https://creativecommons.org/licenses/by-nc/4.0/
Appears in Collections:	FCUP - Artigo em Livro de Atas de Conferência Internacional FEUP - Artigo em Livro de Atas de Conferência Internacional

Files in This Item:

File	Description	Size	Format
70778.pdf	A pipelined data-parallel algorithm for ILP	207.89 kB	Adobe PDF	View/Open

Show full item record Recommend this item Display Statistics

This item is licensed under a Creative Commons License