Please use this identifier to cite or link to this item:
https://hdl.handle.net/10216/615
Author(s): | Filipe Silva Gabriel David |
Title: | Searching a database based web site |
Issue Date: | 2003 |
Abstract: | Currently, information systems are usually supported by databases (DB) and accessed through a Web interface. Pages in such Web sites are not drawn from HTML files but are generated on the fly upon request. Indexing and searching such dynamic pages raises several extra difficulties not solved by most search engines, which were designed for static contents. In this paper we describe the development of a search engine that overcomes most of the problems for a specific Web site, how the limitations put to indexing dynamic Web pages were circumvented, and an evaluation of the results obtained. The solution involves using a locally developed crawler, the Oracle Text full text indexer, and meta-information automatically drawn from the DB or manually added to improve the relevance factor calculation. It has the advantage of uniformly covering the dynamic pages and the static Web pages of the site. |
Description: | Currently, information systems are usually supported by databases (DB) and accessed through a Web interface. Pages in such Web sites are not drawn from HTML files but are generated on the fly upon request. Indexing and searching such dynamic pages raises several extra difficulties not solved by most search engines, which were designed for static contents. In this paper we describe the development of a search engine that overcomes most of the problems for a specific Web site, how the limitations put to indexing dynamic Web pages were circumvented, and an evaluation of the results obtained. The solution involves using a locally developed crawler, the Oracle Text full text indexer, and meta-information automatically drawn from the DB or manually added to improve the relevance factor calculation. It has the advantage of uniformly covering the dynamic pages and the static Web pages of the site. |
URI: | https://hdl.handle.net/10216/615 |
Source: | VII Jornadas de Ingeniería del Software y Bases de Datos (JISBD'03) |
Document Type: | Artigo em Livro de Atas de Conferência Internacional |
Rights: | openAccess |
License: | https://creativecommons.org/licenses/by-nc/4.0/ |
Appears in Collections: | FEUP - Artigo em Livro de Atas de Conferência Internacional |
This item is licensed under a Creative Commons License