Please use this identifier to cite or link to this item: https://hdl.handle.net/10216/15161
Full metadata record
DC FieldValueLanguage
dc.creatorLuís Sarmento
dc.creatorAlexander Kehlenbeck
dc.creatorEugénio Oliveira
dc.creatorLyle Ungar
dc.date.accessioned2022-09-10T07:09:40Z-
dc.date.available2022-09-10T07:09:40Z-
dc.date.issued2009
dc.identifier.othersigarra:60958
dc.identifier.urihttps://hdl.handle.net/10216/15161-
dc.description.abstractWe present a multi-pass clustering approach to large scale. wide-scope named-entity disambiguation (NED) oil collections of web pages. Our approach Uses name co-occurrence information to cluster and hence disambiguate entities. and is designed to handle NED on the entire web. We show that on web collections, NED becomes increasing), difficult as the corpus size increases, not only because of the challenge of scaling the NED algorithm, but also because new and surprising facets of entities become visible in the data. This effect limits the potential benefits for data-driven approaches of processing larger data-sets, and suggests that efficient clustering-based disambiguation methods for the web will require extracting more specialized information front documents.
dc.language.isoeng
dc.relation.ispartofMachine Learning and Data Mining in Pattern Recognition
dc.rightsopenAccess
dc.rights.urihttps://creativecommons.org/licenses/by-nc/4.0/
dc.subjectInformática, Ciências da computação e da informação
dc.subjectInformatics, Computer and information sciences
dc.titleAn Approach to Web-Scale Named-Entity Disambiguation
dc.typeArtigo em Livro de Atas de Conferência Internacional
dc.contributor.uportoFaculdade de Engenharia
dc.identifier.doi10.1007/978-3-642-03070-3_52
dc.identifier.authenticusP-003-R7R
dc.subject.fosCiências exactas e naturais::Ciências da computação e da informação
dc.subject.fosNatural sciences::Computer and information sciences
Appears in Collections:FEUP - Artigo em Livro de Atas de Conferência Internacional

Files in This Item:
File Description SizeFormat 
60958.pdf192.43 kBAdobe PDFThumbnail
View/Open


This item is licensed under a Creative Commons License Creative Commons