Please use this identifier to cite or link to this item: https://hdl.handle.net/10216/171756
Author(s): Silvano, Maria da Purificação
Apostol, Elena-Simona
Truica, Ciprian-Octavian
Damova, Mariana
Oleškeviciene, Giedre Valunaite
Liebeskind, Chaya
Trajanov, Dimitar
Title: Multiword discourse markers across languages: a linguistic and computational perspective
Issue Date: 2025
Abstract: Discourse markers (DMs) are linguistic expressions that convey different semantic and pragmatic values, managing and organizing the structure of spoken and written discourses. They can be either single-word or multiword expressions (MWE), made up of conjunctions, adverbs, and prepositional phrases. Although DMs are the focus of many studies, some questions regarding the interoperability of taxonomies and automatic identification and classification require further research. We aim to tackle these issues by offering a critical analysis and discussing the constitution of a multilingual corpus in 10 languages, i.e., English, Lithuanian, Bulgarian, German, Macedonian, Romanian, Hebrew, Polish, European Portuguese, and Italian. The novel two-level annotation approach is based on (i) signaling the existence or non-existence of DMs in a given text, and (ii) applying the ISO- 24617 standard to annotate the DMs' discourse relation and communicative function in the corpora. Additionally, we introduce prediction models for detecting the presence of DMs within a text.
DOI: 10.1111/ijal.12755
URI: https://hdl.handle.net/10216/171756
Document Type: Artigo em Revista Científica Internacional
Rights: openAccess
Appears in Collections:FLUP - Artigo em Revista Científica Internacional

Files in This Item:
File Description SizeFormat 
752881.pdf1.02 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.