Please use this identifier to cite or link to this item:
https://hdl.handle.net/10216/171756| Author(s): | Silvano, Maria da Purificação Apostol, Elena-Simona Truica, Ciprian-Octavian Damova, Mariana Oleškeviciene, Giedre Valunaite Liebeskind, Chaya Trajanov, Dimitar |
| Title: | Multiword discourse markers across languages: a linguistic and computational perspective |
| Issue Date: | 2025 |
| Abstract: | Discourse markers (DMs) are linguistic expressions that convey different semantic and pragmatic values, managing and organizing the structure of spoken and written discourses. They can be either single-word or multiword expressions (MWE), made up of conjunctions, adverbs, and prepositional phrases. Although DMs are the focus of many studies, some questions regarding the interoperability of taxonomies and automatic identification and classification require further research. We aim to tackle these issues by offering a critical analysis and discussing the constitution of a multilingual corpus in 10 languages, i.e., English, Lithuanian, Bulgarian, German, Macedonian, Romanian, Hebrew, Polish, European Portuguese, and Italian. The novel two-level annotation approach is based on (i) signaling the existence or non-existence of DMs in a given text, and (ii) applying the ISO- 24617 standard to annotate the DMs' discourse relation and communicative function in the corpora. Additionally, we introduce prediction models for detecting the presence of DMs within a text. |
| DOI: | 10.1111/ijal.12755 |
| URI: | https://hdl.handle.net/10216/171756 |
| Document Type: | Artigo em Revista Científica Internacional |
| Rights: | openAccess |
| Appears in Collections: | FLUP - Artigo em Revista Científica Internacional |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| 752881.pdf | 1.02 MB | Adobe PDF | ![]() View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
