|
|
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/37887
|
Full metadata record
| DC Field | Value | Language |
| dc.contributor.author | Nunes, Rafael Oleques | - |
| dc.contributor.author | Santos, Joaquim | - |
| dc.contributor.author | Spritzer, André | - |
| dc.contributor.author | Balreira, Dennis G. | - |
| dc.contributor.author | Freitas, Carla M. Dal Sasso | - |
| dc.contributor.author | Olival, Fernanda | - |
| dc.contributor.author | Cameron, Helena Freire | - |
| dc.contributor.author | Vieira, Renata | - |
| dc.contributor.editor | Paes, Aline | - |
| dc.contributor.editor | Verri, Filipe A. N. | - |
| dc.date.accessioned | 2025-02-12T11:36:14Z | - |
| dc.date.available | 2025-02-12T11:36:14Z | - |
| dc.date.issued | 2025 | - |
| dc.identifier.citation | Nunes, Rafael Oleques; Santos, Joaquim; Spritzer, Andre; Balreira, Dennis G.; Freitas, Carla M. Dal Sasso; Olival, Fernanda; Cameron, Helena Freire; Vieira, Renata (2025). «Assessing European and Brazilian Portuguese LLMs for NER in Specialised Domains». In: Paes, A., Verri, F.A.N. (eds) Intelligent Systems. BRACIS 2024. Lecture Notes in Computer Science, vol 15412.. s.l., Springer, Cham, 2025, pp 215–230. ISBN: 978-3-031-79029-4. https://doi.org/10.1007/978-3-031-79029-4_15 | por |
| dc.identifier.isbn | 978-3-031-79029-4 | - |
| dc.identifier.uri | http://hdl.handle.net/10174/37887 | - |
| dc.description.abstract | This paper discusses the impact of Portuguese variants in
Large Language Models for the task of named entity recognition (NER)
in specialised domains. The tests were made on a Brazilian Portuguese le
gal and a European Portuguese historical corpora. The models taken into
account are BERTimbau (PT-BR), Albertina (PT-PT and PT-BR), and
XML-R (multilingual). The impact was more evident in the Portuguese
historical corpus, which resulted in higher F1 measures compared to
previous works that did not consider the same language variant. Ad
ditionally, the study underscores the impact of model architecture on
performance, highlighting the critical role of both linguistic alignment
and model size in enhancing NER in specialised domains. | por |
| dc.description.sponsorship | This work has received funds from the Coordenação de
Aperfeiçoamento de Pessoal de Nível Superior- Brasil (CAPES)- Finance Code 001, the Brazilian funding agency CNPq, and the Portuguese Science Foundation
FCT,inthecontext of the projects CEECIND/01997/2017 and UIDB/00057/2020
https://doi.org/10.54499/UIDB/00057/2020 | por |
| dc.language.iso | eng | por |
| dc.publisher | Springer, Cham | por |
| dc.rights | embargoedAccess | por |
| dc.subject | Humanidades Digitais | por |
| dc.subject | Processamento de Língua Natural | por |
| dc.subject | Named Entity Recognition | por |
| dc.subject | Variantes do Português | por |
| dc.subject | Large Language Models | por |
| dc.title | Assessing European and Brazilian Portuguese LLMs for NER in Specialised Domains | por |
| dc.type | bookPart | por |
| dc.identifier.sharewith | Departamento de História | por |
| dc.identifier.authoremail | nd | - |
| dc.identifier.authoremail | nd | - |
| dc.identifier.authoremail | nd | - |
| dc.identifier.authoremail | nd | - |
| dc.identifier.authoremail | nd | - |
| dc.identifier.authoremail | mfo@uevora.pt | - |
| dc.identifier.authoremail | helenac@ipportalegre.pt | - |
| dc.identifier.authoremail | renatav@uevora.pt | - |
| dc.identifier.scientificarea | 619 | por |
| dc.date.embargo | 2026-02-15 | - |
| dc.identifier.doi | https://doi.org/10.1007/978-3-031-79029-4_15 | por |
| Appears in Collections: | CIDEHUS - Publicações - Capítulos de Livros
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|