Int J Performability Eng ›› 2025, Vol. 21 ›› Issue (8): 411-421.doi: 10.23940/ijpe.25.08.p1.411421

    Next Articles

Tackling Arabic NLP Challenges: POS-Tagging with Transformer-Based Models and Nuanced Evaluation

Roussafi Mahdjoubia,* and Mohamed Tayeb Laskrib   

  1. aLIAM Laboratory, Department of Computer Science, University of M’sila, M'Sila, Algeria;
    bLRIA Laboratory, Department of Computer Science, Kofi Annan University of Guinea, Conakry, Guinea
  • Submitted on ; Revised on ; Accepted on
  • Contact: *E-mail address: roussafi.mahdjoubi@univ-msila.dz

Abstract: Modern Standard Arabic's (MSA) rich and agglutinative morphology, complexity of clitics, lack of diacritical marks, and resulting morphosyntactic ambiguities make POS-tagging difficult. In order to test for ambiguous forms and stylistic variants, this study assesses the robustness of AraBERT, MARBERT, and CAMeL Tools on an enriched corpus of 8,147 sentences (293,199 tokens), which includes 400 synthetic sentences and 200 literary sentences. The findings demonstrate the limitations of the models when exposed to a range of text types with strong performances on journalistic texts (F1 Macro ~83.5%) and a decline on synthetic (~66%) and literary (~55%) data. To enhance the investigation of ambiguity, we also suggest the PL-Score, a supplementary measure that assesses errors based on their linguistic plausibility (e.g., NOUN→ADJ). These findings underscore the necessity for varied corpora and robust hybrid methodologies combining deep learning and human linguistic knowledge to significantly enhance POS-tagging with implications for machine translation and literary text analysis.

Key words: POS tagging, MSA, morphosyntactic ambiguity, transformers, diacritization, augmented corpus, PL Score