Fairness Auditing and Bias Mitigation in Aspect-Based Sentiment Models for Indonesian Public Services

Muhammad Shihab Fathurrahman Jondien, Taqwa Hariguna, Dhanar Intan Surya Saputra

Abstract


This study presents a comprehensive fairness audit and bias mitigation framework for Indonesian sentiment analysis using the SmSA IndoNLU dataset and the IndoBERT language model. The research investigates demographic and linguistic fairness by evaluating model performance across gender and regional groups and introduces an aspect-based extension to assess semantic fairness using an ABSA-style input formulation. Fairness metrics such as ΔF1, Demographic Parity Difference (DPD), and Equality of Opportunity were employed to quantify disparities in model behavior. The baseline IndoBERT model achieved strong overall accuracy (0.942) and macro-F1 (0.927) but exhibited significant regional bias, particularly toward Eastern and Sumatran dialects. A re-weighting strategy effectively reduced the regional F1 disparity by 59 percent with minimal accuracy loss, demonstrating the viability of loss-based fairness mitigation. The ABSA-style IndoBERT further improved fairness consistency across dialectal and aspect categories, achieving a macro-F1 of 0.930. Despite these improvements, aspect-level imbalances persisted, indicating that fairness challenges extend beyond demographic representation to semantic coverage. This work contributes an empirical and methodological foundation for ethical NLP evaluation in Bahasa Indonesia, emphasizing fairness auditing, bias mitigation, and responsible deployment of language models in low-resource and linguistically diverse settings.

Keywords


Fairness Auditing; Bias Mitigation; Aspect-Based Sentiment Analysis; IndoBERT; Indonesian NLP; Ethical AI; Low-Resource Languages

Full Text:

Link Download

References


Ahmadian, H., Abidin, T. F., Riza, H., & Muchtar, K. (2024). Hybrid Models for Emotion Classification and Sentiment Analysis in Indonesian Language. Applied Computational Intelligence and Soft Computing. https://doi.org/10.1155/2024/2826773

Aji, A. F., Winata, G. I., Koto, F., Cahyawijaya, S., et al. (2022). One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia. ArXiv. https://doi.org/10.48550/arxiv.2203.13357

Cahyawijaya, S., Winata, G. I., Wilie, B., et al. (2021). IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation. EMNLP. https://doi.org/10.18653/v1/2021.emnlp-main.699

Christian, W., Adamlu, D., Yu, A., & Suhartono, D. (2025). Leveraging IndoBERT and DistilBERT for Indonesian Emotion Classification in E-Commerce Reviews. arXiv. https://doi.org/10.48550/arxiv.2509.14611

Da, Y., Bossa, M. N., Berenguer, A. D., & Sahli, H. (2024). Reducing Bias in Sentiment Analysis Models Through Causal Mediation Analysis and Targeted Counterfactual Training. IEEE Access. https://doi.org/10.1109/access.2024.3353056

Dwitama, A. P. J., Fudholi, D. H., & Hidayat, S. (2023). Indonesian Hate Speech Detection Using Bi-LSTM and IndoBERT. Jurnal RESTI. https://doi.org/10.29207/resti.v7i2.4642

Fauzan, M. A., & Saptawijaya, A. (2023). Analysis and Mitigation of Religion Bias in Indonesian NLP Datasets. Jurnal RESTI. https://doi.org/10.29207/resti.v7i4.5035

Fathin, M. A., Sibaroni, Y., & Prasetyowati, S. (2024). Handling Imbalance Dataset on Hoax Indonesian Political News Classification Using IndoBERT. Jurnal Media Informatika Budidarma. https://doi.org/10.30865/mib.v8i1.7099

Febrianto, D., Fitriani, M. A., Afrad, M., & Khadija, M. A. (2024). Aspect-Based Sentiment Analysis Menggunakan IndoBERT. Melek IT Journal. https://doi.org/10.30742/melekitjournal.v10i2.358

Istiqomah, N., & Novika, F. (2025). Comparative Performance of IndoBERT and IndoLEM for Post-Disaster Health Information Extraction. Journal of Computer Science and Informatics Engineering. https://doi.org/10.55537/cosie.v4i3.1174

Khairunnisa, S. O., Chen, Z., & Komachi, M. (2023). Dataset Enhancement and Multilingual Transfer for Named Entity Recognition in Indonesian Language. ACM Transactions on Asian and Low-Resource Language Information Processing. https://doi.org/10.1145/3592854

Koto, F., Rahimi, A., Lau, J. H., & Baldwin, T. (2020). IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP. COLING. https://doi.org/10.18653/v1/2020.coling-main.66

Mahendra, R., Aji, A. F., Louvan, S., et al. (2021). IndoNLI: A Natural Language Inference Dataset for Indonesian. EMNLP. https://doi.org/10.18653/v1/2021.emnlp-main.821

Perwira, R., Permadi, V. A., Purnamasari, D. I., & Agusdin, R. P. (2025). Domain-Specific Fine-Tuning of IndoBERT for Aspect-Based Sentiment Analysis in Indonesian Travel UGC. JISEBI. https://doi.org/10.20473/jisebi.11.1.30-40

Praha, T. C., Widodo, W., & Nugraheni, M. (2024). Indonesian Fake News Classification Using Transfer Learning in CNN and LSTM. JOIV: International Journal on Informatics Visualization. https://doi.org/10.62527/joiv.8.2.2126

Purnomo, T. D., & Sutopo, J. (2024). Comparison of Pre-Trained BERT-Based Transformer Models for Regional Language Text Sentiment Analysis in Indonesia. IJST. https://doi.org/10.56127/ijst.v3i3.1739

Riyadi, A., Kovács, M., Serdült, U., & Kryssanov, V. (2024). IndoGovBERT: A Domain-Specific Language Model for Processing Indonesian Government SDG Documents. Big Data and Cognitive Computing. https://doi.org/10.3390/bdcc8110153

Syazali, M. R., & Yulianti, E. (2025). Classification of Economic Activities in Indonesia Using IndoBERT Language Model. Jurnal Ilmu Komputer dan Informasi. https://doi.org/10.21609/jiki.v18i2.1446

Tandi, T. Y., Abidin, T. F., & Riza, H. (2025). Incorporation of IndoBERT and Machine Learning Features to Improve Indonesian RTE. JISEBI. https://doi.org/10.20473/jisebi.11.2.173-186

Venugopal, J. P., Subramanian, A. A. V., Sundaram, G., Rivera, M., & Wheeler, P. W. (2024). A Comprehensive Approach to Bias Mitigation for Sentiment Analysis of Social Media Data. Applied Sciences. https://doi.org/10.3390/app142311471

Wafda, A., Fudholi, D., & Nugraha, J. (2025). Aspect-Based Sentiment Analysis on Twitter Tweets about the Merdeka Curriculum Using IndoBERT. JITK. https://doi.org/10.33480/jitk.v10i3.5692

Wilie, B., Vincentio, K., Winata, G. I., et al. (2020). IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding. AACL. https://doi.org/10.18653/v1/2020.aacl-main.85

Wiyono, V. R., Anugraha, D., Purwarianti, A., & Winata, G. I. (2025). IndoPref: A Multi-Domain Pairwise Preference Dataset for Indonesian. arXiv. https://doi.org/10.48550/arxiv.2507.22159

Wongso, W., Setiawan, D. S., Limcorn, S., & Joyoadikusumo, A. (2024). NusaBERT: Teaching IndoBERT to Be Multilingual and Multicultural. arXiv. https://doi.org/10.48550/arxiv.2403.01817

Yefferson, D. Y., Lawijaya, V., & Girsang, A. S. (2024). Hybrid Model: IndoBERT and Long Short-Term Memory for Detecting Indonesian Hoax News. IAES International Journal of Artificial Intelligence. https://doi.org/10.11591/ijai.v13.i2.pp1913-1924




DOI: http://dx.doi.org/10.35671/telematika.v19i1.3269

Refbacks

  • There are currently no refbacks.


 



Indexed by:

   

Telematika
ISSN: 2442-4528 (online) | ISSN: 1979-925X (print)
Published by : Universitas Amikom Purwokerto
Jl. Let. Jend. POL SUMARTO Watumas, Purwonegoro - Purwokerto, Indonesia


Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License .