Learning Analytics to Uncover Ethnic Bias in Educational Texts: An Ensemble Learning Approach

Josmario Albuquerque; Bart Rienties; Martin Hlosta; Wayne Holmes

doi:10.18608/jla.2026.8905

Authors

Josmario Albuquerque The Open University https://orcid.org/0000-0002-7437-0747
Bart Rienties The Open University https://orcid.org/0000-0003-3749-9629
Martin Hlosta Swiss Distance University of Applied Sciences and The Open University https://orcid.org/0000-0002-7053-7052
Wayne Holmes University College London https://orcid.org/0000-0002-8352-1594

DOI:

https://doi.org/10.18608/jla.2026.8905

Keywords:

learning analytics, ethnic bias, machine learning, online learning, open educational resources, research paper

Abstract

Online learning platforms have expanded access to education but also raise concerns about biased content, particularly in text-based learning materials such as textbooks, lesson plans, and course excerpts. Such biases can perpetuate discrimination, can harm student outcomes, and can often be difficult to detect, as identification typically relies on time-consuming human review. Learning analytics (LA) can enhance this process by supporting human reviewers through automated detection, offering a scalable solution while retaining human judgment for nuanced evaluations. Accordingly, this LA study explores two research questions: RQ1: Which features might support the identification of ethnic bias in text-based online learning materials? and RQ2: Which classification approaches might be suitable for identifying ethnic bias in text-based online learning materials? First, we identified features signalling potential ethnic bias (presence or absence) in textual content using a dataset (N = 345) labelled by 193 students from diverse ethnic backgrounds. Then, we evaluated multiple machine learning (ML) models for their effectiveness in bias classification. The results suggest significant correlations between perceived bias and content from social sciences. Additionally, through bootstrap analysis, support vector machines and random forest classifiers showed consistent performance in bias identification (with F1-scores of 0.71 and 0.70 on the test set, respectively). In contrast, the naive Bayes (NB) model demonstrated the highest precision (0.75 on the test set). We discuss these findings and their implications for LA, emphasizing the importance of quality and inclusive educational tools. As an initial step toward automated bias classification in education, this study provides a foundation for spotting ethnic bias in learning content, supporting fairer technologies for more inclusive learning environments.

References

Albuquerque, J. (2023). Towards an automatic approach for uncovering ethnic bias in online learning texts [Doctoral dissertation, The Open University]. https://doi.org/10.21954/ou.ro.000170d9

Albuquerque, J., Bittencourt, I. I., Coelho, J. A. P. M., & Silva, A. P. (2017). Does gender stereotype threat in gamified educational environments cause anxiety? An experimental study. Computers & Education, 115, 161–170. https://doi.org/10.1016/j.compedu.2017.08.005

Albuquerque, J., Rienties, B., Holmes, W., & Hlosta, M. (2025). From hype to evidence: Exploring large language models for inter-group bias classification in higher education. Interactive Learning Environments, 33(3), 2332–2354. https://doi.org/10.1080/10494820.2024.2408554

Allen, I. E., & Seaman, J. (2007). Online nation: Five years of growth in online learning. ERIC. https://files.eric.ed.gov/fulltext/ED529699.pdf

Al-Zawqari, A., & Vandersteen, G. (2023). Fairness in predictive learning analytics: A case study in online STEM education. In Proceedings of the 2023 IEEE Frontiers in Education Conference (FIE 2023), 18–21 October 2023, College Station, Texas, USA (pp. 1–5). IEEE. https://doi.org/10.1109/FIE58773.2023.10343059

Ashong, C. Y., & Commander, N. E. (2012). Ethnicity, gender, and perceptions of online learning in higher education. MERLOT Journal of Online Learning and Teaching, 8(2), 98–110. https://jolt.merlot.org/vol8no2/ashong_0612.pdf

Baker, R. S., & Hawn, A. (2022). Algorithmic bias in education. International Journal of Artificial Intelligence in Education, 32(4), 1052–1092. https://doi.org/10.1007/s40593-021-00285-9

Balica, R. (2018). Big data learning analytics and algorithmic decision-making in digital education governance. Analysis and Metaphysics, 17, 128–133. https://doi.org/10.22381/AM1720187

Bansak, C., & Starr, M. (2021). Covid-19 shocks to education supply: How 200,000 US households dealt with the sudden shift to distance learning. Review of Economics of the Household, 19(1), 63–90. https://doi.org/10.1007/s11150-020-09540-9

Baumeister, R., & Vohs, K. (2007). Ingroup–outgroup bias. In Encyclopedia of social psychology (pp. 484–485). SAGE Publications, Inc. https://doi.org/10.4135/9781412956253.n286

Beaunoyer, E., Dupere, S., & Guitton, M. J. (2020). COVID-19 and digital inequalities: Reciprocal impacts and mitigation strategies. Computers in Human Behavior, 111, 106424. https://doi.org/10.1016/j.chb.2020.106424

Benjamini, Y., & Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B (Methodological), 57(1), 289–300. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x

Bergstra, J., & Bengio, Y. (2012). Random search for hyper-parameter optimization. Journal of Machine Learning Research, 13(2), 281–305. https://www.jmlr.org/papers/volume13/bergstra12a/bergstra12a.pdf

Berrar, D. (2019). Cross-validation. In S. Ranganathan, M. Gribskov, K. Nakai, & C. Schonbach (Eds.), Encyclopedia of bioinformatics and computational biology (pp. 542–545, Vol. 1). Academic Press. https://doi.org/10.1016/B978-0-12-809633-8.20349-X

Bommasani, R., Hudson, D. A., Adeli, E., Altman, R., Arora, S., von Arx, S., Bernstein, M. S., Bohg, J., Bosselut, A., Brunskill, E., Brynjolfsson, E., Buch, S., Card, D., Castellon, R., Chatterji, N., Chen, A., Creel, K., Davis, J. Q., Demszky, D., . . . Liang, P. (2021). On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258. https://doi.org/10.48550/arXiv.2108.07258

Borchers, C., & Baker, R. S. (2025). ABROCA distributions for algorithmic bias assessment: Considerations around interpretation. In Proceedings of the 15th International Conference on Learning Analytics and Knowledge (LAK 2025), 3–7 March 2025, Dublin, Ireland (pp. 837–843). ACM. https://doi.org/10.1145/3706468.3706498

Brewer, M., & Yuki, M. (2007). Culture and social identity [PsycINFO ID: 2007-12976-012]. In S. Kitayama & D. Cohen (Eds.), Handbook of cultural psychology (1st ed., pp. 307–322). The Guilford Press.

Chandrashekar, G., & Sahin, F. (2014). A survey on feature selection methods. Computers & Electrical Engineering, 40(1), 16–28. https://doi.org/10.1016/j.compeleceng.2013.11.024

Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16, 321–357. https://doi.org/10.1613/jair.953

Chen, F., & Cui, Y. (2020). Utilizing student time series behaviour in learning management systems for early prediction of course performance. Journal of Learning Analytics, 7(2), 1–17. https://doi.org/10.18608/jla.2020.72.1

Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2016), 13–17 August 2016, San Francisco, California, USA (pp. 785–794). ACM. https://doi.org/10.1145/2939672.2939785

Chiu, C. - y., Hong, Y.- y., & Dweck, C. S. (1997). Lay dispositionism and implicit theories of personality. Journal of Personality and Social Psychology, 73(1), 19–30. https://doi.org/10.1037//0022-3514.73.1.19

Choi, J., Karumbaiah, S., & Matayoshi, J. (2025). Bias or insufficient sample size? Improving reliable estimation of algorithmic bias for minority groups. In Proceedings of the 15th International Conference on Learning Analytics and Knowledge (LAK 2025), 3–7 March 2025, Dublin, Ireland (pp. 547–557). ACM. https://doi.org/10.1145/3706468.3706540

Copur-Gencturk, Y., Thacker, I., & Cimpian, J. R. (2022). Teacher bias in the virtual classroom. Computers & Education, 191, 104627. https://doi.org/10.1016/j.compedu.2022.104627

Dai, S., Xu, C., Xu, S., Pang, L., Dong, Z., & Xu, J. (2024). Bias and unfairness in information retrieval systems: New challenges in the LLM era. In Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2024), 25–29 August 2024, Barcelona, Spain (pp. 6437–6447). ACM. https://doi.org/10.1145/3637528.3671458

Daumeyer, N. M., Onyeador, I. N., Brown, X., & Richeson, J. A. (2019). Consequences of attributing discrimination to implicit vs. explicit bias. Journal of Experimental Social Psychology, 84, 103812. https://doi.org/10.1016/j.jesp.2019.04.010

Dawkins, H., Hedgeland, H., & Jordan, S. (2017). Impact of scaffolding and question structure on the gender gap. Physical Review Physics Education Research, 13(2), 020117. https://doi.org/10.1103/PhysRevPhysEducRes.13.020117

Doube, W., & Lang, C. (2012). Gender and stereotypes in motivation to study computer programming for careers in multimedia. Computer Science Education, 22(1), 63–78. https://doi.org/10.1080/08993408.2012.666038

Dovidio, J. F., & Gaertner, S. L. (2010). Intergroup bias. In Handbook of social psychology (pp. 1084–1121). John Wiley & Sons, Inc. https://doi.org/10.1002/9780470561119.socpsy002029

Dutt, A., Ismail, M. A., & Herawan, T. (2017). A systematic review on educational data mining. IEEE Access, 5, 15991–16005. https://doi.org/10.1109/ACCESS.2017.2654247

Ferguson, R. (2012). Learning analytics: Drivers, developments and challenges. International Journal of Technology Enhanced Learning, 4(5/6), 304–317. https://doi.org/10.1504/IJTEL.2012.051816

Fernandez, A., Garcia, S., Herrera, F., & Chawla, N. V. (2018). SMOTE for learning from imbalanced data: Progress and challenges, marking the 15-year anniversary. Journal of Artificial Intelligence Research, 61, 863–905. https://doi.org/10.1613/jair.1.11192

Greenwald, A. G., & Krieger, L. H. (2006). Implicit bias: Scientific foundations. California Law Review, 94(4), 945–967. https://doi.org/10.2307/20439056

Guyon, I., & Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of Machine Learning Research, 3(Mar), 1157–1182. https://dl.acm.org/doi/10.5555/944919.944968

Hollister, B., Nair, P., Hill-Lindsay, S., & Chukoskie, L. (2022). Engagement in online learning: Student attitudes and behavior during COVID-19. Frontiers in Education, 7, 851019. https://doi.org/10.3389/feduc.2022.851019

Holstein, K., & Doroudi, S. (2019). Fairness and equity in learning analytics systems (FairLAK). In Workshop at the Ninth International Conference on Learning Analytics and Knowledge (LAK 2019), 4–8 March 2019, Tempe, Arizona, USA. https://sites.google.com/view/fairlak

Holstein, K., & Doroudi, S. (2022). Equity and artificial intelligence in education. In W. Holmes & K. Porayska-Pomsta (Eds.), The ethics of artificial intelligence in education (pp. 151–173). Routledge. https://www.taylorfrancis.com/chapters/edit/10.4324/9780429329067-9/equity-artificial-intelligence-education-kenneth-holstein-shayan-doroudi

Hutt, S., Baker, R. S., Ashenafi, M. M., Andres-Bray, J. M., & Brooks, C. (2022). Controlled outputs, full data: A privacy-protecting infrastructure for mooc data. British Journal of Educational Technology, 53(4), 756–775. https://doi.org/10.1111/bjet.13231

Jordano, M. L., & Touron, D. R. (2017). Stereotype threat as a trigger of mind-wandering in older adults. Psychology and Aging, 32(3), 307. https://doi.org/10.1037/pag0000167

Karumbaiah, S., & Brooks, J. (2021). How colonial continuities underlie algorithmic injustices in education. In C. Gardner-McCune, S. Grady, Y. Jimenez, J. Ryoo, R. Santo, & J. Payton (Eds.), Proceedings of the 2021 Conference on Research in Equitable and Sustained Participation in Engineering, Computing, and Technology (RESPECT 2021), 23–27 May 2021, online (pp. 1–6). IEEE. https://doi.org/10.1109/RESPECT51740.2021

Kizilcec, R. F., & Lee, H. (2022). Algorithmic fairness in education. In W. Holmes & K. Porayska-Pomsta (Eds.), The ethics of artificial intelligence in education (pp. 174–202). Routledge. https://www.taylorfrancis.com/chapters/edit/10.4324/9780429329067-10/algorithmic-fairness-education-ren

Kumar, D., Jain, U., Agarwal, S., & Harshangi, P. (2024). Investigating implicit bias in large language models: A large-scale study of over 50 LLMs. arXiv preprint arXiv:2410.12864. https://doi.org/10.48550/arXiv.2410.12864

Liao, Q. V., & Varshney, K. R. (2021). Human-centered explainable AI (XAI): From algorithms to user experiences. arXiv preprint arXiv:2110.10790. https://doi.org/10.48550/arXiv.2110.10790

Long, P., Siemens, G., Grainne, C., & Gasevic, D. (2011). 1st International Conference on Learning Analytics and Knowledge. In Proceedings of the First International Conference on Learning Analytics and Knowledge (LAK 2011), 27 February–1 March 2011, Banff, Alberta, Canada (pp. 3–4). ACM. https://dl.acm.org/doi/proceedings/10.1145/2090116

Maass, A. (1999). Linguistic intergroup bias: Stereotype perpetuation through language. In M. P. Zanna (Ed.), Advances in experimental social psychology (pp. 79–121, Vol. 31). Elsevier. https://doi.org/10.1016/S0065-2601(08)60272-5

Mayer, R. E. (2019). How multimedia can improve learning and instruction. In J. Dunlosky & K. A. Rawson (Eds.), The Cambridge handbook of cognition and education (pp. 460–479). Cambridge University Press. https://doi.org/10.1017/9781108235631.019

Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., & Galstyan, A. (2021). A survey on bias and fairness in machine learning. ACM Computing Surveys (CSUR), 54(6), 1–35. https://doi.org/10.1145/3457607

Mendelsohn, J., Tsvetkov, Y., & Jurafsky, D. (2020). A framework for the computational linguistic analysis of dehumanization. Frontiers in Artificial Intelligence, 3, 55. https://doi.org/10.3389/frai.2020.00055

Mohamed, S., Png, M.- T., & Isaac, W. (2020). Decolonial AI: Decolonial theory as sociotechnical foresight in artificial intelligence. Philosophy & Technology, 33, 659–684. https://doi.org/10.1007/s13347-020-00405-8

Namoun, A., & Alshanqiti, A. (2020). Predicting student performance using data mining and learning analytics techniques: A systematic literature review. Applied Sciences, 11(1), 237. https://doi.org/10.3390/app11010237

Nguyen, A., Ngo, H. N., Hong, Y., Dang, B., & Nguyen, B.- P. T. (2023). Ethical principles for artificial intelligence in education. Education and Information Technologies, 28(4), 4221–4241. https://doi.org/10.1007/s10639-022-11316-w

Nguyen, Q., Rienties, B., & Richardson, J. T. E. (2020). Learning analytics to uncover inequality in behavioural engagement and academic attainment in a distance learning setting. Assessment & Evaluation in Higher Education, 45(4), 594–606. https://doi.org/10.1080/02602938.2019.1679088

Osgood, C. E., Suci, G. J., & Tannenbaum, P. H. (1957). The measurement of meaning. University of Illinois Press. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, E. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830. https://jmlr.org/papers/volume12/pedregosa11a/pedregosa11a.pdf

Pennington, C. R., Heim, D., Levy, A. R., & Larkin, D. T. (2016). Twenty years of stereotype threat research: A review of psychological mediators. PLoS ONE, 11(1), 1–25. https://doi.org/10.1371/journal.pone.0146487

Pryzant, R., Martinez, R. D., Dass, N., Kurohashi, S., Jurafsky, D., & Yang, D. (2020). Automatically neutralizing subjective bias in text. Proceedings of the AAAI Conference on Artificial Intelligence, 34(01), 480–489. https://doi.org/10.1609/aaai.v34i01.5385

Richardson, J. T. E., Mittelmeier, J., & Rienties, B. (2020). The role of gender, social class and ethnicity in participation and academic attainment in UK higher education: An update. Oxford Review of Education, 46(3), 346–362. https://doi.org/10.1080/03054985.2019.1702012

Sabnis, S., Yu, R., & Kizilcec, R. F. (2022). Large-scale student data reveal sociodemographic gaps in procrastination behavior. In Proceedings of the Ninth ACM Conference on Learning at Scale (L@S 2022), 1–3 June 2022, New York, New York, USA (pp. 133–141). ACM. https://doi.org/10.1145/3491140.3528285

Sagi, O., & Rokach, L. (2018). Ensemble learning: A survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 8(4), e1249. https://doi.org/10.1002/widm.1249

Sboev, A., Gudovskikh, D., Rybka, R., & Moloshnikov, I. (2015). A quantitative method of text emotiveness evaluation on base of the psycholinguistic markers founded on morphological features. Procedia Computer Science, 66, 307–316. https://doi.org/10.1016/j.procs.2015.11.036

Sedrakyan, G., Malmberg, J., Verbert, K., Järvelä, S., & Kirschner, P. A. (2020). Linking learning behavior analytics and learning science concepts: Designing a learning analytics dashboard for feedback to support learning regulation. Computers in Human Behavior, 107, 105512. https://doi.org/10.1016/j.chb.2018.05.004

Shahjahan, R. A., Estera, A. L., Surla, K. L., & Edwards, K. T. (2022). “Decolonizing” curriculum and pedagogy: A comparative review across disciplines and global higher education contexts. Review of Educational Research, 92(1), 73–113. https://doi.org/10.3102/00346543211042423

Singhal, Y., Jain, A., Batra, S., Varshney, Y., & Rathi, M. (2018). Review of bagging and boosting classification performance on unbalanced binary classification. In A. Goswami (Ed.), Proceedings of the 2018 IEEE Eighth International Advance Computing Conference (IACC 2018), 14–15 December 2018, Delhi, India (pp. 338–343). IEEE. https://doi.org/10.1109/IADCC.2018.8692138

Skopec, M., Fyfe, M., Issa, H., Ippolito, K., Anderson, M., & Harris, M. (2021). Decolonization in a higher education STEM institution—is “epistemic fragility” a barrier? London Review of Education, 19(1), 1–21. https://doi.org/10.14324/LRE.19.1.18

Sloan-Lynch, J., & Morse, R. (2024). Equity-forward learning analytics: Designing a dashboard to support marginalized student success. In Proceedings of the 14th International Conference on Learning Analytics and Knowledge (LAK 2024), 18–22 March 2024, Tokyo, Japan (pp. 1–11). ACM. https://doi.org/10.1145/3636555.3636844

Tajfel, H. (1970). Experiments in intergroup discrimination. Scientific American, 223(5), 96–102. https://doi.org/10.1038/scientificamerican1170-96

Tajfel, H., Billig, M. G., Bundy, R. P., & Flament, C. (1971). Social categorization and intergroup behaviour. European Journal of Social Psychology, 1(2), 149–178. https://doi.org/10.1002/ejsp.2420010202

Tajfel, H., Turner, J. C., Austin, W. G., & Worchel, S. (2000). An integrative theory of intergroup conflict. In M. J. Hatch & M. Schultz (Eds.), Organizational identity: A reader (pp. 56–65). Oxford University Press. https://doi.org/10.1093/oso/9780199269464.003.0005

Tate, T., & Warschauer, M. (2022). Equity in online learning. Educational Psychologist, 57(3), 192–206. https://doi.org/10.1080/00461520.2022.2062597

The Pandas development team. (2020). Pandas-dev/pandas: Pandas. https://github.com/pandas-dev/pandas

Tincher, M. M., Lebois, L. A. M., & Barsalou, L. W. (2016). Mindful attention reduces linguistic intergroup bias. Mindfulness, 7(2), 349–360. https://doi.org/10.1007/s12671-015-0450-3

Yu, R., Lee, H., & Kizilcec, R. F. (2021). Should college dropout prediction models include protected attributes? In Proceedings of the Eighth ACM Conference on Learning at Scale (L@S 2021), 22–25 June 2021, online (pp. 91–100). ACM. https://doi.org/10.1145/3430895.3460139

Learning Analytics to Uncover Ethnic Bias in Educational Texts

An Ensemble Learning Approach

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)