Multi-Objective Optimization in Machine Learning:Balancing Accuracy, Fairness, and Interpretability

Authors

https://doi.org/10.48313/maa.v2i3.55

Abstract

Machine Learning (ML) systems are frequently evaluated and optimized for predictive performance, yet real-world deployment increasingly requires simultaneous attention to fairness and interpretability. These requirements introduce objectives that can conflict in both theory and practice: Improving accuracy can exacerbate disparities across protected groups; enforcing fairness constraints can reduce utility or shift error burdens; and enhancing interpretability can restrict hypothesis classes or encourage post-hoc explanations whose fidelity is uncertain. This article treats this as a Multi-Objective Optimization (MOO) problem, emphasizing Pareto optimality, the structure of trade-offs, and decision-making on the resulting Pareto set. We review core fairness definitions and metrics, interpretability concepts and pitfalls, and MOO methods used to manage competing objectives, such as including scalarization, constrained learning, and evolutionary approaches. We then propose an evaluation and reporting framework centered on transparent visualization of trade-offs (pareto fronts and fairness-utility curves), careful metric selection (including dominance-based indicators), and documentation practices inspired by model reporting standards.

Keywords:

Multi-objective optimization, Pareto optimality, Fairness, Demographic parity, Equalized odds

References

  1. [1] Hardt, M., Price, E., & Srebro, N. (2016). Equality of opportunity in supervised learning. 30th conference on neural information processing systems (NIPS 2016) (PP. 1-9). Neural Information Processing Systems Foundation. https://proceedings.neurips.cc/paper_files/paper/2016/file/6a9659feb1216f14f7384ba499518b38-Paper.pdf

  2. [2] Kleinberg, J. M., Mullainathan, S., & Raghavan, M. (2016). Inherent trade-offs in the fair determination of risk scores. Information technology convergence and services (pp. 43:1-43:23). Schloss Dagstuhl – Leibniz-Zentrum für Informatik. https://doi.org/10.4230/LIPIcs.ITCS.2017.43

  3. [3] Rudin, C. (2019). Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature machine intelligence, 1(5), 206–215. https://doi.org/10.1038/s42256-019-0048-x

  4. [4] Marler, R. T., & Arora, J. S. (2004). Survey of multi-objective optimization methods for engineering. Structural and multidisciplinary optimization, 26(6), 369–395. https://doi.org/10.1007/s00158-003-0368-6

  5. [5] Chouldechova, A. (2017). Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big data, 5(2), 153–163. https://doi.org/10.1089/big.2016.0047

  6. [6] Pleiss, G., Raghavan, M., Wu, F., Kleinberg, J., & Weinberger, K. Q. (2017). On fairness and calibration. Advances in neural information processing systems (pp. 5680–5689). Curran Associates, Inc. https://proceedings.neurips.cc/paper/2017/hash/b8b9c74ac526fffbeb2d39ab038d1cd7-Abstract.html

  7. [7] EU AI Act. (2024). Regulation (EU) 2024/2847 of the european parliament and of the council. http://hctinsight.com/webzine/webzine/202501/file/ce/ce5.pdf

  8. [8] Barocas, S., Hardt, M., & Narayanan, A. (2023). Fairness and machine learning: Limitations and opportunities. MIT Press. https://books.google.com/books?id=HuGwEAAAQBAJ

  9. [9] Dwork, C., Hardt, M., Pitassi, T., Reingold, O., & Zemel, R. (2012). Fairness through awareness. Proceedings of the 3rd innovations in theoretical computer science conference (pp. 214–226). Association for Computing Machinery. https://doi.org/10.1145/2090236.2090255

  10. [10] Agarwal, A., Beygelzimer, A., Dudik, M., Langford, J., & Wallach, H. (2018). A reductions approach to fair classification. Proceedings of the 35th international conference on machine learning (Vol. 80, pp. 60–69). PMLR. https://proceedings.mlr.press/v80/agarwal18a.html

  11. [11] Lipton, Z. C. (2018). The mythos of model interpretability: In machine learning, the concept of interpretability is both important and slippery. Queue, 16(3), 31-57. https://doi.org/10.1145/3233231

  12. [12] Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). Why should i trust you? Explaining the predictions of any classifier. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining (pp. 1135–1144). Association for Computing Machinery. https://doi.org/10.1145/2939672.2939778

  13. [13] Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. Proceedings of the 31st international conference on neural information processing systems (pp. 4768–4777). Curran Associates Inc. https://proceedings.neurips.cc/paper_files/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper.pdf

  14. [14] Slack, D., Hilgard, S., Jia, E., Singh, S., & Lakkaraju, H. (2020). Fooling lime and shap: Adversarial attacks on post HOC explanation methods. Proceedings of the AAAI/ACM conference on AI, ethics, and society (pp. 180–186). Association for Computing Machinery. https://doi.org/10.1145/3375627.3375830

  15. [15] Jain, S., & Wallace, B. C. (2019). Attention is not explanation. Proceedings of the 2019 conference of the north American chapter of the association for computational linguistics: Human language technologies (long and short papers). Association for Computational Linguistics. (pp. 3543-3556). https://doi.org/10.18653/v1/N19-1357

  16. [16] Deb, K., Pratap, A., Agarwal, S., & Meyarivan, T. (2002). A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE transactions on evolutionary computation, 6(2), 182–197. https://doi.org/10.1109/4235.996017

  17. [17] Zhang, Q., & Li, H. (2008). MOEA/D: A multiobjective evolutionary algorithm based on decomposition. Evolutionary computation, IEEE transactions on, 11, 712–731. https://doi.org/10.1109/TEVC.2007.892759

  18. [18] Mahapatra, D., & Rajan, V. (2020). Multi-task learning with user preferences: Gradient descent with controlled ascent in pareto optimization. Proceedings of the 37th international conference on machine learning (pp. 6597–6607). Journal of Machine Learning Research (JMLR). https://doi.org/10.5555/3524938.3525550

  19. [19] Xi Lin, Hui-Ling Zhen, Zhenhua Li, Qing-Fu Zhang, S. K. (2019). Pareto multi-task learning. Advances in neural information processing systems (pp. 12037–12047). Neural Information Processing Systems Foundation (NeurIPS Foundation). https://proceedings.neurips.cc/paper_files/paper/2019/hash/685bfde03eb646c27ed565881917c71c-Abstract.html

  20. [20] Mitchell, M., Wu, S., Zaldivar, A., Barnes, P., Vasserman, L., Hutchinson, B., … & Gebru, T. (2019). Model cards for model reporting. Proceedings of the conference on fairness, accountability, and transparency (pp. 220–229). Association for Computing Machinery. https://doi.org/10.1145/3287560.3287596

  21. [21] Gebru, T., Morgenstern, J., Vecchione, B., Vaughan, J. W., Wallach, H., Iii, H. D., & Crawford, K. (2021). Datasheets for datasets. Communications of the ACM, 64(12), 86–92. https://dx.doi.org/10.1145/3458723

Published

2025-06-19

How to Cite

Khanjani, A. (2025). Multi-Objective Optimization in Machine Learning:Balancing Accuracy, Fairness, and Interpretability. Metaheuristic Algorithms With Applications, 2(3), 344–352. https://doi.org/10.48313/maa.v2i3.55