Multi-Objective Optimization in Machine Learning:Balancing Accuracy, Fairness, and Interpretability
Abstract
Machine Learning (ML) systems are frequently evaluated and optimized for predictive performance, yet real-world deployment increasingly requires simultaneous attention to fairness and interpretability. These requirements introduce objectives that can conflict in both theory and practice: Improving accuracy can exacerbate disparities across protected groups; enforcing fairness constraints can reduce utility or shift error burdens; and enhancing interpretability can restrict hypothesis classes or encourage post-hoc explanations whose fidelity is uncertain. This article treats this as a Multi-Objective Optimization (MOO) problem, emphasizing Pareto optimality, the structure of trade-offs, and decision-making on the resulting Pareto set. We review core fairness definitions and metrics, interpretability concepts and pitfalls, and MOO methods used to manage competing objectives, such as including scalarization, constrained learning, and evolutionary approaches. We then propose an evaluation and reporting framework centered on transparent visualization of trade-offs (pareto fronts and fairness-utility curves), careful metric selection (including dominance-based indicators), and documentation practices inspired by model reporting standards.
Keywords:
Multi-objective optimization, Pareto optimality, Fairness, Demographic parity, Equalized oddsReferences
- [1] Hardt, M., Price, E., & Srebro, N. (2016). Equality of opportunity in supervised learning. 30th conference on neural information processing systems (NIPS 2016) (PP. 1-9). Neural Information Processing Systems Foundation. https://proceedings.neurips.cc/paper_files/paper/2016/file/6a9659feb1216f14f7384ba499518b38-Paper.pdf
- [2] Kleinberg, J. M., Mullainathan, S., & Raghavan, M. (2016). Inherent trade-offs in the fair determination of risk scores. Information technology convergence and services (pp. 43:1-43:23). Schloss Dagstuhl – Leibniz-Zentrum für Informatik. https://doi.org/10.4230/LIPIcs.ITCS.2017.43
- [3] Rudin, C. (2019). Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature machine intelligence, 1(5), 206–215. https://doi.org/10.1038/s42256-019-0048-x
- [4] Marler, R. T., & Arora, J. S. (2004). Survey of multi-objective optimization methods for engineering. Structural and multidisciplinary optimization, 26(6), 369–395. https://doi.org/10.1007/s00158-003-0368-6
- [5] Chouldechova, A. (2017). Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big data, 5(2), 153–163. https://doi.org/10.1089/big.2016.0047
- [6] Pleiss, G., Raghavan, M., Wu, F., Kleinberg, J., & Weinberger, K. Q. (2017). On fairness and calibration. Advances in neural information processing systems (pp. 5680–5689). Curran Associates, Inc. https://proceedings.neurips.cc/paper/2017/hash/b8b9c74ac526fffbeb2d39ab038d1cd7-Abstract.html
- [7] EU AI Act. (2024). Regulation (EU) 2024/2847 of the european parliament and of the council. http://hctinsight.com/webzine/webzine/202501/file/ce/ce5.pdf
- [8] Barocas, S., Hardt, M., & Narayanan, A. (2023). Fairness and machine learning: Limitations and opportunities. MIT Press. https://books.google.com/books?id=HuGwEAAAQBAJ
- [9] Dwork, C., Hardt, M., Pitassi, T., Reingold, O., & Zemel, R. (2012). Fairness through awareness. Proceedings of the 3rd innovations in theoretical computer science conference (pp. 214–226). Association for Computing Machinery. https://doi.org/10.1145/2090236.2090255
- [10] Agarwal, A., Beygelzimer, A., Dudik, M., Langford, J., & Wallach, H. (2018). A reductions approach to fair classification. Proceedings of the 35th international conference on machine learning (Vol. 80, pp. 60–69). PMLR. https://proceedings.mlr.press/v80/agarwal18a.html
- [11] Lipton, Z. C. (2018). The mythos of model interpretability: In machine learning, the concept of interpretability is both important and slippery. Queue, 16(3), 31-57. https://doi.org/10.1145/3233231
- [12] Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). Why should i trust you? Explaining the predictions of any classifier. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining (pp. 1135–1144). Association for Computing Machinery. https://doi.org/10.1145/2939672.2939778
- [13] Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. Proceedings of the 31st international conference on neural information processing systems (pp. 4768–4777). Curran Associates Inc. https://proceedings.neurips.cc/paper_files/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper.pdf
- [14] Slack, D., Hilgard, S., Jia, E., Singh, S., & Lakkaraju, H. (2020). Fooling lime and shap: Adversarial attacks on post HOC explanation methods. Proceedings of the AAAI/ACM conference on AI, ethics, and society (pp. 180–186). Association for Computing Machinery. https://doi.org/10.1145/3375627.3375830
- [15] Jain, S., & Wallace, B. C. (2019). Attention is not explanation. Proceedings of the 2019 conference of the north American chapter of the association for computational linguistics: Human language technologies (long and short papers). Association for Computational Linguistics. (pp. 3543-3556). https://doi.org/10.18653/v1/N19-1357
- [16] Deb, K., Pratap, A., Agarwal, S., & Meyarivan, T. (2002). A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE transactions on evolutionary computation, 6(2), 182–197. https://doi.org/10.1109/4235.996017
- [17] Zhang, Q., & Li, H. (2008). MOEA/D: A multiobjective evolutionary algorithm based on decomposition. Evolutionary computation, IEEE transactions on, 11, 712–731. https://doi.org/10.1109/TEVC.2007.892759
- [18] Mahapatra, D., & Rajan, V. (2020). Multi-task learning with user preferences: Gradient descent with controlled ascent in pareto optimization. Proceedings of the 37th international conference on machine learning (pp. 6597–6607). Journal of Machine Learning Research (JMLR). https://doi.org/10.5555/3524938.3525550
- [19] Xi Lin, Hui-Ling Zhen, Zhenhua Li, Qing-Fu Zhang, S. K. (2019). Pareto multi-task learning. Advances in neural information processing systems (pp. 12037–12047). Neural Information Processing Systems Foundation (NeurIPS Foundation). https://proceedings.neurips.cc/paper_files/paper/2019/hash/685bfde03eb646c27ed565881917c71c-Abstract.html
- [20] Mitchell, M., Wu, S., Zaldivar, A., Barnes, P., Vasserman, L., Hutchinson, B., … & Gebru, T. (2019). Model cards for model reporting. Proceedings of the conference on fairness, accountability, and transparency (pp. 220–229). Association for Computing Machinery. https://doi.org/10.1145/3287560.3287596
- [21] Gebru, T., Morgenstern, J., Vecchione, B., Vaughan, J. W., Wallach, H., Iii, H. D., & Crawford, K. (2021). Datasheets for datasets. Communications of the ACM, 64(12), 86–92. https://dx.doi.org/10.1145/3458723