Amit Dhurandhar, Karthikeyan Shanmugam, et al.
ICML 2020
The deployment of machine learning (ML) systems in applications with societal impact has motivated the study of fairness for marginalized groups. Often, the protected attribute is absent from the training dataset for legal reasons. However, datasets still contain proxy attributes that capture protected information and can inject unfairness in the ML model. Some deployed systems allow auditors, decision makers, or affected users to report issues or seek recourse by flagging individual samples. In this work, we examine such systems and consider a feedback-based framework where the protected attribute is unavailable and the flagged samples are indirect knowledge. The reported samples are used as guidance to identify the proxy attributes that are causally dependent on the (unknown) protected attribute. We work under the causal interventional fairness paradigm. Without requiring the underlying structural causal model a priori, we propose an approach that performs conditional independence tests on observed data to identify such proxy attributes. We theoretically prove the optimality of our algorithm, bound its complexity, and complement it with an empirical evaluation demonstrating its efficacy on various real-world and synthetic datasets.
Amit Dhurandhar, Karthikeyan Shanmugam, et al.
ICML 2020
Kanthi Sarpatwar, Venkata Sitaramagiridharganesh Ganapavarapu, et al.
CVPRW 2019
Hamid Dadkhahi, Jesus Rios, et al.
AAAI 2022
Swapnaja Achintalwar, Ioana Baldini, et al.
IEEE Internet Computing