Fairness in
Machine Learning

by Novi Quadrianto

with thanks to Oliver Thomas and Thomas Kehrenberg

 

 

Algorithmic fairness definitions

Confusion Tables

blue Applicants

Accepted Not
Actually Graduate
Don't Graduate

green Applicants

Accepted Not
Actually Graduate
Don't Graduate

Solving this problem with statistical parity fairness metric?

???
Select 50% of applicants of both blue and green applicants

blue Applicants

Accepted Not
Actually Graduate 4000 (80%) 1200
Don't Graduate 1000 (20%) 3800
5000

green Applicants

Accepted Not
Actually Graduate 3300 500 (10%)
Don't Graduate 1700 4500 (90%)
5000

10% of qualified blue applicants are being rejected whilst an additional 10% of unqualified green are being accepted

Solving this problem with equality of opportunity fairness metric?

???
Select 55.5% of blue applicants and 44.5% of green applicants, giving a TPR of 85.4% for both groups.

blue Applicants

Accepted Not
Actually Graduate 4440 760
Don't Graduate 1110 3690
5550

green Applicants

Accepted Not
Actually Graduate 3245 555
Don't Graduate 1205 4995
4450

4.5% of qualified blue applicants are being rejected whilst an additional 4.5% of unqualified green are being accepted

Solving this problem with predictive parity fairness metric?

???
Select only the applicants who pass the test

blue Applicants

Accepted Not
Actually Graduate 4800 400
Don't Graduate 1200 3600
6000

green Applicants

Accepted Not
Actually Graduate 3200 600
Don't Graduate 800 5400
4000

Could lead to systemic reinforcement of bias

 

 

Algorithmic fairness methods

Problems with doing this?

Any Ideas?

 

 

Interpretability in fairness

Fair and interpretable representations

- Analysis on the relationship feature on Adult Income dataset

- Feature values of the minority group are transformed to match the majority group

- Here, the wife value is translated to husband

Fair and interpretable representations

Interpretable can be fair!

original $X$ fair & interpretable $X$ latent embedding $Z$
Accuracy $\uparrow$ Eq. Opp $\downarrow$ Accuracy $\uparrow$ Eq. Opp $\downarrow$ Accuracy $\uparrow$ Eq. Opp $\downarrow$
LR $85.1\pm0.2$ $\mathbf{9.2\pm2.3}$ $84.2\pm0.3$ $\mathbf{5.6\pm2.5}$ $81.8\pm2.1$ $\mathbf{5.9\pm4.6}$
SVM $85.1\pm0.2$ $\mathbf{8.2\pm2.3}$ $84.2\pm0.3$ $\mathbf{4.9\pm2.8}$ $81.9\pm2.0$ $\mathbf{6.7\pm4.7}$
Fair Reduction LR $85.1\pm0.2$ $\mathbf{14.9\pm1.3}$ $84.1\pm0.3$ $\mathbf{6.5\pm3.2}$ $81.8\pm2.1$ $\mathbf{5.6\pm4.8}$
Fair Reduction SVM $85.1\pm0.2$ $\mathbf{8.2\pm2.3}$ $84.2\pm0.3$ $\mathbf{4.9\pm2.8}$ $81.9\pm2.0$ $\mathbf{6.7\pm4.7}$
Kamiran & Calders LR $84.4\pm0.2$ $\mathbf{14.9\pm1.3}$ $84.1\pm0.3$ $\mathbf{1.7\pm1.3}$ $81.8\pm2.1$ $\mathbf{4.9\pm3.3}$
Kamiran & Calders SVM $85.1\pm0.2$ $\mathbf{8.2\pm2.3}$ $84.2\pm0.3$ $\mathbf{4.9\pm2.8}$ $81.9\pm2.0$ $\mathbf{6.7\pm4.7}$
Zafar et al. $85.0\pm0.3$ $\mathbf{1.8\pm0.9}$ --- --- --- ---

Homework

Practical Session

https://tinyurl.com/ethicml

Further Resources

Google Crash Course: Fairness in ML

https://developers.google.com/machine-learning/crash-course/fairness

Fast.ai lecture with Fairness discussion

http://course18.fast.ai/lessons/lesson13.html