Novi Quadrianto
Reader in Machine Learning
Department of Informatics, University of Sussex
Machine learning systems are being implemented in all walks of life
Picture credit: Kevin Hong |
![]() Picture credit: AlgorithmWatch |
![]() Picture credit: Centre for Data Ethics and Innovation, UK |
| Social credit system, China | Personal budget calculation, UK | Financial services, Crime and justice, |
| Loan decision, Finland | Recruitment, | |
| etc. | Local government |
Experiments on CelebA dataset: $202,599$ celebrity face images with $40$ attributes, gender as the binary protected attribute, the attribute smiling as the classification task
| Acc. | TPR Diff. | TPR male | TPR female | |
| original image representation $\mathbf{x}\in\mathbb{R}^{2048}$ | 89.70 | 7.54 | 92.03 | 84.50 |
| fair image representation in the data domain $T_{\omega}(\mathbf{x})\in\mathbb{R}^{2048}$ | 91.31 | 4.76 | 91.85 | 87.09 |
| Non-spurious | ![]() |
Spurious | ![]() |
||
![]() |
Disentangling the latent space (c.f. on the reconstruction space) into two components:
|
Original data |
![]() Non-Spurious |
![]() Spurious |
• Gender as the protected attribute
• Unfortunately, the model lightens the skintone when gender-neutralising male faces
• We use gender and age as the two protected attributes.
• We use smiling as the classification task.
| method | Acc. | TPR Diff. | FPR Diff. |
| logistic regression (original) | 89.71 | 6.69 | 6.40 |
| logistic regression (original and GAN contrastive) | 88.94 | 3.50 | 2.79 |
| logistic regression (original and NN contrastive) | 88.78 | 3.32 | 3.53 |
| $\ddagger$logistic regression (original and GAN contrastive with output consistency) | 94.15 | 3.51 | 2.18 |
$\ddagger$: Rejection learning -- classifier only makes a prediction if there is an agreement between original and contrastive examples (occurs in $17,237$ out of $20,000$ test examples, i.e. $86.185\%$).