# Dealing with Gender Bias Issues in Data-Algorithmic Processes: A Social-Statistical Perspective

## 1. Introduction

## 2. The Algorithm Concept

#### 2.1. Algorithm Concept in Science and Engineering

#### 2.2. Algorithm Concept in Social Sciences

## 3. Data-Algorithmic Bias: Definitions and Classifications

## 4. Examples of Gender Bias

#### 4.1. Natural Language Processing and Generation

#### 4.2. Speech Recognition

#### 4.3. Decision Management

#### 4.4. Face Recognition

## 5. Datasets with Gender Bias

## 6. Initiatives to Address Gender Bias

#### 6.1. Private Initiatives

#### 6.2. International Organizations

## 7. An Illustrative Numerical Example

## 8. Recommendations to Prevent, Identify, and Mitigate Gender Bias

- Preventing gender bias: (i) configure a reasonable representation of both genders among each category of experts working in the design, implementation, validation, and documentation of algorithms; (ii) set a reasonable gender distribution among each category of experts working in the extraction/collection, pre-processing, and analysis of data; (iii) incorporate at least one expert in data-algorithmic bias to the group; and (iv) train all staff (male/female/non-bi) in gender bias (and approaches to prevent, avoid, detect, and correct it).
- Identifying gender bias: (i) be transparent regarding the composition of the working group (gender distribution and expertise in ethics and data-algorithmic bias), the strategies implemented to mitigate bias, and the results of the tests implemented to detect potential bias; (ii) assess and publish the limitations regarding gender bias; (iii) improve interpretability of ‘black-box’ models; and (iv) analyze periodically the use and results of the algorithms employed.
- Mitigating gender bias: (i) avoid to reuse data and pre-trained models with gender bias that cannot be corrected; (ii) apply methods to get a balanced dataset if needed [49], as well as to measure accuracy levels separately for each gender; (iii) assess different fairness-based measures to choose which ones are more suitable in a particular case; (iv) test different algorithms (and configurations of parameters) to find which one outperforms the others (benchmark instances or datasets with biases are available in the literature to assess new algorithms); (v) modify the dataset to mitigate gender bias relying on specific-domain experts; (vi) document and store previous experiences where bias has been detected in a dataset and how it has been mitigated (as commented before, gender bias tend to be recurrent in some specific fields); and (vii) implement approaches to remove unwanted features related to gender from intermediate representations in deep learning models.

## 9. Conclusions

**Figure 2.**Decision-making process and decomposition of algorithms into their characteristics and components.

# | G | R | S | A? | # | G | R | S | A? |
---|---|---|---|---|---|---|---|---|---|

1 | M | O | 75 | Y | 47 | M | W | 65 | Y |

2 | M | O | 70 | Y | 48 | F | O | 35 | N |

3 | F | O | 55 | Y | 49 | M | O | 55 | Y |

4 | F | O | 25 | Y | 50 | M | O | 80 | Y |

5 | M | O | 60 | Y | 51 | M | O | 55 | Y |

6 | M | O | 50 | Y | 52 | F | W | 85 | Y |

7 | M | O | 65 | N | 53 | F | W | 60 | Y |

8 | M | W | 25 | Y | 54 | F | O | 65 | Y |

9 | M | W | 20 | Y | 55 | M | W | 67 | Y |

10 | M | W | 77 | Y | 56 | M | O | 60 | N |

11 | F | W | 55 | N | 57 | M | W | 65 | Y |

12 | M | W | 60 | Y | 58 | F | O | 75 | N |

13 | F | O | 62 | N | 59 | M | W | 35 | Y |

14 | M | W | 70 | Y | 60 | F | O | 25 | Y |

15 | M | W | 45 | Y | 61 | M | O | 70 | N |

16 | M | W | 40 | Y | 62 | F | O | 65 | N |

17 | F | O | 40 | Y | 63 | F | O | 51 | Y |

18 | F | O | 45 | Y | 64 | M | W | 75 | Y |

19 | F | W | 35 | Y | 65 | M | W | 73 | Y |

20 | M | W | 80 | Y | 66 | M | O | 79 | N |

21 | M | O | 45 | Y | 67 | M | O | 92 | Y |

22 | M | O | 58 | Y | 68 | M | O | 60 | Y |

23 | M | O | 85 | Y | 69 | M | W | 85 | N |

24 | F | W | 30 | Y | 70 | M | O | 95 | Y |

25 | M | O | 75 | N | 71 | M | W | 85 | Y |

26 | M | W | 95 | Y | 72 | F | W | 84 | N |

27 | F | O | 85 | Y | 73 | M | W | 95 | Y |

28 | M | O | 77 | N | 74 | M | O | 97 | Y |

29 | F | O | 94 | N | 75 | M | O | 90 | Y |

30 | M | O | 90 | Y | 76 | F | O | 80 | N |

31 | M | O | 99 | N | 77 | M | W | 90 | Y |

32 | M | W | 70 | Y | 78 | M | O | 97 | N |

33 | F | O | 65 | N | 79 | M | W | 93 | Y |

34 | F | W | 103 | Y | 80 | M | O | 100 | Y |

35 | M | O | 90 | Y | 81 | M | W | 113 | Y |

36 | M | W | 25 | Y | 82 | M | W | 100 | Y |

37 | M | W | 60 | Y | 83 | M | W | 65 | Y |

38 | F | O | 45 | Y | 84 | M | O | 105 | Y |

39 | M | W | 60 | Y | 85 | M | O | 99 | N |

40 | M | W | 0 | Y | 86 | F | W | 107 | Y |

41 | F | W | 65 | Y | 87 | M | O | 120 | N |

42 | F | W | 70 | Y | 88 | F | W | 90 | N |

43 | M | W | 60 | Y | 89 | M | W | 82 | Y |

44 | M | W | 60 | Y | 90 | M | O | 105 | Y |

45 | M | W | 65 | Y | 91 | M | O | 65 | N |

46 | F | W | 60 | Y | 92 | M | W | 107 | Y |

