Machine Learning Applied to Improve Prevention of, Response to, and Understanding of Violence Against Women
Abstract
:1. Introduction
2. Literature Review
2.1. Integration and Practical Application of Predictive Models
2.2. Identification of Risk Factors and Prediction of Incidents
2.3. Understanding Victims’ Experiences
2.4. Development of Intervention Strategies and Public Policies
3. Opportunity Niche
4. Methodology
4.1. Description of the Safe Woman Platform
4.2. Description of the Data Set
4.3. Feature Analysis
- The Chi-square test assessed the independence between categorical variables and the target variable. The test statistic was computed as [32]
- Analysis of Variance (ANOVA) evaluated whether the mean of numerical features differed significantly between classes [33]. The F-statistic was computed as follows:Features with a p-value greater than 0.05 were deemed non-discriminative.
4.4. Models Description
5. Results and Analysis
Performance Evaluation
- Number of trees (n_estimators): 50, 100, 200
- Maximum tree depth (max_depth): None, 10, 20, 30
- Minimum samples to split a node (min_samples_split): 2, 5, 10
- Minimum samples per leaf (min_samples_leaf): 1, 2, 4
- Bootstrap sampling (bootstrap): True, False
- n_estimators: 100
- max_depth: None
- min_samples_split: 5
- min_samples_leaf: 2
- bootstrap: True
- Number of boosting stages (n_estimators): 50, 100, 200
- Learning rate (learning_rate): 0.01, 0.1, 0.2
- Maximum depth of trees (max_depth): 3, 5, 7
- Minimum samples to split a node (min_samples_split): 2, 5, 10
- Minimum samples per leaf (min_samples_leaf): 1, 2, 4
- n_estimators: 100
- learning_rate: 0.1
- max_depth: 5
- min_samples_split: 5
- min_samples_leaf: 2
6. Discussion
7. Conclusions and Future Work
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Pokhriyal, D.; Bahuguna, R.; Memoria, M.; Kumar, R. Artificial Intelligence Based Legal Application For Combating Domestic Violence. In Proceedings of the 2023 International Conference on Advancement in Computation & Computer Technologies (InCACCT), Gharuan, India, 5–6 May 2023; pp. 81–86. [Google Scholar] [CrossRef]
- Rubio Rodríguez, G.A.; Alejandra, T.; Ospina, M.; Guillermo, J.; Cardozo, A.; Wilfredo Méndez González, D.; Milena, E.; Soza, V.; Rodríguez, R.; González, M. The Facts Associated with Violence Against Women by a Spouse; Technical Report; Revista Conrado: Cienfuegos, Cuba, 2021. [Google Scholar]
- Women, U.N. Facts and Figures: Ending Violence Against Women and Girls; UN Women: New York, NY, USA, 2023. [Google Scholar]
- Chen, Y.; Mendes, K.; Gosse, C.; Hodson, J.; Veletsianos, G. Canadian Gender-Based Violence Prevention Programs: Gaps and Opportunities. Violence Against Women 2024, 10778012241259727. [Google Scholar] [CrossRef] [PubMed]
- William, J.; Loong, B.; Hanna, D.; Parkinson, B.; Loxton, D. Lifetime health costs of intimate partner violence: A prospective longitudinal cohort study with linked data for out-of-hospital and pharmaceutical costs. Econ. Model. 2022, 116, 106013. [Google Scholar] [CrossRef]
- United Nations Economic Commission for Latin America and the Caribbean. Violence Against Indigenous Women in Latin America; United Nations Economic Commission for Latin America and the Caribbean: Santiago, Chile, 2024. [Google Scholar]
- Wands, Z.E.; Mirzoev, T. Intimate Partner Violence Against Indigenous Women in Sololá, Guatemala: Qualitative Insights Into Perspectives of Service Providers. Violence Against Women 2022, 28, 150–168. [Google Scholar] [CrossRef] [PubMed]
- Matias, A.; Gonçalves, M.; Soeiro, C.; Matos, M. Intimate Partner Homicide in Portugal: What Are the (As)Symmetries Between Men and Women? Eur. J. Crim. Policy Res. 2021, 27, 471–494. [Google Scholar] [CrossRef]
- López Ricoy, A.; Andrews, A.; Medina, A. Exit as Care: How Motherhood Mediates Women’s Exodus From Violence in Mexico and Central America. Violence Against Women 2022, 28, 211–231. [Google Scholar] [CrossRef] [PubMed]
- Harikumar, S.; Mannam, V.; Mahanta, C.; Smitha, M.; Zaman, S. Interactive Map Using Data Visualization and Machine Learning. In Proceedings of the 2020 6th IEEE Congress on Information Science and Technology (CiSt), Essaouira, Morocco, 5–12 June 2021; pp. 104–109. [Google Scholar] [CrossRef]
- Talwalkar, S.; Crooks, C.L. Designing User-Centered Artificial Intelligence to Assist in Recovery from Domestic Abuse. In Proceedings of the 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE), Las Vegas, NV, USA, 24–27 July 2023; pp. 375–377. [Google Scholar] [CrossRef]
- Rahman, R.; Khan, M.N.A.; Sara, S.S.; Rahman, M.A.; Khan, Z.I. A comparative study of machine learning algorithms for predicting domestic violence vulnerability in Liberian women. BMC Women’s Health 2023, 23, 542. [Google Scholar] [CrossRef] [PubMed]
- Stephanie, E.M.A.; Ruiz, L.G.B.; Vila, M.A.; Pegalajar, M.C. Study of violence against women and its characteristics through the application of text mining techniques. Int. J. Data Sci. Anal. 2024, 18, 35–48. [Google Scholar] [CrossRef]
- González, G.A.R.; Cantu-Ortiz, F.J. A Sentiment Analysis and Unsupervised Learning Approach to Digital Violence Against Women: Monterrey Case. In Proceedings of the 2021 4th International Conference on Information and Computer Technologies (ICICT), Kahului, HI, USA, 11–14 March 2021; pp. 18–26. [Google Scholar] [CrossRef]
- González-Prieto, A.; Brú, A.; Nuño, J.C.; González-Álvarez, J.L. Hybrid machine learning methods for risk assessment in gender-based crime. Knowl.-Based Syst. 2023, 260, 110130. [Google Scholar] [CrossRef]
- Campedelli, G.M. Explainable machine learning for predicting homicide clearance in the United States. J. Crim. Justice 2022, 79, 101898. [Google Scholar] [CrossRef]
- González, G.A.R.; Gabarrot, M.; Cantu-Ortiz, F.J. Understanding Violence Against Women in Digital Space from a Data Science Perspective: Full/Regular Research Papers-CSCI-ISNA. In Proceedings of the 2020 International Conference on Computational Science and Computational Intelligence (CSCI), Las Vegas, NV, USA, 16–18 December 2020; pp. 263–269. [Google Scholar] [CrossRef]
- Xu, X.; Ong, H.L.; Lai, P.; Ting, M.H.; Wong, W.M.; Chu, C.M. Understanding the Risk Factors of Spousal Violence Victimization Using Machine Learning and Network Approaches. J. Fam. Violence 2024, 39, 1581–1592. [Google Scholar] [CrossRef]
- Shifidi, P.P.; Stanley, C.; Azeta, A.A. Machine Learning-Based Analytical Process for Predicting the Occurrence of Gender-Based Violence. In Proceedings of the 2023 International Conference on Emerging Trends in Networks and Computer Communications (ETNCC), Windhoek, Namibia, 16–18 August 2023; pp. 1–8. [Google Scholar] [CrossRef]
- Yu, Y.; Liu, W. Ethnical Issues ofWomen’s Protection: Ending the Violent Crime Against Women in Indonesia. In Proceedings of the 2020 International Conference on Public Health and Data Science (ICPHDS), Guangzhou, China, 20–22 November 2020; pp. 60–63. [Google Scholar] [CrossRef]
- Lima, V.; de Oliveira, J.A. Identifying Risk Patterns in Brazilian Police Reports Preceding Femicides: A Long Short Term Memory (LSTM) Based Analysis. In Proceedings of the 2023 IEEE Global Humanitarian Technology Conference (GHTC), Radnor, PA, USA, 12–15 October 2023; pp. 144–150. [Google Scholar] [CrossRef]
- Forciniti, A.; Zavarrone, E. Data Quality and Violence Against Women: The Causes and Actors of Femicide. Soc. Indic. Res. 2024, 175, 1073–1097. [Google Scholar] [CrossRef]
- Rubén, D.; Uribe, R.; Sofía, M.; Huerta, M.; Patrocinio, I.E.; Burgos, C. 20 Informe Trimestral de Seguimiento a las Medidas de la AVG en Veracruz, Noviembre 2019–Junio 2020. Available online: https://www.gob.mx/cms/uploads/attachment/file/747590/20._Informe_AVGM_V.F._Nov_2019-Junio2020.pdf (accessed on 15 March 2024).
- Gong, Y.; Liu, G.; Xue, Y.; Li, R.; Meng, L. A survey on dataset quality in machine learning. Inf. Softw. Technol. 2023, 162, 107268. [Google Scholar] [CrossRef]
- Srivastava, S.; Shah, R.N.; Teodoriu, C.; Sharma, A. Impact of data quality on supervised machine learning: Case study on drilling vibrations. J. Pet. Sci. Eng. 2022, 219, 111058. [Google Scholar] [CrossRef]
- Schröer, C.; Kruse, F.; Gómez, J.M. A Systematic Literature Review on Applying CRISP-DM Process Model. Procedia Comput. Sci. 2021, 181, 526–534. [Google Scholar] [CrossRef]
- Lundén, N.; Bekar, E.T.; Skoogh, A.; Bokrantz, J. Domain Knowledge in CRISP-DM: An Application Case in Manufacturing. IFAC-PapersOnLine 2023, 56, 7603–7608. [Google Scholar] [CrossRef]
- Zhang, Z.; Chen, G.; Xu, Y.; Huang, L.; Zhang, C.; Xiao, S. FedDQA: A novel regularization-based deep learning method for data quality assessment in federated learning. Decis. Support Syst. 2024, 180, 114183. [Google Scholar] [CrossRef]
- INEGI. Subsistema de Información Demográfica y Social; INEGI: Aguascalientes, Mexico, 2024.
- Comisión Nacional para Prevenir y Erradicar la Violencia Contra las Mujeres. Centros de Justicia para las Mujeres|Comisión Nacional para Prevenir y Erradicar la Violencia Contra las Mujeres|Gobierno|gob.mx, 2021. Available online: https://www.gob.mx/conavim/acciones-y-programas/centros-de-justicia-para-las-mujeres (accessed on 13 May 2023).
- Choi, H.; Lee, S. Forensic analysis of SQL server transaction log in unallocated area of file system. Forensic Sci. Int. Digit. Investig. 2023, 46, 301605. [Google Scholar] [CrossRef]
- Dey, S.K.; Uddin, K.M.M.; Babu, H.M.H.; Rahman, M.M.; Howlader, A.; Uddin, K.A. Chi2-MI: A hybrid feature selection based machine learning approach in diagnosis of chronic kidney disease. Intell. Syst. Appl. 2022, 16, 200144. [Google Scholar] [CrossRef]
- Pinder, J.P. Chapter 9—Simulation fit and significance: Chi-square and ANOVA. In Introduction to Business Analytics Using Simulation, 2nd ed.; Pinder, J.P., Ed.; Academic Press: Cambridge, MA, USA, 2022; pp. 273–325. [Google Scholar] [CrossRef]
- Tugnait, J.K. Sparse-Group Lasso for Graph Learning From Multi-Attribute Data. IEEE Trans. Signal Process. 2021, 69, 1771–1786. [Google Scholar] [CrossRef]
- Chen, Z.; Ma, W.; Li, Y.; Guo, W.; Wang, S.; Zhang, W.; Chen, Y. Using machine learning to estimate the incidence rate of intimate partner violence. Sci. Rep. 2023, 13, 5533. [Google Scholar] [CrossRef] [PubMed]
- Wang, C.; Chang, L.; Liu, T. Predicting Student Performance in Online Learning Using a Highly Efficient Gradient Boosting Decision Tree. In Proceedings of the 12th IFIP TC 12 International Conference, IIP 2022, Qingdao, China, 27–30 May 2022; Shi, Z., Zucker, J.D., An, B., Eds.; Springer: Cham, Switzerland, 2022; pp. 508–521. [Google Scholar]
Indicator | Value |
---|---|
Number of women victimized by violence | 1,245,000 |
Average age | 32.7 years |
Standard deviation of age | 9.6 years |
Percentage that has suffered emotional violence | 45% |
Percentage that has suffered physical violence | 20% |
Percentage that has suffered sexual violence | 15% |
Percentage that has suffered economic violence | 30% |
Percentage that does not report the violence | 70% |
Field | Description | Data Type |
---|---|---|
Season | Date of the record | Categorical |
Marital Status | Marital situation of the victim | Categorical |
code_100 | Emergency code indicator | Binary |
Type_of_violence | Type of violence experienced | Categorical |
Modalities | Additional details about the violence | Categorical |
Zone | Area of the locality where the victim lives | Categorical |
Children | If they have children | Binary |
Ethnic_Group | Ethnic group affiliation | Binary |
Type_of_report | Classification of the injuries | Categorical |
R_offender | If they have a sentimental relationship with the victim | Binary |
Severity | Severity rating; this variable is mandatory for all registered cases | Binary |
tipo_v | edo_civil | Zona | … | t_violencia | Fecha |
---|---|---|---|---|---|
Violencia Familiar | 1 | Colonia Sanchez | … | Despojo | 30 January 2022 |
Violencia Familiar | 1 | Colonia Sanchez | … | Despojo | 30 January 2022 |
Violencia Familiar | 1 | colonia centro | … | Acoso y hostigamiento | 1 February 2022 |
Violación | 1 | cerro colorado | … | Violacion | 10 March 2022 |
Violación | 1 | cerro colorado | … | Violacion | 10 March 2022 |
Violencia Familiar | 0 | cerro colorado | … | Violacion | 10 March 2022 |
Violencia Familiar | 0 | cuadrilla | … | Pension alimenticia | 13 October 2023 |
Violencia Familiar | 0 | Colonia centro | … | Violencia familiar | 17 October 2023 |
… | |||||
Abuso sexual | 0 | Santa Maria Pipioltepec | … | Violencia familiar | 17 October 2023 |
Feature | Importance (%) |
---|---|
children_marital_status | 29.78 |
children | 22.24 |
month | 20.44 |
day_of_week | 13.15 |
emergency_code | 5.10 |
marital_status | 3.12 |
follow_up | 2.70 |
ethnic_group | 1.88 |
type_of_violence | 1.60 |
Metric | Class 0 (RF) | Class 1 (RF) | Class 0 (GBC) | Class 1 (GBC) |
---|---|---|---|---|
Precision | 1.00 | 0.96 | 1.0 | 0.98 |
Recall | 0.93 | 1.00 | 0.95 | 1.00 |
F1-Score | 0.96 | 0.98 | 0.97 | 0.98 |
Overall Accuracy | 0.97 | 0.89 | ||
ROC AUC | 0.9534 | 0.9891 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cruz-Mendoza, M.C.; Melendez-Armenta, R.A.; Canul-Reich, J.; Muñoz-Benítez, J. Machine Learning Applied to Improve Prevention of, Response to, and Understanding of Violence Against Women. Informatics 2025, 12, 40. https://doi.org/10.3390/informatics12020040
Cruz-Mendoza MC, Melendez-Armenta RA, Canul-Reich J, Muñoz-Benítez J. Machine Learning Applied to Improve Prevention of, Response to, and Understanding of Violence Against Women. Informatics. 2025; 12(2):40. https://doi.org/10.3390/informatics12020040
Chicago/Turabian StyleCruz-Mendoza, Mariana Carolyn, Roberto Angel Melendez-Armenta, Juana Canul-Reich, and Julio Muñoz-Benítez. 2025. "Machine Learning Applied to Improve Prevention of, Response to, and Understanding of Violence Against Women" Informatics 12, no. 2: 40. https://doi.org/10.3390/informatics12020040
APA StyleCruz-Mendoza, M. C., Melendez-Armenta, R. A., Canul-Reich, J., & Muñoz-Benítez, J. (2025). Machine Learning Applied to Improve Prevention of, Response to, and Understanding of Violence Against Women. Informatics, 12(2), 40. https://doi.org/10.3390/informatics12020040