A Personality-Informed Candidate Recommendation Framework for Recruitment Using MBTI Typology
Abstract
1. Introduction
- To make HR processes more efficient by introducing an automated system that handles personality assessments and evaluates how well candidates fit specific job roles.
- To save time in the recruitment process by replacing traditional, paper-based personality tests with a faster, fully digital solution that is easier to manage and scale.
- To build a two-part recruitment system that includes the following:
- ○
- A user-friendly platform for applicants, where they can take the MBTI personality test in a simple and accessible way.
- ○
- A web-based dashboard for HR professionals, where we introduced the ML (machine learning) model to analyze personality–job compatibility and candidate selection.
2. Literature Review
3. Methodology
3.1. System Objective
3.2. Data Collection and Description
- Nature of the Dataset:Each record includes the following:
- ○
- Personal identifiers (name, email, contact);
- ○
- Job position applied for;
- ○
- Skill set (free-text list);
- ○
- Years of professional experience;
- ○
- Academic qualification;
- ○
- MBTI personality type;
- ○
- Soft skills;
- ○
- CV file path.
- Extent and Characteristics:Qualification Distribution:
- ○
- Bachelor’s: 208;
- ○
- High School: 207;
- ○
- PhD: 202;
- ○
- Master’s: 183.
Experience Range:- ○
- Minimum: 0 years;
- ○
- Maximum: 20 years;
- ○
- Mean: 10.33 years;
- ○
- Standard Deviation: 6.01 years.
MBTI Type Distribution:- ○
- Most common types: ENTP (72), ESTJ (65), ISTP (59);
- ○
- All 16 MBTI types are represented, ensuring diversity in personality profiles.
Job Title Distribution:- ○
- The most common roles were Project Manager (86), Software Developer (83), and Data Scientist (80);
- ○
- Other roles include Software Engineer, Social Media Manager, Marketing Specialist, and HR Manager.
3.2.1. Expert-Informed Personality–Job Role Mapping
3.2.2. Subject Matter Expert Consultation
3.2.3. Mapping Process
- Introduction of Job Roles: The HR professionals were presented with a comprehensive list of specific job designations relevant to modern industries, including Data Analyst, Digital Marketing Specialist, Social Media Marketing Specialist, Software Developer, HR Manager, and Business Developer.
- MBTI Framework Discussion: A brief overview of the 16 MBTI personality types was provided to ensure shared understanding of the framework’s dimensions (Extraversion/Introversion, Sensing/Intuition, Thinking/Feeling, and Judging/Perceiving) and their behavioral indicators.
- These professionals used the principles of psychological profiling and applied them together with their hands-on experience with recruitment to match MBTI types to employment opportunities. The criteria used included the following:
- ○
- Behavioral Traits and Role Demands: Experts took the behavioral requirements of each job position, including those of a leader, an analyst, a group member, or a flexible person, and compared them with MBTI personality traits. Experts matched MBTI traits with the cognitive demands and interpersonal dynamics of each role. For example, ISTJ types known for their reliability and attention to detail were mapped to roles like Software Engineer and Data Analyst, which require precision and structure.
- ○
- Performance Benchmarking: The personality traits of employees who performed highly in certain positions were analyzed by professionals and compared to those who performed at an average level. The given comparative analysis aided in determining the MBTI types that were always associated with success in the given roles.
- ○
- Cognitive and Interpersonal Fit: Cognitive and interpersonal preferences (e.g., Thinking vs. Feeling) were also considered in the mapping to guarantee compatibility with the team dynamics and organizational culture.
- ○
- Industry-Specific Situation: Since this study concerned a South Asian context, the focus of the experts was on the compatibility of the cultures, styles of communication, and hierarchical workplace settings. In the case of the ISFJ and ESFJ types, these types were preferred for HR activities because of their empathetic and organized characteristics, which fits the collectivist cultural requirements.
3.2.4. MBTI Determination Process
- ○
- Method Used: A standard MBTI questionnaire was administered digitally to each applicant. This was based on widely accepted MBTI inventories aligned with the original framework by Isabel Briggs Myers and Katharine Cook Briggs.
- ○
- Human vs. Automated: The MBTI classification was entirely automated through the digital questionnaire. No manual review or inference from CVs was performed to determine personality types.
- ○
- CV Role: CVs were used solely for evaluating technical qualifications, experience, and skills. They were not involved in personality profiling.
3.2.5. Error Margins and Reliability
- ○
- Instrument Validity: The MBTI questionnaire used has been validated in prior studies and is widely adopted in organizational psychology. While MBTI has known limitations in psychometric robustness, its structured typology and widespread acceptance in business contexts made it suitable for our recruitment-focused application.
- ○
- Error Considerations: Since MBTI types were self-reported, the primary source of error would stem from response bias or misinterpretation of questions. To mitigate this, no inference from CVs was made, avoiding the risk of misclassification due to engineered or biased resume content.
3.3. Machine Learning Model Selection
3.4. Data Preprocessing
- Column Standardization: Column names in Applicant_data.csv were standardized to Job_Title (from position) and MBTI_Type (from mbti_type) to ensure consistency across datasets.
- Job–MBTI Suitability Map Creation: The enhanced_job_mbti_dataset.csv was processed to create a job_mbti_map. This map is a dictionary (specifically, a defaultdict), where keys are Job_Titles and values are lists of MBTI_Types deemed suitable for that job. This mapping is crucial for generating the target variable for the suitability prediction model.
- Feature and Target Variable Generation: For each applicant in the Applicant_data.csv, a set of features (X) and a binary suitability label (y) were generated for every Job_Title present in the job_mbti_map.
- ○
- Features (X): The input features (X) utilized for training the suitability prediction model were systematically derived from the applicants’ profiles to ensure consistency and relevance. These features include experience, represented as a numerical value indicating the number of years of professional experience; in cases where non-numeric values were encountered, a default value of zero was assigned. The skill count feature captures the total number of distinct skills listed by the applicant, serving as a proxy for technical breadth. Two binary indicators—isphd and isms—were introduced to reflect the presence of advanced academic qualifications, with values set to 1 if the applicant’s qualification field contained the terms “PHD” or “MS,” and 0 otherwise. Lastly, the applicantmbti feature, denoting the applicant’s MBTI personality type, was encoded into a numerical format to facilitate its integration into the machine learning model. Collectively, these features form a structured representation of each applicant’s profile, enabling the model to make informed predictions regarding job suitability; details are mentioned in Figure 2.
- ○
- Target Variable (y): A binary label indicating suitability was assigned to the applicant’s type—1 if the applicant’s MBTI_Type was present in the suitable_mbti_list for a given Job_Title (from job_mbti_map), and 0 otherwise. This transformation resulted in a significantly expanded dataset for model training, with each row representing an applicant–job suitability instance.
- MBTI Encoding: The categorical MBTI_Type feature (both for applicants and for the target variable generation) was converted into numerical labels using sklearn.preprocessing.LabelEncoder.
- Data Balancing (SMOTE): Given that the “suitable” (1) class is typically a minority class compared to “not suitable” (0) combinations, the Synthetic Minority Over-sampling Technique (SMOTE) from the imblearn library was applied to the training data. This technique oversamples the minority class by creating synthetic samples, thereby addressing potential class imbalance and preventing the model from being biased towards the majority class.
- Train–Test Split: To ensure a robust and unbiased evaluation of the model’s performance, K-fold cross-validation was used. The preprocessed and balanced dataset was divided into k distinct, non-overlapping folds. The model was then trained and validated k times. In each iteration, 1 fold was designated as the testing set, while the remaining k − 1 folds were combined to form the training set. To mitigate class imbalance and ensure each fold was representative of the full dataset, stratified sampling was applied during the fold creation process. The final performance metrics were calculated by averaging the results across all k iterations, providing a more reliable measure of the model’s generalization capability.
3.5. Model Selection and Training
3.6. Model Evaluation
- Overall Accuracy: measures the proportion of correctly classified instances out of the total.
- Confusion Matrix: a 2 × 2 matrix visualizing the performance of the binary classification model, showing True Positives (TPs), True Negatives (TNs), False Positives (FPs), and False Negatives (FNs). A heatmap visualization was used for clarity.
- Classification Report: The classification report serves as a comprehensive evaluation of the model’s performance by presenting detailed metrics for each class, namely, “Not Suitable” (0) and “Suitable” (1). It includes precision, which quantifies the proportion of correctly identified positive instances among all instances predicted as positive, thereby reflecting the model’s accuracy in positive classification. Recall measures the proportion of actual positive instances that were correctly identified by the model, indicating its effectiveness in capturing relevant cases. The F1-score, calculated as the harmonic mean of precision and recall, offers a balanced metric that accounts for both false positives and false negatives. Lastly, support denotes the number of actual occurrences of each class within the test dataset, providing context for the reliability of the metrics. Collectively, these measures enable a nuanced understanding of the model’s predictive capabilities across different outcome categories.
- Receiver Operating Characteristic (ROC) Curve and Area Under the Curve (AUC): The ROC curve plots the True Positive Rate (TPR) against the False Positive Rate (FPR) at various threshold settings. The AUC provides an aggregate measure of performance across all possible classification thresholds, indicating the model’s ability to discriminate between suitable and not suitable classes.
- Precision–Recall Curve and AUC: This curve plots precision against recall at various threshold settings. It is particularly informative for imbalanced datasets, offering a more nuanced view of the trade-off between identifying positive instances and avoiding false positives. Details are depicted in Figure 3.
3.7. Applicant Recommendation Logic
3.8. Tools and Technologies
4. Analysis and Results
4.1. Confusion Matrix Analysis
4.2. Classification Report
4.3. ROC Curve and AUC
4.4. Precision–Recall Curve and AUC
4.5. Overall Feature Importance Using SHAP
- ○
- Based on the plot, the mbti_encoded feature has the highest overall importance. This is a critical finding, as it provides quantitative evidence that the core premise of research—that MBTI personality types are a powerful predictor of job fit—is supported by the model’s behavior.
- ○
- The second most important feature is skill, which aligns with conventional wisdom in recruitment. This confirms that the model also leverages traditional metrics in its decision-making process.
- ○
- Other features, such as experience, is_phd, and is_ms, also contribute to the prediction but to a lesser degree.
- ○
- MBTI Type (mbti_encoded): shows the highest variability in SHAP values, indicating that personality type significantly influences suitability predictions.
- ○
- Skill Count: higher values (red) tend to push predictions positively, suggesting that applicants with more skills are more likely to be deemed suitable.
- ○
- PhD Qualification (is_phd): generally contributes negatively to predictions, possibly due to overqualification or mismatch with job expectations.
- ○
- Experience: has mixed effects, with both high and low values influencing predictions in different directions.
- ○
- Master’s Qualification (is_ms): shows a relatively neutral to slightly positive impact.
4.6. Use Case: Software Developer Recommendation
5. Discussion
6. Conclusions
7. Future Work and Ethical Considerations
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
MBTI | Myers–Briggs Type Indicator |
ROC | Receiver Operating Characteristic |
NLP | Natural Language Processing |
RF | Random Forest |
AI | Artificial Intelligence |
TPs | True Positives |
TNs | True Negatives |
FPs | False Positives |
FNs | False Negatives |
References
- Shahzad, M.A.; Jianguo, D.; Junaid, M. Impact of green HRM practices on sustainable performance: Mediating role of green innovation, green culture, green employees’ behavior. Environ. Sci. Pollut. Res. 2023, 30, 88524–88547. [Google Scholar] [CrossRef]
- Shah, A.; Pati, R.; Pimplikar, A.; Puthran, S.; Singh, A. Review of Approaches Towards Building AI Based Career Recommender & Guidance Systems. ScienceOpen 2024. preprints. [Google Scholar] [CrossRef]
- Choubey, A.P.; Agrawal, A.K. Recruit to rise and retain: The CORE–2π framework for successful and sustainable talent acquisition. Organ. Dyn. 2025, 54, 101160. [Google Scholar] [CrossRef]
- Ling, F.Y.Y.; Sim, W.X.L. Mediating Role of Vocational Interest in the Relationship between Personality and Job Satisfaction: Insights from Cost Engineers. J. Constr. Eng. Manag. 2025, 151, 04025057. [Google Scholar] [CrossRef]
- Mohammad, A.J. Chrono-Behavioral Fingerprinting for Workforce Optimization. Int. J. AI BigData Comput. Manag. Stud. 2024, 5, 91–101. [Google Scholar] [CrossRef]
- Nirmala, M.; Rajalakshmi, B.; Sarvepally, S.R.M.; Mandava, S.; Megana, M. Personality Detection for Recruitment Using Machine Learning. In Proceedings of the 6th International Conference on Communications and Cyber Physical Engineering, Hyderabad, India, 28–29 April 2023; Kumar, A., Mozar, S., Eds.; Lecture Notes in Electrical Engineering Series; Springer: Singapore, 2024; Volume 1096, pp. 399–406. [Google Scholar] [CrossRef]
- Naz, A.; Khan, H.U.; Bukhari, A.; Alshemaimri, B.; Daud, A.; Ramzan, M. Machine and deep learning for personality traits detection: A comprehensive survey and open research challenges. Artif. Intell. Rev. 2025, 58, 239. [Google Scholar] [CrossRef]
- Anusha, M.; Yoshitha, C.R.; Namratha, D.S.; Manoj, B.S.; Kausalya Nandan, T.P.; Raju, P.S. Text-Based Personality Assessment Using Deep Learning Techniques. In Proceedings of the International Conference on Emerging Technologies in Computer Science for Interdisciplinary Applications (ICETCS 2024), Bengaluru, India, 22–23 April 2024. [Google Scholar] [CrossRef]
- Sergi, M.R.; Picconi, L.; Aleffi, C.; Tommasi, M.; Lee, H.; Shin, Y. A Study on MBTI Perceptions in South Korea: Big Data Analysis from the Perspective of Applying MBTI to Contribute to the Sustainable Growth of Communities. Sustainability 2024, 16, 4152. [Google Scholar] [CrossRef]
- Park, J.S.Y. Personality as technology of self: MBTI and English language learning in South Korea. Int. J. Soc. Lang. 2024, 285, 35–54. [Google Scholar] [CrossRef]
- Agarwal, N.; Justina; Akshaya, M.J.; Shetty, N.; Kumar, S. Machine Learning Driven Personality Prediction System using the concept of CV Analysis. In Proceedings of the 2024 International Conference on Electrical Electronics and Computing Technologies (ICEECT), Greater Noida, India, 29–31 August 2024; pp. 1–6. [Google Scholar] [CrossRef]
- Tursunbayeva, A. Augmenting Human Resource Management with Artificial Intelligence: Towards an Inclusive, Sustainable, Responsible Future; Contributions to Management Science Series; Part F3983; Springer: Cham, Switzerland, 2024; pp. 1–148. [Google Scholar] [CrossRef]
- Pramanik, N.; Das, P.; Roy, R.; Bal, S. Job Recommendation System Based on Candidate Skills. In Proceedings of the 2024 3rd International Conference on Smart Technologies and Systems for Next Generation Computing (ICSTSN), Villupuram, India, 18–19 July 2024. [Google Scholar] [CrossRef]
- Yuan, C.; Hong, Y.; Wu, J. Who Are You Meant to Be? Predicting Psychological Indicators and Occupations based on Personality Traits. J. Syst. Sci. Syst. Eng. 2023, 32, 571–602. [Google Scholar] [CrossRef]
- Sirasapalli, J.J.; Malla, R.M. A deep learning approach to text-based personality prediction using multiple data sources mapping. Neural Comput. Appl. 2023, 35, 20619–20630. [Google Scholar] [CrossRef]
- Shaheen, N.; Hussain, A.; Ahmad, N.; Jan, A.S.; Khan, N.U. The Entrepreneurial Mindset: Leveraging MBTI to Identify and Nurture Entrepreneurial Potential. Dialogue Social Sci. Rev. 2025, 3, 433–443. Available online: https://dialoguessr.com/index.php/2/article/view/625 (accessed on 26 July 2025).
- Cambria, E. Pragmatics Processing. In Understanding Natural Language Understanding; Springer: Cham, Switzerland, 2025; pp. 229–338. [Google Scholar] [CrossRef]
- Perera, H.; Costa, L. MBTI Personality Classification through Analyzing CVs/Personal Statements for e-Recruitment. Int. J. Mach. Learn. 2024, 14, 1153. [Google Scholar] [CrossRef]
- Bousalem, S.; Benchikha, F.; Marir, N. Personalized Learning Through MBTI Prediction: A Deep Learning Approach Integrated with Learner Profile Ontology. IEEE Access 2025, 13, 30861–30873. [Google Scholar] [CrossRef]
- An, B.; Park, J.; Shin, D.; Kim, H.L.; Lee, H. Measuring Organizational Character: Leveraging Sentence-BERT and MBTI. SSRN 2024. [Google Scholar] [CrossRef]
- Troyer, K.; Jackson, R.; Wilson, R.; Besson, L. Introducing Personality Psychology in Business: Insights & Implications from a Textual Analysis. Open J. Bus. Manag. 2024, 12, 1845–1871. [Google Scholar] [CrossRef]
- Erford, B.T.; Zhang, X.; Sweeting, E.L.; Russo, M.; Rashid, A.; Sherman, M.F.; Bradford, E.L.; Wang, X.; Gao, A.; Huang, X.; et al. A 25-Year Review and Psychometric Synthesis of the Myers–Briggs Type Indicator (MBTI)—Form M. J. Couns. Dev. 2025, 103, 403–417. [Google Scholar] [CrossRef]
- Elmahalawy, A.R.; Li, L.; Wu, X.; Tao, X.; Yong, J. Computational Personality Analysis with Interpretability Empowered Prediction. In Proceedings of the 2024 27th International Conference on Computer Supported Cooperative Work in Design (CSCWD), Tianjin, China, 8–10 May 2024. [Google Scholar]
- Braz, L.F.; Sichman, J.S. Using the Myers-Briggs Type Indicator (MBTI) for Modeling Multiagent Systems. Rev. Inform. Teórica Apl. 2022, 29, 42–53. [Google Scholar] [CrossRef]
- Albassam, W.A.A. The Power of Artificial Intelligence in Recruitment: An Analytical Review of Current AI-Based Recruitment Strategies. Int. J. Prof. Bus. Rev. 2023, 8, e02089. [Google Scholar] [CrossRef]
- Pradhan, A.; Year, M.; Moser, R. Employee–Job Profile Matching Challenges: AI/ML-Based Recommender Systems Solutions for Knowledge Workers. Master’s Thesis, Macquarie University, Macquarie Park, NSW, Australia, March 2024. [Google Scholar] [CrossRef]
- Ajjam, M.-H.; Al-Raweshidy, H. AI-Driven Semantic Similarity-Based Job Matching Framework for Recruitment Systems. Inf. Sci. 2025, 724, 122728. [Google Scholar] [CrossRef]
- Goo, C.L.; Leow, M.C.; Ong, L.Y. Analyzing Course Selection by MBTI Personality Types. JOIV Int. J. Inform. Vis. 2025, 9, 29–37. [Google Scholar] [CrossRef]
- Kamble, P.; Kulkarni, U. An Innovative Approach of Personality Recognition for E-Recruitment. In Proceedings of the 2022 5th International Conference on Advances in Science and Technology (ICAST), Mumbai, India, 2–3 December 2022; pp. 324–328. [Google Scholar] [CrossRef]
- Haldar, S. The future of work: A symbiotic balance of technology and interpersonal skills. Strateg. HR Rev. 2024, 23, 217–220. [Google Scholar] [CrossRef]
- Thapa, L.; Pandey, A.; Gupta, D.; Deep, A.; Garg, R. A Framework for Personality Prediction for E-Recruitment Using Machine Learning Algorithms. In Proceedings of the 14th International Conference on Cloud Computing, Data Science and Engineering, Confluence 2024, Noida, India, 18–19 January 2024; pp. 1–5. [Google Scholar] [CrossRef]
- Akshaya, J.; Sharma, A.; Rathod, Y.; Sridhar, S. Real time personality assessment via Big Five Traits. Res. Sq. 2025. [Google Scholar] [CrossRef]
- Sudha, G.; Sasipriya, K.K.; Nivethitha, D.; Saranya, S. Personality Prediction Through CV Analysis using Machine Learning Algorithms for Automated E-Recruitment Process. In Proceedings of the 2021 4th International Conference on Computing and Communications Technologies (ICCCT), Chennai, India, 16–17 December 2021; pp. 617–622. [Google Scholar] [CrossRef]
Job_Title | MBTI_Types |
---|---|
Data Scientist | INTJ, INTP, ISTJ |
Software Engineer | ISTJ, INTJ, ESTJ, ISTP |
Marketing Specialist | ENFP, ENTP, ESFP |
HR Manager | INFJ, ENFJ, ISFJ |
Project Manager | ENTJ, ESTJ, INTJ |
Finance Manager | ESTJ, ISTJ, ENTJ |
Business Developer | ENTP, ENFP, ESTP |
Data Analyst | INTP, ISTJ, INTJ |
Digital Marketing Specialist | ENFJ, ENFP, ESFJ |
Social Media Manager | ESFP, ENFP, ISFP |
Software Developer | ISTP, ISTJ, INTP |
Precision | Recall | F1-Score | |
---|---|---|---|
Not Suitable (0) | 0.746762 | 0.762950 | 0.754770 |
Suitable (1) | 0.757698 | 0.741273 | 0.749395 |
Accuracy | 0.752111 | 0.752111 | 0.752111 |
Macro avg | 0.752230 | 0.752111 | 0.752082 |
Weight avg | 0.752230 | 0.752111 | 0.752082 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Khan, H.W.; Sattar, M.U.; Noor, S.; Alyousef, M.I. A Personality-Informed Candidate Recommendation Framework for Recruitment Using MBTI Typology. Information 2025, 16, 863. https://doi.org/10.3390/info16100863
Khan HW, Sattar MU, Noor S, Alyousef MI. A Personality-Informed Candidate Recommendation Framework for Recruitment Using MBTI Typology. Information. 2025; 16(10):863. https://doi.org/10.3390/info16100863
Chicago/Turabian StyleKhan, Hamza Wazir, Mian Usman Sattar, Samreen Noor, and Muna I. Alyousef. 2025. "A Personality-Informed Candidate Recommendation Framework for Recruitment Using MBTI Typology" Information 16, no. 10: 863. https://doi.org/10.3390/info16100863
APA StyleKhan, H. W., Sattar, M. U., Noor, S., & Alyousef, M. I. (2025). A Personality-Informed Candidate Recommendation Framework for Recruitment Using MBTI Typology. Information, 16(10), 863. https://doi.org/10.3390/info16100863