You are currently viewing a new version of our website. To view the old version click .
Applied Sciences
  • Article
  • Open Access

14 November 2023

A Study of Reciprocal Job Recommendation for College Graduates Integrating Semantic Keyword Matching and Social Networking

,
and
1
Faculty of Management and Economics, Kunming University of Science and Technology, Kunming 650093, China
2
Business Administration, Southwest JiaoTong University, Chengdu 611700, China
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Methods and Applications of Data Management and Analytics

Abstract

With the surge in college graduate numbers, a disparity has emerged where the supply of jobs falls short of demand, intensifying employment pressures annually. College graduates, due to their lack of historical employment data compared with job seekers in the broader society, encounter a ‘cold start’ issue in the job recommendation process. Additionally, the nature of job recommendations, which differs fundamentally from unilateral recommendations, requires consideration of reciprocity between both parties involved. This article introduces a new approach to job recommendations using college graduates as the object of study. In the screening stage, a semantic keyword iterative algorithm is applied to compute the similarity between the resume and recruitment texts. This algorithm enhances the intersectionality of keywords in the calculation process, maximizing the utilization of resume information to enhance the accuracy of text similarity calculations. The ranking phase utilizes in-school data to build a social network between college graduates and graduated students and solves the system’s cold-start problem using the social network to recommend jobs for college graduates where graduated students are employed. We introduce a dual-dimensional matching approach that incorporates both specialty and salary, building upon the amalgamated semantic keyword iterative algorithm and the social network job recommendation method, to enhance the reciprocity of job recommendations. The job recommendation method introduced herein outperforms other methods in terms of the average satisfaction rate (AR) and normalized discounted cumulative gain (NDCG), thereby confirming its superior ability to meet the job-seeking preferences of graduates and the recruitment criteria of employers. This job recommendation method offers effective assistance to graduates lacking employment experience and historical employment data, facilitating their search for more suitable job opportunities.

1. Introduction

The advancement of higher education has facilitated an expansion in student enrollment within colleges and universities, consequently leading to a yearly increase in the number of graduates. According to statistics, the number of college graduates in China for the year 2023 reached 11.58 million, which represents a year-on-year growth of 820,000. This substantial influx of new graduates has entered the job market, intensifying the competition for employment [,]. Concurrently, some graduates encounter challenges due to undefined employment goals, decision-making difficulties, and information overload [,]. These factors contribute to a decline in the overall employment rate. To assist graduates in securing employment, colleges offer career guidance; however, tailored recommendations for individual cases are often lacking []. Current job search platforms predominantly cater to social job seekers, while the exploration and implementation of personalized recommendations for graduates remain underdeveloped. Additionally, given the multitude of majors and diverse employment fields, providing tailored job recommendations for graduates becomes a more intricate challenge []. The following issues afflict the current job recommendation methodology: (a) The content-based approach heavily depends on semantic matching technology, while the current technology still faces limitations in understanding and extracting semantics. This results in poor accuracy when calculating the relevance of resumes to jobs. (b) Collaborative filtering-based methods rely on user behavioral data to model interest preferences, but the sparsity of user data for first-time graduates makes it challenging to accurately capture their interests. (c) The specific group of graduates differs from the richness of historical behavioral data that social users have, and the system faces the cold-start problem, thus limiting the effectiveness of the recommendation model.
In addressing the aforementioned issues, this study proposes an enhanced semantic matching algorithm. This algorithm uses keyword expansion and iterative computation to prevent the loss of semantic information when calculating the similarity between a resume and job text. Building upon this foundation and leveraging the social network relationships among graduates, personalized suggestions are deduced by relying on the recommendation results of similar users. This approach aims to address the cold-start problem in graduate recommendations and provide accurate and reliable recommendation results. The approach integrates information from both semantic and social network dimensions, with the goal of enhancing the performance of graduate job recommendation methods. Moreover, this article emphasizes professional matching and salary matching, aspects of considerable concern for both parties in job search and recruitment. It incorporates the calculation of dimensions related to “professional matching and salary matching,” thereby further fortifying the reciprocity of job recommendations.

3. Proposed Job Recommendation Method

The job recommendation model comprises three distinct phases, as illustrated in Figure 1. In the screening stage, a semantic keyword iterative algorithm calculates the text similarity between resumes and job postings, which is then used to filter the pool of potential jobs. The second stage is the ranking stage, in which entropy aggregation is conducted using three dimensions: professional similarity, salary matching, and social network. Based on the calculated values, the set of job recommendations is determined, and subsequently, the jobs are returned to college graduates for scoring. The final evaluation stage measures the satisfaction and reciprocity of the job recommendation results using metrics such as average satisfaction (AR) and normalized discounted cumulative gain (NDCG).
Figure 1. Job recommendation model framework.

3.1. Screening Phase

3.1.1. Text Information Extraction and Processing Filtering Stage

Examples of resume texts of college graduates and job recruitment texts are presented in Table 1 and Table 2, respectively. The resume text includes essential information, such as gender, hometown, graduation college, major, research direction, and job preferences. The job recruitment text encompasses details on job positions, workplaces, salary levels, and company nature (state-owned or private enterprises), as well as academic, skill, and professional requirements.
Table 1. Resume information of a logistics graduate.
Table 2. Job recruitment Information.
In this study, the Jieba precise mode was used to perform word segmentation on both the resume and job recruitment texts. To enhance the precision of word segmentation, specialized vocabulary related to a profession was incorporated. We removed punctuation marks and stop words from the texts. Then, Term Frequency-Inverse Document Frequency (TFIDF) values were calculated, and keywords were extracted based on these values.

3.1.2. The Semantic Keyword Iterative Algorithm

Quattrone et al. [] pointed out that traditional similarity calculation methods, such as cosine similarity, suffer from low utility when dealing with sparse data. Traditional methods rely solely on word frequency to calculate text similarity, making them susceptible to discarding keywords with low frequency, thus resulting in reduced calculation accuracy [,]. In light of the sparse data and lack of historical information for college graduates as new users of job recommendation systems, this study uses a semantic keyword iterative algorithm. This algorithm leverages a keyword correlation matrix to calculate text similarity, thereby countering the accuracy issues associated with low-frequency keywords. This approach enhances the accuracy of the job recommendation system, enabling it to provide more reliable recommendations to college graduates, despite their limited historical data.
The keyword correlation matrix is calculated using Equations (1)–(5). First, the TFIDF algorithm was used to calculate the ‘keyword-text’ matrix, referred to as the K D matrix. KD is an n k × n d matrix, where n k represents the number of keywords, n d represents the number of texts, and K D represents the number of times the keyword appears in the text; for example, K D i m represents the number of times the keyword k i appears in the text D m .
The keyword correlation matrix is calculated based on the KD matrix. The specific calculation process is as follows:
Step 1:
s k 0 k m ,     k n = θ m n ,     s d 0 d m ,     d n = δ m n
The first step is to define the initial similarity value of the keyword s k 0 k m , k n and the text s d 0 d m ,   d n . The principle of definition is that each keyword or text is the same as itself and has certain differences from other keywords and texts. s k 0 k m ,   k n represents the initial similarity value between keywords k m and k n , and s d 0 d m ,   d n represents the initial similarity value between text d m and d n .
Step n:
s k n k m , k n = S K K n ( k m ,   k n ) S K K n ( k m , k m ) · S K K n ( k n , k n )
s d n d m , d n = S D D n ( d m , d n ) S D D n ( d m , d m ) · S D D n ( d n , d n )
where s k n k m , k n represents the similarity value between keywords k m and k n in the nth iteration, and s d n d m , d n represents the similarity value between text d m and d n in the nth iteration. The definitions of S K K n k m , k n and S D D n d m , d n are given in Equations (4) and (5):
Where
S K K n k m , k n = i , j = 1 n d K D m i · φ i j · s d n 1 ( d i , d j ) · K D n j
S D D n d m , d n = i , j = 1 n k K D i m · φ i j · s k n 1 ( k i , k j ) · K D j n
The calculated value of S K K n k m , k n in step n must be incorporated into the text similarity calculation of s d n 1 ( d i , d j ) in step ( n 1 ) . The calculated value of S D D n d m , d n in step n must be incorporated into the keyword similarity calculation of s k n 1 ( k i , k j ) in step ( n 1 ) , and the iterative calculation is carried out in turn. K D m i , K D n j , K D i m , and K D j n in Equations (4) and (5) are elements of the K D matrix. φ i j is the mutual reinforcement factor. When keywords k i and k j represent the same document, the mutual reinforcement factor φ i j = 1 . In the other cases, the value range of φ i j is (0,1). The value of φ i j was experimentally obtained. In this study, the best result was obtained when φ i j = 0.4 .
To further address the issue of data sparsity, the resume text of graduates was expanded with additional keywords. Specifically, the two most similar keywords, excluding those already present in the resume text, were added to enhance their content. This expanded resume text was then used to match the job recruitment text. A keyword relevance matrix was obtained using the iterative process described above. Finally, the match degree (MD) between the graduates’ resume text and the job recruitment text was calculated using Equation (6).
M D s n ,     w m = i = 1 n k T F s n k i · T F w m k i · s i n ( k i )
where n k indicates the number of extended-resume document keywords. T F k s n ( i ) represents the frequency score of keyword i in the nth resume document, and T F k w m ( i ) represents the frequency score of keyword i in the m t h job document. sin ( i ) is a parameter used to determine whether the keyword in the resume document is the original keyword. When sin ( i ) = 1 , i is the original keyword in the document. When i is a rich keyword, the value of sin ( i ) is equal to the calculated value of the keyword correlation matrix. The candidate working set was obtained according to the similarity value calculated using the keyword correlation matrix.

3.2. Ranking Stage

In employment, the degree of professional alignment between job seekers and job positions is a crucial factor influencing job search outcomes. With the growing gap between job availability and the demand for skilled workers, employers are emphasizing professional compatibility between job roles and candidates []. Rational choice theory posits that individuals are rational decision-makers who weigh the costs and potential benefits when arriving at decisions []. Researchers, such as Jung et al. [], have used the potential Dirichlet distribution method online to extract themes from a substantial volume of employee comments, and their analysis has highlighted salary as a pivotal factor influencing job satisfaction. Hence, salary alignment emerges as a significant determinant affecting the satisfaction of job recommendations. Regular equivalence theory, a fundamental doctrine within communication theory, posits that an individual’s behavior is influenced by the information, attitudes, and actions of others within a network []. When two individuals share more common social circles, their behavioral preferences tend to exhibit greater similarity, fostering a propensity to establish connections []. Thus, this study applies social networks in recommending jobs to recent college graduates, drawing on the employment decisions of alumni to effectively enhance satisfaction and reciprocity in job recommendations.
In the ranking stage, this study used three critical dimensions: professional matching, salary matching, and social network, to perform entropy-based aggregation. The recommended job set is generated by calculating the entropy aggregation. Professional matching assesses the compatibility between college graduates’ skills and qualifications and the requirements specified in job postings. Salary matching evaluates the level of correspondence between the expected salary of college graduates and the actual salary offered by the employing companies. Social networks analyze the social connections and relationships between college graduates and other graduate students, taking into consideration potential networking opportunities or affiliations that may influence job recommendations.

3.2.1. Professional Matching

First, random walk probability was used to calculate the connection between graduated students and work, and then the professional matching score between the target and graduated students was calculated. We organized all the professional attributes of the job, job, and graduated students into a graphic G ( V , E ) . Professional job attributes include professional skills. In the figure, V p t represents the set of nodes of the graduated students, V w represents the set of working nodes, and V w a t t represents the set of professional attribute nodes of the work, that is, V = V p t + V w + V w a t t , as shown in Figure 2 and Figure 3.
Figure 2. Initial correlation model.
Figure 3. Correlation model.
Figure 2 depicts the initial correlation, which represents the graduated student node and its associated job node without any inserted edges. Starting with wandering, the edges connecting the node s p of the graduated student and the node w q of the signed job of this graduate are inserted, as illustrated in Figure 3. The next step is to randomly travel from the graduated student node s p and, according to the probability ∂, decide whether to continue to travel or stop this travel to return to the starting node s p . Based on the above principle, if every job and job specialty attribute node is visited, the probability will converge to a number, and then the edge of the graduated student node to the job node will be removed.
The algorithm terminates once every starting node s p has been visited; if not, the random walk proceeds from the initial graduated student node. In this study, the value of ∂ is set to 0.7 based on experimental findings, and the calculation formula is defined as follows:
P R r =     r , i n r P R r , o u t ( r , )                                       ( r s p ) 1 + r , i n r P R r , o u t ( r , )       ( r = s p )
where P R r denotes the probability of visiting node r , PR r ,   denotes the probability of r , being visited, and ∂ denotes the probability of random wandering. o u t ( r , ) denotes the set of nodes pointing from r , , and i n r   denotes the set of nodes pointing to node r . PR ( s p , w q ) denotes the probability that the start node s p visits job node w q . If w q is a job signed by the graduated student s p , then PR s p , w q = 1 .
The professional matching value between the graduate student s p and the graduate s p is calculated using Equation (8).
s i m s p ,   s p = t r p T R p t r p + T R p  
where s i m s p , s p represents the professional matching value between college graduates and graduate students. t r p denotes the set of specialized attributes of college graduate p , and T R p denotes the set of specialized attributes of graduated student s p . The wandering probability, starting with the graduate most similar to the college graduate, is considered the specialty match value. For example, if the match value between s 3 and s 2 is the highest, then the wandering probability with graduated students s 3 as the starting node is used as the professional match value for college graduates s 2 .

3.2.2. Salary Matching

In this study, salary matching is conducted using a two-dimensional Euclidean metric that considers the job’s average and maximum salary against the graduate’s desired entry and incentive salary. The average salary is the monthly pay received by an employee, while the maximum salary is the highest amount offered by the company. Entry-level salary refers to the minimum salary required for college graduates, and incentive salary refers to the maximum salary desired by college graduates. The job’s average and maximum salaries are viewed as a two-dimensional point α = ( x w , y w ) , where x w represents the average salary for the job, and y w represents the maximum salary for the job. Similarly, the graduate’s entry and incentive salaries are viewed as two-dimensional points β = ( x s , y s ) , where x s denotes the entry salary and y s denotes the incentive salary.
d i s α , β = ( x w x s ) 2 + ( y w y s ) 2 2
if : x w x s < 0 , d e f a u l t x w x s 2 = 0
if : y w y s < 0 , d e f a u l t y w y s 2 = 0
The value of x w x s defaults to zero when the average salary of the job is lower than the entry salary expected by college graduates. Similarly, the value of y w y s defaults to zero when the maximum salary of the job is lower than the incentive salary desired by college graduates. When choosing a job, college graduates look at the average and highest salary of the job with a bias, so when calculating the weight, δ is added to improve the accuracy of the calculation, and the value of δ ranges from 0 to 1. In this study, δ is experimentally set to 0.6, and is calculated as follows:
d i s α , β = ( x w x s ) 2 · δ + ( y w y s ) 2 · ( 1 δ ) 2

3.2.3. Social Relationship

Interactive behaviors such as contacting and interacting between college graduates and those who have graduated can constitute a social network structure. In this paper, social networks between college graduates and graduated students are quantified using the “common friend set” and “intersection group” metrics. The common friends set pertains to the friends shared by college graduates and graduated students from the same hometown and college. The common intersection group refers to individuals with the same major and tutor. To account for differences in economic and consumption levels across cities, grouping based on hometown helps minimize variations in the salary requirements of graduates. Moreover, if a college graduate and a graduated student share the same major and mentor, it indicates a high degree of similarity in research interests and, consequently, job search preferences. This study also accounts for the potential impact of the annual economic climate on employment by incorporating the timing of graduation into the job recommendation process.
< s p : f 1 , f 2 , f 3 > , < s p : f 1 , f 2 , f 3 , >
< s p : g 1 , g 2 , g 3 > , < s p : g 1 , g 2 , g 3 , >
c s p , s p = ( p = 1 n f x p y p + q = 1 n g x q y q ) · 1 T T
Equation (13) represents the social network common friend set of college graduate s p and graduated student s p . Equation (14) represents the common intersection group of college graduates s p and graduated students s p . p = 1 n f x p y p represents the summation of the number of common friends between college graduates and graduated students, and q = 1 n g x q y q represents the summation of the common intersection group of college graduates and graduated students. n f represents the total number of friends of the college graduates and graduated students. If f p is a friend of college graduate s p , x p equals one; otherwise, it equals zero. If f p is also a friend of a graduated student s p , y p equals 1; otherwise, it equals 0. T T denotes the time difference between graduation time T of college graduates and graduation time T   of graduated students.

3.2.4. Entropy Aggregation

Entropy aggregation is used to convert professional match values, salary match values, and social networks into a decision matrix, which is computed to produce a recommended job set. The calculation process is illustrated in Figure 4.
A = a 11 a 12 a 13 a 21 a 22 a 23 a n 1 a n 2 a n 3
where n in the above matrix represents a total of n jobs, and a i j represents the score value of the ith job in the jth column. There are three columns in the matrix: professional, salary, and social relationships. Decision matrix A is obtained using entropy aggregation. The decision matrix is normalized to obtain matrix B. Finally, the entropy values are obtained using Equations (17)–(19).
B = b 11 b 12 b 13 b 21 b 22 b 23 b n 1 b n 2 b n 3
E j = ( i = 1 n b i j ln b i j ) ln n
Figure 4. Ranking calculation process.
Let d j = 1 E j ( 1 j m ) , where m is the number of column dimensions of the matrix (there are three dimensions in this study: professional matching, salary matching, and social relationship), and the entropy weight of the jth column is obtained using the calculation of Equation (19).
w j = d j j = 1 m d j
Finally, the entropy weights of each column of the work attributes are summarized to obtain the final ranking score F R S i .
F R S i = j = 1 m w j b i j

3.3. Experimental Evaluation

3.3.1. Recommendation Method

To rigorously assess the efficacy of the job recommendation approach proposed in this study, it underwent evaluation and comparison with existing recommendation methods. The description of the recommended method is provided as follows:
TF-C []: This method employs cosine similarity to compute the resemblance between the text within a college graduate’s resume and the content of the job posting. The system then produces a list of job recommendations based on similarity scores for college graduates to evaluate. The rating scale ranges from 1 (unsatisfactory) to 5 (satisfactory), with a total score of 5.
MS []: In the screening stage, a semantic keyword-matching approach is used to evaluate the likeness between the content of the college graduate’s resume and the text of the job posting. Subsequently, a list of job recommendations is generated based on similarity and submitted to college graduates for scoring.
GERS []: Employment characteristics of graduates are analyzed using a two-dimensional matrix of graduate preferences. Subsequently, the system calculates the similarity between graduates to identify jobs chosen by neighboring graduates, generating a recommendation list.
RSCF []: The similarity between graduates’ resumes and job descriptions is integrated with a collaborative filtering algorithm based on graduates’ similarity to calculate a list of job recommendations. The recommended jobs are then provided to college graduates for scoring.
JRSR: Utilizing the job recommendation method introduced in this paper, the screening stage uses a semantic keyword iterative algorithm to compute the similarity between resume text and recruitment text. In the ranking stage, three dimensions—”professional matching, salary matching, and social network”—are used to generate a set of job recommendations using entropy aggregation. This collection is then returned to college graduates for scoring.
To assess the impact of each dimension of the job recommendation method (JRSR) on the computational method’s performance, this study disaggregates the system into its constituent components—semantic keyword matching, professional matching, salary matching, and social networking—and conducts an experimental evaluation of each. RSR omits the semantic keyword-matching method, relying solely on professional alignment, salary comparison, and social network inputs for its calculations. JSR excludes professional alignment, utilizing only semantic keyword matching, salary comparison, and social networks. JRR discards salary comparison, operating with semantic keyword matching, professional alignment, and social networks. In contrast, JRS disregards social networking elements, conducting analyses strictly with semantic keyword matching, professional alignment, and salary matching components. The aforementioned methods were individually applied to the dataset, which comprised 110 target graduates, 320 total graduates, and 280 work samples, to perform calculations, with the recommendation list set at a length of five.
To evaluate the impact of graduates’ historical employment information on recommendation performance, this study categorizes the employment data into two sets: one for logistics engineering and another for management and science work samples. Subsequently, the job recommendation method (JRSR) proposed in this study was applied separately to each work sample set, generating two sets of recommendation results. Finally, the performance of the job recommendation method is assessed and compared based on the evaluation results from both sets of work samples.

3.3.2. Evaluation of Indicators

In this study, the recommendation outcomes were evaluated using the average satisfaction rate (AR) and normalized discounted cumulative gain (NDCG). AR was used to assess the average satisfaction of college graduates with recommended jobs. NDCG was used to assess the reciprocity of job referral methods.
The average satisfaction rate (AR) indicator was calculated using the ratings of college graduates for recommended jobs. The calculation formula is as follows:
A R @ n = 1 U j = 1 U 1 n i = 1 n s j i
where s j ( i ) represents the rating (on a scale of 1–5) of graduate s j for job ranked i in the recommendation set. i denotes the ranking of the job in the recommendation set (the rankings are 1, 2, 3, 4, and 5; the value of i is an integer between 1 and 5). N denotes the number of jobs in the recommendation set (the number of recommended jobs in this study was five). U is the number of graduates.
The normalized discounted cumulative gain (NDCG) calculates the reciprocity of the recommendation results based on the ranking and rating of the job in the recommendation list. The calculation formula is as follows:
N D C G @ n = 1 U j = 1 U D C G j @ n m a x D C G j @ n
where
  D C G j @ n = i = 1 n [ ( 2 s j ( i ) 1 ) / l o g ( 1 + i )
m a x D C G j @ n = i = 1 n [ ( 2 s j ( i ) 1 ) / l o g ( 1 + e )
The i in the equation D C G j @ n represents the rank of the recommended job in the recommendation set; for example, job w 1 is ranked 3 in the recommendation set, and the value of i equals 3. The e in the equation m a x D C G j @ n is the ranking from highest to lowest based on the ratings of the college graduates for the recommended jobs. For example, job w 5 , where the target graduates rated the lowest and ranked 5; therefore, the value of e is 5. There are a total of five candidate jobs in the recommended list, so the rankings are a total of five places.

4. Analysis and Discussion of Experimental Results

4.1. Experimental Settings

Logistics engineering, management, and science represent a multidisciplinary domain that bridges the realms of management and technology, integrating engineering and scientific principles. It maintains strong connections with fields such as transportation engineering, industrial engineering, computer technology, mechanical engineering, environmental engineering, architecture, and civil engineering. Consequently, this study focuses on graduates in logistics engineering, management, and science, given their versatile qualifications and adaptability. The data on graduates’ resumes come from the personal contents of university graduates registered in the university employment program, and the data on job samples come from the information on jobs signed by graduated students at several universities. For the experimental data in this paper, the total number of target graduates is 110, the total number of graduated graduates is 320, and the total number of remaining work samples after screening out the same jobs is 280. All the above data are from the two majors of logistics engineering and management and science. To assess the generalizability of the recommended methods proposed in this paper, additional experimental data is incorporated for comparative evaluation. This paper includes employment data for students who graduated from five colleges in the last five years, encompassing a total of 1200 graduates and 1027 jobs across various majors under the management disciplines of business management, accounting, financial management, marketing, and tourism management.
During the screening stage, a semantic keyword iterative algorithm is used to assess the similarity between the resume text of college graduates and the job recruitment text. In the ranking stage, the top 110 jobs with the highest degree of similarity undergo entropy aggregation, using the three dimensions of “professional matching, salary matching, and social network” to generate a list of job recommendations. Finally, the jobs in the job recommendation list are returned to the college graduates for scoring from 1 (indicating dissatisfaction) to 5 (indicating satisfaction). This rating process serves as the means to evaluate the efficacy of the job recommendation method proposed in this paper.

4.2. Social Network Visualization of Experimental Data

The primary focus of this paper is to investigate the employment situation of students majoring in logistics engineering, management, and science. Therefore, the employment data related to these two majors, including 110 targeted graduates, 320 graduated students, and a total of 280 work samples, were analyzed for data visualization. Figure 5, Figure 6, Figure 7 and Figure 8 illustrate the percentage distribution of college graduates and graduated hometowns, secondary colleges, majors, and mentors, respectively. The horizontal axis represents the percentage of college graduates in the dataset, while the vertical axis represents the percentage of graduated students in the dataset.
Figure 5. Comparison of high school graduates and graduated students’ hometown datasets.
Figure 6. Comparison of high school graduates’ and graduated students’ secondary college datasets.
Figure 7. Comparison of high school graduates and graduated students’ major datasets.
Figure 8. Comparison of high school graduates and graduated students’ mentor datasets.
Figure 5, Figure 6, Figure 7 and Figure 8 illustrate the distribution of hometown, secondary college, major, and advisor proportions for college graduates and graduated students. The horizontal axis represents the percentage of college graduates’ hometowns within the dataset of college graduates, while the vertical axis represents the percentage of graduated students’ hometowns within the dataset of graduated students. The highest proportion of hometowns in the college graduate dataset is 3.6%, compared with 7% in the graduated student dataset. For the secondary college with the highest proportion in the college graduate dataset, it accounts for 20%, while in the graduated student dataset, it is 24%. The major with the highest proportion in the college graduate dataset is 17.5%, in contrast with 11% in the graduated student dataset. Additionally, the top advisor proportion in the college graduate dataset is 28%, while in the graduated student dataset, it is merely 1.18%. With meticulous data analysis, a notable and substantial social interconnection emerges between college graduates and graduated students, encompassing facets such as hometowns, secondary colleges, majors, and advisors. This interconnectedness serves as the bedrock for cultivating a closely woven social network, which in turn provides the fundamental groundwork for calculating probabilities in the context of random walks.
To analyze the social connections between college graduates and those who already graduated more clearly, this paper subdivides into four dimensions—hometown, secondary college, major, and mentor—to generate a social relationship network, depicted in Figure 9. The analysis demonstrates a strong social connection across the four dimensions of hometowns, secondary colleges, majors, and mentors between college graduates and graduated students. This emphasizes the viability of utilizing social networks to facilitate the recommendation of job opportunities endorsed by graduated students to college graduates.
Figure 9. Social network for college graduates and graduated students.

4.3. Result Analysis

This article uses five job recommendation methods (TF-C, MS, GERS, RSCF, JRSR) to compute the recommended work set and subsequently conducts an analysis of the outcomes utilizing evaluation metrics.

4.3.1. Evaluation Analysis of the Average Satisfaction Rate

Figure 10 presents a comparative chart illustrating the average satisfaction rates among the five job recommendation methods. The graph clearly indicates that JRSR attains a notably higher average satisfaction rate when contrasted with the remaining four job recommendation methods. Among these five approaches, TF-C demonstrates the lowest average satisfaction rate, registering at a mere 2.631. Meanwhile, the average satisfaction rates for the other three job recommendation methods, namely, MS, GERS, and RSCF, exhibit relatively close values. The job recommendation method introduced in this paper, JRSR, records an average satisfaction rate of 3.743. In comparison to the four benchmark methods, it showcases a maximum improvement of 1.112 and a minimum improvement of 0.502. This compellingly suggests that the job recommendation method presented in this study, JRSR, excels in discerning the job preferences of college graduates, ultimately resulting in more gratifying job recommendations.
Figure 10. Average satisfaction rate.
The experimental data in this paper were non-normally distributed. To test for a significant difference in the AR@5 assessment values of the JRSR, TF-C, MS, GERS, and RSCF methods, the Wilcoxon signed-rank test was used. As indicated in Table 3 (p < 0.05), there is a significant difference between the assessed values of JRSR and the other four methods.
Table 3. Wilcoxon signed rank test for AR@5.

4.3.2. Evaluation Analysis of Normalized Discounted Cumulative Gain

Figure 11 illustrates the results of normalized discounted cumulative gain (NDCG@5) for the five job recommendation methods. The graph clearly demonstrates that among these methods, JRSR attains the highest NDCG@5 score, reaching 0.938. Among the four benchmark methods, TF-C achieves the lowest NDCG@5 score at 0.675, while GERS reaches the highest score at 0.859. When compared with TF-C and GERS, the proposed JRSR method exhibits a remarkable improvement in NDCG@5, with enhancements of 38.96% and 9.19%, respectively. This emphasizes the effectiveness of the JRSR job recommendation method presented in this paper in aligning the preferences and requirements of college graduates with job opportunities.
Figure 11. Normalized discounted cumulative gain.
The experimental data were non-normally distributed. In this paper, the Wilcoxon signed-rank test was used to assess whether there was a significant difference in the NDCG@5 assessment values of the JRSR, TF-C, MS, GERS, and RSCF methods. As evident in Table 4, with p < 0.05, there is a significant difference between the assessment values of JRSR and the other four methods.
Table 4. Wilcoxon signed rank test for NDCG@5.

4.3.3. Comparative Analysis of the Recommendation Results under Different Job Sample Sets

To assess the impact of historical employment information on recommendation outcomes, this study categorizes the job placements of graduates into two groups: the “Logistics Engineering Job Sample Set” and the “Management and Science Job Sample Set”, as illustrated in Figure 12. The AR@5 evaluation score for the Logistics Engineering sample set is 3.728, surpassing that of the Management and Science sample set by 0.467. This indicates a stronger preference among college graduates for jobs closely aligned with their majors. In terms of NDCG@5, the Logistics Engineering sample set achieves a score of 0.971, while the Management and Science sample set scores 0.866, resulting in a 0.105 higher NDCG@5 score for the Logistics Engineering sample set. This demonstrates the potential for major alignment to enhance the mutual effectiveness of job recommendations.
Figure 12. Evaluation results of separate use of historical employment information for the logistics and management section.

4.3.4. Comparative Analysis of Evaluation Values in Different Dimensions

This article dissects each dimension of the job recommendation method (JRSR) to analyze its impact on the system’s recommendation performance. The dimensions are individually assessed to calculate the recommendation outcomes, which are then evaluated using the average satisfaction rate and the normalized discounted cumulative gain. Figure 13 reveals that RSR has the lowest average satisfaction rate and normalized discounted cumulative gain, suggesting that dimensionality reduction in the semantic keyword iteration algorithm adversely affects satisfaction and validity. This indicates a superior filtering capability of the semantic keyword iteration algorithm when fully utilized. The data indicate that JRR’s average satisfaction rate and normalized discounted cumulative gain are second to last and third to last, respectively, which suggests that salary is a critical concern for graduates during their job search. Conversely, JSR registers the highest values in both average satisfaction rate and normalized discounted cumulative gain, signifying that the alignment of graduates’ majors is not a predominant consideration in their employment pursuits.
Figure 13. Evaluated values of AR@5 and NDCG@5 for different dimensional approaches.

4.3.5. Comparative Analysis of Evaluated Values for Different Recommendation List Lengths

To assess the performance of job recommendation methods across different sample sizes and recommendation list lengths, this paper expands the experimental sample size and generates recommendation lists of different lengths. The experimental dataset used for this comparative analysis comprises 110 targeted graduates, 1200 graduated students, and 1027 job positions. It encompasses various specializations within the management disciplines, including business management, accounting, financial management, marketing, and tourism management.
By examining Figure 10, Figure 14, and Table 5, it is evident that the assessed values of AR@5 for the five recommended methods exhibited a decline with the augmentation of data. The most substantial decrease occurred in the case of JRSR, dropping from 3.743 to 3.462, which represents a 7.5% reduction. As the length of the recommendation lists increases for the five methods, their AR evaluations correspondingly decrease. In Figure 14, it is evident that the assessed value of AR@20 significantly exceeds that of AR@5, with the MS value-added reaching its peak at 0.301 and the TF-C value-added registering its lowest at 0.114. These findings indicate that both the size of the experimental data sample and the length of the recommendation list directly impact the recommendation satisfaction rate.
Figure 14. Analysis of AR evaluation values for different recommendation list lengths.
Table 5. AR assessment values for different recommended list lengths.
By examining Figure 11, Figure 15, and Table 6, it is apparent that as the sample size of the experimental data expands, the assessed values of NDCG@5 for the five recommended methods decrease. The most substantial decrease is observed in MS, where it is 0.089, while the smallest decrease is in RSCF, where it is 0.03. Moreover, with the increase in the length of the recommended list, there is a decrease in its NDCG assessment. The most significant decrease is observed for GERS at 12.81%, while the smallest decrease is for MS at 4.9%. These findings indicate that both the size of the experimental data sample and the length of the recommendation list directly impact the normalized discounted cumulative gain assessment value.
Figure 15. Analysis of NDCG evaluation values for different recommended list lengths.
Table 6. NDCG assessment values for different recommended list lengths.

4.4. Discussion

In Figure 10 and Figure 11, both the average satisfaction rate and the normalized discounted cumulative gain (NDCG) of the JRSR job recommendation method, as proposed in this paper, outperform the other four recommendation methods. This robustly underscores the effectiveness and superiority of the JRSR approach in job recommendation. In contrast with the TF-C and MS methods, JRSR uses a keyword iterative calculation approach to determine the similarity between the resume text of college graduates and the job recruitment text. This methodology not only mitigates the problem of neglecting low-frequency keywords but also enhances inter-keyword cross-referencing, resulting in a more robust semantic matching of text similarity. The job recommendation method GERS recommends job selections for graduates by calculating the similarity of employment characteristics among graduates and suggesting the choices of neighboring graduates as pre-recommended items []. This method is a collaborative filtering recommendation approach, which narrows down the recommendation scope. However, it simplifies the calculation of similarity in employment characteristics among graduates. In contrast, the JRSR recommendation method, when calculating the similarity in employment characteristics among graduates, establishes a social network based on academic data from graduates. This approach not only effectively computes the similarity in employment characteristics among graduates but also addresses the system cold-start problem caused by a lack of employment data for college graduates. Compared with the recommendation method RSCF, which combines content-based and collaborative filtering approaches [], the JRSR recommendation method adds dimensions of major matching and salary matching. By using multiple methods and dimensions, JRSR matches the preferences and job requirements of college graduates, thereby enhancing the reciprocity of job recommendations.
The empirical evaluation demonstrates that the job recommendation method introduced in this study exhibits superior performance relative to the benchmark method. Nonetheless, job recommendation for college graduates is a multifaceted and applied discipline, and this study acknowledges certain limitations. (a) While the semantic keyword iterative algorithm enhances the accuracy of text similarity calculations, its iterative nature results in time-consuming processing with large datasets. Future research should, therefore, focus more on improving the scalability of this computational method. (b) The dataset used in this study is not diverse, which limits the consideration of data variability. Consequently, future work will aim to increase the sample size to broaden the assessment indicators. (c) This paper focuses on the two most prevalent employment characteristics, major and salary; future studies should encompass a broader range of employment attributes. (d) This study does not utilize a range of career assessment tools to aid graduates in gaining a more objective and comprehensive understanding of their employment aspirations. Future research could incorporate ideological education to enhance graduates’ awareness of their career intentions, thereby improving the overall quality of job recommendations.

5. Conclusions

(a)
In response to the issue of low accuracy resulting from the omission of low-frequency keywords during the keyword matching calculation, this study used a text similarity calculation method based on a semantic keyword iteration algorithm during the filtering phase. This method effectively mitigated the problem of discarding low-frequency keywords, thereby enhancing calculation accuracy. This study addressed the issues of major structure matching and salary matching between college graduates and jobs, leading to a significant improvement in job recommendation satisfaction.
(b)
For the first time, this study introduces graduate social networks and historical employment information of past graduates. Jobs held by previous graduates are recommended to recent graduates, serving as a strategy to address the cold-start problem of the system. With the augmentation of both the sample size in the experimental data and the length of the job recommendation list, there is a concurrent decrease in the average satisfaction rate (AR) and the normalized discounted cumulative gain (NDCG) of the job recommendation method.
(c)
Evaluation based on the average satisfaction rate (AR) and normalized discounted cumulative gain (NDCG) metrics demonstrated that the job recommendation method for college graduates proposed in this study outperforms baseline recommendation methods in terms of recommendation performance.

Author Contributions

Conceptualization, J.Y., Y.X. and J.G.; methodology, J.Y. and Y.X.; software, J.Y.; validation, J.Y., J.G. and Y.X.; formal analysis, J.Y.; investigation, J.Y. and Y.X.; resources, J.Y. and J.G.; data curation, J.Y.; writing—original draft preparation, J.Y.; writing—review and editing, J.Y., J.G. and Y.X.; visualization, J.Y.; supervision, J.Y.; project administration, J.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work received funding from the National Natural Science Foundation of China (Grant 71861019). This paper solely reflects the views of the authors. The Commission is not responsible for the contents of this paper or any use made thereof.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The datasets generated and analyzed during the current study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mok, K.H.; Montgomery, C. Remaking higher education for the post-COVID-19 era: Critical reflections on marketization, internationalization and graduate employment. High. Educ. Q. 2021, 75, 373–380. [Google Scholar] [CrossRef]
  2. Zhao, Y.; He, F.; Feng, Y. Research on the Industrial Structure Upgrading Effect of the Employment Mobility of Graduates from China’s “Double First-Class” Colleges and Universities. Sustainability 2022, 14, 2353. [Google Scholar] [CrossRef]
  3. Liu, H.; Wang, Z. Research on Causes and Countermeasures for the Difference between Employment Expectation and Actual Employment of College Graduates. In Proceedings of the 2016 International Conference on Education, Bangkok, Thailand, 21–22 April 2016. [Google Scholar]
  4. Ntale, P.D.; Ssempebwa, J. Designing Organizations for Collaborative Relationships: The Amenability of Social Capital to Inter-Agency Collaboration in the Graduate Employment Context in Uganda. Empl. Responsib. Rights J. 2022, 34, 291–318. [Google Scholar] [CrossRef]
  5. Evans, S.; Huxley, P. Factors associated with the recruitment and retention of social workers in Wales: Employer and employee perspectives. Health Soc. Care Community 2019, 17, 254–266. [Google Scholar] [CrossRef] [PubMed]
  6. Rafter, R.; Bradley, K.; Smyth, B. Personalised Retrieval for Online Recruitment Services. In Proceedings of the BCS/IRSG 22nd Annual Colloquium on Information Retrieval (IRSG 2000), Cambridge, UK, 5–7 April 2000. [Google Scholar]
  7. Bansal, S.; Srivastava, A.; Arora, A. Topic Modeling Driven Content Based Jobs Recommendation Engine for Recruitment Industry. Procedia Comput. Sci. 2017, 122, 865–872. [Google Scholar] [CrossRef]
  8. Lacic, E.; Reiter-Haas, M.; Kowald, D.; Dareddy, M.R.; Lex, E. Using autoencoders for session-based job recommendations. User Model. User-Adapt. Interact. 2020, 30, 617–658. [Google Scholar] [CrossRef]
  9. Mostafa, L.; Beshir, S. Job Candidate Rank Approach Using Machine Learning Techniques. In Advanced Machine Learning Technologies and Applications: Proceedings of AMLTA 2021; Springer International Publishing: Cham, Switzerland, 2021; pp. 225–233. [Google Scholar]
  10. Reusens, M.; Lemahieu, W.; Baesens, B.; Sels, L. A note on explicit versus implicit information for job recommendation. Decis. Support Syst. 2017, 98, 26–35. [Google Scholar] [CrossRef]
  11. Rafter, R.; Bradley, K.; Smyth, B. Automated collaborative filtering applications for online recruitment services. In Proceedings of the Adaptive Hypermedia and Adaptive Web-Based Systems: International Conference, AH 2000, Trento, Italy, 28–30 August 2000; Proceedings 1. pp. 363–368. [Google Scholar]
  12. Reusens, M.; Lemahieu, W.; Baesens, B.; Sels, L. Evaluating recommendation and search in the labor market. Knowl.-Based Syst. 2018, 152, 62–69. [Google Scholar] [CrossRef]
  13. Zhang, Z.J.; Xu, G.W.; Zhang, P.F.; Wang, Y.K. Personalized recommendation algorithm for social networks based on comprehensive trust. Appl. Intell. 2017, 47, 659–669. [Google Scholar] [CrossRef]
  14. Liu, J.Q.; Fu, L.Y.; Wang, X.B.; Tang, F.L.; Chen, G.H. Joint Recommendations in Multilayer Mobile Social Networks. IEEE Trans. Mob. Comput. 2020, 19, 2358–2373. [Google Scholar] [CrossRef]
  15. Malherbe, E.; Diaby, M.; Cataldi, M.; Viennet, E.; Aufaure, M.A. Field Selection for Job Categorization and Recommendation to Social Network Users. In Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), Beijing, China, 17–20 August 2014; pp. 588–595. [Google Scholar]
  16. Chala, S.; Fathi, M. Job Seeker to Vacancy Matching using Social Network Analysis. In Proceedings of the IEEE International Conference on Industrial Technology (ICIT), Toronto, Canada, 22–25 March 2017; pp. 1250–1255. [Google Scholar]
  17. Urdaneta-Ponte, M.C.; Oleagordia-Ruiz, I.; Mendez-Zorrilla, A. Using LinkedIn Endorsements to Reinforce an Ontology and Machine Learning-Based Recommender System to Improve Professional Skills. Electronics 2022, 11, 1190. [Google Scholar] [CrossRef]
  18. Corbellini, A.; Mateos, C.; Godoy, D.; Zunino, A.; Schiaffino, S. An architecture and platform for developing distributed recommendation algorithms on large-scale social networks. J. Inf. Sci. Princ. Pract. 2015, 41, 686–704. [Google Scholar] [CrossRef]
  19. Zhao, R.; Shao, Z.; Zhang, W.; Zhang, J.; Wu, C. A multi-channel multi-tower GNN model for job transfer prediction based on academic social network. Appl. Soft Comput. 2023, 142, 110300. [Google Scholar] [CrossRef]
  20. Rivas, A.; Channoso, P.; Gonzalez-Briones, A.; Casado-Vara, R.; Manuel Corchado, J. Hybrid job offer recommender system in a social network. Expert Syst. 2019, 36, e12416. [Google Scholar] [CrossRef]
  21. Van Hoye, G.; Van Hooft, E.A.; Lievens, F. Networking as a job search behaviour: A social network perspective. J. Occup. Organ. Psychol. 2009, 82, 661–682. [Google Scholar] [CrossRef]
  22. Skeels, M.M.; Grudin, J. When social networks cross boundaries: A case study of workplace use of facebook and linkedin. In Proceedings of the ACM International Conference on Supporting Group Work, Sanibel, FL, USA, 10–13 May 2009; pp. 95–104. [Google Scholar]
  23. Zhitomirsky-Geffet, M.; Bratspiess, Y. Perceived Effectiveness of Social Networks for Job Search. Libri 2015, 65, 105–118. [Google Scholar] [CrossRef]
  24. Palank, J. Face it:’Book’no secret to employers. The Washington Times, 17 July 2006; pp. 1–2. [Google Scholar]
  25. Xia, P.; Liu, B.; Sun, Y.; Chen, C. Reciprocal Recommendation System for Online Dating. In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, Paris, France, 25–28 August 2015; pp. 234–241. [Google Scholar]
  26. Liu, J.; Li, C.; Huang, Y.; Han, J. An intelligent medical guidance and recommendation model driven by patient-physician communication data. Front. Public Health 2023, 11, 1098206. [Google Scholar] [CrossRef]
  27. Ullah, Z.; Jamjoom, M. A smart secured framework for detecting and averting online recruitment fraud using ensemble machine learning techniques. PeerJ Comput. Sci. 2023, 9, e1234. [Google Scholar] [CrossRef]
  28. Jochen, M. Matching people and jobs: A bilateral recommendation approach. In Proceedings of the 39th Annual Hawaii International Conference on System Sciences (HICSS’06), IEEE Computer Society, Kauai, HI, USA, 4–7 January 2006. [Google Scholar]
  29. Huang, L. The Establishment of College Student Employment Guidance System Integrating Artificial Intelligence and Civic Education. Math. Probl. Eng. 2022, 2022, 3934381. [Google Scholar] [CrossRef]
  30. Jochen, M.; Weitzel, T.; Keim, T. Decision support for team staffing: An automated relational recommendation approach. Decis. Support Syst. 2008, 45, 429–447. [Google Scholar]
  31. Hong, W.; Zheng, S.; Wang, H.; Shi, J. A job recommender system based on user clustering. J. Comput. 2013, 8, 1960–1967. [Google Scholar] [CrossRef]
  32. Özcan, G.; Ögüdücü, S.G. Applying different classification techniques in reciprocal job recommender system for considering job candidate preferences. In Proceedings of the 2016 11th International Conference for Internet Technology and Secured Transactions (ICITST), Barcelona, Spain, 5–7 December 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 235–240. [Google Scholar]
  33. Zhou, Q.; Liao, F.L.; Chen, C.; Ge, L. Job recommendation algorithm for graduates based on personalized preference. CCF Trans. Pervasive Comput. Interact. 2019, 1, 260–274. [Google Scholar] [CrossRef]
  34. Li, S.; Chuancheng, Y.; Hongguo, W.; Yanhui, D. An Employment Recommendation Algorithm Based on Historical Information of College Graduates. In Proceedings of the 2018 9th International Conference on Information Technology in Medicine and Education (ITME), Hangzhou, China, 19–21 October 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 708–711. [Google Scholar]
  35. Shi, S.; Lv, H. A Framework of Graduate Employment Recommendation System and Key Technologies. In Proceedings of the 6th International Conference on Information Engineering for Mechanics & Materials, Huhhot, China, 30–31 July 2016; Atlantis Press: Amsterdam, Netherlands, 2016; pp. 169–174. [Google Scholar]
  36. Li, W. Research on personalised recommendation algorithm for college students’ employment. Appl. Math. Nonlinear Sci. 2022. [Google Scholar] [CrossRef]
  37. Assudani, P.J.; Kadu, R.K.; Sheikh, R.; Khanna, T. Smart College Campus Recruitment System. Int. J. Next-Gener. Comput. 2022, 13, 1280–1285. [Google Scholar]
  38. Quattrone, G.; Capra, L.; De Meo, P.; Ferrara, E.; Ursino, D. Effective retrieval of resources in folksonomies using a new tag similarity measure. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management, Glasgow, UK, 24–28 October 2011; pp. 545–550. [Google Scholar]
  39. Zhang, M.; Ma, J.; Liu, Z.; Sun, J.; Silva, T. A research analytics framework-supported recommendation approach for supervisor selection. Br. J. Educ. Technol. 2016, 47, 403–420. [Google Scholar] [CrossRef]
  40. Scott, A.J. The Cultural Economy of Cities. Int. J. Urban Reg. Res. 2010, 21, 323–339. [Google Scholar] [CrossRef]
  41. Jung, Y.; Suh, Y. Mining the voice of employees: A text mining approach to identifying and analyzing job satisfaction factors from online employee reviews. Decis. Support Syst. 2019, 123, 113074. [Google Scholar] [CrossRef]
  42. Lou, T.; Tang, J.; Hopcroft, J.; Fang, Z.; Ding, X. Learning to predict reciprocity and triadic closure in social networks. ACM Trans. Knowl. Discov. Data (TKDD) 2013, 7, 1–25. [Google Scholar] [CrossRef]
  43. Ishitani, Y. Model-based information extraction method tolerant of OCR errors for document images. Int. J. Comput. Process. Orient. Lang. 2012, 15, 165–186. [Google Scholar] [CrossRef]
  44. Zhang, Y.; Yang, C.; Niu, Z. A Research of job recommendation system based on collaborative filtering. In Proceedings of the 2014 Seventh International Symposium on Computational Intelligence and Design, Rome, Italy, 16–18 October 2014; Volume 1, pp. 533–538. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.