Next Article in Journal
Forecasting Half-Hourly Electricity Prices Using a Mixed-Frequency Structural VAR Framework
Previous Article in Journal
Dynamic Factor Models and Fractional Integration—With an Application to US Real Economic Activity
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Relationship Between Coefficients in Parametric Survival Models for Exponentially Distributed Survival Time—Registered Unemployment in Poland

by
Beata Bieszk-Stolorz
Institute of Economics and Finance, University of Szczecin, 71-101 Szczecin, Poland
Econometrics 2025, 13(1), 1; https://doi.org/10.3390/econometrics13010001
Submission received: 24 September 2024 / Revised: 18 December 2024 / Accepted: 24 December 2024 / Published: 2 January 2025

Abstract

:
Survival analysis is a popular research tool in medicine and demography. It has been used for many years to study the duration of socio-economic phenomena. The aim of this article is to evaluate the relationship between the coefficients of the proportional hazards model (PH) and the accelerated failure time model (AFT), assuming an exponential distribution of survival time. The coefficients of the PH and AFT exponential models have the same magnitude but have opposite signs. It follows that there is a symmetric relation between the coefficients. In the case of exponential PH and AFT models, there is a relation of equality between the parameters describing the quality and fit of the model, as well as between the standard errors of the parameters of both models. In this case also, we can talk about a symmetric relation. The exponential PH model is valid if the exponential AFT model is valid. The study showed that the intensity of starting work was higher in the case of men, people with work experience, people with higher education and young people. The job search time was longer for women, people with no work experience, and people aged 60+, but shorter for people with higher education.

1. Introduction

The genesis of survival analysis methods is related to studies of equipment reliability (the duration of operation of machinery and equipment) and demographic studies (the duration of human life). For many years, survival analysis has also been used to study the duration of socio-economic phenomena (Wycinka, 2019; Urbańczyk, 2020; Bieszk-Stolorz & Dmytrów, 2021; Putek-Szeląg & Gdakowicz, 2021; Calabuig et al., 2021), including the study of the duration of unemployment (Nickell, 1979; Moffitt, 1985; Addison & Portugal, 1998; Lalive, 2008; Bieszk-Stolorz, 2017; Doğan, 2019; Bieszk-Stolorz & Markowicz, 2022; Grzenda, 2023). One of the most frequently used models for duration analysis is the proportional hazards (PH) model. However, this model may be inadequate in many situations. An accelerated failure time model (AFT) may provide a better description of the phenomenon. The PH model and its various generalisations are mainly used in medical and biostatistical fields, while the alternative class of regression models, the AFT model, is mainly used in industrial experiments and reliability theory (Orbe et al., 2002). Some authors emphasise that in many situations AFT models better describe and predict the analysed phenomenon (Basha & Gjika, 2022). When proportional hazard assumptions are violated, the AFT model is the appropriate method to apply (Doğan, 2020).
There are many articles in the literature on the comparison of the two models. Often these comparisons involve the Cox proportional hazards model and the AFT model with an arbitrary duration distribution (Orbe et al., 2002; Patel et al., 2006; Faruk, 2018; Khanal et al., 2019; Khamis et al., 2020; Pang et al., 2021; Mishra & Misra, 2022). The comparison of such models makes it impossible to determine the relationship between the coefficients. This is due to the fact that both these models belong to different classes. The Cox proportional hazards (Cox PH) model is a semiparametric one. In this case, no baseline hazard function is specified. This model is particularly attractive when the researcher has only a weak theory supporting a specific parametric model (Landmesser, 2010). In contrast, the AFT model is a parametric one. In such case, an assumption about the duration distribution is made. It is impossible to find a mathematical relationship between the parameters of the semiparametric PH and parametric AFT models. In the literature, there are also issues related to the comparison of parametric PH and parametric AFT models. However, these are applications of these models to the analyses of economic and social phenomena assuming different distributions of duration. There is also no relationship between the PH and AFT models. Many studies confirm that models based on the assumption of an exponential distribution of survival time give a good description. The added value of this study is the determination of the relationship between the parameters of the PH and AFT models assuming an exponential distribution of duration time and the interpretation of this relationship based on the analysis of data on registered unemployment.
The use of both models in the study of socio-economic phenomena confirms that in many cases a good description is provided by models based on the assumption of an exponential distribution of survival time. The two research questions arise:
Q1: What is the relationship between the coefficients of the PH and AFT models in the case of an exponential survival time distribution?
Q2: How does the relationship between the coefficients of the PH and AFT models in the case of an exponential survival time distribution affect the interpretation of these coefficients?
The aim of this article is to evaluate the relationship between the coefficients of the proportional hazards model and the accelerated failure time model, assuming an exponential distribution of survival time.
An example of the application and interpretation of the coefficients of both models is the assessment of duration in registered unemployment. The empirical study was conducted based on individual data from the Poviat Labor Office in Szczecin (Poland) in 2021.

2. Methods of Survival Analysis

Survival (duration) analysis methods are used when the duration of a given phenomenon is studied and is described by a random variable T. These methods allow censored data to be used in the study. It contains observations that do not end with an event before the end of the observation period, or end with an event other than the required one. This approach allows all observed units to be included in the study. The basic function in the survival analysis is the survival (duration) function S t , described by the formula:
S t = 1 F t = P T > t = t f u d u
where:
  • t—time to the event,
  • F t —cumulative distribution function of random variable T,
  • f t —probability density function,
  • P t —probability.
The survival function specifies the unconditional probability that the event of interest has not occurred by time t (Aalen et al., 2008).
The second function is the hazard function h t , describing the intensity of the event at time t:
h t = S t S t
Figure 1 shows a graph of the survival function S t and a geometric interpretation of the hazard function h t for time t > 0 (Bieszk-Stolorz, 2015). Let x denote the distance between t 1 and t . The non-increasing function S t guarantees that t 1 t > 0 . We obtain:
x = 1 h t
The hazard function h(t) allows calculation of the value of the hazard function at time t. The cumulative hazard function H(t) is the cumulative amount of hazard up to time t. It is defined by the formula (Klein & Moeschberger, 2003):
H t = 0 t h u d u
While the instantaneous hazard rate h(t) can increase or decrease with time, the cumulative hazard rate H(t) can only increase or remain the same.
The most common methods used in the survival analysis are the proportional hazards (PH) and accelerated failure time (AFT) models.

2.1. Proportional Hazards (PH) Models

The assumption for the proportional hazards (PH) models is that the hazard ratio (HR) is constant at time t. The hazard function at time t in the PH models is given as follows (Qi, 2009):
h t x = h 0 t e x p i = 1 n β i x i
where:
  • h 0 t —baseline hazard,
  • x = x 1 , x 2 , , x n —vector of covariates,
  • β = β 1 , β 2 , , β n —vector of coefficients,
  • n—number of covariates in the model.
The characteristics of hazard function change proportionally to the influence of explanatory variables (Landmesser, 2009). The baseline hazard function is unspecified, so the time-to-event random variable is not assumed to follow any particular distribution. This is one of the attractive features of the PH model.
Hazard ratio is defined as the hazard for one individual divided by the hazard for a different individual (Kleinbaum & Klein, 2005) and is defined by the formula:
H R = H R t , x i , x j = h t x i h t x j
where:
  • h t x i —hazard function,
  • t —time to the event
  • x i = x i 1 , x i 2 , , x i n —vector of covariates for the i-th unit,
  • x j = x j 1 , x j 2 , , x j n —vector of covariates for the j-th unit,
  • n—number of covariates in the model.
In the case of the hazard function of the form (5), HR is defined by the formula:
H R t , x i , x j = exp k = 1 n β k ( x i x j )
Formula (7) is simplified in the case when the i-th and j-th units differ only in the values of the xk variable. If there is also x i k x j k = 1 , then the formula for the hazard ratio is as follows:
H R t , x i , x j = exp β k
The term “proportional hazards model” refers to the fact that for two different individuals, the ratio of their hazard functions is constant, assuming that xi and xj do not change over time.

2.2. Accelerated Failure Time (AFT) Models

The accelerated failure time (AFT) model is an alternative to the PH model (Pang et al., 2021). Currently, it is not widely used to analyse clinical trial data, although it is quite common in the manufacturing field. Like the PH model, the AFT model describes the relationship between survival probabilities and a set of variables. In the simple case of two groups, it can be written as (Kleinbaum & Klein, 2005):
S 1 t = S 2 γ t
where:
  • γ —the acceleration factor,
  • S 1 , S 2 —survival functions for group 1 and group 2, respectively.
The assumptions of the AFT model for the random variable T can be written as follows (Kleinbaum & Klein, 2005):
T 2 = γ T 1
that is:
γ = T 2 T 1
where:
  • T 1 —survival time of group 1,
  • T 2 —survival time of group 2.
The acceleration factor γ is the key measure of association obtained in the AFT model. After reparameterisation we obtain:
γ = exp i = 1 n α i x i
where:
  • x = x 1 , x 2 , , x n —vector of covariates,
  • a = α 1 , α 2 , , α n —vector of coefficients,
  • n—number of covariates in the model.
Two important conclusions follow from Formulas (9) and (10) (Saikia & Barman, 2017):
  • If γ > 1 , the effect of covariate is decelerated.
  • If γ < 1 , the effect of covariate is accelerated.
Differences between groups are expressed in terms of direct impact on time. For the AFT models, it is assumed that independent variables act additively on the logarithm of time and therefore multiplicatively on time (Patel et al., 2006). The AFT regression model is specified as follows (Basha & Gjika, 2022):
l n T = α 0 + i = 1 n α i x i + σ ε
where:
  • x = x 1 , x 2 , , x n —vector of covariates,
  • a = α 1 , α 2 , , α n —vector of coefficients,
  • α 0 —the intercept,
  • ε —random errors that does not depend on x i ,
  • σ —an unknown scale parameter,
  • n—number of covariates in the model.
This means that the relationship between logarithm of survival time variable and independent variables is linear according to the accelerated failure time model. It can be assumed that the random error follows one of many distributions.
The AFT model assumes that factors interrelate with time directly in the baseline hazard function ( h 0 t ). In this case, the hazard function at time t can be given as (Qi, 2009):
h t x = t γ h 0 t γ
After applying Formula (12), we obtain:
h t x = h 0 t · e x p i = 1 n α i x i e x p i = 1 n α i x i
For the AFT model, the ratio of durations (time ratio) for the i-th and j-th units, characterised by different values of the explanatory variables ( x i and x j ), can be written as follows:
t i t j = γ = exp i = 1 n α i x i
If the i-th and j-th units differ from each other in the values of only one variable x k , and x i k x j k = 1 , then:
t i t j = γ = exp α k

2.3. Main Differences Between PH and AFT Models

Not for every duration distribution can we estimate both PH and AFT models. The exponential and Weibull models are the only parametric models that can be written with either a PH or AFT specification. For the Gompertz distribution we can only apply the PH model. For the lognormal, loglogistic, and generalised gamma distributions only the AFT specification exists. The main differences between the PH and AFT models can be divided into two groups.
1.
Basic assumptions of the models:
In the PH models, the effect of the influence of variables is multiplicative (proportional) to the hazard. In the AFT models, the effect of variable influence is multiplicative (proportional) to the duration.
2.
Different interpretation of coefficients:
The PH models are used to compare hazards (the intensity of an event). AFT models are used to compare survival times. Differences in the interpretation of the coefficients of the PH and AFT models are presented in Figure 2 and Figure 3. Each of them shows survival curves for two groups. On Figure 2 the moment t is fixed and for this moment the hazards h1(t) and h2(t) are determined. On the basis of Figure 1 and Figure 2 we obtain:
H R = h 2 ( t ) h 1 ( t ) = t t 1 t t 2
where t t i is the length of the line segment between points t and ti for i = 1, 2.
Figure 3 presents the fixed probability of survival S(t), for which the corresponding survival times t1 and t2 for groups are determined. On the basis of Figure 1 and Figure 3 we obtain:
γ = 0 t 2 0 t 1
where 0 t i is the length of the line segment between points 0 and ti for i = 1, 2.

2.4. Application of Survival Analysis Methods in the Study of Socio-Economic Phenomena

Survival analysis in labour market research began to be used in the 1980s and 1990s. The aim of these analyses was to properly address the activities counteracting long-term unemployment.
Meyer (1990) analysed the impact of the level and duration of unemployment benefits on the duration of unemployment. Using the Kaplan-Meier estimator and other survival analysis models, he showed that the probability of remaining unemployed increased at the end of the period of receiving benefits. Han and Hausman (1990) used the parametric model of proportional hazards and the competing risk model to examine the duration of unemployment until the moment of leaving for a “new job” and the moment of leaving for a “previous job” (recall). They showed that the fact of receiving the benefit had a significant impact on the duration of unemployment, with its impact on hazard in the case of starting a new job was stronger than in the case of returning to the previous job. Research on the duration of unemployment for women and men, with particular emphasis on their marital status, was conducted by Marcassa (2012). Individual data on the unemployed in France in the years 1991–2002 allowed for the construction and analysis of Kaplan–Meier estimators. Married men left unemployment more quickly than unmarried men, and the opposite was true for women. Married women were less likely to quickly start employment than single women. Labour market research using the Kaplan–Meier estimator and the Cox proportional hazards model was conducted in Romania (Babucea & Danacica, 2007; Ciucă & Matei, 2011; Kavkler et al., 2009). Among other factors, the impact of gender, age, education, marital status, the number of children and the place of residence during the time of looking for a job were analysed. The results of the study indicated a large regional variation in the time of leaving unemployment. Women started employment faster than men, but after taking into account marital status, married men were the fastest at finding work, while married women had the least chance. The chance of leaving unemployment increased with education and decreased with age. Not only people with higher education, but also those with vocational education were in a good situation on the labour market in Romania. Tansel and Taşçı (2010) studied the duration of unemployment in Turkey in the years 2000 and 2001. To assess the differences in the time of leaving unemployment, a non-parametric Turnbull estimator was used. Landmesser (2007) applied competing risk models to describe the conditional probability of transition between employment and inactivity and determined the impact of age, education level, place of residence, and type of company ownership on the individual duration of employment.
Other socio-economic phenomena can also be analysed using survival analysis methods: economic activity of the population (Landmesser, 2013), poverty dynamics in urban and rural households (Sączewska-Piotrowska, 2015), credit risk (Matuszyk, 2017; Wycinka, 2019), and probability and intensity of decline and increase in stock prices on the capital market (Bieszk-Stolorz & Dmytrów, 2021). The range of applications of survival analysis methods is very wide and includes political science. These methods have been used in the study of international relations and are used to analyse the duration of post-war peace, civil wars and alliances (Box-Steffensmeier & Zorn, 2001; Box-Steffensmeier et al., 2003; Box-Steffensmeier & Jones, 2004).

3. Symmetry of Coefficients in PH and AFT Models for Exponential Distribution of Survival Time T

In the case of parametric survival models, the survival time T follows a specific distribution. For example, there are the following distributions: exponential, Weibull, loglogistic, log-normal, and Gompertz. If we assume that the time variable T is subject to an exponential distribution with the parameter λ, the basic survival functions take the form:
S t = e x p λ t
h t = λ
h 0 t = 1
where λ > 0 and t 0 .
In order to determine the duration of the phenomenon, Formula (20) should be transformed to determine the time t. After logarithmisation, we obtain:
l n S t = l n e x p λ t
Using the properties of logarithms, we have:
l n S t = λ t
that is:
t = l n S t 1 λ
In the PH model, the λ coefficient is reparametrised and then defined by the formula:
λ = e x p β 0 + i = 1 n β i x i
In the AFT model, the coefficient 1/λ is parameterised. Let it have the form:
1 λ = e x p α 0 + i = 1 n α i x i
The exponential model satisfies both the PH and AFT assumptions. Formula (27) can be written after transformation as:
λ = e x p α 0 i = 1 n α i x i
From Formulas (26) and (28), we can read the relationship between the coefficients β i in the exponential PH model and the coefficients α i in the exponential AFT model for i = 1, 2, …, n. This relationship is as follows:
β i = α i
A thorough justification and derivation of relation (29) was carried out by Machin et al. (2006, pp. 102–103) and Collett (2023, pp. 203, 242–243). This derivation concerns the relationship between the parameters of the PH and AFT models for the Weibull distribution. This distribution for the shape parameter equal to 1 is an exponential distribution.
A binary relation R over a set X is symmetric (Biggs, 2002) if:
a , b X a R b b R a
where the notation aRb means that (a, b) ∈ R. This means that R is symmetric, by which we mean that whenever aRb, then also bRa (Enderton, 1977).
The relation between the coefficients defined by Formula (29) satisfies the condition (30), i.e., it is symmetric. The consequence of this relationship is the interpretation of the coefficients. If β i > 0 , then α i < 0 . Conversely, if β i < 0 , then α i > 0 . The coefficients of the PH and AFT exponential models have the same magnitude, but have the opposite signs. The change in sign makes sense because the PH format uses covariates to model the hazard rate whereas the AFT format uses covariates to model the survival times. If the risk or hazard increases then the average survival time decreases (Machin et al., 2006). The relationship given by the Formula (29) also implies a relationship between HR and γ coefficients. Using Formulas (6) and (16) for the exponential distribution, we obtain:
H R = 1 γ
Formula (31) implicates the following differences in the interpretation of the coefficients HR and γ:
  • If γ > 1, then the time to the event is higher than in the reference group. In this case, HR < 1, i.e., the intensity of the event, is lower compared to the reference group.
  • If γ < 1, then the time to the event is lower than in the reference group. In this case, HR > 1, i.e., the intensity of the event is higher than in the reference group.
  • In the case, when γ = HR = 1, then the time to the incident and the intensity of the incident are the same as in the reference group.
In the case of exponential PH and AFT models, there is an equality between the parameters describing the quality and fit of the model (Likelihood-ratio test, Akaike’s Information Criterion, Log-Likelihood Function, Bayesian Information Criterion) and between the standard errors of the coefficients’ estimators of both models. The equality is also a symmetric relation, so one can speak of symmetry between these parameters. The exponential PH model is valid if the exponential AFT model is valid (Olive, 2003).

4. Analysis of Duration in Registered Unemployment—An Empirical Example

The empirical example has been based on the data from the Poviat Labour Office in Szczecin (Poland), in order to illustrate the theoretical considerations presented earlier. The duration of registered unemployment was analysed. The data includes the date of registration, date of deregistration, gender, level of education, age and seniority of the deregistered person, as well as information about the reason for deregistration. These were individual data on 8878 people deregistered from the office in 2021. Deregistration due to starting work was considered to be an event ending full observation. Deregistration due to other reasons (retirement or disability pension, going abroad, starting studies, resigning from the office’s mediation, etc.) was considered as the right-censored observation. There were 36.89% of such observations. In this case, the random variable T describes the time from the moment of registration with the office to the moment of starting work (the event ending the observation), i.e., the duration of being in registered unemployment.
Table 1 presents the analysed characteristics of the unemployed with the reference category marked by asterisks. The reference category does not appear in the model, while all other categories are replaced with dichotomous dummy variables. The third column of Table 1 lists the names of individual variables appearing in the PH and AFT models. Table 2 presents the structure of analysed unemployed persons.
Table 3 and Table 4 present the results of coefficient estimation for the exponential PH and AFT models. The calculations were performed in Stata 12.0 software in the Survival analysis module. According to the theoretical considerations presented earlier, the estimators of the coefficients differ from each other only in signs, and the standard deviations are equal. The fit factors of the models are also equal. All parameters are statistically significant, and therefore the analysed characteristics affect the intensity of starting work (PH model) and the time of registration at the labour office (AFT model).
We can evaluate the fit of the model by using the Cox-Snell residuals (Cox & Snell, 1968). If the model fits the data well, plotting the Cox-Snell residuals (values of the cumulative hazard estimated by the model) against the Nelson-Aalen estimator of cumulative hazard (a non-parametric estimate) would result in a straight line (Collett, 2023). The Nelson-Aalen estimator of the cumulative hazard function is a non-parametric estimate of the cumulative hazard rate and is defined by the formula:
H t = t i t d i n i
where:
  • d i —number of events at the time ti,
  • n i —number of units at the time ti.
Because the line obtained (coloured blue) follows the 45 degree line (coloured red) then we know that the model fits the data well (Figure 4).
Table 5 presents the coefficients’ estimators of the PH and AFT models. In addition, on the basis of Formulas (7) and (17), HR and γ were determined. The results in Table 4 show the relationship (29) between the coefficients β i and α i . The plus sign next to the coefficients indicates that the intensity of exit from unemployment or the duration of unemployment is greater in a given group of people compared to the reference group. Similarly, the minus sign next to the coefficients indicates that the intensity of leaving unemployment or the duration of unemployment are lower in a given group of people compared to the reference group. More detailed information is provided by the HR and γ coefficients. They are inverse to each other, as indicated by the relationship (31). The empirical study shows that the intensity of starting work was 17.7% lower for women than for men. Unemployed people with work experience were 7.8% more likely to find work than those without work experience. The higher the level of education, the greater the intensity of leaving registered unemployment. This intensity was more than twice as high in the case of people with higher education compared to those with, at most, lower secondary education. The intensity of starting work decreased with the age of the unemployed person and for people aged 60+ it was almost 84% lower than for young people aged 18 to 24. Changes in the transition time from unemployment to work were in the opposite direction. For women this time was 20.7% longer than for men. People with work experience had a 7.3% shorter time to exit unemployment compared to people without work experience. The higher the level of education, the shorter the time of being registered at the office. People with higher education searched for a job for about 52% less time than people with, at most, lower secondary education. The time of transition from unemployment to work increased with the age of the unemployed person. The oldest people (i.e., aged 60+) had more than twice as long this time as the youngest people (i.e., aged 18–24). The results indicate that registered young people started work more often and faster than older people. In the light of the study, this was the age group with the best indicators. Individual data show that middle-aged people in 2021 were more likely to give up co-operation with labour offices and less likely to accept jobs offered.

5. Discussion and Conclusions

This paper investigates the relationship between the coefficients of the PH and AFT models in the context of an exponential duration distribution. Theoretical considerations contributed to answers to both previously asked research questions:
  • Answer to Q1: In the case of an exponential distribution of survival time, there is a symmetric relation between the coefficients of the PH and AFT models.
  • Answer to Q2: The symmetric relation between the coefficients of the PH and AFT models (for exponential survival) indicates the opposite direction of the effects of the variables on hazard and the duration of the phenomenon.
The signs of the coefficients in the exponential AFT model are opposite to the signs for the exponential PH model (Patel et al., 2006; Qi, 2009). This relationship is illustrated in the article with an example from the labour market. To interpret the coefficients, their exponential form, i.e., the hazard ratios (HR) and the acceleration factors ( γ ), were used. In the case of an exponential distribution of duration, HR and γ are inverse to each other. A negative effect on unemployment duration implies a positive effect on the hazard rate from unemployment and vice versa. The obtained results of the empirical study confirm the research conducted by Dendir (2006). A greater survival time indicates a longer time before an event occurs, a greater hazard rate indicates a shorter time before an event would occur (Gelfand et al., 2016).
The AFT model is rarely used to analyse survivorship data, but offers a potentially useful statistical approach. This approach is based on the survival curve and not on the hazard function. The AFT model’s deceleration rates provide a more intuitive measure of treatment effect than the hazard ratio and are robust to departures from modelling assumptions (Swindell, 2009). The results of the AFT models may be easier to interpret as the covariate effects are directly expressed in terms of time ratio (Khanal et al., 2014). There is also a dilemma as to which of the PH models is better: the Cox regression model (semiparametric) or parametric model? Research indicates that the parametric model is more informative than the Cox model (Bradburn et al., 2003). Research conducted by Pourhoseingholi et al. (2007) indicated that the Cox PH model and exponential PH model were similarly good models in multivariate analysis and yielded the same conclusions in univariate analysis. Exponential distribution is simpler for use and interpretation because it is independent of the length of time (Rashidi & Mohammadian, 2011). However, it seems that there is no single model that is much better than the others.
The considerations presented in the article are an introduction to broader research on the relationships between the coefficients of the PH and AFT models in the case of durations other than exponential. A certain limitation here is that not both models exist for every distribution. The condition of having opposite signs for the coefficients of these models is certainly satisfied. However, the absolute values of the coefficients are different, which implies a relation other than a symmetric relation.
Additionally, the analysis of the duration of unemployment has certain social implications. The study of the duration of unemployment has made it possible to identify groups of people who are most at risk of unemployment. These are people who have had a long time searching for a job and leave the employment office register less intensively. The study presented in the article might indicate that these are women, people with no professional experience, older people and those with a lower level of education. State social policy should be targeted at such groups of people. It is important that studies of this type cover the smallest possible territorial units. This is the only way to identify areas at risk of high unemployment. Empirical analysis was carried out for data from year 2021. Formally, it was still a period of ongoing pandemic. However, the indicators pointed to an improvement in the situation and the return of positive trends in the labour market in Poland. According to the Statistics Poland, from February 2021 to December 2021, the registered unemployment rate steadily decreased and reached 5.8% (Statistics Poland, n.d.). Thus, a level similar to the pre-pandemic level had been reached. Recognising the problems of young people, public employment services in Poland are taking action to activate them and support them in entering the labour market. Young people are among the main addressees of activation programmes carried out by labour offices. Data for the entire EU show that in 2021 the youth unemployment rate in the 15–29 age group in Poland was lower than the European Union average (7.2% vs. 13%). This is certainly the result of the activation measures implemented. They should be continued on a regular basis. In Poland, the Youth Guarantee programme has been implemented since 2014. Activities activating young people carried out by labour offices are one of those undertaken in this programme. The research confirms the need to continue support programs for young job seekers.
One of the main research limitations is access to individual data. Statistical offices only have aggregated data, which prevents the use of most methods from the area of survival analysis. In Poland, in order to obtain individual data, research cooperation with Poviat labour offices is necessary. Only these offices have collected individual data. Another limitation of this study is the fact that it concerns registered unemployment.
The theoretical goal of future research will be to try to find the relationship between the coefficients of the PH and AFT models for other duration distributions. The empirical objective of future research will be to assess the intensity of (PH models) and time to exit unemployment (AFT models) using semiparametric and parametric models.

Funding

This research received no external funding.

Data Availability Statement

The datasets presented in this article are not readily available because the data are part of an ongoing study or due. Requests to access the datasets should be directed to the Poviat Labour Office in Szczecin (Poland).

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Aalen, O. O., Borgan, Ø., & Gjessing, H. K. (2008). Survival and event history analysis: A process point of view. Springer. [Google Scholar]
  2. Addison, J. T., & Portugal, P. (1998). Some specification issues in unemployment duration analysis. Labour Economics, 5(1), 53–66. [Google Scholar] [CrossRef]
  3. Babucea, A. G., & Danacica, D. E. (2007). Using kaplan—Meier curves for preliminary evaluation the duration of unemployment spells, MPRA paper 7854. University Library of Munich. Available online: https://core.ac.uk/download/pdf/7296501.pdf (accessed on 14 November 2024).
  4. Basha, L., & Gjika, E. D. (2022). Accelerated failure time models in analyzing duration of employment. Journal of Physics: Conference Series, 2287, 012014. [Google Scholar] [CrossRef]
  5. Bieszk-Stolorz, B. (2015). Geometric method of determining hazard for the continuous survival function. Folia Oeconomica Stetinensia, 15, 22–33. [Google Scholar] [CrossRef]
  6. Bieszk-Stolorz, B. (2017). Cumulative incidence function in studies on the duration of the unemployment exit process. Folia Oeconomica Stetinensia, 17, 138–150. [Google Scholar] [CrossRef]
  7. Bieszk-Stolorz, B., & Dmytrów, K. (2021). Evaluation of changes on world stock exchanges in connection with the SARS-CoV-2 pandemic. survival analysis methods. Risks, 9(7), 121. [Google Scholar] [CrossRef]
  8. Bieszk-Stolorz, B., & Markowicz, I. (2022). The impact of the COVID-19 pandemic on the situation of the unemployed in Poland. A study using survival analysis methods. Sustainability, 14(19), 12677. [Google Scholar] [CrossRef]
  9. Biggs, N. L. (2002). Discrete mathematics (2nd ed.). Oxford University Press. [Google Scholar]
  10. Box-Steffensmeier, J. M., & Jones, B. S. (2004). Event history modeling a guide for social scientists. Cambridge University Press. [Google Scholar]
  11. Box-Steffensmeier, J. M., & Zorn, C. J. W. (2001). Duration models and proportional hazards in political science. American Journal of Political Science, 45, 972–988. [Google Scholar] [CrossRef]
  12. Box-Steffensmeier, J. M., Reiter, D., & Zorn, C. J. W. (2003). Nonproportional hazards and event history analysis in international relations. The Journal of Conflict Resolution, 47, 33–53. [Google Scholar] [CrossRef]
  13. Bradburn, M., Clark, T. G., Love, S., & Altman, D. (2003). Survival analysis part II: Multivariate data analysis—An introduction to concepts and methods. British Journal of Cancer, 89, 431–436. [Google Scholar] [CrossRef]
  14. Calabuig, J. M., Jiménez-Fernández, E., Sánchez-Pérez, E. A., & Manzanares, S. (2021). Modeling hospital resource management during the COVID-19 pandemic: An experimental validation. Econometrics, 9(4), 38. [Google Scholar] [CrossRef]
  15. Ciucă, V., & Matei, M. (2011). Survival rates in unemployment. International Journal of Mathematical Models and Methods in Applied Sciences, 5, 362–370. [Google Scholar]
  16. Collett, D. (2023). Modelling survival data in medical research (4th ed.). CRC Press. [Google Scholar]
  17. Cox, D. R., & Snell, E. J. (1968). A general definition of residuals. Journal of the Royal Statistical Society. Series B (Methodological), 30, 248–265. [Google Scholar] [CrossRef]
  18. Dendir, S. (2006). Unemployment duration in poor developing economies: Evidence from urban Ethiopia. The Journal of Developing Areas, 40, 181–201. [Google Scholar] [CrossRef]
  19. Doğan, F. İ. (2019). Unemployment benefits and unemployment duration in France and Poland. Marmara Journal of European Studies, 27, 191–216. [Google Scholar] [CrossRef]
  20. Doğan, F. İ. (2020). Social transfers and unemployment duration: An empirical evidence of the EU and Turkey. Marmara Üniversitesi İktisadi ve İdari Bilimler Dergisi, 42, 247–264. [Google Scholar] [CrossRef]
  21. Enderton, H. B. (1977). Elements of set theory. Academic Press. [Google Scholar]
  22. Faruk, A. (2018). The comparison of proportional hazards and accelerated failure time models in analyzing the first birth interval survival data. Journal of Physics: Conference Series, 974, 012008. [Google Scholar] [CrossRef]
  23. Gelfand, L. A., MacKinnon, D. P., DeRubeis, R. J., & Baraldi, A. N. (2016). Mediation analysis with survival outcomes: Accelerated failure time vs. proportional hazards models. Frontiers in Psychology, 7, 423. [Google Scholar] [CrossRef]
  24. Grzenda, W. (2023). Estimating the probability of leaving unemployment for older people in Poland using survival models with censored data. Statistics in Transition. New Series, 24, 241–256. [Google Scholar] [CrossRef]
  25. Han, A., & Hausman, J. A. (1990). Flexible parametric estimation of duration and competing risk models. Journal of Applied Econometrics, 5, 1–28. [Google Scholar] [CrossRef]
  26. Kavkler, A., Danacica, D. E., Babucea, A. G., Bicanic, I., Bohm, B., Tevdovski, D., Tosevska, K., & Borsic, D. (2009). Cox regression models for unemployment duration in Romania, Austria, Slovenia, Croatia, and Macedonia. Romanian Journal of Economic Forecasting, 10, 81–104. [Google Scholar]
  27. Khamis, A., Hamat, C. A. C., & Abdullah, M. A. A. (2020). Modeling students’ performance using Cox and parametric accelerated failure time models. Scientific Research Journal, 8, 44–49. [Google Scholar] [CrossRef]
  28. Khanal, S. P., Sreenivas, V., & Acharya, S. K. (2014). Accelerated failure time models: An application in the survival of acute liver failure patients in India. International Journal of Science and Research, 6, 161–166. [Google Scholar]
  29. Khanal, S. P., Sreenivas, V., & Acharya, S. K. (2019). Comparison of Cox proportional hazards model and lognormal accelerated failure time model: Application in time to event analysis of acute liver failure patients in India. Nepalese Journal of Statistics, 3, 21–40. [Google Scholar] [CrossRef]
  30. Klein, J. P., & Moeschberger, M. L. (2003). Survival analysis techniques for censored and truncated data (2nd ed.). Springer. [Google Scholar]
  31. Kleinbaum, D. G., & Klein, M. (2005). Survival analysis a self-learning text (2nd ed.). Springer. [Google Scholar]
  32. Lalive, R. (2008). How do extended benefits affect unemployment duration? A regression discontinuity approach. Journal of Econometrics, 142, 785–806. [Google Scholar] [CrossRef]
  33. Landmesser, J. M. (2007). Wykorzystanie modeli ryzyka konkurencyjnego w celu szacowania czasu trwania aktywności zawodowej. Prace Naukowe Akademii Ekonomicznej im. Oskara Langego we Wrocławiu. Taksonomia, 14, 542–550. [Google Scholar]
  34. Landmesser, J. M. (2009). Econometric analysis of unemployment duration using hazard models. Studia Ekonomiczne, 1–2, 79–92. [Google Scholar]
  35. Landmesser, J. M. (2010). A dynamic approach to the study of unemployment duration. Metody Ilościowe w Badaniach Ekonomicznych, 11, 212–222. [Google Scholar]
  36. Landmesser, J. M. (2013). Wykorzystanie metod analizy czasu trwania do badania aktywności ekonomicznej ludności w Polsce. Wydawnictwo SGGW. [Google Scholar]
  37. Machin, D., Cheung, Y. B., & Parmar, M. (2006). Survival analysis: A practical approach (2nd ed.). John Wiley & Sons. [Google Scholar]
  38. Marcassa, S. (2012). Unemployment duration of spouses: Evidence from France. THEMA working papers 2012-31. Université de Cergy-Pontoise. [Google Scholar]
  39. Matuszyk, A. (2017). Credit scoring. CeDeWu. [Google Scholar]
  40. Meyer, B. D. (1990). Unemployment insurance and unemployment spells. Econometrica, 58, 757–782. [Google Scholar] [CrossRef]
  41. Mishra, N., & Misra, S. (2022). Comparison of Cox proportional hazard model and accelerated failure time (AFT) models: An application to under-five mortality in Uttar Pradesh. Indian Journal of Applied Research, 12, 83–87. [Google Scholar] [CrossRef]
  42. Moffitt, R. (1985). Unemployment insurance and the distribution of unemployment spells. Journal of Econometrics, 28, 85–101. [Google Scholar] [CrossRef]
  43. Nickell, S. (1979). Estimating the probability of leaving unemployment. Econometrica, 47, 1249–1266. [Google Scholar] [CrossRef]
  44. Olive, D. J. (2003). Available online: http://parker.ad.siu.edu/Olive/survbook.pdf (accessed on 12 July 2024).
  45. Orbe, J., Ferreira, E., & Núñez-Antón, V. (2002). Comparing proportional hazards and accelerated failure time models for survival analysis. Statistics in Medicine, 21, 3493–3510. [Google Scholar] [CrossRef]
  46. Pang, M., Platt, R. W., Schuster, T., & Abrahamowicz, M. (2021). Spline-based accelerated failure time model. Statistics in Medicine, 40, 481–497. [Google Scholar] [CrossRef] [PubMed]
  47. Patel, K., Kay, R., & Rowell, L. (2006). Comparing proportional hazards and accelerated failure time models: An application in influenza. Pharmaceutical Statistics: The Journal of Applied Statistics in the Pharmaceutical Industry, 5, 213–224. [Google Scholar] [CrossRef] [PubMed]
  48. Pourhoseingholi, M. A., Hajizadeh, E., Dehkordi, B. D. M., Safaee, A., Abadi, A., & Zali, M. R. (2007). Comparing Cox regression and parametric models for survival of patients with gastric carcinoma. Asian Pacific Journal of Cancer Prevention, 8, 412–416. [Google Scholar] [PubMed]
  49. Putek-Szeląg, E., & Gdakowicz, A. (2021). Application of duration analysis methods in the study of the exit of a real estate sale offer from the offer database system. In K. Jajuga, K. Najman, & M. Walesiak (Eds.), Data analysis and classification. Methods and applications (pp. 153–169). Springer. [Google Scholar] [CrossRef]
  50. Qi, J. (2009). Comparison of proportional hazards and accelerated failure time models [Ph.D. Thesis, University of Saskatchewan]. [Google Scholar]
  51. Rashidi, T. H., & Mohammadian, A. (Kouros). (2011). Parametric hazard functions: Overview. Transportation Research Record, 2230, 48–57. [Google Scholar] [CrossRef]
  52. Sączewska-Piotrowska, A. (2015). Poverty duration of households of the self-employed. Econometrics. Ekonometria, 1, 44–55. [Google Scholar] [CrossRef]
  53. Saikia, R., & Barman, M. P. (2017). A review on accelerated failure time models. International Journal of Statistics and Systems, 12, 311–322. [Google Scholar]
  54. Statistics Poland. (n.d.). Available online: https://stat.gov.pl/en/poland-macroeconomic-indicators/ (accessed on 16 November 2024).
  55. Swindell, W. R. (2009). Accelerated failure time models provide a useful statistical framework for aging research. Experimental gerontology, 44, 190–200. [Google Scholar] [CrossRef]
  56. Tansel, A., & Taşçı, H. M. (2010). Hazard analysis of unemployment duration by gender in developing county: The case of Turkey. Labour, 24, 501–530. [Google Scholar] [CrossRef]
  57. Urbańczyk, D. M. (2020). Competing risks models for an enterprises duration on the market. Folia Oeconomica Stetinensia, 20, 456–473. [Google Scholar] [CrossRef]
  58. Wycinka, E. (2019). Competing risk models of default in the presence of early repayments. Econometrics. Ekonometria. Advances in Applied Data Analytics, 23, 99–120. [Google Scholar] [CrossRef]
Figure 1. Plot of survival functions S t and geometric interpretation of hazards h t for t > 0 .
Figure 1. Plot of survival functions S t and geometric interpretation of hazards h t for t > 0 .
Econometrics 13 00001 g001
Figure 2. Plots of survival functions for two groups and HR interpretation.
Figure 2. Plots of survival functions for two groups and HR interpretation.
Econometrics 13 00001 g002
Figure 3. Plots of survival functions for two groups and the interpretation of the γ coefficient.
Figure 3. Plots of survival functions for two groups and the interpretation of the γ coefficient.
Econometrics 13 00001 g003
Figure 4. Fitting the model to empirical data using the Cox–Snell residuals.
Figure 4. Fitting the model to empirical data using the Cox–Snell residuals.
Econometrics 13 00001 g004
Table 1. Characteristics of the unemployed and their categories.
Table 1. Characteristics of the unemployed and their categories.
CharacteristicCategoryVariable Name
Gendermales *
femalesgender
Senioritywithout seniority *
with seniorityseniority
Level of educationat most lower secondary *
basic vocationaleducation2
general secondaryeducation3
vocational secondaryeducation4
highereducation5
Age18–24 *
25–34age2
35–44age3
45–54age4
55–59age5
60–64age6
* Reference category.
Table 2. Structure of the unemployed persons.
Table 2. Structure of the unemployed persons.
CharacteristicCategoryTotal SizeStarting WorkRight-Censored Observations
Gendermales *476627652001
females411228561256
Senioritywithout seniority *345718921565
with seniority542137291692
Level of educationat most lower secondary *19298501079
basic vocational1648943705
general secondary1264823441
vocational secondary16471139508
higher23901866524
Age18–24 *948586362
25–3425981762836
35–4424421631811
45–5416201053567
55–59587360227
60–64683229454
Total 887856213257
* Reference category.
Table 3. Results of the estimation of the PH model coefficients for the exponential distribution of the duration of registered unemployment.
Table 3. Results of the estimation of the PH model coefficients for the exponential distribution of the duration of registered unemployment.
VariableCoef.Std. Err.zp > |z|95% Conf. Interval
gender−0.18840.0276−6.830.000[−0.2424, −0.1344]
seniority0.07550.03052.480.013[0.0158, 0.1352]
education20.34900.04847.220.000[0.2543, 0.4438]
education30.35060.04997.030.000[0.2529, 0.4483]
education40.52340.046011.370.000[0.4332, 0.6136]
education50.72680.043516.720.000[0.6416, 0.8121]
age2−0.27930.0493−5.660.000[−0.3760, −0.1825]
age3−0.47540.0509−9.340.000[−0.5751, −0.3757]
age4−0.58870.0543−10.840.000[−0.6951, −0.4822]
age5−0.66320.0695−9.540.000[−0.7994, −0.5269]
age6−1.81660.0807−22.510.000[−1.9748, −1.6585]
Const.−2.34600.0515−45.540.000[−2.4470, −2.2450]
LR chi2(11)1188.07
Log likelihood−12,364.40
Prob > chi20.0000
AIC24,752.8
BIC24,837.9
Table 4. Results of the estimation of the AFT model coefficients for the exponential distribution of the duration of registered unemployment.
Table 4. Results of the estimation of the AFT model coefficients for the exponential distribution of the duration of registered unemployment.
VariableCoef.Std. Err.zp > |z|95% Conf. Interval
gender0.18840.02766.830.000[0.1344, 0.2424]
seniority−0.07550.0305−2.480.013[−0.1352, −0.0158]
education2−0.34900.0484−7.220.000[−0.4438, −0.2543]
education3−0.35060.0499−7.030.000[−0.4483, −0.2529]
education4−0.52340.0460−11.370.000[−0.6136, −0.4332]
education5−0.72680.0435−16.720.000[−0.8121, −0.6416]
age20.27930.04935.660.000[0.1825, 0.3760]
age30.47540.05099.340.000[0.3757, 0.5751]
age40.58870.054310.840.000[0.4822, 0.6951]
age50.66320.06959.540.000[0.5270, 0.7994]
age61.81660.080722.510.000[1.6585, 1.9748]
Const.2.34600.051545.540.000[2.2450, 2.4470]
LR chi2(11)1188.07
Log likelihood−12,364.40
Prob > chi20.0000
AIC24,752.8
BIC24,837.9
Table 5. Results of estimation of PH, AFT model coefficients and HR, γ parameters for the exponential distribution of the duration of registered unemployment.
Table 5. Results of estimation of PH, AFT model coefficients and HR, γ parameters for the exponential distribution of the duration of registered unemployment.
VariablePH Model
Coefficients   ( β i )
AFT Model
Coefficients   ( α i )
PH Model
HR   ( e x p ( β i ) )
AFT Model
γ   ( e x p ( α i ) )
gender−0.18840.18840.82831.2073
seniority0.0755−0.07551.07840.9273
education20.3490−0.34901.41770.7054
education30.3506−0.35061.41990.7043
education40.5234−0.52341.68770.5925
education50.7268−0.72682.06850.4834
age2−0.27930.27930.75631.3221
age3−0.47540.47540.62161.6087
age4−0.58870.58870.55511.8016
age5−0.66320.66320.51521.9409
age6−1.81661.81660.16266.1512
Const.−2.34602.34600.095810.444
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bieszk-Stolorz, B. Relationship Between Coefficients in Parametric Survival Models for Exponentially Distributed Survival Time—Registered Unemployment in Poland. Econometrics 2025, 13, 1. https://doi.org/10.3390/econometrics13010001

AMA Style

Bieszk-Stolorz B. Relationship Between Coefficients in Parametric Survival Models for Exponentially Distributed Survival Time—Registered Unemployment in Poland. Econometrics. 2025; 13(1):1. https://doi.org/10.3390/econometrics13010001

Chicago/Turabian Style

Bieszk-Stolorz, Beata. 2025. "Relationship Between Coefficients in Parametric Survival Models for Exponentially Distributed Survival Time—Registered Unemployment in Poland" Econometrics 13, no. 1: 1. https://doi.org/10.3390/econometrics13010001

APA Style

Bieszk-Stolorz, B. (2025). Relationship Between Coefficients in Parametric Survival Models for Exponentially Distributed Survival Time—Registered Unemployment in Poland. Econometrics, 13(1), 1. https://doi.org/10.3390/econometrics13010001

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop