1. Introduction
For the last three decades, disorders of the mother–infant relationship, including emotional rejection, have increasingly attracted the attention of clinicians and researchers in perinatal mental health [
1,
2,
3]. A variety of terms have been used to signify these conditions: bonding disorder, mother–infant relationship disorder, and maternal (parental) rejection. The essence of this syndrome is aversion to the infant with marked impairment of interaction [
2]. Bonding disorder in mothers after childbirth has been reported as linked to depression [
4,
5,
6,
7,
8,
9,
10,
11]. However, a recent longitudinal study indicated that bonding disorder and depression shared substantial covariance at one time but are not causal with each other [
8,
12].
Several self-report instruments have been developed to measure bonding disorder. They include the Maternal Postpartum Attachment Scale [
13], the Mother-to-Infant Bonding Scale (MIBS) [
14], and the Postpartum Bonding Questionnaire (PBQ) [
15]. Scores of these three instruments are moderately correlated [
16]. Among them, the PBQ is one of the most extensively studied with regard to its psychometric properties [
17,
18,
19]. Because the PBQ is a self-report questionnaire and is easy to administer, it may be used by obstetric and primary care health professionals as a screening instrument to detect mothers at risk of bonding disorders and child abuse [
16].
The validity and reliability of the original English version of the PBQ have been reported by the original author [
18]. Brockington and colleagues administered the instrument to 104 mothers and performed a principal component analysis (PCA) of the 84 PBQ items using varimax rotation, this was an orthogonal solution [
15]. Through this procedure, they selected 25 items with four main factors (components), including impaired bonding, rejection and anger, anxiety about care, and risk of abuse. When the German version of the PBQ was subjected to a PCA, the original four-factor (component) structure was not confirmed [
20]. Unlike exploratory factor analysis (EFA), PCA extracts the first component that explains as much variance of the scale as possible. Hence, the first component is likely to cover many more items than the subsequent components. In order to identify discrete facets of the items of an instrument, EFA is preferable to PCA. To the best of our knowledge, only two studies performed an EFA on the PBQ items. Kaneko and Honjo, using the Kaiser criterion as a measure to define the number of factors, identified five factors with eigenvalues greater than unity [
21]. Because the first factor showed a markedly high eigenvalue, they constructed a shorter version with 16 PBQ items that had a single-factor structure. In the other study using the Japanese version of the PBQ [
22], an EFA yielded four factors with a shorter version consisting of 14 items. Because these studies concluded different solutions, questions remained as to the factor structure of the Japanese version of the PBQ.
The purpose of this paper is to describe psychometric properties of the Japanese version of the PBQ in terms of its factor structure, construct validity, and test–retest reliability in a community (psychiatrically non-referred) population of Japanese mothers of 1 month-old infants.
2. Methods
2.1. Participants and Procedure
Of the 55 obstetric clinics in Kumamoto Prefecture, 18 (33%) responded to our request to cooperate with this questionnaire survey. These clinics included one university hospital, twelve public and private hospitals, and five private clinics. Hence, this was a mixture of different types of antenatal institutions in this area. We then solicited the participation of all pregnant women of at least 28 weeks’ gestation who visited one of these obstetrical clinics during the whole month of November 2011. A set of questionnaires was distributed to these women during late pregnancy and again at 5 days (while in the hospital) and 1 month (while attending the one-month health check-up) postnatally. Participating women were asked to complete the questionnaire at home and to return it to the researcher using a stamp-added envelope.
We were interested in the factor structure of the PBQ at 1 month rather than at day 5 after childbirth. This is because the mothers at day 5 are usually still in a secure hospital environment that we hypothesized would mask or eliminate any negative attitudes and emotions towards the baby. Of 1442 eligible women, 392 (27%) returned the questionnaire 1 month after childbirth. These women were the target population of the main analysis in this study. There were 173 first-time mothers, and 180 multipara. The remaining 39 women’s parity was unknown. The mean (SD) number of children they already had had was 0.7 (0.9). Their mean (SD) age was 30.3 (4.9) years. Of them, 98.7% were married. Their partner’s mean age (SD) was 32.2 (6.0) years. In summary, these are not different from mothers in general in Japan [
23]. Of these 392 women, 254 (65%) returned the questionnaire 5 days after childbirth, and these were used for test–retest reliability.
2.2. Measures
2.2.1. Bonding Disorder
The PBQ was translated into Japanese by Kaneko with the permission of the original author [
24]. We used this Japanese version of the PBQ. The PBQ is a self-report instrument that assesses parents’ attitudes and emotions towards their newborn infant. It consists of 25 items rated on a six-point scale (0 to 5). Eight items are positively worded, and these are reverse-scored. Higher scores indicate that the parent has negative affection towards the baby and feels a greater psychological burden with regard to parenting. In this study, the PBQ was distributed to participants at day 5 and at 1 month after childbirth.
2.2.2. Depression
As a measure of depression, we used the Edinburgh Postnatal Depression Scale (EPDS) [
25]. The EPDS is a ten item questionnaire rated on a four-point scale (0 to 3), used to assess postnatal depression. Psychometric properties of the original English version of the EPDS have proved to be excellent [
25]. Higher scores indicate greater severity of depressive symptoms. The Japanese version of the EPDS is available, and the reliability and validity have been verified [
26]. This version has been used in many previous studies by researchers, as well as by maternal health service providers and clinical professionals in community settings in Japan.
2.2.3. Neonatal Abuse
As a measure of neonatal abuse, we used the Conflict Tactics Scale (CTS) [
27]. The CTS is a self-report questionnaire that measures the frequency of various abusive parenting behaviors that have occurred since the most recent childbirth. The CTS Child Form R (Parent-child CTS: PCCTS) focuses specifically on parental psychological and physical aggression towards the child. It consists of 19 items rated on a seven-point scale (0: “never”, to 6: “more than 20 times”). The first three items (e.g., “discussed an issue calmly”) comprise the negotiation scale. The others include seven psychological abuse items and nine physical abuse items. In this study, the time frame of the PCCTS was changed from the original of “last year” to “the time period since childbirth”. The PCCTS was translated by one of us (T.K.) after obtaining permission from the original author.
2.3. Statistical Analyses
Out of 392 women, 364 (93%) had no missing data in the PBQ and were therefore selected for the present analyses. After randomly dividing these participants into two groups, we examined the means and SDs of all the PBQ items in the first group of mothers (
n = 172). Because all the PBQ item scores were positively skewed, we conducted a log transformation to achieve an approximate normality assumption. We then performed a series of EFAs on the PBQ items. Because all extracted factors were considered to be interdependent, the factor solution was sought after PROMAX rotation, which is a diagonal rotation method. The number of factors was determined by the Scree test as well as interpretability of the factor structure [
28,
29].
In order to confirm the stability of the factor structures obtained from the above EFAs, we performed a series of confirmatory factor analyses (CFAs) using another randomly generated subset of participants (n = 192). This allowed for cross-validation of the factor structure extracted in the EFA.
We then compared the goodness-of-fit of four models: (a) the model derived from our EFA using the first halved sample; (b) the four-factor model proposed by Brockington et al. [
17,
18]; (c) the single-factor model proposed by Kaneko and Honjo [
21]; and (d) the four-factor model proposed by Suetsugu et al. [
22]. We examined the fit of each model with the data in terms of chi-squared (CMIN), root mean square error of approximation (RMSEA), and comparative fit index (CFI). In accordance with conventional criteria, a good fit would be indicated by CMIN/
df < 2, CFI > 0.97, and RMSEA < 0.05, and an acceptable fit by CMIN/
df < 3, CFI > 0.95, and RMSEA < 0.08 [
30]. We used the Akaike Information Criterion (AIC) [
31] as a means of comparing models in terms of goodness of fit. A model is considered superior if its AIC score is lower than that of another model [
32].
Internal consistency of the subscales of the PBQ based on the final factor structure was calculated by Cronbach’s alphas.
Construct validity was examined by correlating the scores of the PBQ subscales with the scores of the EPDS and the CTS, measured at 1 month postnatally. This is because we posited that mothers high in bonding difficulty would be more likely to develop depression and more prone to commit neonatal abuse.
Test–retest reliability of the instrument was examined by intraclass correlation coefficient (ICC) between the PBQ subscale scores at 1 month and 5 days after childbirth.
All statistical analyses were conducted using SPSS version 20.0 (IBM Japan, Tokyo, Japan) and Amos 20.0 (IBM Japan).
2.4. Ethical Considerations
The study was approved by the Ethical Committee of Kumamoto University Graduate School of Life Sciences, as well as the Institutional Review Board of each institution participating in the study.
4. Discussion
The present study demonstrated that there were three domains of PBQ items, each corresponding to different aspects of mothers’ attitudes and emotions towards their babies: Anger and Restrictedness, Lack of Affection, and Rejection and Fear. When compared with the four models presented previously, we believe that our model showed reasonable fit with the data.
There may be several possible reasons why the factor structure we identified differed from those in prior studies. First, Brockington et al. used a patient population, while our study used a community population [
15]. This may be associated with the fact that the PBQ items were positively skewed in our study. Second, as noted earlier, Brockington et al. used a PCA as a means to categorize the PBQ items. This is a technique that defines the first component in such a way that it has the largest possible variance. Thus, the first component captures much of the information originally contained in the items of the instrument [
33]. A PCA is more likely to produce a general rather than specific factor. Thus, four PBQ items that loaded most highly on the first factor (“impaired bonding”) in Brockington et al.’s report were scattered into three different factors in our study. Thus the item “I feel happy when my baby smiles or laughs” loaded on the second factor, the items “the baby does not seem to be mine” and “I wish my baby would somehow go away” loaded on the third factor, and the item “my baby winds me up” loaded on the first factor.
Kaneko and Honjo [
21], Suetsugu et al. [
22], and our group all used a non-clinical Japanese mother population. Kaneko and Honjo suggested a single-factor structure because the first factor showed a markedly high eigenvalue. They then produced a shorter version of the PBQ by selecting only items with high factor loadings on the first factor in their study. The strength of their study is the large sample size (
n = 1786). In contrast, the study of Suetsugu et al., based on a relatively small sample (
n = 244), suggested a four-factor structure. This was derived through deletion of some items that were considered to be non-adaptable because of low factor loading values (≤0.35). Although deletion of items may, by definition, result in better internal consistency, such procedures are not necessarily free from flaws. Psychological phenomena such as bonding disorder are likely to have different facets that may yield different factors obtained through EFAs. Reducing the number of items in an instrument may sacrifice such diversity. Hence, the full version of the PBQ should be retained in order to identify multiple aspects of bonding disorder.
Another important research issue is cross-validation of the factor structure. The factor structure derived from an EFA should be validated by means of a CFA of a sample different from the one that yielded the EFA. Neither Kaneko and Honjo [
21] nor Suetsugu et al. [
22] performed CFAs. A strength of our study is a CFA suggesting a reasonably robust factor structure of the PBQ. Because our Three-factor model contained the full 25 items and utilized factor analysis rather than a PCA together with cross-validation via a CFA, it may be superior to the original model of Brockington et al. [
15,
18] in our population of Japanese mothers of neonates.
While test–retest reliability of the three PBQ subscales was substantial, the scores of Anger and Restrictedness increased from day 5 to 1 month. In Japan, most mothers are still in a hospital at 5 days after childbirth. Hence, they receive ample support and do not face the burdens of household chores. This is in contrast to the situation 1 month after childbirth, when restricted and angry feelings towards the baby are more likely to surface. On the other hand, Lack of Affection and Rejection and Fear may reflect a more psychological refusal unrelated to the living environment. Hence clinicians should pay more attention to Lack of Affection and Rejection and Fear as a means of identifying mothers who need psychological support in the early stage after delivery.
As expected from previous investigations [
34,
35,
36], all three PBQ subscale scores were associated with EPDS and CTS psychological abuse scores. Among the PBQ subscales, Anger and Restrictedness was markedly related to the EPDS total scores and the psychological abuse scores of the CTS. This result supports a previous report that EPDS was strongly associated with anger and rejection in the Mother-to-Infant Bonding Scale (MIBS) at 1 month after childbirth [
35]. Lack of association between the physical abuse subscale of the CTS and the PBQ may be due to a narrower variation (SD = 0.05) of the physical abuse scores in our sample. We think that our three-factor PBQ model has good construct validity.
The three subscales of the PBQ were clinically interpretable. Anger and Restrictedness may be linked to maternal burdens from and restriction in child rearing, resulting in various stress-related psychopathologies. For example, maternal parenting stress may be related to trait anger and to maternal emotional stress and negative affection towards the baby. The second subscale, Lack of Affection, consists of reverse items referring to affection for the baby.
Rejection and Fear of mothers may reflect the “ghost in the nursery” [
37]. This term refers to the clinical observation that the presence of a baby may sometimes arouse unconscious negative feelings in the mother, such as fear, agitation, and anxiety. This stems from their past painful experience and repressed affections that are easily projected to their infants [
37,
38,
39].
Despite the difference of the three subscales, these were moderately correlated. This suggests a generic concept of bonding failure itself.
A drawback of our study is that a CFA showed that our model fit only moderately well with the current data, which limits generalization. This may be due to our relatively small sample size and the use of a community population. Further, our study had a low response rate, which might have impeded our identification of the target population of women with bonding disorder. Our data may therefore be subjected to selection bias. Brockington [
2] claimed that a population of more than 1000 would be needed for this type of study, due to the infrequency of bonding disorders. Because of our use of a nonclinical population, the data were very positively skewed. The log transformation of the data improved normality, but positive skew remained for some items. Mothers whose child is under temporary custody at Child Protection Centre may provide more normal distribution of the PBQ scores. Such a population is to be studied in future studies.