Next Article in Journal
A Controlled Experiment to Test the Efficacy of Ground-Penetrating Radar in the Search for Clandestine Burials in Poland
Next Article in Special Issue
Challenges and (Un)Certainties for DNAm Age Estimation in Future
Previous Article in Journal
Dental Age Estimation Standards for Hispanic Children and Adolescents in California
Previous Article in Special Issue
Tooth Cementum Annulation: A Literature Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evaluating Morphological Methods for Sex Estimation on Isolated Human Skeletal Materials: Comparisons of Accuracies between German and South African Skeletal Collections

1
Department of Historical Anthropology and Human Ecology, University of Göttingen, 37073 Göttingen, Germany
2
Faculty of Health Sciences, School of Anatomical Sciences, University of the Witwatersrand, Parktown, Johannesburg 2193, South Africa
*
Author to whom correspondence should be addressed.
Forensic Sci. 2022, 2(3), 574-584; https://doi.org/10.3390/forensicsci2030042
Submission received: 29 July 2022 / Revised: 19 August 2022 / Accepted: 24 August 2022 / Published: 30 August 2022
(This article belongs to the Special Issue Feature Papers in Forensic Sciences in 2022)

Abstract

:
Objectives: The focus of this research is to evaluate the sex estimation methods on isolated human materials by applying morphological methods published in various forensic and anthropological literature on different skeletal series. Materials and Methods: 165 individuals from the 19th to 20th century Inden skeletal series, 252 individuals from the 13th to 14th century Lübeck skeletal series of German ancestry housed at the Department of Historical Anthropology and Human Ecology, the University of Göttingen, Germany, and 161 individuals from the 19th and 20th century of South African African ancestry housed within the Raymond A. Dart collection of modern human skeletons at the University of Witwatersrand, South Africa, with crania, mandibles, and pelves, were assessed. The evaluation criteria are burial information on the Inden series, genetic sex on both the Inden and the Lübeck series, and previous demography on cadavers from the South African African series. Results and Discussion: The sex estimation with cranial traits perform better in Inden and South Africa samples and worse in Lübeck sample. The mandible accuracies for pooled sexes are not exemplary, but the individual traits perform better for males in the Inden, Lübeck, and South Africa samples, except for gonion and angle, which performs better in females. The pelvic traits perform better in the Inden and South Africa samples compared to the Lübeck sample. The statistical tests show that there is a huge difference in the accuracy rates and the performance between both population groups from Germany itself, considering that Inden and Lübeck samples share the same ancestry. The accuracy rates improve with the exclusion of ambiguous individuals.

1. Introduction

Biological profile measures such as sex, age, stature, and ancestry are important for forensic identification. Sex estimation plays a significant role in relation to the other measures, i.e., age, stature, ancestry, and pathology. By estimating the sex of the individual, the number of possible matches is halved [1]. In principle, for sex estimation three approaches are available—morphological (qualitative and quantitative), metrical (quantitative), and molecular [2]. Heavy reliance of the scientific community on the molecular approach is viewed to be the key component for reducing errors and producing a conclusive result [3]. Even if the skeleton is well preserved, the material might have too highly-degraded DNA [4] for it to be applied. Moreover, DNA analysis is an expensive resource for some countries, which might not have the infrastructure or skills. An Interpol review shows that DNA is used in police investigations in only 84 countries [5]. In such cases, they have to utilize the classical methods, which are morphological and metrical. However, the morphological methods or sexually dimorphic nonmetric traits appear to be more accurate in one population group and not in the others. Although these nonmetric traits are accurate, with 70% to 100% accuracy rates, based on literature published over more than a century [6,7,8], it could be argued that some methods achieve a high level (100%) of accuracy due to the fact that the populations being tested are similar to those collections used to create the methods or are even the same anatomical collections [9], such as the Robert J. Terry or Raymond A. Dart collections, which are often studied and cited in the literature.
On one hand, the community of forensic anthropology is trying to devise novel metrical methods that are population specific and derived from newer collections [10,11,12,13,14,15], or from cadaveric collections that are constantly being updated, such as the Dart collection. On the other hand, for morphological methods the sex estimation standards are heavily based on either the 20th century Terry, Herman–Todd, and William Bass collections or war-dead samples such as those from the Korean War and Balkan conflict, with a bias towards one sex [9]. Finding collections with documented accounts of biological sex is difficult, and to evaluate the traits to see if they can be transferred onto other known-sex skeletal series is even more complicated. Sexual dimorphism and the methods of morphological assessment on the skull and pelvis are often based on the differences caused by shape and size; they can either be differences between biological sexes based on the fact that females have different pelvic morphology to accommodate parturition, or other differences of robustness that are prominent in males, for example, the skull [16]. The fact that population variation plays an important role in skeletal sexual dimorphism [17], the groups vary in terms of their activity patterns [2], and there is an implication of secular trends on sex estimation [18], this demonstrates that the use of sex estimation accuracies are affected by the superimposition of the use of one sample onto another sample derived from different collections representing two separate population groups or from the same collection with specimens from a different temporal period.
The idea of the use of population-specific formulae concerning robusticity, degree of sexual dimorphism, and body size [19] leaves plenty of room to check the reliability of universal sets of traits by assessing the populations that were not considered in their study. It may act as a strong argument in the debate on whether it is necessary to develop population-specific data or not. Because the validation studies on sex estimation are based on well-documented and sometimes demographically biased collections [8], it is not clear how reliable the morphological traits are if they are applied to medieval samples. Ancient DNA (aDNA) testing is a widely known tool for profiling archaeological material and estimating sex [20]. Short Tandem Repeats (STR) typing of samples includes PCR approaches that target amelogenin genes with X–Y homology [21]. The success of STR typing depends on the preservation of aDNA. It is important to point out that the accuracy of aDNA methods is high, but there are limitations. These include Y-allele deletion [22,23], leading to false negative results for males [24]; contamination in sample preparation and extraction is difficult to control [25], and most importantly it is an expensive method for a routine analysis by archaeologists or forensic anthropologists. The heavy reliance on DNA and the idea that it will eventually resolve the case is the reason unrecognizable remains are often not fully assessed with traditional anthropological methods, whether metrical or morphological [3]. These limitations justify the need to evaluate the empirical accuracy rates produced by morphological estimation techniques as published in the forensic anthropological literature.
The aim of this article is to evaluate morphological traits for their accuracy and reliability when applied on samples from European and African population groups of the modern and medieval periods. To achieve this, morphological traits will be used to estimate sex for Inden (modern), Lübeck (medieval), and South African (modern) population groups. Altogether, this large-scale analysis with three different population groups will test if standard morphological methods produce a reliable accuracy on modern and medieval population groups from Germany and only modern in South Africa. By evaluating the current morphological approach, this study can highlight which traits should be given more weight and which should be excluded, while establishing universal traits that can be applied irrespective of the population specificity.

2. Materials: From Inden, Lübeck, South Africa

2.1. Inden Collection

Between 1999 and 2004, excavations carried out under the supervision of Dr. Birgit Grosskopf [26] at a cemetery of an old church, St. Clemens, in an Alt-Inden village in Düren district, recovered 236 individuals. The skeletons of the Inden collection come from the period 1877 to 1924 and is housed at the Department of Historical Anthropology and Human Ecology, the University of Göttingen, Germany. The sample for this research includes 165 individuals with compromised preservation of some bones due to taphonomic processes. Out of 109 previously known-sex individuals, 32 individuals had sex determination carried out by genetic analysis.

2.2. Lübeck Collection

The mass graves excavated between 1989 and 1992 in the northern German city of Lübeck include the data of 1255 individuals. Out of 1044 complete and partially complete individuals housed at the Department of Historical Anthropology and Human Ecology, only 252 individuals have been considered. The collection belongs to the Black Death period of 1260 to 1390 AD [27]. The burial information register, or church book, reveals the sex of individuals up to a certain limit. STR typing has been performed for a selection of samples from the Inden and Lübeck series. Together, the church book and genetic analysis provide sex for only 109 individuals for evaluation and accuracy calculation in the Inden series. The Lübeck collection is not demographically known, and genetic estimation performed at the University of Göttingen for 76 individuals are used for accuracy calculation.

2.3. South African Collection

Lastly, the third sample involves individuals from the Raymond A. Dart collection, with four major self-identified groups according to the census data, which include South African (SA) African, SA White, SA Colored, and SA Indian/Asian. SA African (SAA) or Black South African (as per South Africa’s official classification for population groups) has multiple tribal affinities, namely nine [28]. The collection housed at the School of Anatomical Sciences, the University of the Witwatersrand, South Africa, is demographically known and belongs to the period of birth years 1827–1980. The total sample size from the Dart collection for this research consists of 161 cadaver-origin skeletons of SA African ancestry with mixed tribal affinities [29]. Sex estimation is carried out based on the evaluation of individual morphological traits, as well as a combination of traits. Because the accuracy of aDNA is much higher than for morphological traits, this baseline will be used to compare results and calculate the accuracy rates, at least in the Inden and Lübeck series, because the demography of the South African series is already known.

2.4. Criteria for Selection of Bones

The focus of this study was on only the following bones: crania, mandible, and pelvis. The pelvis is generally considered to be the most sexually dimorphic due to parturition [19,30], which makes it an integral skeletal region to be considered in any morphological sex estimation study. When the pelvis is fragmented or absent, the skull follows the standard framework of analysis among other bones [30]. Long bones offer a better prediction based on dimensions [31], which can be obtained using metrical methods, hence they are excluded from this study.

3. Methods

3.1. Morphological Sex Estimation Methodology

This was a blind study, which means that the sex estimation of the individuals was first completely performed in a morphological series, without any prior knowledge regarding the sex of the individuals. It requires a significant amount of experience to correctly evaluate skeletal material with morphological traits. Thus, the experiment was conducted under the supervision of a more experienced forensic anthropologist. The four chosen traits of the innominate, as shown in Table 1, reflect the morphology of pubic, ischial, and iliac bones. The nine cranial and five mandibular traits mentioned in Table 2 and Table 3 have been observed, evaluated, and estimated for a particular sex, as proposed by the authors [32,33,34,35,36,37,38]. For crania, mandible, and pelvis, the traits were observed independent from each other, either as a single trait or as a group of two traits together.

3.2. Morphological Series

The comparison of the shapes of traits forms the basis of a morphological series, where pronounced or robust male and female characteristics form the reference. This reference was used to observe and evaluate the remaining individuals. Morphological series is advantageous for individuals with underdeveloped or less-pronounced traits. They can be placed in a range, easily comparing the features with other individuals. The five-point scale from the Workshop of European Anthropologists [34] includes scoring of −2 (hyperfeminine) to 0 (neutral) to +2 (hypermasculine). This scoring was modified for this research. The observation was based on a particular range, which is Male, Tendency Male, Indifferent, Tendency Female, Female. When there was a strong fragmentation in an individual for a particular trait, it was recorded as indeterminate. Indeterminate is not a part of the five-point range. Classifying an individual as indeterminate would exclude the individual, but the same individual was included in the study for other observable traits or intact bony landmarks. When features extended towards both male and female, the sex was estimated as indifferent.

3.3. Known Sex and Morphological Sex Estimation Comparison: Calculation of Accuracy Rate

Accuracy rate is the percentage of correctly classified individuals in the corresponding groups of male and female. To calculate this, the sex estimated by the morphological assessment is compared with the documented biological sex. In order to perform the comparison, the morphological estimate Tendency Male was categorized under Male and Tendency Female under Female. For Inden, Lübeck, and the South African collections, the following information was available on their demography—Church book records and molecular sex estimation for Inden; cadaver-origin skeletons from SA had a database maintained locally where most of the individuals had correct sex defined from the death certificate. In some instances, the sex was incorrect, which could be a limitation to the study. The incorrectly-sexed individuals are few. The ancestry estimation for those individuals might be difficult, but sex is more easily identified from a cadaver that is from an unclaimed provenance, which is why those individuals have still been included in the study based on their identified sex. The Lübeck series had data from the molecular sex estimation performed at the department of Historical Anthropology, the University of Göttingen. To find the accuracy rate, the number of correctly classified individuals is divided by the total number of individuals, which were morphologically assessed.
Rate   of   accuracy   ( % ) = T h e   n u m b e r   o f   c o r r e c t l y   c l a s s i f i e d   i n d i v i d u a l s T o t a l   n u m b e r   o f   i n d i v i d u a l s   ×   100
For the calculation of accuracy rate, ‘indeterminate’ (broken or unobservable or not available) individuals have not been included in the study but ‘indifferent’ (or ambiguous) ones are already included. Inskip et al. (2019) and McFadden and Oxenham (2016) [9,40] calculated accuracy rates by including ambiguous sex estimates. This raises an interesting argument that indifferent individuals should be left out of accuracy calculations because ambiguity cannot be taken as incorrect estimation; the counter argument justifies the inclusion of indifferent individuals because they help quantify the information on the high number of indifferent sex estimates. This sheds light on the trait sexual dimorphism power [9]. In the results section, two accuracy rates are mentioned, one termed as raw accuracy with ‘indifferent’ individuals and the second without them. This aids in refraining from reporting the overestimated accuracy, contrary to cases where a researcher might include only correctly-sexed individuals, showing that a particular method for sex estimation has a very high accuracy rate.
The significance of the results was assessed by two tests, namely, the McNemar test and Cohen’s Kappa test. The McNemar test assesses the systematic differences between the conclusion made by previously known sex and the morphological sex estimation, whereas Cohen’s Kappa shows the agreement between the known sex and morphological estimation. The level of disagreement would be higher if the McNemar values would be lower than 1. The level of agreement for Cohen’s Kappa is assessed, as shown in the following range, taken from Watson and Petrie (2010)—“Poor if k < 0.00, slight if 0.00 ≤ k ≤ 0.20, fair if 0.21 ≤ k ≤ 0.40, moderate if 0.41 ≤ k ≤ 0.60, substantial if 0.61 ≤ k ≤ 0.80, and almost perfect if k > 0.80” [41].

4. Results

The number of individuals in all three skeleton collections identified by morphological methods and previously known sex in the form of genetic analyses, burial information, and cadaver database, is outlined in Table 4. The numbers in the table do not exclude the skeletons with unobservable features for individual traits. This gives an overview of the sample size for three population groups.
The accuracy rates (with and without ambiguous individuals) and Cohen’s Kappa results for individual traits for the Inden series is shown in Table 5, for the Lübeck series in Table 6, and for the South African series in Table 7. For crania in the Inden and SA groups, glabella-supra orbital ridges present the highest accuracy. Although in the Lübeck series the combination of traits as overall assessment performs better than other traits, the small sample size impacts greatly in terms of poor accuracy rates. Additionally, Cohen’s Kappa tests demonstrate a similar pattern that there is a higher level of agreement for traits in Inden and SA but not for the Lübeck series. None of the series have any traits that show a perfect agreement (k = 1.000). Glabella-supra orbital ridges and cranium overall show a substantial agreement in Inden and South Africa with previously known sex, whereas the Lübeck series has the poorest agreement between morphological sex estimates and previously known sex estimates for all the cranial traits. McNemar tests identify the systematic difference between morphological and previously known sex estimates, which shows a trend in the Lübeck cranial traits.
In the Inden and South African groups, mandible performs the best overall, considering they have different sample sizes. For pelvic traits, sub-pubic angle is the best indicator of sex in Inden and SA, and is comparably better in Lübeck. The best traits for crania have lower accuracy than the accuracy achieved by the best pelvic trait in all three series. Highlighting the indifferent or ambiguous individuals, the highest variation could be seen in cranial traits by looking at the difference between the raw and the general accuracy rates. The McNemar test shows that the most systematic difference between all traits in Inden is gonion and angle (McN = 0.000); this highlights the difference between correct and incorrect classifications. This systematic difference is quite high for the cranial traits (0.000 to 0.002) in Lübeck. Cohen’s Kappa results suggest that the pelvic trait sub-pubic angle has an almost perfect agreement between morphological and previously known-sex estimates in Inden (k = 0.797) and SA (k = 0.879).
Table 8 highlights the distribution of accuracy rates amongst males and females. It shows a comparison of trait classification in all the population groups. For each trait in every population group, there are values in Male that match and in Female that match. The value inside the brackets is the total number of individuals for a particular sex, and the value outside the brackets shows the correctly identified individuals. The pubic traits were the best indicators for male and female identification in all three series. The probability of males being scored or categorized as male, and females being scored or categorized as female is high for subpubic angle. For South African females, it even reaches 95%, which is far above the pooled accuracy rates in every series. For skull traits in the Inden series, the males were identified poorly with the mastoid process, with the lowest accuracy rate, and in the Inden females the zygomaticum or zygomatic arch was the worst sex indicator. In the Lübeck series, although the sample size is low, a very stark situation can be observed for glabella-supra orbital ridges in females, securing only 28.6% as opposed to the male–female pooled accuracy rate in the Inden series (84.3%), and in the Inden female it can be seen as the best performing sex indicator with 85.2% accuracy.

5. Discussion

The aim of this study was to evaluate the morphological traits for their accuracy and reliability on modern and medieval samples from European ancestry and on modern sample from African ancestry. The sex estimation for an individual can be performed either based on individual trait assessment or on multiple trait assessment, methods, and skeletal regions combined. Looking at the accuracy rates of the assessed traits in Table 8 on all three population groups, it can be seen that the sex can be accurately estimated if all the pelvic traits are observable and can perform even better if correctly sexed in conjunction with skull traits. This finding is congruent with the study performed by Inskip et al. (2019) [9]; the accuracy rates of the morphological traits in an English skeletal collection are higher for pooled sexes (95.6%) if skull and os coxae are assessed together (with 95.6% raw accuracy) and it outperforms the assessment of os coxae alone, which achieved only 91.8% accuracy. When a forensic anthropologist performs analysis on skeletal elements, “ideally the pelvis should be assessed first, because its evolutionary predisposition to parturition and absence of ancestral traits lead to greater reliability than is possible from assessing the skull, which should be assessed second” [42,43]. This statement appears to be more valid for subpubic angle than the other pelvic traits, as seen for the Inden and South Africa series. The difference between males and females’ accuracies is quite moderate for arc compose, greater sciatic notch, and iliac crest in all three groups. The higher accuracy rates in Inden and South African males is comparable to the study carried out by Đurić et al. [6]. They found over 90% accuracy for arc compose when they assessed only males of the Balkan group.
If pelvic bones are fragmented or broken, the misclassifications based only on the skull would be even greater. As shown in Table 5, there is a slight difference in male and female accuracies for sub pubic angle in the Inden series (male = 93.9%, female = 85%), but if skull traits are assessed independent of pelvic traits, the number of misclassified individuals appears to be high among Lübeck and South African females for most of the skull traits, which can be clearly seen in Table 8. In the pelvis, the variation between males and females is related to reproduction, and in crania the sexual dimorphism is more related to body size and shape, which highlights musculature and robusticity [44]. This means it has more potential to vary among different population groups [9]. Inden and Lübeck are both German skeletal collections, but from different periods of modern and medieval. The stark difference between the accuracy rates for cranial traits in both series, as seen in Table 5 and Table 7, is supported by Inskip et al. (2019) [9]. The authors sourced traits from papers that used modern collections, to which they compared the results obtained from medieval English samples. It showed a higher degree of differences for sexual dimorphism. This research shows that independent traits might not give an appropriate sex estimation when the standards from modern collections are applied on archaeological samples.
The skull sex estimates, and previously determined sex, have a substantial agreement and an improved accuracy rate when ambiguous individuals are excluded. This stark change can be observed in the Inden series, particularly for cranium overall, glabella-supraorbital ridges, and margo orbitalis. It was noted that the skull is highly variable from the data, but it also scored high accuracies for overall traits than for individual traits. The difference between raw and correctly-sexed accuracies highlights that there are more ambiguous individuals for skull traits than for pelvic traits and more overlapping in Inden than in Lübeck and South Africa. This is due to the fact that both males and females of the Inden skeletal collection present with more robust and prominent features in general, and thus there is more overlap. An additional reason could be that development of features in males and females take place at different ages. The masculine features develop at an extended period of growth; male individuals who die at a young age might appear feminine. Females might show more robust features as they age [33]. This could become a factor for ambiguity and misclassifications in Inden, Lübeck, and SA females. The females might be of older age groups and hence are often classified as males. In light of this, it is important to include a larger sample size with more females of different age groups.

6. Conclusions

The modern collections of European and African ancestries score better accuracies when using morphological methods based on modern collections when compared to archaeological collections. The cranial traits, combined and assessed overall, perform as one of the best in all three series, namely, Inden, Lübeck, and SA groups, despite being of different ancestries and temporal periods. The small sample size in every series has been a big limitation of this study, and this may create a bias. For every trait tested, the sex difference on accuracy rate could stem from the unequal numbers of male and females in each skeletal series. This greatly impacts the accuracy rates of all the traits in the Lübeck series, which do not improve much, even after taking the ambiguous individuals out. If researchers are excluding ambiguous or indifferent features to ascertain higher accuracies in sex estimation, this provides a significant research theme for the future. The combination of traits for mandible region follows the same pattern, achieving the highest accuracy in all three series. The strongest individual trait in the pelvis turns out to be the subpubic angle, which scores the highest accuracy in males and females of modern series Inden and SA, and also in the males of archaeological samples of Lübeck, and this trait could be considered as a universal trait that can be applied, regardless of the population specificity. Interestingly enough, the females in Lübeck perform better with arc compose. The performances of different individual traits proves that a partially complete or fragmented pelvis would provide a clearer picture on the biological sex of an individual than the fragmented skull, which would be more useful if all the traits are assessed together. The Lübeck series does not deliver the same accuracy rate as the other two modern samples from the Inden and SA series when using the cranial and mandibular features, which may be related to the skull variation in different population groups. Krüger et al. (2015) address the accuracy of morphological traits on Black and White South African crania. According to them, the expression of morphological cranial traits vary across populations where the classification rate or accuracy is highly dependent on the reference collection that is used to compare unidentified skulls [45]. The inability to easily identify the morphology on Lübeck archaeological sample emphasizes that modern samples have better preserved bones and associated morphological features.

Author Contributions

Conceptualization, methodology, original draft preparation, A.G., B.G., S.H. and B.K.B.; formal Analysis, investigation, resources, data curation, project administration, A.G.; review and editing, B.G., S.H. and B.K.B.; visualisation and supervision, B.G. and S.H.; funding acquisition, A.G., B.G. and S.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the German Academic Exchange Service (Deutscher Akademischer Austauschdienst) research scholarship for doctoral students with funding program number 57299294.

Institutional Review Board Statement

Ethical review and approval were waived for this study as it includes human skeletal material that have been retrieved from burials and mass graves in Germany. The request of the ethics waiver for accessing the Raymond Dart cadaver collection has been approved by the School of Anatomical Sciences, University of Witwatersrand.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data supporting the study’s conclusions are available on reasonable request from the corresponding author.

Acknowledgments

The authors appreciate the reviewers’ insightful comments, which significantly enhanced the paper. Special thanks to the Department of Historical Anthropology in Germany and the Raymond Dart collection in South Africa for providing the skeletal material for this research.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mokoena, P.; Billings, B.K.; Gibbon, V.; Bidmos, M.A.; Mazengenya, P. Development of Discriminant Functions to Estimate Sex in Upper Limb Bones for Mixed Ancestry South Africans. Sci. Justice 2019, 59, 660–666. [Google Scholar] [CrossRef] [PubMed]
  2. Krishan, K.; Kanchan, T.; Kharoshah, M.A. “Advances in Forensic Anthropology”–Creation of Skeletal Databases for Forensic Anthropology Research and Casework. Egypt. J. Forensic Sci. 2016, 6, 29–30. [Google Scholar] [CrossRef]
  3. de Boer, H.H.; Obertová, Z.; Cunha, E.; Adalian, P.; Baccino, E.; Fracasso, T.; Kranioti, E.; Lefévre, P.; Lynnerup, N.; Petaros, A. Strengthening the Role of Forensic Anthropology in Personal Identification: Position Statement by the Board of the Forensic Anthropology Society of Europe (FASE). Forensic Sci. Int. 2020, 315, 110456. [Google Scholar] [CrossRef] [PubMed]
  4. Hummel, S. Ancient DNA Typing; Springer: Berlin, Germany, 2003. [Google Scholar]
  5. Butler, J.M.; Willis, S. Interpol Review of Forensic Biology and Forensic DNA Typing 2016-2019. Forensic Sci. Int. Synerg. 2020, 2, 352–367. [Google Scholar] [CrossRef]
  6. Đurić, M.; Rakočević, Z.; Đonić, D. The Reliability of Sex Determination of Skeletons from Forensic Context in the Balkans. Forensic Sci. Int. 2005, 147, 159–164. [Google Scholar] [CrossRef]
  7. Lewis, C.J.; Garvin, H.M. Reliability of the Walker Cranial Nonmetric Method and Implications for Sex Estimation. J. Forensic Sci. 2016, 61, 743–751. [Google Scholar] [CrossRef]
  8. Thomas, R.M.; Parks, C.L.; Richard, A.H. Accuracy Rates of Sex Estimation by Forensic Anthropologists through Comparison with DNA Typing Results in Forensic Casework. J. Forensic Sci. 2016, 61, 1307–1310. [Google Scholar] [CrossRef]
  9. Inskip, S.; Scheib, C.L.; Wohns, A.W.; Ge, X.; Kivisild, T.; Robb, J. Evaluating Macroscopic Sex Estimation Methods Using Genetically Sexed Archaeological Material: The Medieval Skeletal Collection from St John’s Divinity School, Cambridge. Am. J. Phys. Anthropol. 2019, 168, 340–351. [Google Scholar] [CrossRef]
  10. Gualdi-Russo, E. Sex Determination from the Talus and Calcaneus Measurements. Forensic Sci. Int. 2007, 171, 151–156. [Google Scholar] [CrossRef]
  11. Guyomarc’h, P.; Bruzek, J. Accuracy and Reliability in Sex Determination from Skulls: A Comparison of Fordisc® 3.0 and the Discriminant Function Analysis. Forensic Sci. Int. 2011, 208, 180.e1–180.e6. [Google Scholar] [CrossRef]
  12. Trautmann, M.; Trautmann, I.; Hotz, G. A Simple Metric Sexing Method for Unknown Skeletal Remains: The Sacro-Clavicular Index (SCI). Anthropol. Anz. 2014, 71, 57–64. [Google Scholar] [CrossRef]
  13. Selliah, P.; Martino, F.; Cummaudo, M.; Indra, L.; Biehler-Gomez, L.; Campobasso, C.P.; Cattaneo, C. Sex Estimation of Skeletons in Middle and Late Adulthood: Reliability of Pelvic Morphological Traits and Long Bone Metrics on an Italian Skeletal Collection. Int. J. Leg. Med. 2020, 134, 1683–1690. [Google Scholar] [CrossRef]
  14. Introna, F.; Di Vella, G.; Campobasso, C.P.; Dragone, M. Sex Determination by Discriminant Analysis of Calcanei Measurements. J. Forensic Sci. 1997, 42, 725–728. [Google Scholar]
  15. Introna, F.; Di Vella, G.; Campobasso, C.P. Sex Determination by Discriminant Analysis of Patella Measurements. Forensic Sci. Int. 1998, 95, 39–45. [Google Scholar] [CrossRef]
  16. Glucksmann, A. Sexual Dimorphism in Human and Mammalian Biology and Pathology; Academic Press: Cambridge, MA, USA, 1981. [Google Scholar]
  17. Ubelaker, D.H.; DeGaglia, C.M. Population Variation in Skeletal Sexual Dimorphism. Forensic Sci. Int. 2017, 278, 407.e1–407.e7. [Google Scholar]
  18. Ross, A.H.; Ubelaker, D.H.; Kimmerle, E.H. Implications of Dimorphism, Population Variation, and Secular Change in Estimating Population Affinity in the Iberian Peninsula. Forensic Sci. Int. 2011, 206, 214.e1–214.e5. [Google Scholar] [CrossRef]
  19. Steyn, M.; Patriquin, M.L. Osteometric Sex Determination from the Pelvis—Does Population Specificity Matter? Forensic Sci. Int. 2009, 191, 113.e1–113.e5. [Google Scholar] [CrossRef]
  20. Seidenberg, V.; Schilz, F.; Pfister, D.; Georges, L.; Fehren-Schmitz, L.; Hummel, S. A New MiniSTR Heptaplex System for Genetic Fingerprinting of Ancient DNA from Archaeological Human Bone. J. Archaeol. Sci. 2012, 39, 3224–3229. [Google Scholar]
  21. Haas-Rochholz, H.; Weiler, G. Additional Primer Sets for an Amelogenin Gene PCR-Based DNA-Sex Test. Int. J. Leg. Med. 1997, 110, 312–315. [Google Scholar]
  22. Schmidt, D.; Hummel, S.; Herrmann, B. Brief Communication: Multiplex X/Y-PCR Improves Sex Identification in ADNA Analysis. Am. J. Phys. Anthropol. 2003, 121, 337–341. [Google Scholar] [CrossRef]
  23. Tozzo, P.; Giuliodori, A.; Corato, S.; Ponzano, E.; Rodriguez, D.; Caenazzo, L. Deletion of Amelogenin Y-Locus in Forensics: Literature Revision and Description of a Novel Method for Sex Confirmation. J. Forensic Leg. Med. 2013, 20, 387–391. [Google Scholar] [PubMed]
  24. Kashyap, V.K.; Sahoo, S.; Sitalaximi, T.; Trivedi, R. Deletions in the Y-Derived Amelogenin Gene Fragment in the Indian Population. BMC Med. Genet. 2006, 7, 37. [Google Scholar]
  25. Schmidt, N.; Schücker, K.; Krause, I.; Dörk, T.; Klintschar, M.; Hummel, S. Genome-Wide SNP Typing of Ancient DNA: Determination of Hair and Eye Color of Bronze Age Humans from Their Skeletal Remains. Am. J. Phys. Anthropol. 2020, 172, 99–109. [Google Scholar] [CrossRef]
  26. Salega, S.; Grosskopf, B. Evaluation of Entheseal Changes in a Modern Identified Skeletal Collection from Inden (Germany). Int. J. Osteoarchaeol. 2022, 32, 86–99. [Google Scholar] [CrossRef]
  27. Feicke, M. Inventory and Morphological Sexing in a Medieval Skeletal Series from Lübeck, Department of Historical Anthropology and Human Ecology, Johann-Friedrich-Blumenbach Institute of Zoology and Anthropology; Georg-August-Universität Göttingen: Göttingen, Germany, 2013. [Google Scholar]
  28. Fasemore, M.D.; Bidmos, M.A.; Mokoena, P.; Imam, A.; Billings, B.K.; Mazengenya, P. Dimensions around the Nutrient Foramina of the Tibia and Fibula in the Estimation of Sex. Forensic Sci. Int. 2018, 287, 222.e1–222.e7. [Google Scholar] [CrossRef]
  29. Dayal, M.R.; Kegley, A.D.; Štrkalj, G.; Bidmos, M.A.; Kuykendall, K.L. The History and Composition of the Raymond, A. Dart Collection of Human Skeletons at the University of the Witwatersrand, Johannesburg, South Africa. Am. J. Phys. Anthropol. Off. Publ. Am. Assoc. Phys. Anthropol. 2009, 140, 324–335. [Google Scholar]
  30. Curate, F. The Estimation of Sex of Human Skeletal Remains in the Portuguese Identified Collections: History and Prospects. Forensic Sci. 2022, 2, 272–286. [Google Scholar] [CrossRef]
  31. Spradley, M.K.; Jantz, R.L. Sex Estimation in Forensic Anthropology: Skull Versus Postcranial Elements. J. Forensic Sci. 2011, 56, 289–296. [Google Scholar] [CrossRef]
  32. Bruzek, J. A Method for Visual Determination of Sex, Using the Human Hip Bone. Am. J. Phys. Anthropol. 2002, 117, 157–168. [Google Scholar] [CrossRef]
  33. Buikstra, J.E.; Ubelaker, D.H. Standards for Data Collection from Human Skeletal Remains. Ark. Archaeol. Surv. Res. Ser. 1994, 44, 10004710139. [Google Scholar]
  34. Ferembach, D. Recomendations for Age and Sex Diagnosis of Skeletons. J. Hum. Evol. 1980, 9, 517–549. [Google Scholar]
  35. Grupe, G.; Harbeck, M.; McGlynn, G.C. Prehistoric Anthropology, Original Text; Springer Spektrum: Berlin, Germany, 2015. [Google Scholar] [CrossRef]
  36. Herrmann, B.; Grupe, G.; Hummel, S.; Piepenbrink, H.; Schutkowski, H. Prehistoric Anthropology, Original Text: Guide to Field and Laboratory Methods; Springer: Berlin, Germany, 1990. [Google Scholar]
  37. Klepinger, L.L. Fundamentals of Forensic Anthropology; John Wiley & Sons: Hoboken, NJ, USA, 2006; Volume 1. [Google Scholar]
  38. Novotný, V. Pohlavnı Rozdıly a Identifikace Pohlavá Pınevnı Kosti [Sex Differences and Identification of Sex in Pelvic Bone]. Ph.D. Thesis, Purkyne University, Ústí nad Labem, Czech Republic, 1981. [Google Scholar]
  39. Loth, S.R.; Henneberg, M. Mandibular Ramus Flexure: A New Morphologic Indicator of Sexual Dimorphism in the Human Skeleton. Am. J. Phys. Anthropol. Off. Publ. Am. Assoc. Phys. Anthropol. 1996, 99, 473–485. [Google Scholar] [CrossRef]
  40. McFadden, C.; Oxenham, M.F. Revisiting the P Henice Technique Sex Classification Results Reported by M Ac L Aughlin and B Ruce (1990). Am. J. Phys. Anthropol. 2016, 159, 182–183. [Google Scholar] [CrossRef]
  41. Watson, P.F.; Petrie, A. Method Agreement Analysis: A Review of Correct Methodology. Theriogenology 2010, 73, 1167–1179. [Google Scholar] [CrossRef]
  42. Handbook of Forensic Anthropology and Archaeology, 2nd ed.; Blau, S.; Ubelaker, D.H. (Eds.) Routledge: New York, NY, USA, 2016. [Google Scholar] [CrossRef]
  43. Rowbotham, S.K. Anthropological Estimation of Sex. In Handbook of Forensic Anthropology and Archaeology; Routledge: New York, NY, USA, 2016; pp. 303–314. [Google Scholar]
  44. Dunsworth, H.M. Expanding the Evolutionary Explanations for Sex Differences in the Human Skeleton. Evol. Anthropol. Issues News Rev. 2020, 29, 108–116. [Google Scholar] [CrossRef]
  45. Krüger, G.C.; L’Abbé, E.N.; Stull, K.E.; Kenyhercz, M.W. Sexual Dimorphism in Cranial Morphology among Modern South Africans. Int. J. Leg. Med. 2015, 129, 869–875. [Google Scholar] [CrossRef] [Green Version]
Table 1. Morphological pelvic traits used for sex estimation.
Table 1. Morphological pelvic traits used for sex estimation.
TraitsCommon Sources
Greater sciatic notch
Arc composé (Composite arch)
Sub-pubic angle
Iliac crest
(Bruzek, 2002; Buikstra and Ubelaker, 1994;
Ferembach, 1980; Grupe et al., 2015;
Klepinger, 2006; Novotný, 1981;
Hermann et al., 1990) [32,33,34,35,36,37,38]
Table 2. Morphological mandibular traits used for sex estimation.
Table 2. Morphological mandibular traits used for sex estimation.
TraitsCommon Sources
Mandible overall
Condylar process
Mentum (mental eminence)
Gonion and angle
Corpus height
(Buikstra and Ubelaker, 1994;
Ferembach, 1980; Grupe et al., 2015;
Herrmann et al., 1990; Klepinger, 2006;
Loth & Henneberg, 1996) [33,34,35,36,37,39]
Table 3. Morphological cranial traits used for sex estimation.
Table 3. Morphological cranial traits used for sex estimation.
TraitsCommon Sources
Cranium overall
Frontal tuberosity (eminence) and forehead steepness
Glabella and supraorbital ridges (superciliary arch)
Eye orbitals
Zygomaticum (zygomatic arch)
Margo orbitalis (supraorbital margin)
Mastoid process
(Buikstra and Ubelaker, 1994;
Ferembach, 1980; Grupe et al.,
2015; Herrmann et al., 1990;
Klepinger, 2006) [33,34,35,36,37]
Table 4. Number of individuals by morphologically estimated and previously known sex.
Table 4. Number of individuals by morphologically estimated and previously known sex.
Population Groups
Source of Sex EstimateIndenLübeckSouth Africa
Morphological164236161
Previously known10976161
Note. The previously known data include the sources aDNA analysis, church book, and the cadaver database.
Table 5. Total number of individuals for skull and os coxae with traits observed: the correctly classified, indifferent (ambiguous), and incorrect results and the corresponding accuracy rates in the Inden series.
Table 5. Total number of individuals for skull and os coxae with traits observed: the correctly classified, indifferent (ambiguous), and incorrect results and the corresponding accuracy rates in the Inden series.
TraitIndividuals Considered with Known SexMatchNo MatchRaw Accuracy (%)IndifferentWithout IndifferentMatchNo MatchAccuracy (%)Cohen’s Kappap-Value
Inden Calvarium—Morphological
Cranium overall74512368.921361511083.610.6650.000
Frontal tuberosity and Steepness75591678.71362521083.90.6650.000
Glabella-supra orbital ridges75532270.71263531084.10.6790.000
Margo orbitalis75433257.31758431574.10.4500.000
Eye orbitals62431969.4458431574.10.4640.000
Zygomaticum69442563.8960441673.30.4440.000
Mastoid process75433257.31362422067.70.3510.000
Inden Mandible—Morphological
Mandible overall65501576.9065501576.90.4910.000
Condylar process64432167.2064432167.20.2940.019
Corpus height59392066.1059392066.10.2640.040
Mentum57401770.2057401770.20.3480.006
Gonion and angle54351964.8153351866.00.1830.052
Inden Pelvis—Morphological
Iliac crest69472268.1069472268.10.3610.003
Arc compose90672374.4387672077.00.5300.000
Greater schiatic notch90652572.2783651878.30.5560.000
Sub pubic angle5448688.915348590.60.7970.000
Note: The numbers in bold highlight the highest accuracy among all the traits for a particular skeletal element.
Table 6. Total number of skull and os coxae traits observed: the correctly classified, indifferent (ambiguous), and incorrect results and the corresponding accuracy rates in the South African series.
Table 6. Total number of skull and os coxae traits observed: the correctly classified, indifferent (ambiguous), and incorrect results and the corresponding accuracy rates in the South African series.
TraitIndividuals Considered with Known SexMatchNo MatchRaw Accuracy (%)IndifferentWithout IndifferentMatchNo MatchAccuracy (%)Cohen’s Kappap-Value
South Africa Calvarium—Morphological
Cranium overall1601303081.301601303081.30.5950.000
Frontal tuberosity and Steepness1601253578.121581253379.10.5730.000
Glabella-supra orbital ridges1601303081.311591302981.80.6130.000
Margo orbitalis1601085267.511591085167.90.2990.000
Eye orbitals1601243677.521581243478.50.5330.000
Zygomaticum1601243677.541561243279.50.5810.000
Mastoid process1601134770.611591134671.10.4020.000
South Africa Mandible—Morphological
Mandible overall1601303081.341561302683.30.6340.000
Condylar process1601144671.311591144571.70.4070.000
Corpus height1601174373.141561173975.00.4640.000
Gonion and angle1601273379.421581273180.40.6040.000
Mentum1601025863.851551025365.80.2900.000
South Africa Pelvis—Morphological
Iliac crest1561094769.911551094670.30.3820.000
Greater schiatic notch1561223478.221541223279.20.5520.000
Arc compose1561272981.411551272881.90.6090.000
Sub pubic angle1561461093.61155146994.20.8790.000
Note: The numbers in bold highlight the highest accuracy among all the traits for a particular skeletal element.
Table 7. Total number of skull and os coxae traits observed: the correctly classified, indifferent (ambiguous), and incorrect results and the corresponding accuracy rates in the Lübeck series.
Table 7. Total number of skull and os coxae traits observed: the correctly classified, indifferent (ambiguous), and incorrect results and the corresponding accuracy rates in the Lübeck series.
TraitIndividuals Considered with Known MatchNo MatchRaw Accuracy (%)IndifferentWithout IndifferentMatchNo MatchAccuracy (%)Cohen’ s Kappap-Value
Lübeck Calvarium—Morphological
Cranium overall43232053.5439231659.00.2730.027
Frontal tubercle and Steepness40211952.5337211656.80.2040.121
Glabella-supra orbital ridges43202346.5835201557.10.2740.018
Margo orbitalis SH42182442.9735181751.40.1770.128
Eye orbitals43232053.5142231954.80.2220.045
Zygomaticum43202346.5637201754.10.2090.076
Mastoid process43202346.5637201754.10.2110.074
Lübeck Mandible—Morphological
Mandible overall31181358.1031181358.10.0740.675
Condylar process26121446.2125121348.00.1100.405
Corpus height31171454.8031171454.80.0840.609
Mentum31131841.9130131743.3−0.0490.745
Gonion and angle3122971.003122971.00.3180.076
Lübeck Pelvis—Morphological
Iliac crest51312060.8744311370.50.4410.001
Arc compose51401178.434840883.30.6520.000
Greater schiatic notch50361472.074336783.70.6700.000
Sub pubic angle51321962.7133832684.20.6870.000
Note: The numbers in bold highlight the highest accuracy among all the traits for a particular skeletal element.
Table 8. Accuracy rates for individual traits in males and females separately.
Table 8. Accuracy rates for individual traits in males and females separately.
TraitIndenLübeckSouth Africa
Male MatchMale Accuracy (%)Female MatchFemale Accuracy (%)McNemar TestMale MatchMale Accuracy (%)Female MatchFemale Accuracy (%)McNemar TestMale MatchMale Accuracy (%)Female MatchFemale Accuracy (%)McNemar Test
Cranium—Morphological
Cranium overall30 (36)8421/2583.300.75411 (12)91.712 (27)44.400.00187 (98)88.843 (62)69.40.200
Frontal tubercle and Steepness32 (37)86.520 (25)801.0008 (10)8013 (27)48.10.00475 (96)78.150 (62)80.600.163
Glabella-supra orbital ridges30 (36)83.323 (27)85.20.75411 (11)1009 (24)37.50.00084 (97)86.646 (62)74.20.711
Margo orbitalis29 (34)85.314 (24)58.30.30210 (11)90.98 (24)33.30.00078 (97)80.430 (62)48.40.092
Eye orbitals16 (24)66.727 (34)79.41.00011 (12)91.712 (30)400.00085 (96)88.539 (62)62.90.058
Zygomaticum28 (36)77.816 (24)66.71.00011 (12)91.79 (25)360.00074 (94)78.750 (62)80.60.215
Mastoid process24 (36)66.718 (26)69.20.50310 (11)90.910 (26)38.50.00071 (97)73.242 (62)67.70.461
Mandible—Morphological
Mandible overall35 (41)85.415 (24)62.50.6074 (9)4414 (22)63.60.58189 (96)92.741 (60)68.30.029
Condylar process30 (40)7513 (24)54.21.0004 (5)808 (20)400.00374 (97)76.340 (62)64.51.000
Corpus height28 (36)77.811 (23)47.80.5035 (9)55.612 (22)54.50.18079 (95)83.238 (61)62.30.337
Mentum29 (34)85.311 (23)47.80.1435 (9)55.68 (21)38.10.04966 (95)69.536 (60)600.011
Gonion and angle31 (32)96.94 (21)190.0005 (9)55.617 (22)77.31.00074 (97)90.253 (61)86.90.583
Pelvis—Morphological
Iliac crest26 (39)66.721 (30)700.52314 (15)93.317 (29)58.60.00370 (95)73.739 (60)650.659
Arc compose40 (49)81.627 (38)71.10.82415 (18)83.325 (30)83.30.72785 (96)88.542 (59)71.20.215
Greater schiatic notch39 (47)8326 (36)72.20.81515 (17)88.221 (26)80.80.45382 (94)87.240 (60)66.70.345
Sub pubic angle31 (33)93.917 (20)851.00015 (16)93.817 (22)77.30.21989 (95)93.757 (60)950.508
Note: The numbers in bold highlight the highest accuracy among all the traits for a particular skeletal element.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Gupta, A.; Billings, B.K.; Hummel, S.; Grosskopf, B. Evaluating Morphological Methods for Sex Estimation on Isolated Human Skeletal Materials: Comparisons of Accuracies between German and South African Skeletal Collections. Forensic Sci. 2022, 2, 574-584. https://doi.org/10.3390/forensicsci2030042

AMA Style

Gupta A, Billings BK, Hummel S, Grosskopf B. Evaluating Morphological Methods for Sex Estimation on Isolated Human Skeletal Materials: Comparisons of Accuracies between German and South African Skeletal Collections. Forensic Sciences. 2022; 2(3):574-584. https://doi.org/10.3390/forensicsci2030042

Chicago/Turabian Style

Gupta, Avinash, Brendon K. Billings, Susanne Hummel, and Birgit Grosskopf. 2022. "Evaluating Morphological Methods for Sex Estimation on Isolated Human Skeletal Materials: Comparisons of Accuracies between German and South African Skeletal Collections" Forensic Sciences 2, no. 3: 574-584. https://doi.org/10.3390/forensicsci2030042

APA Style

Gupta, A., Billings, B. K., Hummel, S., & Grosskopf, B. (2022). Evaluating Morphological Methods for Sex Estimation on Isolated Human Skeletal Materials: Comparisons of Accuracies between German and South African Skeletal Collections. Forensic Sciences, 2(3), 574-584. https://doi.org/10.3390/forensicsci2030042

Article Metrics

Back to TopTop