Next Article in Journal
Clinical Pharmacists’ Involvement in Pharmacogenomics Testing and Related Services in China
Previous Article in Journal
Pharmacogenomics of Monoclonal Antibodies for the Treatment of Rheumatoid Arthritis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Commentary

The False Dawn of Polygenic Risk Scores for Human Disease Prediction

by
Anthony F. Herzig
1,
Françoise Clerget-Darpoux
2,* and
Emmanuelle Génin
1,*
1
Inserm, Université de Brest, EFS, CHU Brest, UMR 1078, GGB, F-29200 Brest, France
2
Université Paris Cité, Inserm, Institut Imagine, Laboratoire Embryologie et Génétique des Malformations, F-75015 Paris, France
*
Authors to whom correspondence should be addressed.
J. Pers. Med. 2022, 12(8), 1266; https://doi.org/10.3390/jpm12081266
Submission received: 20 June 2022 / Revised: 24 July 2022 / Accepted: 28 July 2022 / Published: 31 July 2022
(This article belongs to the Special Issue The Genomics in Human Disease)

Abstract

:
Polygenic risk scores (PRSs) are being constructed for many diseases and are presented today as a promising avenue in the field of human genetics. These scores aim at predicting the risk of developing a disease by leveraging the many genome-wide association studies (GWAS) conducted during the two last decades. Important investments are being made to improve score estimates by increasing GWAS sample sizes, by developing more sophisticated methods, and by proposing different corrections for potential biases. PRSs have entered the market with direct-to-consumer companies proposing to compute them from saliva samples and even recently to help parents select the healthiest embryos. In this paper, we recall how PRSs arose and question the credit they are given by revisiting underlying assumptions in light of the history of human genetics and by comparing them with estimated breeding values (EBVs) used for selection in livestock.

1. From the Early Days of Human Genetics to the Birth of Polygenic Risk Scores (PRS)

PRSs were born from the idea that genetic marker information can be useful to predict and prevent complex diseases. This idea is not new. Fifty years ago, the discovery of the human leucocyte antigens (HLAs) and their associations with many diseases led Nobel Prize winner Jean Dausset to declare the following at the Fifth International HLA Workshop:
«A relatively important positive or negative association of the HL-A determinant with some well-characterized disease would be of the utmost predictive value, allowing the introduction of preventive treatment, and possibly to the eradication of the disease». [1]
Numerous HLA workshops were subsequently organized, with worldwide collaborations, in order to refine observed associations and to search for their functional basis. These studies were not very successful at predicting disease risk, but they highlighted some important aspects of the genetic architecture of common diseases. First, they showed that there existed some important genetic variability between populations with differences of allele frequency and linkage disequilibrium observed at all geographic scales, between continents but also between countries within a continent, and even within a country [2,3,4]. The important role of our immune system in our biological functioning and the selective response to environmental attacks and large epidemics probably explain this great variability [5]. Second, it was found that the biological mechanisms underlying observed associations are usually very complex. Heterozygous carriers of a given HLA allele can, for example, be more at risk than homozygotes for this allele; e.g., in type 1 diabetes, DR3/DR4 heterozygotes have a higher risk (OR = 16.59; 95% CI, 13.7–20.1) than DR3/DR3 (OR = 6.32; 95% CI, 5.12–7.80) or DR4/DR4 homozygotes (OR = 5.68; 95% CI, 3.91–8.23) (see the meta-analysis in [6]). Interactions were found between different alleles located at different HLA loci. This is, for example, the case in celiac disease, where a large majority of patients carry the two alleles DQA1*0501 and DQB1*0201 that form a heterodimer [7] and the risk is further increased for individuals homozygous for the DQB1*0201 allele [8]. It was also discovered that an allele that confers risk for a disease could be protective against another disease such as where, for example, the DR15 antigen increases the risk of multiple sclerosis (MS), but decreases the risk of type 1 diabetes or rheumatoid arthritis.
Third, for many diseases, it was found that, even if relative risks for carriers compared with non-carriers of HLA antigens are high, these common HLA antigens confer weak absolute risks to their carriers. Consequently, HLA genotyping is not useful to predict those diseases in the general population. This is well illustrated by the association of DR15 and MS. In the French population, DR15 carriers have a relative risk of 4 compared with non-carriers, but the absolute risk for a DR15 carrier of developing MS is only one per thousand. Similarly, for celiac disease, the relative risk of carriers of the DQA1*0501–DQB1*0201 heterodimer compared with non-carriers is very high, around 24 in Europe, but the positive predictive risk conferred by the heterodimer is less than 1%. Consequently, HLA typing is not useful for prediction at the population level. However, it provides information to the clinician to evaluate the risk for the relative of an affected individual [9].
Predominantly, the risk would be calculated according to the mode of transmission of the disease or, when this mode of transmission was not known, according to previously estimated data. For example, the Gail model [10] was proposed to clinicians to compute the risk for a woman to develop breast cancer based on the following factors: age at menarche, age at first live birth, number of previous biopsies, and number of first-degree relatives with breast cancer. However, a large uncertainty was attached to these risk estimates and there was hope that improvements would come from the mapping of the genes involved. An important step in the history of disease risk prediction was indeed the launching of the Human Genome project in the 1980s. Linkage analyses were carried out in large pedigrees with many affected people. More than 1200 genes were identified, often providing knowledge on the protein involved and the deleterious variants [11]. Monogenic sub-entities were also identified in common and complex diseases; for example, some early-onset cases of Alzheimer disease were found to be due to rare deleterious variants in the APP, PSN1, or PSN2 genes [12]. Other examples include the Maturity-onset diabetes of the young (MODY) forms of diabetes [13] or breast cancer due to deleterious variants in the BRCA1 [14,15] and BRCA2 [16,17] genes. In families where a deleterious variant segregated, it became possible to determine with certainty family members who had inherited the deleterious variant and family members who had not, and thus to determine who was at high risk. However, even for disease due to deleterious variants, the reliability of prediction depends on the penetrance; i.e., the probability of the disease developing in a variant carrier. Indeed, this penetrance could be much smaller than 100%, as other factors may play a role in disease expression by modulating it and leading to different severity. These factors can be environmental factors or genetic factors that are then often referred to as modifier genes (see [18] for a discussion on modifier genes). In addition, for monogenic sub-entities of common diseases, the interpretation of family history and thus genetic counselling can be complicated by the fact that, in the same family, there may exist some genetic heterogeneity. This difficulty is well illustrated on breast cancer. Given the high risk of breast cancer for women in the general population, in large families where a pathogenic variant of BRCA1 is segregating, some women may be affected without carrying the variant. Consequently, familial analysis is very sensitive to the phenotype–genotype correspondence [19]. Of special importance are the degrees of relatedness of the family members with cancer, the ages of disease onset, and the ages of family members who do not have the disease [20,21]. Besides, rare variants of BRCA1 have different likelihoods of pathogenicity [22] and many of them are even of unknown clinical significance [23]. Adding to these difficulties, there is also the possibility that de novo variants or mosaic variants could cause disease, making diagnosis and disease prediction even more difficult (see, for example, [24,25]).
When the HapMap project was launched in 2002, enthusiasm and hope of geneticists were again at a peak. Sets of tag-SNPs that capture most of the common variation on the human genome were defined and used to construct SNP-arrays to perform genome wide association studies (GWASs). Francis Collins, Director of the NIH, predicted that genetic diagnosis would be available within 10 years, and U.S. President Clinton declared on 26 June 2000: “it would revolutionize the diagnosis, prevention and treatment of most, if not all, human diseases”. The International Hapmap Consortium shared this optimistic view: “The HapMap will allow the discovery of sequence variants that affect common disease, will facilitate development of diagnostic tools, and will enhance our ability to choose targets for therapeutic intervention”. The results of GWAS were, however, not as expected. Associated loci were found to only confer very small increments in risk. However, even if biological interpretations were difficult, there was hope that GWAS results could be used to predict disease risk. Wray et al. [26] proposed an approach to “predict the genetic risk faced by each individual” based on their genotypes at the SNPs found to be associated with the disease in GWAS. This was the birth of polygenic risk scores (PRSs).

2. From GWAS to PRS

The principle of combining information at different genetic markers to construct a genetic score was first introduced by animal geneticists to perform marker-assisted selection [27]. This score was later called estimated breeding value (EBV) [28] and is used to estimate offspring performances for quantitative traits of agricultural interest. The principals of EBV are built themselves on the assumption that it is possible to separate the genetic and environmental variance components of a trait. Furthermore, that there are a (typically large) number of genetic variants that each capture a small part of the genetic variance, and the heritability of the trait can thus be recovered by summing across their contributions; this being the polygenic model of inheritance proposed by R.A. Fisher [29]. PRSs were introduced into human genetics by Wray et al. 2007 [26] according to the same principle of summing across the contribution of many different SNPs, but serve a different purpose; that is, predicting the individual risk to develop a disease. Given the relationship between PRS and disease heritability under the model of polygenic architecture, it was shown by Wray et al. 2007 [26] that the predictive power of a PRS is bounded by the heritability, and that is thus important to achieve the maximum disease heritability. Following these results, many geneticists embarked in the quest for the so-called missing heritability [30,31].
PRSs are constructed as a weighted sum of risk allele counts using effect sizes estimated from GWAS as the weights. The value P R S i for individual i based on their genotypes at N associated SNPs is computed as follows:
P R S i = k = 1 N β k G i , k
where k is the effect size for SNP k measured in an independent GWAS (or summary statistics obtained from different GWAS for the same disease) and G i , k is the observed genotype of individual i at SNP k coded as 0, 1, or 2 to represent the number of minor alleles carried by the individual.
In the literature of PRS construction, importance is placed on the choice of the SNPs to include in the model and their weights. In early PRS manifestations, only the SNPs that reached genome-wide significance in the GWAS of interest were included in the model [26,32]. After the results of Yang et al. [33] showed an increase in height heritability estimates when all SNPs rather than only the significant ones were included, it was suggested that PRSs could be improved by keeping all SNPs and applying some shrinkage methods using, for example, LASSO [34] or ridge regression [35,36]. It is also important to account for linkage disequilibrium (LD) between SNPs in order to correctly measure the contribution of the different genomic regions. This could be done either by modelling LD between SNPs to estimate independent genetic effects or by prioritizing, at each locus, SNPs with the smallest GWAS p-value (this is referred to as clumping) (for a discussion of these issues, see Choi et al. 2020 [37]).

3. The Different Uses of PRS: From Research to Clinics

Once a PRS is constructed with a set of selected SNPs for a given disease, it can be calculated for any individual by obtaining a DNA sample and genotyping the individual for this set of SNPs or using already available SNP-chip genotyping data for the individual. These PRS estimates have been proposed to be used for different purposes (for a review, see [38]). They could be used as a tool to identify individuals that are at high risk of developing a disease in order to offer them preventive treatments or to encourage them to change their behavior. In this case, individuals are ranked based on their PRS for the studied disease and the focus is on individuals in the top percentiles of the PRS distribution (see, for example, [39]).
This approach is also applied to specific sub-populations, for example, to individuals carrying a rare pathogenic variant. PRSs constructed for such a group of carriers have been proposed as a method to explain the observed incomplete penetrance of such mutations (see, for example, [40,41]).
Leveraging the numerous GWAS results available, PRSs have been built for different diseases in an attempt to “bridge the gap between initial discovery efforts and clinical applications” [42]. Standards have been proposed to report PRSs, as it was suggested that the main limitation to the translation of PRSs into clinical care was the heterogeneity in the way these scores were constructed. It was also suggested that PRSs would be improved by collecting association statistics from larger case-control datasets. Indeed, commenting on the progresses made in agricultural yield by the use of EBV, Wray et al. [43] pointed out the differences in family sizes and show how the GWAS sample size is the key factor for maximizing the accuracy of prediction. Besides family sizes, there are also many other important differences between PRS and EBV that can explain why PRSs will not match clinical expectation.

4. Erroneous Assumptions of PRS

In this chapter, we explain under which assumptions PRSs are estimated and why, contrarily to the EBV estimates used by animal geneticists, PRSs cannot be used for human disease prediction.

4.1. The Polygenic Additive Liability Model of Common Disease Genetic Architecture

A first difference between PRS and EBV is the studied phenotype. PRS are computed for binary phenotypes—affected or non-affected by a given disease—while EBVs usually consider quantitative phenotypes such as milk production. To extend the polygenic model of binary phenotypes, it is assumed that, underlying this binary disease phenotype, there exists an unobserved quantitative trait, called liability, transmitted under Fisher’s polygenic additive model [29] and a threshold over which individuals contract the disease. This polygenic additive liability (PAL) model was proposed 60 years ago to explain the familial segregation of diseases not compatible with monogenic transmission [44].
When using this model, one assumes that the disease has a homogeneous genetic determinism to which small and random effects of environmental factors are added. These are two strong hypotheses that were already discussed in 1965 by Falconer [45], when he showed that the heritability of liability to diabetes strongly decreased with the age of patients. At that time, the different forms of diabetes were not known and diabetes was considered as a single entity. The only information available to support or reject a disease model was the prevalence of the disease in the general population and among relatives of randomly sampled affected individuals from the studied population. Following Falconer’s observations, there were long debates in the scientific community regarding the PAL model and recognition that many alternative models could explain these observations (for a review, see [46]). In the following years, with the development of molecular biology, it was discovered that diabetes is a complex and highly heterogeneous disease with autoimmune forms (with generally early onset and for which the HLA genes play an important and complex role, denoted type 1 diabetes), forms associated with obesity (more often late onset with an important role of diet, denoted type 2 diabetes), as well as monogenic forms that are themselves heterogeneous [47]. It is clear now that the PAL model does not apply to diabetes as a whole and its relevance to type 1 and type 2 diabetes is also very questionable [48]. Diabetes is not an exception. The hypothesis of genetic homogeneity is also not true for most diseases. In breast cancer, for example, there exist different forms of disease and a single genetic model cannot explain them all. Some monogenic forms are due to rare pathogenic variants in different genes. Other forms not explained by high-risk variants are likely to involve different genes and pathways. Indeed, women affected by these forms show different histologies, different gene-expression profiles, and mutational patterns that translate into variable clinical courses and major differences in response to systemic treatments [49]. When pooling all of these different forms together, it is not possible to obtain reliable estimates of genetic effects. Instead, we expect exactly what is observed; that is, very small odds-ratios over many different SNPs. The PAL model is not rejected, whereas we know that it is not a correct model.
Underlying the PAL model is also the assumption that each environmental factor has a small effect that is independent from the effects of the genetic factors. For most traits however, this is clearly not true. The effect of the environment can be important and a given genotype can react differently across different environments [50]. This is well-known from breeders. Changes in the environment can have a dramatic impact on the efficiency of cattle breeding programs and the accuracy of selection [51,52,53]. This occurs, for example, in the case for milk production, with a significant impact of environmental factors such as temperature and humidity and gene–environment interactions [54,55]. The environment cannot be ignored and, as explained by Feldman and Lewontin [56], “Partitioning of the causes of variation is really illusory. The genetic variance depends on the distribution of environments and the environmental variance depends on the distribution of genotypes”. In their footsteps, Burt [57] claims that “the conceptual (biological) model on which heritability studies [and thus also PRS] depend—that of identifiable separate effects of genes vs. the environment on phenotype variance—is unsound”. Gottlieb [58] considers that genes are part of a “developmental system” and Moore and Shenk [59] explain that “contemporary biology has demonstrated that traits are the product of interactions between genetic and non-genetic factors at every point of the development”. This is well illustrated, for example, in cerebrovascular diseases, with both lifestyle and intrinsic factors playing a role in the development and the severity of neurodegenerative diseases [60]. In the field of livestock selection, the environment effect can be studied and taken into account (see, for example, [55]). In human populations, environments change between geographical location, between cultural groups, and between generations; they cannot be precisely controlled for.

4.2. GWAS and Causal Inference

Underneath PRS is the idea that there are individuals with a genetic makeup that predispose them to developing disease and that it is possible to identify such individuals based on their PRS values. It is thus assumed that the different SNPs used to construct the PRS have a direct or indirect (through linkage disequilibrium) effect on the disease itself. The SNPs used in PRS are identified by association tests on the basis of different genotype distributions in cases and controls. It is thus assumed that these observed differences in genotype distributions reflect a true genetic effect. However, this is not true. The most discussed reason is the well-recognized problem of population stratification, but this is not the sole source of associations that do not reflect direct genetic effects. Stratification can be corrected for or averted by matching cases and controls on ancestry, but association may well still reflect the effect of genetic or environmental factors involved not only on the studied trait, but also on other traits associated with the studied trait [61]. This is, for example, the case of many diseases associated with BMI. Among the SNPs associated with these diseases, there are many SNPs that are in fact associated with BMI and not directly with the disease. Such indirect associations are even used by mendelian randomization methods to study the causal impact of BMI on diseases [62], illustrating the implicit acceptance that GWAS associations reflect true genetic effects. In breast cancer GWAS, for example, associations are found with SNPs located in different genes involved in adipocyte differentiation. This does not mean that these genes are directly responsible for breast cancer. It is more likely that this reflects an indirect association owing to fat cells increasing the production of estrogen and, therefore, promoting the growth of cancer cells. Moreover, some associations of SNPs with breast cancer may in fact reflect a difference between cases and controls for alcohol consumption, as alcohol has also been shown to affect estrogen receptors [63]. Matching cases and controls on all environmental risk is, however, not possible, and controls used in GWAS are usually less exposed to environmental risk factors than the general population. Even in a population considered as genetically homogenous, environmental stratification may well exist and PRSs have been shown to be sensitive to changes in variables such as age, sex, and socio-economic status, as shown in recent studies of the U.K. biobank [64]. Therefore, PRSs not only contain information about genetic factors inherited from parents, but also information on clinical and environmental factors. As Janssens [65] rightly pointed out, PRSs do not measure what they are supposed to measure. This fact is totally ignored by those who integrate PRS values in risk models as an observation, which is independent from clinical and environmental factors.
The same problems do not hold when using EBVs to select the best reproducers. Indeed, the genotypes on which the selection is performed are those of the parents, but the trait is measured in the offspring. Thus, contrarily to PRS, we know that the SNPs considered to estimate EBV tag genetic factors transmitted to the offspring. Moreover, the environment is also far more easily and deeply controlled in an agricultural context than it is in humans.

4.3. PRS as a Tool to Predict Individual Risk of Disease

Besides its use to classify individuals in a population according to their disease risk, PRS is also described in the litterature as a measure of disease liability; i.e., the probability of developing the disease. This is very different from the use of EBVs at the population scale, where only a ranking is performed to identify the best reproducers—not on their phenotype, but on what they could transmit to their progeny.
Indeed, when we are interested in the transmission of a trait from one generation to the next, the assumption of an additive effect of risk alleles has no impact as only one of the two alleles of the parent is transmitted to the offspring. However, this assumption can have a tremendous impact in disease risk prediction as dominance and epistatic effects cannot be ignored when assessing the absolute risk of developing a disease. This is well illustrated by the example of celiac disease for which the absolute risk is almost zero for those who are not carriers of the HLA DQ2 or HLA DQ8 heterodimer, regardless of their genotypes on the remaining part of the genome [66].
Moreover, if accounting for gene–environment interactions does not seem to improve the discrimination ability of PRS [67], this is no longer true when it comes to evaluating the absolute risk of an individual to develop a disease. The absolute risk of an individual may be high in a given environment and low or even null in another environment. As outlined by Lewis and Vassos [38], “the risk arising from one’s genes is dynamic, depending on changing factors”. Some genes may have weak (or even no) marginal effects, but an important effect through their interaction [68,69,70]. Under such “purely epistatic” models, important genes would be missed by association studies [71], and thus their effect cannot be captured by PRS, potentially leading to dramatic underestimation of disease risk.
To predict the individual absolute risk of disease, one needs to know the genotype–phenotype correspondence and this correspondence can be very complex. In breast cancer, for example, the risks of women carrying a deleterious variant differ according to the gene in which the variant is present, but also, for two of the same gene, according to the variant pathogenicity. This pathogenicity can only be assessed by functional assay and is certainly not predicted by PRS values. However, in the BOADICEA risk model, the absolute risk of a woman carrying a rare pathogenic variant is calculated by considering her PRS value as a risk modifier of the variant effect [72]. This is clearly inadequate because, when trying to explain the risk of a woman carrying a rare pathogenic variant, the phenotype of interest is not the disease status (affected/non affected), but instead, we should consider as a phenotype the clinical variability in the population of affected individuals carrying the same variant [18]. In some predictive models, a familial component is even added to PRS as if PRS was capturing a genetic component not inherited from parents (see, for example, [73,74,75]). This is a rather strange interpretation of the PAL model and it is important to recall that this model was proposed to explain recurrence risk in relatives not compatible with monogenic transmission.
Another important misconception is the interpretation of statistics such as the area under the receiver operating characteristic curve (AUC). There is a general confusion between the capacity of PRS to discriminate between affected and unaffected individuals in a population quantified by AUC and PRS predictive power of individual status (affected vs. unaffected). AUC is wrongly considered by many as a precision measure of individual risk and used to measure the clinical utility of PRS [72,75]. However, as shown by Wald and Old [76], PRS may be a very poor predictor of absolute risk in terms of specificity and sensitivity. Taking the example of coronary artery disease, where high AUC are found, Wald and Old [76] show on the data of Khera et al. [77] (AUC = 0.81) and Inouye et al. [78] (AUC = 0.79) that, for a false positive rate of 5%, 85% and 87% of true positives, respectively, would be missed. As explained by Janssen and Martens [79]: “AUC is a measure of the discriminative ability of prediction models. The assessment of prediction models should be supplemented with other metrics to assess their clinical utility”. The same questioning on the clinical utility of AUC exists in the field of imaging, where some authors have called for the use of net benefit instead of AUC [80]. It is important to realize that high AUC is not proof of the clinical utility of PRS.

5. Discussion

Forgetting the original purpose of GWAS—namely a preliminary step towards understanding the complex and unknown etiology of human diseases—many GWAS now focus on developing PRS. Since its promotion—during the 2007–2010 years—the number of studies devoted to PRS has increased exponentially.
Unfortunately, it is under erroneous assumptions that these scores are computed. As for heritability estimates, PRS depends on an underlying genetic model that is unknown for most human diseases and, in our view, PRS is another resurgence of what we previously described as the GIGO syndrome [81]. The Polygenic Additive Liability model, under which PRS is computed, was proposed sixty years ago to explain the familial segregations of a disease that cannot be explained by a monogenic transmission model. Since then, human geneticists have learned that such a model cannot account for the heterogeneity and complexity of pathophysiological processes. The adoption of this model accredits the idea that our diseases are genetically determined and that our genetic risks of contracting a disease are known at birth.
The misinterpretation of PRS has dangerous clinical and eugenic consequences. Some argue that individuals with a high PRS deserve the same attention than rare mutations carriers and defend the use of PRS in clinical practice [78,82,83]. More and more clinicians are considering using PRS in therapeutic prevention. Furthermore, after years of individual medical future selling by direct-to-consumer companies, new companies are now offering prenatal diagnosis, based on PRS, for all the diseases that parents want to avoid infecting their children [84,85]. Some already protested—from an ethical standpoint—against such eugenic drift [86,87]. However, in addition to the ethical question, one must also be aware that it is scientifically wrong. It is important to stop selling the dream that it is possible to predict disease risk with an over simplistic model. As we have shown, the estimated breeding value is not used in the same way as polygenic risk scores. When using EBVs for selection, phenotypes are measured in the offspring and under well controlled environmental conditions. This may be successful for improving a trait in livestock, but there is no equivalence by which its success might be transferred with the envisioned use of PRS as a universal measure of risk to develop a disease.
Finally, sticking with the single association information, even in very huge samples, is not sufficient to progress in complex disease etiologies. In particular, it is really strange that so many geneticists have turned away from the information provided by familial correlation. There is a huge gap between observing associations in a population and understanding the role of genes in the disease development process [88]. Recent technological advances have offered new possibilities to study the 3D genome organization and shown that chromatin conformation is important to ensure genome function by allowing interactions between different genomic regions. Protein–protein interaction networks have been described that showed that interactions exist at all scales. However, in statistical genetics, we are using a model proposed a century ago when almost no molecular data were available and that ignores all interactions. As already stressed by Nelson et al. [89], it is more than time to move forward and propose alternative models that take better account of the advances made on our understanding of the complex biological processes that drive human phenotypes. Attempts have been made in this direction with, for example, the omnigenic model [90], but it is important to continue on this path and to stop selling the dream that it is possible to predict disease risk with an over simplistic model.

Author Contributions

All authors have contributed substantially to the writing of this paper and agreed to the published version of the manuscript.

Funding

This research was funded by Inserm Cross-Cutting Project GOLD on the genomic variability in health and disease and by the French Ministry of Higher Education, Research, and Innovation for the POPGEN France Genomic Medicine pilot project.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Dausset, J.; Colombani, J. (Eds.) Histocompatibility Testing 1972; Munksgaard: Copenhagen, Denmark, 1973. [Google Scholar]
  2. Buhler, S.; Sanchez-Mazas, A. HLA DNA Sequence Variation among Human Populations: Molecular Signatures of Demographic and Selective Events. PLoS ONE 2011, 6, e14643. [Google Scholar] [CrossRef] [PubMed]
  3. Gourraud, P.-A.; Khankhanian, P.; Cereb, N.; Yang, S.Y.; Feolo, M.; Maiers, M.; Rioux, J.D.; Hauser, S.; Oksenberg, J. HLA Diversity in the 1000 Genomes Dataset. PLoS ONE 2014, 9, e97282. [Google Scholar] [CrossRef] [PubMed]
  4. Degioanni, A.; Darlu, P.; Raffoux, C. Analysis of the French National Registry of Unrelated Bone Marrow Donors, Using Surnames as a Tool for Improving Geographical Localisation of HLA Haplotypes. Eur. J. Hum. Genet. 2003, 11, 794–801. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Quintana-Murci, L. Human Immunology through the Lens of Evolutionary Genetics. Cell 2019, 177, 184–199. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Noble, J.A.; Valdes, A.M. Genetics of the HLA Region in the Prediction of Type 1 Diabetes. Curr. Diab. Rep. 2011, 11, 533–542. [Google Scholar] [CrossRef] [Green Version]
  7. Sollid, L.M.; Markussen, G.; Ek, J.; Gjerde, H.; Vartdal, F.; Thorsby, E. Evidence for a Primary Association of Celiac Disease to a Particular HLA-DQ Alpha/Beta Heterodimer. J. Exp. Med. 1989, 169, 345–350. [Google Scholar] [CrossRef]
  8. Clerget-Darpoux, F.; Bouguerra, F.; Kastally, R.; Semana, G.; Babron, M.; Debbabi, A.; Bennaceur, B.; Eliaou, J. High Risk Genotypes for Celiac Disease. Comptes Rendus Acad. Sci. III 1994, 317, 931–936. [Google Scholar]
  9. Bourgey, M.; Calcagno, G.; Tinto, N.; Gennarelli, D.; Margaritte-Jeannin, P.; Greco, L.; Limongelli, M.G.; Esposito, O.; Marano, C.; Troncone, R.; et al. HLA Related Genetic Risk for Coeliac Disease. Gut 2007, 56, 1054–1059. [Google Scholar] [CrossRef]
  10. Gail, M.H.; Brinton, L.A.; Byar, D.P.; Corle, D.K.; Green, S.B.; Schairer, C.; Mulvihill, J.J. Projecting Individualized Probabilities of Developing Breast Cancer for White Females Who Are Being Examined Annually. J. Natl. Cancer Inst. 1989, 81, 1879–1886. [Google Scholar] [CrossRef]
  11. Botstein, D.; Risch, N. Discovering Genotypes Underlying Human Phenotypes: Past Successes for Mendelian Disease, Future Approaches for Complex Disease. Nat. Genet. 2003, 33, 228–237. [Google Scholar] [CrossRef]
  12. Campion, D.; Dumanchin, C.; Hannequin, D.; Dubois, B.; Belliard, S.; Puel, M.; Thomas-Anterion, C.; Michon, A.; Martin, C.; Charbonnier, F.; et al. Early-Onset Autosomal Dominant Alzheimer Disease: Prevalence, Genetic Heterogeneity, and Mutation Spectrum. Am. J. Hum. Genet. 1999, 65, 664–670. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Froguel, P.; Vaxillaire, M.; Sun, F.; Velho, G.; Zouali, H.; Butel, M.O.; Lesage, S.; Vionnet, N.; Clément, K.; Fougerousse, F.; et al. Close Linkage of Glucokinase Locus on Chromosome 7p to Early-Onset Non-Insulin-Dependent Diabetes mellitus. Nature 1992, 356, 162–164. [Google Scholar] [CrossRef]
  14. Hall, J.M.; Lee, M.K.; Newman, B.; Morrow, J.E.; Anderson, L.A.; Huey, B.; King, M.C. Linkage of Early-Onset Familial Breast Cancer to Chromosome 17q21. Science 1990, 250, 1684–1689. [Google Scholar] [CrossRef] [Green Version]
  15. Miki, Y.; Swensen, J.; Shattuck-Eidens, D.; Futreal, P.A.; Harshman, K.; Tavtigian, S.; Liu, Q.; Cochran, C.; Bennett, L.M.; Ding, W. A Strong Candidate for the Breast and Ovarian Cancer Susceptibility Gene BRCA1. Science 1994, 266, 66–71. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Wooster, R.; Neuhausen, S.L.; Mangion, J.; Quirk, Y.; Ford, D.; Collins, N.; Nguyen, K.; Seal, S.; Tran, T.; Averill, D. Localization of a Breast Cancer Susceptibility Gene, BRCA2, to Chromosome 13q12-13. Science 1994, 265, 2088–2090. [Google Scholar] [CrossRef] [PubMed]
  17. Wooster, R.; Bignell, G.; Lancaster, J.; Swift, S.; Seal, S.; Mangion, J.; Collins, N.; Gregory, S.; Gumbs, C.; Micklem, G.; et al. Identification of the Breast Cancer Susceptibility Gene BRCA2. Nature 1995, 378, 789–792. [Google Scholar] [CrossRef] [PubMed]
  18. Génin, E.; Feingold, J.; Clerget-Darpoux, F. Identifying Modifier Genes of Monogenic Disease: Strategies and Difficulties. Hum. Genet. 2008, 124, 357–368. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  19. Margaritte, P.; Bonaiti-Pellie, C.; King, M.C.; Clerget-Darpoux, F. Linkage of Familial Breast Cancer to Chromosome 17q21 May Not Be Restricted to Early-Onset Disease. Am. J. Hum. Genet. 1992, 50, 1231–1234. [Google Scholar]
  20. Claus, E.B.; Risch, N.; Thompson, W.D. Genetic Analysis of Breast Cancer in the Cancer and Steroid Hormone Study. Am. J. Hum. Genet. 1991, 48, 232–242. [Google Scholar]
  21. Parmigiani, G.; Berry, D.A.; Aguilar, O. Determining Carrier Probabilities for Breast Cancer–Susceptibility Genes BRCA1 and BRCA2. Am. J. Hum. Genet. 1998, 62, 145–158. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Risch, H.A.; McLaughlin, J.R.; Cole, D.E.; Rosen, B.; Bradley, L.; Kwan, E.; Jack, E.; Vesprini, D.J.; Kuperstein, G.; Abrahamson, J.L.; et al. Prevalence and Penetrance of Germline BRCA1 and BRCA2 Mutations in a Population Series of 649 Women with Ovarian Cancer. Am. J. Hum. Genet. 2001, 68, 700–710. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Plon, S.E.; Eccles, D.M.; Easton, D.; Foulkes, W.D.; Genuardi, M.; Greenblatt, M.S.; Hogervorst, F.B.L.; Hoogerbrugge, N.; Spurdle, A.B.; Tavtigian, S.V.; et al. Sequence Variant Classification and Reporting: Recommendations for Improving the Interpretation of Cancer Susceptibility Genetic Test Results. Hum. Mutat. 2008, 29, 1282–1291. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Cao, Y.; Tokita, M.J.; Chen, E.S.; Ghosh, R.; Chen, T.; Feng, Y.; Gorman, E.; Gibellini, F.; Ward, P.A.; Braxton, A.; et al. A Clinical Survey of Mosaic Single Nucleotide Variants in Disease-Causing Genes Detected by Exome Sequencing. Genome Med. 2019, 11, 48. [Google Scholar] [CrossRef] [Green Version]
  25. Legrand, A.; Devriese, M.; Dupuis-Girod, S.; Simian, C.; Venisse, A.; Mazzella, J.M.; Auribault, K.; Adham, S.; Frank, M.; Albuisson, J.; et al. Frequency of de Novo Variants and Parental Mosaicism in Vascular Ehlers-Danlos Syndrome. Genet. Med. 2019, 21, 1568–1575. [Google Scholar] [CrossRef] [PubMed]
  26. Wray, N.R.; Goddard, M.E.; Visscher, P.M. Prediction of Individual Genetic Risk to Disease from Genome-Wide Association Studies. Genome Res. 2007, 17, 1520–1528. [Google Scholar] [CrossRef] [Green Version]
  27. Lande, R.; Thompson, R. Efficiency of Marker-Assisted Selection in the Improvement of Quantitative Traits. Genetics 1990, 124, 743–756. [Google Scholar] [CrossRef]
  28. Meuwissen, T.H.; Hayes, B.J.; Goddard, M.E. Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps. Genetics 2001, 157, 1819–1829. [Google Scholar] [CrossRef]
  29. Fisher, R.A. XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. Earth Environ. Sci. Trans. R. Soc. Edinb. 1919, 52, 399–433. [Google Scholar] [CrossRef] [Green Version]
  30. Manolio, T.A.; Collins, F.S.; Cox, N.J.; Goldstein, D.B.; Hindorff, L.A.; Hunter, D.J.; McCarthy, M.I.; Ramos, E.M.; Cardon, L.R.; Chakravarti, A.; et al. Finding the Missing Heritability of Complex Diseases. Nature 2009, 461, 747–753. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  31. Génin, E. Missing heritability of complex diseases: Case solved? Human Genetics 2020, 139, 103–113. [Google Scholar] [CrossRef] [PubMed]
  32. Dudbridge, F. Power and Predictive Accuracy of Polygenic Risk Scores. PLoS Genet. 2013, 9, e1003348. [Google Scholar] [CrossRef]
  33. Yang, J.; Benyamin, B.; McEvoy, B.P.; Gordon, S.; Henders, A.K.; Nyholt, D.R.; Madden, P.A.; Heath, A.C.; Martin, N.G.; Montgomery, G.W.; et al. Common SNPs Explain a Large Proportion of the Heritability for Human Height. Nat. Genet. 2010, 42, 565–569. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Tibshirani, R. Regression Shrinkage and Selection via the Lasso. J. R. Stat. Soc. Ser. B Methodol. 1996, 58, 267–288. [Google Scholar] [CrossRef]
  35. Hoerl, A.E.; Kennard, R.W. Ridge Regression: Biased Estimation for Nonorthogonal Problems. Technometrics 1970, 12, 55–67. [Google Scholar] [CrossRef]
  36. Hoerl, A.E.; Kennard, R.W. Ridge Regression: Applications to Nonorthogonal Problems. Technometrics 1970, 12, 69–82. [Google Scholar] [CrossRef]
  37. Choi, S.W.; Mak, T.S.-H.; O’Reilly, P.F. Tutorial: A Guide to Performing Polygenic Risk Score Analyses. Nat. Protoc. 2020, 15, 2759–2772. [Google Scholar] [CrossRef] [PubMed]
  38. Lewis, C.M.; Vassos, E. Polygenic Risk Scores: From Research Tools to Clinical Instruments. Genome Med. 2020, 12, 44. [Google Scholar] [CrossRef] [PubMed]
  39. Mavaddat, N.; Michailidou, K.; Dennis, J.; Lush, M.; Fachal, L.; Lee, A.; Tyrer, J.P.; Chen, T.-H.; Wang, Q.; Bolla, M.K.; et al. Polygenic Risk Scores for Prediction of Breast Cancer and Breast Cancer Subtypes. Am. J. Hum. Genet. 2019, 104, 21–34. [Google Scholar] [CrossRef] [Green Version]
  40. Kuchenbaecker, K.B.; McGuffog, L.; Barrowdale, D.; Lee, A.; Soucy, P.; Dennis, J.; Domchek, S.M.; Robson, M.; Spurdle, A.B.; Ramus, S.J.; et al. Evaluation of Polygenic Risk Scores for Breast and Ovarian Cancer Risk Prediction in BRCA1 and BRCA2 Mutation Carriers. J. Natl. Cancer Inst. 2017, 109, djw302. [Google Scholar] [CrossRef] [Green Version]
  41. Muranen, T.A.; Mavaddat, N.; Khan, S.; Fagerholm, R.; Pelttari, L.; Lee, A.; Aittomäki, K.; Blomqvist, C.; Easton, D.F.; Nevanlinna, H. Polygenic Risk Score Is Associated with Increased Disease Risk in 52 Finnish Breast Cancer Families. Breast Cancer Res. Treat. 2016, 158, 463–469. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Wand, H.; Lambert, S.A.; Tamburro, C.; Iacocca, M.A.; O’Sullivan, J.W.; Sillari, C.; Kullo, I.J.; Rowley, R.; Dron, J.S.; Brockman, D.; et al. Improving Reporting Standards for Polygenic Scores in Risk Prediction Studies. Nature 2021, 591, 211–219. [Google Scholar] [CrossRef] [PubMed]
  43. Wray, N.R.; Kemper, K.E.; Hayes, B.J.; Goddard, M.E.; Visscher, P.M. Complex Trait Prediction from Genome Data: Contrasting EBV in Livestock to PRS in Humans: Genomic Prediction. Genetics 2019, 211, 1131–1141. [Google Scholar] [CrossRef] [Green Version]
  44. Carter, C.O. The Inheritance of Congenital Pyloric Stenosis. Br. Med. Bull. 1961, 17, 251–253. [Google Scholar] [CrossRef] [PubMed]
  45. Falconer, D.S. The Inheritance of Liability to Certain Diseases, Estimated from the Incidence among Relatives. Ann. Hum. Genet. 1965, 29, 51–76. [Google Scholar] [CrossRef]
  46. Fraser, F.C. The Multifactorial/Threshold Concept—Uses and Misuses. Teratology 1976, 14, 267–280. [Google Scholar] [CrossRef] [PubMed]
  47. Gaál, Z.; Balogh, I. Monogenic Forms of Diabetes mellitus. Exp. Suppl. 2019, 111, 385–416. [Google Scholar] [CrossRef]
  48. Génin, E.; Clerget-Darpoux, F. Revisiting the Polygenic Additive Liability Model through the Example of Diabetes mellitus. Hum. Hered. 2015, 80, 171–177. [Google Scholar] [CrossRef] [PubMed]
  49. Ignatiadis, M.; Sotiriou, C. Luminal Breast Cancer: From Biology to Treatment. Nat. Rev. Clin. Oncol. 2013, 10, 494–506. [Google Scholar] [CrossRef] [PubMed]
  50. Falconer, D.S.; Mackay, T.F.C. Introduction to Quantitative Genetics, 4th ed.; Longmans Green: Harlow, UK, 1996. [Google Scholar]
  51. Bohmanova, J.; Misztal, I.; Tsuruta, S.; Norman, H.D.; Lawlor, T.J. Short Communication: Genotype by Environment Interaction Due to Heat Stress. J. Dairy Sci. 2008, 91, 840–846. [Google Scholar] [CrossRef] [PubMed]
  52. Hammami, H.; Vandenplas, J.; Vanrobays, M.-L.; Rekik, B.; Bastin, C.; Gengler, N. Genetic Analysis of Heat Stress Effects on Yield Traits, Udder Health, and Fatty Acids of Walloon Holstein Cows. J. Dairy Sci. 2015, 98, 4956–4968. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Cheruiyot, E.K.; Nguyen, T.T.T.; Haile-Mariam, M.; Cocks, B.G.; Abdelsayed, M.; Pryce, J.E. Genotype-by-Environment (Temperature-Humidity) Interaction of Milk Production Traits in Australian Holstein Cattle. J. Dairy Sci. 2020, 103, 2460–2476. [Google Scholar] [CrossRef]
  54. Bohlouli, M.; Yin, T.; Hammami, H.; Gengler, N.; König, S. Climate sensitivity of milk production traits and milk fatty acids in genotyped Holstein dairy cows. J. Dairy Sci. 2021, 104, 6847–6860. [Google Scholar] [CrossRef] [PubMed]
  55. Bohlouli, M.; Alijani, S.; Naderi, S.; Yin, T.; König, S. Prediction Accuracies and Genetic Parameters for Test-Day Traits from Genomic and Pedigree-Based Random Regression Models with or without Heat Stress Interactions. J. Dairy Sci. 2019, 102, 488–502. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  56. Feldman, M.W.; Lewontin, R.C. The Heritability Hang-Up. Science 1975, 190, 1163–1168. [Google Scholar] [CrossRef]
  57. Burt, C.H. Heritability Studies: Methodological Flaws, Invalidated Dogmas, and Changing Paradigms. In Genetics, Health and Society; Advances in Medical Sociology; Emerald Group Publishing Limited: Bingley, UK, 2015; Volume 16, pp. 1–44. ISBN 978-1-78350-581-4. [Google Scholar]
  58. Gottlieb, G. Individual Development and Evolution: The Genesis of Novel Behavior; Psychology Press: London, UK, 2001. [Google Scholar]
  59. Moore, D.S.; Shenk, D. The Heritability Fallacy. Wiley Interdiscip. Rev. Cogn. Sci. 2017, 8, e1400. [Google Scholar] [CrossRef]
  60. Popa-Wagner, A.; Dumitrascu, D.; Capitanescu, B.; Petcu, E.; Surugiu, R.; Fang, W.-H.; Dumbrava, D.-A. Dietary Habits, Lifestyle Factors and Neurodegenerative Diseases. Neural Regen. Res. 2020, 15, 394. [Google Scholar] [CrossRef] [PubMed]
  61. Munafò, M.R.; Tilling, K.; Taylor, A.E.; Evans, D.M.; Davey Smith, G. Collider Scope: When Selection Bias Can Substantially Influence Observed Associations. Int. J. Epidemiol. 2018, 47, 226–235. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  62. Davey Smith, G.; Hemani, G. Mendelian Randomization: Genetic Anchors for Causal Inference in Epidemiological Studies. Hum. Mol. Genet. 2014, 23, R89–R98. [Google Scholar] [CrossRef] [Green Version]
  63. Fan, S.; Meng, Q.; Gao, B.; Grossman, J.; Yadegari, M.; Goldberg, I.D.; Rosen, E.M. Alcohol Stimulates Estrogen Receptor Signaling in Human Breast Cancer Cell Lines. Cancer Res. 2000, 60, 5635–5639. [Google Scholar]
  64. Mostafavi, H.; Harpak, A.; Agarwal, I.; Conley, D.; Pritchard, J.K.; Przeworski, M. Variable Prediction Accuracy of Polygenic Scores within an Ancestry Group. Elife 2020, 9, e48376. [Google Scholar] [CrossRef]
  65. Janssens, A.C.J.W. Validity of Polygenic Risk Scores: Are We Measuring What We Think We Are? Hum. Mol. Genet. 2019, 28, R143–R150. [Google Scholar] [CrossRef] [PubMed]
  66. Margaritte-Jeannin, P.; Babron, M.C.; Bourgey, M.; Louka, A.S.; Clot, F.; Percopo, S.; Coto, I.; Hugot, J.P.; Ascher, H.; Sollid, L.M.; et al. HLA-DQ Relative Risks for Coeliac Disease in European Populations: A Study of the European Genetics Cluster on Coeliac Disease. Tissue Antigens 2004, 63, 562–567. [Google Scholar] [CrossRef]
  67. Aschard, H.; Chen, J.; Cornelis, M.C.; Chibnik, L.B.; Karlson, E.W.; Kraft, P. Inclusion of Gene-Gene and Gene-Environment Interactions Unlikely to Dramatically Improve Risk Prediction for Complex Diseases. Am. J. Hum. Genet. 2012, 90, 962–972. [Google Scholar] [CrossRef] [Green Version]
  68. Templeton, A.R.; Weiss, K.M.; Nickerson, D.A.; Boerwinkle, E.; Sing, C.F. Cladistic Structure within the Human Lipoprotein Lipase Gene and Its Implications for Phenotypic Association Studies. Genetics 2000, 156, 1259–1275. [Google Scholar] [CrossRef] [PubMed]
  69. Moore, J.H. The Ubiquitous Nature of Epistasis in Determining Susceptibility to Common Human Diseases. Hum. Hered. 2003, 56, 73–82. [Google Scholar] [CrossRef] [PubMed]
  70. Mackay, T.F.; Moore, J.H. Why Epistasis Is Important for Tackling Complex Human Disease Genetics. Genome Med. 2014, 6, 42. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  71. Culverhouse, R.; Suarez, B.K.; Lin, J.; Reich, T. A Perspective on Epistasis: Limits of Models Displaying No Main Effect. Am. J. Hum. Genet. 2002, 70, 461–471. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  72. Lee, A.; Mavaddat, N.; Wilcox, A.N.; Cunningham, A.P.; Carver, T.; Hartley, S.; Babb de Villiers, C.; Izquierdo, A.; Simard, J.; Schmidt, M.K.; et al. BOADICEA: A Comprehensive Breast Cancer Risk Prediction Model Incorporating Genetic and Nongenetic Risk Factors. Genet. Med. 2019, 21, 1708–1718. [Google Scholar] [CrossRef] [Green Version]
  73. Garcia-Closas, M.; Gunsoy, N.B.; Chatterjee, N. Combined Associations of Genetic and Environmental Risk Factors: Implications for Prevention of Breast Cancer. J. Natl. Cancer Inst. 2014, 106, dju305. [Google Scholar] [CrossRef]
  74. Oliver, D.; Radua, J.; Reichenberg, A.; Uher, R.; Fusar-Poli, P. Psychosis Polyrisk Score (PPS) for the Detection of Individuals At-Risk and the Prediction of Their Outcomes. Front. Psychiatry 2019, 10, 174. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  75. Lambert, S.A.; Abraham, G.; Inouye, M. Towards Clinical Utility of Polygenic Risk Scores. Hum. Mol. Genet. 2019, 28, R133–R142. [Google Scholar] [CrossRef] [PubMed]
  76. Wald, N.J.; Old, R. The Illusion of Polygenic Disease Risk Prediction. Genet. Med. 2019, 21, 1705–1707. [Google Scholar] [CrossRef]
  77. Khera, A.V.; Chaffin, M.; Aragam, K.G.; Haas, M.E.; Roselli, C.; Choi, S.H.; Natarajan, P.; Lander, E.S.; Lubitz, S.A.; Ellinor, P.T.; et al. Genome-Wide Polygenic Scores for Common Diseases Identify Individuals with Risk Equivalent to Monogenic Mutations. Nat. Genet. 2018, 50, 1219–1224. [Google Scholar] [CrossRef] [PubMed]
  78. Inouye, M.; Abraham, G.; Nelson, C.P.; Wood, A.M.; Sweeting, M.J.; Dudbridge, F.; Lai, F.Y.; Kaptoge, S.; Brozynska, M.; Wang, T.; et al. Genomic Risk Prediction of Coronary Artery Disease in 480,000 Adults: Implications for Primary Prevention. J. Am. Coll. Cardiol. 2018, 72, 1883–1893. [Google Scholar] [CrossRef] [PubMed]
  79. Janssens, A.C.J.W.; Martens, F.K. Reflection on Modern Methods: Revisiting the Area under the ROC Curve. Int. J. Epidemiol. 2020, 49, 1397–1403. [Google Scholar] [CrossRef]
  80. Halligan, S.; Altman, D.G.; Mallett, S. Disadvantages of Using the Area under the Receiver Operating Characteristic Curve to Assess Imaging Tests: A Discussion and Proposal for an Alternative Approach. Eur. Radiol. 2015, 25, 932–939. [Google Scholar] [CrossRef] [Green Version]
  81. Génin, E.; Clerget-Darpoux, F. The Missing Heritability Paradigm: A Dramatic Resurgence of the GIGO Syndrome in Genetics. Hum. Hered. 2015, 79, 1–4. [Google Scholar] [CrossRef]
  82. Torkamani, A.; Wineinger, N.E.; Topol, E.J. The Personal and Clinical Utility of Polygenic Risk Scores. Nat. Rev. Genet. 2018, 19, 581–590. [Google Scholar] [CrossRef] [PubMed]
  83. Wray, N.R.; Lin, T.; Austin, J.; McGrath, J.J.; Hickie, I.B.; Murray, G.K.; Visscher, P.M. From Basic Science to Clinical Application of Polygenic Risk Scores: A Primer. JAMA Psychiatry 2021, 78, 101–109. [Google Scholar] [CrossRef]
  84. Treff, N.R.; Zimmerman, R.; Bechor, E.; Hsu, J.; Rana, B.; Jensen, J.; Li, J.; Samoilenko, A.; Mowrey, W.; Van Alstine, J.; et al. Validation of Concurrent Preimplantation Genetic Testing for Polygenic and Monogenic Disorders, Structural Rearrangements, and Whole and Segmental Chromosome Aneuploidy with a Single Universal Platform. Eur. J. Med. Genet. 2019, 62, 103647. [Google Scholar] [CrossRef]
  85. Treff, N.R.; Eccles, J.; Marin, D.; Messick, E.; Lello, L.; Gerber, J.; Xu, J.; Tellier, L.C.A.M. Preimplantation Genetic Testing for Polygenic Disease Relative Risk Reduction: Evaluation of Genomic Index Performance in 11,883 Adult Sibling Pairs. Genes 2020, 11, 648. [Google Scholar] [CrossRef]
  86. Lombardo, P.A. The Power of Heredity and the Relevance of Eugenic History. Genet. Med. 2018, 20, 1305–1311. [Google Scholar] [CrossRef] [Green Version]
  87. Lázaro-Muñoz, G.; Pereira, S.; Carmi, S.; Lencz, T. Screening Embryos for Polygenic Conditions and Traits: Ethical Considerations for an Emerging Technology. Genet. Med. 2021, 23, 432–434. [Google Scholar] [CrossRef]
  88. Bourgain, C.; Génin, E.; Cox, N.; Clerget-Darpoux, F. Are Genome-Wide Association Studies All That We Need to Dissect the Genetic Component of Complex Human Diseases? Eur. J. Hum. Genet. 2007, 15, 260–263. [Google Scholar] [CrossRef]
  89. Nelson, R.M.; Pettersson, M.E.; Carlborg, Ö. A Century after Fisher: Time for a New Paradigm in Quantitative Genetics. Trends Genet. 2013, 29, 669–676. [Google Scholar] [CrossRef]
  90. Boyle, E.A.; Li, Y.I.; Pritchard, J.K. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell 2017, 169, 1177–1186. [Google Scholar] [CrossRef] [Green Version]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Herzig, A.F.; Clerget-Darpoux, F.; Génin, E. The False Dawn of Polygenic Risk Scores for Human Disease Prediction. J. Pers. Med. 2022, 12, 1266. https://doi.org/10.3390/jpm12081266

AMA Style

Herzig AF, Clerget-Darpoux F, Génin E. The False Dawn of Polygenic Risk Scores for Human Disease Prediction. Journal of Personalized Medicine. 2022; 12(8):1266. https://doi.org/10.3390/jpm12081266

Chicago/Turabian Style

Herzig, Anthony F., Françoise Clerget-Darpoux, and Emmanuelle Génin. 2022. "The False Dawn of Polygenic Risk Scores for Human Disease Prediction" Journal of Personalized Medicine 12, no. 8: 1266. https://doi.org/10.3390/jpm12081266

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop