A Quantitative Comparison of Exoplanet Catalogs

Bashi, Dolev; Helled, Ravit; Zucker, Shay

doi:10.3390/geosciences8090325

Open AccessArticle

A Quantitative Comparison of Exoplanet Catalogs

by

Dolev Bashi

^1,*,

Ravit Helled

²

and

Shay Zucker

¹

School of Geosciences, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel

²

Institute for Computational Science, Center for Theoretical Astrophysics and Cosmology, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland

^*

Author to whom correspondence should be addressed.

Geosciences 2018, 8(9), 325; https://doi.org/10.3390/geosciences8090325

Submission received: 30 July 2018 / Revised: 23 August 2018 / Accepted: 25 August 2018 / Published: 29 August 2018

(This article belongs to the Special Issue Detection and Characterization of Extrasolar Planets)

Download

Browse Figures

Versions Notes

Abstract

In this study, we investigated the differences between four commonly-used exoplanet catalogs (exoplanet.eu; exoplanetarchive.ipac.caltech.edu; openexoplanetcatalogue.com; exoplanets.org) using a Kolmogorov–Smirnov (KS) test. We found a relatively good agreement in terms of the planetary parameters (mass, radius, period) and stellar properties (mass, temperature, metallicity), although a more careful analysis of the overlap and unique parts of each catalog revealed some differences. We quantified the statistical impact of these differences and their potential cause. We concluded that although statistical studies are unlikely to be significantly affected by the choice of catalog, it would be desirable to have one consistent catalog accepted by the general exoplanet community as a base for exoplanet statistics and comparison with theoretical predictions.

Keywords:

methods: statistical; astronomical data bases: miscellaneous; catalogs; planetary systems; stars: statistics

1. Introduction

Since the detection of ‘51 Peg b’, the first exoplanet around a main sequence star [1], many more planets around other stars have been discovered. Currently, more than 3500 exoplanets have been detected in our galaxy. The diversity of these exoplanets in terms of orbital and physical properties is overwhelming. This diversity challenges planet formation and evolution theories, which were tuned originally to explain the planets in our Solar System [2,3].

Several groups took it upon themselves to label and classify the known exoplanets, and compile catalogs to provide the scientific community with a comprehensive working tool to access the data and perform statistical studies of the exoplanet sample (hereafter, exostatistics). These databases include information about the physical properties of the planets, as well as their host stars. Analysis of this information is constantly improving our understanding of planet formation mechanisms [4], protoplanetary disks [5], and planetary composition and internal structure [6,7]. At the moment, several exoplanet catalogs are available and are used by the community (see [8]).

The most widely-used exoplanet catalogs are:

The Extrasolar Planets Encyclopaedia, www.exoplanet.eu ([9]; hereafter, EU).
The NASA Exoplanet Archive, https://exoplanetarchive.ipac.caltech.edu ([10]; hereafter, ARCHIVE).
The Open Exoplanet Catalogue, www.openexoplanetcatalogue.com/ (hereafter, OPEN).
The Exoplanet Data Explorer, www.exoplanets.org ([11]; hereafter, ORG).

These catalogs include data from ground-based observations as well as space missions such as CoRoT, Kepler, and K2. The available data in these catalogs are comprehensive and include the physical properties of the host star, available information on the planetary physical properties, and the referenced confirmation paper or other mentioned source.

The different teams of each catalog use different criteria to include a planet, which are usually based on the physical properties of the planet or statistical thresholds (see Table 1). Furthermore, each catalog has a different approach to displaying the database. For example, ARCHIVE designates a set of default parameters for each planet. This set is extracted from a single published reference to ensure internal consistency. Additional values published in other papers can only be found by viewing the pages dedicated to individual planets, where multiple sets of parameters are displayed. As a result, the ARCHIVE table provides a self-consistent set of parameters for any system, with missing values when the information is unavailable. On the other hand, EU uses a table displaying information on specific planet extracted from different sources, thus making for a more complete parameter set, though not necessarily self-consistent.

Many exostatistics papers use one of these catalogs as their source of observational data. Nevertheless, so far, the different catalogs have not been compared in terms of their possible differences and potential biases and selection effects that might affect inferred results and conclusions.

In this work, we present a simple statistical comparison between the different exoplanet catalogs. We mainly focus on the EU, ARCHIVE and OPEN catalogs. The database of the ORG catalog contains a single and reliable set of parameters for each planet. However, since it has not been updated for a couple of years now (see website and discussion in Reference [8]), we perform only a coarse comparison. As discussed in Reference [8], there are plans to restart regular updates in the near future.

2. Methods

We have downloaded lists of confirmed planets from the following four catalogs: EU, ARCHIVE, OPEN and ORG on 3 April 2018. As discussed previously, because of the different planetary mass criteria of each catalog (see Table 1), we set

10 M_{J}

as an upper bound for the planetary mass, to strictly exclude any potential brown dwarfs. Thus, we avoided any biases that might emerge from the different mass cutoffs the catalogs use.

The parameters we use in order to compare the catalogs are the stellar mass (

M_{*}

), surface temperature (

T_{e f f}

) and metallicity (

[F e / H]

), and planetary mass (

M_{p}

), radius (

R_{p}

) and orbital period (Period). We chose this set of six parameters because they are the fundamental parameters that are most easily available from current photometric and spectroscopic detection methods [12]. Physically, these parameters provide basic, broad information about the planetary system [6]. The process of deriving the stellar properties involves a collection of literature values for atmospheric properties (temperature, surface gravity, and metallicity) derived from different observational techniques (photometry, spectroscopy, asteroseismology, and exoplanet transits), and then fitting them to stellar isochrones (e.g., References [13,14]). The stellar properties are then used in the derivation of almost all planet properties from radial velocities (RV), transits or transit timing variation (TTV) data. Thus, a reliable estimate of these parameters is crucial for the quality of the planet properties estimate (e.g., Reference [15]).

In the framework of this analysis, we compared separately (and in combination) the planetary properties of the confirmed planets from the listed catalogs. In addition, we performed a comparison between planetary systems by examining the distributions of stellar and planetary properties of the main star and each system’s first detected planet. By doing so, we were able to find the biases of the planet properties emerging from the stellar properties. There was no sense in comparing the stellar parameters of all the confirmed planets since it is possible to unintentionally give more weight to multi-planetary systems when performing the analysis. Table 2 lists the total number of confirmed planets and systems of each catalog as a function of the different stellar and planet properties. The significant variability of those numbers raised the following questions: Does the catalog with the largest number of planets include all the listed objects of the other catalogs? How different is the distribution of planets from two different catalogs?

Most of the statistical work in this analysis is based on comparing the different sets using a two-sample Kolmogorov–Smirnov test (hereafter, KS test, [16]). Broadly speaking, the p-value of the KS test indicates to what extent two samples can be considered to be drawn from the same distribution—a high p-value indicates a good agreement. It is sensitive to differences in both shape and location of the empirical distribution functions of the two samples. A KS test comparison between two catalogs would compare the distributions of all available estimates of one of the planetary properties mentioned above. If a specific object’s quantity is unavailable, we excluded the object from the comparison pertaining to this property. In cases where only lower/upper bounds were available, we set it to the listed value instead of excluding it. Thus, there were some cases in which two catalogs agreed on the planetary nature of a specific object, yet, since one of the catalogs had a missing value for some property, we excluded this planet from the test.

For each pair of catalogs, ‘A’ and ‘B’, we compiled three subsets: (1) The overlap subset, including all objects listed in both catalogs; (2) the unique ‘A’ subset, including only the planets that are unique to catalog ‘A’ and not listed in catalog ‘B’; and (3) the unique ‘B’ subset, including only the planets that are unique to catalog ‘B’ and not listed in catalog ‘A’. We then applied the KS test to compare the three subsets. We performed this analysis for each one of the six parameters separately, as well as a comparison of subsets that include information about the planetary mass, radius and orbital period. Figure 1 describes the methodology applied to compare the different catalogs.

A problem we often encountered in our analysis is the possibility of different planet names and aliases listed in each of the catalogs. The differences are typically caused by spaces, uppercase/lowercase mix-up, and numbers or unique symbols that are used in spelling the stellar and planet names. Yet, in most cases, we found a given object to have two different names in two different catalogs. To overcome this difficulty, we first cross-matched the different objects according to their stellar names using the SIMBAD database, and then identified the different planetary names that populate each system. This practice imparted us with the following unfortunate conclusion: Currently, there is no consensus on an unambiguous method to mark a specific planet and to find its aliases in a convenient way. Therefore, there is no reliable way to cross-match two planetary tables.

3. Results

In general, we found the distributions of planets and systems from the different catalogs to be quite similar. An exception was the ORG catalog, where its missing planetary mass values were derived from some theoretical mass–radius (M–R) relation. See Appendix A for the detailed and complete analysis with the quantitative results (Table A1 and Table A8; Figure A1 and Figure A7). To infer the differences between the catalogs, we compared the derived populations of the overlap and unique subsets for two catalogs at a time, as discussed above. We found the distributions of the overlapping planets between the different catalogs to be similar. However, there were significant differences between the overlap and unique subsets. In Figure 2, a kernel density estimation (KDE) [17] for the probability density functions (PDF) of the mass, radius and orbital period for the different subsets is given. The solid blue curves represent the three overlap subsets when comparing each pair of the EU-ARCHIVE-OPEN catalogs. The brown curves represent the unique subsets, where the dashed, dashed-dotted and dotted curves correspond to the unique EU, ARCHIVE and OPEN subsets, respectively. Evidently, the overlap and unique subsets seem to have very different distributions for the three planet properties. Table 3 presents the KS tests p-values for the different comparisons between KS tests. We found a low p-value for most comparison tests.

As for the property of planetary mass, we noted that a higher density of small mass planets (

M_{p} ~ 10 M_{\oplus}

) populate the unique subsets. We found most of these planets to be TTV planets, detected in multi-planetary systems. Usually, the mass of a planet detected through TTV is not resolved directly, and in practice, is degenerate with the planet’s eccentric orbit [18]. As a result, we have many small TTV planets with an upper mass limit, rather than a nominal value. However, there is no uniform approach to displaying this mass limit. At times, the catalogs choose to omit the value altogether, while at other times it is displayed as an upper limit. However, in many cases, especially with the EU catalog, the mass limit is reported as a valid nominal estimate. Due to this difference in the mass property criteria, we found many small mass planets biasing the unique subset towards smaller masses.

For the planet radii and orbital period distributions, we found the EU and OPEN unique subsets to have a relative higher density of planets in radii values of

R_{p} ~ 1 R_{J}

, and periods of

P e r i o d ~ 1000 days

. Examining the planets that comprise these subsets suggests the reason for this difference is probably related to the different inclusion criteria the catalogs use. Some examples are: Listed planets where the confirmation paper has used some theoretical M–R relations to infer the planetary radius (or mass), some unusual large radii planets suffering strong tidal forces due to their proximity to the parent stars (in short periods), planets with ‘strange’ transiting light curves that make the planetary detection more controversial, etc. As for the higher consistency, in terms of the overlap and unique parts of the ARCHIVE catalog, we found it to be somewhat artificial. By examining the ARCHIVE unique planets, we found over 75% of them to be K2 planets, suggesting the reason for the good agreement with the overlap subsets related to the simple fact: Most of the overlap planets are transiting planets (Kepler and K2), with similar properties as the K2 planets.

Performing a similar analysis for the subset of planets in which all three properties of planetary mass, radius and orbital period were available, we found the unique and overlap subsets were different again (Figure 3), although having slightly higher p-values (Table 4) than the individual property comparisons presented above.

In this case, most of the overlap planets had large masses and radii, with short periods. This in itself is a bias, caused by the combined sensitivity of the RV and transit detection methods [19]. As for the distributions of the unique subsets, we found them to be compromised by a higher number of TTV-detected planets, causing the distributions to be almost uniform in the regime of both small and large planets. Another interesting aspect we observed about these unique subsets was their relative similarity between each other, as displayed quantitative in Table A10 displaying the unique subsets p-values. In spite of this relative similarity, these subsets populate different planets, causing this resemblance to be a product of the TTV biases in the catalogs planet mass inclusion criteria.

After examining the planetary systems according to the stellar properties, we again found similarity between the overlap subsets and significant differences between the unique to overlap subsets (Figure 4). We noted that the unique distributions of stellar mass and surface temperature were biased towards smaller mass and lower temperature stars (K, M-stars). Spectroscopy of small stars (especially M-stars) is challenging because of their intrinsic faintness and high activity [20]. As a result, detection of planets around these stars is purportedly more difficult and somewhat controversial, thus explaining why we found a higher relative number of these objects in the unique subsets. As for the stellar metallicity, we found it to be an unreliable property when analyzing possible biases in the catalog comparison. Although the metallicity is an important property, providing an early record of the chemical composition of the initial protoplanetary disk [21], the planetary detection methods do not rely on this property directly. The catalogs usually reported this property with high relative errors, probably linked to the imprecise derivation that is used to determine the metallicity value. Adding to this the fact that each catalog referred to different stellar survey sources, we found the highest inconsistency between catalogs to be in this parameter (when compared with the other stellar and planetary properties). Nevertheless, we still found the following trends: The unique subsets that included larger planets, especially in terms of the OPEN catalog, was biased towards higher metallicity values, as expected from previous studies [22,23]. On the other hand, small planets, especially listed in the unique ARCHIVE catalog, had a wider distribution of metallicities [24]. As before, the p-values for the different KS tests of stellar properties in the overlap vs. unique subsets are provided in Table 5. The seemingly improved mean p-value results of the OPEN catalog are caused by its relative smaller number of listed stellar property objects.

Although it would be reasonable to detect most biases in the exoplanets catalogs in the extreme ends of the examined distributions, the analysis we have presented used a quantitative approach to study the biases the catalogs possess. We conclude that the biases in the catalogs are caused by: Some missing or obsolete information about a planets’ properties (e.g., the system of ‘Kepler 53 b,c’ is labeled in the ARCHIVE catalog with main planetary masses of

M_{P}

<

18 M_{J}

and

M_{P}

<

15 M_{J}

, for planets ‘b’, ‘c’ respectively [25], however a later paper [26] finds the masses to be much smaller

M_{P} ~ 0.18 M_{J}

,

M_{P} ~ 0.11 M_{J}

, respectively, as listed in both EU and OPEN); model dependent planetary information based on some theoretical assumption and not a direct measurement (e.g., ‘Kepler-446 b,c,d’: Both EU and OPEN display their mass property, however, according to the reference article [27], the given value is only a coarse estimate for the planets’ masses and expected RV semi-amplitude signatures using recent empirically-measured data); roughly estimated measurements, which is especially relevant for stellar parameters; or approximated upper limits as the nominal value (e.g., ‘Kepler-114 c’ is a TTV detected mass planet. The EU catalog displays its mass to be

M_{p} ~ 40 M_{\oplus}

[26]. However, since this measurement is an upper limit, we find it to be inconsistent with the nominal measurement the ARCHIVE displays of

M_{p} ~ 2.8 M_{\oplus}

[28], while the OPEN catalog does not include this planet).

4. Summary & Discussion

Our analysis suggests that, although the main exoplanet catalogs overlap significantly, which results in similar distributions for most astrophysical parameters, the small discrepancies between the subsets highlight some of the catalogs’ biases. These biases can best be seen in the extreme ends of the examined distributions of small mass, long orbital period planets or small stars (less than our sun). These biases do not only result from different numbers of confirmed planets in each catalog, but mainly from contributing factors, related to the data collection policy of each catalog, such as: The process each catalog uses to present and collect the properties of a specific planet, the decision whether to include a controversial object as a planet, or the routine maintenance each catalog team performs to its current listed planets.

Furthermore, in our analysis, we excluded planets with masses larger than

M_{p} > 10 M_{J}

. However the different catalogs use different mass boundaries, which also adds to their different biases. Unfortunately, most of the biases we found are due to the use of various subjective criteria in compiling and maintaining the database. Although all catalogs usually include in their database planets announced in peer-reviewed publications, this should not be the only criterion for a confirmed planet. We suggest that the explosive growth in the known planet population in recent years once again highlights the need for a more rigorous and objective mechanism to tag planets as confirmed. The differences among the catalogs demonstrate that there are conflicting views in the community regarding such criteria. The International Astronomical Union (IAU) is an objective and well-accepted authority by the community, and we therefore suggest that a central catalog could be maintained by Division F (Planetary Systems and Bio-astronomy) of the IAU, and specifically its Commission F2 (Exoplanets and the Solar Systems). Discussions within the commission should resolve the various differences and arrive at a system that can be agreed upon.

After performing this analysis and scrutinizing the different calculated biases, we can carefully make the following statements:

The ARCHIVE catalog is the most up-to-date catalog, with recent Kepler and K2 planet discoveries. It is also the least biased catalog in terms of the interpretation of the mass upper limit, being the true value or the adoption of a model-based value instead of a genuine measurement. Another interesting feature the catalog has is a list of “removed targets” displaying objects that had been listed in the catalog but were removed, suggesting a more rigorous process applied by the ARCHIVE team.
The EU catalog is less restrictive when listing the planetary properties, and therefore could include imprecise estimates. The EU catalog differs the most with the overlap subsets, probably due to its more permissive acceptance criteria and the use of mixed sources of information. However, it has the most wide and large coverage of planets.
The OPEN catalog is somewhere in the middle, between ARCHIVE and EU. In some cases, we find that it resembles the EU subsets, while in others the ARCHIVE. This might not be surprising, given that this catalog is an open-source catalog which is managed and updated by the astronomical community. Although its interface is elegant and user friendly, it has its drawbacks, especially the lack of detection reference and a smaller planet population.

Finally, while each catalog suffers biases, for an exostatistics work, there should not be too much difference among the databases, since the planet population (especially the one compared in this work) is large enough to wash out the small biases and discrepancies. Nevertheless, we find the fusion of catalogs (the overlap subset) a powerful tool as a starting point for increasing the reliability of exostatistics research. A promising platform seems to be the Data & Analysis Center for Exoplanets (DACE) database (https://dace.unige.ch), which includes a linked table to commonly-used exoplanet catalogs. DACE offers an accessible option to check the properties of a specific planet listed in different catalogs, and to compare its properties as they are displayed on the catalogs.

Besides a careful and detailed inspection of each exoplanet related paper confirmation, other useful techniques that can be used to increase the confidence of some exoplanet databases is to check other related parameters such as: Discovery date and update times, which can solve issues of “catch-up” times between catalogs and the rate by which they upload new exoplanets; a measure of the velocity semi-amplitude K parameter can suggest the mass measurement is truly deduced from a RV measurement and not derived from some theoretical model; a TTV flag with reported eccentricity parameter can suggest the reported mass measurement is probably not an upper limit, but some nominal value.

Author Contributions

Formal analysis, D.B.; Investigation, D.B., R.H. and S.Z.; Methodology, D.B., R.H. and S.Z.; Supervision, R.H. and S.Z.; Validation, D.B., R.H. and S.Z.; Writing–original draft, D.B.; Writing–review & editing, D.B., R.H. and S.Z.

Funding

This research was supported by the Ministry of Science, Technology and Space, Israel.

Acknowledgments

We thank the anonymous referees for valuable comments and suggestions. This research has made use of the NASA Exoplanet Archive, which is operated by the California Institute of Technology, under contract with the National Aeronautics and Space Administration under the Exoplanet Exploration Program. This research has made use of the Exoplanet Orbit Database and the Exoplanet Data Explorer at exoplanets.org.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Details of the Comparison

Appendix A.1. Comparison of Planetary Properties

We first compared the different catalogs using a KS test for the planet properties of mass (

M_{P}

), radius (

R_{P}

) and orbital period (

P e r i o d

). Table 2 summarizes the number of available objects in each catalog for the different properties. We present in Table A1 and Figure A1 the p-values of the corresponding KS tests and relevant empirical cumulative distribution functions (CDF) of each subset respectively. The distributions of the planetary properties from the different catalogs were found to be very similar, apart from the ORG catalog, which bases its missing planetary mass values on a theoretical mass-radius relation.

Table A1. p-values of the different catalogs’ KS test for various planetary properties.

Planet Property	EU-ORG	EU-ARCHIVE	ARCHIVE-ORG	EU-OPEN	OPEN-ARCHIVE	OPEN-ORG
$M_{P}$	<10⁻¹⁰	0.12	<10⁻¹⁰	0.30	0.99	<10⁻¹⁰
$R_{P}$	6 × 10⁻⁴	0.21	0.18	0.82	0.03	3 × 10⁻⁴
$P e r i o d$	0.32	0.37	0.96	0.99	0.74	0.73

Figure A1. Cumulative distribution functions (CDFs) of the planetary mass, radius and orbital period. The different colors correspond to the different catalogs: Blue: The Extrasolar Planets Encyclopaedia (EU), red: NASA Exoplanet Archive (ARCHIVE), orange: Open Exoplanet Catalogue (OPEN), and purple: Exoplanet Data Explorer (ORG).

Although the catalogs seemed similar in this comparison, to reveal their true differences, we applied the analysis discussed in Section 2 to compare the overlap and unique subsets between each pair of the EU-ARCHIVE-OPEN catalogs. In all the comparisons we also investigated whether there was any difference between the properties of the overlap parts listed in catalog ‘A’ or ‘B’. In all cases, the p-values were nearly one, suggesting that the two catalogs are comparable. In the following subsections we present the comparison for the different planetary properties.

Appendix A.1.1. Planetary Mass

We summarize the results of the planetary mass comparisons in Table A2 and Table A3. In these tables (and the following tables), each row represents two catalogs we compare, where the columns represent the number of planets in that subset (Table A2) or the p-values of the comparison between each two subsets (Table A3). We abbreviate by ‘UA’ and ‘UB’ the unique subsets ‘A’ and ‘B’ respectively. For instance, the first row in Table A3 represents (by column order from left to right): The p-value of the comparison between the overlap of the EU and OPEN catalogs with the unique part of EU catalog; the p-value of the comparison between the overlap of the EU and OPEN catalogs with the unique part of OPEN catalog; and the p-value of the comparison between the unique EU and OPEN subsets (see also Figure 1, for a graphic explanation).

The CDFs of the different subsets is shown in Figure A2. We found that the number of planets in the overlap subsets was large, yet the population of the unique subsets (in this section and the following to come) were non-negligible. Moreover, although the number of planets in each catalog was different, there was no single catalog that included all the objects from the other catalog. We found that the unique subsets were different from the overlap subsets with the exception of the unique ARCHIVE and OPEN subsets, when comparing them with the overlap subset of the EU. Correspondingly, we also found the unique parts of the ARCHIVE and OPEN to be very similar based on the comparison between these two catalogs.

Table A2. Catalogs’ subsets total number of planets with listed planetary mass.

Catalog	Overlap	UA	UB
EU-OPEN	1188	239	87
OPEN-ARCHIVE	1089	187	186
ARCHIVE-EU	1175	101	252

Table A3. p-values of the different catalogs’ KS test for the planetary mass property.

Catalog	Overlap vs. UA	Overlap vs. UB	UA vs. UB
EU-OPEN	<10⁻¹⁰	0.034	7 × 10⁻³
OPEN-ARCHIVE	<10⁻¹⁰	3 × 10⁻⁸	0.42
ARCHIVE-EU	0.69	<10⁻¹⁰	3.5 × 10⁻⁴

Figure A2. CDF of the planetary mass property for different subsets. (a) EU-OPEN; (b) OPEN-ARCHIVE; (c) ARCHIVE-EU blue: Overlap, red: UA, yellow: UB.

We found that most of the disagreement between the overlap and unique subsets to be concentrated in the region of lower-mass planets. At present, most planets with masses

< 10 M_{\oplus}

, are estimated from TTVs in multi-planetary systems. Usually, the mass of a planet detected through TTV is not resolved directly, and in practice, is degenerate with the planet’s eccentric orbit [18], following knowledge on an upper mass limit, rather than a nominal value. Consequently, we found the catalogs use different inclusion criteria for these kinds of planets with no uniform approach to displaying this mass limit. Sometimes the catalogs chose to omit the value altogether, sometimes to display it as an upper limit, but in many cases, the mass limit was reported as a legitimate nominal estimate. The derived CDF presented in Figure A2 suggest the EU catalog, as opposed to the ARCHIVE and OPEN, displays many of its TTV planets’ mass as the reported upper limit value given from their confirmation paper without any strict filtering. The modest agreement we found between the OPEN and ARCHIVE unique parts was artificial, driven from the similar number of TTV planets the two catalogs choose to include.

Appendix A.1.2. Planetary Radius and Orbital Period

Although we inspected the properties of planetary radius and orbital period separately, we found the result of the comparison analysis between their overlap and unique subsets to be similar. We present the number of planets with reported radii, calculated p-values and CDFs of the different subsets in Table A4 and Table A5 and Figure A3, respectively. We present the number of planetary orbital periods and calculated p-values and CDFs of the different subsets in Table A6 and Table A7 and Figure A4, respectively.

Table A4. Catalogs’ subsets total number of planets with listed planetary radius.

Catalog	Overlap	UA	UB
EU-OPEN	2691	116	40
OPEN-ARCHIVE	2667	64	244
ARCHIVE-EU	2703	208	104

Table A5. p-values of the different catalogs’ KS test for the planetary radius property.

Catalog	Overlap vs. UA	Overlap vs. UB	UA vs. UB
EU-OPEN	<10⁻¹⁰	0.74	0.02
OPEN-ARCHIVE	<10⁻¹⁰	0.03	<10⁻¹⁰
ARCHIVE-EU	0.01	<10⁻¹⁰	<10⁻¹⁰

Figure A3. CDF of the planetary radius property for different subsets. (a) EU-OPEN; (b) OPEN-ARCHIVE; (c) ARCHIVE-EU blue: Overlap, red: UA, yellow: UB.

Table A6. Catalogs’ subsets total number of planets with listed planetary orbital period.

Catalog	Overlap	UA	UB
EU-OPEN	3303	202	77
OPEN-ARCHIVE	3245	135	317
ARCHIVE-EU	3313	249	192

Table A7. p-values of the different catalogs’ KS test for the planetary orbital period property.

Catalog	Overlap vs. UA	Overlap vs. UB	UA vs. UB
EU-OPEN	<10⁻¹⁰	4 × 10⁻⁸	0.31
OPEN-ARCHIVE	10⁻⁹	2 × 10⁻³	5 × 10⁻¹⁰
ARCHIVE-EU	0.02	<10⁻¹⁰	<10⁻¹⁰

Figure A4. CDF of the planetary orbital period property for different subsets. (a) EU-OPEN; (b) OPEN-ARCHIVE; (c) ARCHIVE-EU blue: Overlap, red: UA, yellow: UB.

In the analysis of both properties, we found relative similarity between the overlap and unique subsets of the ARCHIVE catalog and a large difference with the EU and OPEN subsets. It seems that the OPEN and EU unique parts are shifted towards larger radii (

R_{P} > 1 R_{J}

) or longer orbital periods. However, when we examined the list of ARCHIVE unique planets we found over 75% of them to be K2 planets. We found also a similar ratio of Kepler and K2 planets that populate the overlap ARCHIVE subset. These two facts together suggest the reason for the good agreement we found between the ARCHIVE overlap and unique subsets related to the similar properties and biases that emerged from the transiting detected Kepler and K2 planets: Relative small radii (

R_{P} < 4 R_{\oplus}

) and short orbital period (

P e r i o d < 100

days) [29]. Other differences we found between the listed subsets are: Listed planets where the confirmation paper has used some theoretical M–R relations to infer the planetary radius (or mass), some unusual large radii planets suffering strong tidal forces due to their proximity to the parent stars (in short periods), planets with a ‘strange’ transiting light curves that make the planetary detection more controversial, etc.

We conclude that the reason for the disagreement between the catalogs radii and orbital period distributions is derived from: (1) Different criteria for including planets in the catalogs that do not emerge directly from the criteria presented in Table 1; (2) Different update and confirmation processes for new candidates.

Appendix A.1.3. Planetary Mass-Radius-Period (MRP)

In this subsection we compare the total number of planets in each catalog for a subset in which all three properties of planetary mass, radius and orbital period are available. Information of these three physical properties can provide important constraints for planetary characterization and formation models [7]. Most of the current MRP planets were detected with both RV and transit methods while, as of today, most confirmed mass-radius planets are not TTV detected [30]. Consequently, we expect to detect especially large mass-radius, close orbit planets (Hot-Jupiters) around bright solar-like stars. As before, we started by analyzing the MRP distributions of the different catalogs (including the ORG catalog). We present in Table A8 and Figure A5 the p-values of the corresponding KS tests and relevant empirical cumulative distribution functions (CDF) of each subset respectively.

Table A8. Same as Table A1, but for planets with combined information of planetary properties.

Planet Property	EU-ORG	EU-ARCHIVE	ARCHIVE-ORG	EU-OPEN	OPEN-ARCHIVE	OPEN-ORG
$M_{P}$	<10⁻¹⁰	0.73	<10⁻¹⁰	0.69	0.92	<10⁻¹⁰
$R_{P}$	<10⁻¹⁰	0.98	<10⁻¹⁰	0.21	0.24	<10⁻¹⁰
$P e r i o d$	<10⁻¹⁰	0.99	<10⁻¹⁰	0.81	0.58	<10⁻¹⁰

Figure A5. Same as Figure A1, but for planets with combined information of planetary properties.

Similar with the overall catalogs comparison, excluding the ORG catalog with its added theoretical supplement, all three other catalogs agreed on their distributions. We noted that as expected, the distributions were different than those displayed for the separate properties (see Figure A1), with indeed a higher number of planets in the large mass-radius and small period regimes.

By comparing the total numbers, p-values and distributions of the catalogs subsets (Table A9 and Table A10 and Figure A6), we found most of the disagreements between the overlap and unique subsets to be evident in the small planetary regime of TTVs’ detected planets. We found the unique subsets, and especially the planetary masses distributions, to be similar between the catalogs, although the listed planets are different. We conclude that most of the disagreement with the planets of listed MRP comes especially from the different approaches each catalog uses for the confirmed TTV planets.

Table A9. Catalogs’ subsets total number of planets with listed mass-radius-period (MRP) properties.

Catalog	Overlap	UA	UB
EU-OPEN	520	135	36
OPEN-ARCHIVE	465	91	110
ARCHIVE-EU	528	47	127

Table A10. p-values of the different catalogs’ KS test for the planetary MRP properties.

Catalog	Planetary Property	Overlap vs. UA	Overlap vs. UB	UA vs. UB
	$M_{P}$	2 × 10⁻⁸	6 × 10⁻³	0.49
EU-OPEN	$R_{P}$	<10⁻¹⁰	2 × 10⁻⁷	0.46
	$P e r i o d$	8 × 10⁻⁶	10⁻³	0.23
	$M_{P}$	6 × 10⁻⁵	3 × 10⁻⁵	0.34
OPEN-ARCHIVE	$R_{P}$	7 × 10⁻⁵	<10⁻¹⁰	2 × 10⁻³
	$P e r i o d$	6 × 10⁻⁴	3 × 10⁻¹¹	0.05
	$M_{P}$	0.02	3 × 10⁻⁷	0.51
ARCHIVE-EU	$R_{P}$	3 × 10⁻⁷	8 × 10⁻⁷	0.02
	$P e r i o d$	8 × 10⁻⁴	0.23	0.05

Figure A6. CDF of the MRP properties for different subsets. (a) EU-OPEN; (b) OPEN-ARCHIVE; (c) ARCHIVE-EU blue: Overlap, red: UA, yellow: UB.

Appendix A.2. Planetary Systems

We first compared the different catalogs by using a KS test for the stellar properties of the planetary systems of mass (

M_{*}

), metallicity (

[F e / H

]) and surface temperature (

T_{e f f}

). Table 2 summarizes the number of available objects in each catalog for the different properties. We present in Table A11 and Figure A7 the p-values of the corresponding KS tests and relevant empirical CDFs of each subset respectively. We found the distributions to be very similar between all catalogs. The slightly lower metallicity p-values we found was caused by the large error that follows the metallicity property measurement and does not relate to any different distribution between the catalogs.

Table A11. p-values of the different catalogs’ KS test for various stellar properties.

Stellar Property	EU-ORG	EU-ARCHIVE	ARCHIVE-ORG	EU-OPEN	OPEN-ARCHIVE	OPEN-ORG
$M_{*}$	0.77	0.59	0.56	0.76	0.33	0.93
$[F e / H]$	0.08	0.99	0.03	0.07	0.01	0.28
$T_{e f f}$	0.55	0.99	0.74	0.94	0.89	0.99

Figure A7. CDFs of the stellar mass, metallicity and surface temperature. The different colors correspond to the different catalogs: blue: EU, red: ARCHIVE, orange: OPEN purple: ORG.

Similar with the planetary properties, we next compared the overlap and unique stellar properties subsets between each pair of the EU-ARCHIVE-OPEN catalogs. We performed the following analysis for the stars of each planetary system. As mentioned in Section 2, we also compared the distributions of the first detect planet in each system to better understand the reasons for the possible differences.

Appendix A.2.1. Stellar Mass and Surface Temperature

Finding the stellar mass is an important prior when assessing a planet’s mass by the RV method [31]. We expected to detect most of our overlap planets around F, G and K stars since most of the planetary detection projects and efforts as of today have been dedicated towards searching planets around a solar mass star. We present the number of objects with reported stellar mass (and first planet’s planetary properties), calculated p-values and CDFs of the different subsets in Table A12 and Table A13 and Figure A8 and Figure A9, respectively. We found the unique vs overlap subsets to be different, with higher disagreement in the regime of small-mass stars (especially K and M-stars). The spectrum of M-stars presents a difficulty for measuring, due to its intrinsic faintness and high activity [20]. Consequently, detection of planets around these stars is supposed to be more difficult and somewhat controversial, thus explaining why we found a higher relative number of these objects in the unique subsets.

Table A12. Catalogs’ subsets total number of objects with listed stellar mass and planetary properties fractions.

Catalog	Stellar/Planet Property	Overlap	UA	UB
EU-OPEN	$M_{*}$	2306	185	157
	$M_{P}$	904	122	60
	$R_{P}$	1867	105	129
	$P e r i o d$	2268	159	157
OPEN-ARCHIVE	$M_{*}$	2290	173	282
	$M_{P}$	877	73	101
	$R_{P}$	1833	141	218
	$P e r i o d$	2253	171	263
ARCHIVE-EU	ARCHIVE-EU	$M_{*}$	2348	224	143
		$M_{P}$	1876	175	72
		$R_{P}$	894	84	113
		$P e r i o d$	2307	209	119

Table A13. Stellar mass and planet properties KS test p-values for the different catalogs.

Catalog	Stellar/Planet Property	Overlap vs. UA	Overlap vs. UB	UA vs. UB
EU-OPEN	$M_{*}$	<10⁻¹⁰	0.06	8 × 10⁻⁴
	$M_{P}$	6 × 10⁻³	0.11	0.77
	$R_{P}$	2 × 10⁻³	9 × 10⁻³	1 × 10⁻⁵
	$P e r i o d$	0.02	2 × 10⁻⁸	8 × 10⁻⁶
OPEN-ARCHIVE	$M_{*}$	0.12	<10⁻¹⁰	0.01
	$M_{P}$	0.44	0.16	0.46
	$R_{P}$	0.11	0.19	0.05
	$P e r i o d$	0.014	0.04	0.36
ARCHIVE-EU	$M_{*}$	2 × 10⁻⁶	5 × 10⁻⁷	0.05
	$M_{P}$	0.31	0.14	0.99
	$R_{P}$	0.22	<10⁻¹⁰	<10⁻¹⁰
	$P e r i o d$	2 × 10⁻³	2 × 10⁻⁸	8 × 10⁻⁶

Figure A8. CDF of the stellar mass property for different subsets. (a) EU-OPEN; (b) OPEN-ARCHIVE; (c) ARCHIVE-EU blue: Overlap, red: UA, yellow: UB.

When comparing the planetary properties of the first detected planets in these systems, we observed a bias towards small planets that were probably easier to detect around lower mass stars. We also found a small fraction of long orbital period planets (

P e r i o d ~ 1000 days

) derived from direct imaging of large, high mass, semi-major axis objects around a small mass star or brown dwarf (especially evident in the EU subsets).

Examining the stellar surface temperature (not displayed here), we found the unique vs. overlap subsets analysis to be analogous to that of the stellar mass analysis. This was no surprise, especially because of the well-known correlation the two properties possess according to some mass–luminosity relation [32]. For this reason, we have chosen not to elaborate about it here.

Figure A9. CDF of the planetary properties for different subsets of the first detected planet systems with reported stellar mass property. (a) EU-OPEN; (b) OPEN-ARCHIVE; (c) ARCHIVE-EU blue: Overlap, red: UA, yellow: UB.

Appendix A.2.2. Stellar Metallicity

We found the catalogs usually reported the metallicity property with high relative errors, probably linked to imprecise derivation that is used to determine the metallicity value. Combined with the different survey sources the exoplanet catalogs choose to present in their sites, we consequently found the highest inconsistency between catalogs to be with this parameter (as compared with the other stellar and planetary properties). We present the number of objects with reported stellar metallicity (and first planet’s planetary properties), calculated p-values and CDFs of the different subsets in Table A14 and Table A15 and Figure A10 and Figure A11, respectively. We found the unique vs. overlap subsets to be yet again different. While the overlap subsets have a clear peak around solar metallicity, the unique subsets have a wider range of metallicities values. Combining information we acquired from the system’s first detected planet, we noted the unique OPEN subset to be populate with a relatively low number of small planets: The OPEN catalog lists fewer Kepler planets with their stellar metallicity, and hence they automatically move to the opposite unique subset. We found the unique subsets of the ARCHIVE catalog to be especially distributed with a wide variance coming from its population of many K2 and Kepler small radii planets [24].

Table A14. Catalogs’ subsets total number of objects with listed stellar metallicity and planetary properties fractions.

Catalog	Stellar/Planet Property	Overlap	UA	UB
EU-OPEN	$[F e / H]$	2035	409	53
	$M_{P}$	798	97	48
	$R_{P}$	1654	103	24
	$P e r i o d$	2034	407	52
OPEN-ARCHIVE	$[F e / H]$	1937	151	487
	$M_{P}$	691	137	108
	$R_{P}$	1554	103	450
	$P e r i o d$	1936	150	486
ARCHIVE-EU	$[F e / H]$	2266	158	178
	$M_{P}$	732	67	144
	$R_{P}$	1871	133	134
	$P e r i o d$	2264	158	176

Table A15. Stellar metallicity and planet properties KS test p-values for the different catalogs.

Catalog	Stellar/ Planet Property	Overlap vs. UA	Overlap vs. UB	UA vs. UB
EU-OPEN	$[F e / H]$	<10⁻¹⁰	0.04	9 × 10⁻⁴
	$M_{P}$	5 × 10⁻⁵	0.24	0.11
	$R_{P}$	2 × 10⁻⁶	8 × 10⁻⁴	3 × 10⁻⁶
	$P e r i o d$	<10⁻¹⁰	2 × 10⁻³	10⁻⁵
OPEN-ARCHIVE	$[F e / H]$	0.02	<10⁻¹⁰	3 × 10⁻⁹
	$M_{P}$	0.13	2 × 10⁻⁵	3 × 10⁻⁴
	$R_{P}$	<10⁻¹⁰	1 × 10⁻⁶	< 10⁻¹⁰
	$P e r i o d$	3 × 10⁻⁷	<10⁻¹⁰	7 × 10⁻⁵
ARCHIVE-EU	$[F e / H]$	2 × 10⁻⁵	0.19	10⁻³
	$M_{P}$	0.48	0.12	0.33
	$R_{P}$	0.89	<10⁻¹⁰	<10⁻¹⁰
	$P e r i o d$	0.05	3 × 10⁻⁶	3 × 10⁻³

Figure A10. CDF of the stellar metallicity property for different subsets. (a) EU-OPEN; (b) OPEN-ARCHIVE; (c) ARCHIVE-EU blue: Overlap, red: UA, yellow: UB.

Figure A11. CDF of the planetary properties for different subsets of the first detected planet systems with reported stellar metallicity property. (a) EU-OPEN; (b) OPEN-ARCHIVE; (c) ARCHIVE-EU blue: Overlap, red: UA, yellow: UB.

Appendix A.3. Specific Examples of Differences between the Catalogs

Below, we present some of the examples we have come across during this analysis. These examples emphasize some of the differences between the catalogs reported planetary and stellar properties:

(I): Planetary mass inconsistency: ‘Kepler-114 c’ is a TTV detected mass planet. The EU catalog displays its mass to be $M_{p} ~ 40 M_{\oplus}$ [26]. However, since this measurement is an upper limit, we found it to be inconsistent with the nominal measurement the ARCHIVE displays of $M_{p} ~ 2.8 M_{\oplus}$ [28], while the OPEN catalog does not include this planet; The system of ‘Kepler 53 b,c’ is labeled in the ARCHIVE catalog with main planetary mass of $M_{P}$ < $18 M_{J}$ and $M_{P}$ < $15 M_{J}$ , for planets ‘b’, ‘c’ respectively [25], however a later paper [26] reports the masses to be much smaller, $M_{P} ~ 0.18 M_{J}$ , $M_{P} ~ 0.11 M_{J}$ , respectively, as listed in both EU and OPEN. Both EU and OPEN display the mass properties of ‘Kepler-446 b,c,d’, however according to the reference article [27], the given value is only a coarse estimate for the planets masses and expected semi-amplitudes radial velocities signatures using recent empirically-measured; ‘Hd181720 b’ is an object that is listed in the OPEN unique with a mass of $M_{p} ~ 0.37 M_{J}$ . On the other hand, the EU catalog lists its mass to be $M_{p} ~ 12.12 M_{J}$ (and because of a planet mass larger than our $10 M_{J}$ cutoff, dropped from our analysis). The explanation for this difference is referred in EU to be related with the planetary inclination, measured by astrometry, to be small ( $i ~ 1.75 °$ , [33]), resulting indeed with a $M_{P} \cdot s i n i ~ 0.37 M_{J}$ that was set to be the planet mass in the OPEN catalog.
(II): Planetary radius inconsistency: ‘CoRoT-21 b’ is a $R_{p} ~ 1.3 R_{J}$ planet in short orbit ( $P e r i o d ~ 2.7$ days) which exchanges extreme tidal forces with its parent star [34], and is listed in the EU and OPEN catalogs only; On the other hand, the radius of ‘HD 219134 d’ with a bottom radius limit $> 1.6 R_{\oplus}$ [35] is listed only on the ARCHIVE catalog.
(III): Planetary orbital period inconsistency: ‘51 Eri b’ is an imaged astrometric giant planet with an assumed orbital period of period~14965 days (41 years) listed on both the EU and OPEN catalogs [36], but with missing information in the ARCHIVE, displaying only the semi-major axis measurement [37]. Another difference between the catalogs for this planet is with its reported planetary mass: $M_{p} ~ 2 M_{J}$ (OPEN and ARCHIVE) $M_{p} ~ 9 M_{J}$ (EU); ’Kepler-37 e’ is a TTV detected planet reported only on the ARCHIVE and OPEN catalogs with no extra reported information except to each period of period~51.19 days [26].
(IV): Stellar mass inconsistency: ‘HIP 57050 b’ [38] is listed in all three catalogs but with a missing stellar mass measurement ( $M_{*} ~ 0.34 M_{☉}$ ) in the ARCHIVE catalog; ‘OGLE-2014-BLG-1722 b’ is a $M_{p} ~ 55.3 M_{\oplus}$ planet detected by the Microlensing method around a $M_{*} ~ 0.4 M_{☉}$ star (two-planet system), listed only on the EU catalog [39].

References

Mayor, M.; Queloz, D. A Jupiter-mass companion to a solar-type star. Nature 1995, 378, 355–359. [Google Scholar] [CrossRef]
Masset, F.S.; Papaloizou, J.C.B. Runaway Migration and the Formation of Hot Jupiters. Astrophys. J. 2003, 588, 494–508. [Google Scholar] [CrossRef]
Mordasini, C.; Alibert, Y.; Klahr, H.; Henning, T. Characterization of exoplanets from their formation I. Models of combined planet formation and evolution. Astron. Astrophys. 2012, 547, 111. [Google Scholar] [CrossRef]
Ribas, I.; Miralda-Escudé, J. The eccentricity-mass distribution of exoplanets: Signatures of different formation mechanisms? Astron. Astrophys. 2007, 464, 779–785. [Google Scholar] [CrossRef]
Williams, J.P.; Cieza, L.A. Protoplanetary Disks and Their Evolution. Annu. Rev. Astron. Astrophys. 2011, 49, 67–117. [Google Scholar] [CrossRef]
Udry, S.; Santos, N.C. Statistical Properties of Exoplanets. Annu. Rev. Astron. Astrophys. 2007, 45, 397–439. [Google Scholar] [CrossRef]
Helled, R.; Bodenheimer, P.; Podolak, M.; Boley, A.; Meru, F.; Nayakshin, S.; Fortney, J.J.; Mayer, L.; Alibert, Y.; Boss, A.P. Giant Planet Formation, Evolution, and Internal Structure. In Protostars and Planets VI; Klessen, S., Dullemond, P., Henning, K., Eds.; University of Arizona Press: Tucson, AZ, USA, 2014. [Google Scholar]
Christiansen, J.L. Exoplanet Catalogues. In Handbook of Exoplanets; Deeg, H.J., Belmonte, J.A., Eds.; Springer: New York, NY, USA, 2018; ISBN 978-3-319-30648-3. [Google Scholar]
Schneider, J.; Dedieu, C.; Sidaner, P.L.; Savalle, R.; Zolotukhin, I. Defining and cataloging exoplanets: The exoplanet.eu database. Astron. Astrophys. 2011, 532, A79. [Google Scholar] [CrossRef]
Akeson, R.L.; Chen, X.; Ciardi, D.; Crane, M.; Good, J.; Harbut, M.; Jackson, E.; Kane, S.R.; Laity, A.C.; Leifer, S.; et al. The NASA Exoplanet Archive: Data and Tools for Exoplanet Research. Publ. Astron. Soc. Pac. 2013, 125, 989–999. [Google Scholar] [CrossRef]
Wright, J.T.; Fakhouri, O.; Marcy, G.W.; Han, E.; Feng, Y.; Johnson, J.A.; Howard, A.W.; Fischer, D.A.; Valenti, J.A.; Anderson, J.; et al. The Exoplanet Orbit Database. Publ. Astron. Soc. Pac. 2011, 123, 412–422. [Google Scholar] [CrossRef]
Wright, J.T.; Gaudi, B.S. Exoplanet Detection Methods. In Planets, Stars and Stellar Systems; Springer: Dordrecht, The Netherlands, 2013; pp. 489–540. [Google Scholar]
Rowe, J.F.; Bryson, S.T.; Marcy, G.W.; Lissauer, J.J.; Jontof-Hutter, D.; Mullally, F.; Gilliland, R.L.; Issacson, H.; Ford, E.; Howell, S.B.; et al. Validation of Kepler’s Multiple Planet Candidates. III: Light Curve Analysis and Announcement of Hundreds of New Multi-planet Systems. Astrophys. J. 2014, 784, 45. [Google Scholar] [CrossRef]
Huber, C.; Silva Aguirre, V.; Matthews, J.M.; Pinsonneault, M.H.; Gaidos, E.; Garcia, R.A.; Hekker, S.; Huber, D.; García, R.A.; Mathur, S.; et al. Revised Stellar Properties Of Kepler Targets For The Quarter 1-16 Transit Detection Run Detailed Terms. Astrophys. J. 2014, 211. [Google Scholar] [CrossRef]
Johnson, J.A.; Petigura, E.A.; Fulton, B.J.; Marcy, G.W.; Howard, A.W.; Isaacson, H.; Hebb, L.; Cargile, P.A.; Morton, T.D.; Weiss, L.M.; et al. The California-Kepler Survey. II. Precise Physical Properties of 2025 Kepler Planets and Their Host Stars. Astrophys. J. 2017, 1–13. [Google Scholar] [CrossRef]
Massey, F.J. The Kolmogorov-Smirnov Test for Goodness of Fit. J. Am. Stat. Assoc. 1951, 46, 68–78. [Google Scholar] [CrossRef]
Silverman, B.W. Density estimation for statistics and data analysis. In Monographs on Statistics and Applied Probability; Chapman and Hall: London, UK, 1986. [Google Scholar]
Lithwick, Y.; Xie, J.; Wu, Y. Extracting Planet Mass and Eccentricity from TTV data. Astrophys. J. 2012, 761, 122. [Google Scholar] [CrossRef]
Steffen, J.H. Sensitivity bias in the mass-radius distribution from transit timing variations and radial velocity measurements. MNRAS 2016, 457, 4384–4392. [Google Scholar] [CrossRef]
Gao, P.; Plavchan, P.; Gagné, J.; Furlan, E.; Bottom, M.; Anglada-Escudé, G.; White, R.; Davison, C.; Beichman, C.; Brinkworth, C.; et al. Retrieval of Precise Radial Velocities from Near-Infrared High Resolution Spectra of Low Mass Stars. Publ. Astron. Soc. Pac. 2016, 128, 104501. [Google Scholar] [CrossRef]
Boss, A.P. Stellar Metallicity and the Formation of Extrasolar Gas Giant Planets. Astrophys. J. 2002, 567, L149–L153. [Google Scholar] [CrossRef]
Fischer, D.A.; Valenti, J. The Planet-Metallicity Correlation. Astrophys. J. 2005, 622, 1102–1117. [Google Scholar] [CrossRef]
Sousa Sérgio, G.; Santos Nuno, C.; Israelian, G.; Mayor, M.; Stephane, U. Spectroscopic stellar parameters for 582 FGK stars in the HARPS volume-limited sample-Revising the metallicity-planet correlation. Astron. Astrophys. 2011, 533, A141. [Google Scholar] [CrossRef]
Buchhave, L.A.; Latham, D.W.; Johansen, A.; Bizzarro, M.; Torres, G.; Rowe, J.F.; Batalha, N.M.; Borucki, W.J.; Brugamyer, E.; Caldwell, C.; et al. An abundance of small exoplanets around stars with a wide range of metallicities. Nature 2012, 486, 375–377. [Google Scholar] [CrossRef] [PubMed]
Ford, E.B.; Fabrycky, D.C.; Steffen, J.H.; Carter, J.A.; Fressin, F.; Holman, M.J.; Lissauer, J.J.; Moorhead, A.V.; Morehead, R.C.; Ragozzine, D.; et al. Transit Timing Observations from Kepler: II. Confirmation of two multiplanet systems via a non-parametric correlation. Astrophys. J. 2012, 750, 113. [Google Scholar]
Hadden, S.; Lithwick, Y. Densities and eccentricities of 139 Kepler planets from transit time variations. Astrophys. J. 2014, 787, 80. [Google Scholar] [CrossRef]
Muirhead, P.S.; Mann, A.W.; Vanderburg, A.; Morton, T.D.; Kraus, A.; Ireland, M.; Swift, J.J.; Feiden, G.A.; Gaidos, E.; Gazak, J.Z. Kepler-445, Kepler-446 and the Occurrence of Compact Multiples Orbiting Mid-M Dwarf Stars. In AAS/Division for Extreme Solar Systems Abstracts; American Astronomical Society: Washington, DC, USA, 2015; Volume 3. [Google Scholar]
Xie, J.W. Transit timing variation of near-resonance planetary pairs. II. Confirmation of 30 planets in 15 multiple planet systems. Astrophys. J. Suppl. Ser. 2013, 210, 25. [Google Scholar] [CrossRef]
Batalha, N.M. Exploring exoplanet populations with NASA’s Kepler Mission. Proc. Nati. Acad. Sci. USA 2014, 111, 12647–12654. [Google Scholar] [CrossRef] [PubMed]
Bashi, D.; Helled, R.; Zucker, S.; Mordasini, C. Two empirical regimes of the planetary mass-radius relation. Astron. Astrophys. 2017, 604, A83. [Google Scholar] [CrossRef]
Lovis, C.; Fischer, D.A. Radial Velocity. In Radial Velocity Techniques for Exoplanets; Seager, S., Ed.; University of Arizona Press: Tucson, AZ, USA, 2011. [Google Scholar]
Kuiper, G.P. The Empirical Mass-Luminosity Relation. Astrophys. J. 1938, 88, 472. [Google Scholar] [CrossRef]
Reffert, S.; Quirrenbach, A. Mass constraints on substellar companion candidates from the re-reduced Hipparcos intermediate astrometric data: Nine confirmed planets and two confirmed brown dwarfs. Astron. Astrophys. 2011, 527, A140. [Google Scholar] [CrossRef]
Pätzold, M.; Endl, M.; Csizmadia, S.; Gandolfi, D.; Jorda, L.; Grziwa, S.; Carone, L.; Pasternacki, T.; Aigrain, S.; Almenara, J.M.; et al. Transiting exoplanets from the CoRoT space mission XXIII. CoRoT-21b: A doomed large Jupiter around a faint subgiant star. Astron. Astrophys. 2012, 545, 6. [Google Scholar] [CrossRef]
Gillon, M.; Demory, B.-O.; Van Grootel, V.; Motalebi, F.; Lovis, C.; Cameron, A.C.; Charbonneau, D.; Latham, D.; Molinari, E.; Pepe, F.A.; et al. Two massive rocky planets transiting a K-dwarf 6.5 parsecs away. Nat. Astron. 2017, 1. [Google Scholar] [CrossRef]
De Rosa, R.J.; Nielsen, E.L.; Blunt, S.C.; Graham, J.R.; Konopacky, Q.M.; Marois, C.; Pueyo, L.; Rameau, J.; Ryan, D.M.; Wang, J.J.; et al. Astrometric Confirmation and Preliminary Orbital Parameters of the Young Exoplanet 51 Eridani b with the Gemini Planet Imager. Astrophys. J. 2015, 814. [Google Scholar] [CrossRef]
Macintosh, B.; Graham, J.R.; Barman, T.; De Rosa, R.J.; Konopacky, Q.; Marley, M.S.; Marois, C.; Nielsen, E.L.; Pueyo, L.; Rajan, A.; et al. Discovery and spectroscopy of the young Jovian planet 51 Eri b with the Gemini Planet Imager. Sci. Express 2015, 350, 64–67. [Google Scholar] [CrossRef] [PubMed]
Haghighipour, N.; Vogt, S.S.; Paul Butler, R.; Rivera, E.J.; Laughlin, G.; Meschiari, S.; Henry, G.W. The Lick-Carnegie Exoplanet Survey: A Saturn-Mass Planet in the Habitable Zone of the Nearby M4V Star HIP 57050. Astrophys. J. 2010, 715, 271–276. [Google Scholar] [CrossRef]
Suzuki, D.; Bennett, D.P.; Udalski, A.; Bond, I.A.; Sumi, T.; Han, C.; Abe, F.; Asakura, Y.; Barry, R.K.; Bhattacharya, A.; et al. A Likely Detection of a Two-Planet System in a Low Magnification Microlensing Event. Astrophys. J. 2018, 155, 14. [Google Scholar] [CrossRef]

Figure 1. Venn diagram describing the different compared subsets. Catalogs ‘A’, ‘B’ represents two out of the three compared catalogs. For each planet or stellar property we used a Kolmogorov–Smirnov (KS) test to compare two out of the three derived subsets: ‘Overlap’, ‘unique A’ and ‘unique B’.

Figure 2. Each catalogs’ subsets probability density functions (PDFs) of planetary mass, radius and orbital period properties, using the kernel density estimation (KDE), where overlap is represented by blue curves and unique by brown curves. In each plot three overlap subsets and six unique subsets may be noted (see text for further details).

Figure 3. Same as Figure 2, but for the subset of planets in which all three properties of planetary mass, radius and orbital period were available.

Figure 4. Catalogs’ subsets PDF’s of stellar mass, metallicity and surface temperature properties using KDE, where: overlap is given in blue and unique is given in brown. In each plot, three overlap subsets and six unique subsets may be noted (see text for further details).

Table 1. The exoplanets catalogs inclusion criteria.

Catalog	Object Mass Criterion	Reference Criteria
EU	< $60 M_{J} \pm 1 σ$	Published or submitted to peer-reviewed journals or announced in conferences by professional astronomers.
ARCHIVE	< $30 M_{J}$	Accepted, refereed paper.
OPEN	Not listed	Open-source.
ORG	< $24 M_{J}$	Carefully vetted, peer-reviewed journal papers.

Table 2. The total number of listed planets with different stellar and planet properties for each exoplanet catalog.

Number of Objects	EU	ARCHIVE	OPEN	ORG
All planets	3757	3708	3524	2950
With planetary mass	1327	1276	1275	2917
With planetary radius	2807	2912	2731	2438
With planetary orbital period	3505	3564	3371	2920
With planetary mass, radius and orbital period	655	576	558	2436
All systems	2652	2693	2556	2200
With stellar mass $M_{*}$	2465	2571	2476	1929
With stellar metallicity $[F e / H]$	2444	2581	2088	1877
With stellar surface temperature $T_{e f f}$	2503	2424	2469	2162

Table 3. The p-values from KS tests wherein the distributions of the overlap vs. unique subsets of planetary mass, radius and orbital period properties are compared. The three numbers in brackets in each cell represent the p-value according to this order: (

M_{P}, R_{P}, P e r i o d

).

Table 3. The p-values from KS tests wherein the distributions of the overlap vs. unique subsets of planetary mass, radius and orbital period properties are compared. The three numbers in brackets in each cell represent the p-value according to this order: (

M_{P}, R_{P}, P e r i o d

).

	Overlap OPEN-ARCHIVE	Overlap EU-OPEN	Overlap ARCHIVE-EU
Unique EU	---	(<10⁻¹⁰, <10⁻¹⁰, <10⁻¹⁰)	(<10⁻¹⁰, <10⁻¹⁰, <10⁻¹⁰)
Unique ARCHIVE	(3 × 10⁻⁸, 0.03, 2 × 10⁻³)	---	(0.69, 0.01, 0.02)
Unique OPEN	(<10⁻¹⁰, <10⁻¹⁰, 10⁻⁹)	(0.034, 0.74, 4 × 10⁻⁸)	---

Table 4. Same as Table 3, but for the subset of planets in which all three properties of planetary mass, radius and orbital period are available.

	Overlap OPEN-ARCHIVE	Overlap EU-OPEN	Overlap ARCHIVE-EU
Unique EU	---	(2 × 10⁻⁸, <10⁻¹⁰, 8 × 10⁻⁶)	(3 × 10⁻⁷, 8 × 10⁻⁷, 0.23)
Unique ARCHIVE	(3 × 10⁻⁵, <10⁻¹⁰, 3 × 10⁻¹¹)	---	(0.02, 3 × 10⁻⁷, 8 × 10⁻⁴)
Unique OPEN	(6 × 10⁻⁵, 7 × 10⁻⁵, 6 × 10⁻⁴)	(6 × 10⁻³, 2 × 10⁻⁷, 10⁻³)	---

Table 5. The p-values obtained from KS tests comparing the distributions of the overlap vs. unique subsets of stellar mass, metallicity and surface temperature properties. The three numbers in brackets of each cell represent the p-value according to this order: (

M_{*}, [F e / H], T_{*}

).

Table 5. The p-values obtained from KS tests comparing the distributions of the overlap vs. unique subsets of stellar mass, metallicity and surface temperature properties. The three numbers in brackets of each cell represent the p-value according to this order: (

M_{*}, [F e / H], T_{*}

).

	Overlap OPEN-ARCHIVE	Overlap EU-OPEN	Overlap ARCHIVE-EU
Unique EU	---	(<10⁻¹⁰, <10⁻¹⁰, 9 × 10⁻¹⁰)	(5 × 10⁻⁷, 0.19, 0.02)
Unique ARCHIVE	(<10⁻¹⁰, <10⁻¹⁰, <10⁻¹⁰)	---	(2 × 10⁻⁶, 2 × 10⁻⁵, 2 × 10⁻⁴)
Unique OPEN	(0.12, 0.02, 0.09)	(0.06, 0.04, 9 × 10⁻⁴)	---

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bashi, D.; Helled, R.; Zucker, S. A Quantitative Comparison of Exoplanet Catalogs. Geosciences 2018, 8, 325. https://doi.org/10.3390/geosciences8090325

AMA Style

Bashi D, Helled R, Zucker S. A Quantitative Comparison of Exoplanet Catalogs. Geosciences. 2018; 8(9):325. https://doi.org/10.3390/geosciences8090325

Chicago/Turabian Style

Bashi, Dolev, Ravit Helled, and Shay Zucker. 2018. "A Quantitative Comparison of Exoplanet Catalogs" Geosciences 8, no. 9: 325. https://doi.org/10.3390/geosciences8090325

APA Style

Bashi, D., Helled, R., & Zucker, S. (2018). A Quantitative Comparison of Exoplanet Catalogs. Geosciences, 8(9), 325. https://doi.org/10.3390/geosciences8090325

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Quantitative Comparison of Exoplanet Catalogs

Abstract

1. Introduction

2. Methods

3. Results

4. Summary & Discussion

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A. Details of the Comparison

Appendix A.1. Comparison of Planetary Properties

Appendix A.1.1. Planetary Mass

Appendix A.1.2. Planetary Radius and Orbital Period

Appendix A.1.3. Planetary Mass-Radius-Period (MRP)

Appendix A.2. Planetary Systems

Appendix A.2.1. Stellar Mass and Surface Temperature

Appendix A.2.2. Stellar Metallicity

Appendix A.3. Specific Examples of Differences between the Catalogs

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI