1. Introduction
Among the spectral types B, A, and F, we can find stars that show peculiarities in their spectra. These chemically peculiar (CP) stars are a well-established phenomenon since their discovery almost 130 years ago [
1].
Classically, one makes the distinction between the four types defined by [
2] with some additions made by [
3].
First, CP1 (Am/Fm) stars show enhanced lines of iron and other metals, whereas calcium, for example, is underabundant. The fact that many of those stars are found in binary systems [
4] and thus, are rotating rather slowly (
km
) leads to the so-called
chemical separation, where some elements get driven outwards by radiation and others start to settle down.
The second subgroup is CP2 or Bp/Ap or magnetic (mCP) stars. With (strong) magnetic fields up to tens of kG [
5], they exhibit variability in their spectra and photometric time series. This is explained by the
Oblique Rotator Theory [
6]. Currently, we are unable to explain how such strong magnetic fields can exist in those stars, given that convection is only present in their core. Nevertheless, there have been attempts made by [
7], who argue that the magnetic field is somehow frozen into the star, and by [
8], where the authors suggest a process of the core going from radiative to convective states during the pre-main-sequence phase.
In the group of CP3 (HgMn) stars, we find overabundances of mercury and/or manganese. Ref. [
9] explains these peculiarities with similar processes as in CP1 stars, suggesting a common evolution of these two objects.
The fourth group, defined by [
2], can be seen as an extension of CP4 groups to hotter (early to mid-B-type) stars. They show similar characteristics in their variability [
10]. In contrast to CP2 stars, however, the CP2 stars show primarily helium peculiarities in the form of either over- (He-strong) or underabundances (He-Weak).
In addition to [
2], a new group of CP stars has been discovered: the Lambda Bootis (Lam Boo) stars. These are stars with significant underabundances of almost all heavier elements. The anomalies can be explained by diffusion and mass-loss theories [
11]. Another proposed theory by [
12] proposes an interplay of accretion and diffusion, where the star passes through a metal-poor interstellar cloud. However, [
13] found little to no evidence for this mechanism.
A poorly researched question is the pre-main-sequence (PMS) evolution of those stars. There have been a few CP stars with characteristics of PMS stars reported in the literature [
14].
Stars in their PMS phase are characterised by accretion onto the protostar while it contracts and heats up. Commonly, two subgroups are mentioned in the literature that differ by the mass range of the protostar. Lower masses (up to ∼
) are called T Tauri stars, while the ones above this threshold are put into the category of Herbig Ae/Be stars. Ref. [
15] gives an extensive overview of how those objects are characterised and how they behave. In short, one can observe them by looking for emission lines (mostly hydrogen) as a sign of the accretion phase in those stars and via infrared excess in their spectral energy distribution (SED).
On the other hand, we have open clusters (OCs) and stellar associations. These objects are invaluable tools for various applications in astronomy, particularly in stellar astrophysics. Astronomers use them to accurately determine stellar ages using various methods, such as isochrone fitting, kinematic studies, and the lithium depletion boundary (LDB) technique, among others. With the advent of more precise and large datasets from the
mission [
16,
17], we now can use those astrometric measurements to accurately determine which stars belong to clusters and which do not. Recent examples working on open clusters using this dataset are [
18,
19].
Stellar associations are regions of low stellar density (
, [
20,
21]) which are categorised into three groups [
22,
23]:
OB associations: Mostly including hot young stars of predominantly O and B type stars (e.g., the Perseus OB1 association);
T associations: Dominated by low-mass T Tauri stars in their formation stage (e.g., Chamaeleon 1);
R associations: Stellar groups containing reflection nebulae (e.g., Monocerotis R2).
The ages of stellar associations range from a few Myr to ∼
Myr [
21] and thus are well-suited to study stars right after their formation and arrival on the main sequence [
24]. The formation of associations is believed to have been due to one of two scenarios [
24]: The first one states that the stars are formed where they are observed as part of a hierarchical star formation process, where over-densities in the interstellar medium can form at any scale, anywhere. The other scenario proposes that stars are formed in clusters that undergo processes such as UV radiation and stellar winds, leading to the cluster no longer being in virial equilibrium and resulting in its dispersal.
This paper aims to look for CP stars in open clusters and stellar associations with a special interest in detecting CP candidates that are still in their PMS phase. In
Section 2, we describe the catalogues and data used, and the quality criteria employed to filter these datasets.
Section 3 describes the transformation of the astrometry for the clustering in
Section 4. The latter also describes the process of determining membership in the aggregations, and the process of classifying stars based on their CP nature.
Section 5 provides an overview of how PMS stars are detected, and the last two sections,
Section 7 and
Section 8, present the results, a discussion, and an outlook on possible future work in this field.
3. Astrometric Transformations
After all the quality criteria described above were imposed, some astrometric transformations had to be taken into account for the planned clustering on the datasets.
At first, the parallax and galactic coordinates
l,
b were transformed into 3D Euclidean distances
X,
Y,
Z using a standard transformation:
where
R is simply calculated by taking the inverse of the parallax in mas to get to a distance of pc:
Finally, the proper motions in the galactic frame were transformed into transverse velocities (in km/s) via
This finally resulted in a five-dimensional dataset () that we then used for the subsequent membership analysis of the clusters and associations.
4. Methods
4.1. Clustering
Determining the membership of stars in open clusters and associations is not always a straightforward task, and depending on the data and methods used, one can obtain vastly different results in membership numbers and probabilities. Commonly used methods are Gaussian Mixture Models (GMMs, e.g., [
42]), Unsupervised Photometric Membership Assignment in Stellar Clusters (UPMASK, [
43]) or, more recently, Significant Mode Analysis (SiGMA, [
26]). Other methods try to use a combination of photometry, astrometry, and spectroscopy to determine which stars are members of young associations (e.g., [
44]).
To determine which stars of our list were likely members of open clusters and associations, we used the HDBSCAN algorithm [
45] that was also used by [
19,
25].
HDBSCAN is a clustering algorithm with several possible hyperparameters in its implementation. The most important ones are the following:
min_cluster_size: This is somewhat self-explanatory; it describes how many points should be at a minimum in a cluster. Larger values will result in smaller substructures that get lumped together in a bigger cluster.
min_samples: This describes how dense a cluster should be. Larger values will result in only the densest cores being considered to be clusters.
cluster_selection_method: There are two choices for this parameter:
- -
eom: This stands for “excess of mass” and searches for over-densities in the n-dimensional parameter space, looking for clusters in accordance with the other parameters. It generally favours larger and fewer clusters.
- -
leaf: This lets the user recover finer and smaller clusters in the dataset.
metric: This chooses the distance metric that the algorithm uses to calculate the distance between the points in the input set. Commonly used choices are euclidean, manhattan or malahanobis.
Of course, HDBSCAN is not perfect and has some shortcomings. According to [
19], it tends to be a bit overconfident, meaning that the algorithm assigns more data points to the cluster than there actually are. However, they used an all-sky approach, whereas this work only clusters the stars in the regions around the clusters and associations. This should result in at least the cores of the aggregations to be recovered.
We randomly selected 30% of the data in each cluster region to look for the best hyperparameter combination of the values seen in
Table 1. The different combinations were evaluated using a silhouette score, from which the highest, i.e., the best parameter combination, was chosen for the final clustering on the set. An example of the clustering result can be seen in
Figure 2. In total, the clustering resulted in a list of roughly 462.5 thousand stars that have a high (
) probability of being members of the clusters and associations in question after removing stars that are below the main sequence (i.e., White Dwarfs). From the 430 thousand stars in 2022 clusters and moving groups that resulted from the clustering in and around the catalogue of [
19], around 200 k also belong to the initial catalogue of [
19]. This is a recovery rate of 15%, a result of the difference in the clustering method: Ref. [
19] used an all-sky method, whereas we used the regions around the individual clusters and aggregations. For the catalogue from [
25] we recovered a similar fraction (13% or ∼12,500 stars in 26 associations) due to the same reasons of a different approach to the clustering. This discrepancy in membership, of course, may affect the number/fraction of CP stars detected in these groups.
4.2. Extinction Correction and Final Target Selection
This list, however, is still not sufficient to sort out all the stars that are not in question of being chemically peculiar. Since we wanted to look at stars with
K (or hotter than spectral type around mid-F), the cooler stars had to be removed from the list. Although we also wanted to look at PMS CP stars, we opted for this boundary, since there has been no evidence for PMS CP stars in T Tauri stars that we could find in the literature. There might be other classes of peculiar stars in the lower temperature region, predominately barium stars. These, however, are not in the focus of the current work. All previously discovered PMS CP stars belong to the class of Herbig Be/Ae stars; see [
46] and references therein. For this purpose, we took the StarHorse21 catalogue from [
47]. The authors of this paper compiled measurements from
EDR3, the Two-Micron All-Sky Survey (2MASS, [
48]), the Panoramic Survey Telescope and Rapid Response System (Pan-STARRS1, [
49]), and the Wide-Field Infrared Survey Explorer All-Sky Release (AllWISE, [
50]) to derive extinction values
at
Å, and stellar parameters such as
,
,
, and
-positions with respect to the galactic center. Approximately 333 thousand stars of the clustered sample are included in this catalogue. Out of those, around a third, 105,270 stars, have a temperature higher than the threshold as mentioned above. See
Figure 3 for the CMD of this final target list. We want to emphasise the point that the choice of another catalogue of stellar parameters might result in a different number of stars selected. See [
46] for a discussion of the influence of different catalogues on this matter.
4.3. Photometry
The first method we used for detecting possible CP stars was
photometry. This powerful method, developed by [
51], makes use of the flux depression in stellar spectra around
. First reported by [
52] in the spectrum of HD 221568, this depression has been proven to indicate the presence of a strong magnetic field in mCP stars. The most critical element affected here is iron, although there are also contributions of silicon, chromium, and others [
53]. The method itself is relatively simple. One takes three filters,
to the left (shorter
) of the depression,
right at the depression, and the Strömgren
y filter (see
Figure 4) at the right (longer
) of the flux depression. Then the magnitudes in each filter are measured or calculated synthetically. These magnitudes can then be converted into an index
a that is defined as
This value
a is then compared to the value of a (seemingly) normal star that has a value
to create
:
The colour term corrects for temperature differences in the stars, and the function is then called the "normality line". All stars that are a certain distance (typically, one uses 95% or confidence intervals) above this line are considered to be magnetic stars with a significant flux depression.
Before the synthetic photometry, the list of the clustered stars was matched to the catalogue of the
BP/RP (XP in short) spectra via the
DR3 IDs, of which 64 thousand stars had a spectrum. For each of those, the signal-to-noise ratio (S/N) was calculated using the quick way, described in [
28]. Basically, since the spectra are downloaded as a set of coefficients for each passband, one can estimate the S/N by the
norm of the coefficient vector divided by the
norm of the vector of errors on the coefficients:
The final estimation was done by just taking the mean of the S/N in each passband. All stars with an were considered for the subsequent analysis. This left 26,814 spectra to be analysed using synthetic photometry.
The photometry was done in the same way as described in [
31]. At first, the spectra that initially came in flux units of W
n
were normalised at a common wavelength of
Å. Additionally, they were interpolated using a third-order polynomial technique to match the resolution of the used filters, which have
Å.
Actual astronomical magnitudes in the AB system [
54], for example, for a given filter curve
and a spectrum
are calculated using the following quantity [
55]:
However, since the interest here only lies in the difference of (instrumental) magnitudes, one can take the much more uncomplicated approach where the filter and the spectrum are just multiplied and the values are then summed up:
After we calculated the synthetic magnitudes this way, the normality line of the form
, i.e., the coefficients
needed to be found. For this purpose, we used the Python 3.10 package
PyMC [
56]. This uses the theorem of Bayes
with the input vector
and
to infer the vector of coefficients
via a Monte Carlo process. The resulting normality line (see also
Figure 5) had the following form:
Additionally, PyMC calculated the standard deviation of this fit line, and it gave a value of . All stars with a distance of more than above are considered to be candidate mCP stars, which amounted to 341 individual stars.
4.4. Spectral Classification
Another method to find CP stars is the classification of stellar spectra. The
DR3 IDs of the 105 thousand candidate stars on the upper main sequence were submitted to the ninth data release of LAMOST (
http://www.lamost.org/dr9/, accessed on 8 April 2024), which contained 4143 spectra of this set. Removing all spectra with S/N lower than 50 in the
g-band and, in the case of stars with multiple spectra, keeping the ones with higher S/N left us with 1440 spectra to classify. Those were classified using the MKCLASS code [
57]. This program takes an input spectrum and compares it to a standard library to derive the final spectral type after some iterations. The results are comparable to those made by humans, with only minor deviations in the temperature and luminosity classes. It is also well-suited to classify CP stars as done by [
37], for example.
For the classifications, we used three standard libraries:
libnor36;
libr18;
libr18_225.
Out of the 1440 spectra, 608 individual stars showed CP characteristics in their spectra for at least one of these libraries. This high fraction of roughly 42% shows that either an unusual amount of stars in young open clusters and associations are CP stars or that some stars have been misclassified by one or more standard libraries of MKCLASS. Especially, libr18_225 has a high recovery rate of CP1 stars, and thus, one has to be careful with the classifications.
Of course, one has to be cautious with the spectra of lower quality, although they should still result in reasonable estimations of the spectral type.
4.5. Light Curve Analysis
A volume-limited subset of the stars with
K was also checked for light curves from TESS. This was limited in volume to the list of [
25] because of the large pixel size of TESS (
Section 2) and to minimise blending as much as possible. The light curves of 1022 stars were downloaded using the Python package
eleanor [
58]. For the process of retrieving the time series data, an aperture with a radius of one pixel was used (
Figure 6). The resulting data were cut by 3
to remove outliers and then converted from flux differences to magnitude differences via Pogson’s equation
where
is the mean flux of the observations.
After this process, we used the astropy [
59,
60,
61] implementation of the Lomb–Scargle periodogram [
62,
63] that follows the algorithm described in [
64]. Additionally, one can calculate the false-alarm probability (FAP) using the methods from [
65] to check if the found periods are significant and thus correct. We used a value of
for a period to be significant. This process was done in two frequency regions:
We then classified the light curves based on their frequency spectrum and phase-folded light curve (
Figure 7 and
Figure 8). Generally, we followed the criteria given in [
34] while analysing the light curves with the types of variability taken from the Variable Star Index (VSX, [
66]) and the notation from the General Catalogue of Variable Stars (GCVS, [
67]). The most prominent variability classes in our sample are pulsators such as Gamma Doradus (GDOR) or Delta Scuti stars (DSCT), Rotating stars (ROT) and their subgroup of
Canum Venaticorum (ACV) stars, and stars with variability of unknown origin or unclear signal (VAR). A general scheme we used can be seen in
Table 2.
Classification of light curves presents several challenges. These range from unclear signals in the periodograms, poor data, ambiguity in classification due to a lack of more detailed information about the star in question, and blending effects. Accordingly, there is a non-negligible chance that light curves will be classified incorrectly. See e.g., [
34] for a more detailed discussion on that matter.
However, 512 (50%) out of the 1022 light curves showed apparent variability and could be classified. The result can be seen in
Figure 9. A total of 22 stars had light curves with attributes from CP stars, i.e., a so-called “double wave” from the Oblique Rotator [
6]. The classes in question are ACV (after
CVn, also
Figure 8) and SXARI (after SX Arietis), which are essentially hotter analogues of the ACV variables.
Altogether, 971 individual CP stars and candidates in 217 open clusters and associations were detected from at least one of those three methods.
8. Summary and Outlook
Using different methods ( photometry, spectral, and light curve classification), we detected 971 CP stars in 217 open clusters and associations; most of those CP candidates have not been named in the literature before. The membership probabilities of the stars in the respective stellar aggregation were determined using HDBSCAN. Additionally, we used SED-fitting and emission line photometry to determine which CP stars are likely to be in their PMS phase.
We found that 12 CP stars show emission in H, whereas 92 CP stars show infrared excess. Note that the excess is relatively weak in most cases, suggesting that the stars are already in the late PMS evolution stage and close to the Zero-Age Main Sequence (ZAMS).
The stars with emission lines all belong to the category of CP2 (mCP) stars, agreeing with the suggestion that the magnetic field is at least partially responsible for the occurrence of CP stars [
14]. However, this is still a tiny sample and a much more detailed investigation is needed to confirm or disprove these ideas.
Nevertheless, the present list of about 100 new PMS CP candidates vastly expands the list of known PMS sources with CP properties, which will be a stepping stone for future research on the formation of these objects, particularly the origin and early evolution of the magnetic fields in stars on the upper main sequence and the onset of the diffusion processes in these objects. A recent study by [
86] already discusses the occurence of CP stars in open clusters and also made an analysis of the evolutionary status. They discuss a decrease in CP stars in the evolution from the ZAMS to the terminal-age main sequence (TAMS). We did not analyse the present sample in this way due to the relatively large number of unconfirmed CP candidates. This, however, is surely an opportunity for future research with higher quality of spectral data from future telescope such as the Extremely Large Telescope which is currently under construction.