College of Science
112 Symbiotic Star Candidates in Gaia Data Release 3
Samantha Ball; Benjamin Bromley; and Scott Kenyon
Faculty Mentor: Benjamin C. Bromley (Physics and Astronomy, University of Utah)
Abstract
Symbiotic Stars are some of the most elusive and exotic star systems in the universe. Their rarity and peculiarity are of key importance to our understanding of stellar evolution and the origins of elements that make life possible. They give crucial information in determining how binary systems evolve and a peek into fundamental cosmological events. Since the list of known Symbiotics is relatively small, new releases from Gaia provide an opportunity to expand upon this list. Using machine learning techniques to train and test on a multitude of stars found in Gaia Data Release 3 (DR3), the aim is to discover a new list containing potential Symbiotic Stars.
Introduction
Symbiotic Stars are a part of a small set of binaries that usually contain a white dwarf and a red giant star. Both stars were similar to the Sun but are now at an older stage of evolution. The white dwarf is a compact, Earth-size object, while the red giant is cooler and roughly a hundred times the size of the Sun. In a Symbiotic Star, the white dwarf accretes material from its red giant companion resulting in an extremely hot temperature and super bright luminosity. This then allows for ionization of a large fraction of the cool giant’s wind. In combination, this unique light pattern given at a specific wavelength allows for Symbiotics to stand out from other stars.
In order to find these systems, it’s crucial to look at a number of key factors:
- Spectroscopic data involves an interaction of matter with electromagnetic radiation over a range of wavelengths. This data is used to find the spectral lines, doppler shift, temperature, stellar classifications and compositions of celestial objects.
- Photometric data includes light measured over a broad range of colors which allows for an accurate determination of the brightness of a celestial object.
- Astrometric data refers to the measurements of the positions and motions of stars.
We focus on observations from Gaia, a space-based instrument that measured over 1 billion stars. Gaia data provides accurate astrometry, photometry, and spectra in blue and red wavebands. The spectra are provided in the form of coefficients of spectral templates derived by the Gaia team. These factors help create the parameters that are needed to search for the appropriate stars in a machine learning algorithm.
Machine Learning Techniques
To begin the machine learning process there are some steps to ensure a thorough and well-rounded classifier
- Gather a training set from Gaia DR3 of known non-Symbiotics.
- Set-up training parameters.
- Retrieve a test set of known Symbiotics.
- Create labels to represent if a star is Symbiotic or not.
- Create a list of stars in both datasets that have spectroscopic data.
- Use the first 25 coefficients of the blue spectra data and add them to the training parameters.
- Create classifier parameters to fit the datasets and make a matrix regarding the accuracy of the predicted versus misclassified stars.
- Run the classifier onto potential red giant stars gathered from the Gaia DR3 database.
In creating a training set, we took two distance cuts in kiloparsecs, [-1 – 10], [10 – 1000] and then queued Gaia to pull 875 random stars from each of these cuts. This dataset was then run through SIMBAD, a database of astronomical objects, to confirm it contained no known symbiotic stars. The dataset was then combined to form a 1750-star list which was then assigned labels equal to zero for non-symbiotic stars. The known Symbiotics, which we determined from SIMBAD, were assigned labels equal to one. These sources constitute the preliminary analysis reported here.
For machine learning we set up a list of Gaia attributes on which to train the data sets. For Symbiotic stars specifically, bp_rp, phot_g_mean_mag, parallax, parallax_over_error, astrometric_gof_al, pm, ruwe, and astrometric_excess_noise were the essential parameters in initial testing.
Toward including spectroscopic information in our machine learning search, we sought stars with Gaia’s spectral data (has_xp_continuous == True). Our data was then filtered to get a list of total stars that contained spectroscopic information. For our Symbiotic Star search, the first 25 coefficients of the blue spectra list were used. Those 25 coefficients were then added into the initial parameters for testing.
We set up our machine learning classifier with a parameter grid to fine tune the RandomForestClassifier for better accuracy. The classifier is then run on the datasets and should produce a matrix by which it correctly predicts and/or misclassifies them. Our next step will be to run the classifier on promising candidates in the Gaia database and see what turns out.
Summary
This is a work in progress. While much has been done, there is still much left to do. By leveraging machine learning techniques on Gaia’s extensive dataset, we aim to identify these elusive Symbiotic binaries and discover new members. In finding these new potential candidates, and studying them, we hope to learn more about how they work and what they can tell us about our universe. After all, the universe is a space with infinite possibilities and a vast unexplored frontier waiting for us to discover its mysteries.
References
De Angeli, F., et al. “Gaia Data Release 3: Processing and Validation of BP/RP Low-Resolution Spectral Data.” Astronomy & Astrophysics, vol. 674, June 2023, p. A2, doi:10.1051/0004- 6361/202243680
Kenyon, S. J., and R. F. Webbink. “The Nature of Symbiotic Stars.” The Astrophysical Journal, vol. 279, Apr. 1984, pp. 252–283, doi:10.1086/161888.
Prusti, T., et al. “The Gaia Mission.” Astronomy & Astrophysics, vol. 595, Nov. 2016, p. A1, doi:10.1051/0004-6361/201629272
Wenger, M., et al. “The Simbad Astronomical Database.” Astronomy and Astrophysics Supplement Series, vol. 143, no. 1, Feb. 2000, pp. 9–22, doi:10.1051/aas:2000332