Identifying Ly{alpha} emitter candidates with Random Forest: learning from galaxies in CANDELS survey. (arXiv:2307.11818v1 [astro-ph.GA])
<a href="http://arxiv.org/find/astro-ph/1/au:+Napolitano_L/0/1/0/all/0/1">L. Napolitano</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Pentericci_L/0/1/0/all/0/1">L. Pentericci</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Calabro_A/0/1/0/all/0/1">A. Calabrò</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Santini_P/0/1/0/all/0/1">P. Santini</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Castellano_M/0/1/0/all/0/1">M. Castellano</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Cassata_P/0/1/0/all/0/1">P. Cassata</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Fynbo_J/0/1/0/all/0/1">J. P. U. Fynbo</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Jung_I/0/1/0/all/0/1">I. Jung</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Kashino_D/0/1/0/all/0/1">D. Kashino</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Mascia_S/0/1/0/all/0/1">S. Mascia</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Mignoli_M/0/1/0/all/0/1">M. Mignoli</a>
The physical processes which make a galaxy a Lyman Alpha Emitter have been
extensively studied for the past 25 years. However, the correlations between
physical and morphological properties of galaxies and the strength of the
Ly$alpha$ emission line are still highly debated. Therefore, we investigate
the correlations between the rest-frame Ly$alpha$ equivalent width and stellar
mass, star formation rate, dust reddening, metallicity, age, half-light
semi-major axis, S’ersic index and projected axis ratio in a sample of 1578
galaxies in the redshift range $2 leq z leq 7.9$ from the GOODS-S, UDS and
COSMOS fields. From the large sample of Ly$alpha$ emitters (LAEs) in the
dataset we find that LAEs are typically common main sequence star forming
galaxies which show stellar mass $ leq 10^9 text{M}_{odot}$, star formation
rate $ leq 10^{0.5} text{M}_{odot}/text{yr}$, $E(B-V) leq 0.2$ and
half-light semi-major axis $leq 1 text{kpc}$. Building on these findings we
develop a new method based on Random Forest (i.e. a Machine Learning
classifier) in order to select galaxies which have the highest probability of
being Ly$alpha$ emitters. When applied to a population in the redshift range
$z in [2.5, 4.5]$, our classifier holds a $(80 pm 2)%$ accuracy and $(73 pm
4)%$ precision. At higher redshifts ($z in [4.5, 6]$), we obtain a $73%$
accuracy and a $80%$ precision. These results highlight it is possible to
overcome the current limitations in assembling large samples of LAEs by making
informed predictions that can be used for planning future large scale
spectroscopic surveys.
The physical processes which make a galaxy a Lyman Alpha Emitter have been
extensively studied for the past 25 years. However, the correlations between
physical and morphological properties of galaxies and the strength of the
Ly$alpha$ emission line are still highly debated. Therefore, we investigate
the correlations between the rest-frame Ly$alpha$ equivalent width and stellar
mass, star formation rate, dust reddening, metallicity, age, half-light
semi-major axis, S’ersic index and projected axis ratio in a sample of 1578
galaxies in the redshift range $2 leq z leq 7.9$ from the GOODS-S, UDS and
COSMOS fields. From the large sample of Ly$alpha$ emitters (LAEs) in the
dataset we find that LAEs are typically common main sequence star forming
galaxies which show stellar mass $ leq 10^9 text{M}_{odot}$, star formation
rate $ leq 10^{0.5} text{M}_{odot}/text{yr}$, $E(B-V) leq 0.2$ and
half-light semi-major axis $leq 1 text{kpc}$. Building on these findings we
develop a new method based on Random Forest (i.e. a Machine Learning
classifier) in order to select galaxies which have the highest probability of
being Ly$alpha$ emitters. When applied to a population in the redshift range
$z in [2.5, 4.5]$, our classifier holds a $(80 pm 2)%$ accuracy and $(73 pm
4)%$ precision. At higher redshifts ($z in [4.5, 6]$), we obtain a $73%$
accuracy and a $80%$ precision. These results highlight it is possible to
overcome the current limitations in assembling large samples of LAEs by making
informed predictions that can be used for planning future large scale
spectroscopic surveys.
http://arxiv.org/icons/sfx.gif