Galaxy Zoo: reproducing galaxy morphologies via machine learning☆

被引:147
作者
Banerji, Manda [1 ,2 ]
Lahav, Ofer [1 ]
Lintott, Chris J. [3 ]
Abdalla, Filipe B. [1 ]
Schawinski, Kevin [4 ,5 ]
Bamford, Steven P. [6 ]
Andreescu, Dan [7 ]
Murray, Phil [8 ]
Raddick, M. Jordan [9 ]
Slosar, Anze [10 ,11 ]
Szalay, Alex [9 ]
Thomas, Daniel [12 ]
Vandenberg, Jan [9 ]
机构
[1] UCL, Dept Phys & Astron, London WC1E 6BT, England
[2] Univ Cambridge, Inst Astron, Cambridge CB3 0HA, England
[3] Univ Oxford, Dept Phys, Oxford OX1 3RH, England
[4] Yale Univ, Dept Phys, New Haven, CT 06511 USA
[5] Yale Univ, Yale Ctr Astron & Astrophys, New Haven, CT 06520 USA
[6] Univ Nottingham, Sch Phys & Astron, Ctr Astron & Particle Theory, Nottingham NG7 2RD, England
[7] LinkLab, Bronx, NY 10471 USA
[8] Fingerprint Digital Media, Newtownards BT23 7GY, Co Down, North Ireland
[9] Johns Hopkins Univ, Dept Phys & Astron, Baltimore, MD 21218 USA
[10] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Berkeley Ctr Cosmol Phys, Berkeley, CA 94720 USA
[11] Univ Calif Berkeley, Dept Phys, Berkeley, CA 94720 USA
[12] Univ Portsmouth, Inst Cosmol & Gravitat, Portsmouth PO1 2EG, Hants, England
关键词
methods: data analysis; galaxies: general; DIGITAL-SKY-SURVEY; ARTIFICIAL NEURAL-NETWORKS; ESTIMATING PHOTOMETRIC REDSHIFTS; AUTOMATED CLASSIFICATION; STELLAR SPECTRA; COLOR;
D O I
10.1111/j.1365-2966.2010.16713.x
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
We present morphological classifications obtained using machine learning for objects in the Sloan Digital Sky Survey DR6 that have been classified by Galaxy Zoo into three classes, namely early types, spirals and point sources/artefacts. An artificial neural network is trained on a subset of objects classified by the human eye, and we test whether the machine-learning algorithm can reproduce the human classifications for the rest of the sample. We find that the success of the neural network in matching the human classifications depends crucially on the set of input parameters chosen for the machine-learning algorithm. The colours and parameters associated with profile fitting are reasonable in separating the objects into three classes. However, these results are considerably improved when adding adaptive shape parameters as well as concentration and texture. The adaptive moments, concentration and texture parameters alone cannot distinguish between early type galaxies and the point sources/artefacts. Using a set of 12 parameters, the neural network is able to reproduce the human classifications to better than 90 per cent for all three morphological classes. We find that using a training set that is incomplete in magnitude does not degrade our results given our particular choice of the input parameters to the network. We conclude that it is promising to use machine-learning algorithms to perform morphological classification for the next generation of wide-field imaging surveys and that the Galaxy Zoo catalogue provides an invaluable training set for such purposes.
引用
收藏
页码:342 / 353
页数:12
相关论文
共 29 条
  • [11] Estimating photometric redshifts with artificial neural networks
    Firth, AE
    Lahav, O
    Somerville, RS
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2003, 339 (04) : 1195 - 1202
  • [12] An artificial neural network approach to the classification of galaxy spectra
    Folkes, SR
    Lahav, O
    Maddox, SJ
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 1996, 283 (02) : 651 - 665
  • [13] A catalog of morphologically classified galaxies from the Sloan Digital Sky Survey: North equatorial region
    Fukugita, Masataka
    Nakamura, Osamu
    Okamura, Sadanori
    Yasuda, Naoki
    Barentine, John C.
    Brinkmann, Jon
    Gunn, James E.
    Harvanek, Mike
    Ichikawa, Takashi
    Lupton, Robert H.
    Schneider, Donald P.
    Strauss, Michael A.
    York, Donald G.
    [J]. ASTRONOMICAL JOURNAL, 2007, 134 (02) : 579 - 593
  • [14] Neural computation as a tool for galaxy classification: Methods and examples
    Lahav, O
    Naim, A
    Sodre, L
    Storrie-Lombardi, MC
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 1996, 283 (01) : 207 - 221
  • [15] GALAXIES, HUMAN EYES, AND ARTIFICIAL NEURAL NETWORKS
    LAHAV, O
    NAIM, A
    BUTA, RJ
    CORWIN, HG
    DEVAUCOULEURS, G
    DRESSLER, A
    HUCHRA, JP
    VANDENBERGH, S
    RAYCHAUDHURY, S
    SODRE, L
    STORRIE-LOMBARDI, MC
    [J]. SCIENCE, 1995, 267 (5199) : 859 - 862
  • [16] Galaxy Zoo: the large-scale spin statistics of spiral galaxies in the Sloan Digital Sky Survey
    Land, Kate
    Slosar, Anze
    Lintott, Chris
    Andreescu, Dan
    Bamford, Steven
    Murray, Phil
    Nichol, Robert
    Raddick, M. Jordan
    Schawinski, Kevin
    Szalay, Alex
    Thomas, Daniel
    Vandenberg, Jan
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2008, 388 (04) : 1686 - 1692
  • [17] Galaxy Zoo: morphologies derived from visual inspection of galaxies from the Sloan Digital Sky Survey
    Lintott, Chris J.
    Schawinski, Kevin
    Slosar, Anze
    Land, Kate
    Bamford, Steven
    Thomas, Daniel
    Raddick, M. Jordan
    Nichol, Robert C.
    Szalay, Alex
    Andreescu, Dan
    Murray, Phil
    Vandenberg, Jan
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2008, 389 (03) : 1179 - 1189
  • [18] AUTOMATED MORPHOLOGICAL CLASSIFICATION OF APM GALAXIES BY SUPERVISED ARTIFICIAL NEURAL NETWORKS
    NAIM, A
    LAHAV, O
    SODRE, L
    STORRIE-LOMBARDI, MC
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 1995, 275 (03) : 567 - 590
  • [19] Ripley B. D., 1988, STAT INFERENCE SPATI, V2nd, DOI DOI 10.1017/CBO9780511624131
  • [20] Ripley B. D., 1981, Spatial Statistics