Magnetic control of tokamak plasmas through deep reinforcement learning

被引:433
作者
Degrave, Jonas [1 ]
Felici, Federico [2 ]
Buchli, Jonas [1 ]
Neunert, Michael [1 ]
Tracey, Brendan [1 ]
Carpanese, Francesco [1 ,2 ]
Ewalds, Timo [1 ]
Hafner, Roland [1 ]
Abdolmaleki, Abbas [1 ]
de las Casas, Diego [1 ]
Donner, Craig [1 ]
Fritz, Leslie [1 ]
Galperti, Cristian [2 ]
Huber, Andrea [1 ]
Keeling, James [1 ]
Tsimpoukelli, Maria [1 ]
Kay, Jackie [1 ]
Merle, Antoine [2 ]
Moret, Jean-Marc [2 ]
Noury, Seb [1 ]
Pesamosca, Federico [2 ]
Pfau, David [1 ]
Sauter, Olivier [2 ]
Sommariva, Cristian [2 ]
Coda, Stefano [2 ]
Duval, Basil [2 ]
Fasoli, Ambrogio [2 ]
Kohli, Pushmeet [1 ]
Kavukcuoglu, Koray [1 ]
Hassabis, Demis [1 ]
Riedmiller, Martin [1 ]
机构
[1] DeepMind, London, England
[2] Ecole Polytech Fed Lausanne, Swiss Plasma Ctr, Lausanne, Switzerland
基金
瑞士国家科学基金会;
关键词
D O I
10.1038/s41586-021-04301-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Nuclear fusion using magnetic confinement, in particular in the tokamak configuration, is a promising path towards sustainable energy. A core challenge is to shape and maintain a high-temperature plasma within the tokamak vessel. This requires high-dimensional, high-frequency, closed-loop control using magnetic actuator coils, further complicated by the diverse requirements across a wide range of plasma configurations. In this work, we introduce a previously undescribed architecture for tokamak magnetic controller design that autonomously learns to command the full set of control coils. This architecture meets control objectives specified at a high level, at the same time satisfying physical and operational constraints. This approach has unprecedented flexibility and generality in problem specification and yields a notable reduction in design effort to produce new plasma configurations. We successfully produce and control a diverse set of plasma configurations on the Tokamak a Configuration Variable(1,2), including elongated, conventional shapes, as well as advanced configurations, such as negative triangularity and 'snowflake' configurations. Our approach achieves accurate tracking of the location, current and shape for these configurations. We also demonstrate sustained 'droplets' on TCV, in which two separate plasmas are maintained simultaneously within the vessel. This represents a notable advance for tokamak feedback control, showing the potential of reinforcement learning to accelerate research in the fusion domain, and is one of the most challenging real-world systems to which reinforcement learning has been applied.
引用
收藏
页码:414 / +
页数:20
相关论文
共 54 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Data-driven profile prediction for DIII-D
    Abbate, J.
    Conlin, R.
    Kolemen, E.
    [J]. NUCLEAR FUSION, 2021, 61 (04)
  • [3] Abdolmaleki A., 2018, Relative entropy regularized policy iteration
  • [4] Abdolmaleki A., MULTIOBJECTIVE POLIC
  • [5] Akkaya I., 2019, SOLVING RUBIKS CUBE
  • [6] Plasma flux expansion control on the DIII-D tokamak
    Anand, H.
    Humphreys, D.
    Eldon, D.
    Leonard, A.
    Hyatt, A.
    Sammuli, B.
    Welander, A.
    [J]. PLASMA PHYSICS AND CONTROLLED FUSION, 2021, 63 (01)
  • [7] Real time magnetic control of the snowflake plasma configuration in the TCV tokamak
    Anand, H.
    Coda, S.
    Felici, F.
    Galperti, C.
    Moret, J-M
    Labit, B.
    Reimerdes, H.
    Maurizio, R.
    [J]. NUCLEAR FUSION, 2019, 59 (12)
  • [8] A novel plasma position and shape controller for advanced configuration development on the TCV tokamak
    Anand, H.
    Coda, S.
    Felici, F.
    Galperti, C.
    Moret, J. -M.
    [J]. NUCLEAR FUSION, 2017, 57 (12)
  • [9] Andrychowicz M, 2021, ICLR 2021 9 INT C LE
  • [10] Achievement of Reactor-Relevant Performance in Negative Triangularity Shape in the DIII-D Tokamak
    Austin, M. E.
    Marinoni, A.
    Walker, M. L.
    Brookman, M. W.
    deGrassie, J. S.
    Hyatt, A. W.
    McKee, G. R.
    Petty, C. C.
    Rhodes, T. L.
    Smith, S. P.
    Sung, C.
    Thome, K. E.
    Turnbull, A. D.
    [J]. PHYSICAL REVIEW LETTERS, 2019, 122 (11)