Difference between revisions of "RG: Machine Learning"

From gem
Jump to navigation Jump to search
 
(36 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
__TOC__
 
__TOC__
  
"""""" This page is under development """""
+
==Chairs==
 +
 
 +
{| class="wikitable"
 +
|+ style="caption-side:bottom; text-align:left | Table 1: RG Chairs.
 +
! Name          !! Affiliation !! Email
 +
|-
 +
| 1. Hyunju Connor  || NASA GSFC || [mailto:hyunju.k.connor@nasa.gov ]
 +
|-
 +
| 2. Matthew Argall  || UNH || [mailto:Matthew.Argall@unh.edu ]
 +
|-
 +
| 3. Xiangning Chu  || LASP, CU Boulder || [mailto:xiangning.chu@lasp.colorado.edu ]
 +
|-
 +
| 4. Bashi Ferdousi    || AFRL || [mailto:banafsheh.ferdousi@spaceforce.edu ]
 +
|-
 +
| 5. Valluri Sai Gowtam    || UAF || [mailto:svalluri@alaska.edu ]
 +
|}
 +
 
 +
==About the ML-GEM Resource Group==
  
==Chairs==
+
Machine Learning based Geospace Environment Modeling (ML-GEM) is a new resource group selected by the GEM Steering Committee, with two primary goals: advancing system-of-systems science in Sun-Earth interaction from a data-driven perspective and developing an ML-based Geospace Environment Modeling by integrating community-wide ML efforts.
  
Hyunju Connor <br>
+
==Goals and Objectives==
Matthew Argall <br>
 
Xiangning Chu <br>
 
Bashi Ferdousi <br>
 
Valluri Sai Gowtam <br>
 
  
Contact:  
+
'''1. Invite community-wide ML efforts:''' Extend invitations to a diverse community, including GEM, CEDAR, SHINE, and ML communities, fostering broader participation and collaboration.
:[tbd]
 
:[tbd]
 
  
 +
'''2. Share ML advancements:''' Disseminate the latest ML techniques, including their advantages and disadvantages, and lessons learned. This knowledge-sharing will support community-wide geospace environment modeling efforts.
  
==Description about the Resource Group==
+
'''3. Facilitate integration discussions:''' Stimulate discussions on integrating diverse ML models across geospace systems, such as solar wind, magnetosheath, cusp, ring current, radiation belt, plasmasphere, and ionosphere.
  
Machine Learning based Geospace Environment Modeling (ML-GEM) is a new resource group selected by the GEM Steering Committee, with two primary goals: advancing system-of-systems science in Sun-Earth interaction from a data-driven perspective and developing an ML-based Geospace Environment Modeling by integrating community-wide ML efforts.
+
'''4. Develop ML-based models:''' Initiate the development of an ML-based geospace environment model and encourage the strategic creation and inclusion of ML components in ML-GEM for enhanced performance.
  
Machine Learning (ML) efforts have experienced rapid growth within the GEM community over
+
'''5. Explore systematic responses:''' Investigate system-of-systems science in the solar wind – Earth interaction by leveraging ML-GEM and other ML models.
the past few years. However, there has been no collective initiative to develop a new ML-based
 
Geospace Environment Model (ML-GEM) that comprehensively connects ML-based models from the Sun to the solar wind, magnetosphere, and upper atmosphere to systematically understand and predict Sun – Earth interactions. Our new Focus Group (FG) aims to gather and lead diverse machine-learning efforts within the GEM community, with the primary goal of developing ML-GEM and gaining insights into our geospace systems from a data-driven perspective.
 
  
Over the next four years, our diverse FG team, consisting of four early-career scientists and
+
'''6. Compile a catalog of ML-ready dataset:''' Cleaning and preparing datasets constitute a substantial portion of machine learning model development. We will curate a list of existing ML-ready datasets in heliophysics for various scientific purposes.
two females, aims to unite ML efforts across the GEM community. We will actively explore
 
innovative ways to link existing and developing ML models and establish the initial architecture
 
of a new data-driven global geospace environment model. These efforts will subsequently lead to system-of-systems research to understand the collective behavior of geospace systems in response to incoming solar wind and Interplanetary Magnetic Field (IMF) conditions. Additionally, we will share new ML techniques and ML-ready dataset, outline the pros and cons of ML approaches, and offer valuable lessons learned, providing a resource for newcomers initiating their ML projects. The main research area of this FG is Global General Circulation Modeling (GGCM). However, our topics cover all other four GEM research areas, aligning well with all current GEM FGs.
 
  
Machine Learning (ML) efforts have experienced rapid growth within the GEM community over
+
==GEM-2024 Activities==
the past few years. However, there has been no collective initiative to develop a new ML-based
 
Geospace Environment Model (ML-GEM) that comprehensively connects ML-based models from the Sun to the solar wind, magnetosphere, and upper atmosphere to systematically understand and predict Sun – Earth interactions. Our new Focus Group (FG) aims to gather and lead diverse machine-learning efforts within the GEM community, with the primary goal of developing ML-GEM and gaining insights into our geospace systems from a data-driven perspective.
 
  
Over the next four years, our diverse FG team, consisting of four early-career scientists and
+
ML-GEM chairs have scheduled 4 sessions for the upcoming 2024 GEM Summer workshop held in Fort Collins, Colorado during June 23-28:
two females, aims to unite ML efforts across the GEM community. We will actively explore
 
innovative ways to link existing and developing ML models and establish the initial architecture
 
of a new data-driven global geospace environment model. These efforts will subsequently lead to system-of-systems research to understand the collective behavior of geospace systems in response to incoming solar wind and Interplanetary Magnetic Field (IMF) conditions. Additionally, we will share new ML techniques and ML-ready dataset, outline the pros and cons of ML approaches, and offer valuable lessons learned, providing a resource for newcomers initiating their ML projects. The main research area of this FG is Global General Circulation Modeling (GGCM). However, our topics cover all other four GEM research areas, aligning well with all current GEM FGs.
 
  
==Goals and Objectives==
+
'''1. ML-GEM stand-alone session :'''  All ML efforts across the GEM research areas are invited.
  
Our overarching goals are to develop an ML-based geospace environment model and advance
+
'''2. ML-GEM joint session with the Inner Magnetosphere Focus Groups :''' ML efforts particularly in the inner magnetosphere research area are invited.  
system-of-systems science in Sun-Earth interaction. Over the 4-year focus group period, we will
 
implement the following objectives to achieve these goals:
 
  
1. Invite community-wide ML efforts: Extend invitations to a diverse community, including GEM, CEDAR, SHINE, and ML communities, fostering broader participation and collaboration.
+
'''3. ML-GEM discussion session :''' Please submit a summary of your ML model (1-2 slides) — the slide format to be announced — and join the discussion on how to integrate your model into a unified, data-driven geospace environment model.
  
2. Share ML advancements: Disseminate the latest ML techniques, including their advantages and disadvantages, and lessons learned. This knowledge-sharing will support community-wide geospace environment modeling efforts.
+
<u>'''If you are interested in giving a talk in these sessions, please submit your talk at the following website:'''</u> https://forms.gle/fmsU2eeBFFwD4tDQ8.
  
3. Facilitate integration discussions: Stimulate discussions on integrating diverse ML models across geospace systems, such as solar wind, magnetosheath, cusp, ring current, radiation belt, plasmasphere, and ionosphere.
+
'''4. ML-GEM tutorial session:'''  A hands-on tutorial on the Long Short-Term Memory (LSTM) technique that models time-series data. This tutorial will use the LSTM model and the SuperMAG geomagnetic field data, published in Blandin et al. (2022; https://doi.org/10.3389/fspas.2022.846291).
  
4. Develop ML-based models: Initiate the development of an ML-based geospace environment model and encourage the strategic creation and inclusion of ML components in ML-GEM for enhanced performance.
+
'''Instructions for the ML-GEM Tutorial Session:'''
  
5. Explore systematic responses: Investigate system-of-systems science in the solar wind – Earth interaction by leveraging ML-GEM and other ML models.
+
Welcome to the ML-GEM tutorial session! We will be using the Python programming language and the Google Colab platform for our exercises. There’s no need to download or install anything. Just follow these simple steps:
  
6. Compile a catalog of ML-ready dataset: Cleaning and preparing datasets constitute a substantial portion of machine learning model development. We will curate a list of existing ML-ready datasets in heliophysics for various scientific purposes.
+
'''Step 1:''' Download the tutorial Jupyter notebook and the data files via the provided link. [Link to be determined soon]
  
==GEM-2024 Activities==
+
'''Step 2:''' Sign in to your Google Drive account.
  
ML-GEM chairs have scheduled 4 sessions for the upcoming 2024 GEM Summer workshop held in Fort Collins, Colorado during June 23-28:
+
'''Step 3:''' Create a new folder named “LSTM_tutorial” in your Google Drive.
  
1. ML-GEM stand-alone session : All ML efforts across the GEM research areas are invited.
+
'''Step 4:''' Upload the Jupyter notebook and the data files into the “LSTM_tutorial” folder.
  
2. ML-GEM joint session with the Inner Magnetosphere Focus Groups : ML efforts particularly in the inner magnetosphere research area are invited.  
+
'''Step 5:''' Open the tutorial file by double-clicking on it. It will launch in the Google Colab platform.
  
3. ML-GEM discussion session : Please submit a summary of your ML model (1-2 slides) — the slide format to be announced — and join the discussion on how to integrate your model into a unified, data-driven geospace environment model.  
+
'''Step 6:''' By default, the code will execute using the “CPU” option. To change this, locate the dropdown menu next to the “Connect” button in the top-right corner. Select “Change runtime type,” then choose the “T4 GPU” option.
  
4. ML-GEM tutorial session:  A hands-on tutorial on the Long Short-Term Memory (LSTM) technique that models time-series data. This tutorial will use the LSTM model and the SuperMAG geomagnetic field data, published in Blandin et al. (2022; https://doi.org/10.3389/fspas.2022.846291).
+
You’re now fully prepared for the tutorial. If you run into any issues during the setup process, don’t hesitate to reach out. One of the co-chairs will be available to assist you. We’re excited to have you join us for the tutorial!

Latest revision as of 14:52, 16 May 2024

Chairs

Table 1: RG Chairs.
Name Affiliation Email
1. Hyunju Connor NASA GSFC [1]
2. Matthew Argall UNH [2]
3. Xiangning Chu LASP, CU Boulder [3]
4. Bashi Ferdousi AFRL [4]
5. Valluri Sai Gowtam UAF [5]

About the ML-GEM Resource Group

Machine Learning based Geospace Environment Modeling (ML-GEM) is a new resource group selected by the GEM Steering Committee, with two primary goals: advancing system-of-systems science in Sun-Earth interaction from a data-driven perspective and developing an ML-based Geospace Environment Modeling by integrating community-wide ML efforts.

Goals and Objectives

1. Invite community-wide ML efforts: Extend invitations to a diverse community, including GEM, CEDAR, SHINE, and ML communities, fostering broader participation and collaboration.

2. Share ML advancements: Disseminate the latest ML techniques, including their advantages and disadvantages, and lessons learned. This knowledge-sharing will support community-wide geospace environment modeling efforts.

3. Facilitate integration discussions: Stimulate discussions on integrating diverse ML models across geospace systems, such as solar wind, magnetosheath, cusp, ring current, radiation belt, plasmasphere, and ionosphere.

4. Develop ML-based models: Initiate the development of an ML-based geospace environment model and encourage the strategic creation and inclusion of ML components in ML-GEM for enhanced performance.

5. Explore systematic responses: Investigate system-of-systems science in the solar wind – Earth interaction by leveraging ML-GEM and other ML models.

6. Compile a catalog of ML-ready dataset: Cleaning and preparing datasets constitute a substantial portion of machine learning model development. We will curate a list of existing ML-ready datasets in heliophysics for various scientific purposes.

GEM-2024 Activities

ML-GEM chairs have scheduled 4 sessions for the upcoming 2024 GEM Summer workshop held in Fort Collins, Colorado during June 23-28:

1. ML-GEM stand-alone session : All ML efforts across the GEM research areas are invited.

2. ML-GEM joint session with the Inner Magnetosphere Focus Groups : ML efforts particularly in the inner magnetosphere research area are invited.

3. ML-GEM discussion session : Please submit a summary of your ML model (1-2 slides) — the slide format to be announced — and join the discussion on how to integrate your model into a unified, data-driven geospace environment model.

If you are interested in giving a talk in these sessions, please submit your talk at the following website: https://forms.gle/fmsU2eeBFFwD4tDQ8.

4. ML-GEM tutorial session: A hands-on tutorial on the Long Short-Term Memory (LSTM) technique that models time-series data. This tutorial will use the LSTM model and the SuperMAG geomagnetic field data, published in Blandin et al. (2022; https://doi.org/10.3389/fspas.2022.846291).

Instructions for the ML-GEM Tutorial Session:

Welcome to the ML-GEM tutorial session! We will be using the Python programming language and the Google Colab platform for our exercises. There’s no need to download or install anything. Just follow these simple steps:

Step 1: Download the tutorial Jupyter notebook and the data files via the provided link. [Link to be determined soon]

Step 2: Sign in to your Google Drive account.

Step 3: Create a new folder named “LSTM_tutorial” in your Google Drive.

Step 4: Upload the Jupyter notebook and the data files into the “LSTM_tutorial” folder.

Step 5: Open the tutorial file by double-clicking on it. It will launch in the Google Colab platform.

Step 6: By default, the code will execute using the “CPU” option. To change this, locate the dropdown menu next to the “Connect” button in the top-right corner. Select “Change runtime type,” then choose the “T4 GPU” option.

You’re now fully prepared for the tutorial. If you run into any issues during the setup process, don’t hesitate to reach out. One of the co-chairs will be available to assist you. We’re excited to have you join us for the tutorial!