Using the ECMWF OpenIFS model and state-of-the-art training techniques in meteorological education

The OpenIFS programme of the European Centre for Medium-Range Weather Forecasts (ECMWF) maintains a version of the ECMWF forecast model (IFS; Integrated Forecasting System) for use in education and research at universities, national meteorological services and other institutes. The OpenIFS model can be run on high-performance computing systems, desktop or laptop computers to produce weather forecasts in a similar way to the operational forecasts at ECMWF. Application of OpenIFS as a training tool is wide ranging. At several universities, masters students are taught modelling aspects via sensitivity studies, such as numerical stability, impact of spatial resolution and physical parameterisation settings on the forecast quality. The OpenIFS single column model is used to study a subset of physical processes in the atmosphere. Participants of the OpenIFS user workshops are trained through selected weather events on interpretation of different forecasts, for example ensemble forecasts, probabilistic information, seasonal forecasts. The OpenIFS user meetings and training events demonstrate advanced and easy-to-use graphical tools and training technologies. Metview is developed to analyse, visualise and evaluate the forecast outputs. OpenIFS and Metview “virtual machines” relieve the tutors from the difficulties often found in installing this software on the local computing environment. They provide data, applications and documents in a package tested in-house and deployed easily to another site. A further step on virtualisation is utilising cloud servers, ensuring the computational resources demanded by model runs are available in the cloud space. This paper shows the education activity in the OpenIFS programme with some examples.


Introduction
The European Centre for Medium-Range Weather Forecasts (ECMWF) is both a research institute and an operational service, producing and disseminating global numerical weather predictions and other data to its Member and Co-operating States as well as to the broader community. The comprehensive Earth-system model developed at ECMWF forms the basis for all data assimilation and forecasting activities through one computer software system called the Integrated Forecasting System (IFS).
Developments of ECMWF are based on European scientific and technical co-operation in meteorology and in highperformance computing. To strengthen external collaborations, ECMWF launched the OpenIFS initiative in 2011. The mission of the OpenIFS project is to increase the expertise on IFS at universities and other research institutions and to bring new scientific collaborations on research topics of interest. To support this, ECMWF develops and maintains an easy-to-use version of its operational global IFS model.
A software licensing agreement with ECMWF is required to use the OpenIFS model. License agreements are free and limited to non-commercial use for research or educational organisations. At the start of 2019 there are more than 60 licensed institutes mainly from Europe, but also from Japan, China, South Korea, India, Brazil, Peru, Morocco and the United States. OpenIFS is currently based on IFS cycle 40r1 (Bauer and Richardson, 2014) which was operational in 2013−2014. OpenIFS has the same forecast capability as Figure 1. Evolution of OpenIFS between 2012 and 2018. The "38r1" and "40r1" OpenIFS cycle numbers are taken from the corresponding IFS cycle on which the OpenIFS release is based ("40r1" refers to the IFS model release 1 of cycle 40). The "v1" and "v2" version numbers are unique to OpenIFS and used to indicate updates to OpenIFS only. HTESSEL (Balsamo et al., 2009) is the version of the Tiled ECMWF Scheme for Surface Exchanges over Land (TESSEL) and used operationally in IFS for describing the evolution of soil, vegetation and snow over the continents at diverse spatial resolutions.
IFS but it excludes the observation processing and the data assimilation.
The OpenIFS model is extensively used in meteorological education. Besides the intention to provide training on meteorological modelling and computing, the involved institutes aim to develop and improve work-relevant skills of masters and PhD students, bridging the gap between their studies and real research using a complex, state-of-the-art numerical weather prediction model with a variety of approaches. At the École Nationale de la Météorologie (ENM) of Météo-France, universities of Ghent and Oxford, a meteorological story has been built up around a selected case study to demonstrate some key features of the meteorological modelling like necessity of the ensemble approach. At the Hungarian Meteorological Service, universities of Helsinki, Reading and Stockholm, different model settings are tested on a given forecast case to teach some aspects of the numerical methods and the physical parameterisations. At the universities of Innsbruck and Perugia, the focus is on vertical column processes, the students investigate "what if" questions with the single column model (originally developed by Martin Köhler and João Teixeira in 2002) available since 2016 (Váňa and Ahlgrimm, 2018;Fig. 1). ECMWF itself also delivers courses based on OpenIFS (IFS) combined with further ECMWF software and modern training techniques like e-learning, virtual machines and cloud technology. In this paper, we show some examples for training and teaching activities based on OpenIFS.

OpenIFS user workshops
OpenIFS user workshops are organised annually or biannually jointly by a university or research institute and the OpenIFS team from ECMWF. The meetings between 2013 and 2017 were held at the universities of Helsinki, Stockholm, ECMWF (organised with the University of Oxford) and the International Centre for Theoretical Physics in Trieste, respectively. The aims of the meetings are to promote OpenIFS to the local and wider community and bring together scientists, current and prospective OpenIFS users to discuss the workshop theme. The event is devoted to a different scientific topic each year, discussed in keynote talks and explored via case studies in the practical sessions. For example, predictability issues and the importance of ensemble predictions were demonstrated with Hurricane Sandy (Carver et al., 2013); convection parameterisation was investigated with case studies of tornado formation in the USA and convection over Africa (Hannachi and Carver, 2014); seasonal forecasts and teleconnections were studied via the 2015-2016 El Niño (Carver, 2017).
The participants have the possibility to run forecasts with OpenIFS or they analyse the model outputs produced at ECMWF prior to the training course (see Fig. 2 for the 2015-2016 seasonal predictions). The OpenIFS single column model has occasionally been used in the meetings to experiment with modifying physical parameterisations. To visualise and evaluate the results, the analysis and visualisation software Metview is applied. This is developed in cooperation between ECMWF and the Brazilian National Institute for Space Research (Russell et al., 2014). It features an iconbased user interface for interactive work and a macro scripting language for batch processing. For the workshops, custom easy-to-use Metview icons and macros are prepared for each experiment with detailed tutorials on the exercises for the participants to follow.
A key feature of some meetings is the use of Linux virtual machines (VM). A virtual machine consists of a computer operating system and its applications running on top of another operating system. This is an effective way of delivering remote workshop practicals involving real models, data and analysis tools. The OpenIFS virtual machines contain a pre-installed version of Metview, pre-compiled OpenIFS and single column model executables as well as all the results of the different OpenIFS simulations run at ECMWF prior to the meeting. It is deployed via FTP or on a USB stick and the participants can use them by installing a "virtual machine player" on the classroom computers.

Training at Météo-France ENM
The practical sessions of the OpenIFS workshops inspired the masters courses at several universities for example in Reading (Plant and Gray, 2017), Oxford as well as at the École Nationale de la Météorologie in France. The OpenIFS team returned to Météo-France for the third consecutive year in 2018 to run a training course for engineers of the school. It forms a part of the teaching programme at ENM and is now an established collaboration with OpenIFS outreach activities (Carver et al., 2016). The forecast was initialised on 1 November 2015 and the ten ensemble members were generated using the stochastic scheme for the physics tendencies (Buizza et al., 1999;Leutbecher et al., 2017) and the stochastic kinetic energy backscatter scheme (Shutts, 2005) in the model dynamics. The figures represent monthly mean values (a) for December 2015 over the Pacific region and (b) for January 2016 over Europe. Although El Niño is a coupled ocean-atmosphere phenomenon centred over the tropical Pacific the key effect of which is the warming of the equatorial Pacific Ocean, it can have climate impacts also in remote parts of the globe. Fraedrich and Müller (1992) identified the colder January temperatures in northern Europe as one of such teleconnection patterns.
Fifteen ENM students took part in the course in 2018. They followed a prepared tutorial using ECMWF ensemble forecast data in a case study of hurricane Nadine during the HyMeX (Hydrological cycle in Mediterranean experiment; Drobinski et al., 2014) observational campaign in 2012. The course focused on part of Nadine's life cycle when the tropical cyclone interacted with the mid-latitude flow. The forecast uncertainty of this situation is naturally high, ideal for teaching as it demonstrates the need to establish ensemble forecasts. The exercises simulated a real forecast situation and put the students in the position of a duty forecaster supporting the HyMeX scientific director in decision making for flight planning based on probabilistic forecast information.
During the 3-day course, the students were presented with forecast information step-by-step, the same way a duty forecaster might receive the information. They explored the available analysis and the high-resolution operational forecast (HRES) first. The 16 km resolution IFS forecast predicted Nadine to move eastwards to the continent, at the same time a short-wave trough separated into a cut-off low and approached Nadine from the north (Fig. 3d). The orography of the Iberian Peninsula diverted the low-level flow over the Mediterranean Sea, resulting in strong, moist flow over southern France.
Afterwards the students looked at the differences between the HRES and control forecasts which both started from the same initial condition and differ only in the horizontal resolution. The 31 km resolution ensemble control forecast showed that Nadine made a loop back to the Atlantic and the interaction between the cyclone and the cut-off low was weak. The southwesterly wind from the Mediterranean caused large precipitation amounts over southern France. The investigations were extended with the 50-member operational ensemble forecast of ECMWF which has the same resolution as the control forecast (i.e. 31 km in 2012) and represents the uncertainties in the initial condition and the physical parameterisations. Exploring the ensemble members on stamp maps and spaghetti maps, the students concluded that the forecast of the severe weather over southern France was linked to the forecast of the interaction between Nadine and the cutoff low (Pantillon et al., 2015). They performed manual and automatic clustering of the ensemble forecast to understand the different scenarios. For automatic clustering a principal component analysis was used retaining the first two principal components. Figure 3a and b show a representative member of the two main clusters for the potential vorticity at 320 K and the geopotential at 500 hPa. Visualising probability and percentile maps brought the engineers closer to estimating the likelihood of extreme precipitation over southern France. Finally, with all the available forecast information, they gave advice to the "scientific director" to make a pre-alert decision on aircraft deployment. There was a discussion on the use Figure 3. Potential vorticity at 320 K (pvu) and geopotential at 500 hPa (dam) at 12:00 UTC on 22 September 2012 according to (a-b) two members of the operational ensemble forecast, (c) the control forecast, (d) the high-resolution forecast, each of them started at 00:00 UTC on 20 September 2012 (i.e. all are 60 h forecasts) and (e) the operational analysis. The ensemble members split to two substantially different scenarios for the interaction between Nadine and the Atlantic cut-off. Member 2 represents a scenario where Nadine and the cut-off rotate around each other tending to merge, while member 21 shows that Nadine moves westwards back to the ocean and the cut-off shifts eastwards. The HRES forecast indicated both Nadine and the cut-off being steered eastward. The control forecast predicted weak interaction and divergence between the two systems which was confirmed also by the analysis. of cost/loss ratios to turn probability of a precipitation event into an eventual decision. The students were confronted with the actual forecast performance and ECMWF analyses of the event only on the last day when they calculated some error statistics.
Linux virtual machines have been employed in the courses at ENM since the beginning in 2016. Apart from the virtual machine player, there is no need to install software and files on the host computer. At the same time, some drawbacks have been identified from experience. The VM can be compressed to a single file, but as a full operating system with data and files, its size typically exceeds 10 GB. It can freeze on computers with limited RAM when using software that is compute and memory intensive (e.g. Metview). Exporting the output files via USB, shared folders or network transfer (for presentation by the tutor to the class for example) can be problematic relying on kernel changes to the VM that do not always work. In 2018 a few virtual machines were hosted on the Copernicus Climate Data Store (CDS). The CDS is a cloud-based system to browse and combine raw data, access information about the past, present and future climate, develop and build own applications, maps and graphs online using a set of available software (Raoult et al., 2017). The virtual machines were prepared as VM "instances" on the CDS cloud server and provided to some of the students who reached them through a web browser after logging into the CDS, whilst the others used the VM installed on the classroom PCs. The system provided more stability and relied less on local computing capacity. Nevertheless, some new issues were encountered: the responsiveness of the CDS VMs could not match the local VMs; Metview animations and Metview scripts required noticeably longer times to complete despite generous resource allocation on the CDS. Although the internal cloud network eased moving the images between the CDS VMs for class presentation, copying the results to the local accounts can be still complicated if the local (firewall etc.) settings do not support that. Considering that cloud services rely on network and cloud resource allocation, they may work best for less compute intensive or slower paced workshops.

NumLab at the University of Helsinki
The Laboratory course in numerical modelling, commonly referred to as NumLab, is taught at the University of Helsinki, Finland every year as part of the 2-year masters program in atmospheric sciences. NumLab is designed to give students completing their masters degrees relevant work-life skills. Its specific learning objectives are: 1. to develop basic skills in high-performance computing relevant for meteorology; 2. to acquire practical, hands-on experience of using a state-of-the-art, complex atmospheric forecast model; 3. to learn to develop research questions, design relevant numerical experiments and apply meteorological knowledge from earlier courses and scientific reasoning to address research questions and 4. to develop transferable skills in scientific problem solving as a research group member and in presenting scientific results in a conference style presentation.
Historically, a different numerical model was selected each time NumLab was taught. However, this resulted in a large amount of preparation each year and often the lecturers had to first learn the model themselves. Since 2015, OpenIFS has been used as the numerical model in NumLab and will continue to be so for the foreseeable future. OpenIFS was selected for numerous reasons: it is based on a state-of-the-art operational model, IFS; research in the dynamic meteorology group at the University of Helsinki relies now largely on OpenIFS; climate research at the University of Helsinki is based on EC-Earth (Hazeleger et al., 2010) which has a version of IFS as its atmospheric component; operational shortterm forecasts at the Finnish Meteorological Institute are based on the HARMONIE model (HIRLAM-ALADIN Research for Mesoscale Operational NWP In Europe; Bengtsson et al., 2017) which has a dynamical core similar to IFS.
The NumLab course is worth 5 credit points in the European Credit Transfer and Accumulation System (ECTS; one year of full study is worth 60 credit points). It runs over two distinct 7-week teaching periods with a 2 h contact session per week. In the first 7 weeks, short lectures are given during the contact sessions on topics such as useful Linux commands, architecture of a supercomputer, an overview of IFS incorporating information about the space-time discretisation, the state variables, parametrised physical processes (etc.), and the output file formats from OpenIFS (gridpoint versus spectral coefficients). The students work individually and learn how to compile and run the model, post-process and analyse the model outputs and improve their basic Linux and high-performance computing skills and knowledge (for example, using batch queue systems).
During the second part of the course the participants work in groups of 3-4 students on a small research project. Each year, a different scientific topic or specific historical weather event is selected as the basis of the course. Previous topics have included the extra-tropical wind storm Lothar, the parameterisation of deep convection and forecast skill drop outs over Europe which was studied using ensemble simulations. The students design their own research questions and numerical experiments, with input and guidance from lecturers, and then conduct and analyse the simulations. One example of a research project undertaken by a group of students who studied the Lothar storm was to alter the surface roughness over land and investigate what impact this had on the forecast 10 m wind gusts. In the year when the focus was on deep convection another group explored the impact of entrainment on the amount of cloudiness and convective precipitation over central Africa.
The topic in 2019 is the sudden stratospheric warming of February 2018 which resulted in cold conditions over large parts of Europe. As the theme changes each year, it is possible for students to take the course more than once and the course can also be included in doctoral studies. To date more than 50 students have participated in the course and have learnt how to use OpenIFS. Most of them stated in their feedbacks that they learnt new skills and enjoyed having the opportunity to apply knowledge gained from earlier courses. Some students have commented that there is a very steep learning curve at the start of the course. In the future it is likely that the course will be opened up to remote participation.

Teaching numerical modelling at Eötvös Loránd University in Budapest
A two-semester course is dedicated to numerical weather prediction (NWP) and climate modelling at Eötvös Loránd University in Budapest, Hungary. In the first term, the masters students of Meteorological Department learn about theoretical basics of meteorological modelling: discretisation methods, finite difference technique, Galerkin methods, numerical stability, data assimilation, ensemble predictions etc. The lectures are given by modelling practitioners of the Hungarian Meteorological Service. The proceeding semester allows the attendees to try in practice what they learnt in the theory course. Further master students join from Department of Applied Analysis and Computational Mathematics and they form small teams with the meteorology students. The teams comprise of 2 to 4 members working on differ- ent modelling topics processing the related scientific articles and conducting simulations with numerical models available for educational purposes like the Lorenz models (Lorenz and Emanuel, 1998;Lorenz, 2005) and OpenIFS model. The students spend either an intensive week focusing solely on their project or a few hours every week during the semester consulting regularly with their supervisor from the meteorological institute. The programme closes with the teams' presentations in which they introduce the background of their chosen topic, their achieved experiments and discuss the conclusions drawn based on their results. The course strengthens the students' capabilities to work in teams. Especially beneficial is teams composed from students with a more theoretical background and those from the area of applications because that mimics the real work style in most meteorological services.
An evaluation framework was developed at ECMWF for OpenIFS version 40r1v2 in 2018 with the aim to provide a user tool for checking the meteorological performance of the model, comparing its different versions and testing the user developments. The tool is based on two weather events with severe impacts in Europe: storm Xaver from 2013 and storm Desmond from 2015. Several runs were conducted for both cases to study the effect of forecast length, initial conditions and spatial resolution on forecast quality. The experiments were initialised from the ERA-Interim and ERA5 reanalyses 1 to 5 days before the high-impact events using 3 different resolutions. The input data, the Metview-based visualisation programs, a detailed guidance about post-processing and plotting the model outputs, as well as the output figures are published online (Szépszó, 2008;Szépszó and Carver, 2018a, b).
In 2018, a group of applied mathematics students from Eötvös Loránd University worked with OpenIFS. The Hungarian students tested the evaluation tool before its release. They reconstructed the experiments for Desmond at T255L91 and T639L137 resolution, initialised from ERA-Interim on 4 December 2015, using OpenIFS 40r1 on the supercomputer of the Hungarian Meteorological Service. (The resolution of OpenIFS is specified by the spectral truncation prefixed by T for triangular truncation and associated with a linear Gaussian grid for the grid-point space computations, i.e. T255L91 and T639L137 correspond to 78 and 31 km grid spacing with 91 and 137 vertical levels, respectively.) The students visualised and evaluated the results following the guidance description step-by-step (Fig. 4). They compared the forecasts with ERA5 data (Hersbach et al., 2018) for different meteorological variables. The precipitation field was analysed with more detail, because not only OpenIFS, but also the ERA5 forecasts underestimated the rainfall with respect to the observational data. The students' feedbacks were taken into account at the final release of the evaluation package.

Summary and outlook
The courses based on the OpenIFS model help the masters and PhD students in developing and improving the knowledge and skills typically requested in scientific job announcements. Their analytical and problem solving skills are improved by the scientific mini-projects mapping the related literature and raising scientific questions based on the existing knowledge of the students. Their programming and scripting skills are enhanced by using a state-of-the-art numerical weather prediction model, analysing large datasets and handling model data in various formats. The courses give opportunities to work in a (sometimes multi-disciplinary) team as well as to discuss questions in English which both are highly desirable in meteorology where numerous developments are achieved in international collaborations. The next OpenIFS model release in 2019 will make it possible to extend the research and teaching activities based on the model. OpenIFS version 43r3 will introduce the octahedral cubic grid used in ECMWF operational forecasts since 2015 which had significant benefits in terms of computational efficiency and representation of the resolved processes (Malardel et al., 2016). An offline version of the new radiation scheme ecRad (Hogan and Bozzo, 2016) has already been available for non-commercial use under the terms of the OpenIFS licence since 2017 and it will be included in the new OpenIFS version. It is faster than the radiation scheme of OpenIFS 40r1 (Morcrette et al., 2008) and its more flexible modular structure facilitates radiative transfer research. An interactive lake model (Balsamo and Belward, 2017) will be also available in the new version, based on the FLake parametrisation developed at the German Weather Service (Mironov, 2008). A range of academic test cases such as the aquaplanet configuration (Neale and Hoskins, 2000a, b), the Held-Suarez case (Held and Suarez, 1994) is available in the current OpenIFS version to support teaching the dynamical meteorology. They will be extended with further pre-defined settings like for example baroclinic waves, idealised tropical cyclones in OpenIFS 43r3.
Data availability. Use of the OpenIFS model requires a free OpenIFS license from ECMWF which can be requested by email to openifs-support@ecmwf.int. The content of the virtual machines as well as the VM itself used in the OpenIFS workshops and the trainings introduced in Section 3 are available on request by email to openifs-support@ecmwf.int. The data and the programs mentioned in Section 5 can be retrieved from the ECMWF download server: http://download.ecmwf.int/test-data/openifs/reference_ casestudies (Szépszó and Carver, 2018b). The content is covered by the Creative Commons Attribution-Non-Commercial-No-Derivatives-4.0-Unported License, as described on https://www. ecmwf.int/en/terms-use (ECMWF, 2019). The tutorial for the exercises of Section 5 is available here: https://confluence.ecmwf.int/ x/jxwXBQ (Szépszó, 2018).
Author contributions. GS wrote the manuscript with input from all authors. GC designed the scientific and technical content of the OpenIFS workshops and the training introduced in Sect. 3. VS provided Sect. 4 and performed the work shown there. GS developed the OpenIFS evaluation package (with support of GC), carried out the work detailed in Sect. 5 and was involved in delivering some workshops discussed in the paper. The figures were made by GS. All authors contributed to the final version of the manuscript.
Competing interests. The authors declare that they have no conflict of interest.

Special issue statement.
This article is part of the special issue "18th EMS Annual Meeting: European Conference for Applied Meteorology and Climatology 2018". It is a result of the EMS Annual Meeting: European Conference for Applied Meteorology and Climatology 2018, Budapest, Hungary, 3-7 September 2018.
Acknowledgements. The authors wish to thank the reviewers for their useful comments and suggestions improving the manuscript. Victoria Sinclair was funded by the Academy of Finland (project no. 307331) and wishes to acknowledge CSC -IT Center for Science, Finland, for computational resources used by the NumLab course.
Review statement. This paper was edited by Tomas Halenka and reviewed by Robert S. Plant and one anonymous referee