METplus Use Case: GenEnsProd and EnsembleStat
METplus Use Case: GenEnsProd and EnsembleStatBoth the Ensemble-Stat tool and Gen-Ens-Prod tool have METplus wrapper versions. We will review a METplus use case that calls on both of these tools and see how they interact with and compliment each other.
Optional: Refer to the METplus Config Glossary section of the METplus Users Guide for a reference to METplus variables used in this use case.
- Review the basic wrapper configuration file: EnsembleStat.conf
Recall that the Ensemble-Stat tool is for verification against gridded and/or point observation datasets and can calculate statistical output, which is reflected in the wrapper's configuration options. Note that variables in EnsembleStat.conf reference other config variables that have been defined in other configuration files. For example:
This references INPUT_BASE which is set in the tutorial.conf configuration file. METplus config variables can reference other config variables even if they are defined in a config file that is read afterwards. This is a common practice in METplus wrappers, including the use case that we'll be running.
- Review the basic wrapper configuration file: GenEnsProd.conf
The Gen-Ens-Prod tool (and its wrapper that we're now using) is for creating ensemble products. This differs from the purpose of the Ensemble-Stat tool, so the GenEnsProd wrapper options will differ from those in the EnsembleStat wrapper.
- Review the use case configuration file: EnsembleStat_fcstHRRRE_obsHRRRE_Sfc_MultiField.conf
For the use case that we will be running, three tool wrappers are called: PB2NC, GenEnsProd, and EnsembleStat. These tools will iterate over one time step, but three separate lead times. PB2NC is used as a conversion method for the input file types and was covered in a previous session. From the order of the tools in PROCESS_LIST, we see that EnsembleStat runs first. Since that tool is for verification and statistical output, it uses observational field settings like OBS_VAR1_NAME and BOTH_VAR1_THRESH, which applies to both the forecast and observation variable 1. It also will create multiple output files for each time step processed, one for each of the requested line types:
ENSEMBLE_STAT_OUTPUT_FLAG_RPS = NONE
ENSEMBLE_STAT_OUTPUT_FLAG_RHIST = BOTH
ENSEMBLE_STAT_OUTPUT_FLAG_PHIST = BOTH
ENSEMBLE_STAT_OUTPUT_FLAG_ORANK = BOTH
ENSEMBLE_STAT_OUTPUT_FLAG_SSVAR = BOTH
ENSEMBLE_STAT_OUTPUT_FLAG_RELP = BOTH
GenEnsProd, on the other hand, will utilize the ENS_ configuration settings, and will place all output for each time step into a single netCDF output. Those file's locations and names can be seen with the following configuration settings:
GEN_ENS_PROD_OUTPUT_TEMPLATE = {init?fmt=%Y%m%d%H%M}/gen_ens_prod_{ENSEMBLE_STAT_OUTPUT_PREFIX}_{valid?fmt=%Y%m%d_%H%M%S}V_ens.nc
Note that the output directory is dependent on a different configuration setting, ENSEMBLE_STAT_OUTPUT_DIR. As mentioned above, this is common practice in METplus.
-
Run the use case:
${METPLUS_BUILD_BASE}/parm/use_cases/model_applications/short_range/EnsembleStat_fcstHRRRE_obsHRRRE_Sfc_MultiField.conf \
${METPLUS_TUTORIAL_DIR}/tutorial.conf \
config.OUTPUT_BASE=${METPLUS_TUTORIAL_DIR}/output/Ensemble
METplus is finished running when control returns to your terminal console and you see the following text:
- Review the output files:
You should have output files in the following directories:
- ensemble_stat_HRRRE_F000_ADPSFC_20180709_120000V_ecnt.txt
- ensemble_stat_HRRRE_F000_ADPSFC_20180709_120000V_orank.txt
- ensemble_stat_HRRRE_F000_ADPSFC_20180709_120000V_phist.txt
- ensemble_stat_HRRRE_F000_ADPSFC_20180709_120000V_relp.txt
- ensemble_stat_HRRRE_F000_ADPSFC_20180709_120000V_rhist.txt
- ensemble_stat_HRRRE_F000_ADPSFC_20180709_120000V_ssvar.txt
- ensemble_stat_HRRRE_F000_ADPSFC_20180709_120000V.stat
- ensemble_stat_HRRRE_F001_ADPSFC_20180709_130000V_ecnt.txt
- ensemble_stat_HRRRE_F001_ADPSFC_20180709_130000V_orank.txt
- ensemble_stat_HRRRE_F001_ADPSFC_20180709_130000V_phist.txt
- ensemble_stat_HRRRE_F001_ADPSFC_20180709_130000V_relp.txt
- ensemble_stat_HRRRE_F001_ADPSFC_20180709_130000V_rhist.txt
- ensemble_stat_HRRRE_F001_ADPSFC_20180709_130000V_ssvar.txt
- ensemble_stat_HRRRE_F001_ADPSFC_20180709_130000V.stat
- ensemble_stat_HRRRE_F002_ADPSFC_20180709_140000V_ecnt.txt
- ensemble_stat_HRRRE_F002_ADPSFC_20180709_140000V_orank.txt
- ensemble_stat_HRRRE_F002_ADPSFC_20180709_140000V_phist.txt
- ensemble_stat_HRRRE_F002_ADPSFC_20180709_140000V_relp.txt
- ensemble_stat_HRRRE_F002_ADPSFC_20180709_140000V_rhist.txt
- ensemble_stat_HRRRE_F002_ADPSFC_20180709_140000V_ssvar.txt
- ensemble_stat_HRRRE_F002_ADPSFC_20180709_140000V.stat
- gen_ens_prod_HRRRE_F000_ADPSFC_20180709_120000V_ens.nc
- gen_ens_prod_HRRRE_F001_ADPSFC_20180709_130000V_ens.nc
- gen_ens_prod_HRRRE_F002_ADPSFC_20180709_140000V_ens.nc
We can tell from the name of the files, specifically the _F00x_ string, that three different lead times were run: 0, 1, and 2. These ended up as valid times of 12z, 13z, and 14z, as indicated by the 1x0000V strings.
This stat file contains all of the statistic line types that were requested in the configuration file. If we wanted to review only one line type at a time, ECNT for example, we could instead look at the ASCII file for this lead time with the _ecnt string at the end of the file name. That file exists because the use case configuration file set:
signaling that the ecnt line type results should be in the .stat file, as well as a separate ASCII file.
- Review the log output:
Log files for this run are found in ${METPLUS_TUTORIAL_DIR}/output/Ensemble/logs. The filename contains a timestamp of the day and time that the use case was run.
-
Review the Final Configuration File:
The final configuration file is called metplus_final.conf.*, where the final string is a timestamp of the day and time the use case was run, similar to the log files. This contains all of the configuration variables used in the run. If you complete a run of METplus and are unsure how the system interpreted something from the configuration file, this is a great source of information.