Benchmarking

The GEOS-Chem Support Team uses GCPy to produce comparison plots and summary tables from GEOS-Chem benchmark simulations. In this chapter we will describe this capability of GCPy.

Location of benchmark plotting scripts

The source code for creating benchmark plots is located in the gcpy/benchmark directory tree.

Contents of the gcpy/benchmark directory

File or folder

Description

run_benchmark.py

Benchmark driver script

benchmark_slurm.sh

Bash script to submit run_benchmark,py as a SLURM batch job

cloud/

Directory containing template config files (in YAML format) for 1-hour and 1-month benchmark plot jobs on the AWS cloud.

config/

Directory containing editable config files (in YAML format) for 1-month and 1-year benchmark plot jobs.

__init__.py

Python import script

modules/

Contains Python modules imported into the run_benchmark.py script.

README.md

Readme file in Markdown format

Note

As of this writing, the benchmarking scripts still use several plotting and tabling functions from module gcpy.benchmark_funcs. We are currently in the process of moving the functions contained in gcpy.benchmark_funcs to the gcpy/benchmark/modules directory.

Steps to create benchmark plots

Follow these instructions to create comparison plots and summary tables from GEOS-Chem benchmark simulations.

  1. Copy a configuration file from the gcpy/benchmark/config directory.

    In this example we will use the configuration file that will create plots from 1-year full-chemistry benchmark simulations. (Configuration files for other benchmark types have a similar layout.)

    $ cp /path/to/GCPy/gcpy/benchmark/config/1yr_fullchem_benchmark.yml .
    


  2. Edit the paths section of the configuration file to specify the proper directory paths for your system.

    # Configuration for 1-year FullChemBenchmark
    #
    # paths:
    #   main_dir:     High-level directory containing ref & dev rundirs
    #   results_dir:  Directory where plots/tables will be created
    #   weights_dir:  Path to regridding weights
    #   spcdb_dir:    Folder in which the species_database.yml file is
    #                  located.  If set to "default", then will look for
    #                  species_database.yml in one of the Dev rundirs.
    #   obs_data_dir: Path to observational data (for models vs obs plots)
    #
    paths:
      main_dir: /path/to/benchmark/main/dir    # EDIT AS NEEDED
      results_dir: /path/to/BenchmarkResults   # EDIT AS NEEDED
      weights_dir: /n/holyscratch01/external_repos/GEOS-CHEM/gcgrid/data/ExtData/GCHP/RegriddingWeights
      spcdb_dir: default
    #
    # Observational data dirs are on Harvard Cannon, edit if necessary
    #
    obs_data:
      ebas_o3:
        data_dir: /n/jacob_lab/Lab/obs_data_for_bmk/ebas_sfc_o3_2019
        data_label: "O3 (EBAS, 2019)"
      sondes:
        data_dir: /n/jacob_lab/Lab/obs_data_for_bmk/sondes_2010-2019
        data_file: allozonesondes_2010-2019.csv
        site_file: allozonesondes_site_elev.csv
    


  3. Edit the data section to specify the directories (and labels) for the Ref and Dev versions for GEOS-Chem Classic and GCHP.

    #
    # data: Contains configurations for ref and dev runs
    #   version:         Version string (must not contain spaces)
    #   dir:             Path to run directory
    #   outputs_subdir:  Subdirectory w/ GEOS-Chem diagnostic files
    #   restarts_subdir: Subdirectory w/ GEOS-Chem restarts
    #   bmk_start:       Simulation start date (YYYY-MM-DDThh:mm:ss)
    #   bmk_end:         Simulation end date (YYYY-MM-DDThh:mm:ss)
    #   resolution:      GCHP resolution string
    #
    data:
      ref:
        gcc:
          version: GCC_ref
          dir: GCC_ref
          outputs_subdir: OutputDir
          restarts_subdir: Restarts
          logs_subdir: .
          logs_template: "GC.log"
          bmk_start: "2019-07-01T00:00:00"
          bmk_end: "2019-08-01T00:00:00"
        gchp:
          version: GCHP_ref
          dir: GCHP_ref
          outputs_subdir: OutputDir
          restarts_subdir: Restarts
          logs_subdir: .
          logs_template: "gchp.%Y%m%d_0000z.log"
          bmk_start: "2019-07-01T00:00:00"
          bmk_end: "2019-08-01T00:00:00"
          is_pre_14.0: False
          resolution: c24
      dev:
        gcc:
          version: GCC_dev
          dir: GCC_dev
          outputs_subdir: OutputDir
          restarts_subdir: Restarts
          logs_subdir: .
          logs_template: "GC.log"
          bmk_start: "2019-07-01T00:00:00"
          bmk_end: "2019-08-01T00:00:00"
        gchp:
          version: GCHP_dev
          dir: GCHP_dev
          outputs_subdir: OutputDir
          restarts_subdir: Restarts
          logs_subdir: Logs
          logs_template: "gchp.%Y%m%d_0000z.log"
          bmk_start: "2019-07-01T00:00:00"
          bmk_end: "2019-08-01T00:00:00"
          is_pre_14.0: False
          resolution: c24
    


  4. Edit the comparisons section to specify the types of comparisons you would like to perform.

    #
    # comparisons: Specifies the comparisons to perform.
    #
    comparisons:
      gcc_vs_gcc:
        run: True
        dir: GCC_version_comparison
        tables_subdir: Tables
      gchp_vs_gcc:
        run: True
        dir: GCHP_GCC_comparison
        tables_subdir: Tables
      gchp_vs_gchp:
        run: True
        dir: GCHP_version_comparison
        tables_subdir: Tables
      gchp_vs_gcc_diff_of_diffs:
        run: True
        dir: GCHP_GCC_diff_of_diffs
    


  5. Edit the outputs section to select the plots and tables that you would like to generate.

    #
    # outputs: Specifies the plots and tables to generate
    #
    outputs:
      plot_conc: True
      plot_emis: True
      emis_table: True
      plot_jvalues: True
      plot_aod: True
      plot_drydep: False  # Need to save out DryDep collection for 1-mo
      mass_table: True
      mass_accum_table: False
      ops_budget_table: False
      OH_metrics: True
      ste_table: True # GCC only
      timing_table: True
      summary_table: True
      plot_options:
        by_spc_cat: True
        by_hco_cat: True
    


  6. Edit the n_cores setting if you wish to change the number of computational cores to use. If not, leave n_cores set to -1, which will use as many cores as possible.

    #
    # n_cores: Specify the number of cores to use.
    # -1: Use $OMP_NUM_THREADS         cores
    # -2: Use $OMP_NUM_THREADS - 1     cores
    # -N: Use $OMP_NUM_THREADS - (N-1) cores
    #  1: Disable parallelization (use a single core)
    #
    n_cores: -1
    


  7. Run the run.benchmark.py script. You may do this in 2 ways:

    1. Direct execution from the command line:

      (gcpy_env) $ python -m gcpy.benchmark.run_benchmark
      1yr_fullchem_benchmark.yml
      
    2. Batch execution with the SLURM scheduler. First, copy the benchmark_slurm.sh script to your current directory:

      (gcpy_env) $ cp /path/to/GCPy/gcpy/benchmark/benchmark_slurm.sh .
      

      Next, edit your local copy of benchmark_slurm.sh to specify your SLURM partition name, number of cores, the name of your Python environment and the configuration file to use.

      #!/bin/bash
      
      #SBATCH -c 8
      #SBATCH -N 1
      #SBATCH -t 0-4:00
      #SBATCH -p seas_compute,shared
      #SBATCH --mem=100000
      #SBATCH --mail-type=END
      
      #============================================================================
      # This us a sample SLURM script that you can use to run the GCPy
      # benchmark plotting code as a SLURM batch job.
      #
      # You can modify the SLURM parameters above for your setup.
      #
      # Tip: Using less cores can reduce the amount of memory required.
      #============================================================================
      
      # Apply all bash initialization settings
      . ~/.bashrc
      
      # Make sure to set multiple threads; Joblib will use multiple
      # cores to parallelize certain plotting operations.
      export OMP_NUM_THREADS=$SLURM_CPUS_PER_TASK
      export OMP_STACKSIZE=500m
      
      # Turn on Python environment (edit for your setup)
      mamba activate gcpy_env
      
      # Specify a YAML file with benchmark options
      # Uncomment the file that you wish:
      #config="1mo_benchmark.yml"
      config="1yr_fullchem_benchmark.yml"
      #config="1yr_tt_benchmark.yml"
      
      # Call the run_benchmark script to make the plots
      python -m gcpy.benchmark.run_benchmark "${config}" > "${config/.yml/.log}" 2>&1
      
      # Turn off python environment
      mamba deactivate
      
      exit 0
      

      Lastly, start the SLURM batch execution with this command:

      $ sbatch benchmark_slurm.sh
      

Benchmark plotting functions

Module gcpy.benchmark_funcs contains several functions for creating plots and tables from GEOS-Chem benchmark simulations. The specific outputs generated have been requested by the GEOS-Chem Steering Committee in order to facilitate comparing benchmark output from different model versions.

In this section, we will describe functions that create comparison plots from GEOS-Chem benchmark simulation output. The functions to create summary tables will be described in a separate section.

Note

We are working towards moving all benchmark-related source code to the gcpy/benchmark/ directory tree. For the time being, the benchmark_funcs.py script is located in the /path/to/GCPy/gcpy/ directory.

Functions creating six-panel comparison plots

Function

Plot that it creates

make_benchmark_aod_plots

Aerosol optical depth

make_benchmark_conc_plots

Species concentrations

make_benchmark_drydep_plots

Dry deposition velocities

make_benchmark_emis_plots

Emissions (by species and catgegory)

make_benchmark_jvalue_plots

J-values (photolysis)

make_benchmark_wetdep_plots

Wet deposition of soluble species

Functions creating model vs. observation plots

Function

Plot that it creates

make_benchmark_model_vs_obs_plots

Modeled ozone vs. surface observations

make_benchmark_model_vs_sondes_plots

Vertical profiles of modeled ozone vs. ozonesondes

The functions listed above create comparison plots of most GEOS-Chem output variables divided into specific categories, e.g. species categories such as Aerosols or Bromine for the SpeciesConcVV diagnostic. In eachcategory, these function create single level PDFs for the surface and 500hPa and zonal mean PDFs for the entire atmosphere and only the stratosphere (defined a 1-100hPa). For make_benchmark_emis_plots(), only single level plots at the surface are produced. All of these plotting functions include bookmarks within the generated PDFs that point to the pages containing each plotted quantity. Thus these functions serve as tools for quickly creating comprehensive plots comparing two GEOS-Chem runs. These functions are used to create the publicly available plots for 1-month and 1-year benchmarks of new versions of GEOS-Chem.

Many of the plotting functions listed above use pre-defined lists of variables in YAML files. If one dataset includes a variable but the other dataset does not, the data for that variable in the latter dataset will be considered to be NaN and will be plotted as such.

make_benchmark_aod_plots

Located in module: gcpy.benchmark.modules.benchmark_funcs

This function creates column optical depth plots using the Aerosols diagnostic output.

def make_benchmark_aod_plots(
        ref,
        refstr,
        dev,
        devstr,
        varlist=None,
        dst="./benchmark",
        subdst=None,
        cmpres=None,
        overwrite=False,
        verbose=False,
        log_color_scale=False,
        sigdiff_files=None,
        weightsdir='.',
        n_job=-1,
        time_mean=False,
        spcdb_dir=os.path.dirname(__file__)
):
    """
    Creates PDF files containing plots of column aerosol optical
    depths (AODs) for model benchmarking purposes.

    Args:
        ref: str
            Path name for the "Ref" (aka "Reference") data set.
        refstr: str
            A string to describe ref (e.g. version number)
        dev: str
            Path name for the "Dev" (aka "Development") data set.
            This data set will be compared against the "Reference"
            data set.
        devstr: str
            A string to describe dev (e.g. version number)

    Keyword Args (optional):
        varlist: list of str
            List of AOD variables to plot.  If not passed, then all
            AOD variables common to both Dev and Ref will be plotted.
            Use the varlist argument to restrict the number of
            variables plotted to the pdf file when debugging.
            Default value: None
        dst: str
            A string denoting the destination folder where a
            PDF file  containing plots will be written.
            Default value: ./benchmark.
        subdst: str
            A string denoting the sub-directory of dst where PDF
            files containing plots will be written.  In practice,
            subdst is only needed for the 1-year benchmark output,
            and denotes a date string (such as "Jan2016") that
            corresponds to the month that is being plotted.
            Default value: None
        cmpres: string
            Grid resolution at which to compare ref and dev data, e.g. '1x1.25'
        overwrite: bool
            Set this flag to True to overwrite files in the
            destination folder (specified by the dst argument).
            Default value: False.
        verbose: bool
            Set this flag to True to print extra informational output.
            Default value: False
        log_color_scale: bool
            Set this flag to True to enable plotting data (not diffs)
            on a log color scale.
            Default value: False
        sigdiff_files: list of str
            Filenames that will contain the list of quantities having
            having significant differences in the column AOD plots.
            These lists are needed in order to fill out the benchmark
            approval forms.
            Default value: None
        weightsdir: str
            Directory in which to place (and possibly reuse) xESMF regridder
            netCDF files.
            Default value: '.'
        n_job: int
            Defines the number of simultaneous workers for parallel plotting.
            Set to 1 to disable parallel plotting. Value of -1 allows the
            application to decide.
            Default value: -1
        spcdb_dir: str
            Directory of species_datbase.yml file
            Default value: Directory of GCPy code repository
        time_mean : bool
            Determines if we should average the datasets over time
            Default value: False
    """

make_benchmark_conc_plots

Located in module: gcpy.benchmark.modules.benchmark_funcs

Generates species concentration plots using the SpeciesConc diagnostic output by default. In particular:

  • This function is the only benchmark plotting function that supports diff-of-diffs plotting, in which 4 datasets are passed and the differences between two groups of Ref datasets vs. two groups of Dev datasets is plotted (typically used for comparing changes in GCHP vs. changes in GEOS-Chem Classic across model versions).

  • This is also the only benchmark plotting function that sends plots to separate folders based on category (as denoted by the plot_by_spc_cat flag). The full list of species categories is denoted in benchmark_categories.yml (included in GCPy).

  • In this function, parallelization occurs at the species category level. In all other functions, parallelization occurs within calls to compare_single_level() and compare_zonal_mean().=

def make_benchmark_conc_plots(
        ref,
        refstr,
        dev,
        devstr,
        dst="./benchmark",
        subdst=None,
        overwrite=False,
        verbose=False,
        collection="SpeciesConc",
        benchmark_type="FullChemBenchmark",
        cmpres=None,
        plot_by_spc_cat=True,
        restrict_cats=[],
        plots=["sfc", "500hpa", "zonalmean"],
        use_cmap_RdBu=False,
        log_color_scale=False,
        sigdiff_files=None,
        normalize_by_area=False,
        cats_in_ugm3=["Aerosols", "Secondary_Organic_Aerosols"],
        areas=None,
        refmet=None,
        devmet=None,
        weightsdir='.',
        n_job=-1,
        second_ref=None,
        second_dev=None,
        time_mean=False,
        spcdb_dir=os.path.dirname(__file__)
):
    """
    Creates PDF files containing plots of species concentration
    for model benchmarking purposes.

    Args:
        ref: str
            Path name for the "Ref" (aka "Reference") data set.
        refstr: str
            A string to describe ref (e.g. version number)
        dev: str
            Path name for the "Dev" (aka "Development") data set.
            This data set will be compared against the "Reference"
            data set.
        devstr: str
            A string to describe dev (e.g. version number)

    Keyword Args (optional):
        dst: str
            A string denoting the destination folder where a PDF
            file containing plots will be written.
            Default value: ./benchmark
        subdst: str
            A string denoting the sub-directory of dst where PDF
            files containing plots will be written.  In practice,
            subdst is only needed for the 1-year benchmark output,
            and denotes a date string (such as "Jan2016") that
            corresponds to the month that is being plotted.
            Default value: None
        overwrite: bool
            Set this flag to True to overwrite files in the
            destination folder (specified by the dst argument).
            Default value: False
        verbose: bool
            Set this flag to True to print extra informational output.
            Default value: False
        collection: str
            Name of collection to use for plotting.
            Default value: "SpeciesConc"
        benchmark_type: str
            A string denoting the type of benchmark output to plot, options are
            FullChemBenchmark, TransportTracersBenchmark, or CH4Benchmark.
            Default value: "FullChemBenchmark"
        cmpres: string
            Grid resolution at which to compare ref and dev data, e.g. '1x1.25'
        plot_by_spc_cat: logical
            Set this flag to False to send plots to one file rather
            than separate file per category.
            Default value: True
        restrict_cats: list of strings
            List of benchmark categories in benchmark_categories.yml to make
            plots for. If empty, plots are made for all categories.
            Default value: empty
        plots: list of strings
            List of plot types to create.
            Default value: ['sfc', '500hpa', 'zonalmean']
        log_color_scale: bool
            Set this flag to True to enable plotting data (not diffs)
            on a log color scale.
            Default value: False
        normalize_by_area: bool
            Set this flag to true to enable normalization of data
            by surfacea area (i.e. kg s-1 --> kg s-1 m-2).
            Default value: False
        cats_in_ugm3: list of str
            List of benchmark categories to to convert to ug/m3
            Default value: ["Aerosols", "Secondary_Organic_Aerosols"]
        areas: dict of xarray DataArray:
            Grid box surface areas in m2 on Ref and Dev grids.
            Default value: None
        refmet: str
            Path name for ref meteorology
            Default value: None
        devmet: str
            Path name for dev meteorology
            Default value: None
        sigdiff_files: list of str
            Filenames that will contain the lists of species having
            significant differences in the 'sfc', '500hpa', and
            'zonalmean' plots.  These lists are needed in order to
            fill out the benchmark approval forms.
            Default value: None
        weightsdir: str
            Directory in which to place (and possibly reuse) xESMF regridder
            netCDF files.
            Default value: '.'
        n_job: int
            Defines the number of simultaneous workers for parallel plotting.
            Set to 1 to disable parallel plotting. Value of -1 allows the
            application to decide.
            Default value: -1
        second_ref: str
            Path name for a second "Ref" (aka "Reference") data set for
            diff-of-diffs plotting. This dataset should have the same model
            type and grid as ref.
            Default value: None
        second_dev: str
            Path name for a second "Ref" (aka "Reference") data set for
            diff-of-diffs plotting. This dataset should have the same model
            type and grid as ref.
            Default value: None
        spcdb_dir: str
            Directory of species_datbase.yml file
            Default value: Directory of GCPy code repository
        time_mean : bool
            Determines if we should average the datasets over time
            Default value: False
    """

make_benchmark_drydep_plots

Located in module: gcpy.benchmark.modules.benchmark_drydep

Generates plots of dry deposition velocities using the GEOS-Chem DryDep diagnostic output.

def make_benchmark_drydep_plots(
        ref,
        refstr,
        dev,
        devstr,
        collection="DryDep",
        dst="./benchmark",
        subdst=None,
        cmpres=None,
        overwrite=False,
        verbose=False,
        log_color_scale=False,
        weightsdir=".",
        sigdiff_files=None,
        n_job=-1,
        time_mean=False,
        varlist=None,
        spcdb_dir=os.path.join(os.path.dirname(__file__), "..", "..")
):
    """
    Creates six-panel comparison plots (PDF format) from GEOS-Chem
    benchmark simualtion output.  Can be used with data collections
    that do not require special handling (e.g. concentrations).

    Args:
        ref: str
            Path name for the "Ref" (aka "Reference") data set.
        refstr: str
            A string to describe ref (e.g. version number)
        dev: str
            Path name for the "Dev" (aka "Development") data set.
            This data set will be compared against the "Reference"
            data set.
        devstr: str
            A string to describe dev (e.g. version number)

    Keyword Args (optional):
        collection : str
            Name of the diagnostic collection (e.g. "DryDep")
        dst: str
            A string denoting the destination folder where a PDF
            file containing plots will be written.
            Default value: ./benchmark
        subdst: str
            A string denoting the sub-directory of dst where PDF
            files containing plots will be written.  In practice,
            subdst is only needed for the 1-year benchmark output,
            and denotes a date string (such as "Jan2016") that
            corresponds to the month that is being plotted.
            Default value: None
        benchmark_type: str
            A string denoting the type of benchmark output to plot, options are
            FullChemBenchmark, TransportTracersBenchmark, or CH4Benchmark.
            Default value: "FullChemBenchmark"
        overwrite: bool
            Set this flag to True to overwrite files in the
            destination folder (specified by the dst argument).
            Default value: False.
        verbose: bool
            Set this flag to True to print extra informational output.
            Default value: False.
        n_job: int
            Defines the number of simultaneous workers for parallel plotting.
            Set to 1 to disable parallel plotting. Value of -1 allows the
            application to decide.
            Default value: -1
        spcdb_dir: str
            Directory of species_datbase.yml file
            Default value: Directory of GCPy code repository
        time_mean : bool
            Determines if we should average the datasets over time
            Default value: False
        varlist: list of str
            List of variables to plot.  If varlist is None, then
            all common variables in Ref & Dev will be plotted.
    """

make_benchmark_emis_plots

Located in module: gcpy.benchmark.modules.benchmark_funcs

Generates plots of total emissions using output from HEMCO_diagnostics.* (for GEOS-Chem Classic) and/or GCHP.Emissions.* output files.

def make_benchmark_emis_plots(
        ref,
        refstr,
        dev,
        devstr,
        dst="./benchmark",
        subdst=None,
        plot_by_spc_cat=False,
        plot_by_hco_cat=False,
        benchmark_type="FullChemBenchmark",
        cmpres=None,
        overwrite=False,
        verbose=False,
        flip_ref=False,
        flip_dev=False,
        log_color_scale=False,
        sigdiff_files=None,
        weightsdir='.',
        n_job=-1,
        time_mean=False,
        spcdb_dir=os.path.dirname(__file__)
):
    """
    Creates PDF files containing plots of emissions for model
    benchmarking purposes. This function is compatible with benchmark
    simulation output only. It is not compatible with transport tracers
    emissions diagnostics.

    Args:
        ref: str
            Path name for the "Ref" (aka "Reference") data set.
        refstr: str
            A string to describe ref (e.g. version number)
        dev: str
            Path name for the "Dev" (aka "Development") data set.
            This data set will be compared against the "Reference"
            data set.
        devstr: str
            A string to describe dev (e.g. version number)

    Keyword Args (optional):
        dst: str
            A string denoting the destination folder where
            PDF files containing plots will be written.
            Default value: './benchmark
        subdst: str
            A string denoting the sub-directory of dst where PDF
            files containing plots will be written.  In practice,
            and denotes a date string (such as "Jan2016") that
            corresponds to the month that is being plotted.
            Default value: None
        plot_by_spc_cat: bool
            Set this flag to True to separate plots into PDF files
            according to the benchmark species categories (e.g. Oxidants,
            Aerosols, Nitrogen, etc.)  These categories are specified
            in the YAML file benchmark_species.yml.
            Default value: False
        plot_by_hco_cat: bool
            Set this flag to True to separate plots into PDF files
            according to HEMCO emissions categories (e.g. Anthro,
            Aircraft, Bioburn, etc.)
            Default value: False
        benchmark_type: str
            A string denoting the type of benchmark output to plot, options are
            FullChemBenchmark, TransportTracersBenchmark, or CH4Benchmark.
            Default value: "FullChemBenchmark"
        cmpres: string
            Grid resolution at which to compare ref and dev data, e.g. '1x1.25'
        overwrite: bool
            Set this flag to True to overwrite files in the
            destination folder (specified by the dst argument).
            Default value: False
        verbose: bool
            Set this flag to True to print extra informational output.
            Default value: False
        flip_ref: bool
            Set this flag to True to reverse the vertical level
            ordering in the "Ref" dataset (in case "Ref" starts
            from the top of atmosphere instead of the surface).
            Default value: False
        flip_dev: bool
            Set this flag to True to reverse the vertical level
            ordering in the "Dev" dataset (in case "Dev" starts
            from the top of atmosphere instead of the surface).
            Default value: False
        log_color_scale: bool
            Set this flag to True to enable plotting data (not diffs)
            on a log color scale.
            Default value: False
         sigdiff_files: list of str
            Filenames that will contain the lists of species having
            significant differences in the 'sfc', '500hpa', and
            'zonalmean' plots.  These lists are needed in order to
            fill out the benchmark approval forms.
            Default value: None
        weightsdir: str
            Directory in which to place (and possibly reuse) xESMF regridder
            netCDF files.
            Default value: '.'
        n_job: int
            Defines the number of simultaneous workers for parallel plotting.
            Set to 1 to disable parallel plotting.
            Value of -1 allows the application to decide.
            Default value: -1
        spcdb_dir: str
            Directory of species_datbase.yml file
            Default value: Directory of GCPy code repository
        time_mean : bool
            Determines if we should average the datasets over time
            Default value: False

    Remarks:
        (1) If both plot_by_spc_cat and plot_by_hco_cat are
            False, then all emission plots will be placed into the
            same PDF file.

        (2) Emissions that are 3-dimensional will be plotted as
            column sums.
           column sums.
"""

make_benchmark_jvalue_plots

Located in module: gcpy.benchmark.modules.benchmark_funcs

Generates plots of J-values using the GEOS-Chem JValues diagnostic output.

def make_benchmark_jvalue_plots(
        ref,
        refstr,
        dev,
        devstr,
        varlist=None,
        dst="./benchmark",
        subdst=None,
        local_noon_jvalues=False,
        cmpres=None,
        plots=["sfc", "500hpa", "zonalmean"],
        overwrite=False,
        verbose=False,
        flip_ref=False,
        flip_dev=False,
        log_color_scale=False,
        sigdiff_files=None,
        weightsdir='.',
        n_job=-1,
        time_mean=False,
        spcdb_dir=os.path.dirname(__file__)
):
    """
    Creates PDF files containing plots of J-values for model
    benchmarking purposes.

    Args:
        ref: str
            Path name for the "Ref" (aka "Reference") data set.
        refstr: str
            A string to describe ref (e.g. version number)
        dev: str
            Path name for the "Dev" (aka "Development") data set.
            This data set will be compared against the "Reference"
            data set.
        devstr: str
            A string to describe dev (e.g. version number)

    Keyword Args (optional):
        varlist: list of str
            List of J-value variables to plot.  If not passed,
            then all J-value variables common to both dev
            and ref will be plotted.  The varlist argument can be
            a useful way of restricting the number of variables
            plotted to the pdf file when debugging.
            Default value: None
        dst: str
            A string denoting the destination folder where a
            PDF file  containing plots will be written.
            Default value: ./benchmark.
        subdst: str
            A string denoting the sub-directory of dst where PDF
            files containing plots will be written.  In practice,
            subdst is only needed for the 1-year benchmark output,
            and denotes a date string (such as "Jan2016") that
            corresponds to the month that is being plotted.
            Default value: None
        local_noon_jvalues: bool
            Set this flag to plot local noon J-values.  This will
            divide all J-value variables by the JNoonFrac counter,
            which is the fraction of the time that it was local noon
            at each location.
            Default value: False
        cmpres: string
            Grid resolution at which to compare ref and dev data, e.g. '1x1.25'
        plots: list of strings
            List of plot types to create.
            Default value: ['sfc', '500hpa', 'zonalmean']
        overwrite: bool
            Set this flag to True to overwrite files in the
            destination folder (specified by the dst argument).
            Default value: False.
        verbose: bool
            Set this flag to True to print extra informational output.
            Default value: False
        flip_ref: bool
            Set this flag to True to reverse the vertical level
            ordering in the "Ref" dataset (in case "Ref" starts
            from the top of atmosphere instead of the surface).
            Default value: False
        flip_dev: bool
            Set this flag to True to reverse the vertical level
            ordering in the "Dev" dataset (in case "Dev" starts
            from the top of atmosphere instead of the surface).
            Default value: False
        log_color_scale: bool
            Set this flag to True if you wish to enable plotting data
            (not diffs) on a log color scale.
            Default value: False
        sigdiff_files: list of str
            Filenames that will contain the lists of J-values having
            significant differences in the 'sfc', '500hpa', and
            'zonalmean' plots.  These lists are needed in order to
            fill out the benchmark approval forms.
            Default value: None
        weightsdir: str
            Directory in which to place (and possibly reuse) xESMF regridder
            netCDF files.
            Default value: '.'
        n_job: int
            Defines the number of simultaneous workers for parallel plotting.
            Set to 1 to disable parallel plotting. Value of -1 allows the
            application to decide.
            Default value: -1
        spcdb_dir: str
            Directory of species_datbase.yml file
            Default value: Directory of GCPy code repository
        time_mean : bool
            Determines if we should average the datasets over time
            Default value: False

    Remarks:
         Will create 4 files containing J-value plots:
            (1 ) Surface values
            (2 ) 500 hPa values
            (3a) Full-column zonal mean values.
            (3b) Stratospheric zonal mean values
         These can be toggled on/off with the plots keyword argument.

         At present, we do not yet have the capability to split the
         plots up into separate files per category (e.g. Oxidants,
         Aerosols, etc.).  This is primarily due to the fact that
         we archive J-values from GEOS-Chem for individual species
         but not family species.  We could attempt to add this
         functionality later if there is sufficient demand.
    """

make_benchmark_wetdep_plots

Located in module: gcpy.benchmark.modules.benchmark_funcs

Generates plots of wet deposition using the GEOS-Chem WetLossConv and WetLossLS diagnostic outputs. It is currently primarily used for 1-Year Transport Tracer benchmarks, plotting values for the following species as defined in benchmark_categories.yml

def make_benchmark_wetdep_plots(
        ref,
        refstr,
        dev,
        devstr,
        collection,
        dst="./benchmark",
        cmpres=None,
        datestr=None,
        overwrite=False,
        verbose=False,
        benchmark_type="TransportTracersBenchmark",
        plots=["sfc", "500hpa", "zonalmean"],
        log_color_scale=False,
        normalize_by_area=False,
        areas=None,
        refmet=None,
        devmet=None,
        weightsdir='.',
        n_job=-1,
        time_mean=False,
        spcdb_dir=os.path.dirname(__file__)
):
    """
    Creates PDF files containing plots of species concentration
    for model benchmarking purposes.

    Args:
        ref: str
            Path name for the "Ref" (aka "Reference") data set.
        refstr: str
            A string to describe ref (e.g. version number)
        dev: str
            Path name for the "Dev" (aka "Development") data set.
            This data set will be compared against the "Reference"
            data set.
        devstr: str
            A string to describe dev (e.g. version number)
        collection: str
            String name of collection to plot comparisons for.

    Keyword Args (optional):
        dst: str
            A string denoting the destination folder where a PDF
            file containing plots will be written.
            Default value: ./benchmark
        datestr: str
            A string with date information to be included in both the
            plot pdf filename and as a destination folder subdirectory
            for writing plots
            Default value: None
        benchmark_type: str
            A string denoting the type of benchmark output to plot, options are
            FullChemBenchmark, TransportTracersBenchmark, or CH4Benchmark.
            Default value: "FullChemBenchmark"
        overwrite: bool
            Set this flag to True to overwrite files in the
            destination folder (specified by the dst argument).
            Default value: False.
        verbose: bool
            Set this flag to True to print extra informational output.
            Default value: False.
        plots: list of strings
            List of plot types to create.
            Default value: ['sfc', '500hpa', 'zonalmean']
        normalize_by_area: bool
            Set this flag to true to enable normalization of data
            by surfacea area (i.e. kg s-1 --> kg s-1 m-2).
            Default value: False
        areas: dict of xarray DataArray:
            Grid box surface areas in m2 on Ref and Dev grids.
            Default value: None
        refmet: str
            Path name for ref meteorology
            Default value: None
        devmet: str
            Path name for dev meteorology
            Default value: None
        n_job: int
            Defines the number of simultaneous workers for parallel plotting.
            Set to 1 to disable parallel plotting. Value of -1 allows the
            application to decide.
            Default value: -1
        spcdb_dir: str
            Directory of species_datbase.yml file
            Default value: Directory of GCPy code repository
        time_mean : bool
            Determines if we should average the datasets over time
            Default value: False
    """

make_benchmark_model_vs_obs_plots

Located in module: gcpy.benchmark.modules.benchmark_models_vs_obs

Gnerates plots of monthly-averaged modeled surface ozone concentrations (using the GEOS-Chem SpeciesConc diagnostic outputs) vs. the EBAS 2019 observations.

Note

Model vs. observation plots are only available in 1-year full-chemistry benchmarks.

def make_benchmark_models_vs_obs_plots(
        obs_filepaths,
        obs_label,
        ref_filepaths,
        ref_label,
        dev_filepaths,
        dev_label,
        varname="SpeciesConcVV_O3",
        dst="./benchmark",
        verbose=False,
        overwrite=False
):
    """
    Driver routine to create model vs. observation plots.

    Args:
    obs_filepaths : str|list : Path(s) to the observational data.
    obs_label     : str      : Label for the observational data
    ref_filepaths : str      : Paths to the Ref model data
    ref_label     : str      : Label for the Ref model data
    dev_filepaths : str      : Paths to the Dev model data
    dev_label     : str      : Label for the Dev model data
    varname       : str      : Variable name for model data
    dst           : str      : Destination folder for plots
    verbose       : bool     : Toggles verbose output on/off
    overwrite     : bool     : Toggles overwriting contents of dst

make_benchmark_model_vs_sondes_plots

Located in module: gcpy.benchmark.modules.benchmark_models_vs_sondes

Note

Model vs. ozonesonde plots are only available in 1-year full-chemistry benchmarks.

Generates vertical profiles of modeled ozone concentrations (using the GEOS-Chem SpeciesConc diagnostic outputs) vs. ozonesonde observations.

def make_benchmark_models_vs_sondes_plots(
        obs_data_file,
        obs_site_file,
        ref_filepaths,
        ref_label,
        dev_filepaths,
        dev_label,
        dst="./benchmark",
        overwrite=False,
        varname="SpeciesConcVV_O3",

    ):
    """
    Creates plots of sonde data vs. GEOS-Chem output.  For use in the
    1-year benchmark plotting workflow.

    Args
    obs_data_file : str      : File containing sonde data
    obs_site_file : str      : File containing sonde site metadata
    ref_filepaths : str|list : Files for the GEOS-Chem Ref version
    ref_label     : str      : GEOS-Chem Ref version label
    dev_filepaths : str|list : Files for the GEOS-Chem Dev version
    dev_label     : str      : GEOS-Chem Dev version label

    Keyword Args
    dst           : str      : Folder where PDF w/ plots will be created
    overwrite     : bool     : Overwrite contents of dst folder?
    varname       : str      : GEOS-Chem diagnostic variable name
    verbose       : bool     : Activate verbose printout?
    """

Benchmark tabling functions

Functions creating summary tables

Function

Table that it creates

global_ox_budget

Ox budget (1yr benchmarks only)

make_benchmark_aerosol_tables

Global aerosol burdens (1yr benchmarks only)

make_benchmark_emis_tables

Emissions (by species & inventory)

make_benchmark_gcclassic_timing_table

GEOS-Chem Classic timers output

make_benchmark_gchp_timing_table

GCHP timers output

make_benchmark_mass_tables

Total mass of each species

make_benchmark_mass_accumulation_tables

Mass accumulation for each species

make_benchmark_mass_conservation_table

Timeseries of the PassiveTracer species

make_benchmark_oh_metrics

Global OH metrics

make_benchmark_operations_budget

Species mass after each operation

transport_tracers_budgets

Rn222, Pb210, Be7 budgets (1yr benchmarks only

The functions listed above create summary tables for quantities such as total mass of species, total mass of emissions, and OH metrics.

Many of these functions use pre-defined lists of variables in YAML files. If one dataset includes a variable but the other dataset does not, the data for that variable in the latter dataset will be considered to be NaN and will be plotted as such.

global_ox_budget

Located in module: gcpy.benchmark.modules.budget_ox

Generates a budget table for the Ox (odd ozone) family from 1-year full-chemistry benchmark output.

def global_ox_budget(
        devstr,
        devdir,
        devrstdir,
        year,
        dst='./1yr_benchmark',
        overwrite=True,
        spcdb_dir=None,
        is_gchp=False,
        gchp_res="c24",
        gchp_is_pre_14_0=False
):
    """
    Main program to compute Ox budgets

    Arguments:
        maindir: str
            Top-level benchmark folder
        devstr: str
            Denotes the "Dev" benchmark version.
        year: int
            The year of the benchmark simulation (e.g. 2016).

    Keyword Args (optional):
        dst: str
            Directory where budget tables will be created.
            Default value: './1yr_benchmark'
        overwrite: bool
            Denotes whether to ovewrite existing budget tables.
            Default value: True
        spcdb_dir: str
            Directory where species_database.yml is stored.
            Default value: GCPy directory
        is_gchp: bool
            Denotes if data is from GCHP (True) or GCC (false).
            Default value: False
        gchp_res: str
            GCHP resolution string (e.g. "c24", "c48", etc.)
            Default value: None
        gchp_is_pre_14_0: bool
            Denotes if the version is prior to GCHP 14.0.0 (True)
            or not (False).
            Default value: False
    """

make_benchmark_aerosol_tables

Located in module: gcpy.benchmark.modules.benchmark_funcs

Generates a table of global aerosol budgets and burdens from GEOS-Chem 1-year full-chemistry benchmark simulation output.

def make_benchmark_aerosol_tables(
        devdir,
        devlist_aero,
        devlist_spc,
        devlist_met,
        devstr,
        year,
        days_per_mon,
        dst='./benchmark',
        overwrite=False,
        is_gchp=False,
        spcdb_dir=os.path.dirname(__file__)
):
    """
    Compute FullChemBenchmark aerosol budgets & burdens

    Args:
        devdir: str
            Path to development ("Dev") data directory
        devlist_aero: list of str
            List of Aerosols collection files (different months)
        devlist_spc: list of str
            List of SpeciesConc collection files (different months)
        devlist_met: list of str
            List of meteorology collection files (different months)
        devstr: str
            Descriptive string for datasets (e.g. version number)
        year: str
            The year of the benchmark simulation (e.g. '2016').
        days_per_month: list of int
            List of number of days per month for all months

    Keyword Args (optional):
        dst: str
            Directory where budget tables will be created.
            Default value: './benchmark'
        overwrite: bool
            Overwrite burden & budget tables? (default=True)
            Default value: False
        is_gchp: bool
            Whether datasets are for GCHP
            Default value: False
        spcdb_dir: str
            Directory of species_datbase.yml file
            Default value: Directory of GCPy code repository

    """

make_benchmark_emis_tables

Located in module: gcpy.benchmark.modules.benchmark_funcs

Generates tables of emissions (by species and by inventory) from the the HEMCO_diagnostics* outputs.

def make_benchmark_emis_tables(
        reflist,
        refstr,
        devlist,
        devstr,
        dst="./benchmark",
        benchmark_type="FullChemBenchmark",
        refmet=None,
        devmet=None,
        overwrite=False,
        ref_interval=[2678400.0],
        dev_interval=[2678400.0],
        spcdb_dir=os.path.dirname(__file__)
):
    """
    Creates a text file containing emission totals by species and
    category for benchmarking purposes.

    Args:
        reflist: list of str
             List with the path names of the emissions file or files
             (multiple months) that will constitute the "Ref"
             (aka "Reference") data set.
        refstr: str
            A string to describe ref (e.g. version number)
        devlist: list of str
             List with the path names of the emissions file or files
             (multiple months) that will constitute the "Dev"
             (aka "Development") data set
        devstr: str
            A string to describe dev (e.g. version number)

    Keyword Args (optional):
        dst: str
            A string denoting the destination folder where the file
            containing emissions totals will be written.
            Default value: ./benchmark
        benchmark_type: str
            A string denoting the type of benchmark output to plot, options are
            FullChemBenchmark, TransportTracersBenchmark or CH4Benchmark.
            Default value: "FullChemBenchmark"
        refmet: str
            Path name for ref meteorology
            Default value: None
        devmet: str
            Path name for dev meteorology
            Default value: None
        overwrite: bool
            Set this flag to True to overwrite files in the
            destination folder (specified by the dst argument).
            Default value: False
        ref_interval: list of float
            The length of the ref data interval in seconds. By default, interval
            is set to [2678400.0], which is the number of seconds in July
            (our 1-month benchmarking month).
            Default value: [2678400.0]
        dev_interval: list of float
            The length of the dev data interval in seconds. By default, interval
            is set to [2678400.0], which is the number of seconds in July
            (our 1-month benchmarking month).
            Default value: [2678400.0]
        spcdb_dir: str
            Directory of species_datbase.yml file
            Default value: Directory of GCPy code repository

    """

make_benchmark_gcclassic_timing_table

Located in module: gcpy.benchmark.modules.benchmark_scrape_gcclassic_timers

Generates a comparison table of GEOS-Chem Classic timer values. This can be used to determine if computational bottlenecks have been introduced.

def make_benchmark_gcclassic_timing_table(
        ref_files,
        ref_label,
        dev_files,
        dev_label,
        dst="./benchmark",
        overwrite=False,
):
    """
    Creates a table of timing information for GEOS-Chem Classic
    benchmark simulations given one or more JSON and/or text files
    as input.

    Args
    ref_files : str|list : File(s) with timing info from the "Ref" model
    ref_label : str      : Version string for the "Ref" model
    dev_files : str|list : File(s) with timing info from the "Ref" model
    dev_label : str      : Version string for the "Dev" model

    Kwargs
    dst       : str      : Directory where output will be written
    overwrite : bool     : Overwrite existing files? (default: False)
    """

make_benchmark_gchp_timing_table

Located in module: gcpy.benchmark.modules.benchmark_scrape_gchp_timers

Generates a comparison table of GCHP Classic timer values. This can be used to determine if computational bottlenecks have been introduced.

def make_benchmark_gchp_timing_table(
        ref_files,
        ref_label,
        dev_files,
        dev_label,
        dst="./benchmark",
        overwrite=False,
):
    """
    Creates a table of timing information for GCHP benchmark
    simulations given one or more text files as input.

    Args
    ref_files : str|list : File(s) with timing info from the "Ref" model
    ref_label : str      : Version string for the "Ref" model
    dev_files : str|list : File(s) with timing info from the "Ref" model
    dev_label : str      : Version string for the "Dev" model

    Kwargs
    dst       : str      : Directory where output will be written
    overwrite : bool     : Overwrite existing files? (default: False)
    """

make_benchmark_mass_tables

Located in module: gcpy.benchmark.modules.benchmark_funcs

Generates a comparison table of total mass for each GEOS-Chem species, using the GEOS-Chem restart file output.

def make_benchmark_mass_tables(
        ref,
        refstr,
        dev,
        devstr,
        varlist=None,
        dst="./benchmark",
        subdst=None,
        overwrite=False,
        verbose=False,
        label="at end of simulation",
        spcdb_dir=os.path.dirname(__file__),
        ref_met_extra=None,
        dev_met_extra=None
):
    """
    Creates a text file containing global mass totals by species and
    category for benchmarking purposes.

    Args:
        reflist: str
            Pathname that will constitute
            the "Ref" (aka "Reference") data set.
        refstr: str
            A string to describe ref (e.g. version number)
        dev: list of str
            Pathname that will constitute
            the "Dev" (aka "Development") data set.  The "Dev"
            data set will be compared against the "Ref" data set.
        devstr: str
            A string to describe dev (e.g. version number)

    Keyword Args (optional):
        varlist: list of str
            List of variables to include in the list of totals.
            If omitted, then all variables that are found in either
            "Ref" or "Dev" will be included.  The varlist argument
            can be a useful way of reducing the number of
            variables during debugging and testing.
            Default value: None
        dst: str
            A string denoting the destination folder where the file
            containing emissions totals will be written.
            Default value: ./benchmark
        subdst: str
            A string denoting the sub-directory of dst where PDF
            files containing plots will be written.  In practice,
            subdst is only needed for the 1-year benchmark output,
            and denotes a date string (such as "Jan2016") that
            corresponds to the month that is being plotted.
            Default value: None
        overwrite: bool
            Set this flag to True to overwrite files in the
            destination folder (specified by the dst argument).
            Default value: False
        verbose: bool
            Set this flag to True to print extra informational output.
            Default value: False.
        spcdb_dir: str
            Directory of species_datbase.yml file
            Default value: Directory of GCPy code repository
        ref_met_extra: str
            Path to ref Met file containing area data for use with restart files
            which do not contain the Area variable.
            Default value: ''
        dev_met_extra: str
            Path to dev Met file containing area data for use with restart files
            which do not contain the Area variable.
            Default value: ''
    """

make_benchmark_mass_accumulation_tables

Located in module: gcpy.benchmark.modules.benchmark_funcs

Generates a comparison table of mass accumulation (i.e. mass difference between the start and end of Ref and Dev benchmark simulations), using GEOS-Chem restart files.

def create_mass_accumulation_table(
        refdatastart,
        refdataend,
        refstr,
        refperiodstr,
        devdatastart,
        devdataend,
        devstr,
        devperiodstr,
        varlist,
        met_and_masks,
        label,
        trop_only=False,
        outfilename="GlobalMassAccum_TropStrat.txt",
        verbose=False,
        spcdb_dir=os.path.dirname(__file__)
):
    """
    Creates a table of global mass accumulation for a list of species in
    two data sets.  The data sets, which typically represent output from two
    different model versions, are usually contained in netCDF data files.

    Args:
        refdatastart: xarray Dataset
            The first data set to be compared (aka "Reference").
        refdataend: xarray Dataset
            The first data set to be compared (aka "Reference").
        refstr: str
            A string that can be used to identify refdata
            (e.g. a model version number or other identifier).
        refperiodstr: str
            Ref simulation period start and end
        devdatastart: xarray Dataset
            The second data set to be compared (aka "Development").
        devdataend: xarray Dataset
            The second data set to be compared (aka "Development").
        devstr: str
            A string that can be used to identify the data set specified
            by devfile (e.g. a model version number or other identifier).
        devperiodstr: str
            Ref simulation period start and end
        varlist: list of strings
            List of species concentation variable names to include
            in the list of global totals.
        met_and_masks: dict of xarray DataArray
            Dictionary containing the meterological variables and
            masks for the Ref and Dev datasets.
        label: str
            Label to go in the header string.  Can be used to
            pass the month & year.

    Keyword Args (optional):
        trop_only: bool
            Set this switch to True if you wish to print totals
            only for the troposphere.
            Default value: False (i.e. print whole-atmosphere totals).
        outfilename: str
            Name of the text file which will contain the table of
            emissions totals.
            Default value: "GlobalMass_TropStrat.txt"
        verbose: bool
            Set this switch to True if you wish to print out extra
            informational messages.
            Default value: False
        spcdb_dir: str
            Directory of species_datbase.yml file
            Default value: Directory of GCPy code repository

    Remarks:
        This method is mainly intended for model benchmarking purposes,
        rather than as a general-purpose tool.

        Species properties (such as molecular weights) are read from a
        YAML file called "species_database.yml".
    """

make_benchmark_mass_conservation_table

Located in module: gcpy.benchmark.modules.benchmark_mass_cons_table

Generates a timeseries table of the global mass of the PassiveTracer species. Usually used with output from 1-year TransportTracers benchmark simulations. This is an important check for mass conservation in GEOS-Chem Classic and GCHP.

def make_benchmark_mass_conservation_table(
        datafiles,
        runstr,
        dst="./benchmark",
        overwrite=False,
        areapath=None,
        spcdb_dir=os.path.dirname(__file__)
):
    """
    Creates a text file containing global mass of the PassiveTracer
    from Transport Tracer simulations across a series of restart files.

    Args:
        datafiles: list of str
            Path names of restart files.
        runstr: str
            Name to put in the filename and header of the output file
        refstr: str
            A string to describe ref (e.g. version number)
        dev: str
            Path name of "Dev" (aka "Development") data set file.
            The "Dev" data set will be compared against the "Ref" data set.
        devmet: list of str
            Path name of dev meteorology data set.
        devstr: str
            A string to describe dev (e.g. version number)

    Keyword Args (optional):
        dst: str
            A string denoting the destination folder where the file
            containing emissions totals will be written.
            Default value: "./benchmark"
        overwrite: bool
            Set this flag to True to overwrite files in the
            destination folder (specified by the dst argument).
            Default value: False
        areapath: str
            Path to a restart file containing surface area data.
            Default value: None
        spcdb_dir: str
            Path to the species_database.yml
            Default value: points to gcpy/gcpy folder
    """

make_benchmark_oh_metrics

Located in module: gcpy.benchmark.modules.oh_metrics

Generates a table of OH metrics (mean OH concentration, methyl chloroform lifetime, CH4 lifetime) from the GEOS-Chem Metrics diagnostic outputs.

def make_benchmark_oh_metrics(
        ref,
        refmet,
        refstr,
        dev,
        devmet,
        devstr,
        dst="./benchmark",
        overwrite=False,
):
    """
    Creates a text file containing metrics of global mean OH, MCF lifetime,
    and CH4 lifetime for benchmarking purposes.

    Args:
        ref: str
            Path name of "Ref" (aka "Reference") data set file.
        refmet: str
            Path name of ref meteorology data set.
        refstr: str
            A string to describe ref (e.g. version number)
        dev: str
            Path name of "Dev" (aka "Development") data set file.
            The "Dev" data set will be compared against the "Ref" data set.
        devmet: list of str
            Path name of dev meteorology data set.
        devstr: str
            A string to describe dev (e.g. version number)

    Keyword Args (optional):
        dst: str
            A string denoting the destination folder where the file
            containing emissions totals will be written.
            Default value: "./benchmark"
        overwrite: bool
            Set this flag to True to overwrite files in the
            destination folder (specified by the dst argument).
            Default value: False
    """

make_benchmark_operations_budget

Located in module: gcpy.benchmark.module.benchmark_funcs

Creates a table with the change in species mass after each GEOS-Chem operation, using diagnostic output from GEOS-Chem benchmark simulations.

def make_benchmark_operations_budget(
        refstr,
        reffiles,
        devstr,
        devfiles,
        ref_interval,
        dev_interval,
        benchmark_type=None,
        label=None,
        col_sections=["Full", "Trop", "PBL", "Strat"],
        operations=[
             "Chemistry", "Convection", "EmisDryDep",
             "Mixing", "Transport", "WetDep"
        ],
        compute_accum=True,
        compute_restart=False,
        require_overlap=False,
        dst='.',
        species=None,
        overwrite=True,
        verbose=False,
        spcdb_dir=os.path.dirname(__file__)
):
    """
    Prints the "operations budget" (i.e. change in mass after
    each operation) from a GEOS-Chem benchmark simulation.

    Args:
        refstr: str
            Labels denoting the "Ref" versions
        reffiles: list of str
            Lists of files to read from the "Ref" version.
        devstr: str
            Labels denoting the "Dev" versions
        devfiles: list of str
            Lists of files to read from "Dev" version.
        interval: float
            Number of seconds in the diagnostic interval.

    Keyword Args (optional):
        benchmark_type: str
            A string denoting the type of benchmark output to plot, options are
            FullChemBenchmark, TransportTracersBenchmark, or CH4Benchmark.
            Default value: None
        label: str
            Contains the date or date range for each dataframe title.
            Default value: None
        col_sections: list of str
            List of column sections to calculate global budgets for. May
            include Strat eventhough not calculated in GEOS-Chem, but Full
            and Trop must also be present to calculate Strat.
            Default value: ["Full", "Trop", "PBL", "Strat"]
        operations: list of str
            List of operations to calculate global budgets for. Accumulation
            should not be included. It will automatically be calculated if
            all GEOS-Chem budget operations are passed and optional arg
            compute_accum is True.
            Default value: ["Chemistry","Convection","EmisDryDep",
                            "Mixing","Transport","WetDep"]
        compute_accum: bool
            Optionally turn on/off accumulation calculation. If True, will
            only compute accumulation if all six GEOS-Chem operations budgets
            are computed. Otherwise a message will be printed warning that
            accumulation will not be calculated.
            Default value: True
        compute_accum: bool
            Optionally turn on/off accumulation calculation. If True, will
            only compute accumulation if all six GEOS-Chem operations budgets
            are computed. Otherwise a message will be printed warning that
            accumulation will not be calculated.
            Default value: True
        compute_restart: bool
            Optionally turn on/off calculation of mass change based on restart
            file. Only functional for "Full" column section.
            Default value: False
        require_overlap: bool
            Whether to calculate budgets for only species that are present in
            both Ref or Dev.
            Default value: False
        dst: str
            Directory where plots & tables will be created.
            Default value: '.' (directory in which function is called)
        species: list of str
            List of species for which budgets will be created.
            Default value: None (all species)
        overwrite: bool
            Denotes whether to overwrite existing budget file.
            Default value: True
        verbose: bool
            Set this switch to True if you wish to print out extra
            informational messages.
            Default value: False
    """

make_benchmark_ste_table

Located in module: gcpy.benchmark.modules.ste_flux

Generates a table with the stratosphere-troposphere flux of ozone from GEOS-Chem benchmark simulation output.

Note

This table is only available for GEOS-Chem Classic benchmarks.

def make_benchmark_ste_table(devstr, files, year,
                             dst='./1yr_benchmark',
                             bmk_type="FullChemBenchmark",
                             species=["O3"],
                             overwrite=True,
                             month=None):
    """
    Driver program.  Computes and prints strat-trop exchange for
    the selected species and benchmark year.

    Args:
        devstr: str
            Label denoting the "Dev" version.
        files: str
            List of files containing vertical fluxes.
        year: str
            Year of the benchmark simulation.

    Keyword Args (optional):
        dst: str
            Directory where plots & tables will be created.
        bmk_type: str
            FullChemBenchmark or TransportTracersBenchmark.
        species: list of str
            Species for which STE fluxes are desired.
        overwrite: bool
            Denotes whether to ovewrite existing budget tables.
        month: float
            If passed, specifies the month of a 1-month benchmark.
            Default: None (denotes a 1-year benchmark)
    """

transport_tracers_budgets

Located in module: gcpy.benchmark.modules.budget_tt

Generates a budget table for Rn222, Pb210, and Be7 species from 1-year TransportTracers benchmark output.

def transport_tracers_budgets(
        devstr,
        devdir,
        devrstdir,
        year,
        dst='./1yr_benchmark',
        is_gchp=False,
        gchp_res="c00",
        gchp_is_pre_14_0=False,
        overwrite=True,
        spcdb_dir=os.path.dirname(__file__)):
    """
    Main program to compute TransportTracersBenchmark budgets

    Args:
        maindir: str
            Top-level benchmark folder
        devstr: str
            Denotes the "Dev" benchmark version.
        year: int
            The year of the benchmark simulation (e.g. 2016).

    Keyword Args (optional):
        dst: str
            Directory where budget tables will be created.
            Default value: './1yr_benchmark'
        is_gchp: bool
            Denotes if data is from GCHP (True) or GCC (false).
            Default value: False
        gchp_res: str
            A string (e.g. "c24") denoting GCHP grid resolution.
            Default value: "c00".
        gchp_is_pre_14_0: bool
            Logical to indicate whether or not the GCHP data is prior
            to GCHP 14.0.0.  Needed for restart files only.
            Default value: False
        overwrite: bool
            Denotes whether to ovewrite existing budget tables.
            Default value: True
        spcdb_dir: str
            Directory where species_database.yml is stored.
            Default value: GCPy directory
    """