[FEATURE]: Essential diagnostic variables #57

jthielen · 2022-03-06T22:43:19Z

Description

A key feature that was formally part of #14, but was stripped out while we were still working out #20, was calculating the key diagnostic variables needed to use WRF output in most analysis workflows:

'T' going to potential temperature with a magic number offset of 300 K
'P' and 'PB' combine to form pressure
'PH' and 'PHB' combine to form geopotential
Geopotential (see previous) to geopotential height conversion depends on a particular value of g (9.81 m s**2) that may not match the value used elsewhere

Implementation

This was implemented previously in #14 as

def calc_base_diagnostics(dataset, drop=True):
    """Calculate the four basic fields that WRF does not have in physically meaningful form.
    Parameters
    ----------
    dataset : xarray.Dataset
        Dataset representing WRF data opened via normal backend, with chunking.
    drop : bool
        Decide whether to drop the components of origin after creating the diagnostic fields from
        them.
    Notes
    -----
    This operation should be called before destaggering.
    """
    # Potential temperature
    dataset['air_potential_temperature'] = dataset['T'] + 300
    dataset['air_potential_temperature'].attrs = {
        'units': 'K',
        'standard_name': 'air_potential_temperature'
    }
    if drop:
        del dataset['T']

    # Pressure
    dataset['air_pressure'] = dataset['P'] + dataset['PB']
    dataset['air_pressure'].attrs = {
        'units': dataset['P'].attrs.get('units', 'Pa'),
        'standard_name': 'air_pressure'
    }
    if drop:
        del dataset['P'], dataset['PB']

    # Geopotential and geopotential height
    dataset['geopotential'] = dataset['PH'] + dataset['PHB']
    dataset['geopotential'].attrs = {
        'units': 'm**2 s**-2',
        'standard_name': 'geopotential'
    }
    dataset['geopotential_height'] = dataset['geopotential'] / 9.81
    dataset['geopotential_height'].attrs = {
        'units': 'm',
        'standard_name': 'geopotential_height'
    }
    if drop:
        del dataset['PH'], dataset['PHB']

    return dataset

Tests

These are pretty straightforward calculations, so creating tests using our existing "raw wrfout" (i.e., not the "dummy" or "geo_em" files) should also be straightforward.

Questions

How should we consider lazy-loading? If we do the straight implementation like included above, it should "just work" with Dask for delayed loading, but will otherwise eagerly evaluate...is this okay?

The text was updated successfully, but these errors were encountered:

andersy005 · 2022-03-07T00:59:35Z

Will this calculation be part of the xwrf.postprocess() or will it be a standalone function to be invoked by users explicitly?

kmpaul · 2022-03-07T21:29:36Z

How should we consider lazy-loading? If we do the straight implementation like included above, it should "just work" with Dask for delayed loading, but will otherwise eagerly evaluate...is this okay?

I think that would be expected behavior for something like this. I like the idea of the diagnostic variables "being ready for access" (i.e., already set with a Python ds['variable'] = ... statement), and initializing the dataset with Dask makes it lazy-loading by default. I think that's the right approach.

jthielen · 2022-03-07T21:39:32Z

Will this calculation be part of the xwrf.postprocess() or will it be a standalone function to be invoked by users explicitly?

Yes, as I'd view it as needed for most workflows (and if not taken, users could be mislead by what is there). Still, it'd be easy to make it conditional like decode_times.

I think that would be expected behavior for something like this. I like the idea of the diagnostic variables "being ready for access" (i.e., already set with a Python ds['variable'] = ... statement), and initializing the dataset with Dask makes it lazy-loading by default. I think that's the right approach.

Sounds good, that's enough confirmation for me to make a PR based on this approach!

jthielen added the enhancement New feature or request label Mar 6, 2022

jthielen added this to the v0.0.1 milestone Mar 6, 2022

jthielen mentioned this issue Mar 6, 2022

First Release? #51

Closed

3 tasks

jthielen mentioned this issue Mar 7, 2022

Add calculation of essential diagnostic variables to postprocess #59

Merged

3 tasks

andersy005 closed this as completed in #59 Mar 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE]: Essential diagnostic variables #57

[FEATURE]: Essential diagnostic variables #57

jthielen commented Mar 6, 2022

andersy005 commented Mar 7, 2022

kmpaul commented Mar 7, 2022

jthielen commented Mar 7, 2022

[FEATURE]: Essential diagnostic variables #57

[FEATURE]: Essential diagnostic variables #57

Comments

jthielen commented Mar 6, 2022

Description

Implementation

Tests

Questions

andersy005 commented Mar 7, 2022

kmpaul commented Mar 7, 2022

jthielen commented Mar 7, 2022