⇐ ⇒

[CF-metadata] Storing multiple NWP model runs in a NetCDF - CFfile [SEC=UNCLASSIFIED]

From: Timothy Hume <T.Hume>
Date: Fri, 9 Jan 2009 10:09:34 +1100

Hi Doug,

How do you handle requests for "all the data from a particular model run", or "all the 24 hour forecasts from all runs"? I notice that your reftime and leadtime variables have repeated values, which can make it more difficult to do such commonly requested extractions. This is why I chose to use two dimensions. For example, if someone requests all the 24->72 hour forecasts from all the model runs between 00Z 1 January 2009 and 00Z 5 January 2009 I use the NetCDF operator "ncks" like this:

ncks -d basetime,"2009-01-01 00:00:00".,"2009-01-05 00:00:00". -d forecast,24.,72. bigfile.nc subset.nc

Is there an easy way to do such extractions using your format?

Cheers,

Tim.

________________________________

From: cf-metadata-bounces at cgd.ucar.edu [mailto:cf-metadata-bounces at cgd.ucar.edu] On Behalf Of Douglas Schuster
Sent: Friday, 9 January 2009 00:41
To: Timothy Hume
Cc: John Caron; cf-metadata at cgd.ucar.edu
Subject: Re: [CF-metadata] Storing multiple NWP model runs in a NetCDF - CFfile [SEC=UNCLASSIFIED]


Hi Tim, John,

NCAR has been using a structure similar to option 2 for TIGGE based NetCDF files that is not
technically CF compliant, but was reviewed by colleagues at ECMWF, UK MetOffice, and BADC
(Dec 10, 2007, ECMWF workshop) and deemed acceptable. The structure is designed to handle multiple
ensemble forecast systems, and multiple init times with varying forecast periods.


i.e.

        int reftime(reftime) ;
                reftime:data_type = "int" ;
                reftime:units = "hours since 1950-01-01 00:00:00" ;
                reftime:standard_name = "forecast_reference_time" ;
                reftime:long_name = "Time of model initialization" ;
        int leadtime(reftime) ;
                leadtime:data_type = "int" ;
                leadtime:units = "hours" ;
                leadtime:standard_name = "forecast_period" ;
                leadtime:long_name = "Hours since forecast_reference_time" ;

The structure was based on
the proposal by Francisco J. Doblas-Reyes for the "Ensembles" project. A thread related to this proposal
can be found at:


http://mailman.cgd.ucar.edu/pipermail/cf-metadata/2007/001470.html

A sample file is available for download at:

http://dss.ucar.edu/download/doug/data.nc

The metadata "CDL" dump of the file can be accessed at:

http://dss.ucar.edu/download/doug/data.dump

Doug


On Jan 7, 2009, at 7:19 PM, Timothy Hume wrote:


        Hi John,
        
        This seems a reasonable proposal to me. Often when storing NWP forecasts option (2) will suffice. Most centres will always include the same forecast lead times each time new NWP model data are disseminated (for example some centres will send out +6, +12, +18, +24, +36, +42, +48, +60 and +72 hour forecasts for two model runs per day, ever day). Data users will then often want to extract particular forecast lead times (e.g. +48 hours). Option (2) makes this a trivial task.
        
        On the other hand option (1) is also useful, even when the time offsets are fixed. For example, one can construct an auxilliary coordinate variable called valid_time which is two dimensional. This aids in the cases when the user wants to extract all forecasts which are valid at a particular time. In fact, using both options (1) and (2) often yields the most useful files.
        
        Tim.
        
        
        -----Original Message-----
        From: John Caron [mailto:caron at unidata.ucar.edu]
        Sent: Thursday, 8 January 2009 12:56
        To: Karl Taylor
        Cc: Timothy Hume; cf-metadata at cgd.ucar.edu
        Subject: Re: [CF-metadata] Storing multiple NWP model runs in a NetCDF - CF file [SEC=UNCLASSIFIED]
        
        Hi Karl, Tim:
        
        I think the original wording doesnt fully anticipate multiple datetime coordinates:
        
        Section 4.4:
        " A time coordinate is identifiable from its units string alone. The Udunits routines utScan() and utIsTime() can be used to make this determination. Optionally, the time coordinate may be indicated additionally by providing the standard_name attribute with an appropriate value, and/or the axis attribute with the value T. "
        
        In Tim's example, his "forecast/valid time" is actually an offset from the base time, in units of time, not a udunit datetime. This allows it to be a 1D coordinate variable. But in the general case, if the forecast time spacing depends on the basetime, it must be 2D. We see this a lot in NCEP model output.
        
        In general, when storing multiple forecast model runs in the same file there are two options:
        
        1. A 1D "basetime/runtime" coordinate variable identified by standard name "forecast_reference_time" which holds udunit datetimes, and a 2D "forecast/valid time" auxiliary coordinate variable with standard name "time" which also holds udunit datetimes. It must be dimensioned by the runtime and the time dimensions. Data variables will need to use the "coordinates" attribute to reference the time auxiliary coordinate, as usual.
        
        2. A 1D "basetime/runtime" coordinate variable identified by standard name "forecast_reference_time" which holds udunit dates, and a 1D or 2D "forecast offset time" auxiliary coordinate variable identified by standard name "forecast_period" which holds a udunit time unit (eg hours), which is added to the basetime to get the forecast datetime. If 1D, it will be dimensioned by time, if 2d it will be dimensioned by runtime and time, and must be referenced by the "coordinates" attribute
        
        As an aside, the utIsTime() function is ambiguous as to whether we have a time unit or a datetime unit (which udunits calls "having an offset"). I think we need to carefully distinguish time and datetime, and specify where each is allowed.
        
        If this seems reasonable, I can write up a proposal.
        
        
        Karl Taylor wrote:
        

                Hi Tim,
                


                Section 4.4 (just before section 4.1) states "The methods of identifying
                

                coordinate types described in this section apply both to coordinate
                

                variables and to auxiliary coordinate variables named by the coordinates
                

                attribute." Since the axis attribute is one of the methods used to
                

                identify a vertical coordinate, it would seem to be allowed for the
                

                level in your file.
                


                On the other hand, in the 4th paragraph of section 5.7, it says that
                

                "The axis attribute is not allowed for auxiliary coordinate variables".
                

                This seems to contradict the earlier statement.
                


                We need to revise the document to be internally consistent. Does anyone
                

                recall why we would want to prohibit the use of the axis attribute for
                

                auxiliary coordinate variables?
                


                cheers,
                

                Karl
                


                Timothy Hume wrote:
                

                        Hi Karl,
                        


                        I have just checked one of my files, and get two errors:
                        


                        The first error is:
                        


                        ------------------
                        

                        Checking variable: forecast
                        

                        ------------------
                        

                        ERROR (4.4): Invalid units and/or reference time
                        


                        This is the error I am aware of. My file should become compliant if I
                        

                        switch the "T" axis to basetime.
                        



                        The second error is:
                        


                        ------------------
                        

                        Checking variable: level
                        

                        ------------------
                        

                        ERROR (4): Axis attribute is not allowed for auxillary coordinate
                        

                        variables.
                        


                        I was unaware that the axis attribute should not be used for scalar
                        

                        coordinate variables (as describe in Section 5.7). Is this intended?
                        

                        In any case, I don't think this small error should cause the NetCDF
                        

                        viewers I tried (IDV and Joe Sirott's web application) to not work.
                        


                        Cheers,
                        


                        Tim.
                        


                        -----Original Message-----
                        

                        From: Karl Taylor [mailto:taylor13 at llnl.gov] Sent: Thursday, 8 January
                        

                        2009 11:03
                        

                        To: Timothy Hume
                        

                        Cc: cf-metadata at cgd.ucar.edu
                        

                        Subject: Re: [CF-metadata] Storing multiple NWP model runs in a NetCDF
                        

                        - CF file [SEC=UNCLASSIFIED]
                        


                        Hi Tim,
                        


                        Other than the axis attribute for time, I didn't see any issues. You
                        

                        might run the CF compliance checker on the file (http://
                        

                        cf-pcmdi.llnl.gov/conformance), but it might not run if the other
                        

                        utilities stumbled. Maybe someone else has some ideas.
                        


                        cheers,
                        

                        Karl
                        



                        Timothy Hume wrote:
                        

                                Hi,
                                


                                I am writing NetCDF files which hold surface fields (2m temperature
                                

                                etc) from multiple NWP model runs (all the runs from the same model
                                

                                for the last month or so). The files are for operational use, so I
                                

                                want them to strictly follow the CF conventions. I am running into a
                                

                                couple of problems, where the conventions don't seem to be ideally
                                

                                suited for storing more than a single model run.
                                


                                Here is what I do:
                                


                                I have four dimensions and associated coordinate variables:
                                


                                basetime: The base time for the model run (units: seconds since
                                

                                1970-01-01 00:00:00.0 +0:00)
                                

                                forecast: The forecast lead time, relative to the basetime (units:
                                

                                hours)
                                

                                latitude
                                

                                longitude
                                


                                There is no need for a vertical dimension, because I am using surface
                                

                                fields. Never-the-less I make use of a scalar vertical coordinate
                                

                                variable as described in Section 5.7 of the CF-1.3 metadata
                                

                                conventions document.
                                


                                The use of two dimensions to store the time information (basetime and
                                

                                forecast) seems to be a natural way to store multiple NWP model runs,
                                

                                and is the standard way used in the very old NUWG conventions. As far
                                

                                as I can tell, what I am doing is supported by the CF conventions,
                                

                                provided the time axis is taken to be the basetime dimension. This is
                                

                                because Section 4.4 of the conventions specifies the units of the
                                

                                time coordinate must include a reference time. Obviously the units of
                                

                                the forecast coordinate cannot include a reference time, because the
                                

                                reference time varies, and is determined by the value of the basetime
                                

                                coordinate.
                                


                                The difficulty I am encountering is that some applications which read
                                

                                NetCDF/CF files (such as the Unidata IDV and Joe Sirott's very nice
                                

                                web based NetCDF data viewer) seem to choke on my data. I suspect
                                

                                (but am not 100% certain in the case of the IDV) that the reason is
                                

                                because of the way I handle the time information in my files.
                                


                                To illustrate in more detail what my files look like, I am attaching
                                

                                the CDL from an example file. The CDL is non-CF compliant, because I
                                

                                have specified the "T" axis (via the axis variable attribute) to be
                                

                                the forecast coordinate, and the forecast coordinate has invalid
                                

                                units (no reference time). I am planning on switching the "T" axis to
                                

                                the base time coordinate, which as far as I can determine should make
                                

                                the file CF compliant.
                                


                                My question is: is there a better (or more standard) way of storing
                                

                                multiple NWP model runs in a single file than what I am doing?
                                

                                Cheers,
                                


                                Tim Hume
                                

                                Centre for Australian Weather and Climate Research
                                

                                Australian Bureau of Meteorology
                                

                                Melbourne
                                

                                Australia
                                


                                --- Example CDL follows ---
                                


                                netcdf gasp_1p0deg_ocf_t2m_rtdb_opn {
                                

                                dimensions:
                                

                                   forecast = 69 ;
                                

                                   basetime = UNLIMITED ; // (100 currently)
                                

                                   latitude = 96 ;
                                

                                   longitude = 121 ;
                                

                                   bounds = 2 ;
                                

                                variables:
                                

                                   double forecast(forecast) ;
                                

                                       forecast:long_name = "Time of model forecast, relative to the
                                

                                basetime" ;
                                

                                       forecast:units = "hours" ;
                                

                                       forecast:standard_name = "forecast_period" ;
                                

                                       forecast:axis = "T" ;
                                

                                       forecast:bounds = "forecast_bounds" ;
                                

                                   double forecast_bounds(forecast, bounds) ;
                                

                                       forecast_bounds:long_name = "forecast interval" ;
                                

                                       forecast_bounds:units = "hours" ;
                                

                                   int basetime(basetime) ;
                                

                                       basetime:long_name = "Model basetime" ;
                                

                                       basetime:units = "seconds since 1970-01-01 00:00:00.0 +0:00" ;
                                

                                       basetime:calendar = "gregorian" ;
                                

                                       basetime:standard_name = "forecast_reference_time" ;
                                

                                   double latitude(latitude) ;
                                

                                       latitude:long_name = "latitude" ;
                                

                                       latitude:units = "degrees_north" ;
                                

                                       latitude:bounds = "latitude_bounds" ;
                                

                                       latitude:valid_min = -90. ;
                                

                                       latitude:valid_max = 90. ;
                                

                                       latitude:standard_name = "latitude" ;
                                

                                       latitude:axis = "Y" ;
                                

                                   double latitude_bounds(latitude, bounds) ;
                                

                                       latitude_bounds:long_name = "grid cell latitude boundaries" ;
                                

                                       latitude_bounds:units = "degrees_north" ;
                                

                                       latitude_bounds:valid_min = -90. ;
                                

                                       latitude_bounds:valid_max = 90. ;
                                

                                   double longitude(longitude) ;
                                

                                       longitude:long_name = "longitude" ;
                                

                                       longitude:units = "degrees_east" ;
                                

                                       longitude:bounds = "longitude_bounds" ;
                                

                                       longitude:valid_min = -360. ;
                                

                                       longitude:valid_max = 360. ;
                                

                                       longitude:standard_name = "longitude" ;
                                

                                       longitude:axis = "X" ;
                                

                                   double longitude_bounds(longitude, bounds) ;
                                

                                       longitude_bounds:long_name = "grid cell longitude boundaries" ;
                                

                                       longitude_bounds:units = "degrees_east" ;
                                

                                       longitude_bounds:valid_min = -360. ;
                                

                                       longitude_bounds:valid_max = 360. ;
                                

                                   float temperature_2m(basetime, forecast, latitude, longitude) ;
                                

                                       temperature_2m:long_name = "Air temperature 2m above the
                                

                                surface" ;
                                

                                       temperature_2m:units = "K" ;
                                

                                       temperature_2m:_FillValue = 9.96921e+36f ;
                                

                                       temperature_2m:missing_value = 9.96921e+36f ;
                                

                                       temperature_2m:valid_min = 180.f ;
                                

                                       temperature_2m:valid_max = 330.f ;
                                

                                       temperature_2m:standard_name = "air_temperature" ;
                                

                                       temperature_2m:cell_methods = "lat: lon: mean (area weighted)" ;
                                

                                       temperature_2m:coordinates = "level" ;
                                

                                   double level ;
                                

                                       level:long_name = "Height above the surface" ;
                                

                                       level:units = "m" ;
                                

                                       level:positive = "up" ;
                                

                                       level:standard_name = "height" ;
                                

                                       level:axis = "Z" ;
                                


                                // global attributes:
                                

                                       :Conventions = "CF-1.3" ;
                                

                                       :history = "File created by the Gridded OCF data ingest
                                

                                system" ;
                                

                                       :institution = "Australian Bureau of Meteorology" ;
                                

                                       :source = "model" ;
                                

                                       :title = "GASP forecasts of temperature_2m; resolution: 1.0
                                

                                degree; source: rtdb" ;
                                

                                       :topography = "MSAS" ;
                                

                                }
                                

                                _______________________________________________
                                

                                CF-metadata mailing list
                                

                                CF-metadata at cgd.ucar.edu
                                

                                http:// mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata
                                







                _______________________________________________
                

                CF-metadata mailing list
                

                CF-metadata at cgd.ucar.edu
                

                http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata
                

        _______________________________________________
        CF-metadata mailing list
        CF-metadata at cgd.ucar.edu
        http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata
        
Received on Thu Jan 08 2009 - 16:09:34 GMT

This archive was generated by hypermail 2.3.0 : Tue Sep 13 2022 - 23:02:40 BST

⇐ ⇒