⇐ ⇒

[CF-metadata] Proposal for better handling vector quantities in CF (and the role of libCF)

From: Thomas Lavergne <t.lavergne>
Date: Thu, 15 Dec 2011 01:18:34 +0000 (UTC)

Dear all,

Sorry for not posting earlier, I was away from my e-mail. I am glad the discussion is lively around this issue.

It seems we could go for 4 types of approaches: the umbrella variable I introduced, the addition of a vector dimension (John), the heavy use of auxiliary_variables and the introduction of "groups".

It seems obvious "groups" (in the sense of HDF5, netCDF-4 or, more generally, CDMv4) would be the most elegant solution. It could solve the problem at hand (vectors), but would open for much more (like what Seth was asking for). But introducing groups is not just a matter of introducing a new section in CF and I am not sure being able to better handle vectors is a strong enough driver for such a change.

I will be away from my emails and these topics for about a month. I plan to create a trac account and submit a proposal (with the umbrella) in January, and we can then discuss in more details what this all implies.

On a different (but related) topic: once we have created a new standard_name for a vector umbrella variable (for example "wind_vector" or "sea_ice_displacement_vector"), then I would advocate that the standard names for the components are built using standard name modifiers like "wind_vector magnitude", "wind_vector direction", "sea_ice_displacement_vector x", etc... I know standard names already exist for these components but they are generally not built consistently between, e.g., winds and sea currents. The current standard names could be aliased to the new constructs using standard name modifiers.

But this is a different topic. I'll wrap-up things in January when I get back.

Cheers,
Thomas

----- Original Message -----
> >I agree with your reasoning. It is worth considering the use of
> >Groups, but
> >the approach should be weighed against the best proposals that can be
> >generated that stick to the classic model. Fundamentally the need is
> >for 2
> >bit of semantics:
> >
> > 1. associate components together so they form a conceptual N-vector
> > object
> > 2. associate metadata with the N-vector object
>
> Having these two bits of semantics would be really valuable because it
> also provides a generic method for handling tabular data within
> netcdf,
> which is something I think we need.
>
> If you want to represent a set of events (e.g., earthquakes) in
> netcdf, I
> believe the best way to do it is to have a bunch of variables (e.g.,
> time,
> lat, lon, magnitude) with a common "ID" dimension. Having a common
> dimension links them together implicitly, but there's currently no
> good
> way to say explicitly "these variables are different features of this
> set
> of entities" or to provide metadata about the collection, like "this
> is a
> catalog of Japanese earthquakes of magnitude 5.0 and greater".
>
> Something simple along the lines of the coordinates attribute would
> probably suffice. I'm not sure whether it would be better to attach it
> to the ID dimension, and use that as the holding point for metadata
> associated with the table, or whether it would be preferable to simply
> have a dummy variable, as is used for map_projection metadata.
>
> (I had more conclusions about some of these issues, but lost the
> document where I collected all my thoughts in a hard-drive crash...)
>
> This would also provide a stepping-stone to better handling of spatial
> categorical data. Currently, CF says to use flag_values and
> flag_meanings for categorical data, but that gets really awkward and
> human-unfriendly if you have more than a handful of categories. It
> would be much more elegant to have the meanings of the numeric
> values defined by reference to a table. (This also solves an issue I
> posted about a while back: we could have one set of standard
> names for use with controlled vocabularies like the Area Type and
> Region Names tables, and another for user-provided categorizations,
> which would then be enumerated in the same file.)
>
> Cheers,
>
> --Seth
> _______________________________________________
> CF-metadata mailing list
> CF-metadata at cgd.ucar.edu
> http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata

-- 
========================================== 
Thomas Lavergne 
Norwegian Meteorological Institute 
P.O.BOX 43, Blindern, NO-0313 OSLO, Norway 
Phone: (+47) 22963364  Fax: (+47) 22963380 
Email: t.lavergne at met.no 
OSISAF HL Portal:     http://osisaf.met.no 
========================================== 
Received on Wed Dec 14 2011 - 18:18:34 GMT

This archive was generated by hypermail 2.3.0 : Tue Sep 13 2022 - 23:02:41 BST

⇐ ⇒