⇐ ⇒

[CF-metadata] [CF Metadata] #107: CF Data Model 1.7

From: Little, Chris <chris.little>
Date: Wed, 16 Apr 2014 12:14:44 +0000

Dear CF community,

I would like to add some more considerations to Mark Hedley and Jim Biard's detailed and well founded technical comments over the debate about some fundamental concepts of abstract data models for geospatial interoperability. I apologise if I am teaching anyone to suck eggs etc, but it does help to get my thoughts in order.

The UK Met Office, and other national met services and institutions, such as NCAR, NWS, BoM, DWD and M?t?o-France, have put significant efforts over the last five years, at the level of the equivalent of several full time staff per institution per year to make meteorological data and services, for both weather forecasting and climate prediction, more readily accessible to a much wider, and richer, world of geospatial services, both now and for decades into the future.

Significant effort has gone into making meteorological and aviation data have a common, shared conceptual model to ensure interoperability well into the future. A simple version of such a model has existed in peoples' heads for decades, but many modern software tools (which many meteorologists tend not to use yet) are predicated on a formal version of a conceptual model, whether expressed in UML, XML or other languages.

Discovery and use of data and services are also predicated on search engines and similar mechanisms and these are increasingly relying on underpinning conceptual models and their semantics to ensure good results.

Such formal models tend to have more layers of abstraction than na?ve users expect. This gives 'loose coupling' enabling flexible software, and also tight semantics, giving clear meaning. The models also tend to be richer than expected because they are encompassing the full domain, of which only a subset may be implemented at any one time. Providing the conceptual components are stable, this gives some measure of future proofing too.

Processing such conceptual models is not particularly resource intensive, as, for example, most modern phones contain such a software kernel to handle internet activities.

To summarise, I do not see why a conceptual model should not have two concepts that are different, and modular, even if implementations of current software conflate the two for reasons of history or efficiency. In the future, software implementers will still have the opportunity to conflate concepts for the sake of efficiency. This is a well recognised pattern in IT, e.g. 3rd Normal Form and denormalised databases, XML and lots of tools, shared application memory, etc.

Another IT pattern, that I would like to break from, is that of embedding backward compatibility for too long, distorting future software and imposing undue costs of maintenance and complexity.

Chris

-----Original Message-----
From: owner-cf-metadata at lists.llnl.gov [mailto:owner-cf-metadata at lists.llnl.gov] On Behalf Of Mark Hedley
Sent: Tuesday, April 15, 2014 4:51 PM
Subject: Re: [CF Metadata] #107: CF Data Model 1.7

This message came from the CF Trac system. Do not reply. Instead, enter your comments in the CF Trac system at http://kitt.llnl.gov/trac/.

#107: CF Data Model 1.7
-----------------------------+------------------------------
  Reporter: markh | Owner: cf-conventions_at_?
      Type: task | Status: new
  Priority: medium | Milestone:
 Component: cf-conventions | Version:
Resolution: | Keywords:
-----------------------------+------------------------------
\
\
\
\
\
\

Comment (by markh):

 Replying to [comment:79 jonathan]:
 Hello Jonathan

> Both `grid_mapping` and `formula_terms` can be described logically by the [http://kitt.llnl.gov/trac/ticket/107?replyto=55#comment:55 text which David posted]. For instance,

 I can see that this can be done, the important question for me is should it be done?

 Where are the benefits and where are the costs?

 I do not perceive the benefits of this conflation to form a new `geolocation` type, based on the comments to date.

 This conflation is quite a jump from the current CF conventions terminology, which concerns me; I think it is difficult to explain.

> What is to be gained in clarity or simplicity by regarding these two things as distinct concepts? We don't talk about transforms or coordinate reference systems because these terms have technical and sometimes restrictive meanings, and therefore may make the description less clear.

 A particular benefit I see in having two separate concepts in the CF data model is interoperability with other domains and concepts.

 If a concept in the CF data model cleanly and clearly relates to a concept in another domain then this is a big supporting factor for interoperability.

 I think this is the case here: the georeferencing capability provided by a coordinate reference system type of thing in CF is very closely aligned to the ISO19111 definition of a coordinate reference system. This brings huge benefits in providing interoperability with other ISO aware communities, such as the OGC, as highlighted by [comment:80 edavis]. I think this is the approach Stefano has taken with the OGC to date. The documents he has written link grid_mapping variables to OGC CRS instances.

 The derivation of coordinate values, perhaps for georeferencing purposes, based on bespoke defined algorithms and reference data sets is a much more specialised field, which is not widely used in the ISO and OGC communities (for example).

 Conflating the well known georeferencing function of coordinate reference systems with these derived coordinate functionality in a single data model type is making it very hard for that type to be understood and used effectively by other communities.

 It also makes it much harder for CF to adopt useful building blocks for other communities, as the interfaces are all in different places.

 So, the cost of the approach of conflating these concepts is it isolates the CF community from other communities, just when we and they stand to benefit so much from better interoperability.

 The benefit has to significantly outweigh this cost, and I am afraid I cannot see this.

 With this in mind I strongly favour the separation of these concepts within the data model. Whilst they can be logically conflated, I think the cost is prohibitive and the benefit minimal.

 all the best
 mark
\
\
\

--
Ticket URL: <http://kitt.llnl.gov/trac/ticket/107#comment:82>
CF Metadata <http://cf-convention.github.io/> CF Metadata This message came from the CF Trac system.  To unsubscribe, without unsubscribing to the regular cf-metadata list, send a message to "majordomo at lists.llnl.gov" with "unsubscribe cf-metadata" in the body of your message.
Received on Wed Apr 16 2014 - 06:14:44 BST

This archive was generated by hypermail 2.3.0 : Tue Sep 13 2022 - 23:02:41 BST

⇐ ⇒