Table of Contents

9.1. Features and feature types
9.2. Collections, instances and elements
9.3. Representations of collections of features in data variables
9.3.1. Orthogonal multidimensional array representation
9.3.2.  Incomplete multidimensional array representation
9.3.3.  Contiguous ragged array representation
9.3.4. Indexed ragged array representation
9.4. The featureType  attribute
9.5. Coordinates and metadata
9.6. Missing Data

Note: For greater readability, the conventional highlighting for new material. has not been applied to the whole chapter.

This chapter provides representations for discrete sampling geometries, such as time series, vertical profiles and trajectories. Discrete sampling geometry datasets are characterized by a dimensionality that is lower than that of the space-time region that is sampled; discrete sampling geometries are typically “paths” through space-time.  

9.1. Features and feature types

Each type of discrete sampling geometry (point, time series, profile or trajectory) is defined by the relationships among its spatiotemporal coordinates.  We refer to the type of discrete sampling geometry as its featureType.  The term “feature” refers herein to a single instance of the discrete sampling geometry (such as a single time series).  The representation of such features in a CF dataset was supported previous to the introduction of this chapter using a particular convention, which is still supported (that described by section 9.3.1).  This chapter describes further conventions which offer advantages of efficiency and clarity for storing a collection of features in a single file.  When using these new conventions, the features contained within a collection must always be of the same type; and all the collections in a CF file must be of the same feature type. (Future versions of CF may allow mixing of multiple feature types within a file.)  Table 9.1 presents the feature types covered by this chapter.

featureType

Description of a single feature with this discrete sampling geometry

 

Form of a data variable containing values defined on a collection of these features

Mandatory space-time coordinates for a collection of these features                                               

point

a single data point (having no implied coordinate relationship to other points)

 

      data(i)

x(i) y(i)  t(i)

timeSeries

a series of data points at the same spatial location with monotonically increasing times

 

     data(i,o)

x(i) y(i) t(i,o)

trajectory

a series of data points along a path through space with monotonically increasing times

 

       data(i,o)   

x(i,o) y(i,o) t(i,o)

profile

an ordered set of data points along a vertical line at a fixed horizontal position and fixed time

 

       data(i,o)   

x(i) y(i) z(i,o) t(i)

timeSeriesProfile

a series of profile features at the same horizontal position with monotonically increasing times

 

       data(i,p,o)     

x(i) y(i) z(i,p,o) t(i,p)

trajectoryProfile

a series of profile features located at points ordered along a trajectory

 

       data(i,p,o)         

x(i,p) y(i,p) z(i,p,o) t(i,p)

Table 9.1. Logical structure and mandatory coordinates for discrete sampling geometry featureTypes.

In Table 9.1 the spatial coordinates x and y typically refer to longitude and latitude but other horizontal coordinates could also be used (see sections 4 and 5.6).   The spatial coordinate z refers to vertical position.  The time coordinate is indicated as t.  The space-time coordinates that are indicated for each feature are mandatory.  However a featureType may also include other space-time coordinates which are not mandatory (notably the z coordinate).  The array subscripts that are shown illustrate only the logical structure of the data.  The subscripts found in actual CF files are determined by the specific type of representations (see section 9.3).

The designation of dimensions as mandatory precludes the encoding of data variables where geo-positioning cannot be described as a discrete point location.  Problematic examples include:  

  • time series that refer to a geographical region (e.g. the northern hemisphere), a volume (e.g. the troposphere), or a geophysical quantity in which geolocation information is inherent (e.g. the Southern Oscillation Index (SOI) is the difference between values at two point locations);

  • vertical profiles that similarly represent geographically area-averaged values;  and

  • paths in space that indicate a geographically located feature, but lack a suitable time coordinate (e.g. a meteorological front).

Future versions of CF will generalize the concepts of geolocation to encompass these cases.  As of CF version 1.6 such data can be stored using the representations that are documented here by two means: 1) by utilizing the orthogonal multidimensional array representation and omitting the featureType attribute; or 2) by assigning arbitrary coordinates to the mandatory dimensions.  For example a globally-averaged latitude position (90s to 90n) could be represented arbitrarily (and poorly) as a latitude position at the equator.