BBC2 SUPPORTED DATA FORMATS


The dataformat that will be used for the BBC2 campaign will be similar to the format used for the previous BBC-campaign. See the 4D-CLOUDS format page for more information.

However, we encourage the use of netCDF-format for any data but especially large volume data, like data from radars and lidars. Data stored as ASCII and netCDF files should be zipped (gzip). The (CLIWANET) data format is described below in two sections, one on the filenames and one on the file format. Data providers are encouraged to provide quicklooks together with the data. Quicklooks may be provided in any common graphics format like gif, jpg or png. However if you provide gif-formatted image files please use (preferably) only compressed gif files. Quicklooks to be displayed on the results page of this website should preferably be limited to 100 kB.

Presently we have not yet defined the format of the netCDF-files, but we try to stay as close as possible to the CloudNET-format which is mainly for radar and lidar data. For other datatypes you can use the Climate and Forecast metadata convention.

File naming standard

The proposed filename convention is: SS_IIIIIIII_YYMMDDNN.EXT
where:
SS=station/aircraft code (see table 1)
I..I=instrument, variable or model (see table 2 for an alphabetical list)
YY=year
MM=month
DD=day
NN=hour (UTC), number or alphanumerical code (NN may be extended to more than 2 characters)
EXT=extension, "DAT" for cliwanet formatted files and "NC" for netCDF formatted files resp.

An example: "CA_MRADMICC_03050100.DAT" is the MICCY microwave radiometer at Cabauw on May 1, 2003, starting at 0 hours UTC.

If you want to propose a new variable/abbreviation for this list, please send an email to Victor Venema and Henk Klein Baltink, so that all codes used in the BBC-campaign will be listed on the Bonn BBC2 pages and here.

File format standard

A data file consists of three parts: a header, an optional line with y-axis or variables and a data segment. The header gives basic information on variables and some comments. A line with a y-axis or variables is included to make processing in some software packages easier. After the header there is the data section. The measurements are stored one row per time step. Columns contain time (in decimal of an hour), variable 1, variable 2, ...

At the moment there are two file formats in use: Format 2.3 and 3.0. Format 2.3 was made for 1 or 2 dimensional data. As for 4d-clouds, more dimensions were neccessary, so a new format was made: 3.0. If possible use the newest format, 3.0. Data providers, please adhere to the standard as close as possible, so that other people do not have to rewrite their reading routines when using your data. There is a lot of redundancy in the format, this can make the files bigger, but it helps to find errors and can make using the data easier for new users.

CLIWA-NET File format 2.3

- The number header lines including this line
- Header format version: 2.3
- Data version
- Instrument name
- Latitude and longitude of the instrument
- Elevation of the instrument above the ground
- Starting date and time of the data
- Ending date and time of the data
- Info about the time axis of the data
- Info about the other axis, or for 2D data: 0,0,'null','null'
- The number of variables used in the data
- x lines with the description of the variables: 'name', 'unit', number of columns
- Line marking the beginning of the comment lines
- x lines with comments
- Line marking the end of the header
- Optional y-axis or variables
- Many line with data

CLIWA-NET File format 3.0

- The number header lines including this line
- Header format version: 3.0
- Data version
- Instrument name
- Latitude and longitude of the instrument
- Elevation of the instrument above the ground
- Starting date and time of the data
- Ending date and time of the data
- Number of dimensions
- Number of datalines
- D lines with information about the axis of the D dimensions
- The number of variables used in the data
- x lines with the description of the variables: 'name', 'unit', number of columns
- Line marking the beginning of the comment lines
- x lines with comments
- Line marking the end of the header
- Optional y-axis or variables
- Many lines with data

Explanations

'# HD LINES',x
number of header lines: total number of lines of the header (x=integer), including this line itself, but excluding the line with the y-axis or variables.
'FORMAT VERS',x.x
Header format version: version number of the file format, x.x is either 2.3 or 3.0.
'DATA VERS',x.x
Data version: number identifies updates of the data. The first version will be: 'x.x = 1.0'; minor updates of the data (e.g. new calibrations, small errors corrected, or time synchronized) update the last number: 1.1, 1.2. Significant changes (e.g. new processing methods) update the first number 2.0, 3.0.
'INS NAME','name'
Instrument name: in case of higher level products the instrument name can also be the tool(s) or the algorithm used, .
'LAT/LON',f1,f2
Latitude (f1=float) and longitude (f2) of the instrument, if not relevant or if changing (aircraft): 0, 0.
'ELEV',f,'unit'
Elevation of the instrument above the ground: f in "unit", if it is changing like for the aircraft: -9999 m. Please, give the height corresponding to the ground as height axis of the products or/and specify this relation in the comments.
'START',YYYY:MM:DD,HH:MM:SS
Starting date and time of the data, year (YYYY) in four numbers, month (MM), day (DD), hour (HH), minutes (MM) and seconds (SS) in two, time should always be in UTC.
'STOP',YYYY:MM:DD,HH:MM:SS
Ending date and time of the data, year (YYYY) in four numbers, month (MM), day (DD), hour (HH), minutes (MM) and seconds (SS) in two.
'DIM',x
Dimension of the data product, which also specifies the number of '* INFO' lines in the header. This is only used in Format version 3.0.
'DATA LINES',x
The number of lines (x) with data. This is only used in Format version 3.0.
'U INFO',f1,f2,f3,'name','unit',dt
These lines provide the axis information. The number of these lines is equal to the dimension given in DIM, and they are called U, V, W, X, Y, Z, A, B, ... INFO, respectively. f1, is the number of data bins, f2 the value of the starting bin, f3 the value of the last bin. If the bins are not equidistant, please, specify this in the comments. The first dimension should be time. Only this dimension has to specify a time shift (dt), which is the difference between the time the instrument indicated originally and the time used in this data synchronized with other instruments. The last dimension specifies the data bins of the y-axis. If no line with y-axis or variables is given after the header, then x,y,z,'name','unit' should be: 0.0,0.0,0.0,'null','null'. If there is a line with variables present the last dimension INFO should be 1.0,0.0,0.0,'null','null'.
'X INFO',x,y,z,'name','unit',dt
'Y INFO',x,y,z,'name','unit'
See 'U INFO for comments. these names are used in format version 2.2.'
'# VAR TYPES',x
The number of variables (x), which is also the number of headerlines that follows with variable information. Note, that this does not have to be the same as the number of columns, as the number of columns for a variable can be more than one.
variables
'variable','unit',x
Name of the variable ('variable'), its unit ('unit') and how many columns this value (x) has, this could for example be drop counts for each diameter bin of an FSSP or radiances at a number of frequencies. If x is larger than one, please make sure that the user knows exactly what they are, either in the y-axis information (which can only be done for one variable) or in the comments lines.
%% BEGIN COMMENT
%% END COMMENT
Between these two lines you can dump everything, you still wanted to say. It is a good idea to give e-mail and web-adresses for more information and cooperation.
y-axis or variables
This line is optional, although its presence should correspond with the information in the last dimension specified in the header, see 'U INFO'. This line can be used if there is one dimension that is not sampled equidistant. It also makes plotting 2D data easier in packages like Transform and 1D data in spreadsheets and many graph-packages. This line is not part of the header.
Data lines
The data lines should start with a time, then other data separated by tabs. Please do not include any non-numeric data in here, but specify values for NaN, +INF, -INF, etc. in the comments.