New Dataset Naming Convention for ISCCP (2001)COMMENTS: The old dataset naming convention concerned data TAPES of a nearly universal form and their manipulation with mainframe operating systems. The new convention concerns data FILES for ftp over the Internet, their storage on disc drives and a wide variety of media, and their manipulation with UNIX workstations. There will be no duplicate file names: that is, if two files are found to have the same name, they will be identical in content. The proposed new naming convention assumes groupings of data files into directories at monthly time steps and by satellite where appropriate (there is one very small dataset for which the directory will cover a whole years). The proposed new naming convention tries to preserve as much of the old convention as possible and tries to be as informative as possible. To that end, each transmission of a directory will be accompanied by a number of ancillary meta-data files (many of which are fixed) that, together with the data files, provide a set of files that resembles the structure of the old data tapes. In addition, each ftp transmission session will include a LOG file that gives a one-line summary of each data file sent: file name, file size (number of bytes), checksum and md5 output. This file will be used to verify proper transmission of the data and then discarded. AC and BC data will not use this naming convention since an Internet form for these data already exists and does not need to be changed. Stage A data is not included in this convention and is assumed to remain in the original format with the original labeling for each data center. The C-series cloud products will not be modified to change to this new naming convention. FILE Name = ISCCP.TTTTTT.V.SATIDNN.YYYY.MM.DD.HHMM.DCN (maximum length = 44 characters), where TTTTTT, SATIDNN and DC are of variable length and all other fields are fixed length and the number of fields is fixed at 9.
ISCCP ftp ProtocolsEach SPC will "push" data to the GPC and ICA and the GPC will "push" data to the ICA (and NASA Langley ASDC). For this purpose and to allow checking and revision of files, each center will have an account on the target center's ftp server. The data will be transmitted uncompressed (unless this is absolutely necessary in specific cases). Each file transmitted will be named using the filename convention described above. Each ftp session may transmit a single or multiple files. Data files will be organized in monthly directories. Directory names will follow the filename convention, except that the DD and HHMM fields will be dropped: DIRECTORY Name = ISCCP.TTT.V.SATID.YYYY.MM.DCN The version number will be reserved for use to re-transmit the same dataset if it is changed at a later date. For IS data, MM = 99, that is this directory will contain a whole year of data. TTT in these directories will only include actual data types (B1, B2, B3, BT, SN, SI, TOVS, IS, TV, DX, D1, D2 ), not the meta-data types (XXOA, XXTOC, B3LWMAP, XXGRID, XXREAD, XXREADME). The files transmitted in each ftp session will be accompanied by a LOG file that is named: ISCCP.XXFTPLOG.YYYY.MM.DD.HHMM.DCN Where XX is the data type and the date and time fields can be filled with 9's as appropriate to the frequency of transmission sessions. The LOG file will contain a one-line summary for each file in the session: filename, filesize (number of bytes), checksum, md5 output Inclusion of md5 is optional but desired. The contents of the LOG file will refer to the uncompressed data, if data compression is used. If ftp data transmissions from the SPC are routine and on schedule, then no notification of data transmission is required. All data for a given month must be transmitted to the GPC and ICA by the 15th of the following month and a report (the current monthly report matrix) sent by e-mail by the same date. If data will be transmitted late or replaced after the cutoff date, then notification is required. The GPC and ICA will cross-check data files received against the monthly report matrices and notify each SPC within one-two working days that all files have been successfully received. The GPC will notify both the ICA and NASA Langley ASDC when data products are ready for transmission; this notification will also go to all data centers so that anyone may collect copies of the ISCCP data products. Contact Us: ISCCP Webmaster http://isccp.giss.nasa.gov/docs/NameConv.html Last updated: 2003:02:04 @ 14:02:51 | |