UNAVCO Home
   |    |   |  
Promoting Earth science by advancing high-precision techniques for the measurement of crustal deformation.

· Publications & Reports · Brochures · Community Bibliography · Periodic Reports · Proposals · Staff Publications · Workshop Publications
UNAVCO 1996 Annual Report
[Next] [Previous] [Contents]

3.0 Data Management and Archive Support


GPS data management encompasses the range of activities necessary to ensure that high-quality GPS data and its accompanying meta-data (descriptive data) are collected, transported, organized and safely stored in designated GPS archives, and then made readily available from these archives to the rest of the GPS community. The UNAVCO GPS Community Archive at the UNAVCO Boulder Facility (henceforth referred to as the "UNAVCO Archive" or the "Archive") and NASA's Crustal Dynamics Data Information System (CDDIS) at the Goddard Space Flight Center represent two such designated archives.

At UNAVCO, the Data Management and Archiving Group (DMAG) has long recognized that there is considerable overlap between the acquisition, transport, archival, and retrieval phases of GPS data management. The long-term goals of a GPS data archive should ensure the safe long-term storage of, and ready access to, accurate and correct data and all necessary meta-data. GPS data archiving is greatly simplified and improved when all phases of data handling match these long-term archival goals. UNAVCO is dedicated to the task of making this a reality for the GPS community and central to this task is the UNAVCO Archive.

The purpose of the Archive is three-fold. First, it serves as physical storage for the PI who is primarily responsible for the GPS data collection. Second, it serves as an on-line repository of GPS data for the present GPS community. Third, it serves as a long-term archive for future researchers who may want to search, identify, and recover validated GPS data and meta-data at earlier precise positions, in a specific spatial or temporal frame of interest. Not coincidentally, these three functions translate into the three primary components of the UNAVCO Archive (see Figure 3-1):

  • Physical Repository: Physical media (e.g. floppy disks, tapes, etc.), field log sheets, maps, photographs, and other physical records turned in to UNAVCO by the PI are organized, checked-in and stored in a dedicated room. Data are duplicated and stored on backup media. Data in the Physical Repository can be requested from the DMAG staff by telephone or email and sent using an agreed-upon physical format.
  • On-line Repository: Data and associated meta-data copied from the Physical Repository, or obtained electronically over the Internet, are first catalogued and stored on a 15-Gbyte RAID disk system and later migrated to a 500-Gbyte digital linear tapedrive jukebox. These data and meta-data are only minimally verified but are readily available to the GPS Community via the Internet.
  • Archive Database: Data and meta-data in the On-line Repository are completely validated using log sheets, shipping invoices, PI notes, and other sources of information using an interactive data-entry interface to an Oracle relational database system. This validated meta-data and links to the data in the Physical and On-line Repositories are stored in a relational database especially designed for GPS data and meta-data; this is the Archive Database. Once data and its meta-data have been validated (and possibly corrected) and this information entered into the Archive Database, the GPS Community may search and retrieve selected data using a search-and-retrieve user-interface on the World Wide Web to the same relational database system. Data are sent electronically over the Internet to the person making the request.

Figure 3-1. UNAVCO Archive Operations

To facilitate the GPS community's ability to collect, transfer and retrieve correct GPS data and meta-data, the DMAG also engages in other data management tasks peripherally related to the central task of maintaining the Archive, but which directly compliment and simplify archival goals. These related tasks include:

  • design and standardization of log sheets, both physical and electronic, for recording of necessary meta-data (see "Log Sheets" below)
  • development of software for the extraction and validation of data and meta-data from raw native binary data formats from various GPS receivers (see "TEQC" below)
  • co-development of safe GPS data transfer protocols (see "LDM/IDD" below)
  • co-design and -development of the "Seamless Archive", allowing for the GPS Community to access data and meta-data at any of the designated GPS archives without the need for knowing details about a specific archival site or where or how the data are stored
  • accomplishment of all these goals, as well as the primary task of maintaining the Archive, in a fashion consistent with the wide heterogeneity of the GPS Community which:
    • is involved in both episodic (campaign) and permanent station projects
    • uses a multiplicity of receiver types (e.g. Trimble, Ashtech, Rogue/TurboRogue/TurboStar, Leica, TI-4100, etc.)
    • accesses these receivers using a wide variety of communications, downloading or data-logging methods
    • uses a wide variety of computer hardware and operating systems
    • has a wide variety of data-release constraints (e.g. non-restricted availability, restricted availability, availability after time lapse from data collection, etc.)

The sections below discuss both the primary archiving tasks and these associated activities.

3.1 - Archival Process
3.2 - Current Archive Status
3.3 - Data Management and Archive Tasks in FY97
3.4 - Data Management and Archiving Group (DMAG) Staff and Budget


1996 Annual Report - 23 SEP 1997

[Next] [Previous] [Contents]


Last modified Tuesday, 08-Nov-2005 02:34:54 UTC

 

Home | About Us | Contact Us | Support | Search | Facility | PBO | Education & Outreach

Comments: webmasterATunavco.org
© 2009 UNAVCO, Inc.