DOI - FAQs
Q1. I am familiar with DOI's for journal articles. How does the DOI system work for data sets?
A1. The system works in the same way as for journal articles. A third party registration agent manages the DOI metadata and provides the mechanism for the DOI resolver service (dx.doi.org) to work. There are several registration agents; UNAVCO uses the one provided by the California Digital Library (CDL). The registration agent tracks the DOI metadata (author, publisher, publication date, etc.), and forwards the information to the resolver service. UNAVCO maintains a citation summary page; the URL to this page is uploaded to the registration service. Because the DOI system is meant to provide a way to recover the data set in question as well as other information to make the data set useful through time, UNAVCO has committed through a service agreement with CDL to maintaining this information and access to the data sets indefinitely.
Q2. Which data sets in the UNAVCO Archive are "publishable" using DOIs?
A2. UNAVCO aims to keep extensive metadata about each archived data set. Data sets that have complete metadata (including principal investigator(s), funding, and sufficient observational metadata such as monument, location, instrumentation), are publishable. Data sets that are incomplete in any way must be withheld from publication.
Q3. What metadata are stored in a third party registry when UNAVCO publishes a data set?
A3. The DataCite system and the CDL Registry allows a variety of metadata to be stored to uniquely define and document the associated data set. The DataCite metadata schema includes mandatory and optional items. The mandatory metadata are: Identifier (the DOI), Creator (the authors), Title, Publisher, Publication Year. In addition, UNAVCO also provides the optional metadata to the CDL and DataCite systems including subtitles and related DOIs.
Q4. Where do I find citation information (author, publication date, title, etc.) for published data sets?
A4. This information is found on the DOI citation summary page for each data set that has been published. Use the UNAVCO DOI search tool to find published data sets.
Q5. What data set types does UNAVCO publish using DOIs?
A5. UNAVCO has implemented DOI publication for GPS/GNSS datasets. UNAVCO publishes (assigns DOIs) for four different data set types, all with associated GPS/GNSS data that have been archived to certain quality standards. The types are GPS/GNSS Campaign Data Sets; GPS/GNSS Continuous Station Data Sets; Aggregated Data Sets; and Composite Data Sets. The first two are considered primary data set types. The third and fourth types are derived or secondary data set types because they are composed of two or more data sets of the primary type.
GPS/GNSS Campaign Data Set - This will be a data set defined between UNAVCO and the Principal Investigators at the time of archiving, and generally will include observations and metadata from GPS/GNSS data collection at a number of recoverable monuments. Data collection is temporally limited at each monument. Typically the physical association of the monument and the observation system GPS/GNSS antenna is through a tripod/tribrach, spike mount, or other temporary setup.
GPS/GNSS Continuous Station Data Set – Observations and metadata from GPS/GNSS data collection at a single recoverable monument. Data collection typically consists of daily 24 hour observation files, or sub-daily files, or streamed data that are captured as files for archiving. Collection is often continuous for years. Typically the physical association of the monument and the observation system GPS/GNSS antenna is through a permanent setup such as a shallow or deep drilled-braced monument. Unlike the campaign data set type, which is complete and unchanging through time, the Continuous Station Data Set is open ended (until the station is retired). The DOI will be associated with an increasing data set through time; because of this aspect of this data set type it is important when citing this data to qualify the citation with an access date of the data and the temporal window of data used in the research. See Citation Guidance for a permanent/continuous station data set.
Aggregated GPS/GNSS Data Sets - These will often be an associated group of campaign data sets or a network of stations. A campaign example is the Mammoth/Mojave 1994 campaign, which consists of three individual primary data sets: Mammoth, Mojave, and Combined Sites). For permanent/continuous stations, networks or sub-networks of stations may be aggregated into one aggregated data set and assigned a DOI. In this case, the purpose of the aggregated data set is for conveniently designating multiple associated stations for citing in a journal article (ie, in order to avoid citation lists containing tens or hundreds of data set references).
Composite GPS/GNSS Data Sets - A composite data set DOI is one that is comprised of two or more subset DOIs that together make up what would normally be considered to be a single data set. The most common example is a permanent (continuous) GPS/GNSS station where the principal investigator (author) changed at a particular point in time. The existing network (Nucleus) stations that were adopted by UNAVCO as part of PBO are examples. The entire data set is one DOI and is comprised of a separate DOI for each time period with a different author or set of authors.
Q6. When I use the DOI search, my data sets are not shown. What is happening?
A6. We are working to assign the DOIs for all publishable data sets as quickly as possible. If you would like to have your data sets advanced in the queue, please email archive-gpsunavco.org.
Q7. It would be useful to me to define an aggregated data set with a DOI consisting of a number of primary data sets that I have in mind. Is there a way to request this?
A7. This can be done on a case by case basis; please email archive-gpsunavco.org.
Please submit additional questions for this FAQ to archive-gpsunavco.org.
Last modified: Tuesday, 15-Nov-2016 14:29:38 UTC