페이지 트리

ENVRIplus Theme 2:

Requirements information gathering exercise

Your RI name here

RI representative(s):

  • Firstname Lastname,

Institute

Version date

1. Identification and citation

While some of the questions should be quite easy to answer, others could require a bit of thinking and effort, and then there are also a few that are probably not applicable to your Research Infrastructure.

In any case, it is preferable that you give a rough estimate or a very schematic response, than leaving a question unanswered. If you get stuck, either get in touch with me (Maggie Hellström at margareta.hellstrom@nateko.lu.se), or simply write "can't answer", "don't understand" or "not applicable"...

Please send the completed questionnaire to Maggie by 18:00 on Tuesday, December 15 2015 at the very latest!

IDENTIFICATION

1)     What granularity do your RI’s data products have:

a)      Content-wise (all parameters together, or separated e.g. by measurement category)?

<your answer here>

b)      Temporally (yearly, monthly, daily, or other)?

<your answer here>

c)      Spatially (by measurement station, region, country or all together)?

<your answer here>

2)     How are the data products of your RI stored - as separate “static” files, in a database system, or a combination?

<your answer here>

3)     How does your RI treat the “versioning” of data - are older datasets simply replaced by updates, or are several versions kept accessible in parallel? How do you identify different version of the same dataset?

<your answer here>

4)     Is it important to your data users that:

a)      Every digital data object is tagged with a unique & persistent digital identifier (PID)?

<your answer here>

b)      The metadata for data files contains checksum information for the objects?

<your answer here>

c)      Metadata (including any documentation about the data object contents) is given its own persistent identifier?

<your answer here>

d)      Metadata and data objects can be linked persistently by means of PIDs?

<your answer here>

5)     Is your RI currently using, or planning to use, a standardized system based on persistent digital identifiers (PIDs) for:

a)      “Raw” sensor data?

<your answer here>

b)      Physical samples?

<your answer here>

c)      Data undergoing processing (QA/QC etc.)?

<your answer here>

d)      Finalized “publishable” data?

<your answer here>

6)     Please indicate the kind of identifier system that are you using - e.g. Handle-based (EPIC or DOI), UUIDs or your own RI-specific system?

<your answer here>

7)     If you are using Handle-based PIDs, are these handles pointing to “landing pages”? If so, are these pages maintained by your RI or an external organization (like the data centre used for archiving)?

<your answer here>

8)     Are costs associated with PID allocation and maintenance (of landing pages etc.) specified in your RI’s operational cost budget?

<your answer here>

 

CITATION

9)     How does your “designated scientific community” (typical data users) primarily use your data products? As input for modelling, or for comparisons?

<your answer here>

10)            Do your primary user community traditionally refer to datasets they use in publications:

a)      By providing information about producer, year, report number if available, title or short description in the running text (e.g. under Materials and Methods)?

<your answer here>

b)      By adding information about producer, year, report number if available, title or short description in the References section?

<your answer here>

c)      By DOIs, if available, in the References section?

<your answer here>

d)      By using other information?

<your answer here>

e)      By providing the data as supplementary information, either complete or via a link

<your answer here>

11)            Is it important to your data users to be able to refer to specific subsets of the data sets in their citation? Examples:

a)      Date and time intervals

<your answer here>

b)      Geographic selection

<your answer here>

c)      Specific parameters or observables

<your answer here>

d)      Other

<your answer here>

12)            Is it important to be able to refer to many separate datasets in a collective way, e.g. having a collection of “all data” from your RI represented by one single DOI?

<your answer here>

13)            What strategy does your RI have for collecting information about the usage of your data products?

<your answer here>

a)      Downloads/access requests

<your answer here>

b)      Visualization at your own data portal

<your answer here>

c)      Visualization at other data portals

<your answer here>

d)      References in scientific literature

<your answer here>

e)      References in non-scientific literature

<your answer here>

f)        Scientific “impact”

<your answer here>

14)            Who receives credit when a dataset from your RI is cited?

a)      The RI itself

<your answer here>

b)      The RI’s institutional partners (all or in part, depending on the dataset contents)

<your answer here>

c)      Experts in the RI’s organization (named individuals)

<your answer here>

d)      “Principal investigators” in charge of measurements or data processing (named individuals)

<your answer here>

e)      Staff (scientists, research engineers etc.) performing the measurements or data processing (named individuals)

<your answer here>

15)            What steps in tooling, automation and presentation do you consider necessary to improve take up of identification and citation facilities and to reduce the effort required for supporting those activities?

<your answer here>