AnaEE is still in its preparatory phase, and therefore - as pointed out by the AnaEE representatives - it should be noted that many of the questions on this topic could only be answered in a very preliminary way.
The following information was contributed via e-mail by the RI representatives directly to the topic coordinator Maggie Hellström.
The data are collected into distributed site data bases. Some of them may be gathered at the national level. A querying interface allows to get data flexibly at different level (from a parameter to the whole data of given site/experiment or even from different sites). We have two kind of data sets :
- from long term experiment where the data are collected in a site data base.
- from short term experiment as in controlled conditions in ECOTRON where the data are gathered in a project data base.
Yearly, monthly, daily, hourly and sometimes at higher temporal resolution
By measurement station or a network of stations. We don’t produce data products representative of an area whatever the scale.
Mainly in Data Base information systems.
Not yet addressed. We start the data production at the France level. It is intended to expose the latest updates on the data base system. However published data will be versioned according to the update of their content.
Not yet. It is intended to have PID at the data set level (a site , an experiment). Not mature for finer descriptions (eg parameter , variable …). However we are working on the annotation of the data using a ontological approach which would lead to unique identification of every parameter.
not yet applicable
Not yet. Will be using DOI
not yet applicable
Not yet decided. However there will be a strong probability that raw data will be stored together with processed data in order. The aim is to make reprocessing possible by users.
Not yet implemented. It is planned to annotate persistently the different objects on which observations are made (soil sample, soil layer, plot, tree, animal…)
Not yet implemented. It is intended to define different levels of processing (L0, L1, L2, L3 … ) and have an array with quality code. Some of the level (not necessarily all) will need to have a PID
Not yet decided.
Not yet . Our plan is to use DOI for published data set and or own specific system for the description at the parameter level..
Not yet decided.
Not yet adressed
Both
Yes in material and method, with appropriate reference and appropriate acknowledgement
See previous
Not widely yet, But could be used
No other known practices
Yes
yes
yes
yes
Data quality, accuracy,
Yes at a site level or for an experiment that produced several datasets. Not necessarily to the whole RI
Not yet fully defined
It is expected to have a registration of users (account in the Information System), download tracking, identification in scientific publication, citation (DOI, publication/report transmission …)
Yes, access requests
Not yet defined
No.
Yes
Yes if easily collected
To be defined
Yes
Yes
no
yes.
yes
How to deal with incremental datasets?
How to link annotation on ontology and PID?
| Go-between | ?? Questionnaire response received by topic coordinator Maggie Hellström |
|---|---|
| RI representative | Christian Pichot and André Chanzy |
| Period of requirements collection | March 2016 |
| Status | Information gathered, no analysis done yet |