To be completed by the go-between with help from the Ri-Rep.
Cover the stages of the data life-cycle in which the RI is involved, that pertain to the <topic> with references to more detail if the RI has them. Include quantitative and timeliness information, intended uses and so on - if such information is available.
Insert a summary of the main requirements for this RI for the current topic. Point out any unusual features, and comment on the extent to which these requirements are fixed or evolving. |
1. Do you use catalogues or require using catalogues for the following items?
Metadata catalogue for specification of data products.
2. For each used or required catalogue, consider the following questions:
-> Which fields do you use for describing items?
ESGF: Use MD, in preparation: Citation MD
LTA: Use MD, Citation MD, Contacts, Rights, access&storage, in preparation: provenance.
->Which standards do you apply for these fields (format or standard model)?
ISO, DIF, DC, etc.
->Do you use controlled vocabularies for these fields? If so, please cite the vocabulary service providers.
ESGF: NetCDF-CF (www.cfconventions.org) and lists in central repository (remake is in progress, github.com/ES-DOC/esdoc-cim-cv and ES-DOC/esdoc-cv and other)
LTA: Just internal lists.
->Do you maintain a cross-link or inter-links between:
Working on cross-link between data and simulation-Metadata, Model-Metadata.
LTA: cross-links to various publications (with DOI).
CMS = plone, cKAN.
LTA: Oracle DB, JavaSP.
ESGF: Lucene indexing, postgres DB.
->Human inputs: Do you provide/need facilities for editors/reviewers to maintain the metadata in the catalogues (e.g. forms, validation workflow, etc.)? If so, please describe them briefly.
LTA: Oracle SQLdeveloper.
->Machine inputs: Do you use/ need automated harvesting to populate your catalogues? If so, which protocol do you use (e.g. csw, oai-pmh, other, specific)?
ESGF: 1) auto Metadata harvesting from netCDF file headers to DB; 2) Lucene solr cloud for Metadata aggregation and presentation to user; 3) external harvesting from DB possible.
->How do you manage duplicates? i.e. Do you apply governance rules in a network of catalogues, do you use unique identifiers, or take other actions?
Checksums and unique IDs.
-> What specific feature is provided/required in your web discovery function (multi-criteria search, graphical selection components (e.g. map, calendar), facets, keyword or natural language)?
ESGF: facetted search.
->Do you evaluate the accessibility, quality, and usage of your catalogue by using a dashboard or value-added products? If so, do you provide/need:
No.
If so, please describe them shortly.
All Metadata are free and open.
4. Machine outputs:
Metadata: OAI-PMH: ISO, DC, DIF, etc.
Partially e.g. to provide Metadata as part of the world data center federation
| Go-between | Yin Chen |
|---|---|
| RI representative | Sylvie Joussaume <sylvie.joussaume@lsce.ipsl.fr> Francesca Guglielmo <francesca.guglielmo@lsce.ipsl.fr> |
| Period of requirements collection | Oct -Nov 2015 |
| Status | Completed |
Add additional rows to the above table if you have covered this topic with this RI by holding discussions with several people, or if you have delegated some discussions; to show the full authorship and duration.