The collection of raw scientific data requires coordination between the  CV Data Acquisition phase (which extracts the raw data from instruments) and the  CV Data Curation phase (which packages and stores the data).

 

Notation of Computational Viewpoint Models#notation_cv_objects

 

 

The delivery of raw data into a research infrastructure is driven by collaboration between an CV Service Objects#acquisition_service and a CV Service Objects#data_transfer_service. This process can be configured using a CV Presentation Objects#field_laboratory subject to an  CV Service Objects#aaai_service authorisation, via the CV Service Objects#aaai_service's authorise action interface. Regardless, the acquisition service identifies the instruments that act as data sources and provides information on their output behaviour, whilst the data transfer service provides a CV Component Objects#data_transporter that can establish (multiple, persistent) data channels between instruments and data stores. The data transporter (a CV Component Objects#raw_data_collector) can initiate data transfer by requesting data from one or more CV Component Objects#instrument_controller and preparing one or more CV Component Objects#data_store_controller to receive the data.

The raw data collector is considered responsible for packaging any raw data obtained into a format suitable for curation - this may entail chunking data streams, assigning persistent identifiers and associating metadata to the resulting datasets. To assist in this, a raw data collector may acquire identifiers from a CV Service Objects#pid_service. It may also want to register the presence of new data and any immediately apparent data characteristics in infrastructure data catalogues - this is done by invoking an update operation on the CV Service Objects#catalogue_service.