Principal Investigators: Miklós Bán. University of Debrecen, Dept. of Evolutionary Zoology. banm@vocs.unideb.hu

Shepherds: Miguel Caballer. Universitat Politècnica de València. micafer1@upv.es

Entry in the community requirement database: OpenBioMaps data management service for biological sciences and biodiversity conservation

About the pilot

We would like to create a service with EOSC that allows multiple users to run tasks that are above the level of a PC through the same interface. In fact, we would like to develop a “service in service” - specifically for projects that collect nature conservation and biodiversity data.

The most common computing tasks come from the following areas:

To serve these diverse tasks we need a fully configurable VM which let us deploy our service interface (API) which will be available in the OpenBioMaps Network and provide computation capacity access to the involved projects.

According to our recent experiences in our PC based local computational cluster, the number of processors is the most important in these ecological analyzes. A “typical” analysis is now running at an acceptable rate on 16 threads. The parallel computing requirements of image analysis can be much higher, and GPU usage can be interesting there. Some analyzes, for example, genetic analyzes or larger spatial analyzes require a lot of memory.

Description of supported work

General

We would like to create an interface that allows user-level access to data computing from OBM-based databases and to run custom analyzes so that the system supports users in the preparation of new analyzes based on the analysis performed.

Use Cases

Team

ParticipantRoleName and Surname

PIMiklós Bán

ShepherdMiguel Caballer

Technical support






Technical Plan

The full technical plan can be found here: 

Work planned for Q1

  • Integration with EGI Cloud Compute.
  • Deploy and Configure OBM node in test environment manually.

Work planned for Q2

  • Create TOSCA Recipes and Ansible roles needed to deploy the application automatically using IM.

Work planned for Q3

  • Deploy OBM node to production environment using the developed recipes.
  • Analyse EOSC data services to be used by the application:
    1. EGI DataHub.
    2. B2 services (Drop, Find, Handle, Share)
    3. EGI Services (Training Infrastructure, Data Transfer)
    4. EOSC Marketplace services (GeoDAB, D4Science spatial services, Alien and Invasive Species Virtual Research Environment, Biodiversity, EODC JupyterHub for global Copernicus data)

Work planned for Q4

  • Performance test of all nodes.

EOSC services and providers

Providers

Services

Services that will be explored during project lifetime::