Short descriptionCLARIN
Type of community

Thematic Services

Community contact
Interviewer
Date of interview

 

User stories

Requirements are based on a user story, which is  is an informal, natural language description of one or more features of a software system. User stories are often written from the perspective of an end user or user of a system. Depending on the community, user stories may be written by various stakeholders including clients, users, managers or development team members. They facilitate sensemaking and communication, that is, they help software teams organize their understanding of the system and its context. Please do not confuse user story with system requirements. A user story is an informal description of a feature; a requirement is a formal description of need (See section later).

User stories may follow one of several formats or templates. The most common would be:

"As a <role>, I want <capability> so that <receive benefit>"

"In order to <receive benefit> as a <role>, I want <goal/desire>"

"As <persona>, I want <what?> so that <why?>" where a persona is a fictional stakeholder (e.g. user). A persona may include a name, picture; characteristics, behaviours, attitudes, and a goal which the product should help them achieve.

Example:

“As provider of the Climate gateway I want to empower researchers from academia to interact with datasets stored in the Climate Catalogue, and bring their own applications to analyse this data on remote cloud servers offered via EGI.”


Note with respect to roles:

‘Researcher’ is a non-specific researcher


‘CLARIN user’ is a user working in the CLARIN infrastructure environment. Typical CLARIN users are linguists.


‘CLARIN data’ is data specific for the CLARIN community and stored and avaliable from CLARIN centers


‘Community manager’ is responsible for some part of community services e.g. a repository or a community specific service contributed to EOSC


No.

User stories

US1

A researcher wants to find relevant resources by the available metadata, using keywords or other search dimensions (facets) such as date, location, language, format, etc. to use in their work. Many of such resources are available through the CLARIN infrastructure.

US2

A linguist wants to be able to find (software) tools that can be used to process the data that they have found. For instance, they want to find a tokenizer for the Dutch language.

US3

A repository manager wants to make a repository and its resources findable for researchers. There may be various forms of resources which may have anywhere from no metadata to well-defined elaborate metadata based on specific schema.

US4

A community manager wants to make some language technology tools findable for researchers. The tools have minimal metadata.

US5

A researcher wants to be able to access the content of a resource that he has found using a search engine or other means. He does not want to need separate credentials to access resource and prefers to use his institute’s credentials for access.

US6

A researcher wants to manage a group of resources (not limited to a single existing collection or site) that are relevant for her in a way that they are easily findable, accessible, and citable.

US7A community manager wants to group related resources from their repository in citable collections.
US8A researcher wants to know what tools can be used to process a given resource that they have. They may have found the resource through an online repository or have produced it themselves. The researcher would like to have an overview quickly showing a selection of tools that are relevant and useful.
US9A researcher or software engineer has developed a tool for processing resources. They want to make this tool available, findable, and accessible to as many researchers and users as possible. They prefer if they can make the tool available and maintain it themselves without having to ask help from a middle layer
US10A user of an EOSC-hub compatible repository or data discovery tool wants to be able to find and access CLARIN data resources and collections from within the service they are using.
US11A linguist using one of the EOSC-hub compatible discovery or repository services wants to be able to see what linguistic tools they can use to process a given data object, without leaving the environment of the service that they are using.



Use cases

A use case is a list of actions or event steps typically defining the interactions between a role (known in the Unified Modeling Language as an actor) and a system to achieve a goal.

Include in this section any diagrams that could facilitate the understanding of the use cases and their relationships.




Step

Description of action

Dependency on 3rd party services (EOSC-hub or other)

UC1


  • The VLO is a search tool that can be used to find relevant data by searching the metadata
  • The search can be done using in various facets, e.g. date, location, genre, collection
  • The primary users are researchers, mainly linguists
  • Community data managers can use the VLO to make their data available to a larger audience
  • B2FIND is for non-specific researchers an easier to use, but less specific tool to find resources


VLO, B2FIND

(US1-4)

UC2
  • The VCR is a repository of collection metadata
  • In the VCR virtual collections are defined that contain links to resources stored in other repositories
  • Researchers  can use the VCR to group data into virtual collections of their choosing and make these collections  findable and citable
  • Researchers can seamlessly access virtual collection content

VCR, B2ACCESS0

(US5-7)

UC3
  • The LR switchboard can provide a set of relevant tools for specific data types

  • It is not meant as a stand-alone service, but as a feature that can be integrated  with other services.

  • Users can use this feature to find relevant tools for processing data objects directly from within the data discovery or hosting service.

  • Tool and service providers can make their tools available via the LR switchboard by specifying its features (tool metadata).

LR Switchboard

(US8-9)

UC4

Reverse integration of VCR with B2SHARE  (B2DROP/B2STAGE?).

Users can access and use the VCR from within relevant EOSC services to download or copy the data to a new destination.

VCR,

B2SHARE, B2DROP

(US10)

UC5

Reverse integration of LR Switchboard with B2FIND/B2SHARE/B2DROP

The LR Switchboard can be accessed by users from within relevant EOSC services to show what tools can be applied on a given resource

LR Switchboard,

B2SHARE, B2DROP

(US11)

UC6

CLARIN community manager can host the CLARIN thematic services with a suitable hosting service

VCR, LR Switchboard, VLO

B2HOST, FederatedCloud?

(US12)



Requirements

Technical Requirements


Requirement ID

EOSC-hub service

GAP (Yes/No) + description

Requirement description

Source Use Case

Example

EOSC-hub AAI

Yes: EOSC-hub AAI doesn’t support the Marine IdP

EOSC-hub AAI should accept Marine IDs

UC1

RQ1VLO, B2FIND

Yes,

not all CLARIN metadata is harvested (scaling and mapping problem)


Make VLO metadata also findable in B2FIND. At least the available collection metadata should be harvested through a managed VLO OAI-endpoint


UC1
RQ2

VLO, B2FIND

No

(Is the language-related data already harvested?)

Make B2FIND data accessible in VLO

DBR: unsure if this is really wanted

UC1

RQ3

VCR, B2ACCESS

Yes.

AAI integration between CLARIN and EOC-hub

UC2

RQ4

VCR, B2FIND, B2SHARE

Yes.

Link from VCR to EOSC-hub services

Make metadata resources available in relevant EOSC-hub services, e.g. B2SHARE, B2FIND, etc., accessible from the VCR and in suitable format for building virtual collections

UC2

RQ5

VCR, B2SHARE, B2FIND, EGI DataHub, …

Yes.

Link from EOSC-hub services to VCR

Make the VCR available when using EOSC-hub repository and discovery services, such that this data can be downloaded e.g. used to make new collections

UC4

RQ6

LR Switchboard, B2DROP, B2SHARE, ...

Yes.

Link from EOSC-hub services to LR Switchboard

Adding a link to the GUI of relevant EOSC-hub services for accessing the LR-Switchboard

UC3, UC5


Capacity Requirements


EOSC-hub services

Amount of requested resources

Time period

hostinghosting services for VLO, VCR and LR-Switchboardindefinite if suitable SLA and costs