Skip to content

kuleuven/iRODS-Dataverse

Repository files navigation

iRODS-Dataverse

This is an implementation for programmatic publication of data from iRODS into a Dataverse installation.

Prerequisites

  1. Being an iRODS user with data in an iRODS zone.
  2. Authenticate to Dataverse:
  • Sign up with individual account
  • Get the API Token which is valid for a certain amount of time (demo: one year)
  1. Run the scripts with internet access

Set up the virtual environment

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

When finished, deactivate the virtual environment

deactivate

User script

After the virtual environment is activated run:

python src/userScript.py

This will trigger an interactive terminal that will take you through the following steps:

  1. Authentication to iRODS - this happens automatically by reading your local "irods_environment.json".

  2. Identify the data object(s) to send to Dataverse. First it will search for data objects with the metadata field dv.publication: initiated. If none exist, you can input the absolute path(s) of the data object(s) to be sent to Dataverse.

  3. Identify a Dataverse installation. First it will go through the selected data object(s) and retrieve the metadata field dv.installation. If it is not valid or missing, you can input it from a selection.

  4. Authenticate to the Dataverse installation. For this, you need to input your API Token.

  5. Gather the metadata needed to create a draft in the selected Dataverse installation. ManGO users can use a metadata schema to add the metadata to any object of the list. Alternatively, users can copy the metadata template and fill it in, or create a shorter JSON file with the minimal metadata. For example, the text below shows the contents of the short JSON file, with metadata for the Demo installation:

{
    "author": {
        "authorAffiliation": "My university",
        "authorName": "Surname, Given Name"
    },
    "datasetContact": {
        "datasetContactEmail": "username@domain.edu",
        "datasetContactName": "Surname, Given Name"
    },
    "dsDescription": [
        {
            "dsDescriptionValue": "This is the first dataset I send from iRODS"
        }
    ],
    "subject": [
        "Demo Only"
    ],
    "title": "My dataset"
}

For RDR, the short JSON file would have, for example, the following contents:

{
    "access": {
        "accessRights": "open",
        "dateAvailable": "",
        "legitimateOptout": "other"
    },
    "author": [
        {
            "authorAffiliation": "My university",
            "authorName": "Surname, Given Name"
        }
    ],
    "datasetContact": [
        {
            "datasetContactEmail": "username@domain.edu",
            "datasetContactName": "Surname, Given Name"
        }
    ],
    "dsDescription": [
        {
            "dsDescriptionValue": "This is the first dataset I send from iRODS"
        }
    ],
    "keyword": [
        {
            "keywordValue": "required-keyword"
        }
    ],
    "technicalFormat": "json",
    "title": "My dataset"
}

To work with the short JSON file, copy the text above and adapt the values into a text file.

For the RDR long template, when the access rights are open, omit the fields regarding available date and legitimate opt-out.

For either the short JSON or the long JSON matching the template, provide the path to that file to complete this step.

  1. Validate the metadata.

  2. Deposit the draft with its metadata in the selected Dataverse installation.

  3. Update the metadata of the data objects with the DOI provided by Dataverse.

Visual overview of the available pipeline options

overview-pipeline-options

Configuring another installation

If you want to configure this script to work with other Dataverse installations, look at the custom classes or contact us.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages