-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enforce Namespaced UUID and check if landing page is present and correct-ish #124
Comments
This is the epic, so just the collection of issues, but yeah the name is maybe misleading |
ah I see |
We should have namespace uri prefix for the different environments. I have tested this with solr-drupal and it is working on the portals. So the uri namespace and landingpage mapping should be as follows:
this apply for both metadata_identifier and related_dataset |
so for having the right dataset landing page url in dmci the parameter added to the config is
for the name space we need something similar, like a
in the config... does this sound ok or does anyone see a better way? |
MMD files that are sent to dmci now have no.met: in the metadata_identifier. |
@magnarem I think we can fix that in dmci but see my question in the chat to see if I got it right |
It should be enough to update the ids of the parents with the namespace included. This can be done manually. But it means we also need to update all the MMD files that refer to the parents. DMCI should reject all MMD files without namespaced ids, as far as I have understood. |
we just need to merge this now https://gitlab.met.no/tjenester/s-enda/-/merge_requests/2758/ |
this Epic, at least with the design choices made so far (i.e. doing most of the stuff in dmci), has been completed, as far as I can tell |
@magnarem - is this required by solr? In principle, I think we should just naively use what we get. It shouldn't matter where the parent is present. In principle, it could also be on another "data center", couldn't it? |
@mortenwh - it will be required by both solr and csw. We have now defined that the metadata_identifier element in the MMD-file have identifiers on the form naming_authority:uuid. Since child-datasets have a related_dataset element witch have an identifier pointing to a parent, this id will also need to be on the form naming_authority:uuid, or else the children will point to a non-existing parent. The parent-child relationship in csw is as far as I know are using standard database relations, so the identifiers will have to match. For solr it is required for the search interface to know about parent-child relationships. In solr which is not a relational database, we have two flags that help solr figure out if a dataset is a parent, and then to know the children. example: a parent dataset in solr will get a flag isParent=true. Then when searching if a dataset have isParent=true, then solr looks for all other datasets in the index which have a related_dataset id which points to the metadata_identifier of the given parent. So the parent dataset does not know about its children, but the children knows which are the parent. So if we have a parent dataset with metadata_identifier=no.met:1234abcd-aa33-ffaa-aaff-fffffffff, and then we get a new dataset with metadata_identfier=no.niva:uuuid, that will point to the parent, then this child need to have related_dataset=no.met:1234abcd-aa33-ffaa-aaff-fffffffff. I have not come across parent/child relationships that comes from different "data centers" /"naming_authorities" that have a parent-child relationships across datacenters/naming_authotities in ADC at least, both internal and external datasets. Maybe Lara or Øystein can answer that. |
Ok. Then we need another issue under this epic. I'll create it now, then it would be good if you could review if I understand correctly :) |
No description provided.
The text was updated successfully, but these errors were encountered: