Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SSPN] Records in need of curation #270

Open
12 tasks done
kpsherva opened this issue Nov 27, 2024 · 1 comment
Open
12 tasks done

[SSPN] Records in need of curation #270

kpsherva opened this issue Nov 27, 2024 · 1 comment
Assignees
Milestone

Comments

@kpsherva
Copy link
Contributor

kpsherva commented Nov 27, 2024

Hello!
During migration we have found some irregularities in metadata which we would like to ask you to help us curate:

  • 2278612, File is restricted, could we handle it in the same way as the other cases and assign SSO restriction on the file explicitly?
    • Set the restriction status of bibdocid:1341638 to SSO.
  • 2876930, There is a 906__p entry with empty value
    • Deleted empty 906__p value
  • 1583277, There is a 700__u:"-" entry which needs to be cleaned as creators are required
    • The missing user was "Fantechi, Riccardo"
    • Added the missing information from another record in the system that
      they are author
  • 2779427, We miss 100__a and it is required
    • Added 100__a: Juan Pablo Garcés taken from the submitted PDF
  • 2779432, 690__a:DarkSide/ProtoDune needs to be fixed as it is not found in greybook
    • Modified DarkSide/ProtoDune to 693__e:DarkSide-20K. From the metadata of the record and our controlled values from the greybook, this seems to refer to https://greybook.cern.ch/experiment/detail?id=RE37. The record has already as keywords the values DarkSide and ProtoDune.
  • 2060710, 520__a:"-" is not a valid description and it is required
    • Copied the title to 520__a
  • 2044560, 906__p is empty but it is required
    • Removed empty value
    • Added 100__a: Saied, Ahmed and 100__0: AUTHOR|(SzGeCERN)773472 as the 859__f: [email protected]. Related record in CERN People collection.
  • 2913067, the record has field 300__a (number of pages) as the only one. is it needed as a part of metadata for the summer student notes?
    • Decided to ignore for SSPN
  • 2684720, 690__a:DarkSide/ProtoDune needs to be fixed as it is not found in greybook
    • Modified DarkSide/ProtoDune to 693__e:DarkSide-20K. From the metadata of the record and our controlled values from the greybook, this seems to refer to https://greybook.cern.ch/experiment/detail?id=RE37. The record has already as keywords the values DarkSide and ProtoDune.
  • 2636046, There is no 100 field and creators are required
    • We added the 100__a: Giovinazzo, Dena and 100__0: AUTHOR|(SzGeCERN)830332 as the user was found in https://cds.cern.ch/record/2316876?ln=en and the 859__f points to the email they had while at CERN.
  • 2283978, 6531_9:"author" should be cleaned? We strictly check the scheme to be szgecern or cern
    • We will add it as a keyword
  • 2215919, the affiliation of University College Dublin matches with multiple universities in ROR.
    • The correct one was identified: https://ror.org/05m7pjf47
    • The record was edited to uniquely identify the affiliation. Affiliation changes to University College Dublin – National University of Ireland, Dublin
@kpsherva
Copy link
Contributor Author

kpsherva commented Dec 5, 2024

I found a new record corrupted data:
https://cds.cern.ch/record/1748461/

  • 001748461 693__ $$a$$eCMS PH-CMG-PS -> a subfield missing and department is incorrect (PH-CMG-PS value does not match any vocabulary we have), it will be mapped as CMS and department PH

  • 1748423, 1749912, 1755214 - ATS is not a department, how shall we handle this value?

  • 2634714 SHINE is not a valid experiment, it will be omitted

  • 2046076 COMPASS is not a valid experiment, it will be omitted

  • 2779426 keywords are in one field, should be divided into separate entries

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: For review
Development

No branches or pull requests

2 participants