created on 2014-10-17+02:00
An initial Schematron Schema for any EAD to validate for EHRI-preprocess
Version: 0.1
this is before any EHRI preprocess. after validating according to the ead.xsd, this
establishes some rules, according to Deliverable 17.3, for ead's provided to EHRI.
It may be used as a starting point for data providers.
Version History:
0.1 (2014-10-07) : Initial rules
0.2 (2014-10-17) DEFINITIONS MSCW:ADVICE
MUST-WP19: mandatory for import process according to WP19
MUST-EAD: mandatory according to the EAD specification of LoC
SHOULD-WP17: mandatory for description process according to WP17
SHOULD-WP19: desirable for description process according to WP19
COULD: desirable for description process to WP17
- EAD header
- did's
- unitdate's
- level attribute
- type attribute
- archdesc's and c-levels
EAD header
the EAD header contains useful information, like language of description, eadid etc.
- MUST-WP19:
eadid MUST contain text
- SHOULD-WP17:
eadid SHOULD contain a mainagencycode attribute
- MUST-WP19:
eadheader MUST contain a profiledesc
- MUST-WP19:
eadheader MUST specify a langusage
- SHOULD-WP17:
eadheader SHOULD specify a creation
- COULD:
eadheader COULD have a non-empty creation-date
- SHOULD-WP17:
eadheader SHOULD specify a publisher
- SHOULD-WP17:
a revisiondesc SHOULD have a non-empty date
did's
- SHOULD-WP17:
a did SHOULD have a unitdate, according 17.3
- COULD:
unitdates COULD have a label, describing the type of date, according 17.3
- SHOULD-WP17:
a did SHOULD have a non-empty physdesc-extent, according to 17.3
- MUST-WP19:
a did MUST have a unitid, according 17.3 and WP19
- SHOULD-WP17:
a did SHOULD have a unittitle, according 17.3
- MUST-WP19:
a did MUST have at least one non-empty unittitle
- MUST-WP19:
unitid's MUST be unique within one eadfile, according 17.3
- SHOULD-WP17:
a unitid SHOULD not be empty
unitdate's
the xsd imposes a regexp on the 'normal' attribute. the unitdate itself however is
free-text.
- SHOULD-WP17:
unitdate SHOULD be non-empty or have a non-empty @normal attribute
- SHOULD-WP19:
unitdate SHOULD have a non-empty @normal attribute
level attribute
- MUST-EAD:
archdesc/c-level with @level 'otherlevel' MUST have an @otherlevel attribute describing
the level
- SHOULD-WP19:
ONLY the archdesc can be fonds level
- SHOULD-WP19:
recordgrp SHOULD be a child of another recordgrp
- SHOULD-WP19:
subgrp SHOULD be a child of another subgrp or a recordgrp
- SHOULD-WP19:
subseries SHOULD be a child of another subseries or a series
type attribute
TYPE – required? (according to the LoC DTD information, this is not required, however
they state in their text that it is required.)
- MUST-EAD:
dsc MUST have a @type attribute
- MUST-EAD:
dsc with @type 'othertype' MUST have an @othertype attribute describing the type
archdesc's and c-levels
- MUST-WP19:
archdesc MUST have a level-attribute
- COULD:
archdesc COULD have a non-empty origination
- SHOULD-WP17:
archdesc-processinfo SHOULD have a date
- SHOULD-WP17:
archdesc-level SHOULD be 'fonds', 'recordgrp', 'collection' or 'otherlevel'
- COULD:
archdesc COULD have a langmaterial
- COULD:
archdesc COULD have a custodhist
- COULD:
archdesc COULD have an otherfindaid
- COULD:
archdesc COULD have an originalsloc
- COULD:
archdesc COULD have an altformavail
- COULD:
archdesc COULD have a bibliography
- COULD:
archdesc COULD have an odd
- COULD:
archdesc COULD have a note
- COULD:
archdesc COULD have a scopecontent
- COULD:
archdesc COULD have a controlaccess
- COULD:
controlaccess COULD have a subject
- COULD:
controlaccess COULD have a place
- COULD:
controlaccess COULD have a persname
- COULD:
controlaccess COULD have an orgname
- SHOULD-WP17:
archdesc SHOULD have a non-empty processinfo
- SHOULD-WP17:
archdesc SHOULD have a non-empty processinfo-date
- MUST-WP19:
language MUST contain a langcode attribute
- SHOULD-WP17:
language SHOULD contain a scriptcode attribute
- MUST-WP19:
c01 MUST have a level-attribute
- MUST-WP19:
c02 MUST have a level-attribute
- MUST-WP19:
c03 MUST have a level-attribute
- MUST-WP19:
c04 MUST have a level-attribute
- MUST-WP19:
c05 MUST have a level-attribute
- MUST-WP19:
c06 MUST have a level-attribute