-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy path00000015.xml
1815 lines (1782 loc) · 103 KB
/
00000015.xml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="echoProfile2html.xsl" ?>
<!--
********************************************************************************
$Header: /ECHODEP/METS/BaseMETSProfile.xml 7 11/28/06 1:28p Thabing $
Authors/Contributors/Reviewers:
Janet Eke (UIUC)
Jay Goodkin (OCLC)
Jerome McDonough (UIUC)
Leah Houser (OCLC)
Morgan Cundiff (LOC)
Rebecca Guenther (LOC)
Richard Urban (UIUC)
Shifra Raffel (UCOP)
Shweta Rani (OCLC)
Tom Habing (UIUC)
Tracy Seneca (UCOP)
**********************************************************************************
-->
<METS_Profile xmlns="http://www.loc.gov/METS_Profile/"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.loc.gov/METS_Profile/
http://www.loc.gov/standards/mets/profile_docs/mets.profile.v1-2.xsd">
<URI LOCTYPE="URL">http://www.loc.gov/mets/profiles/00000015.xml</URI>
<title> ECHO Dep Generic METS Profile for Preservation and Digital Repository Interoperability</title>
<abstract> The primary focus of this profile is to enable repository interoperability and
digital preservation of repository content. Because of the strong
focus on preservation and not access, this profile is relatively noncommittal regarding file
formats or structures. However, special attention has been given to administrative and
technical metadata, particularly on integrating the PREMIS data model and schema into METS.
Because this profile is agnostic regarding file formats or structures, it is anticipated that this profile may be
overlaid on top of or inherited from other profiles which better define these aspects, but require the preservation
or interoperability support that this profile can support. As an example refer to the Related Profiles section below.
</abstract>
<date>2005-11-27T00:00:00</date>
<contact>
<name>Thomas G. Habing</name>
<institution> University of Illinois at Urbana-Champaign, Grainger Engineering Library
Information Center </institution>
<address>1301 W. Springfield Ave., Urbana, IL, 61801</address>
<phone>(217) 244-4425</phone>
<email>[email protected]</email>
</contact>
<related_profile ID="RELATED_PROFILES" RELATIONSHIP="Inherited by" URI="WebMETSProfile.xml">ECHO Dep METS Profile for Web Site Captures</related_profile>
<extension_schema>
<name>Metadata Object Description Schema (MODS)</name>
<URI>http://www.loc.gov/standards/mods/</URI>
<context>MODS descriptive metadata occur embedded in the dmdSec element. The METS package as
a whole must have one primary MODS dmdSec describing the entire package.</context>
</extension_schema>
<extension_schema>
<name>Digital Library Federation / Aquifer Implementation Guidelines for Shareable MODS Records</name>
<URI>http://www.diglib.org/aquifer/dlfmodsimplementationguidelines_finalnov2006.pdf</URI>
<context>The Primary descriptive MODS metadata record must conform to these guidelines.
In this case conformance means that all requirements of the Aquifer guidelines which are listed
as "REQUIRED" or "REQUIRED IF APPLICABLE" must be adhered to by METS files confromant to this
profile.</context>
</extension_schema>
<extension_schema>
<name>PREMIS Preservation Metadata Schema: Object</name>
<URI>http://www.loc.gov/standards/premis/v1/Object-v1-1.xsd</URI>
<context>PREMIS Object elements occur embedded in the techMD section. Each file or bitstream
which occurs in the file section must have a corresponding PREMIS Object in a techMD
section. METS sections which correspond to PREMIS representations, such as structMap
element or the METS file overall, may also have an associated PREMIS object
element.</context>
</extension_schema>
<extension_schema>
<name>PREMIS Preservation Metadata Schema: Event</name>
<URI>http://www.loc.gov/standards/premis/v1/Event-v1-1.xsd</URI>
<context>PREMIS Event elements occur embedded in the digiprovMD section. PREMIS Events can
be associated with nearly any other METS sections, including descriptive metadata,
files, structural maps, and possibly others. See the Structural Requirements sections
below for details.</context>
</extension_schema>
<extension_schema>
<name>PREMIS Preservation Metadata Schema: Agent</name>
<URI>http://www.loc.gov/standards/premis/v1/Agent-v1-1.xsd</URI>
<context>PREMIS Agent elements occur embedded in either the digiprovMD section or rightsMD
sections depending on whether the Agent is associated with PREMIS Events or PREMIS
Rights.</context>
</extension_schema>
<extension_schema>
<name>PREMIS Preservation Metadata Schema: Rights</name>
<URI>http://www.loc.gov/standards/premis/v1/Rights-v1-1.xsd</URI>
<context>PREMIS Rights elements occur embedded in the rightsMD section. Rights can be
expressed for files or bitstreams or for descriptive metadata.</context>
</extension_schema>
<extension_schema>
<name>Text Metadata Schema</name>
<URI>http://dlib.nyu.edu/METS/textmd.xsd</URI>
<context>Any textMD element may be embedded within the administrative techMD
element.</context>
</extension_schema>
<extension_schema>
<name>NISO Data Dictionary: Technical Metadata for Digital Still Images (MIX)</name>
<URI>http://www.loc.gov/standards/mix/mix.xsd</URI>
<context>Any MIX element may be embedded within the administrative techMD element.</context>
</extension_schema>
<extension_schema>
<name>VIDEOMD: Video Technical Metadata Extension Schema</name>
<URI>http://lcweb2.loc.gov/mets/Schemas/VMD.xsd</URI>
<context>The VIDEOMD element may be embedded within the administrative techMD
element. The VIDEOMD element should include the file_data and physical_data elements,
but may include other child elements as well.
</context>
</extension_schema>
<extension_schema>
<name>Audio Technical Metadata Extension Schema </name>
<URI>http://lcweb2.loc.gov/mets/Schemas/AMD.xsd</URI>
<context>The AUDIOMD element may be embedded within the administrative techMD
element. The AUDIOMD element should include the file_data and physical_data elements,
but may include other child elements as well.
</context>
</extension_schema>
<extension_schema>
<name>JHOVE XML Handler Output Schema</name>
<URI>http://hul.harvard.edu/ois/xml/xsd/jhove/jhove.xsd</URI>
<context>JHoVE XML output may be included within the administrative techMD element.</context>
<note>For details on JHOVE see http://hul.harvard.edu/jhove/.</note>
</extension_schema>
<description_rules>
<head>Purpose of this Profile</head>
<p> This profile is generally not concerned with rendering or making accessible any
particular representation of an object, but it is concerned with preserving the object
and its representations, including the history of how those have changed over periods of
time. In this context preservation refers to both short-term interoperability,
preserving the representations and metadata as a digital package is moved between two
different repositories, but it also refers to the long-term preservation of the package
and its history as it exists in various repositories for long periods of time and
undergoes various "preservation actions" such as fixity checks, normalizations, or
format migrations. </p>
<p> In general this profile is agnostic about almost all aspects of a digital object's
representation. We have made some pragmatic concessions, such as mandating at least MODS
for the dmdSec, but otherwise don't have many requirements for these sections. However,
this profile is very prescriptive when it comes to administrative metadata which can be
associated with almost all of the sections that make up the representation, particularly
technical and provenance metadata because we feel that those are important to
preservation. </p>
<head>Definitions/Assumptions</head>
<p> This profile draws on many of the concepts from the PREMIS data model. In particular,
METS documents that conform to this profile are assumed to be "representations" of
"intellectual entities." In addition to the overall "representation" that is the METS
document itself, there may be subordinate "representations" that correspond to specific
structural maps contained in the METS document. In general, the "representation" of an
"intellectual entity" is taken to be the combination of the structural maps (structMap),
structural map linkings (structLink), descriptive metadata (dmdSec), content files or
bitstreams (fileSec), and behaviors (behaviorSec) encapsulated by the METS document. If
a METS document contains multiple structural maps, each of those structural maps and
associated structural map linkings, descriptive metadata, content files or bitstreams,
and behaviors is treated as an alternate representation of the intellectual entity. The
administrative metadata (amdSec), including the METS header (metsHdr), is not considered
part of the actual representation; however, these sections are critical for interpreting
and utilizing the sections that are considered part of the representation. In general,
this profile treats data embedded in or referenced from the following METS elements as
comprising the representation of the object being preserved: dmdSec, fileSec, structMap,
structLink, and behaviorSec. This implies that any of these sections may have associated
administrative metadata, particularly entities from PREMIS or one of the technical
metadata standards called out in this profile. </p>
<p> The definition of intellectual entities is left for different communities of
practice, and will vary across those communities. </p>
<p> One ECHODEP METS document represents a single intellectual entity as defined by the
community of practice for which the intellectual entity applies. For example, for one
community of practice the intellectual entity of interest may be a web site, but for a
different community of practice the intellectual entity of interest may be individual
web pages. This profile can be used for either.</p>
<p> Omission of a particular METS section or attribute in this profile does not imply that
the section is prohibited by this profile. However, it does imply that the section or
attribute may be ignored by a METS processor that claims to conform to this profile. In
general, processors that act on METS documents that conform to this profile should
attempt to preserve any undefined sections or attributes during any operations performed
on the files, such as transformations, submissions, disseminations, or archiving.
However, this profile makes no guarantees regarding these undefined sections or
attributes. </p>
<head ID="DESCR_ID">METS Identifier</head>
<p> Each METS document must be assigned a persistent and globally unique identifier. The
only requirement for these identifiers is that they should conform to a widely accepted
standard identifier scheme, and they must be locally resolvable, that is the local
system must be able to retrieve its local representation of the package using only this
identifier. It is desirable that these identifiers also be globally resolvable. However,
global resolution is not defined. Acceptable identifier schemes include: OCLC Purls,
CNRI Handles, DOIs, and others. </p>
<p> If the METS document is being used as a Submission Information Package (SIP), the
identifier is optional. It is assumed that the repository to which the package is being
submitted will assign its own identifier. Even if the SIP does have its own identifier,
the repository to which it is being submitted may assign a new identifier in which case
the SIPs original identifier must be listed as an alternate identifier in the
altRecordID section of the header. </p>
<p> As this unique identifier is assumed to be the primary identifier for the METS instance,
it must be recorded in the OBJID attribute of the mets element. It is assumed that the
OBJID will be assigned at the time the METS file is completed and therefore the
CREATEDATE attribute must be the date on which the unique identifier was assigned to the
object. </p>
<p> The primary identifier and identifier scheme must also be included in a PREMIS object
element as a representation-level description of the entire set of files that are part
of the METs object. This PREMIS object element is recorded in a special techMD section
with a status of PRIMARY_REPRESENTATION. This techMD section must be referenced via the
root div element of the PRIMARY_STRUCTMAP structural map.</p>
<head>PREMIS Identifiers</head>
<p> PREMIS identifiers, namely objectIdentifier, eventIdentifier, agentIdentifier, and
permissionStatementIdentifier, may be assigned persistent and globally unique identifier
values which conform to a widely accepted standard identifier scheme, similar to the
requirement for the METS Identifier. However, there is no requirement that these
identifiers be resolvable either globally or by the local system. It is recognized that
many systems will not be capable of authoritatively tracking and assigning identifiers
to all of the PREMIS entities. Therefore, the only absolute requirement for these
identifiers within this profile is that they be consistent within a single METS
document, similar to the Rules for XML Identifiers which are described below. If a
system is capable of assigning globally unique and persistent identifiers for some
PREMIS entities, for example if it maintains an authority file for agents, it should use
those identifiers, but this is not a requirement. </p>
<p> Similar to the METS Identifier, an individual repository may reassign its own local
identifiers to certain PREMIS entities. If this occurs the original identifier of the
entity must be preserved for PREMIS object and agent entities. Both of these entities
support multiple identifiers. PREMIS rights and event entities support only a single
identifier. Therefore, identifiers which are preassigned to these entities should not be
changed. </p>
<head>Descriptive Metadata</head>
<p> This profile makes several assumptions about the descriptive metadata. 1) The most
significant assumption is that the descriptive metadata about the entity represented by
the METS document are part of the representation of the intellectual entity and not just
metadata about the representation. 2) It is also assumed that the descriptive metadata
standards supported by different repositories will vary and that preservation of
descriptive metadata can help establish the provenance of the METS object. 3) We also
assume that the descriptive metadata will change during the life-cycle of the object. </p>
<p> Because of the above assumptions, this profile supports multiple descriptive metadata
sections, as well as provenance metadata about each of those sections. However, to
facilitate interoperability, this profile requires one primary MODS metadata section. </p>
<p> A potential usage scenario might be a digital object whose original source descriptive
metadata is in the MARCXML format. Because this profile requires MODS as the primary
descriptive metadata, the MARCXML will be transformed into MODS, and the MODS will be
stored in the METS document along with a provenance statement with some details about
the transformation, especially identifying the source metadata format. However, because
descriptive metadata are considered to be a significant part of the representation of
an entity and because transformations between metadata formats are often imperfect, the
original MARCXML format is also stored in the METS document as an alternate metadata
format. Now assume that the digital object is to be ingested into DSpace. However,
DSpace does not have native support for the MODS metadata format; therefore, as part of
the ingest process the MODS must be transformed into the idiosyncratic Dublin Core
metadata format that is supported by DSpace. This metadata format is also added as
another alternate descriptive metadata format to the METS documents, along with a
provenance statement describing how this new DC format was derived from the primary MODS
format. Now imagine that the object exists in DSpace for some period of time during
which the descriptive metadata undergoes some revision, such as the addition of new
subject terms or the addition of an abstract. Also imagine that the object is being
disseminated from DSpace for ingest into some new repository. This could trigger the
addition of another alternate descriptive metadata section to the METS document. This
alternate format would conform to the idiosyncratic DSpace Dublin Core format, but the
provenance statement would specify that this DC format represents a newer version of the
descriptive metadata than was originally ingested into DSpace. The above scenario would
produce a chain of descriptive metadata formats, such as MARCXML (original) -> MODS
(primary) -> DC (version 1) -> DC (version 2), with the provenance statements adequate
to determine the sequence of events that led to this chain. As part of this profile we
also envision future processes that might reconcile later metadata revisions and merge
those revisions back into a new primary MODS descriptive metadata section. </p>
<p> It must be noted that even though this profile can accommodate multiple different
versions of the same descriptive metadata, it does not mandate that every change to the
descriptive metadata must be preserved as an alternate version. The tools that we are
developing (the Hub and Spoke) will take advantage of this feature especially for the
descriptive metadata which our tool may transform in significant ways. Just as you would
probably not delete a source content file after doing a format migration to a more
'preservable' format (say from TIF to JPEG2000), we also assume that you will not delete
descriptive metadata or a structural map just because you have migrated to a new format
or version (say MARC to MODS or even from MODS to MODS with a new revised abstract). </p>
<p> It will be up to a particular system developer and/or collection curator as to what
extent to implement this. In general, you would not want to preserve an old version of
the descriptive metadata every time a new subject term was added or a spelling mistake
was corrected, but if the descriptive metadata was substantially rewritten because of
new scholarly discoveries about the object, this probably would require preserving the
old metadata along with a provenance statement describing the rational for the changes
in the new metadata. Likewise, a transformation into an entirely new metadata format
should require a new alternate metadata version with the original also preserved for
posterity.</p>
<p> Part of the assumptions of this profile is that the objects being packaged are in some
sort of "preservation state" -- a deliberate decision was made by some agent to put the
object into that state, and the object is not expected to be undergoing frequent
changes. </p>
<p> For more details see the structural requirements section for descriptive metadata below. </p>
<head ID="STRUCT_MAP">Structural Map </head>
<p> This profile also makes assumptions about the structural maps which are similar to the
assumptions about descriptive metadata. 1) The structural maps of the object represented
by the METS document are also considered a significant parts of the representation
itself. 2) In addition, the level of support for complex structural maps can vary
greatly across different repositories, meaning that complex structural maps will often
need to be transformed or simplified in order to ingest the object into a specific
repository. 3) To a lesser extent, we also assume that structural maps can change over
time. </p>
<p> Just as with the descriptive metadata, this implies that this profile must support
multiple structural maps along with provenance metadata about each of those structural
maps. Unlike the descriptive metadata, we cannot specify a single interoperable standard
that a primary structural map must follow. This is because there are currently no
standards for structural maps which are rich enough to represent all possible structures
to which an object might conform. Because of this,and also because access and rendering
are not a primary focus for this profile, we are deliberately vague with respect to
structural maps (Other profiles which inherit from this profile may be more prescriptive with regard to
structural maps. See the Related Profiles section above for an example.). In general we cannot be any more
prescriptive about structural maps than we can be for content files. </p>
<p> As an example, if a digital object representation requires the Indiana METS Navigator
that would be the structural map it should use; alternately it could require the Harvard
Page Delivery Service (PDS), or it might have a separate structural map for each
application. At some future point a new and better page turning application may emerge
that requires a new structural map. Now the METS document would contain both the old
structural map and the new with appropriate technical and provenance metadata, so that a
future curator could make informed decisions about the object and its possible
representations. </p>
<p> However, this profile does still specify that there must be one structural map which is
considered the primary structural map for the object. This primary structural map is
defined to be that structure that provides the best representation for the digital
object as determined by the original creator of the structural map. If the METS document
contains multiple structural maps, a processing system may choose amongst the various
structural maps that best meets its needs; however, preference should be given to the
primary structMap if no other clearly meets the requirements of the system. The primary
structural map should also have a reference to every file in the file section.</p>
<p> Since our profile is not so concerned with the actual structural map, but with how that
structural map can be described for preservation, thus unlike many other profiles, we
recommend that structural maps have associated administrative metadata, both technical
and provenance, to facilitate the long term preservation of the structural map,
regardless of what the structure actually is. </p>
<head>Structural Map Linking</head>
<p> Because this profile interprets structural maps as representations of intellectual
entities, and multiple structural maps in the same METS document as alternate
representations of the same entity, we have imposed a restriction that structural map
linking must only link divisions within a single structural map. Using structural map
linking to link divisions across different structural maps is forbidden. This allows a
single structural map linking section to be associated with a single representation and
by extension any administrative metadata associated with a structural map is assumed to
also apply to the associated structural map linking section. </p>
<head>Rules for the XML</head>
<p> Every METS XML file conforming to this profile must be UTF-8 encoded Unicode. </p>
<p> Every METS XML file conforming to this profile must begin with the standard XML
declaration: </p>
<p> <?xml version="1.0" encoding="UTF-8"?>. </p>
<head ID="XML_IDS_RULE">Rules for XML Identifiers</head>
<p> By XML identifiers we mean any XML attribute of type ID and by extension attributes of
type IDREF or IDREFS. This includes those attributes which occur in the METS namespace
itself and any other extension schema that may be used within the METS document. This
profile makes no requirements for those types of identifiers beyond the requirements
that are already specified by the XML specification itself. All values of ID-type
attributes must be unique within the METS document. However, they may be reused in
different METS documents, i.e. they need not be globally unique. They might also be
different in different instances of the METS document for the same object, i.e. they
need not be globally persistent. In other words, attributes of type ID (also IDREF and
IDREFS) have no meaning outside the METS document. </p>
<p> Also, the syntax used for those identifiers is not specified beyond what is already
required by the XML specification. It may be convenient for systems that are creating
METS documents to adopt naming standards for ID-type attributes, such as all amdSec ID
attributes must begin with "AMD_" followed by a sequential number. However, this is not
a requirement of this profile, and systems which are processing METS documents based on
this profile should make no assumptions in this regard. </p>
<p> Finally, the schema for METS specifies that certain ID attributes are required on
certain elements, and those requirements must be followed. However, in general this
profile only requires an ID-type attribute on an element if that element is referred to
via an IDREF or IDREFS-type attribute elsewhere in the file. </p>
<head>Date Values</head>
<p>Unless otherwise specified (or constrained by the XML Schema) all date values must
conform to the W3C-DTF format with a granularity of at least days, such as YYYY-MM-DD.
Finer granularities are acceptable, such as YYYY-MM-DDTHH:MM:SS, but lesser
granularities are not acceptable, such as YYYY-MM or YYYY. Note that all date attributes
specified in the METS profile itself require the xsd:dateTime format which is
YYYY-MM-DDTHH:MM:SS with an optional time zone indicator.</p>
<head ID="LINK_VS_EMBED">Linking Versus Embedding</head>
<p> In general this profile encourages metadata to be embedded within the METS document via
the mdWrap element. However, in some cases, if there is a good reason, for example if
the metadata is too large to conveniently embed, it may be referenced via an mdRef. If
mdRef is used the xlink:href must be a relative URL which is relative to the location of
the METS document itself. The same metadata must not be both embedded and referenced.
Each metadata section must have either an mdWrap or an mdRef but not both.</p>
<p> Conversely, this profile encourages content to be referenced via the FLocat element. The
FLocat xlink:href must be a relative URL which is relative to the location of the METS
document itself. However, if embedding the content would make the package easier to move
around and share and would not cause the METS document to become too large then it may
be embedded via the FContent element. The same content may not be both embedded and
referenced. Each file element must have either an FLocat or an FContent but not
both.</p>
</description_rules>
<controlled_vocabularies>
<!-- TODO: Should all controlled vocabulary terms should be assigned URI values, possibly in the info: namespace? -->
<vocabulary>
<name>PREMIS Suggested Event Types</name>
<maintenance_agency>Library of Congress</maintenance_agency>
<URI>http://www.loc.gov/standards/premis/</URI>
<values>
<value>CAPTURE = the process whereby a repository actively obtains an object</value>
<value>COMPRESSION = the process of coding data to save storage space or
transmission time</value>
<value>DEACCESSION = the process of removing an object from the inventory of a
repository</value>
<value>DECOMPRESSION = the process of reversing the effects of compression</value>
<value>DECRYPTION = the process of converting encrypted data to plain text</value>
<value>DELETION = the process of removing an object from repository storage</value>
<value>DIGITAL_SIGNATURE_VALIDATION = the process of determining that a decrypted
digital signature matches an expected value</value>
<value>DISSEMINATION = the process of retrieving an object from repository storage
and making it available to users</value>
<value>FIXITY_CHECK = the process of verifying that an object has not been changed
in a given period</value>
<value>INGESTION = the process of adding objects to a preservation repository</value>
<value>MESSAGE_DIGEST CALCULATION = the process by which a message digest ("hash")
is created</value>
<value>MIGRATION = a transformation of an object creating a version in a more
contemporary format</value>
<value>NORMALIZATION = a transformation of an object creating a version more
conducive to preservation</value>
<value>REPLICATION = the process of creating a copy of an object that is, bit-wise,
identical to the original</value>
<value>VALIDATION = the process of comparing an object with a standard and noting
compliance or exceptions</value>
<value>VIRUS_CHECK = the process of scanning a file for malicious programs</value>
</values>
<context>
<p>PREMIS Suggested Event Types are used as values for the eventType element which
is used for the METS digiprovMD sections.</p>
</context>
</vocabulary>
<vocabulary>
<name>Event Types for Descriptive Metadata</name>
<maintenance_agency>The ECHO DEPository Project</maintenance_agency>
<URI>http://www.ndiipp.uiuc.edu/...</URI>
<values>
<value>METADATA_TRANSFORMATION = the transformation of one metadata format into
another</value>
<value>METADATA_CREATION = the creation of a new metadata record</value>
<value>METADATA_MODIFICATION = the modification of a metadata record that does not
change the format</value>
<value>METADATA_DELETION = the deletion of a metadata record </value>
</values>
<context>
<p>Event types which are used for describing actions performed on the descriptive
metadata of a package.</p>
</context>
</vocabulary>
<vocabulary>
<name>Event Types for Structural Maps</name>
<maintenance_agency>The ECHO DEPository Project</maintenance_agency>
<URI>http://www.ndiipp.uiuc.edu/...</URI>
<values>
<value>STRUCTMAP_TRANSFORMATION = changes to a structural map which effect the
compatibility with processing systems</value>
<value>STRUCTMAP_CREATION = the creation of a new structural map</value>
<value>STRUCTMAP_MODIFICATION = changes to a structural map which do not effect the
compatibility with processing systems</value>
<value>STRUCTMAP_DELETION = the deletion of a structural map</value>
</values>
<context>
<p>Event types which are used for describing actions performed on the structural map
of a package.</p>
</context>
<description>
<p> The difference between STRUCTMAP_TRANSFORMATION and STRUCTMAP_MODIFICATION is
dependent on the system which will process the structural map. Changes that do
not effect the ability of systems that could process the old version to process
this version should be given an eventType of STRUCTMAP_MODIFICATION. However,
changes that make the structural map no longer compatible with systems which
were able to process the original structural map should be given an eventType of
STRUCTMAP_TRANSFORMATION. </p>
</description>
</vocabulary>
<vocabulary>
<name>PREMIS Suggested Object Categories</name>
<maintenance_agency>Library of Congress</maintenance_agency>
<URI>http://www.loc.gov/standards/premis/</URI>
<values>
<value>REPRESENTATION</value>
<value>FILE</value>
<value>BITSTREAM</value>
</values>
<context>
<p>PREMIS Suggested Object Categories are used as values for the objectCategory
element which is used for the METS techMD sections.</p>
</context>
</vocabulary>
<vocabulary>
<name>PREMIS Suggested Agent Types</name>
<maintenance_agency>Library of Congress</maintenance_agency>
<URI>http://www.loc.gov/standards/premis/</URI>
<values>
<value>PERSON</value>
<value>ORGANIZATION</value>
<value>SOFTWARE</value>
</values>
<context>
<p>PREMIS Suggested Agent Types are used as values for the PREMIS agent/agentType
element which is used for either METS digiprovMD or rightsMD sections.</p>
</context>
</vocabulary>
<vocabulary>
<name>Descriptive Metadata Status</name>
<values>
<value>PRIMARY_DMDSEC</value>
<value>ALTERNATE_DMDSEC</value>
</values>
<context>
<p>Used with the STATUS attribute of dmdSec elements. There must be only one dmdSec
element with a STATUS of PRIMARY_DMDSEC, but there can be any number of dmdSec elements
with a STATUS of ALTERNATE_DMDSEC.</p>
</context>
</vocabulary>
<vocabulary>
<name>Structural Map Type</name>
<values>
<value>PRIMARY_STRUCTMAP</value>
</values>
<context>
<p>Used with the TYPE attribute of structMap element to indicate that this structMap
should be considered the primary structural map for the METS file. There must be
only one structMap element with a TYPE of PRIMARY_STRUCTMAP.</p>
</context>
</vocabulary>
<vocabulary ID="PRIMARY_REPRESENTATION">
<name>Technical Metadata Status</name>
<values>
<value>PRIMARY_REPRESENTATION</value>
</values>
<context>
<p>Used with the STATUS attribute of techMD elements. Used to indicate that a techMD
section contains a PREMIS object element which describes the entire METS
document as a whole. There should only be one techMD with this status, and it
must contain a PREMIS object element which describes a representation.</p>
</context>
</vocabulary>
<vocabulary>
<name>PREMIS Identifier Types</name>
<values>
<value>LOCAL</value>
<value>ARK</value>
<value>URN</value>
<value>URL</value>
<value>PURL</value>
<value>HANDLE</value>
<value>DOI </value>
</values>
<context>
<p>These values should be used for any PREMIS *IdentifierType. The value LOCAL
should be used whenever the corresponding *IdentifierValue contains an
identifier which is local to just this METS file, usually this value will be the
same as the XML ID attribute value. These values do not constitute an exhaustive
list, but they correspond to the METS LOCTYPE attribute values and should be
used whenever applicable. The value OTHER should not be used; instead just name
the other type.</p>
</context>
</vocabulary>
<vocabulary>
<name>PREMIS linkingAgentRole Values</name>
<values>
<value>EVENT_INITIATOR = the agent who requested or initiated the event</value>
<value>SOFTWARE_USED = the software system used to carry out the event</value>
<value>DATA_SOURCE = the source of the data or metadata used by the event</value>
<value>DATA_DESTINATION = where the data or metadata resides after the event
completes</value>
</values>
<context>
<p>These values do not constitute an exhaustive list but should be used when they
apply. Other values may be used if a value given above does not seem to
apply.</p>
</context>
</vocabulary>
</controlled_vocabularies>
<structural_requirements>
<metsRootElement>
<requirement ID="ROOT_OBJID" RELATEDMAT="DESCR_ID APP1_METS_SAMPLE_1">
<head>OBJID</head>
<p> As previously described, the OBJID attribute must be the primary, persistent,
and globally unique identifier for the file. This attribute is required; all
METS files which are conformant to this profile must have a persistent and
globally unique identifier, unless they are Submission Information Packages that
will be assigned an identifier upon ingestion. </p>
<p> Computing systems which process files conformant to this profile must preserve
this identifier through any transformations, submissions, disseminations,
archiving, or other operations on the file. If a system does reassign a new
primary identifier to the METS document, the old identifier must be listed as an
altRecordID in the metsHdr. The alternate identifiers must also be recorded in
the PRIMARY_REPRESENTATION techMD section.</p>
</requirement>
<requirement>
<head>LABEL</head>
<p> The LABEL attribute must contain a human-readable title string that would be
suitable for distinguishing the METS document from others when a person is
browsing through a list of METS documents. This value could be a title element
or variation thereof taken from the dmdSec. This attribute is required. </p>
</requirement>
<requirement>
<head>PROFILE</head>
<p> The PROFILE attribute must contain the URI assigned to this profile when it was
registered with the Library or Congress. The value is 'http://www.loc.gov/mets/profiles/00000015.xml'. This attribute is
required. </p>
</requirement>
</metsRootElement>
<metsHdr>
<requirement RELATEDMAT="DESCR_ID APP1_METS_SAMPLE_2">
<head>CREATEDATE</head>
<p> This date must be the date on which the METS document was assigned its first
persistent, globally unique identifier, even if the actual file itself was
created prior to this date, or even if the file was not yet created when the
identifier was coined. This attribute is required. Once it is assigned it must
not be changed, even if the OBJID is changed. </p>
<p> Computing systems which process files conformant to this profile must preserve
this date through any transformations, submissions, disseminations, archiving,
or other operations on the file. </p>
</requirement>
<requirement>
<head>LASTMODDATE</head>
<p> This date must reflect the date on which any changes to the METS document itself
occur. This attribute is required, even for newly created METS documents, in
which case the CREATEDATE and the LASTMODDATE should match. Subsequently, the
LASTMODDATE must always be after the CREATEDATE. </p>
</requirement>
<requirement RELATEDMAT="DESCR_ID ROOT_OBJID">
<head>altRecordID</head>
<p> There can be altRecordID elements. If the primary identifier of the METS
document as recorded in the OBJID element of the mets root element ever changes
for any reason, the previous identifier must be recorded as an alternate
identifier in the altRecordID element. </p>
<p> Computing systems which process files conformant to this profile must preserve
all altRecordID elements through any transformations, submissions,
disseminations, archiving, or other operations on the file. </p>
</requirement>
</metsHdr>
<dmdSec>
<requirement RELATEDMAT="DMD_DIGIPROV LINK_VS_EMBED">
<head>All Descriptive Metadata</head>
<p> All descriptive metadata should be embedded in the dmdSec via the mdWrap
element, but may be referenced via the mdRef if it is too large to conveniently
embed. The same metadata must not be both embedded and referenced.</p>
<p> Each dmdSec, including the primary MODS dmdSec, must have a CREATED attribute
which reflects the date on which description was created. </p>
<p> Each dmdSec, including the primary one, must have an ADMID attribute that
references a digiprovMD section. See the requirements for the amdSec for
details. Additional administrative metadata sections may be referenced, but the
digiprovMD reference is required. </p>
<p> The STATUS attributes defined by this profile are 'PRIMARY_DMDSEC' or 'ALTERNATE_DMDSEC'.
There must be one and only one dmdSec with a STATUS='PRIMARY_DMDSEC'. There may be zero
or more dmdSec elements with a STATUS='ALTERNATE_DMDSEC'.</p>
<p> All dmdSec elements with a STATUS of PRIMARY_DMDSEC or ALTERNATE_DMDSEC must be considered
descriptive of the entire intellectual entity represented by the METS document.
The first div child in all structMap elements must have a DMDID attribute that
references all the PRIMARY_DMDSEC and ALTERNATE_DMDSEC dmdSec elements.</p>
<p> Computing systems which process files conformant to this profile must preserve
the PRIMARY_DMDSEC and ALTERNATE_DMDSEC dmdSec through any transformations, submissions,
disseminations, archiving, or other operations on the file. Although, the
PRIMARY_DMDSEC dmdSec may be modified so long as previous states are preserved in
alternate dmdSec elements along with appropriate provenance metadata.</p>
<p> There may be other dmdSec elements with STATUS values other than PRIMARY_DMDSEC or
ALTERNATE_DMDSEC; however, these are undefined by this profile. Computing systems which
process files conformant to this profile should preserve these dmdSec through
any transformations, submissions, disseminations, archiving, or other operations
on the file, but this behavior is not guaranteed. </p>
</requirement>
<requirement>
<head>Primary Descriptive Metadata</head>
<p> Every METS document must contain exactly one dmdSec with a STATUS attribute of 'PRIMARY_DMDSEC'. This
primary dmdSec must have an embedded MODS XML
metadata record. The MODS must be embedded via mdWrap and xmlData elements. The
primary MODS metadata must not be referenced via an mdRef.</p>
<p> The primary dmdSec describes the entire intellectual entity represented by the
METS document.</p>
<p> The primary MODS descriptive metadata must
conform to the Digital Library Federation / Aquifer Implementation Guidelines for Shareable MODS Records.
[http://www.diglib.org/aquifer/dlfmodsimplementationguidelines_finalnov2006.pdf]. All requirements
in the Aquifer guidelines listed as "REQUIRED" or "REQUIRED IF APPLICABLE" must be adhered to by
METS files confromant to this profile.</p>
<p> Computing systems which process files conformant to this profile must be able to
process the MODS descriptive metadata in the primary dmdSec. The primary dmdSec
should be a processing system's first source for any required descriptive
metadata about the object. However, if a processing system is able to determine
that for its purposes more suitable descriptive metadata is available from one
of the alternate dmdSec elements it may use that instead. </p>
<p> The primary descriptive metadata about an object is expected to change over time
as objects are submitted, subsequently transformed, or disseminated by different
repositories. However, anytime that the primary descriptive metadata for a
repository is changed or replaced with new data, the original primary dmdSec
should be preserved and its STATUS changed to 'ALTERNATE_DMDSEC', and the
newer version should be given a STATUS of 'PRIMARY_DMDSEC'. In other words,
at any one time there must be exactly one primary dmdSec element with embedded MODS. However,
there may be any number of alternate dmdSec elements with embedded MODS which may represent
earlier versions of the descriptive metadata. See the following section on Alternate Descriptive Metadata
for more details on alternate dmdSec elements.
</p>
<p> Any time the primary
dmdSec is modified the referencing digiprovMD must be modified to describe the
the nature of the changes.</p>
<p> If the primary object being represented by this METS document contains
constituent parts which also must have some level of descriptive metadata, the
MODS element should contain relatedItem elements for each subpart. Each
relatedItem element must have an ID attribute and a type="constituent"
attribute. The relatedItem element must be referenced via a DMDID attribute on
the associated structMap div elements which represent the constituent part. A
typical use for relatedItem elements might be a web capture where the individual
resources making up the web capture each have there own descriptive metadata,
such as the title of the web page. </p>
</requirement>
<requirement>
<head>Alternate Descriptive Metadata</head>
<p> A METS document may contain one or more alternate dmdSec. These should be
embedded via mdWrap, but may be referenced via mdRef.</p>
<p> The STATUS attribute of the alternate dmdSec must be 'ALTERNATE_DMDSEC'. </p>
<p> Alternate dmdSec describe the entire intellectual entity represented by the METS
document.</p>
<p> Alternate dmdSec are intended primarily to provide an historical record of the
descriptive metadata for an object. This includes historical versions of the
primary MODS metadata, and also historical versions of alternate metadata
formats that might exist for the object at different points in time. For
example, if the object was originally described using MARCXML, that MARCXML can
be transformed into MODS which becomes the primary dmdSec, but the original
MARCXML can be preserved in an alternate dmdSec. If the METS document is
subsequently ingested and later disseminated from a repository whose native
metadata format is Dublin Core, the Dublin Core metadata would be added to the
METS document as another alternate dmdSec.</p>
<p> Because of how the alternate dmdSec are intended to provide historical versions
of the descriptive metadata, computing systems which process files conformant to
this profile must preserve the alternate dmdSec through any transformations,
submissions, disseminations, archiving, or other operations on the file.</p>
</requirement>
</dmdSec>
<amdSec>
<requirement>
<head>General Requirements for the Organization of Administrative Metadata</head>
<p> In general this profile assumes a single amdSec element for the entire METS
document. However, this is not an absolute requirement. How techMD, digiprovMD,
sourceMD, and rightsMD elements are arranged beneath one or more admSec elements
is not significant and can be at the convenience of the original creator of the
file so long as the result conforms to the METS XML schema.</p>
<p> The reason that the arrangement of sub-elements beneath one or more admSec
elements is not significant is that this profile requires all ADMID attributes
(of type IDREFS) to point directly to techMD, digiprovMD, sourceMD, or rightsMD
elements via those elements' ID attributes. An ADMID attribute must not point to
an enclosing admSec, but directly to the relevant sub-element. In addition, if
there are relationships between the sub-elements themselves, such as a
digiprovMD that contains a PREMIS event and another digiprovMD that contains a
PREMIS agent associated with that event, the relationships between those
sub-elements must be explicitly defined via identifier and referencing
mechanisms explicit in the markup itself, such as the PREMIS event
linkingAgentIdentifier and corresponding PREMIS agent agentIdentifier. These
relationships cannot be implied merely because of the nesting or sequence of
sub-elements beneath one or more amdSec elements. These linking mechanisms will
be more fully described in separate sections below. </p>
</requirement>
<requirement>
<head>General Requirements for the Use of PREMIS</head>
<p> PREMIS object, event, agent, and rights elements may all be used in various
different amdSec sub-elements. However, the PREMIS container element <premis> is
never used. Furthermore, only a single PREMIS object, event, agent, or rights
element may ever be embedded within a single amdSec sub-element. For example,
a techMD may only contain a single object element; it may not contain multiple object elements or
an object element and some element like mix. Even, if a single file needs to reference multiple
technical metadata sections, each of those metadata sections must be wrapped in its own
techMD section. The same applies to other administrative metadata sections.
In general,
PREMIS object elements are embedded under techMD, PREMIS event elements are
embedded under digiprovMD, PREMIS rights elements are embedded under rightsMD,
and PREMIS agent elements are embedded under either digiprovMD or rightsMD
depending on whether the agent is associated with an event or with rights. If
the same agent is associated with multiple events or rights it must occur only
once. If the same agent is associated with both an event and a rights element,
it may occur under either the digiprovMD or rightsMD section; it does not matter
which. </p>
<p> The linkage between a METS section, such as file, and a PREMIS entity, such as
object, is via the METS ID attribute of the amdSec sub-element, such as techMD,
containing the PREMIS entity and ADMID attribute of the section which is linking
to the PREMIS entity. For example, a file element which references a PREMIS
object entity would have a ADMID attribute containing the ID of the techMD
section which contains the PREMIS object entity. </p>
<p> The linkages between related PREMIS entities, such as events and agents or
rights and agents, must be via the PREMIS IDREF-type (LinkAgentXmlID,
GrantAgentXmlID, etc.) attributes and the METS ID of the amdSec sub-element
which contains the agent. Specifically, the relationship between an event and
the associated agents is made by assigning the value of the ID attribute of the
amdSec sub-element containing the agent to the LinkAgentXmlID attribute of the
event, and similarly for rights and agents. There are several reasons that these
linkages are made via the ID-type and IDREF-type attributes instead of between
the identifier elements (objectIdentifier, eventIdentifier, agentIdentifier, and
permissionStatementIdentifier) of the various PREMIS entities. One reason is for
consistency with the METS way of linking items which relies on ID-type and
IDREFS-type attributes. A second reason is that the PREMIS object and agent
entities may have more than one identifier which could complicate the processing
of the associations. A third reason is that by using the ID-type and IDREF-type
attributes you can rely on the XML validation mechanisms to ensure that the
usage of the ID-type and IDREF-type attributes are consistent. An XML validator
can ensure that each ID-type attribute used within a single XML document is
unique, and it can also ensure that each IDREF-type or IDREFS-type attribute
value has a corresponding ID-type attribute somewhere in the same XML document.
</p>
</requirement>
<requirement>
<head>Technical Metadata (techMD)</head>
<p> This profile supports several different types of technical metadata. PREMIS
object elements with an objectCategory of FILE or BITSTREAM should be associated
with file elements in the fileSec. PREMIS object elements with an objectCategory
of REPRESENTATION may be associated with div elements in a structMap. In
addition to PREMIS object elements, various format-specific technical metadata
schema have been adopted for different formats of files or bitstreams. These
should be associated with file elements in the fileSec, depending on the format
of the file. Finally, the XML output of JHOVE may be associated with file
elements in the fileSec. See the following requirements for more detail. </p>
</requirement>
<requirement>
<head> Technical Metadata for Files and Bitstreams </head>
<p> Each file or bitstream which is referenced or embedded in the fileSec must have
a corresponding techMD section. </p>
<p> The content of the techMD section must be a PREMIS object element embedded in
the techMD via the mdWrap element. The PREMIS object element must conform to the
XML Schema of PREMIS. </p>
<p> The PREMIS objectIdentifier elements must have values that correspond to the
OWNERID attribute of the referencing file element. </p>
<p> The PREMIS objectCategory element must have a value of 'FILE' or 'BITSTREAM'. </p>
<p> One PREMIS objectCharacteristics element is required with a compositionLevel
element value of 0. Composed objects must be decomposed prior to being included
in packages conforming to this profile. Therefore, compositionLevel elements
other than 0 are prohibited. Note that in keeping with recommendations from
PREMIS both ARC files and documents contained in ARC files are all described at
compositionLevel 0.</p>
<p>The objectCharacteristics element must have at least one fixity sub-element which
has a messageDigestAlgorithm of 'SHA-1'. The value contained in the
messageDigest must match the value of the CHECKSUM attribute of the referencing
file element. There may be additional fixity sub-elements with different
messageDigestAlgorithm values. </p>
<p>The objectCharacteristics element must have a size sub-element. The value of this
sub-element must be a positive integer number representing the size in bytes of
the file or bitstream. This value must match the SIZE attribute of the
referencing file element. </p>
<p> The objectCharacteristics element must have a format sub-element which has a
formatDesignation child element which in turn has a formatName child element.
The value of the formatName element must be the MIME type value of the file or
bitstream. It must match the MIMETYPE attribute of the referencing file element
and follow the same rules. It is recommended that if there are format registries
describing the format of the file or bitstream that the format sub-element may
have one or more formatRegistry sub-elements referencing the appropriate
registry key. </p>
<p> A conforming METS document may contain additional PREMIS object entity elements. </p>
<p> Technical metadata for files or bitstreams must conform to the specific
technical metadata for its root MIME type specified in MIME Type technical
metadata requirements.</p>
</requirement>
<requirement>
<head>Technical Metadata for Files with a Root MIME Type of 'Text'</head>
<p> In addition to required elements specified in Technical Metadata for Files and
Bitstreams, all files with a root MIME type of 'text' should provide a second
techMD section in addition to the PREMIS section previously mentioned. This
second techMD section must wrap a textMD description of the file. </p>
<p> It is strongly recommended that text MIME types, specifically text/xml, that
conform to external schema or DTDs record schema information using PREMIS
Dependency elements. </p>
</requirement>
<requirement>
<head>Technical Metadata for Files with a Root MIME Type of 'Image'</head>
<p> In addition to required elements specified in Technical Metadata for files and
bitstreams, all files with a root MIME type of "image" should provide a second
techMD section in addition to the PREMIS section previously mentioned. This
second techMD section must wrap a MIX Schema description of the file.</p>
</requirement>
<requirement>
<head>Technical Metadata for Files with a Root MIME Type of 'Audio'</head>
<p>In addition to required elements specified in Technical Metadata for files and
bitstreams, all files with a root MIME type of 'audio' should provide a second
techMD section in addition to the PREMIS section previously mentioned. This
second techMD section must wrap a AUDIOMD element from the Library of
Congress' Audio Metadata Schema. The AUDIOMD element must contain a
file_data child element, should contain a physical_data element,
but may contain other elements as well as allowed by the XML schema.</p>
</requirement>
<requirement>
<head>Technical Metadata for Files with a Root MIME Type of 'Video'</head>
<p>In addition to required elements specified in Technical Metadata for files and
bitstreams, all files with a root MIME type of 'video' should provide a second
techMD section in addition to the PREMIS section previously mentioned. This
second techMD section must wrap a VIDEOMD element from the Library of Congress
Video Metadata Schema. The VIDEOMD element must contain a
file_data child element, and should contain a physical_data element,
but may contain other elements as well as allowed by the XML schema.</p>
</requirement>
<requirement>
<head>Technical Metadata for Files with a Root MIME Type of 'Application'</head>
<p>In addition to required elements specified in Technical Metadata for files and
bitstreams, all files with a root MIME type of 'application' must include in the
previously described PREMIS techMD section a PREMIS creatingApplication,
software and environment elements.</p>
</requirement>
<requirement>
<head>JHOVE</head>
<p> As an alternative or in addition to the technical metadata described above which
is specific to a file format, the XML output of the JHOVE application may be
embedded in a techMD section which is referenced from a file.
[http://hul.harvard.edu/jhove/]</p>
</requirement>
<requirement>
<head>Technical Metadata Associated with Representations</head>
<p> In addition to the technical metadata associated with files or bitstreams, this
profile also allows technical metadata to be associated with representations.
This profile considers the entire METS file itself to be one 'super'
representation, but it also considers each structMap in the METS file to be a
representation as well. A special techMD with a status of PRIMARY_REPRESENTATION
must be present in each METS file. This techMD section must embed a PREMIS
object element with an objectCategory of REPRESENTATION. This techMD section
must be referenced via the root div element of the primary structMap. </p>
<p> In addition to the PRIMARY_REPRESENTATION, other representation-level technical
metadata sections with embedded PREMIS object elements with an objectCategory of
REPRESENTATION may be included and referenced from other structMap div elements.
In certain cases it may also be useful to reference these representation-level
technical metadata sections from file elements.</p>
</requirement>
<requirement>
<head>Rights Metadata (rightsMD)</head>
<p> This profile does not require any particular rights metadata. However, if rights
statements are used, it is recommended to use the PREMIS rights element.
Regardless of how rights are specified, this profile does require systems
processing files which are conformant to this profile to preserve all rights
statements and their linkages to other sections during any actions performed on
the METS file.</p>
</requirement>
<requirement>
<head>Source Metadata (sourceMD)</head>
<p> This profile has no requirements for this section, but processing systems which
conform to this profile should preserve any sourceMD elements along with their
linkages to other sections during any actions performed on the METS file. </p>
</requirement>
<requirement>
<head>Digital Provenance Metadata (digiprovMD)</head>
<p> In general, any element with an ADMID attribute may reference a digiprovMD
element. This profile requires provenance for the primary and alternate
descriptive metadata sections. It encourages provenance for files and bitstreams
and structural maps, and it allows provenance in other sections. All provenance
statements must be expressed using the PREMIS event element and possibly
associated PREMIS agent elements.</p>
</requirement>
<requirement ID="DMD_DIGIPROV">
<head> Provenance for Descriptive Metadata </head>
<p> As previously noted in the structural requirements for dmdSec elements, each
dmdSec which has a STATUS of 'PRIMARY_DMDSEC' or 'ALTERNATE_DMDSEC' must have a corresponding
digiprovMD section. </p>
<p> The content of this digiprovMD must be a PREMIS event element embedded in the
digiprovMD via the mdWrap element. The PREMIS event must have an eventType of
'METADATA_TRANSFORMATION', 'METADATA_CREATION', 'METADATA_MODIFICATION', or
METADATA_DELETION. It should have an eventDetail which describes how the
metadata was derived, transformed, or modified and it should have an
linkingAgentIdentifier which identifies the agent responsible for the
transformation or creation of the descriptive metadata. The PREMIS event and
agent elements must conform to the XML Schema of PREMIS.</p>
<p> There are two ways to delete descriptive metadata using the METADATA_DELETION
eventType. In the first way the metadata is just marked as deleted, but it
continues to be maintained in the METS document. This is accomplished by
referencing a digiprovMD with a PREMIS event element with an eventType of
METADATA_DELETION. Even though the metadata is still contained in the document
for all intents and purposes it should be treated as if were deleted -- it would
not appear in any disseminations of the entity, except possibly for privileged
administrative users. The second way is to actually delete the metadata. In this
case just the mdRef or mdWrap elements are deleted from the dmdSec, leaving it
empty. The dmdSec element needs to be maintained so that it can reference the
digiprovMD with a PREMIS event element with an eventType of METADATA_DELETION.
The dmdSec also needs to be maintained to avoid dangling IDREF attributes which
may have pointed to the now deleted metadata. Because of the possibility of
deleted metadata, processing systems will need to take care to check any provenance
statements associated with the metadata before disseminating it to users or other
systems.
</p>
</requirement>
<requirement>
<head> Provenance for Files and Bitstreams </head>
<p> Each file or bitstream which is referenced or embedded in the fileSec may have
one or more corresponding digiprovMD sections. </p>
<p> The content of this digiprovMD must be a PREMIS event element embedded in the
digiprovMD via the mdWrap element. The PREMIS event should use an eventType from
the list of PREMIS Suggested Event Types when ever possible. It should have an
linkingAgentIdentifier which identifies the agent responsible for the event. The
PREMIS event and agent elements must conform to the XML Schema of PREMIS.</p>
<p> Just as with descriptive metadata, files may be deleted, and file deletions should be
indicated in the same way as for descriptive metadata. Files may be kept in the METS
document but marked as deleted by referencing a digiprovMD with a PREMIS event
with an eventType of DELETION. The file may also be deleted by removing all child
elements below the file element, but keeping the file element itself so that it can reference
a digiprovMD with a PREMIS event element with an eventType of DELETION.
In either case, processing systems must take care to check the DELETION status of any
files before attempting to disseminate or process them in any way.
</p>
</requirement>
<requirement>
<head> Provenance for Structural Maps </head>
<p> The first div child element of each structMap should reference a digiprovMD
section. </p>
<p> The content of this digiprovMD must be a PREMIS event element embedded in the
digiprovMD via the mdWrap element. The PREMIS event must have an eventType of
'STRUCTMAP_TRANSFORMATION', 'STRUCTMAP_CREATION', 'STRUCTMAP_MODIFICATION', or
'STRUCTMAP_DELETION'. It should have an eventDetail which describes how the
structural map was derived, transformed, or modified and it should have an