Encoding Facsimiles
The metadata for the facsimiles for each volume is contained within
a separate file in the volume's data directory (e.g. the facsimile
file for volume 1 is located in
data/volumes/vol01/vol1_facs.xml). The facsimile file for each volumes contains information about various pagination schemes as well as pointing to the facsimile image file in SFU's Islandora collection. This information was originally compiled from a spreadsheet and should be edited with caution to avoid breaking any links to the page images.
The facsimile element contains all
of the information about the pages with a
corresp pointing to the Islandora collection
landing page:
<facsimile corresp="islandora:4593">
<!-- ... -->
</facsimile>
Within the facsimile, each page is
represented by a surface, which at
minimum contains an xml:id to provide an
identifier. All pages that have a digital representation (i.e. not
missing or lost pages), also contain a
graphic that points to the Islandora
PID.
<surface n="1" xml:id="vol1_001">
<graphic url="islandora:4806"/>
</surface>
Paratextual pages (i.e. front covers, etc) are direct children of
the facsimile and contain a
desc that describes the page:
<surface n="1" xml:id="vol1_001">
<graphic url="islandora:4806"/>
<desc>front cover</desc>
</surface>
For pages that are part of the manuscript proper, we group the
pages into folios (i.e. front and back) in order to capture the
National Library of Scotland's folio number. Each
surface within a
surfaceGrp is categorized as
"recto" or "verso" through the
type attribute. Since there are multiple
pagination schemes for the manuscript, each
surface also contains a set of
milestones that specify the page number
under a specific counting system. For instance:
<surfaceGrp n="1">
<surface n="7" xml:id="vol2_007" type="recto">
<graphic url="islandora:4783"/>
<milestone unit="page" type="Forbes" n="199"/>
<milestone unit="page" type="volume" n="1"/>
<milestone unit="page" type="collection" n="199"/>
</surface>
<surface n="8" xml:id="vol2_008" type="verso">
<graphic url="islandora:4782"/>
<milestone unit="page" type="Forbes" n="200"/>
<milestone unit="page" type="volume" n="2"/>
<milestone unit="page" type="collection" n="200"/>
</surface>
</surfaceGrp>
This encoding says the following:
The
surfaceGrp/@nspecifies that this is Folio 1 in Volume 2
The first surface is:
n: The 7th page image (this does
not mean the 7th page overall, but the 7th digital image)
type: recto (i.e. the right hand
page)
graphic: The Islandora PID
is 4783
milestone unit="page" type="Forbes" n="199"/:
The number Forbes writes at the top of the page is 199
(continuous across the volumes)
milestone unit="page" type="volume" n="1":
The page number from the beginning of this volume that is
paginated (i.e. this is the first page in the second volume
that Forbes put a page number on)
milestone unit="page" type="collection" n="199"/:
The idealized page number.
This is the page number we use across the project for
pointers, ids, et cetera. In almost all cases, this is the
same as the Forbes number; however, in Volume 9, Forbes skips
from 2099 to 3000, making the last fifty pages of the
manuscript offset by 991.