Digital Initiatives Home About the Digital Initiatives Services Research and Development Metadata Reports Ask Questions Virgo Catalog
University of Virginia
University of Virginia Library
Digital Initiatives: Research and Development

Digital Library Implementation - Book TEI Content Model

Digital Library Implementation Home > Content Models

Input Master

TEI P4 XML file, with local modifications; follow the guidelines and DTD available at http://text.lib.virginia.edu/bin/cgi-dl/dlps/doco/text/kb/markup_guide/

TEI file contains full header, transcribed text, and links to page images

Datastreams

  • static TEI XML file with full headers and links to page images, but no text transcription

    Phase 2 content for this model: Selections from DLPS 2003 Q3, 2003 Q4, 2004 Q1, and American Studies texts

uvaBook Behaviors (Transcribed texts with page images)

  • uvaDefault Disseminator / uvaDefaultTEI Mechanism
    • getPreview - returns plain text from DescMeta
    • getLabel - returns plain text label from DescMeta
    • getDescription - returns plain text description from DescMeta
    • getFullView(pid)
    • getDefaultContent - returns raw xml
  • uvaMeta Disseminator
    • getDescMeta - returns raw xml
    • getAdminMeta - returns raw xml
    • getDC - returns Dublin Core elements in raw xml
    • viewDescMeta(style)
    • viewAdminMeta(style)
    • viewDC(style)
  • uvaGenText Disseminator
    • getLabel - returns xml label
    • getLabelSTX
    • getTreeView - returns xml table of contents
    • getTreeViewSTX
    • getChunk(id, page, pageNum) - return xml chunk specified by id
    • getChunkSTX(id, page, pageNum)
    • getDynamicView(pid*, behav*, refb, id, page, pageNum) - returns cocoon app of dynamic view
    • getXML - returns raw xml
  • uvaPageBook Disseminator
    • getLabel - returns xml label
    • getLabelSTX
    • getTreeView - returns xml table of contents
    • getTreeViewSTX
    • getPageTurner(pid*, behav*, refb, id, page, pageNum, snum, sblk) - returns cocoon app of page turner
    • getXML - returns raw xml
  • uvaOKI Disseminator
    • getAssetDefs(childPid, parentPid) - returns asset definitions as xml
  • uvaRelation Disseminator / uvaRelationTEI behavior
    • getChildDescMeta(childPid) - returns descriptive metadata for childPid
  • Other
    • getCitation - returns a true bibliographic citation, plus an actionable, persistent link to the text object.   This contains additional elements than are included in the getPreview behavior in the uvaDefault disseminator.  Scheduled for later implementation.
    • getHeader - returns the header from the TEI datastream file, rather than metadata from the DescMeta elements in the Fedora object.  Scheduled for later implementation.

Recommendation:  Three TEI models are required to handle the great variation between fully transcribed texts without page images, fully transcribed texts with page images, and page image-only texts without transcriptions.

The existing mechanisms for transcription display can be used on variations of TEI encoding provided we are willing to accept that some elements may be rendered (or not) with the different encoding versions.  To have a single TEI content model we must either update the markup to bring all to same encoding standard (over and above parsing against the same DTD), or make the mechanisms more flexible to handle the variations.  There are costs in terms of people resources and time to do either.

February 18, 2004; revised April 27, May 19, July 23, August 27, and September 17, 2004

Digital Initiatives
University of Virginia
PO Box 400112
Charlottesville, VA 22904-4112

Digital Initiatives Home • UVa Library Home
Search the Library Site • UVa Home
Maintained by: dl@virginia.edu
Last Modified: Monday, August 03, 2009
© The Rector and Visitors of the University of Virginia