![]() |
![]() |
![[ornament]](/images/horzorn1.gif)
The TEI guidelines draw a distinction between two broadly defined classes of structure: the major "structural division" markers (such as a chapter) and smaller "elements" (such as paragraphs, italicized phrases, verse lines, or individual speeches in a play).
This category includes the major partitions of the text beginning with descriptions of the largest partitions: the "TEI" document, the "TEI" header, the "text", the "body" of the text, "front" and "back" matter; and ending with descriptions of partitions within the textual body itself: volumes, chapters, sections, acts, etc.
The TEI Tag and Its Two Divisions
All of the texts we prepare share the same basic set of large-scale divisions. Each text is bound in its entirety first by a pair of tags -- <TEI.2> </TEI.2> -- that mark it as conforming to the Text Encoding Initiative rules. The <TEI.2> tag pair, which must be given an "id" identical to the "text id," encloses two major sections: a <teiHeader> and a <text>. The <teiHeader> records information about the print source, about the creator of the electronic version, about changes we have made, and so on (The <teiHeader> is actually generated by a Header Template; it is not manually encoded):<TEI.2 id="AusEmma">
<teiHeader>
[Source and processing information goes here]
</teiHeader>
<text id="AusEmma">
[All of the material that is part of Emma goes here]
</text>
</TEI.2>
<TEI.2 id="AusEmma">
<teiHeader>
[Source and processing information goes here]
</teiHeader>
<text id="AusEmma"><body></text>
[text goes here]
</body>
</TEI.2>
<TEI.2>
<teiHeader>
[Source and processing information goes here]
</teiHeader>
<text id="AusEmma"><front></text>
[preface, etc. goes here]
</front>
<body>
[main body of the text goes here]
</body>
<back>
[appendices, etc goes here]
</back>
</TEI.2>
<TEI.2 id="AusEmma">A chapter is not inherently numbered as <div1> in every work: if the work is a multi-volume novel, then the volume, and not the chapter, is the largest internal structural division -- the volume becomes the <div1> for those texts, and the chapter becomes <div2>:
<teiHeader>
[Source and processing information goes here]
</teiHeader>
<text id="AusEmma"><body></text>
<div1 type="chapter" n="1"></body>
<head>Chapter 1</head>
[text of Chapter 1 goes here]
</div1>
<div1 type="chapter" n="2">
<head>Chapter 2</head>
[text of Chapter 2 goes here]
</div1>
<div1 type="chapter" n="3">
<head>Chapter 3</head>
[text of Chapter 3 goes here]
</div1>
<div1 type="chapter" n="4">
<head>Chapter 4</head>
[text of Chapter 4 goes here]
</div1>
</TEI.2>
<TEI.2 id="xxxxxxx">
<teiHeader>
[Source and processing information goes here]
</teiHeader>
<text id="xxxxxxx">
<body></text>
<div1 type="volume" n="1"> [Volume 1 starts here, including chapters]</body>
<div2 type="chapter" n="1.1"> [Chapter 1.1 goes here] </div2></div1>
<div2 type="chapter" n="1.2"> [Chapter 1.2 goes here] </div2>
<div2 type="chapter" n="1.3"> [Chapter 1.3 goes here] </div2>
<div1 type="volume" n="2"> [Volume 2 starts here, including chapters]
<div2 type="chapter" n="2.1"> [Chapter 2.1 goes here] </div2></div1>
<div2 type="chapter" n="2.2"> [Chapter 2.2 goes here] </div2>
<div2 type="chapter" n="2.3"> [Chapter 2.3 goes here] </div2>
</TEI.2>
Tags can be further expanded and defined through the use of "attributes", which
are descriptive components within the opening tag.
Formally, the lang attribute is an IDREF; a reference to the id value of a <language> element in the TEI header. It is a requirement of the TEI scheme that the lang attribute point to a <language> element. This means that each language used in the document should be declared in the TEI header using the <language> element. In the Header Template there is an "AdBlock" button to bring up further language fields that will then, once filled out, be inserted into the header. Once in the header, the field will look like the following:
<langUsage>
<language id="fre">French</language>
</langUsage>
The following are typical "type" attributes for major divisional structures in prose and poetry:
Unlike the tags that mark the larger structural hierarchy, this second class of tags identify individual aspects of a text. They are not part of a numbered hierarchy. Common examples include tags to mark typographical elements, titles, paragraphs, or lines. Examples of element tags are listed in the sections below, and a far more extensive list can be found in the Text Encoding Initiative's Guidelines for Electronic Text Encoding and Interchange.
Examples of the use of these smaller-scale elements can be found throughout the other sections of this guide.