Conceptual Description of Repository Workflow
March 21, 2003
I. Preliminary Resource Selection
- Selection information gathering process by faculty, selector, and/or
eCenters(s).
- Evaluate the resource for appropriateness in the Librarys collection
if not already approved by a selector.
- Create or receive a proposal for a new digital resource to be included
in the Librarys collection, including any special delivery functional
requirements or requests for special media file processing.
- Prepare a high level assessment of the costs associated with collecting
the resource, processing costs, and technical difficulties it may bring.
II. Resource Assessment
- Obtain a sample of the media and metadata for review, if possible
- Determine if the resource fits into an existing collection or class of
resources, based on type and format of the files included and needs for
delivery and management.
- For a new type of resource, perform a feasibility study to determine
the amount of work that mounting it will take, any technical problems that
it raises, and who will do it.
- Decide on new formats or standards, if necessary.
- Determine the time frame, operating system, system resources, specialized
software, licensing requirements, and how much disk will be needed for mounting
the resource.
- Determine the access policies that must be enforced and human resources
necessary to use and manage the resource.
- Inform the selector about ongoing support and preservation requirements
and costs.
- If the resource is determined to be problematic, make a recommendation
back to the selector.
- If the resource is deemed OK, then approve and prioritize.
III. Final Resource Acquisition and Prioritization
- Identifying funding source for processing costs
- Complete purchase or negotiation for resource.
- Make arrangements for the acquisition of the files and associated data.
- Add project into prioritized processing queue.
IV. Content Preparation
- Prepare the infrastructure for the processing of the files, including
workspace, cenrepo storage space, and potential higher-level volume and
processor management.
- Prepare delivery versions of files, if appropriate. For example, create
delivery resolutions of image files
- Convert file names to our standards.
- Create objects that describe the resource as a collection, if appropriate.
- For resources that encode metadata into the content, such as in the TEI
header, ascertain that the minimally required metadata is available.
- Put all files in the appropriate place Cenrepo file system.
- Initiate archiving with the creation of archival master files, if appropriate.
V. Metadata Preparation
- Identify the metadata supplied by the vendor, creator, or selector.
- Map new metadata structures to descmeta.
- For resources with metadata contained in the content, prepare the extraction
process; for example, with a TEI file, make sure that an existing XSL stylesheet
will work or create a new one.
- For resources with no metadata or for which only partial metadata is
available, identify the appropriate metadata standard and cataloging tool,
assign cataloging resources, and create the metadata.Organize the metadata
into the form required by the batch creation process and put it in the appropriate
place in the Cenrepo file system.
- Ensure that any policy requirements are reflected appropriately in the
metadata for the whole batch.
VI. Delivery Preparation
- For new classes of resources, create the behavior objects needed.
- If the new resource needs a new search or browse system, create it.
- Have the selector review the delivery prototype.
VII. Quality Assurance for resources
- Check to make sure that standards and practices have been followed for
content and metadata.
- Check to make sure that files are all in their places
- Determine that policies are appropriately registered in the metadata
VIII. Repository Installation
- Confirm that server and system resource infrastructure is in place.
- Create the batch template for the child objects in the resource.
- Add that batch to the repository .
- Use the PID list returned from the batch to update the IDs in the parent
objects with the PID for the appropriate child objects.
- Create the batch template for the parent objects.
- Add that batch to the repository. Note that they may be complex batches
where the first parents are child objects to other parents.
- Use the PID list returned from the batch to update the parent pointer
in the child objects, if appropriate.
- Update the appropriate search and browse index(es).
- Add any hard links to the web site, as appropriate.
IX. Quality assurance for delivery
- Make sure that the disseminators work appropriately.
- Make sure that the search and browse functions work appropriately.
- Ensure that policies associated with delivery are enforced.
- Have the selector review the delivery application.
X. Quality assurance for management
- Ensure that policies associated with the management of the resource have
been enforced.
- Ensure that the resource has been properly archived, if appropriate.
|
|