[Home] [Teaching] [Projects] [Research] [Publications] [Curriculum Vitae]

HUMANITIES COMPUTING: Electronic Text - 2002-2003
Last Revision: 27/04/2003
new version of DALF.clb (07/04/03)
New version of schrey.zip (14/04/03)

B33080 - 30 contact hours - 4 credits



[Week 1] [Week 2] [Week 3] [Week 4] [Week 5] [Week 6] [Week 7] [Week 8] [Week 9] [Week 10]

Group Project

Description:

Make an electronic edition of the letters of Julius J.B. Schrey.

  • Encode the letters using DALF.
  • Encode the introductory essays using TeiLite.
  • Create XSL and CSS stylesheets for the vizualisation of the XML files.
  • Generate an XHTML output using XSLT.
  • Document your encoding practice.

Materials for download:

Documentation:

  • Edward Vanhoutte & Ron Van den Branden, DALF guidelines for the description and encoding of modern correspondence material Version 1.0 (discussion draft). [html] [pdf]
  • TEILite. "TEI U5: Encoding for Interchange: an introduction to the TEI." [html] [sgml] [xml] [pdf]
Deadline

The group assignment and the documentation of the markup guidelines should be handed in no later than 20 June (12 a.m.) 2003 both on CD-Rom and per email <evanhoutte@kantl.be>. The CD-Rom should be sent to:

Edward Vanhoutte
CTB - Centrum voor Teksteditie en Bronnenstudie
Koningstraat 18
b-9000 Gent

Top

Lecturer: Edward Vanhoutte
CTB - Centrum voor Teksteditie en Bronnenstudie
Koningstraat 18 / b-9000 Gent
tel: +32 (0)9 265.93.51 / fax: +32 (0)9 265.93.49
evanhoutte@kantl.be
Time: Monday 9-12u.30. - 2nd semester 2002-2003
Room: Computer room A9.01 (Library building A)
Contents: The use of electronic texts in all areas of current society and all disciplines of both the Humanities and the hard sciences is increasing enormously. Together with this trend, the problems attached to the use and interchange of electronic texts become more prominent: software- and platform-incompatibility, loss of data in converting files, problems of arciving, creation, use, etc. This course addresses these problems and focuses on the problematic position of electronic texts in the humanities. The student can also expect an introduction in the history and evolution of electronic publication media such as the Internet. In lectures, seminars, and workshops, we draw the attention to the creation and publication of electronic texts, and gain hands-on experience in using internationally accepted standards for text-encoding and markup - SGML, XML, (X)HTML, XSL, CSS, TEI... This course introduces tools and techniques which will be used by the students to produce an electronic publication. This year, we will concentrate on a new method for the encoding of modern manuscript material (DALF), and the students will prepare an electronic edition of some letters from the 19th and 20th century.
Pre-required knowledge: An elementary computer literateness is required (know how to work with multiple windows, work with the mouse, create folders and files, download files from the internet), but an introductory session may be organised for students who are not up to elementary standards.
This course is taught in English. Foreign students are most welcome.
Format: Seminars and workshops with preparation.
Examination: Permanent evaluation and group assignment. Only students who take part in both parts of the assesment will be eligible to receive marks on this course.
Required reading:
  • Tim Berners-Lee (1999). Weaving the Web - The original design and ultimate destiny of the World Wide Web by its inventor. London: Orion Business Press / San Francisco: Harper.
  • Tim Berners-Lee (2000). De wereld van het World Wide Web. Het oorspronkelijke ontwerp en de uiteindelijke bestemming van het World Wide Web, beschreven door zijn uitvinder. Amsterdam: Nieuwezijds.
  • Hockey, Susan (2000). Electronic Texts in the Humanities. Oxford: Oxford University Press.
  • The journals Literary & Linguistic Computing, Computers and the Humanities, Markup Languages: Theory and Practice and Human IT.
  • Further required and advised readings are posted on the course website.
Credits: This course counts for 4 ECTS credits, which equals a 120 hour workload. This is organized as follows:
  • Lectures: 30h.
  • Weekly preparation: 20h.
  • Group assignment: 60h.
  • Report: 10u.

Programme

Week 1 (17 February): Introduction Humanities Computing - History of the Internet - Hypertext.

Format Formal lecture
Preparation
  • Know how to surf the internet, look and find information.
  • Know how to email.
Required reading
  • Susan Hockey (2000). Electronic Texts in the Humanities. Oxford: Oxford University Press. Chapter 1: "Why Electronic Texts?". p. 1-10.
Further reading

Top


Week 2 (24 February): Document Analysis - Markup - Text Encoding - SGML/XML - DTD.

Format Seminar
Preparation
  • Mail me (evanhoutte@kantl.be) a list of seven different internet browsers.
  • Read Alan Morrison, Michael Popham & Karen Wikander (2000). Creating and Documenting Electronic Texts: A Guide to Good Practice. (Oxford: OTA): chapter 2: "Document Analysis."
  • Choose a document and analyse it.
Required reading
  • Susan Hockey (2000). Electronic Texts in the Humanities. Oxford: Oxford University Press. Chapter 3: "Text Encoding". p. 24-48.
Course material
Further reading

Top


Week 3 (3 March): XML - TEI Hands-on - CSS - XSL

Format Seminar
Assignment Copy your favourite poem to a plain text file (ASCII) *.txt and bring it along on a disk.
Required reading
  • P4 TEI Guidelines for Electronic Text Encoding and Interchange. A Gentle Introduction to XML. [html] [xml] [pdf]
    • For the Linguists: Susan Hockey (2000). Electronic Texts in the Humanities. Oxford: Oxford University Press. Chapter 6: "Linguistic Analysis". p. 85-103.
    • For the Literaturists: Susan Hockey (2000). Electronic Texts in the Humanities. Oxford: Oxford University Press. Chapter 5: "Literary Analysis". p. 66-84.
Data files
Downloads
Installation

NoteTab Light is a very complete plain text editor which allows you to create SGML, XML, (X)HTML, CSS etc. documents.

Download the software on your computer and unzip the file with an Unzip programma (e.g. WinZip). Double click the Setup.exe file and follow the install shield guidance. Once installed, run the programma and select View > Options > File Filters. Select "New", and add the next details

  • Description: "xml"
  • Wildcards: "*.xml"
  • Click the OK button. Now you can save XML instances with the extension ".xml".

Repeat this operation for each file format you want to add to the software, e.g. CSS.

Download teixlite.clb and save (with .clb extension!) in NoteTab Light/Libraries. The Tab "teixlite" will now appear in the tab-bar at the bottom of the programme window. Click to activate the library which will appear in the left margin.

Reference material
TEI U5 and teixlite Please note that TEI U5 defines teilite in terms of SGML. Teixlite is XML, this means that you will have to adapt your encoding practice to next rules of thumb:
  1. Element names: use the exact element names. XML is case sensitive!
  2. Empty elements: don't tag empty elements as e.g. <pb>, but as <pb />
  3. Attributes: lower case.
  4. Attribute values: always quoted. e.g. <pb id="1.01.002" n="2" />
  5. Tag minimization: not allowed in XML. All tags which are opened must be closed and must nest correctly.

Top


Week 4 (10 March): XML - TEI Hands-on, Guest appearance of Vincent Neyt &Documentary "Into the Future".

Format Seminar & Movie
Assignment Try to encode a poem, a piece of prose etc. using teixlite. Mail me (evanhoutte@kantl.be) if you experience any problems or for feedback.
Required reading TEILite. "TEI U5: Encoding for Interchange: an introduction to the TEI." [html] [sgml] [xml] [pdf]
Data files
Downloads
Installation

Download the binaries for Windows 95 en Windows NT and unzip and extract in a SP folder which you create. The setup creates three folders: bin, doc and pubtext. You can find the parser (nsgmls) in the bin folder.

Next, download the Runsp2 windows interface for nsgmls. Unzip the file in the bin folder of SP. By running runsp2.exe, runsp2 wil find nsgmls. Read runsp.txt carefully.

Copy the next files in the same bin folder:

Specify where nsgmls can find the catalog file under Options in the toolbar of runsp2.

Specify where nsgmls can find xml.dcl under Options in the toolbar of runsp2.

Download and save teixlite.clb (with .clb extension!) in NoteTab Light/Libraries. The Tab "teixlite" will now appear in the tab-bar at the bottom of the programme window. Click to activate the library which will appear in the left margin.

Top


Week 5 (17 March) St. Patrick's Day: XML DALF Hands-on. Guest appearance of Ron Van den Branden.

Format Seminar
Assignment Check, correct and validate the file error.xml which contains 109 errors, and explain how you correct this file in 8 steps (one of which is combined). Mail me (evanhoutte@kantl.be) your report before Friday March 14th, 12 a.m.
Required Reading
  • Edward Vanhoutte & Ron Van den Branden, DALF guidelines for the description and encoding of modern correspondence material Version 1.0 (discussion draft). [html] [pdf]
Downloads

Top


Week 6 (24 March): XML - DALF Hands-on - Group project

Format Seminar
Assignment Complete and validate the Schrey01 letter (1893.10.23) and encode the Schrey03 letter (1893.11.05). Mail me (evanhoutte@kantl.be) your files before Friday March 21st, 12 a.m. Please take following points into consideration:
  • All PCDATA inside <notes>, explanatory <p>s and metadata tags should be in Dutch.
  • The value of the attribute "value" of <date> should have the form "yyyy-mm-dd"
Required Reading
  • Edward Vanhoutte & Ron Van den Branden, DALF guidelines for the description and encoding of modern correspondence material Version 1.0 (discussion draft). [html] [pdf]
Downloads

Top


Week 7 (31 March): XML - DALF hands-on - Group project

Format Seminar

Week 8 (21 April): No Class - Easter Monday

Assignment Group assignment

Week 9 (28 April): Digitization. What, why, how, OCR & imaging

Required reading
  • Susan Hockey (2000). Electronic Texts in the Humanities. Oxford: Oxford University Press. Chapter 2: "Creating and Acquiring Electronic Texts". p. 11-23.
  • Stuart D. Lee (2001). Digital Imaging. A practical handbook. London: Facet Publishing. Chapter 3: "How do you digitize?" p. 35-75.
Further reading
Downloads

Top


Week 9bis (2 May): Guest Lecture: Martha Nell-Smith: "Dickinson, a User's Guide."

Format Lecture
Place & time Koninklijke Academie voor Nederlandse Taal- en Letterkunde, Koningstraat 18, Gent - 10u.30.
Further information http://www.kantl.be/ctb/seminar/03/seminar-nellsmith.htm

Top



XHTML author: Edward Vanhoutte
Last Revision: 27/04/2003


[Home] [Teaching] [Projects] [Research] [Publications] [Curriculum Vitae]

Valid XHTML 1.0!