B33080 Humanities Computing: Electronic Text

University of Antwerp, Campus Drie Eiken

Second Term 2006

Edward Vanhoutte

edward.vanhoutte@kantl.be

http://www.edwardvanhoutte.org

TOC | First

B33080 Humanities Computing: Electronic Text

Week 1: Introduction to this course - Humanities Computing - History of the Internet - Hypertext.

University of Antwerp, Campus Drie Eiken

Monday 13 February

Edward Vanhoutte

TOC | First


I. Monday 13 February Introductions

Congratulations

previous table of contents next
1 of 5 [36]
congratulationscongratulationscongratulationscongratulations congratulationscongratulationscongratulationscongratulationscon gratulationscongratulationscongratulationscongratulationscongra tulationscongratulationscongratulationscongratulationscongratul ationscongratulationscongratulationscongratulationscongratulati onscongratulationscongratulationscongratulationscongratulations congratulationscongratulationscongratulationscongratulationscon gratulationscongratulationscongratulationscongratulationscongra tulationscongratulationscongratulationscongratulationscongratul ationscongratulationscongratulationscongratulationscongratulati onscongratulationscongratulationscongratulationscongratulations congratulationscongratulationscongratulationscongratulationscon gratulationscongratulationscongratulationscongratulationscongra tulationscongratulationscongratulationscongratulationscongratul ationscongratulationscongratulationscongratulationscongratulati onscongratulationscongratulationscongratulationscongratulations congratulationscongratulationscongratulationscongratulationscon gratulationscongratulationscongratulationscongratulationscongra

Congratulations

previous table of contents next
2 of 5 [36]
  • Unique in Belgium
  • Of direct relevance to the job market
  • You managed to find this course

University Courses (a.o.)

previous table of contents next
3 of 5 [36]
  • Australia
  • Belgium
  • Denmark
  • Finland
  • France
  • Germany
  • Italy
  • Netherlands
  • Norway
  • Spain
  • Sweden
  • Czech Republic
  • UK
  • US

Short courses and summer/winter schools

previous table of contents next
4 of 5 [36]
  • ESSLI, European Summer School in Logic, Language and Information, Edinburgh, UK
  • Digital Humanities / Humanities Computing Summer Institute, University of Victoria, Canada
  • Maryland Institute for Technology in the Humanities (MITH) University of Maryland; USA
  • Manuscript & Elektronische Tekst Academie META04 The Centre for Scholarly Editing and Document Studies of the Royal Academy of Dutch Language and Literature, Gent, Belgium
  • Book & Text Studies: Humanities Computing, Rhodes University, South Africa
  • http://www.tei-c.org/Activities/SIG/Training/short_courses.html

Monday 13 February

previous table of contents next
5 of 5 [36]
  1. Introduction to this course
    1. Objectives of this course
    2. (Non-)Assumptions
    3. Me & You
    4. Housekeeping Rules
    5. Overview of the Course
    6. Test Elementary Computer Skills
  2. Introduction to Humanities Computing
    1. Humanities Computing: definitions
    2. Humanities Computing: a field, a discipline
      1. Associations involved
      2. Journals and mailinglists
      3. Publications
      4. Institutions
    3. Humanities Computing: short history
  3. Computing
    1. Hardware
    2. Graphical Interface

1.1.a. Explicit Objectives of this Course

previous table of contents next
1 of 18 [36]
  • Digitization
  • XML
  • XSL
  • Project management

1.1.a. Implicit Objectives of this Course

previous table of contents next
2 of 18 [36]
  • Team work
  • Project management
  • Problem solving thinking

1.1.a. Contents of this Course

previous table of contents next
3 of 18 [36]
  • Humanities Computing
  • Digitization of text and images
  • XML
  • XSL

1.1.a. Humanities Computing

previous table of contents next
4 of 18 [36]
After this course, you should be aware of:
  • what humanities computing is
  • why it matters
  • what it can do for us
  • what it can do for the humanities

1.1.a. Digitization of Texts and Images

previous table of contents next
5 of 18 [36]
After this course, you should be aware of:
  • what a digital image is
  • what a digital text is
  • what digitization is
  • why you would digitize an object
  • ways of obtaining texts
  • how digitization works
  • tools, techniques, and standards
  • files and formats

1.1.a. XML: eXtensible Markup Language

previous table of contents next
6 of 18 [36]
XML is a metalanguage by which one can create separate markup languages for seperate purposes. After this course you should know how to:
  • analyse a document
  • create a well formed XML document
  • create a valid XML document
  • parse an XML document for validation
  • read and interpret an XML document
  • create and read small DTDs
  • use and understand TEI-Lite
  • use and understand DALF

1.1.a. XSL: eXtensible Stylesheet Language

previous table of contents next
7 of 18 [36]
XSLT is a tool for processing XML documents. After this course, you should know how to:
  • transform XML from one datatype to another
  • style XML for display in off-the-shelf Web browsers
  • use XPath to denote specific parts of a document
  • use XSLT as a search engine
  • segment your data
  • retag your data
  • express recursive algorithms in XSLT

1.1.a. Not covered in the course

previous table of contents next
8 of 18 [36]
  • (X)HTML, CSS
  • ECDL
  • Web design, page design, typography
  • XSL formatting objects
  • Tricks of the trade (production use)
  • Digital audio and video
  • Three dimensional digitization

1.1.b. Assumptions

previous table of contents next
9 of 18 [36]
I assume that:
  • You have elementary computer skills:
    • know how to work with multiple windows
    • work with the mouse
    • create folders and files
    • download files from the internet
  • You are interested in literary and historical texts, databases, and images.
  • You expect to work with XML data, and need to:
    • markup texts
    • display information
    • extract information
    • reformat information
  • You expect to work with digitized texts and images
  • Understand English
  • (and a little bit of Scottish and Dutch)

1.1.b. Non-Assumptions

previous table of contents next
10 of 18 [36]
I do not assume that:
  • You know something about programming
  • You know something about markup, transformations, and digitization
  • You know XML and can edit XML documents
  • You have used XSLT
  • You are a programmer

1.1.c. The Lecturer: Edward Vanhoutte

previous table of contents next
11 of 18 [36]

1.1.c. You

previous table of contents next
12 of 18 [36]
I want to know:
  • Who you are (name)
  • What you do
  • What you study
  • Why you're here
  • What you expect of this course

1.1.d. Housekeeping Rules

previous table of contents next
13 of 18 [36]
  • Marking:
    1. Permanent Evaluation: presence and participation in the weekly lectures
    2. Weekly Assignments
    3. Group Assignment: electronic edition of the correspondence of John T. Whalen and Theodore Bissell
  • Lectures:
    • Academic 15 min for checking email
    • Always bring a disk
    • Preparation & exercises
    • Required readings
    • Suggested readings
    • → Reader
  • Course website:

1.1.d. Communication

previous table of contents next
14 of 18 [36]
  • A1.13
  • Communication is via email
  • I can be reached at all times (and I mean at all times) at edward.vanhoutte@kantl.be

1.1.d. What I Need Now

previous table of contents next
15 of 18 [36]
  • → A copy of your student card
  • → A mail with your email address to edward.vanhoutte@kantl.be

1.1.e. Overview of the course

previous table of contents next
16 of 18 [36]
  • Monday 13 February: 1. Introduction to this course - Humanities Computing - History of modern computing
  • Monday 20 February: 2.
    • History of the Internet - Hypertext
    • XML theory and practice: Text & Computers - Text Encoding & Markup - Document Analysis - DTD
  • Monday 27 February : 3. XML theory and practice: HTML - SGML/XML - TEI - well formed XML - DTD - Schema
  • Monday 6 March: 4. XML theory and practice: Valid XML - Parsing/Validating - TeixLite
  • Monday 13 March: 5. XML theory and practice: TEI - DALF
  • Monday 20 March: 6. XSL theory and practice: basics, XPath, function
  • Monday 27 March: 7. XSL theory and practice: Real XSLT
  • Monday 17 April: 8 Easter Monday - No Class
  • Monday 24 April: 9. Guest Lecture Dr. Melissa Terras
  • Monday 1 May.: 10. 1 May - No Class
  • Group Project - Documentary "Into the Future"

1.1.f. Test Elementary Computer Skills

previous table of contents next
17 of 18 [36]
  • Make three folders on your C drive, name them "Man", "Woman", "Baby".
  • Download the following files from http://www.kantl.be/ctb/vanhoutte/teach/hc2006.htm
    • Week 1 > The Field of Humanistic Informatics and its Relation to the Humanities (HI.html): put HI.html in folder "Man"
    • Week 4 > teixlite.dtd: put teixlite.dtd in folder "Woman"
    • Bottom > XHTML 1.0 logo (valid-xhtml10.png): put valid-xhtml10.png in folder "Baby"
    • Put the contents of "Baby" in "Woman" and delete "Baby"
    • Copy the contents of "Woman" to "Man"
    • Save the remaining folders to a disk
    • Pass the disk on to your right hand neighbour
    • Explore the contents of the disk and show me

Possible problems

previous table of contents next
18 of 18 [36]
  • Your webbrowser doesn't know files with extensions .dtd / .xml / etc.
    → save as "all files" and add extension yourself
    → don't follow the link but use right click to "save target as"

1.2. Introduction to Humanities Computing

previous table of contents next
1 of 13 [36]
  • a. Humanities Computing: definitions
  • b. Humanities Computing: a field, a discipline
    • a. Associations
    • b. Journals
    • c. Mailinglists
    • d. Publications
    • e. Institutions
  • c. Humanities Computing: short history

1.2.a. Humanities Computing: definitions

previous table of contents next
2 of 13 [36]
Humanities Computing is an academic field concerned with the application of computing tools to arts and humanities data or their use in the creation of these data.
Willard McCarty

1.2.a. Humanities Computing: definitions

previous table of contents next
3 of 13 [36]
To apply Computing solutions to valid, and generally complex, Humanities based problems to provide access and answers that otherwise would have been impossible.
Melissa Terras

1.2.a. Humanities Computing: definitions

previous table of contents next
4 of 13 [36]
Humanities Computing or Digital Humanities is an interesting field at the intersection of a wide array of disciplines which can be roughly identified as belonging to either the Humanities or to Computer Science (Informatics).
Edward Vanhoutte


Wen wir die Sache so verstehen, so leuchtet sofort ein, dass die geisteswissenschaftliche Informationsverarbeitung keine eigenständige Wissenschaft ist und unmöglich eine werden kann. Den jedes Problem, das wir haben, ist entweder mit einem Problem der Informatik oder mit einem Problem der Geisteswissenschaften vermunden, jede Methode gleichfalls.
Michael Sperberg-McQueen

1.2.a. Humanities Computing: definitions

previous table of contents next
5 of 13 [36]
Required reading:

1.2.b. Humanities Computing: a field, a discipline?

previous table of contents next
6 of 13 [36]
Required reading
  • Marilyn Deegan (2000). "Introduction." Frances Condron, Michael Fraser & Stuart Sutherland (eds.), Guide to Digital Resources for the Humanities 2000. Oxford: CTI.
Further reading
  • Michael Sperberg-McQueen (2002). "Geisteswissenschaften und Informatik. Zur aktuellen Situation und zu künftigen Aufgaben." In Thomas Burch, Johannes Fournier, Kurt Gärtner and Andrea Rapp (eds.), Standards und Methoden der Volltextdigitalisierung. Mainz/Stuttgart: Akedemie der Wissenschaften und der Literatur/Franz Steiner Verlag. 27-37.
  • Willard McCarty (25 November 2005). Tree, Turf, Centre, Archipelago--or Wild Acre? Metaphors and Stories for Humanities Computing. Literary and Linguistic Computing, doi:10.1093/llc/fqi066 http://llc.oxfordjournals.org/cgi/content/abstract/fqi066v1

1.2.b.a. Associations: A Selection

previous table of contents next
7 of 13 [36]

1.2.b.b. Journals (Print): A Selection

previous table of contents next
8 of 13 [36]
  • Literary & Linguistic Computing
  • Computers and the Humanities [discontinued]
  • Human IT
  • Markup Languages: Theory and Practice [discontinued]
  • Computers and Texts
  • Revue. Informatique et Statistiques dans les Sciences Humaines.
  • Text Technology. The Journal of Computer Text Processing.
  • Cultural & Heritage Informatics Quarterly
Institutional models for humanities computing
http://www.kcl.ac.uk/humanities/cch/allc/imhc/

1.2.b.b. Journals (on-line): A Selection

previous table of contents next
9 of 13 [36]
Institutional models for humanities computing
http://www.kcl.ac.uk/humanities/cch/allc/imhc/

1.2.b.c. Mailing Lists: A Selection

previous table of contents next
10 of 13 [36]
  • Humanist
    an international electronic seminar on the application of computers to the humanities whose primary aim is to provide a forum for discussion of intellectual, scholarly, pedagogical, and social issues and for exchange of information among members.
    http://www.kcl.ac.uk/humanities/cch/humanist/
  • TEI-L
    a discussion for newbies and advanced users of the TEI. → http://listserv.brown.edu/archives/tei-l.html
  • E-DOCS
    a discussion list and website for professionals involved in the production, distribution, and organization of historical documents on the Internet. Offers an index of "Best Practices and Exemplary Sites."
    http://history.furman.edu/edocs/

1.2.b.d. Publications: A Selection

previous table of contents next
11 of 13 [36]
  • Susan Hockey (2000). Electronic Texts in the Humanities. Oxford: Oxford University Press.
  • Jahrbuch für Computerphilologie - print (München, Germany) & online
    http://computerphilologie.uni-muenchen.de/ejournal.html
  • Teresa Numerico and Arturo Vespignani (eds.) (2003). Informatica per le scienze umanistiche. Bologna: Il Mulino.
  • Office for Humanities Communication Publications (King's College London, currently 16 titles in print)
  • Susan Schreibman, Ray Siemens and John Unsworth (eds.) (2004). A Companion to Digital Humanities. Malden, MA/Oxford/Carlton: Blackwell Publishing.
  • Willard McCarty (2005). Humanities Computing. London: Palgrave.

1.2.b.e. Institutions

previous table of contents next
12 of 13 [36]
Institutional models for humanities computing
http://www.kcl.ac.uk/humanities/cch/allc/imhc/

1.2.c. Humanities Computing: short history

previous table of contents next
13 of 13 [36]
Required reading
  • Susan Hockey (2004). 'The History of Humanities Computing.' in Susan Schreibman, Ray Siemens and John Unsworth (eds.), A Companion to Digital Humanities. Malden, MA/Oxford/Carlton: Blackwell Publishing.
Further reading: