View on GitHub

NEH Institute materials

July 2017

Don’t touch that keyboard! “Making your edition”: starting with research questions
Where data modeling belongs in the work flow
Markup as an expression of the data model; making the implicit explicit and machine-actionable
The relationships among model, syntax, and markup semantics
How modeling reduces iterations of the document analysis → schema development → markup pipeline
Reconciling community-driven (prescriptive) and research-driven (descriptive) analysis; “how do I do this in X?” vs “how should I model this?”
Modular development: the digital edition as a computational pipeline
Modeling in plain text and in XML

Understanding modeling perspectives (tree, ranges, graph) and communities
Modular development: thinking about digital edition development as a computational pipeline
Beginning to tokenize texts
Beginning to normalize texts

Understanding the principles of basic text transformations like normalization and how they serve different objectives
Bringing together tokenization and normalization as individual pipeline steps and seeing how they can be implemented in the act of collation
Normalize, tokenize, and collate text
Fundamentals of TAG: hypergraph Modeling discontinuity

Grasping the concept of modelling text as trees and graphs Understanding annotation as a form of adding layers to text Varieties of layered editions
Deeper discussion of the alignment step in the GM
An awareness of computation to understand that we do near-matching late (in the pipeline) for reasons of efficiency

Visualization as part of the text processing pipeline: making decisions, selecting formats, interacting with, and producing visualizations
Hands-on: a departure from Word Cloud
Review of XQuery