Digital Thoreau and Parallel Segmentation
By Dana Wheeles on February 26, 2013
[Cross-posted at juxtasoftware.org]
Every now and then I like to browse the project list at DHCommons.org, just to get an idea of what kind of work is being done in digital scholarship around the world. This really paid off recently, when I stumbled upon Digital Thoreau, an engaging and well-structured site created by a group from SUNY-Geneseo. This project centers around a TEI-encoded edition of Walden, which will, to quote their mission statement, “be enriched by annotations links, images, and social tools that will enable users to create conversations around the text.” I highly recommend that anyone interested in text encoding take a look at their genetic text demo of “Solitude,” visualized using the Versioning Machine.
What really caught my attention, however, is that they freely offer a toolkit of materials from their project, including XML documents marked up in TEI Parallel Segmentation. This allowed me to take a closer look at how they encoded the text featured in the demo, and try visualizing it, myself.
This embed shows the same text featured on the Digital Thoreau site, now visualized in Juxta Commons. It is possible to import a file encoded in TEI Parallel Segmentation directly into Juxta Commons, and the software will immediately break down the file into its constituent witnesses (see this example of their base witness from Princeton) and visualize them as a comparison set.
Once you’ve successfully added the file to your account, you have access to the heat map visualization (where changes are highlighted blue on the chosen base text), the side-by-side option, and a histogram to give you a global view if the differences between the texts in the set. In this way, the Juxta Commons R&D hope to enable the use of our software in concert with other open-source tools.
I should also note that Juxta Commons allows the user to export any other sets they have created as a parallel-segmented file. This is a great feature for starting an edition of your own, but it no way includes the complexity of markup one would see in files generated by a rigorous project like Digital Thoreau. We like to think of it the Parallel Segmentation, and the (very new, experimental) edition builder export as building blocks for future scholarly editions.
Many thanks to the team at Digital Thoreau for allowing us to make use of their scholarship!
Posted in american studies, collation, digital humanities, juxtacommons, scholarship, text, text encoding | Tagged tei, thoreau | Leave a response