Permalink
Switch branches/tags
Nothing to show
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
63 lines (43 sloc) 2.04 KB

EOASkripts

This is a set of scripts which forms the central part of the conversion workflow in the Edition Open Access.

The first part uses tralics to convert the TeX source to XML. The original DocBook output is enriched by various EOA-specific elements.

This intermediate XML file is subsequently used by three additional programs which turn it into TEI-XML, EPUB and Django-XML, respectively. The Django-XML format is ingested into the database of the EOA site where it will show up as an online publication.

The EPUB files can be put together to form an ebook. The script data/misc/epub.sh performs the required steps.

The conversion to TEI is still work in progress.

The EOA workflow

Examplary workflow

To install the whole toolchain, clone at least this repository as well as the 'advanced' branch of EOA sample project. Follow the installation instructions in INSTALL.md.

In eoa_sample_project, run xelatex, biber (the version included in your TeX distribution) and xelatex two more times. This will give you the PDF version of the document.

Next, comment line 9 in EOASample.tex (the EOA preambel) and uncomment line 10 (the XML preambel) and run the older version of biber (biber v2.1).

biber_2.1 EOASample

Now, you are ready to run eoaconvert.py:

eoaconvert.py -f EOASample

If everything went well, you can also try and run

tralics2django.py
tralics2epub.py
tralics2tei.py

These scripts don't take any arguments and will produce output in the CONVERT directory.

Convert TEI to EOADjango

Suite of functions to get from TEI encoded XML into the workflow of Edition Open Access. The main output file is an XML file called IntermediateXML.xml which can subsequently processed with tralics2django, a tool found in the EOASkripts repository.

Code written in Python3.

External dependencies

  • lxml
  • BeautifulSoup
  • pandoc
  • pandoc-citeproc