Skip to content
Permalink
cc2f136e64
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Go to file
 
 
Cannot retrieve contributors at this time
62 lines (43 sloc) 2.03 KB
# EOASkripts
This is a set of scripts which forms the central part of the conversion workflow in the Edition Open Access.
The first part uses
[tralics](https://www-sop.inria.fr/marelle/tralics/) to convert the
TeX source to XML. The original DocBook output is enriched by various
EOA-specific elements.
This intermediate XML file is subsequently used by three additional
programs which turn it into TEI-XML, EPUB and Django-XML,
respectively. The Django-XML format is ingested into the database of
the EOA site where it will show up as an online publication.
The EPUB files can be put together to form an ebook. The script
`data/misc/epub.sh` performs the required steps.
The conversion to TEI is still work in progress.
![The EOA workflow](data/img/workflow.png)
## Examplary workflow
To install the whole toolchain, clone at least this repository as well
as the 'advanced' branch of
[EOA sample project](https://github.molgen.mpg.de/EditionOpenAccess/eoa_sample_project).
Follow the installation instructions in `INSTALL.md`.
In `eoa_sample_project`, run `xelatex`, `biber` (the version included
in your TeX distribution) and `xelatex` two more times. This will give
you the PDF version of the document.
Next, comment line 9 in `EOASample.tex` (the EOA preambel) and
uncomment line 10 (the XML preambel) and run the older version of
biber (biber v2.1).
biber_2.1 EOASample
Now, you are ready to run `eoaconvert.py`:
eoaconvert.py -f EOASample
If everything went well, you can also try and run
tralics2django.py
tralics2epub.py
tralics2tei.py
These scripts don't take any arguments and will produce output in the
`CONVERT` directory.
# Convert TEI to EOADjango #
Suite of functions to get from TEI encoded XML into the workflow of Edition Open Access. The main output file is an XML file called `IntermediateXML.xml` which can subsequently processed with `tralics2django`, a tool found in the `EOASkripts` repository.
Code written in Python3.
External dependencies
---------------------
- lxml
- BeautifulSoup
- pandoc
- pandoc-citeproc