The separation of collation into stages
As Collate0-2 developed, I learnt that one had to break the collation process into stages. At first, Collate 0-2 simply collated, and identified the variants as it found them. I soon learned that the complex requirements of scholarly collation demanded the adjustment of the collation at various points. To do this, it became clear that one had to separate out the stages of collation to permit intervention at various points.
However, this separation was grafted onto Collate0-2 in a a piecemeal fashion. I propose that from the beginning, CollateXML separate out what appear to me now as the following fundamental stages of collation:
- text alignment, one witness at a time against the base
- storage of alignment information for all witnesses against the base
- adjustment of alignment information for all witnesses against each other
- variant identification within the aligned texts.