From the account of the collation, we are dealing with something very different from ’string comparison’. Indeed, the base unit of the collation is the word: we collate words, not strings. Words may be concatenated, or divided: but words are the basis of it all. (This was the form used by Collate).
For each witness, we need the following information:
- Its sigil
- Its location (in Collate0-2 this was simply a file name; in CollateXML it might be a url, an xquery or xpath expression, etc)
- Collation specifications for this witness. See below.
- For each collateable block: two collateable object arrays. See below
- For each collateable block: an array of correspondences with the base. See below.