A generic tool to recognize the logical structure of documents from a OCR stream
Everyone who has to deal with electronic document encoding of from the original source material and needs to consider the hierarchical structure represented in the digitized document.
The system recognizes the logical structure of documents from a OCR stream in accordance with the descriptions of a model (DTD, XML Schema). The result is a hierarchically structured flow. The model involves both knowledge of the macro-structure of the documents and the micro-structure of their content.
Any Posix compliant system