A generic tool for classifying documents based on a hybrid learning technique
Everyone who has to deal with document classification with a large amount of already classified documents.
The 100% automatic system is based on linguistic resources that are extracted from already classified documents.
On a 100 classes patent preclassification task, this system achieves 85% precision (that is 5% better than human operators for this task).
Any Posix compliant system