What is PoCoTo?
It is a tool for the postcorrection of OCR'd documents. It has been developed by the IMPACT working group at the Centrum für Informations- und Sprachverarbeitung, University of Munich. Additional Information can be found here.
PLEASE NOTE: The development of PoCoTo will continue at https://github.com/cisocrgroup
FAQ
-
How is it implemented?
PoCoTo is written in Java. It is built as a Netbeans Platform Application.
-
What do i need to run it?
All you need is a Java runtime on your machine. (Version > 7)
-
What types of ocr'd files can be loaded?
At the moment PoCoTo can deal with Google HOcr files as well as with Abbyy XML Output.
-
What types of image files can be used?
Right now, only support for .tiff files is implemented
-
Where can i get the software
See: https://github.com/cisocrgroup/PoCoTo for the latest version.
If you want to use this old version, you are on your own ...
-
Where can i find additional information?
At the github wiki that belongs to this project. Link Or email @thorstenv or @ciskristof