OCR / HTML / SGML / XML conversion
Sometimes it is useful to have paper forms or documents in electronic form so they can be edited or published on CD or the Web. The process of conversion into accurate documents can be difficult, especially where formatting of the page is complex with graphics and tables.
DCS have developed our own sophisticated OCR and HTML conversion software which can accurately and cheaply convert large volumes of paper information into electronic form.
Typical conversion formats include Microsoft Office (DOC, XLS Files), ASCII text files, SGML and HTML format.
You can specify different levels of accuracy and quality checks dependant upon requirements. Our team of proof readers / typists can cope with large volumes of difficult technical and multi-language documents as well.
Some recent projects include:
- A magazine publishers past publication archive into HTML format
- Publication in SGML of scientific research data
- Implementation of a photographic web library
- Conversion of millions of pages of financial reports into Word
- Conversion of 2 million pages of form/report data into XML format
