Transcript
Integration and Availability
System Requirements: • PC with Intel®Pentium®or compatible processor, at least 500 MHz, 512 MB RAM • Operating system: Microsoft Windows Server 2008/2003, Windows Vista, Windows XP or Windows 2000
Recognition Server can easily be connected to external applications including Enterprise Content Management and workflow systems, expanding existing applications and automated workflows. Companies looking to implement web services or other service oriented applications can also easily integrate Recognition Server. Recognition Server Extended Edition offers a number of integration options for more advanced integration on the server. Functionalities includes: • XML-Ticket Support* • Web Service API*
PRODUCTINFORMATION
Integration with Back-End Systems
®
ABBYY Recognition Server 2.0 Professional Edition & Extended Edition
• COM-based API* • Microsoft SharePoint Export*
Input Formats:
Server-based OCR and PDF Conversion Solution
• • • • • • • •
ABBYY Recognition Server is a robust server-based solution for automating the recognition and document/PDF conversion process in enterprise environments. It is a scalable, reliable and rapidly deployable solution for high performance delivery of optical character recognition (OCR) functionalities in environments where centralised processing management and greater flexibility integration is needed. OCR functions can be used together with existing infrastructures and third-party applications.
BMP PCX JPEG JPEG 2000 PNG TIFF PDF (up to PDF 1.6) DjVu
Output Formats: • • • • • • • • • • • • •
OCR via a Service-Based Architecture
DOC (Microsoft Word) DOCX (Microsoft Word 2007) RTF PDF, PDF/A, PDF (Version 1.6) HTML CSV (comma separated values) TXT XLS (Microsoft Excel) XLSX (Microsoft Excel 2007) Original image files TIFF JPEG JPEG2000
Automated OCR and document conversion accessible from multiple points ABBYY Recognition Server 2.0 can be accessed by many users, MFPs and workgroup scanners. Scanned documents and PDFs can be processed from a variety of sources, including network and FTP-folders and Microsoft®Exchange mailboxes. Employees can use the server without special training. Processing a document is as simple as dropping a file into a folder. This functionality is available only for ABBYY Recognition Server 2.0 Extended Edition.
Independent, server-based processing
Availability Recognition Server is sold in two product versions: • Recognition Server 2.0 Professional Edition for a fast installation and automated OCR. The Professional Edition offers standard functionalities for enterprises which only need partial adjustment and almost no integration with other applications. • Recognition Server 2.0 Extended Edition Recognition Server 2.0 Extended Edition offers complete integration with external applications and the implementation as part of a web service architecture. The Extended Edition also offers additional OCR languages (Thai and Hebrew), XML-Export, Web Service API, COM-based API as well as support of XML-Tickets and Microsoft SharePoint Export.
Both versions can be extended by adding further Processing and Verification Stations. FineReader XIX (recognition of Black letter script) and OCR for Chinese, Japanese and Korean are available only for Recognition Server 2.0 Extended Edition. One year maintenance, support and updates are included. After this period customers have the option to purchase an extended support and maintenance agreement. ABBYY offers fully functional test versions with time and quantity limits. More information about the above mentioned product versions, pricing, supported formats and technologies can be obtained through ABBYY or one of its partners.
Recognition and conversion take place remotely on the server as a background process, working independently without affecting other tasks running on client workstations. Processing can take place unattended and “instantly” upon placement of a document in a “watched folder”. Administrators can also schedule processing for specific times. High-volume processing, aasy scalability Recognition Server 2.0 delivers parallel processing and automatically distributes the workload to maximise its capacity. Processing performance can easily be increased by using additional workstations and CPU cores. Scheduling functions make it easy to process high volumes of documents overnight or at scheduled times.
Centralised management
ABBYY Europe GmbH Elsenheimerstr. 49, 80687 Munich, Germany Tel: +49 89 51 11 59 - 0, Fax: +49 89 51 11 59 - 59
[email protected]
Set-up and administration of OCR processing and workloads are all managed centrally and processing parameters are defined remotely via the Management Console. Recognition Server administrators are able to set up individual workflows for workgroups or projects.
International and flexible Recognition Server 2.0 recognises text in more than 190 languages and processes multilingual documents as well as 1D and 2D barcodes. It also offers document separation functions through the use of barcodes and blank pages. Multiple output formats, flexible export Scanned documents, PDFs and image files can be converted into multiple file formats, such as plain text, Word, Excel®, HTML, PDF (including searchable PDFs), PDF/A and MRC compressed PDFs. Converted documents can be saved on the network, sent via e-mail or published to a Microsoft SharePoint® Server library*. Fault tolerance Integrated functions such as job and server logging, auto-start of OCR and management processes after restart, ensure stability and fault tolerance.
®
• Central OCR- Enterprise level processing • High-volume processing and scalability • Unsurpassed recognition quality and layout retention • Ongoing system stability and data safety • Scalable backend, centralised administration • Easy integration with additional systems • Multiple output formats and operations sequences
Target Audience Mid- to large-sized companies who wish to automatically and continually digitalise large volumes of documents whilst continuing to be efficient and saving costs. Documents can be processed, converted or extracted automatically or by users to PDFs or other Office formats, e.g.: • Companies, publishing houses
Simple SOA compatible integration Recognition Server is flexible and adapts to changing requirements of enterprises. The software can be used either as a stand-alone solution or interact with external applications. For integration XML-Ticket Support,* COM-based API* and Web Service (SOAP) API* are available. * This function is only available in ABBYY Recognition Server 2.0 Extended Edition.
www.ABBYY.com © 2008 ABBYY. All rights reserved. © 1987-2003 Adobe Systems Incorporated. Adobe® PDF Library is licensed from Adobe Systems Incorporated. Fonts Newton, Pragmatica, Courier © 2001 ParaType, Inc. Font OCR-v-GOST © 2003 ParaType, Inc. © 1999-2000 Image Power, Inc. and the University of British Columbia, Canada. © 2001-2002 Michael David Adams. All rights reserved. © 2001-2004 NewSoft Technology Corporation. All rights reserved. Portions of this computer program are copyright © 1996-2007 LizardTech, Inc. All rights reserved. DjVu is protected by U.S. Patent No. 6,058,214. Foreign Patents Pending. ABBYY, the ABBYY Logo are registered trademarks or trademarks of ABBYY Software Ltd. Adobe, the Adobe Logo, the Adobe PDF Logo and Adobe PDF Library are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States and/or other countries. Microsoft, Excel, Outlook, Windows, Windows Vista, SharePoint are either registered trademarks or trademarks of Microsoft Corporation in the United States and/or other countries. Unicode is a trademark of Unicode, Inc. All other trademarks are the property of their respective owners.
Product advantages at a glance:
ABBYY Recognition Server 2.0
• Public authorities • Agencies and consultancies • Service providers and service bureaus
Functionality Overview
Functionality Overview
Setup and Management
Processing Features
Administration
Scalability
Document Separation and Naming
Quality Control and Verification
ABBYY Recognition Server is administered remotely via the Microsoft Management Console (MMC). All system settings, including workflows, job lists, Processing Station properties, licences and server log files, can be edited at a central location.
ABBYY Recognition Server can satisfy the needs of every type of enterprise. The Server Manager and Processing Stations can be installed on a multi-core system or they can be distributed in the network. In each scenario, the flexible architecture scales in an almost linear way. Thus, systems which process hundreds of pages per minute can easily be set up.**
Recognition Server can separate scanned batches into different documents. They can be divided by a fixed number of pages, by using blank pages or barcodes. Documents in a sub-folder can also be merged into a single document. Date, time, specific text and barcode values can be used for naming documents.
Verification Stations enable the employees or operators to manually check or verify processing results. Verifiers can, for example, resize and check block types (image, text, tables) or manually change the OCR results. Verification Stations can be installed on multiple workstations. The Server Manager controls the number of simultaneously used stations through concurrent licensing.
Pages per minute
600
500
Workflow and Job Distribution
400
A Recognition Server workflow is the smallest administrative unit. The workflow includes all processing parameters, like input sources from which the documents are taken for processing. A job can include a single page image, a multiple page image or PDF file. The administrator sets priorities for each workflow in advance so that all jobs are automatically loaded throughout the entire system.
300
200
100
20
40
60
80
Numbers of CPUs
Recognition and PDF Conversion Scheduled and Efficient Processing Particular workflows can be set up to take place at specific times or on a regular basis (i.e. daily, weekly, or monthly). This kind of routing increases efficiency, especially when processing is scheduled during the nighttime hours or at times of the lowest workloads. Recognition Server 2.0 is very flexible. Administrators can also configure the system so that different Processing Stations are used at different times.
ABBYY Recognition Server is based on ABBYY’s award-winning recognition technology, which is known for its accuracy and stability. Recognition Server provides automated document optimisation and pre-processing functions which include splitting dual pages (for book scans), converting colour and grey to black and white as well as clearing background noise, image deskewing and despeckling. Formats and functions included: • Print Type: normal text, typewriter, dot-matrix, OCR-A, OCR-B, and MICR (E13b). • Languages: more than 190 languages, including multilingual documents. • Special Fonts***: Black letter, Schwabacher and most other Gothic fonts printed between 1700 and 1937 in English, German, French, Italian and Spanish.
Overview Processing Steps 1. Document Import: During the first processing step, the Server Manager imports files from the input source (i.e. shared folder, FTP folder, or mailbox folder) and arranges them in a queue for processing. 2. Recognition: Next, files are evenly distributed among the available Processing Stations for recognition. The Server Manager and Processing Station are Windows® services. Each component can be installed separately in the network or on an individual computer. The Server Manager administers and monitors processing on all available Processing Stations.
4. Export: Following the recognition and verification, the Server Manager delivers the output document to its destination, which can be a network folder, a SharePoint® library*, an e-mail address or applications which use the API.
• Barcodes: most popular 1D and 2D barcodes, positioned at any angle on a document. • Processing Speed: 3 different modes of OCR definition: precision, speed and balanced mode.
Output Formats Recognition Server 2.0 offers a variety of different output formats: DOC, DOCX, RTF, XML*, XLS, XLSX, HTML, TIFF, JPEG, JPEG200 and more. It is possible to generate multiple output formats for one input document. The results can be sent by e-mail or placed in different locations, i.e. a file or SharePoint Server*. Recognition Server supports export to “simple“ searchable PDFs, linear and tagged PDFs, PDF files with security options and encryption and PDF/A for long-term archiving. It is also possible to generate highly compressed MRC-PDFs, which can effectively manage colour documents.
Verification Station
Processing Station
3. Verification (optional): When exact accuracy is required, an optional verification station can be set up where results can be checked manually. Input Folder
Output Folder Server Manager
*
This functionality is available only for ABBYY Recognition Server 2.0 Extended Edition.
**
The diagram reflects the results of tests conducted by ABBYY. System performance can vary depending on the quality of images as well as hardware performance and networks configuration.
*** Upon request.