FutureTech Systems, Inc.
FutureTech Logo
 
             Leading you to the Future.
Our Mission & Our Profile Solutions & Services Provided By Us List of FutureTech customers Our Business Development Partners Research & Glossary Get Support for FutureTech products Future At FutureTech
Document Capture
Document Mgmt
Forms Processing
Web Portals
Document Storage
Glossary of Terms
 
Research & Glossary
Glossary
FutureTech Systems
 
This glossary of terms, as defined by FutureTech Systems, covers important terms and acronyms used in the document technology industry. 

The purpose of this glossary is help customers interested in document technologies gain a better understanding of industry terms. 
Term
Acronym
Definition
Automated Data Entry
-
Process of validating documents, checking for missing documents, and assigning indexes or keywords, prior to storing them in a document management system. 
Automatic Document Feeder
ADF
A scanner accessory which feeds a paper into the scanner. 
Black Border Removal
-
A software or hardware algorithm which removes the border around a scanned document.  A slight border of black may be created during scanning because scanners may read over the edge of the document.  Black Border Removal corrects thisand is an important feature of a tools category called Image Clean Up.
Bar Code Recognition
BCR
A graphical representation of lines of varying widths representing data.  Bar Codes are not readable by the human eye, but provide a high degree of recognition using Bar Code scanning technology.
Business Process
BP
A set of rules or definitions, written or un-written, governing information flow.
CCITT Group 4 Compression
G4
A two dimensional compression algorithm that uses patterns to compress binary images, black and white text and line art.
Certified Document Imaging Architect
CDIA
Industry certification for document management and imaging professionals.  Certification requires passing a test covering concepts on: capture, systems sizing, protocols, archival, storage and performance.
Charge-Coupled Device
CCD
Sensor used in scanners to read documents being scanned into memory.
Color Dropout
-
The use of specially colored lamps during scanning so the color form disappears leaving only data. 
Compression
-
An algorithm which runs via software or hardware reducing a document's file size.  Compression is used to reduce storage space.
Compression Ratio
-
The ratio of uncompressed to compressed files size.  Theoretical limit of G4 compression is 20 to 1.  Typical results vary based on the document type.  10-15 to 1 is common.
Database Management System
DBMS
Software offering database services, such as Oracle and Microsoft SQL Server.
Decompression
-
The opposite of compression, decompression is an algorithm extracting the compressed document's data.
Deskew
-
A software or hardware algorithm correcting scanned or slightly rotated documents.  Deskew is an important feature of a tools category called Image Clean Up.
Despeckle
-
Similar to copying except an electronic document image is created and stored on a computer.
Digitizing
-
The act of processing documents through the scanner creating digital files or images of the paper being processed.
Document Capture
-
Similar to copying except a document's electronic image is created and stored on a computer.
Document Management
DM/EDM
A system storing and retrieving both digital document images and electronic documents such as spreadsheets and word processing files.  Document Management systems often possess ‘librarian’-type functions such as check in/out or version control.
Document Preparation
-
Pre-scanning operation removing staples, notes, clips, bindings then sorts and inserts separator pages.
Dots Per Inch
DPI
A measurement of scanner resolution relative to the original document.  Each dot represents a pixel.  The more dots in a given inch, the better the clarity and higher the resolution. Resolution is also a function of file size.  Because compression algorithms are  based on patterns, higher resolution documents compress less, resulting in larger file sizes.
Duplex
-
A scanner capable of reading both the front and back sides of document simultaneously.  This decreases the effort to capture both images.
Duty Cycle
-
Recommended use level by the scanner manufacture measured in pages per month.  This is not the same as rated speed.
Endorser
-
Scanner accessory which stamps documents with a number or data key either pre or post scan.
Fat Client
-
Client software installed at each computer which communicates with another location's server.  The term "fat" is used because all client components are installed at the computer and the user typically operates a vendor specific interface.  The opposite of "Thin Client".
Forms Processing
-
The act of extracting information from documents after being captured.  Forms Processing software uses histogram technology to determine the form being processed, then applies a form specific template to aid the data extraction from the form.  Data extraction may be via OCR for machine print, Intelligent Character Recognition (ICR) for hand written data and Optical Mark Recognition (OMR) for check boxes. 
Full Page OCR
-
The process of converting graphical character representation on a page to text (ASCII) data.  See also: Optical Character Recognition
Grayscale
-
A scanner feature resulting in outputting multiple shades of gray.  While file size increases, it may create a more readable document.
Image Clean Up
-
A software or hardware algorithm correcting a variety of issues encountered as a result of the scanning process.  Typical features include: Desckew, Despeckle, Black Border Removal, Shading Removal, Line Removal.
Image Enhancement
-
A software or hardware algorithm correcting a variety of issues encountered as a result of the scanning process.  Typical features include: Deskew, Despeckle, Black Border Removal, Shading Removal, Line Removal.
Imprinter
-
Scanner accessory which stamps documents with a number or data key either pre or post scan.
ISIS
-
A scanner communication standard developed by Pixel Translations.
Joint Photographic Experts Group
JPEG
A compression algorithm primarily used for color documents.  Currently the dominant compression technique used on the internet and web today.
Landscape
-
Scanning or display of documents rotated 90 degrees.  That is, 11" wide by 8.5" tall.
Line Removal
-
A software or hardware algorithm removing vertical or horizontal lines from scanned documents, such as boxes on forms.  Line removal is an important feature of a category of tools called Image Clean Up.
Open Database Connectivity
ODBC
Microsoft standard for communicating with databases.  Applications supporting ODBC communicate with any database management system that is also ODBC compliant.
Operating System
OS
Computer software that communicates with other devices and networks connected to a computer and also provides an interface for users to operate. Such as Windows 98, NT, 2000 or XP.
Optical Character Recognition
OCR
Conversion of a character image representation to ASCII text representation. 
Optical Mark Sense
OMR
A technology used by Forms Processing applications to specific boxes checked on a from.  A checked box's data value may be "yes," and a blank box's data value may be "no." 
Oracle
-
A commercial database engine many software applications integrate with for database services.
Pages Per Minute
PPM
Unit of scanner performance.  Pages or documents scanned in a given minute, either simplex or duplex.  See also: Rated Speed
Patch Code
-
A unique graphic, developed by Kodak,  placed on separator sheet to indicate document separation and other scanning functions.
Pixel Element
Pixel
The smallest unit of measure represents all on-screen data.  Each character, image, picture or color is made up of a collection of pixels.
Portable Document Format
PDF
Document file format developed by Adobe storeing textual and graphical information in a single format.  While it is a proprietary format, the viewer is available for free.
Portal
-
A website functioning either as part of an intranet or the internet consolidating information from a variety of sources unbeknownst to the user.  This popular interface provides one location for finding information and is familiar to the user.
Portrait
-
Scanning or display of documents in typical orientation.  That is, 8.5" wide by 11" tall.
Rated Speed
-
Measurement of scanner performance based on scanning 200 DPI images through the scanner landscape measured in pages per minute.
Resolution
-
The density of pixels in a scanned document.  See also: Dots Per Inch
Retrieval
-
The act of finding a document in a document management or portal solution, equivalent to running a specific criteria search to locate the information required.
Scanning
-
Processing documents through the scanner to create digital files or images of the paper being processed.
Separator Sheet
-
A page inserted at the time of document preparation indicating the start of a new document to the scanning software.
Shade Removal
-
A software or hardware algorithm removing shaded areas on scanned documents.  Shade Removal is an important feature of a category of tools called Image Clean Up.
Simplex
-
A scanner capable of reading only one side of a document at a time.  Simplex scanners can scan double sided documents, but the document stack must be flipped, scanned and collated.  Presently, nearly all production level scanners are duplex.
Small Computer Systems Interface
SCSI
Industry standard for connecting and controlling peripheral devices such as scanners, printers and tape drives.
SQL Server
-
Microsoft's commercial database engine  many software applications integrate with for database services.
Structure Query Language
SQL
Standard language to communicate with database servers such as Oracle and SQL Server.
Tagged Image File Format
TIFF
Standard file format used by most document management systems.  While standard, there are many variations of this file format.  The most popular being a multi-page G4 TIFF file.
Thin Client
-
Web based application that communicates with a server.  The term thin is used because all that is needed on the client is the web browser, versus all client components. 
TWAIN
TWAIN
A software interface for controlling scanners and digital cameras.  Competes with ISIS technology.
Virtual ReScan
VRS
Technology developed by Kofax greatly improving the quality of post-scan documents using advanced enhancement techniques.
Workflow
-
An automated electronic document-routing system enabling users to quickly process work with greater efficiency and faster accuracy than traditional paper processing. Workflow instantly and electronically transmits documents through the business process as each increment or user work is completed. Workflow can be scheduled to run at the most convenient time and run completely unattended.
Zonal Optical Character Recognition
OCR
The process of converting graphical character representation on a page to text (ASCII) data.  See also: Full Page OCR
 
Contact us to learn more about these and other FutureTech products and services.
 
 

Need more information about this, just complete the form and a FutureTech consultant will contact you soon.

Name

Company

Email

Comments

 
Industry Info.
Knowledge management can increase productivity and profits.
 
 
Newsletter
Click here to subscribe for the newsletter and simply click on send button of your outlook.
 
Feedback
Click here to send your online opinion about this page.

Privacy PolicyLegal Copyright PolicyHow to reach us?Sitemap Perform a keyword Search             ©2009 FutureTech Systems, Inc.