Ten Things You Should Know About Document Classification Methods |
||||
| By Manuel J. Montesino |
||||
For example, you classify a document as a sales order, as an order from a peculiar customer, for a peculiar product and of a peculiar date. With this data, you may retrieve a peculiar order, all orders from a peculiar customer, or for a sure product, and so on. 1. It is how users tend to search for a document that determines how it are going to be classified. In the example of the sale order above, the dissimilar classification criteria are all ways users tend to ask for order details. They may want to review a peculiar order, or all orders from a customer, or all orders for a product and so on. 2. Structured documents suchlike sales orders are stored in databases with specified structures. Database queries may then retrieve them by desired criteria and generate reports offering desired data. 3. Unstructured selective information suchlike communication exchange, e-mails, reports, etc. Can’t be so without apparent effort stored in structured databases. Instead, they feel inclined to be indexed by document metadata that integrate brief data in regards to the topic covered. For example, you may want to retrieve all reports on market conditions for a particular product. 4. Indexing by metadata will in truth work only whether or not there are galore standards for attaching the metadata. It will have to integrate common data, suchlike date of creation, author, and topic covered by the document. Secondly, alike documents will have to be described likewise by all people. To achieve this, choice lists are quintessentially standardized and users are provided drop-down selection boxes to choose one of these common selections. 5. Metadata may be extracted mechanically by the scheme when a document is developed, suchlike the date of creation, or entered manually by the user, as for the topic selection. 6. Full text search enables documents to be chosen by words in the document content. However, this is likely to provide inadequate results as the same words may take place in numerous documents and the search will lead to too numerous documents. 7. One resolution to having too numerous search results is to combine a hierarchical directory structure with search capablenesses. Documents are stored in directories and subdirectories with significant names, and you browse to the applicable subdirectory before invoking a search command fixed to that directory. 8. Classification and tagging of documents may serve intentions other than retrieval. For example, meta-tagging documents with their retire-by dates may aid programs to retrieve all documents that have expired and even dispose them as instructed by another meta-tag. This may reduce storage media costs by releasing storage space. 9. Documents may likewise be tagged by their business-sensitivity. Documents tagged as highly sensible may then be made accessible subject to particular limitations utilized mechanically. 10. Document classification may thence serve multiple goals intended to be attained. A microsoft blog (http://blogs. Technet. Com/filecab/archive/2009/05/11/windows -server-2008-r2-file-classification-infrastructure-managing- selective information-based-on-business-value. Aspx) reports that the most general tagging necessaries are personal information (yes/no), business criticality, confidentiality, project, and retention period. If documents are assigned properties therefore, schemes may automate assorted document-affiliated tasks leading to the kinds of business gains brought up in the blog. Document classification can’t be an ad-hoc practice carried out by the document creators. Instead, it will have to follow common conventions that have been created with particular attention to desired goals intended to be attained. These goals intended to be attained may include retrieval, retention and confidentiality goals intended to be attained. . |
||||
| Article Source: http://mowspace.co.za | ||||
| About The Author About Author: Ademero, Inc. develops paperless office software . Based largely on user experience, the company's flagship product, Content Central™, is a browser-based document management software system created to provide businesses and other organizations with a convenient way to capture, retrieve, and manage information originating in hard copy or digital form. |
||||
|
||||
| © 2010 mowspace.co.za |