Abstract
The current role of computers in automatic document processing is briefly outlined, and some reasons are given why the early promise of library automation and of the mechanization of documentation processes has not been fulfilled.
A new dynamic document environment is then outlined in which clustered files are searched and information is retrieved following an interactive user-controlled search process. Methods are described for an automatic query modification based on user needs, and for a continuous reorganization of the stored information as a function of earlier file processing and of normal collection growth. The proposed procedures provide powerful tools for information retrieval and for the control of dynamic library collections in which new items are continually added and ones are retired.
- 1 Salton, G. Computers and libraries--a reply. Library J. 96, 18 (Oct. 1971), 3277-3282.Google Scholar
- 2 Locke, W.N. Computer costs for large libraries. Datamation 16, 2 (Feb. 1970), 69-74.Google Scholar
- 3 Vickery, B.C. Techniques of Information Retrieval. Archon Books, Hamden, Conn., 1970.Google Scholar
- 4 Salton, G. Automatic text analysis. Science 168, 17 (Apr. 1970), 335-343.Google ScholarCross Ref
- 5 Swanson, D.R. Searching natural language text by computer. Science 132, 21 (Oct. 1960), 1099-1104.Google ScholarCross Ref
- 6 Salton, G. The SMART Retrieval System--Experiments in Automatic DocumentProcessing. Prentice-Hall, Englewood Cliffs, N.J., 1971. Google ScholarDigital Library
- 7 Salton, G. A new comparison between conventional indexing (Medlars) and automatic text processing (SMART). J. ASIS 23, 2 (Mar.-Apr. 1972).Google Scholar
- 8 Lancaster, F.W. Evaluation of the operating efficiency of Medlars. Final Rep., 1968. Nat. Library of Medicine.Google Scholar
- 9 Lancaster, F.W. An evaluation of EARS (Epilepsy Abstracts Retrieval System) and factors governing its effectiveness. Report to N1NDS. U. of Illinois, Oct. 1971.Google Scholar
- 10 Salton, G. Automatic Information Organization and Retrieval. McGraw-Hill, New York, 1968. Google ScholarDigital Library
- 11 Salton, G. Search strategy and the optimization of retrieval effectiveness. In Mechanized Information Storage, Retrieval, and Dissemination. K. Samuelson, Ed. North Holland Pub. Co., Amsterdam, 1968, pp. 73-107.Google Scholar
- 12 Murray, D.M. Document Retrieval based on Clustered Files. Doer. Thesis, Cornell U. Dept. of Computer Science, 1972. Google ScholarDigital Library
- 13 Lesk, M.E., and Salton, G. Interactive search and retrieval methods using automatic information displays. Proc. AFIPS SJCC, AFIPS Press, Montvale, N.J., 1969, pp. 435--446.Google ScholarDigital Library
- 14 Salton, G. The performance of interactive information retrieval. Information Processing Letters 1, (1971), 35--41.Google ScholarDigital Library
- 15 Brauen, T.L. Document vector modification. In The SMART Retrieval System--Experiments in Automatic Document Processing. G. Salton, Ed. Ch. 24, Prentice-Hall, Englewood Cliffs, N.J., 1971, pp. 456--484.Google Scholar
- 16 Kerchner, M.D. Dynamic document processing in clustered Collections. Doct. Thesis, Scientific Report No. ISR-19, Dept. of Computer Science, Cornell U. Oct. 1971. Google ScholarDigital Library
- 17 Johnson, D.B., and Lafuente, J.M. A controlled single-pass classification algorithm with application to multilevel clustering. Scientific Report No. ISR-18, Section XII, Dept. of Computer Science, Cornell U., Oct. 1970.Google Scholar
- 18 Lesser, V.R. A modified two-level search algorithm using request clustering. Scientific Report N. ISR-11, Section VII, Dept. of Computer Science, Cornell U., June 1966.Google Scholar
- 19 Worona, S. Query clustering in a large document space. In The SMART Retrieval System--Experiments in Automatic Document Processing. G. Salton, Ed. Ch. 13, Prentiec-Hall, Englewood Cliffs, N.J., pp. 298-310.Google Scholar
- 20 Burton, R.E., and Kebler, R.W. The half-life of some scientific and technical literatures. American Documentation 11 (1960), 18-22.Google ScholarCross Ref
- 21 Sandison, A. The use of older literature and its obseolescene. J Documentation 27, 3 (Sept. 1971), 167-183.Google ScholarCross Ref
- 22 Grant, R.S. Predicting the need for multiple copies of books. J. Library Automation 4, 2 (June 1971).Google Scholar
- 23 Brookes, B.C. Obsolescence of special library periodicals: Sampling errors and utility contours J. ASIS (Sept.-Oct. 1970), 320-329.Google Scholar
- 24 Brookes, B.C. The growth, utility and obsolescence of scientific periodical literature. J. Documentation 26, 4 (Dec. 1970), 283-294.Google Scholar
- 25 Sage, C.R., Anderson, R.R., and Fitzwater, D.R. Adaptive information dissemination. Am. Documentation 16, 3 (July 1965), 185-200.Google ScholarCross Ref
- 26 Kelly, J. Negative response relevance feedback. In The SMART Retrieval System--Experiments in Automatic Document Processing. G. Salton, Ed. Ch. 20, Prentice-Hall, Englewood Cliffs, N.J. 1971, pp. 403--411.Google Scholar
- 27 Salton, G. Experiments in automatic thesaurus construction. Proc. IFIP Congress 71. In Information Processing 71, North Holland Pub. Co., Amsterdam, 1972.Google Scholar
Index Terms
- Dynamic document processing
Recommendations
Natural language processing versus content-based image analysis for medical document retrieval
One of the most significant recent advances in health information systems has been the shift from paper to electronic documents. While research on automatic text and image processing has taken separate paths, there is a growing need for joint efforts, ...
Comments