Spaces:
Runtime error
Runtime error
.I 1 | |
.W | |
What problems and concerns are there in making up descriptive titles? | |
What difficulties are involved in automatically retrieving articles from | |
approximate titles? | |
What is the usual relevance of the content of articles to their titles? | |
.I 2 | |
.W | |
How can actually pertinent data, as opposed to references or entire articles | |
themselves, be retrieved automatically in response to information requests? | |
.I 3 | |
.W | |
What is information science? Give definitions where possible. | |
.I 4 | |
.W | |
Image recognition and any other methods of automatically | |
transforming printed text into computer-ready form. | |
.I 5 | |
.W | |
What special training will ordinary researchers and businessmen need for proper | |
information management and unobstructed use of information retrieval systems? | |
What problems are they likely to encounter? | |
.I 6 | |
.W | |
What possibilities are there for verbal communication between computers and | |
humans, that is, communication via the spoken word? | |
.I 7 | |
.W | |
Describe presently working and planned systems for publishing and printing | |
original papers by computer, and then saving the byproduct, articles coded in | |
data-processing form, for further use in retrieval. | |
.I 8 | |
.W | |
Describe information retrieval and indexing in other languages. | |
What bearing does it have on the science in general? | |
.I 9 | |
.W | |
What possibilities are there for automatic grammatical and contextual analysis | |
of articles for inclusion in an information retrieval system? | |
.I 10 | |
.W | |
The use of abstract mathematics in information retrieval, e.g. group theory. | |
.I 11 | |
.W | |
What is the need for information consolidation, evaluation, and retrieval in | |
scientific research? | |
.I 12 | |
.W | |
Give methods for high speed publication, printing, and distribution of | |
scientific journals. | |
.I 13 | |
.W | |
What criteria have been developed for the objective evaluation of information | |
retrieval and dissemination systems? | |
.I 14 | |
.W | |
What future is there for automatic medical diagnosis? | |
.I 15 | |
.W | |
How much do information retrieval and dissemination systems, as well as | |
automated libraries, cost? | |
Are they worth it to the researcher and to industry? | |
.I 16 | |
.W | |
What systems incorporate multiprogramming or remote stations in information | |
retrieval? What will be the extent of their use in the future? | |
.I 17 | |
.W | |
Means of obtaining large volume, high speed, customer usable | |
information retrieval output. | |
.I 18 | |
.W | |
What methods are there for encoding, automatically matching, | |
and automatically drawing structures extended in two dimensions, | |
like the structural formulas for chemical compounds? | |
.I 19 | |
.W | |
Techniques of machine matching and machine searching systems. | |
Coding and matching methods. | |
.I 20 | |
.W | |
Testing automated information systems. | |
.I 21 | |
.W | |
The need to provide personnel for the information field. | |
.I 22 | |
.W | |
Automated information in the medical field. | |
.I 23 | |
.W | |
Amount of use of books in libraries. | |
Relation to need for automated information systems . | |
.I 24 | |
.W | |
Educational and training requirements for personnel in the information field. | |
Possibilities for this training. Needs for programs providing this training. | |
.I 25 | |
.W | |
International systems for exchange and dissemination of information. | |
.I 26 | |
.W | |
Cost and determination of cost associated with systems of automated information. | |
.I 27 | |
.W | |
Computerized information retrieval systems. Computerized indexing systems. | |
.I 28 | |
.W | |
Computerized information systems in fields related to chemistry. | |
.I 29 | |
.W | |
Specific advantages of computerized index systems. | |
.I 30 | |
.W | |
Information dissemination by journals and periodicals. | |
.I 31 | |
.W | |
Information systems in the physical sciences. | |
.I 32 | |
.W | |
Attempts at computerized and mechanized systems for general libraries. | |
Problems and methods of automated general author and title indexing systems. | |
.I 33 | |
.W | |
Retrieval systems which provide for the automated transmission of information | |
to the user from a distance. | |
.I 34 | |
.W | |
Methods of coding used in computerized index systems. | |
.I 35 | |
.W | |
Government supported agencies and projects dealing with information dissemination. | |
.I 36 | |
.W | |
What are some of the theories and practices in computer translating of | |
texts from one national language to another? How can machine translating | |
compete with traditional methods of translating in comprehending nuances | |
of meaning in languages of different structures? | |
.I 37 | |
.W | |
What lists of words useful for indexing or classifying material are | |
available? Wanted are lists of terms that are descriptive vocabularies | |
of particular fields or schedules of words that are related to each other | |
in meaningful schemes. Wanted are lists that have been tested, at least to | |
some extent, and found useful for organizing material and for retrieving it. | |
.I 38 | |
.W | |
How can access words in an information retrieval system be kept up to date? | |
Word meanings and usage often change and lists must be dynamic to be current. | |
What definitions of the problem and progress toward solutions have been made | |
in providing necessary flexibility in systems of subject headings, index | |
words, or other symbols used for getting at stored data? | |
.I 39 | |
.W | |
The progress of information retrieval presents problems of maladjustment | |
and dislocation of personnel. Training and retraining of people to use | |
the new equipment is important at all levels. Librarians, assistants, | |
technicians, students, researchers, and even executives will need education | |
to learn the purpose, values, and uses of information systems and hardware. | |
What programs have been developed to change the attitudes and skills of | |
traditional workers and help them to learn the newer techniques? | |
.I 40 | |
.W | |
What is the status of machine translation? What progress has been | |
made in the use of computers to transfer from one language to another | |
with some degree of automation? What problems and stumbling blocks | |
have been found and are they considered to be insurmountable limitations | |
or only challenging to the field of documentation on an international scale? | |
.I 41 | |
.W | |
Is alphabetical ordering of material considered to be a useful tool in | |
information retrieval? What studies have been done to compare the | |
effectiveness of alphabetical order with other organization schemes? | |
Is there a generally accepted form of arranging material in | |
alphabetical order, and is there an easy way of achieving this form | |
without going to a great amount of effort? | |
.I 42 | |
.W | |
The average student or researcher has difficulty in comprehending the | |
vocabulary of information retrieval. It appears important that this | |
new field be understood before it is to be fully accepted. What basic | |
articles would provide an understanding of the various important aspects | |
of the information storage and retrieval? | |
.I 43 | |
.W | |
The difficulties encountered in information retrieval systems are often | |
less related to the equipment used than to the failure to plan | |
adequately for document analysis, indexing, and machine coding. The | |
position of the programmer is to take a problem and write it in a way | |
in which the equipment will understand. What articles have been written | |
describing research in maximizing the effectiveness of programming? | |
.I 44 | |
.W | |
There are presently fifty to one hundred technical journals being | |
published. On the average, two new journals appear every day. In | |
the many journals published, one to two million articles appear every | |
year. What attempts have been made to cope with this amount of | |
scientific and technical publication in terms of analysis, control, | |
storage, and retrieval? | |
.I 45 | |
.W | |
I am looking for information about the impact of automation on | |
libraries and its significance for libraries in general. This includes | |
the increasing importance of automation in view of the proliferation of | |
information today, and how automation can help libraries cope with | |
this problem. How will automation affect libraries and how should they | |
react to the idea of automation? | |
.I 46 | |
.W | |
I am seeking information on the use of data processing in libraries and | |
the mechanization of routine library processes and procedures. I would | |
like descriptions of both general and specific applications of | |
automation in such areas as circulation, cataloging, acquisitions, | |
serial records, and other record-keeping. Examples should be based on | |
the operation of a conventional public or university library, or | |
practices in a special library which could also be applied in a public | |
or university library. Give descriptions of equipment and operations, | |
both present and projected. | |
.I 47 | |
.W | |
Is there any established means at present for an international exchange | |
of material about information retrieval? If there is, does it take | |
the form of an international agency or center which regularly | |
distributes information retrieval methods and research results? If | |
there is not, in what ways has this material crossed national | |
boundaries? What seem to have been some of the problems blocking a | |
better international exchange, and is any effort being made to solve | |
some of those problems? | |
.I 48 | |
.W | |
Information retrieval is still such a new and experimental field that a | |
line distinguishing research and practice is often difficult - even | |
impossible - to draw. Are there, however, actual centers of research | |
on information retrieval? If so, in which countries are they | |
located? Who supports them - government, business, universities, or | |
libraries? Can information retrieval as a specialized research | |
discipline be said to be emerging, or is it still an amalgam of skills | |
from other fields, such as mathematics, engineering, and library | |
science? In other words, tell me about information retrieval research. | |
.I 49 | |
.W | |
Most resources have been spent on applying information retrieval | |
techniques to the physical and medical sciences. But, has information | |
retrieval been used at all in the natural sciences, social sciences, | |
and humanities? If so, what have been some of the problems which have | |
been encountered with these subject areas and how have they been | |
solved, if at all? Have the characteristics of these subject areas | |
necessitated the development of new information retrieval techniques? | |
What are the prospcts for future machine control in these areas? | |
.I 50 | |
.W | |
Is there any use for traditional classification schemes - DDC, UDC, LC, | |
etc. - in information retrieval systems? If there is, which scheme | |
appears most suited to machine use and where has it been applied? | |
If there is not, why are these classification schemes irrelevant? | |
Has research shown that a subject classification of knowledge is | |
completely unnecessary in machine systems? Or, have new schemes | |
been devised which appear to be more suited to machine use? | |
.I 51 | |
.W | |
Coordinate indexing utilizes descriptors for controlled language. Of | |
what use are descriptors in the construction of an index? How can | |
descriptors be used for searching in an information retrieval system? | |
.I 52 | |
.W | |
What are the characteristics of MEDLARS (Medical Literature Analysis | |
and Retrieval System) project which has been undertaken by the | |
National Library of Medicine? How does it index current medical | |
journals and of what relation is this indexing system to Index Medicus? | |
What are the major components of the MEDLARS project and its major operating | |
details? | |
.I 53 | |
.W | |
How can the computer be used in medical science for diagnostic and | |
clinical record keeping purposes? Have any programs of automation | |
been tried in hospitals? If so, what have been the results? | |
What problems have been encountered in the use of automation in | |
medicine? For what purposes can an automated system of clinical | |
records be used? What are other possible uses of the computer in medicine? | |
.I 54 | |
.W | |
What is the effect on librarians of automation? Note the new types | |
of technology to be used in the library which will have an effect on | |
the status, position, and function of the librarians. What changes | |
are being contemplated or have been initiated to introduce automation | |
into the education of librarians? | |
.I 55 | |
.W | |
What are the aims and objectives of the medical literature analysis | |
and retrieval system (MEDLARS)? How does MEDLARS operate? What are | |
the possible applications of MEDLARS to future information retrieval | |
systems? | |
.I 56 | |
.W | |
The standard method of finding information in today's libraries is | |
through the use of the alphabetically arranged card catalog or the | |
classified catalog based on a classification system such as the DC or | |
LC. Can these systems be modified for use with automated information | |
retrieval? | |
.I 57 | |
.W | |
In catalogs which are either arranged alphabetically or arranged by | |
classification number, the LC entry, printed in readable language, is | |
ultimately important because the individual looking for information | |
has a definite author, title, or subject phrase in his language | |
(probably English in our case) in mind. Will LC entries and subject | |
headings be used in the same manner in automated systems? | |
.I 58 | |
.T | |
Directions in Library Networking | |
.A | |
Avram, H.D. | |
McCallum, S.H. | |
.W | |
Bibliographic control before and after MARC is reviewed. The capability | |
of keying into online systems brought an interdependence among libraries, | |
the service centers that mediate between them, and the large utilities that | |
process and distribute data. From this has developed the basic network | |
structure among libraries in the United States. The independent development | |
of major networks has brought problems in standardization and coordination. | |
The authors point out that while technology has led toward centralization | |
of automated library services, new developments are now pushing toward | |
decentralization. Coordination is a requirement to avoid fragmentation in | |
this new environment. | |
.B | |
(JASIS, Vol. 31, No. 6, November 1980, pp. 438-444) | |
.I 59 | |
.T | |
Performance Testing of a Book and Its Index as a Information Retrieval | |
System | |
.A | |
Bennion, B.C. | |
.W | |
The retrieval performance of book indexes can be measured in terms of | |
their ability to direct a user selectively to text material whose identity | |
but not location is known. The method requires human searchers to base | |
their searching strategies on actual passages from the book rather than on | |
test queries, natural or contrived. It circumvents the need for relevance | |
judgement, but still yields performance indicators that correspond | |
approximately to the recall and precision ratios of large document retrieval | |
system evaluation. A preliminary application of the method to the subject | |
indexing of two major encyclopedias showed one encyclopedia apparently | |
superior in both the finding and discrimination abilities of retrieval | |
performance. The method is presently best suited for comparative testing | |
since its ability to yield absolute or reproducible measures is as yet not | |
established. | |
.B | |
(JASIS, Vol. 31, No. 4, July 1980, pp. 264-270) | |
.I 60 | |
.T | |
The Combined Use of Bibliographic Coupling and Cocitation for Document | |
Retrieval | |
.A | |
Bichteler, J. | |
Eaton, E.A. III | |
.W | |
A linkage similarity measure which takes into account both the bibliographic | |
coupling of documents and their cocitations (both cited and citing papers) | |
produced improved document retrieval over a measure based only on | |
bibliographic coupling. The test collection consisted of 1712 papers whose | |
relevance to specific queries had been judged by users. To evaluate the | |
effect of using cocitation data, we calculated for each query two measures | |
of similarity between each relevant paper and every other paper retrieved. | |
Papers were then sorted by the similarity measures, producing two ordered | |
lists. We then compared the resulting predictions of relevance, partial | |
relevance, and non-relevance to the user's evaluations of the same papers. | |
Overall, the change from the bibliographic coupling measure to the linkage | |
similarity measure, representing the introduction of cocitation data, | |
resulted in better retrieval performance. | |
.B | |
(JASIS, Vol. 31, No. 4, July 1980, pp. 278-282) | |
.I 61 | |
.T | |
Searching Biases in Large Interactive Document Retrieval Systems | |
.A | |
Blair, D.C. | |
.W | |
The way that individuals construct and modify search queries on a | |
large interactive document retrieval system is subject to systematic biases | |
similar to those that have been demonstrated in experiments on judgements | |
under uncertainty. These biases are shared by both naive and sophisticated | |
subjects and cause the inquirer searching for documents on a large interactive | |
system to construct and modify queries inefficiently. A searching algorithm | |
is suggested that helps the inquirer to avoid the effect of these biases. | |
.B | |
(JASIS, Vol. 31, No. 4, July 1980, pp. 271-277) | |
.I 62 | |
.T | |
Fuzzy Requests: An Approach to Weighted Boolean Searches | |
.A | |
Bookstein, A. | |
.W | |
This article concerns the problem of how to permit a patron to | |
represent the relative importance of various index terms in a Boolean | |
request while retaining the desirable properties of a Boolean system. | |
The character of classical Boolean systems is reviewed and related to the | |
notion of fuzzy sets. The fuzzy set concept then forms the basis of the | |
concept of a fuzzy request in which weights are assigned to index terms. | |
Ther properties of such a system are discussed, and it is shown that such | |
systems retain the manipulability of traditional Boolean requests. | |
.B | |
(JASIS, Vol. 31, No. 4, July 1980, pp. 240-247) | |
.I 63 | |
.T | |
Feature Comparison of an In-House Information Retrieval System With a | |
Commercial Search Service | |
.A | |
Boyle, S.O. | |
Miller, A.P. | |
.W | |
A commercially available online search was used as a standard for | |
comparative searching and evaluation of an in-house information system | |
based on automatic indexing. System features were identified and | |
evaluated on the basis of their usefulness in various kinds of searching, | |
their ease in implementation, and how they are influenced by differences | |
in user type or specific applications. Some common features of the | |
commercial system, such as online instruction, user-specified print formats, | |
dictionary display, and truncation, are seen to be unnecessary or | |
impractical for the in-house system. In designing the in-house system, | |
therefore, detald consideration must be given to the applications, | |
operating environment, and real user needs. While a commercial system can | |
serve as a useful standard for comparative evaluation, one must be | |
careful not to attempt to duplicate it blindly in-house. | |
.B | |
(JASIS, Vol. 31, No. 5, September 1980, pp. 309-317) | |
.I 64 | |
.T | |
Measurement in Information Science: Objective and Subjective Metrical Space | |
.A | |
Brookes, B.C. | |
.W | |
It is argued that in information science we have to distinguish | |
physical, objective, or document space from perspective, subjective, or | |
information space. These two spaces are like maps and landscapes: each | |
is a systematic distortion of the other. However, transformation can be | |
easily made once the two spaces are distinguished. If the transformations | |
are omitted we only get unhelpful physical solutions to information problems. | |
.B | |
(JASIS, Vol. 31, No. 4, July 1980, pp. 248-255) | |
.I 65 | |
.T | |
A Model of Cluster Searching Based on Classification | |
.A | |
Croft, W.B. | |
.W | |
The use of document clusters has been suggested as an efficient file | |
organization for a document retrieval system. It is possible that by | |
using this information about the relationships between documents that the | |
effectiveness of the system (i.e., its ability to distinguish relevant | |
from non-relevant documents) may also be improved. In this paper a | |
probabilistic model of cluster searching based on query classification is | |
described. This model is tested with retrieval experiments which indicate | |
that it can be more effective than heuristic cluster searches and cluster | |
searches based on other models. It can also be more effective than a full | |
search in which every document is compared to the query. The efficiency | |
aspects of the implementation of the model are discussed. | |
.B | |
(Inform. Systems, Vol. 5, No. 3, 1980, pp. 189-195) | |
.I 66 | |
.T | |
The Technology of Library and Information Networks | |
.A | |
Epstein, H. | |
.W | |
Current online library network technology is described, including the | |
physical and functional aspects of networks. Three types of networks are | |
distinguished: search service (e.g., SDC, Lockheed), customized service | |
that provide bibliographic files (e.g., OCLC, Inc., RLIN), and service | |
center (e.g., NELINET, INCOLSA). It is predicted that as technology | |
evolves more services will be provided outside the library directly to the | |
user through his home or office. | |
.B | |
(JASIS, Vol. 31, No. 6, November 1980, pp. 425-437) | |
.I 67 | |
.T | |
The Use of Titles for Automatic Document Classification | |
.A | |
Hamill, K.A. | |
Zamora, A. | |
.W | |
An experimental computer program has been developed to classify | |
documents according to the 80 sections and five major section groupings of | |
Chemical Abstracts (CA). The program uses pattern recognition techniques | |
supplemented by heuristics. During the "training" phase, words from | |
pre-classified documents are selected, and the probability of occurrence | |
of each word in each section of CA is computed and stored in a reference | |
dictionary. The "classification" phase matches each word of a document | |
title against the dictionary and assigns a section number to the document | |
using weights derived from the probabilities in the dictionary. Heuristic | |
techniques are used to normalize word variants such as plurals, past | |
tenses, and gerunds in both the training phase and the classification | |
phase. The dictionary lookup technique is supplemented by the analysis of | |
chemical nomenclature terms into their component word roots to influence | |
the section to which the documents are assigned. Program performance and | |
human consistency have been evaluated by comparing the program results | |
against the published sections of CA and by conducting an experiment with | |
people experienced in the assignment of documents to CA sections. The | |
program assigned approximately 78% of the documents to the correct major | |
section groupings of CA and 67% of the correct sections or cross-references | |
at a rate of 100 documents per second. | |
.B | |
(JASIS, Vol. 31, No. 6, November 1980, pp. 396-402) | |
.I 68 | |
.T | |
Brief Communications | |
.A | |
Harding, A.G. | |
Willett, P. | |
.W | |
Some of the automatic classification procedures used in information | |
retrieval derive clusters of documents from an intermediate similarity | |
matrix, the computation of which involves comparing each of the documents | |
in the collection with all of the others. It has recently been suggested | |
that many of these comparisons, specifically those between documents | |
having no terms in common, may be avoided by means of the uyse of an inverted | |
file to the document collection. This communication shows that the | |
approach will effect reductions in the number of interdocument comparisons | |
only if the documents are each indexed by a limited number of indexing | |
terms; if exhaustive indexing is used, many document pairs will be compared | |
several times over and the computation will be greater than when | |
conventional approaches are used to generate the similarity matrix. | |
.B | |
(JASIS, Vol. 31, No. 4, July 1980, pp. 298-299) | |
.I 69 | |
.T | |
The Application of a Minicomputer to Thesaurus Construction | |
.A | |
Kazlauskas, E.J. | |
Holt, T.D. | |
.W | |
The Use of a minicomputer in various phases of creating the thesaurus | |
for the National Information Center for Special Education Materials | |
(NICSEM) database is described. The minicomputer is used to collect, | |
edit, and correct candidate thesaurus terms. The use of the minicomputer | |
eases the process of grouping terms into files of similar concepts and | |
facilitates the generation of products useful in vocabulary review and in | |
term structuring. Syndetic relations, indicated by assigning coded | |
identification numbers, are altered easily in the design phase to reflect | |
restructuring requirements. Because thesaurus terms are already in machine- | |
readable form, it is simple to prepare print programs to provide permuted, | |
alphabetic, hierarchical, and chart formatted term displays. Overall, the | |
use of the minicomputer facilitates initial thesaurus entry development by | |
reducing clerical effort, editorial staff decisions, and overall processing | |
times. | |
.B | |
(JASIS, Vol. 31, No. 5, September 1980, pp. 363-368) | |
.I 70 | |
.T | |
Adaptive Design for Decision Support Systems | |
.A | |
Keen, P.G.W. | |
.W | |
Decision Support Systems (DSS) represent a concept of the role of | |
computers within the decision making process. The term has become a | |
rallying cry for researchers, practitioners, and managers concerned that | |
Management Science and Management Information Systems fields have become | |
unnecessarily narrow in focus. As with many rallying cries, the term is | |
not well defined. For some writers, DSS simply mean interactive systems | |
for use by managers. To others, the key issue is support, rather than | |
system. They focus on understanding and improving the decision process; | |
a DSS is then designed using any available and suitable technology. Some | |
researchers view DSS as a subfield of MIS, while others regard it as an | |
extension of Management Science techniques. The former define Decision | |
Support as providing managers with access to data and the latter as giving | |
them access to analytic models. | |
The key argument of this paper is that the term DSS is relevant to | |
situations where a "final" system can be developed only through an | |
adaptive process of learning and evolution. The design strategy must | |
then focus on getting finished; this is very different from Management | |
Science and Data Processing approaches. The research issued for DSS | |
center around adaption and evolution; they include managerial learning | |
representation of tasks and user behavior, design architecture and | |
strategies for getting started. | |
.B | |
(Database, Vol. 12, No. 1-2, Fall 1980) | |
.I 71 | |
.T | |
An Automatic Method for Extracting Significant | |
Phrases in Scienfific or Technical Documents | |
.A | |
Maeda, T. | |
Momouchi, J. | |
Sawamura, H. | |
.W | |
A new method is described to extract significant phrases in the title | |
and the abstreact of scientific or technical documents. The method is | |
based upon a text structure analysis and uses a relatively small dictionary. | |
The dictionary has been constructed based on the knowledge about concepts | |
in the field of science or technology and some lexical knowledge. For | |
significant phrases and their component items may be used in different | |
meanings among the fields. A text analysius approach has been applied to | |
select significant phrases as substantial and semantic information carriers | |
of the contents of the abstract. | |
The results of the experiment for five sets of documents have shown | |
that the significant phrases are effectively extracted in all cases, and | |
the number of them for every document and the processing time is fairly | |
satisfactory. The information representation of the document, partly | |
using the method, is discussed with relation to the construction of the | |
document information retrieval system. | |
.B | |
(Info. Proc. & Management, Vol. 16, No. 3, 1980, pp.119-127) | |
.I 72 | |
.T | |
Answer-Passage Retrieval by Text Searching | |
.A | |
O'Connor, J. | |
.W | |
Passage retrieval (already operational for lawyers) has advantages in | |
output form opver references retrieval and is economically feasible. | |
Previous experiments in passage retrieval for scientists have demonstrated | |
recall and false retrieval rates as good or better than those of present | |
reference retrieval services. The present experiment involved a greater | |
variety of forms of retrieval question. In addition, search words were | |
selected independently by two different people for each retrieval question. | |
The search words selected, in combination with the computer procedures used | |
for passage retrieval, produced average recall ratios of 72 and 67%, | |
respectively, for the two selectors. The false retrieval rates were (except | |
for one predictably difficult question) respectively 13 and 10 falsely | |
retrieved sentences per answer-paper retrieved. | |
.B | |
(JASIS, Vol. 31, No. 4, July 1980, pp. 227-239) | |
.I 73 | |
.T | |
Partial-Match Retrieval Using Indexed Descriptor Files | |
.A | |
Pfaltz, J.L. | |
Berman, W.J. | |
Cagley, E.M. | |
.W | |
In this paper we describe a practical method of partial-match retrieval | |
in very large data files. A binary code word, called a descriptor, is | |
associated with each record of the file. These record descriptors are | |
then used to form a derived descriptor for a block of several records, | |
which will serve as an index for the block as a whole; hence, the name | |
"indexed descriptor files." | |
First the structure of these files is described and a simple, efficient | |
retrieval algorithm is presented. Then its expected behavior, in terms of | |
storage accesses, is analyzed in detail. Two different file creation | |
procedures are sketched, and a number of ways in which the file organization | |
can be "tuned" to a particular application is suggested. | |
.B | |
(Commun. ACM, Vol. 23, No. 9, September 1980, pp. 522-528) | |
.I 74 | |
.T | |
Cooperation and Competition Among Library Networks | |
.A | |
Robinson, B.M. | |
.W | |
Recenty technological advances and the success of OCLC, Inc. has led | |
to the emergence of three additional nonprofit library networks: the | |
Research Libraries Information Network (RLIN) of the Research Libraries | |
Group, Inc., the University of Toronto Library Automation System (UTLAS), | |
and the Washington Library Network (WLN). This paper examines the economic | |
and technological factors affecting the evolution of these networks and | |
also explores the role of those state and regional (multistate) networks | |
that broker OCLC services. The competitive and cooperative nature of | |
network relationships is a major theme of the discussion. | |
.B | |
(JASIS, Vol. 31, No. 6, November 1980, pp. 413-424) | |
.I 75 | |
.T | |
An Integrated Understander | |
.A | |
Schank, R.C. | |
Lebowitz, M. | |
Birnbarim, L. | |
.W | |
A new type of natural language parser is presented. The idea behind | |
this parser is to map input sentences into the deepest form of the | |
representation of their meaning and inferences, as is appropriate. The | |
parser is not distinct from an entire understanding system. It uses an | |
integrated conception of inferences, scripts, plans and other knowledge to | |
aid in the parse. Furthermore, it does not attempt to parse everything it | |
sees. Rather, it determines what is most interesting and concentrates on | |
that, ignoring the rest. | |
.B | |
(Am. J. of Computational Linguistics, Vol. 6, No. 1, January-March 1980, | |
pp. 13-30) | |
.I 76 | |
.T | |
Library Networks and Resource Sharing in the United States: | |
An Historical and Philosophical Overview | |
.A | |
Stevens, N.D. | |
.W | |
This paper discusses the origins of library networks and traces their | |
development in the United States in the late 1960s through the present. | |
The concept of resource sharing, with particular attention to the inter- | |
library loan and programs for the cooperative acquisition and storage of | |
materials, is examined in relationship to library networks. In particular, | |
attention is given to the question of how these two major components of | |
library cooperation, which have tended to be separate, might become more | |
closely integrated. | |
.B | |
(JASIS, Vol. 31, No. 6, November 1980, pp. 405-412) | |
.I 77 | |
.T | |
Normalization of Titles and Their Retrieval | |
.A | |
Takamatsu, S. | |
Fujita, UY. | |
Nishida, F. | |
.W | |
This paper presents a method of normalizations of English titles and | |
their retrieval. The title expressed by a noun phrase or a noun clause | |
is converted to a function-expression by parsing. For the retrieval with | |
a reasonable recall rate as well as a high precision rate, the function- | |
expression is transformed to a predicate-governor form, and then normalized | |
to a standard form. Therefrom, various items are extracted and recorded | |
in a hierarchical tree-like inverted file. | |
In order to keep the recall rate in a reasonable value, several | |
retrieval stages are implemented based on the key-term and case-label | |
matching. The retrieval is controlled by the preciseness of the specification | |
of case-labels for each key-term. | |
.B | |
(Info. Proc. & Management, Vol. 16, No. 3, 1980, pp. 155-167) | |
.I 78 | |
.T | |
Cascaded ATN Grammars | |
.A | |
Woods, W.A. | |
.W | |
A generalization of the notion of ATN grammar, called a cascaded ATN | |
(CATN), is prescribed. CATN's permit a decomposition of complex language | |
understanding behavior into a sequence of cooperating ATN's with separate | |
domain of responsibility, where each stage (called an ATN transducer) | |
takes its input from the output of the previous stage. The paper includes | |
an extensive discjussion of the principles of factoring-conceptual | |
factoring reduces the number of places that a given fact needs to be | |
represented in a grammar, and hypothesis factoring reduces the number | |
of distinct hypotheses that have to be considered during parsing. | |
.B | |
(Am. of Computational Linguistics, Vol. 6, No. 1, January-March 1980) | |
.I 79 | |
.T | |
Algorithms for Processing Partial Match Queries Using Word Fragments | |
.A | |
Alagar, V.S. | |
.W | |
Algorithms are given to process partially specified queries in a | |
compressed database system. The proposed methods handle effectively | |
queries that use either whole words or word fragments as language elements. | |
The methods are compared and critically evaluated in terms of the design | |
and retrieval costs. The analyses show that the method which exploits the | |
interdependence of fragments as well as the relevance of fragments to | |
records in the file has maximum design cost and least retrieval cost. | |
.B | |
(Inform. Systems, Vol. 5, No. 4, April 1980, pp. 323-332) | |
.I 80 | |
.T | |
A General Formulation of Bradford's Distribution: The Graph-Oriented | |
Approach | |
.A | |
Asai, I. | |
.W | |
From the detailed analysis of eight previously published mathematical | |
models, a general formulation of Bradford's distribution can be deduced as | |
follows: y = a log(x + c) + b, where y is the ratio of the cumulative | |
frequency of articles to the total number of articles and x is the ratio | |
of the rank of journals to the total number of journals. The parameters a, b, | |
and c are the slope, the intercept, and the shift in a straight line to log rank, | |
respectively. Each of the eight models is a special case of the general | |
formulation and is one of five types of formulation. In order to estimate | |
three unknown parameters, a statistical method using root-weighted square | |
error is proposed. A comparative experiment using 11 databases suggests that | |
the fifth type of formulation with three unknown parameters is the best fit | |
to the observed data. A further experiment shows that the deletion of the | |
droop data leads to a more accurate value of parameters and less error. | |
.B | |
(JASIS, Vol. 32, No. 2, March 1981, pp. 113-121) | |
.I 81 | |
.T | |
Lexical Problems in Large Distributed Information Systems | |
.A | |
Berkovich, S.Y. | |
Shneiderman, B. | |
.W | |
The lexical problems in large information systems are created by the | |
necessity of handling a great number of names and their interrelations. | |
Such lexical problems are not covered completely by the concept data | |
dictionaries, which are mostly concerned with database scheme design rather | |
than the execution of operations. In this paper we introduce our view of a | |
lexical subsystem as a separate component in an information system architecture, | |
to deal with linguistic and control functions concerning the lexical problems | |
in local and network environments. The lexical suybsystem is a special | |
efficiently organized program package, which plays the role of a "linguistic | |
filter" in a broad sense for lexically incorrect queries, promotes integration | |
of databases and information retrieval systems, and facilitates the creation | |
of local information systems. We hope that lexical subsystems can become | |
productive for any large, especially distributed, information system. | |
.B | |
(Information Processing & Management, Vol. 16, February 1980, pp. 259-267) | |
.I 82 | |
.T | |
The Relational Model in Information Retrieval | |
.A | |
Crawford, R.G. | |
.W | |
The relational model has received increasing attention during the | |
past decade. Its advantages include simplicity, consistency, and a sound | |
theoretical basis. In this article, the naturalness of viewing information | |
retrieval relationally is demonstrated. The relational model is presented, | |
and the relational organization of a bibliographical database is shown. | |
The notion of normalization is introduced and first, second, third, and | |
fourth normal forms are demonstrated. Relational languages are discussed, | |
including the relational calculus, relational algebra, and SEQUEL. | |
Numerous examples pertinent to information retrieval are presented in these | |
relational languages. Advantages of the relational approach to information | |
retrieval are noted. | |
.B | |
(JASIS, Vol. 32, No. 1, January 1981, pp. 51-64) | |
.I 83 | |
.T | |
Electronic Information Interchange in an Office Environment | |
.A | |
DeSousa, M.R. | |
.W | |
This paper describes an architectural approach that provides information | |
exchange across a broad spectrum of user applications and office automation | |
offerings. Some of the architectures described herein are currently | |
implemented in existing IBM products. These and other architectures will | |
provide the basis for document interchange capability between products | |
such as the IBM 5520 Administrative System, the IBM System/370 Distributed | |
Office Support System (DISOSS), and the IBM Displaywriter System. | |
Specifically described is a document distribution architecture and its | |
associated data streams and others. | |
A general overview of the architectures as opposed to a detailed | |
technical description is provided. The architectures described are | |
protocols for interchange between application processes; they do not | |
address the specific user interface. The document distribution | |
architectures utilize SNA for data transmission and communications control | |
facilities. | |
.B | |
(IBM Systems Journal, Vol. 20, No. 1, 1981, pp. 4-22) | |
.I 84 | |
.T | |
The Use of Automatic Relevance Feedback in Boolean Retrieval Systems | |
.A | |
Dillon, M. | |
Desper, J. | |
.W | |
A technique is described for automatic reformulation of boolean | |
queries. Based on patron relevance judgements of an initial retrieval, | |
prevalence measures are derived for terms appearing in the retrieved set | |
of documents that reflect a term's distribution among the relevant and | |
non-relevant documents. These measures are then used to guide the | |
construction of a boolean query for a subsequent retrieval. To illustrate | |
the technique, a series of tests is described of its application to a small | |
data base in an experimental environment. Results compare favourably with | |
feedback as employed in a SMART-type system. MOre extensive testing is | |
suggested to validate the technique. | |
.B | |
(Journal of Documentation, Vol. 36, No. 3, September 1980, pp. 197-208) | |
.I 85 | |
.T | |
Interacting in Natural Language With Artificial Systems: The Donau Project | |
.A | |
Guida, G. | |
Somalvico, M. | |
.W | |
This paper is intended to propose a new methodological approach to | |
the conception and development of natural language understanding systems. | |
This new contribution is supported by the design, implementation, and | |
experimentation of DONAU: a general purpose domain oriented natural | |
language understanding system developed and presently running at the Milan | |
Polytechnic Artificial Intelligence Project. The system is based on a two | |
level modular architecture intended to overcome the lack of flexibility and | |
generality often pointed out in many existing systems, and to facilitate | |
the exchange of results and actual experiences between different projects. | |
The horizontal level allows an independent and parallel development of the | |
single segments of the system (syntactic analyser, information extractor, | |
legality controller). The vertical level ensures the possibility of changing | |
(enlarging or redefining) the definition of the semantic domain on which each | |
particular version of the system is oriented and specialized in a simple, | |
incremental, and user-oriented way. In the paper the general architecture of | |
the system and the mode of operation of each segment are illustrated in | |
detail. Linguistic models, knowledge representation, and parsing algorithms | |
are described and illustrated by means of selected examples. Performance | |
evaluations of the system in the application version on data base inquiry are | |
reported and discussed. Promising directions for future research are presented | |
in the conclusions. | |
.B | |
(Inform. Systems, Vol. 5, NO. 4, February 1980, pp. 333-344) | |
.I 86 | |
.T | |
Approximate String Matching | |
.A | |
Hall, P.A.V. | |
Dowling, G.R. | |
.W | |
Approximate matching of strings is reviewed with the aim of | |
surveying techniques suitable for finding an item in a database when | |
there may be a spelling mistake or other error in the keyword. The | |
methods found are classified as either equivalence or similarity problems. | |
Equivalence problems are seen to be readily solved using canonical forms. | |
For similarity problems difference measures are surveyed, with a full | |
description of the well-established dynamic programming method relating | |
this to the approach using probabilities and likelihoods. Searches for | |
approximate matches in large sets using a difference function are seen to | |
be an open problem still, though several promising ideas have been | |
suggested. Approximate matching (error correction) during parsing is | |
briefly reviewed. | |
.B | |
(Computing Surveys, Vol. 12, No. 4, December 1980, pp. 381-402) | |
.I 87 | |
.T | |
Using an Online Microfiche Catalog for Technical Service and Retrieval of | |
Bibliographic Data | |
.A | |
Hayes, R.M. | |
Borko, H. | |
.W | |
A prototype system is created that integrates a microfiche catalog | |
into an online computer system for bibliographic control. Costs and | |
operational data are collected and analyzed. The system permits the more | |
economical microfiche storage of catalog records than would be feasible | |
for comparable online magnetic disk storage. Experimental tests | |
demonstrate the feasibility of the online microfiche catalog system for use | |
in library technical services and retrieval of bibliographic data. The | |
primary result of the project is the creation of a completely operational | |
facility, including all equipment, software, procedures, and data bases | |
necessary to demonstrate the system. A second set of results is derived | |
from the experimental use of the system and the evaluation of costs and | |
times for various operations. The cost effectiveness of the online microfiche | |
catalog is demonstrated. | |
.B | |
(Information Processing and Management, Vol. 16, No. 6, February 1980, | |
pp. 277-289) | |
.I 88 | |
.T | |
Natural Language Access to Information Systems. An Evaluation Study | |
of Its Acceptance by End Users | |
.A | |
Krause, J. | |
.W | |
The question is asked whether it is feasible to use subsets of | |
natural languages as query languages for data bases in actual applications | |
using the question answering system "USER SPECIALTY LANGUAGES" (USL). | |
Methods of evaluating a natural language based information system will | |
be discussed. The results (error and language structure evaluation) | |
suggest how to form the general architecture of application systems which | |
use a subset of German as query language. | |
.B | |
(Inform. Systems, Vol. 5, No. 4, May 1980, pp. 297-318) | |
.I 89 | |
.T | |
Some Considerations Relating to the Cost-Effectiveness of Online Services | |
in Libraries | |
.A | |
Lancaster, F.W. | |
.W | |
In 1978 Collier presented some hypothetical data on economic aspects | |
of the use of online services as compared with subscriptions to printed | |
services in libraries. Collier's view of the economics of online searching | |
seems misleadingly pessimistic because: | |
1. It looks only at costs but not at effectiveness in comparing the two | |
modes of access and searching. An analysis combining cost and | |
effectiveness aspects (i.e., a cost-effectiveness analysis) would | |
give a completely different picture. | |
2. The way the cost data are presented is grossly unfair to the online | |
mode of access and use. | |
This work contains corrected information regarding online and printed | |
services in libraries. | |
.B | |
(Aslib Proceedings, 33(1), January 1981, pp. 10-14, printed in Great Britain) | |
.I 90 | |
.T | |
Co-Citation Context Analysis and the Structure of Paradigms | |
.A | |
Small, H. | |
.W | |
Many information scientists are concerned with the operation of | |
document retrieval systems serving scientists in various fields. The | |
scientists served by these systems are often members of what have been called | |
invisible colleges, groups of scientists in frequent communication with | |
one another and involved with highly specialized subject matters. Often | |
such groups are considered to share an intellectual perspective regarding | |
this subject matter, which is sometimes referred to as a paradigm. | |
The purpose of this paper is to show how it is possible to identify | |
paradigms, using the techniques of citation analysis. I will operationalize | |
the notion of paradigm as a 'consensual structure of concepts in a field.' | |
Suppose we have obtained a set of papers pertaining to some topic. Already | |
knowing something about the field, we read each text and mark passages in | |
which certain specific concepts are used or discussed. For example, we | |
might find that a concept designated 'A' appears in some sub-set of the | |
papers. Suppose further that we identify those papers in which concepts 'A' | |
and 'B' are used together in the same papers in a certain specified manner. | |
Clearly not all concepts will combine in a natural way, and not all authors | |
combining concepts 'A' and 'B' will do so in the same way, though some | |
predominant mode may emerge. For a set of n concepts their structure is | |
given by the totality of admissible combinations of concepts taken from | |
two to n at a time. The frequency with which a given combination occurs | |
in the sample of papers on the topic is a measure of the degree of consensus | |
regarding the particular concept combination within the corpus. For | |
concepts taken two at a time, the structure can be displayed as a graph with | |
concepts as nodes and the relations between them represented as lines (arcs) | |
connecting the nodes. This definition of concept structure is | |
similar to the semantic network of artificial intelligence except that in | |
our approach a measure of consensus weights each arc of the graph. | |
.B | |
(Journal of Documentation, Vol. 36, No. 3, September 1980, pp. 183-196) | |
.I 91 | |
.T | |
Cocited Author Retrieval Online: | |
An Experiment with the Social Indicators Literature | |
.A | |
White, H.D. | |
.W | |
One mode of online retrieval in Scisearch or Social Scisearch involves | |
entering pairs of authors' names believed to be jointly cited by | |
subsequent writers and retrieving papers in which cocitations occur. Six | |
pairs were formed with the names of four authors prominent in the social | |
indicators movement (Bauer, Duncan, Land, and Sheldon). Documents by the | |
four were not specified. It was thought that the pair Duncan and Land | |
would retrieve papers in which indicator-type data would be integrated with | |
path-analytic causal modeling. All other pairs seemed likely to retrieve a | |
"general social indicators" literature. The 298 retrieved papers confirmed | |
expectattions. It was found that 121 papers generally cited social indicators | |
(SI) documents by the input authors and frequently had SI language in | |
their titles. Other signs of content also identified them as papers of | |
the SI movement. The 177 papers retrieved on Duncan and Land generally | |
cited causal modeling documents by the input pair and were path-analytic | |
in nature. As expected, they were relatively "harder" than the first | |
group of papers, although the two groups are akin and are formally linked | |
through citations in certain papers. An additional result is that papers | |
citing at least three of the input authors tend to be overviews of the SI | |
movement. | |
.B | |
(JASIS, Vol. 32, No. 1, January 1981, pp. 16-21) | |
.I 92 | |
.T | |
Database and Online Statistics for 1979 | |
.A | |
Williams M.E. | |
.W | |
The number of databases, records contained in databases and the online | |
use of databases has increased dramatically over the past several years, | |
bringing the 1979 totals for bibliographic, bioliographic-related, and | |
natural language databases to 528. These 528 databases contain 148 million | |
records. Some 4 million online searches were conducted via the major U.S. | |
and Canadian systems in 1979. | |
.B | |
(Bulletin of ASIS, Vol. 7, No. 2, December 1980, pp. 27-29) | |
.I 93 | |
.T | |
Experiments in Local Metrical Feedback in Full-Text Retrieval Systems | |
.A | |
Attar, R. | |
Fraenkel, A.S. | |
.W | |
A method of iterative searching, using the results of one iteration search | |
to formulate the next iteration search, was applied to a full-text database | |
consisting of some 2400 documents and 1,3000,000 text-words of Hebrew and | |
Aramaic. The iterative method consists of clustering the documents returned | |
in an iteration, using weighting by proximity and by frequency simultaneously. | |
The process produces searchonyms, which are terms synonymous to keywords in the | |
context of a single query. Augumenting or replacing keywords by searchonyms | |
via manual or automatic feedback leads to the formulation of the next iteration | |
search. The results of the experiment are consistent with those of an earlier | |
small-scale experiment on an English database, and indicate that in contrast | |
to global clustering where the size of matrices limits applications to small | |
databases and improvements are doubtful, local metrical methods appear to be | |
well suited to arbitrarily large databases, improving precision and recall | |
simultaneously. Further experiments using more test-queries run on even | |
larger databases should be made to collect further evidence as to the | |
performance of these methods. | |
.B | |
(Information Processing & Management, Vol. 17, No. 3, 1981, pp. 115-126) | |
.I 94 | |
.T | |
A Microcomputer Alternative for Information Handling: Refles | |
.A | |
Bivins, K.T. | |
Palmer, R.C. | |
.W | |
REFLES is a microcomputer-based system for data retrieval in library | |
environments. The problem of information retrieval is discussed from a | |
theoretical point of view, followed by an analysis of the reference process | |
and data thereby gathered, leading to a description of REFLES in terms of | |
its hardware and software. REFLES, a prototype system at present, currently | |
functions in a test environment. Examples of data contained in the system | |
and of its use are presented. Future considerations and speculations on | |
other versions of the system conclude the paper. | |
.B | |
(Information Processing & Management, Vol. 17, No. 2, 1981, pp. 93-101) | |
.I 95 | |
.T | |
A Comparison of Two Systems of Weighted Boolean Retrieval | |
.A | |
Bookstein, A. | |
.W | |
A major deficiency of traditional Boolean systems is their inability to | |
represent the varying degrees to which a document may be written on a subject. | |
In this article we isolate a number of criteria that should be met by any | |
Boolean system generalized to have a weighting capability. It is proven that | |
only one weighting rule satisfies these conditions--that associated with fuzzy- | |
set theory--and that this weighting scheme satisfies most of the other | |
properties associated with Boolean algebra as well. Probabilistic weighting | |
is then introduced as an alternative approach and the two systems compared. | |
In the limit of zero/one weights, all systems considered converge to | |
traditional Boolean retrieval. | |
.B | |
(JASIS, Vol. 32, No. 4, July 1981) | |
.I 96 | |
.T | |
Threshold Values and Boolean Retrieval Systems | |
.A | |
Buell, D.A. | |
Kraft, D.H. | |
.W | |
Several papers have appeared that have analyzed recent developments | |
in the problem of processing, in a document retrieval system, queries expressed | |
as Boolean expressions. The purpose of this paper is to continue that analysis. | |
We shall show that the concept of threshold values resolves the problems | |
inherent with relevance weights. Moreover, we shall explore possible evaluation | |
mechanisms for retrieval of documents, based on fuzzy-set-theoretic | |
considerations. | |
.B | |
(Information Processing & Management, Vol. 17, No. 3, 1981, pp. 127-136) | |
.I 97 | |
.T | |
A Model for a Weighted Retrieval System | |
.A | |
Buell, D.A. | |
Kraft, D.H. | |
.W | |
There has been a good deal of work on information retrieval systems that | |
have continuous weights assigned to the index terms that describe the records | |
in the database, and/or to the query terms that describe the user queries. | |
Recent articles have analyzed retrieval systems with continuous weights of | |
either type and/or with a Boolean structure for the queries. They have also | |
suggested criteria which such systems ought to satisfy and record evaluation | |
mechanisms which partially satisfy these criteria. We offer a more careful | |
analysis, based on a generalization of the discrete weights. We also look | |
at the weights from an entirely different approach involving thresholds, and | |
we generate an improved evaluation mechanism which seems to fulfill a larger | |
subset of the desired criteria than previous mechanisms. This new mechanism | |
allows the user to attach a "threshold" to the query term. | |
.B | |
(JASIS, Vol. 32, No. 3, May 1981, pp. 211-216) | |
.I 98 | |
.T | |
A Translating Computer Interface for End-User Operation of Heterogeneous | |
Retrieval Systems. I. Design | |
.A | |
Marcus, R.S. | |
Reintjes, J.F. | |
.W | |
Online retrieval systems may be difficult to use, especially by end | |
users, because of heterogeneity and complexity. Investigations have concerned | |
the concept of a translating computer interface as a means to simplify access | |
to, and operation of, heterogeneous bibliographic retrieval systems and | |
databases. The interface allows users to make requests in a common language. | |
These requests are translated by the interface into the appropriate commands | |
for whatever system is being interrogated. System responses may also be | |
transformed by the interface into a common form before being given to the | |
users. Thus, the network of different systems is made to look like a single | |
"virtual" system to the user. The interface also provides instruction and | |
other search aids for the user. The philosophy, design, and implementation | |
of an experimental interface named CONIT are described. | |
.B | |
(JASIS, Vol. 32, No. 4, July 1981, pp. 287-303) | |
.I 99 | |
.T | |
A Translating Computer Interface for End-User Operation of | |
Heterogeneous Retrieval Systems. II. Evaluations | |
.A | |
Marcus, R.S. | |
Reintjes, J.F. | |
.W | |
The evaluation of the concept of a translating compuyter interface for | |
simplifying operation of multiple, heterogenous online bibliographic | |
retrieval systems has been undertaken. An experimental retrieval system, | |
named CONIT, was built and tested under controlled conditions with | |
inexperienced end users. A detailed analysis of the experimental usages | |
showed that users were able to master interface operation sufficiently well | |
to find relevant document references. Success was attributed, in part, | |
to a simple command language, adequate online instruction, and a simplified | |
natural-language, keyword/stem approach to searching. It is concluded that | |
operational interfaces of the type studied can provide for increased usability | |
of existing system in a cost effective manner, especially for searchers. | |
Furthermore, more advanced interfaces based on improved instruction and | |
automated search strategy techniques could further enhance retrieval | |
effectiveness for a wide class of users. | |
.B | |
(JASIS, Vol. 32, No. 4, July 1981, pp. 304-317) | |
.I 100 | |
.T | |
The Interface Between Computerized Retrieval Systems and Micrographic | |
Retrieval Systems | |
.A | |
McMurdo, G. | |
.W | |
This paper notes the benefits accruing from interaction between computerized | |
retrieval systems and micrographic retrieval systems. It reviews current state | |
of automated micrographic retrieval technology. The conclusion is that with a | |
combination of advances in communications technology, and sophisticated indexing | |
input from libraries and information scientists, the new generation of automated | |
micrographs devices may constitute the on-line document retrieval systems of the | |
future. | |
.B | |
(Journal of Information Science I, 1980, pp. 345-349) | |
.I 101 | |
.T | |
Parallel Computations in Information Retrieval | |
.A | |
Salton, G. | |
Bergmark, D. | |
.W | |
Conventional information retrieval processes are largely based on data | |
movement, pointer manipulations and integer arithmetic; more refined retrieval | |
algorithms may in addition benefit from substantial computational power. | |
In the present study a number of parallel processing methods are described | |
that serve to enhance retrieval services. In conventional retrieval | |
environments parallel list processing and parallel search facilities are of | |
greatest interest. In more advanced systems, the use of array processors | |
also proves beneficial. Various information retrieval processes are examined | |
and evidence is given to demonstrate the usefulness of parallel processing | |
and fast computational facilities in information retrieval. | |
.B | |
(In Lecture Notes in Computer Science, No. III, W. Handler, Ed., Springer | |
Verlag, Berlin-New York, 1981, pp. 328-342) | |
.I 102 | |
.T | |
The Measurement of Term Importance in Automatic Indexing | |
.A | |
Salton, G. | |
Wu, H. | |
Yu, C.T. | |
.W | |
The frequency characteristics of terms in the documents of a collection | |
have been used as indicators of term importance for content analysis and | |
indexing purposes. In particular, very rare or very frequent terms are | |
normally believed to be less effective than medium-frequency terms. Recently | |
automatic indexing theories have been devised that use not only the term | |
frequency characteristics but also the relevance properties of the terms. | |
The major term-weighting theories are first briefly reviewed. The term | |
precision and term utility weights that are based on the occurrence | |
characteristics of the terms in the relevant, as opposed to the nonrelevant, | |
documents of a collection are then introduced. Methods are suggested for | |
estimating the relevance properties of the terms based on their overall | |
occurrence characteristics in the collection. Finally, experimental | |
evaluation results are shown comparing the weighting systems using the term | |
relevance properties with the more conventional frequency-based methodologies. | |
.B | |
(JASIS, Vol. 32, No. 3, May 1981, pp. 175-186) | |
.I 103 | |
.T | |
NDX-100: An Electronic Filing Machine for the Office of the Future | |
.A | |
Slonim, J. | |
MacRae, L.J. | |
Mennie, W.E. | |
Diamond, N. | |
.W | |
This paper describes the design and implementation of an "electronic filing | |
machine," a machine which is capable of storing large numbers of "unstructured" | |
documents in such a way a particular document may be easily and quickly | |
retrieved. A functional distributed architecture permits the implementation | |
of the system in a mixture of hardware and software. | |
.B | |
(Computer, Vol. 14, No. 5, May 1981, pp. 24-36) | |
.I 104 | |
.T | |
The Selection of Good Search Terms | |
.A | |
van Rijsbergen, C.J. | |
Harper, D.J. | |
Porter, M.F. | |
.W | |
This paper tackles the problem of how one might select further search terms, | |
using relevance feedback, given the search terms in the query. These search | |
terms are extracted from a maximum spanning tree connecting all the terms in the | |
index term vocabulary. A number of different spanning trees are generated from | |
a variety of association measures. The retrieval effectiveness for the | |
different spanning trees is shown to be approximately the same. Effectiveness | |
is measured in terms of precision and recall, and the retrieval tests are done | |
on three different test collections. | |
.B | |
(Information Processing & Management, Vol. 17, No. 2, 1981, pp. 77-91) | |
.I 105 | |
.T | |
Indexing Consistency, Quality and Efficiency | |
.A | |
Rolling, L. | |
.W | |
Indexing quality determines whether the information content of an indexed | |
document is accurately represented. Indexing effectiveness measures whether | |
an indexed document is correctly retrieved every time it is relevant to a | |
query. Measurement of these criteria is cumbersome and costly; data base | |
producers therefore prefer inter-indexer consistency as a measure of indexing | |
quality or effectiveness. The present article assesses the validity of this | |
substitution in various environments. | |
.B | |
(Information Processing & Management, Vol. 17, No. 2, 1981, pp. 69-76) | |
.I 106 | |
.T | |
Text Passage Retrieval Based on Colon Classification: Retrieval Performance | |
.A | |
Shepherd, M.A. | |
.W | |
A set of experiments was conducted to determine the suitability of the | |
Colon Classification as a foundation for the automated analysis, representation | |
and retrieval of primary information from the full text of documents. Primary | |
information is that information embodied in the text of a document, as opposed | |
to secondary information which is generally in such forms as: an abstract, a | |
table of contents, or an index. | |
Full text databases were created in two subject areas and queries solicited | |
from specialists in each area. An automated full text indexing system, along | |
with four automated passage retrieval systems, was created to test the various | |
features of the Colon Classification. Two Boolean-based systems and one simple | |
word occurrence system were created in order to compare the retrieval results | |
against types of systems which are in more common use. The systems' retrieval | |
performances were measured using recall and precision and the mean expected | |
search length reduction factors. | |
Overall, it was found that the Colon Classification-based systems did not | |
perform significantly better than the other systems. | |
.B | |
(Journal of Documentation, Vol. 37, No. I, March 1981, pp. 25-35) | |
.I 107 | |
.T | |
User-Responsive Subject Control in Bibliographic Retrieval Systems | |
.A | |
Tague, J.M. | |
.W | |
A study was carried out of the relationship between the vocabulary of | |
user queries and the vocabulary of documents relevant to the queries, and | |
the value of adding to the document description record in a retrieval system | |
keywords from previous queries for which the document had proved useful. | |
Two test databases incorporating user query keywords were implemented at | |
the School of Library and Information Science, University of Western | |
Ontario. Clustering of the documents via title and user keywords, a | |
statistical analysis of title-user keyword co-occurrences, and retrieval | |
tests were used to examine the effect of the added keywords. Results | |
showed the impracticality of the procedure in an operational setting, but | |
indicated the value of analyses with sample data in the development and | |
maintenance of keyword dictionaries and thesauri. | |
.B | |
(Information Processing & Management, Vol. 17, No. 3, 1981, pp. 149-159) | |
.I 108 | |
.T | |
A Program for Machine-Mediated Searching | |
.A | |
Toliver, D. | |
.W | |
A technique of online instruction and assistance to bibliographic data | |
base searchers called Individualized Instruction for Data Access (IIDA) is | |
being developed by Drexel University. IIDA assists searchers by providing | |
feedback based on real-time analysis while searches are being performed. | |
Extensive help facilities which draw on this analysis are available to | |
users. Much of the project's experimental work, as described elsewhere, | |
is concerned with the process of searching and the behavior of searchers. | |
This paper will largely address itself to the project's computer system, which | |
is being developed by subcontract with the Franklin Institute's Science | |
Information Services. | |
.B | |
(Information Processing & Management, Vol. 17, No. 2, 1981, pp. 61-68) | |
.I 109 | |
.T | |
Author Cocitation: A Literature Measure of Intellectual Structure | |
.A | |
White, H.D. | |
Griffith, B.C. | |
.W | |
It is shown that the mapping of a particular area of science, in this | |
case information science, can be done using authors as units of analysis and | |
the cocitations of pairs of authors as the variable that indicates their | |
"distances" from each other. The analysis assumes that the more two authors | |
are cited together, the closer the relationship between them. The raw data | |
are cocitation counts drawn online from Social Scisearch (Social Sciences | |
Citation Index) over the period 1972-1979. GThe resulting map shows | |
(1) identifiable author groups (akin to "schools") of information science, | |
(2) locations of these groups with respect to each other, (3) the degree of | |
centrality and peripherality of authors within groups, (4) proximities of | |
authors within group and across group boundaries ("border authors" who seem | |
to connect various areas of research), and (5) positions of authors with | |
respect to the map's axes, which were arbitrarily set spanning the most | |
divergent groups in order to aid interpretation. Cocitation analysis of | |
authors offers a new technique that might contribute to the understanding of | |
intellectual structure in the sciences and possibly in other areas to the | |
extent that those areas rely on serial publications. The technique | |
establishes authors, as well as documents, as an effective unit in | |
analyzing subject specialties. | |
.B | |
(JASIS, Vol. 32, No. 3, May 1981, pp. 163-171) | |
.I 110 | |
.T | |
Progress in Documentation. Word Processing: | |
An Introduction and Appraisal | |
.A | |
Whitehead, J. | |
.W | |
The "Office of the Future," "Office Technology," "Word Processing," | |
"Electronic Mail," "Electronic Communications," "Convergence," "Information | |
Management." These are all terms included in the current list of buzz words | |
used to describe current activities in the office technology area. The high | |
level of investment in factories and plants and the ever-increasing fight to | |
improve productivity by automating the dull, routine jobs are usually quoted | |
and compared with the extremely low investment in improving and automating | |
the equally tedious routine jobs in the office environment; the investment | |
in the factory is quoted as being ten times greater per employee than in the | |
office. This, however, is changing rapidly and investment on a large scale | |
is already taking place in manhy areas as present-day inflation bites hard, | |
forcing many companies and organizations to take a much closer look at their | |
office operations. | |
.B | |
(Journal of Documentation, Vol. 36, No. 4, December 1980, pp. 313-341) | |
.I 111 | |
.T | |
Document Clustering Using an Inverted File Approach | |
.A | |
Willett, P. | |
.W | |
An automated document clustering procedure is described which does not | |
require the use of an inter-document similarity matrix and which is independent | |
of the order in which the documents are processed. The procedure makes use of | |
an initial set of clusters which is derived from certain of the terms in the | |
indexing vocabulary used to characterise the documents in the file. The | |
retrieval effectiveness obtained using the clustered file is compared with that | |
obtained from serial searching and from use of the single-linkage clustering | |
method. | |
.B | |
(Journal of Information Science, 2, 1980, pp. 222-231) | |
.I 112 | |
.T | |
A Fast Procedure for the Calculation of Similarity Coefficients in | |
in Automatic Classification | |
.A | |
Willett, P. | |
.W | |
A fast algorithm is described for comparing the lists of terms representing | |
documents in automatic classification experiments. The speed of the procedure | |
arises from the fact that all of the non-zero-valued coefficicents for a given | |
document are identified together, using an inverted file to the terms in the | |
document collection. The complexity and running time of the algorithm are | |
compared with previously described procedures. | |
.B | |
(Information Processing & Management, Vol. 17, No. 2, 1981, pp. 53-60) | |