INFORMATION RETRIEVAL LANGUAGE

From Big Medical Encyclopedia

INFORMATION RETRIEVAL LANGUAGE — the artificial language intended for the description of semantic contents of texts and also for expression of semantic maintenance of queries for the purpose of implementation of information search in informatsionnopoiskovy system. Any abstract And. - the item I. it is set by the following three components: alphabet (set of initial symbols), rules of education and rules of interpretation. Rules of education define procedures of creation of words and expressions this And. - the item I. According to rules of interpretation a certain sense is attributed to these words and expressions, i.e. the bilingual dictionary, in Krom to each lexical unit is set And. - the item I. the word or expression of a natural language and vice versa is put in compliance.

And. - the item I. is under construction on the basis of this or that natural language. At the same time semantic ambiguity of dictionary structure so in is eliminated And. - the item I. each concept is expressed to one and only in a word, and each word in turn expresses one and only one concept. Grammar And. - the item I. is under construction strictly formally that each expression And. - the item I. allowed only one interpretation. Such creation of language does it convenient for algorithmic comparison and an identification of records of the main semantic contents of texts of documents and the semantic maintenance of queries. During the development specific And. - the item I. specifics of the industry are considered, for a cut this language is created, the nature of information requirements for which satisfaction this is developed information retrieval system (see), and also features of the texts forming its search array.

Three main types I are used. - the item I.: library and bibliographic classifications, alfavitnopredmetny classifications and languages of descriptor type (see. Descriptor ). First two types I. - the item I. do not provide a possibility of search of data in any, in advance not set combination of signs (characteristics). Therefore since the beginning of the 50th were widely adopted descriptor And. - the item I. Their basis is made by a method of coordinate indexing which is based on representation that the main semantic contents of the document and a query can be with a sufficient accuracy rate and completeness is expressed by the list of descriptors.

One of the main difficulties, with to-rymi it is necessary to deal by information search, consists in transfer of requests of consumers on And. - the item I. This difficulty is caused by inadequacy of verbal expression of requirements of requesting; it can be considerably reduced if to create the dictionary which would allow to find words (descriptors) necessary for expression of sense on sense of a request. Such dictionaries received the name of thesauruses. The thesaurus is the dictionary reference reflecting semantic links between lexical units And. - the item I. At the same time in it systematization of descriptors on sense is made and all semantic bonds between them are specified.


Bibliography: Blochin Yu. I. Classification and coding of technical and economic information, M., 1976; Kopylov V. A. Creation of the automated information retrieval systems, M., 1974; Mikhaylov A. I., Black A. I. and and l I am p e in with to y P. C. Fundamentals of information science, M., 1968, bibliogr.; they e, Scientific communications and information scientist, M., 1976; M about with to about in and h VA. Information languages, M., 1971, bibliogr.

L. Mikulich.

Яндекс.Метрика