Báo cáo khoa học: "Studies in Machine Translation—8: Manual for Postediting Russian Text"
lượt xem 2
download
The present study is a practical guide to editors who refine partially machine-translated text as a basis for linguistic analysis. The posteditors' tasks are: to code preferred English equivalents, to code English structural symbols, to resolve grammatic properties, and to code syntactic connections (dependencies).
Bình luận(0) Đăng nhập để gửi bình luận!
Nội dung Text: Báo cáo khoa học: "Studies in Machine Translation—8: Manual for Postediting Russian Text"
- [Mechanical Translation, Vol.6, November 1961] Studies in Machine Translation—8: Manual for Postediting Russian Text * by H. P. Edmundson†, K. E. Harper, D. G. Hays, and B. J. Scott Mathematics Division, The RAND Corporation The present study is a practical guide to editors who refine partially machine-translated text as a basis for linguistic analysis. The post- editors' tasks are: to code preferred English equivalents, to code English structural symbols, to resolve grammatic properties, and to code syntactic connections (dependencies). A general introduction to the field of ma- chine translation is contained in The RAND Corporation RM-2060. English equivalent (E 1, E 4) 1. Introduction Word number 1.1 GENERAL Special codes Alternative English equivalents (E 2, E 5, E 3, E 6) The present paper is one in a series which describes the methods now in use for research on machine trans- lation (MT) at The RAND Corporation. Postediting TABLE1 follows mechanical partial translation in the research process; the editor encodes changes to yield an accurate, THE RAND PUNCTUATION CODE readable English text, and encodes the structure of each sentence in preparation for linguistic analysis. The Symbol Printed Before an Occurrence Punctuation Mark present manual is based on studies of Russian physics and mathematics, but is presumably applicable to other textual materials, within the framework of the RAND 1 Start paragraph 2 Start sentence methodology. 3 Capital / Open parenthesis 1.2 WORKSHEET FORMAT 8 Open quotation The posteditor works from a text listing prepared on an IBM printer; a sample list is shown in Table 2. Each Symbol Printed occurrence in the Russian text occupies one line of the After an Occurrence Punctuation Mark listing; the following items of information are given for each occurrence: . Question mark Sequence number (S), consisting of: . Exclamation point - Hyphen Page number (PG) . Period Line number (L) , Comma Occurrence number (O) Dash a -- Punctuation before the occurrence (P 1) 9 Colon Russian form of the occurrence (may be transliter- ; Semicolon / Close parenthesis ated) 8 Close quotation Punctuation after the occurrence (P 2) Russian inflectional grammar code (G) a This mark must sometimes be interpreted as an arithmetic Sentence-sequence number (S'), consisting of: symbol (minus) or as the verb be. Sentence number (SN) Occurrence number in the sentence (ON) The RAND printing of punctuation adheres to gener- Coding space, for insertion of: ally accepted standards, within the limits set by the Dependency code (DC) number of characters on an IBM printer. Original English structural symbols (ESS) punctuation marks appear only after the Russian oc- Preferred English equivalent (PE) currence and are not repeated after English translation. Translation order (TO) The Russian grammar code is fully described in The RAND Corporation MT Study 6; the posteditor should * The research herein reported was performed with the support of USAF Project RAND. be familiar with it. † Presently at Ramo-Wooldridge, a Division of Thompson Ramo The first English equivalent (E 1) is generally an Wooldridge, Inc. H. P. Edmundson has co-authored all revisions of this manual up to but not including the present revision. accurate translation of the Russian form; the rough 63
- TABLE 2 POSTEDITOR'S WORKSHEET translation can be read by following this column down posteditor has examined text and selected the desired the page. English equivalent for the homographic occurrence, he The reader will note that alternative English equiva- replaces the original grammar-code symbol with the lents are sometimes printed in fields adjacent to the first symbol corresponding to his choice of English equiva- English equivalent. The alternative equivalents printed lent. to the right are sometimes preferable to the first; sub- When an idiom, i.e., a group of Russian forms trans- stitution is made by the reader or posteditor as necessary. lated as a group when they occur together in fixed On the worksheet, a homograph, i.e., Russian form sequence, is recognized by the computer program, the with two different grammar codes and corresponding English equivalent of the idiom is printed next to the English equivalents, occupies one line. The grammar- first form in the idiom. The English-equivalent fields of code symbol of a homograph contains ++ in the first subsequent forms within the idiom are blank. two positions; English equivalents appear in fields The special codes, printed at the right, convey in- E 1—E 4 and E 3—E 6, while appropriate individual formation about English grammar, inflection, etc.; see grammar codes appear in fields E 2 and E 5. After the The RAND Corporation MT Study 7 for details. 64
- The coding spaces are filled in by the computer and the right-hand margin, opposite the first form of the idiom. If an idiom is recognized by the computer, “—” the posteditors; Sections 2, 3 and 4 of the present Study is printed beside each form of the idiom. describe their content in detail. If, in non-idiomatic combinations, the posteditor wishes to omit translation of an occurrence, he writes 2. Choice of English Equivalents the numeral О in the coding space. (Examples of all coding rules are illustrated in Table 2.1 SELECTION 2.) For each occurrence in the Russian text, the posteditor selects an English equivalent. The posteditor must be 3. English Structural Symbols guided, first, by the customary criteria for translation: accuracy and readability. However, variation for the 3.1 ENGLISH MORPHOLOGY AND SYNTAX sake of stylistic excellence is not allowed; the posteditor The computer program that prints the worksheet also must expect the finished translation to be clear but dull. begins the conversion of Russian structural symbolism The order of the English equivalents on each line, from into English, but the posteditor must complete this task. 1 to 6, is such that E 1 is preferred more often than any The translation, when it leaves the posteditor, must be other. The posteditor must accept E 1 whenever it gives clear and readable in construction as well as in diction; a clear, accurate translation of the original text. The the main tools to be used are inflection and the inser- alternative equivalents are listed because they are oc- tion of English function words. casionally essential for accuracy; when one of the al- Inflections are rarely stored in the glossary; that is, ternative equivalents is definitely more accurate, it must the English equivalents stored in the glossary are usually be selected. The posteditor can also, when necessary, in canonical form. For example, the singular forms of insert new alternative equivalents and recognize new nouns, the infinitive forms of verbs (without to), and idioms or homographs. similar uninflected forms are usually stored. The English 2.2 CODING equivalent of a genitive Russian noun does not include, in the glossary, the preposition of, nor does the English The column of the coding space marked PE is reserved equivalent of a reflexive Russian verb include the for a one-position English-equivalent code. If the editor auxiliary verb is or are. chooses the first English equivalent, he does not mark As studies of Russian-English translation progress, the space. the computer program is improved; the work of the If he selects an alternative English equivalent, the posteditor diminishes correspondingly. The following posteditor writes 2, 3, etc., in the coding space, as ap- description of the posteditors' task assumes no modifi- propriate, using the numbers printed as column headings cation of glossary entries by the computer. Whatever on the worksheet. part of the work has been performed correctly by the To add an English equivalent, the posteditor writes computer is omitted by the posteditor, while any errors an asterisk (*) in the English-equivalent coding space, that the computer program has introduced are corrected and writes the new English equivalent in BLOCK by the insertion of accurate entries in the coding space. CAPITAL LETTERS in the right-hand margin. When the computer performs an inflection, it also prints When the posteditor selects the first form of a homo- a mark in the coding space. graph, he leaves the PE coding space blank. If he selects the English equivalent appearing in a field other 3.2 ENGLISH INFLECTIONS than E 1, the number of the field from which the equivalent was chosen is inserted in the coding space. The posteditor inflects the English equivalents, as To identify a new homograph, the posteditor writes necessary for accuracy and clarity, in any of the follow- H in the English equivalent coding space beside the ing ways: homographic form; no further coding is required. A new Nouns: plural. When a Russian noun occurs in plural English equivalent may be written in the margin if number, the English equivalent is coded to show necessary. plurality. When the posteditor accepts the first English equiv- Verbs: past tense. When a Russian verb occurs in the alent of an idiom, he leaves the PE coding space blank. simple past tense, its English equivalent is coded to If the second English equivalent is desired, the post- show past tense, except in constructions with бы. editor writes P beside each word in the idiom; if the Verbs: third person singular, present tense. When a third English equivalent is desired, the posteditor writes Russian verb occurs in third person singular, present Q beside each word in the idiom. tense, active voice, its English equivalent is coded to To identify a new idiom, i.e., one which is not show that s must be added. recognized by the computer program, the editor writes Verbs: present participle. When a Russian present A in the English equivalent coding space beside the first active participle occurs, or when the English equivalent form in the idiom, В beside the second, С beside the of a verb must be given in progressive form (e.g., is third, and so forth. He also writes the English equiv- going), the English equivalent is coded to show present alent of the idiom in BLOCK CAPITAL LETTERS in participle inflection. 65
- TABLE 3 Verbs: past (passive) participle. When a Russian verb occurs in a form which must be translated into passive SYMBOLS REPRESENTING PRONOUN SUBJECT INSERTION voice, the English equivalent is coded to show past Position 1 participle inflection. This category includes most Russian reflexive constructions, passive and reflexive participles, Pronoun Insertion Symbol and constructions with бы. Adjectives: comparative. When a Russian adjective It 1 occurs in comparative degree, the English equivalent There 2 is coded to show comparative inflection. I 3 Adjectives: superlative. When a Russian adjective We 4 occurs in superlative degree, the English equivalent is They 5 coded to show superlative inflection, unless the English Let Us 6 Who 7 equivalent is listed in superlative form. One 8 Adjectives: adverb. When a Russian adjective-adverb homograph occurs as an adverb, the English equivalent is coded to show adverbial inflection. Adjectives: comparative adverb. When a Russian adjective-adverb homograph occurs in comparative de- TABLE 4 gree as an adverb, the English equivalent is coded to show comparative-adverb inflection. CODE SYMBOLS FOR AUXILIARY VERB INSERTIONS Position 2 3.3 ENGLISH INSERTIONS The posteditor codes the insertion of an additional Code Symbol English word whenever necessary for accuracy or clarity, Auxiliary Verb Insertion choosing from the following list: Pronoun subjects: it, there, 1, we, they, let us, who, Are 1 one. One of these pronoun subjects is inserted whenever Was 2 a Russian sentence construction includes a verb with Were 3 Do 4 no subject, unless context makes the omission definitely Does 5 preferable in English. Did 6 Verb auxiliaries: are, was, were, do, does, did, will, Will 7 will be, be, am, being, is, to, to be. The verb auxiliaries Will Be 8 Be 9 are inserted to construct passive voice, negation, past Am A tense, future tense, or progressive form in English, as Being В required by the construction of the Russian sentence. Is С Articles: a, an, the. An article is inserted in English To D whenever it contributes to accuracy or clarity. The To Be E articles a and an are not distinguished. Connections: of, to, by, with, than, as, in, on. English connecting words must be inserted in the absence of a Russian equivalent, in two kinds of context situations: when the (oblique) case of a noun in Russian expresses TABLE 5 a relationship which can best be expressed by a pre- CODE SYMBOLS REPRESENTING INSERTION OF ARTICLES position or a conjunction in English; and when a Russian verb without a preposition requires a noun ob- AND ENGLISH CONNECTING WORDS ject which must be connected to the English-equivalent Position 3 verb by a preposition. In either case, the connecting word must be inserted by the posteditor. Article Insertion 3.4 ENGLISH STRUCTURAL-SYMBOL CODE Preposition Insertion None A, An The A four-position space is included on the worksheet for None + — coding English structural symbols. In the first position, Of 1 A J the posteditor codes pronoun subject insertions (see To 2 В К Table 3). In the second position, the posteditor codes By 3 С L With 4 D M auxiliary verb insertions (see Table 4). In the third Than 5 E N position, the posteditor codes insertions of articles and φ As 6 F prepositions (see Table 5). In the fourth position, the In 7 G P posteditor codes miscellaneous inflections: verbs, par- On 8 H Q ticiples, noun plurals, and adjective inflections (see From 9 I R Table 6). 66
- based on the assumption that every occurrence in a TABLE 6 sentence depends on some other occurrence in the same CODE SYMBOLS FOR MISCELLANEOUS INFLECTIONS sentence (except that one occurrence in each sentence Position 4 is independent). The concept of dependency is partly syntactic, partly semantic; the posteditor must have a Inflection Code Symbol good understanding of Russian grammar and a general familiarity with the subject matter of the scientific Short-form, neuter adjective/adverb perform- articles that are being analyzed if he is to do an accurate ing adverbial function (381) - job of coding sentence structure. The posteditor must Noun plural 3 adhere, as closely as possible, to the rules laid down in Positive comparative for adjectives and ad- this section, since the work of several posteditors is to verbs (modified by более) (addition of be compared. er) 4 Positive superlative for adjectives and ad- Syntactically, one occurrence depends on another verbs (modified by наиболее) (addi- if the inflection of the first depends on the nature of tion of est) 5 the second. Thus, it is generally said that a preposition Negative comparative for adjectives and ad- governs the case of its noun object; hence, a noun used verbs (modified by менее) (addition of as the object of a preposition depends on that preposi- er or less) 6 Negative superlative for adjectives and ad- tion. Semantically, one occurrence depends on another verbs (modified by наименее) (addition if the meaning of the first complements or modifies the of est) 7 meaning of the second. These definitions are related in Third person singular present tense for verbs a natural language, so it is not important to keep them ( addition of s) A distinct and to choose one or the other as a guide to Past tense for verbs (addition of ed) В Present participle verb from (addition of postediting. Both definitions can serve as guides to the ing) С task. Past participle verb form (addition of en) D The one general rule to be observed in postediting is that every occurrence must be coded as depending on one and only one other occurrence in the same sentence The tabulations of these codes are readily understood, —an exception to this rule is made for relative with the possible exception of Table 5. Whereas the clauses. One and only one occurrence in every sentence insertion of the article a or an is represented by the is independent. The style of Russian technical articles symbol “+”, and the insertion of the preposition of is sometimes permits two or more independent clauses to represented by the number 1, when insertion of both be joined without conjunctions, so that, in effect, two the article and preposition are required for the same sentences can be compressed into one. In such instances, occurrence, the symbol is not + 1 but A. This method the posteditor is free to establish two independent of symbol combination derives from the properties of occurrences in one sentence. IBM machines; when the letter A is punched into an IBM card, it is represented by two punches, “+” and 1 4.2 RESULTANT CODING in a single card column. Because usage is the factor finally determining the The posteditor must be careful to distinguish the properties of a word, the posteditor is required to re- characters G, С and 6 from one another; the numeral О solve grammar-code symbols appearing with Russian from the letter φ; the numeral 1 from the letter I; the occurrences on the print-out sheets. letters U and V from each other; and the numeral 5 Original grammar-code symbols are those appearing from the letter S. in the RAND glossary with each Russian form. Indi- The line on which the codes are written must be vidual words possess varying degrees of morphological determined in accordance with the following rules: and semantic ambiguity; further they may be capable of Verb inflections, pronoun-subject insertions, and fulfilling a multiplicity of syntactic functions. The ori- auxiliary-verb insertions must be coded on the line on ginal grammar-code symbol is designed to reflect the which the verb occurs. intrinsic ambiguity of a given form. Preposition insertions, article insertions, and noun Resultant grammar-code symbols are the symbols plural inflections must be coded on the line on which applied immediately above the original grammar code the noun occurs, even though the preposition or article symbol on the print-out sheet after ambiguity has been must actually be inserted before an adjective, for ex- resolved. Resolution is achieved mechanically whenever ample. possible, but final responsibility for the task must rest Adjective or adverb inflections must be coded on the with the posteditor. Only after examination of text is it line on which the inflected word appears. possible to determine the unique function of a given 4. Structural Coding occurrence. Resultant grammar-code symbols presently fall into 4.1 DEPENDENCY the following major categories: Sentence structure can be analyzed in many ways; one (a) Resultant symbols for nouns, pronouns, adjec- plan, which is convenient for the present research, is tival pronouns (part of speech A), and homographs. 67
- For example, the feminine substantive линии can only Assuredly, existing resultant codes will not suffice be imprecisely identified as a singular noun in the gen- for every possible function of an occurrence; their num- itive, dative or prepositional case, or as a plural noun in ber will continue to grow as more text is analyzed. either the nominative or accusative case. Assuming that 4.3 examination of text has allowed the editor to determine TENTATIVE DEPENDENCY RULES that an occurrence of линий is used as a singular noun The following rules are furnished as a guide; the list in the genitive case, the original symbol 23D is changed is not complete, since more rules will surely be added in to 230, a precise identification of both case and number the course of postediting large volumes of Russian text. of the occurrence. When necessary, the posteditor deviates from these In the case of homographs, after the posteditor has rules in order to adhere to the more general principles examined text and selected the desired English equiva- of completeness and syntactic-semantic consistency. lent for the entry, he replaces the ambiguous “+ +” Within the structure of a phrase or clause, it is use- by which the form is originally identified, with the ap- ful to distinguish the single occurrence which serves as propriate individual grammar code. its representative, or principal, element. This element (b) Resultant symbols for parts of speech serving as we shall call simply the main clement. As outlined governors of substantives. Included in this category are below, for prepositional phrases, the main element is verbs or participles acting as governors of substantives, the preposition. For clauses, the main element is norm- and substantives acting as governors of other substan- ally the verb or other verbal element (short-form ad- tives. The symbols are designed to show satisfaction jective or participle). In coding the dependency of of a function for which the word was originally coded. phrases and subordinate clauses, it is important to note Their application serves to establish complementation that the relationship is indicated through the depen- of the governing occurrence. dency of the main element of the governed structure on For example, assume that a verb originally coded to the most closely related element of the governing struc- take objects in both the accusative and dative cases is ture. found to be complemented only by a substantive in the Dependency-coding rules arc classified according to accusative case. The original symbol is changed to show part of speech. that possible complementation has been partially ful- Cardinal numbers. A cardinal number is generally filled. If, on the other hand, the occurrence is found to treated as an adjective; see below. Cardinals can also be have both direct and indirect objects the resultant used as nouns, e.g., Three were chosen. In such in- grammar-code symbol shows complete satisfaction of stances, they are assigned nominal dependency. the verbal complementation code. Ordinal numbers. An ordinal number is treated as an (c) English-equivalent selection-code symbols for adjective. prepositions, degree-marking adverbs and adjectives, Particles. Generally, a particle depends on the occur- auxiliary verbs and particles. The symbols determine rence whose meaning it modifies or intensifies. Modi- the selection of one among several possible English fying particles (бы, нибудь, будто, etc.) usually de- translations and indicate the syntactic function of the pend on the preceding word, while intensifying occurrence. For example, a preposition serving as the particles (даже, вплоть, же, etc.) may depend on the head word of a simple prepositional phrase may well preceding or the following element. be translated differently and serve a different syntactic When the particle пусть appears with a finite verb function than the same preposition serving as the head or short form adjective, it is said to be the independent word of an idiomatic occurrence. Similarly, when быть element. acts as an independent verb, its grammar-code symbol Pronouns. In general, pronouns are treated as nouns; must show it to have different properties than when it see below. However, relative pronouns (который, serves as an auxiliary. что, какой, etc.) have twofold functions. A relative (d) Resultant-code symbols for conjunctions and pronoun serves as a noun within a subordinate clause, relative pronouns (both as governor and dependent). and its nominal dependency must be encoded. The same Conjunction grammar-code symbols are refined to iden- pronoun also serves to connect the subordinate clause tify the occurrence as coordinating or subordinating, as with an element of the main clause of the sentence, and well as simple or compound. Certain words are capable the connection must be coded as well. Relative pro- of performing a conjunctive function alone, and they nouns, therefore, generate double dependencies. can also be used as one member of a larger conjunctive The first dependency of a relative pronoun is upon frame. Similarly, certain conjunctions may be used with the word that determines its case. For example, in the a given class of governor, or their use may have no such fragment . . . которая подтверждается, the relative restriction. It should be pointed out that the posteditor pronoun is in the nominative case since it is used as the must apply a resultant code to establish the governor of subject of the verb: 'which is confirmed.' Again, in the a subordinate conjunction, while governorship of a fragment у которого имеется, the relative pronoun is coordinate conjunction is not indicated by a grammar- object of the preposition у, and the prepositional phrase code symbol. Specific examples of resultant grammar modifies the verb: 'for which there exists.' The relative codes appear in 4.3, and are more completely attested pronoun can even modify a noun in the subordinate in the grammar code manual. clause: сущность которого хорошо известна = 68
- 'the substance of which is well known' The relative homographs are resolved as either adjectives or nouns, pronoun depends first upon the verb, the preposition, depending upon the function of the occurrence. the noun, etc., that governs its nominal function within A short-form predicate adjective can serve as the the subordinate clause. independent element of the sentence. For example, in The second dependency of a relative pronoun is upon a sentence of the type человек умен — 'the man is wise' its antecedent outside the subordinate clause. For ex- the adjective is independent and receives a resultant ample, in the fragment фосфора, у которого имеется grammar-code symbol to indicate its subject-taking = 'phosphorus, for which there exists', the pronoun de- function. pends on фосфора as antecedent. The pronoun 'который Long-form adjectival predicates depend on the must agree with its antecedent in number and gender. nouns which they modify. The antecedent of что, when this word is used as a Participles. Active and passive participles acting as relative pronoun, is an entire clause, so agreement is noun modifiers are usually treated as adjectives. When irrelevant: Кривая принимает новый вйд, что an active reflexive participle modifies a noun, its gram- указывает на ...--= 'the curve assumes a new form, mar-code symbol is converted to that of a passive parti- which indicates . . .'.The first governor of что is the ciple, while an active participle that follows the noun it main element of the subordinate clause; the second gov- modifies is classed as a gerund. This transformation is ernor of что is the main element of the independent effected to indicate more clearly the syntactic function of clause. The second dependency of any relative pronoun, such occurrences. however, ties the subordinate clause into the sentence. Short-form passive participles appearing with быть Nouns. A noun in the nominative case, serving as the in modal constructions are considered to be indepen- subject of a sentence, can depend upon a finite verb, a dent. However, long-form passive participles appearing shortform adjective or participle, or other predicate in the same construction are dependent on быть. element. In a sentence such as X—функция = 'X is a Verbs. A verb is normally the independent element function, for example, the symbol X is treated as a noun of the sentence, or the main element of a dependent in the nominative case and is the independent element. clause. In the latter instance, it depends secondarily A noun in the genitive case, and occasionally in an- on a subordinate conjunction such as если = 'if’ or other oblique case, can serve as the complement of an- хотя = 'although' or on a relative adverb. other noun. For example, частиц depends on поле in In constructions utilizing a modal (e.g., можно, the phrase поле неподвижных частиц = 'field of the легко) plus an infinitive, the infinitive is considered the fixed particles'; частиц and нуклонами depend on main element in the clause and is said to govern the рассеяние in the phrase рассеяние частиц modal. In such constructions, and in impersonal con- нуклонами = 'diffusion of the particles by nucleons'. structions, a direct object is said to depend upon the Several nouns have been given grammar codes which infinitive. Thus, условие depends upon the infinitive in indicate they can act as governors of other nouns. For следует определить условие = 'it is necessary to example, рассеяние is coded to take complements in determine the condition’ as well as in мы можем both genitive and instrumental cases. When genitive определить условие = 'we can determine the con- complementation has occurred, the symbol is changed ditions' Also, мы depends on определить rather than to show complementation has been partially satisfied; on можем. when both genitive and instrumental modifiers have In constructions utilizing a modal, the auxiliary occurred, the complementation code is blanked out. A infinitive быть, and a short form past passive participle, complete list of noun complementation types and re- sultant symbols appears in the grammar code manual. the modal depends upon быть, which depends upon A noun in an oblique case can be governed by a verb, the participial form—the independent element of the an active or passive participle, a preposition, a short- chain. If, however, the auxiliary is used in either past form or comparative adjective, etc. Note that several or future form (e.g., было or будет), it serves simply nouns in a given sentence can be governed by the same as a tense marker and is made to depend upon the verb; one in the nominative case, one in the accusative, modal. etc. However, if two or more nouns are used as subjects, Original grammar-code symbols of verbs are con- direct objects, etc., of the same verb, the rules of con- verted to resolve subject-taking and complementation junction apply; see below. When the original grammar- functions of the occurrence. These code symbols are code symbol of the noun is ambiguous, it is resolved to attested in the grammar code manual. show the actual function of the occurrence in text (i.e., Prepositions. A preposition and its noun complement subject, object, etc.). (together with any dependents of the noun) form a Adjectives. normally, a long-form adjective depends prepositional phrase; the phrase is a modifier and is on the noun with which it agrees. It should be pointed similar in function to an adjective or an adverb. The out that several adjective/pronoun homograph forms preposition is said to depend on the occurrence that is have been formally designated as part of speech A. The modified by the phrase; this element can be a noun, grammar-code symbols of such words are converted verb, active or passive participle, adjective, adverb, to show the adjectival or pronominal qualities of the oc- pronoun, or cardinal number. When a prepositional currence, as the case may be. Long-form adjective/noun phrase appears to modify an entire sentence or clause, 69
- the preposition depends on the main element of the in turn shows dependency on the main element in the sentence or clause. independent clause. When the title of an article is a prepositional phrase, Compound subordinating conjunctions consist usually e.g., О взаимодействии антипротонов с ядрами of two words так как = 'since', так что = 'so that', = 'On the interaction of antiprotons with nuclei', the тогда как = 'whereas'; or of a unit involving a pre- preposition is said to be the independent element. positional phrase: для того, чтобы — 'in order', в The posteditor is expected to resolve the 4th and 5th том, что = 'in/of the fact that', после того, как position grammar code of prepositions if this has not = 'after'. Each element of the combination is said to been correctly done by machine. depend on the preceding element within the combina- Adverbs. Ordinarily, an adverb depends on a verb, tion; the first element depends on the element of the adjective, or other adverb. Relative adverbs introduce modified clause to which it is most directly related, dependent clauses (the clause can modify a noun, verb, and the main element of the subordinate clause depends etc.); the relative adverb depends first on the main on the last element of the combination. element in the dependent clause, second on the proper The conjunctions тот же . . . что и = 'the same as', element in the modified clause. The main element in the and такой же ... как и = 'the same ... as' are dependent clause is primarily independent, but sec- idiomatic and generally indicate ellipsis of elements ondarily it depends on the relative adverb. within the sentence structure. RAND studies have de- Conjunction. Coordinate conjunctions, such as и = termined that the construction is used to conjoin two 'and', или = 'or', connect elements of the sentence that subjects of a single verb, two clausal modifiers or a are similar in structure and identical in function. The clausal modifier and a transform of the clausal modifier conjunction is said to join two words, two phrases, or used as the subject. Dependency is most conveniently two clauses. In such instances, the sections joined must established through что—и appears to have little syn- be developed so that the conjunction depends on the tactic significance for the construction. main element in the following section, and the main Resultant grammar-code symbols identify conjunc- element in the preceding section depends on the con- tions as coordinating or subordinating, as a single junction. occurrence or part of an idiomatic frame, etc. Inter- In a sequence of coordinate elements (e.g., A, B, phrasal/inter-clausal behavior of this part of speech is and C), all the elements except the last depend on the more fully documented in the grammar code manual. conjunction, and the conjunction depends on the last Symbols. A symbol that is hyphenated to a noun element. If there is no conjunction in the sequence, as (e.g., х-функция] depends on the noun. Otherwise, a in a series of equations, all elements except the last de- symbol is treated in a manner consistent with its be- pend on the last element. havior in the sentence. Such coordinating conjunctions as либо ... либо Equations. An equation can be used as a noun, as a = 'either . . . or', ни . .. ни = 'neither . . . nor', and clause, etc.; the editor determines the function of each 'как . . . так и . . . = 'both . . . and' form idiomatic occurrence and treats it as required by the foregoing conjunctive frames, connecting semantically parallel rules. words or phrases. The main elements, which are of 4.4 ELLIPSIS similar form and identical function and follow the two units of the conjunction, must be located. Then the first A common construction in Russian, especially fre- unit of the conjunction and the main element of the quent in the scientific text for which this handbook is first phrase depend on the second unit of the conjunc- to be used, is the conjunction of two or more phrases or tion, which in turn depends on the main element of clauses with omission, or ellipsis, of key words in the second phrase. For example, in the construction, как repetition. For example, the author may write в линия, так и точка = 'both the line and the point', результате столкновения нуклона с дейтроном и как and линия depend upon так, which depends on дейтрона с ядрами = 'as a result of the collision of точка. и here is functionally little more than a particle a nucleon with a deuteron and of a deuteron with depending on так. Similarly, in the construction не nuclei', omitting столкновения after the conjunction. только, Х, но и Y, не depends on только, which de- Another example is функции А, В нормированы на pends on но; X depends on но, as does the particle u, единицу объема, функция С—на единицу = 'The and но depends on Y. functions А, В are normalized to unit volume, function Simple subordinating conjunctions, such as хотя = С to unity'. In the latter sentence, ellipsis of 'although', если = 'if', причем = 'whereas', introduce нормирована is indicated by the dash. dependent clauses. The conjunction depends on the The importance of the ellipsis is suggested by the main element in the modified clause, and the main ele- fact that на must be referred to its governor and to its ment in the subordinate clause depends on the con- dependent for accurate translation. junction. The structure of a sentence containing an ellipsis is The double conjunction если . . ., mo ... = 'If..., restored by the posteditor to non-elliptic form. The then . . .' conjoins two clauses of unequal value. The missing word or phrase is re-entered and dependencies main element in the dependent clause depends on если; are described as if it were present. Thus, in the first если is made to depend on the conjunction mo, which example above, the conjunction и joins two occurrences 70
- of столкновения one real and one fictitious. The real N1. The posteditor writes OO in the coding space of occurrence governs нуклона and с дейтроном, while the independent occurrence in each sentence. the fictitious occurrence governs дейтрона and с In the case of a subordinate clause, the posteditor is ядрами. In the second example, there is a conjunction of required to reflect the dual dependency of both the two clauses: функции А, В нормированы на единицу introductory element and the verbal element in the объема and функция С нормирована на единицу. clause that the relative introduces. He does so by writ- Once the omitted element has been restored, the struc- ing an asterisk in the coding space for each such occur- ture is obvious; it can be determined by the rules of rence, and by writing two dependency symbols on the Section 4. 3. extreme right-hand margin of the sheet; the occurrence number of the first governor is written first and followed 4.5 CODING by the occurrence number of the second governor. The The first portion of the coding space (DC) is used for a same plan is followed in every subordinate clause. two-position dependency code. For all but one of the To restore an elliptically omitted word, the posteditor occurrences in a sentence, the posteditor indicates de- adds a line on the worksheet at the end of the sentence. pendence on another occurrence in the same sentence. Page number, line number, Russian form, Russian in- One occurrence in each sentence is coded as indepen- flectional grammar-code symbol, Russian resultant dent, except in a complex sentence or a sentence con- grammar-code symbol, sentence number, occurrence sisting of two or more complete clauses separated by number (1E, 2E, 3E, etc., for several ellipses within a commas. sentence), dependency-code symbol and word number Within each article, the computer assigns sequence must all be filled in. The dependency-code symbol for a numbers to sentences, and within each sentence, it as- restored word is the occurrence number of the word on signs sequence numbers to occurrences. The two-digit which it would have depended if it had actually oc- occurrence-within-sentence number is used for depen- curred. The words depending on the restored word have dency coding. If occurrence N1 depends on occurrence dependency symbols 1E (2E, 3E, etc., if they depend N2, the posteditor writes N2 in the coding space on line on the second, third, etc., restored words). Received January 18, 1960 71
CÓ THỂ BẠN MUỐN DOWNLOAD
-
báo cáo khoa học: " Study of ‘Redhaven’ peach and its white-fleshed mutant suggests a key role of CCD4 carotenoid dioxygenase in carotenoid and norisoprenoid volatile metabolism"
14 p | 65 | 7
-
Báo cáo khoa học: Study protocol for the translating research in elder care (TREC): building context through case studies in long-term care project (project two)
10 p | 55 | 5
-
Báo cáo khoa học: "Neoangiogenesis in early cervical cancer: Correlation between color Doppler findings and risk factors. A prospective observational study"
7 p | 47 | 5
-
Báo cáo khoa học: "Changes in appetite related gut hormones in intensive care unit patients: a pilot cohort study"
9 p | 62 | 5
-
báo cáo khoa học:" Oynophagia in patients after dental extraction: surface electromyography study"
10 p | 48 | 5
-
báo cáo khoa học: " Study protocol for the translating research in elder care (TREC): building context – an organizational monitoring program in long-term care project (project one)"
13 p | 61 | 4
-
báo cáo khoa học:" Pain in castration-resistant prostate cancer with bone metastases: a qualitative study"
11 p | 55 | 4
-
báo cáo khoa học: " Study protocol: Cost effectiveness of two strategies to implement the NVOG guidelines on hypertension in pregnancy: An innovative strategy including a computerised decision support system compared to a common strategy of professional audit and feedback, a randomized controlled trial"
7 p | 47 | 4
-
báo cáo khoa học:" Study of quality of life and its determinants in patients after urinary stone fragmentation"
6 p | 43 | 4
-
báo cáo khoa học:" Disability in activities of daily living, depression, and quality of life among older medical ICU survivors: a prospective cohort study"
10 p | 46 | 4
-
báo cáo khoa học: " Transformational leadership, transnational culture and political competence in globalizing health care services: a case study of Jordan's King Hussein Cancer Center"
13 p | 122 | 3
-
Báo cáo khoa học: "Relapse in resected lung cancer revisited: does intensified follow up really matter? A prospective study"
11 p | 40 | 3
-
báo cáo khoa học: "Metabolome in schizophrenia and other psychotic disorders: a general population-based study"
14 p | 46 | 3
-
Báo cáo khoa hoc:" Study of the impact of perilipin polymorphisms in a French population"
6 p | 37 | 3
-
báo cáo khoa học: " Study protocol: the DESPATCH study: Delivering stroke prevention for patients with atrial fibrillation - a cluster randomised controlled trial in primary healthcare"
13 p | 56 | 3
-
báo cáo khoa học: "Safety in home care: A research protocol for studying medication management"
9 p | 40 | 3
-
Báo cáo khoa học: "The in vivo study on the radiobiologic effect of prolonged delivery time to tumor control in C57BL mice implanted with Lewis lung cancer"
6 p | 51 | 3
-
Báo cáo khoa học: " Genetic predisposition to chikungunya – a blood group study in chikungunya affected families"
3 p | 60 | 3
Chịu trách nhiệm nội dung:
Nguyễn Công Hà - Giám đốc Công ty TNHH TÀI LIỆU TRỰC TUYẾN VI NA
LIÊN HỆ
Địa chỉ: P402, 54A Nơ Trang Long, Phường 14, Q.Bình Thạnh, TP.HCM
Hotline: 093 303 0098
Email: support@tailieu.vn