Analýza statistických dat při korpusovém výzkumu víceslovných předložkových jednotek s místním významem (na materiálu korpusu SYN2010)
Aksana Mikalayenka
Filozofická fakulta Univerzity Karlovy v Praze
<kust_alive@mail.ru>
Abstract:
Analysis of the statistical data obtained through the corpus research of multi-word prepositional units with spatial relations (based on examples from Corpus SYN2010)
The article deals with analysis of the statistical data obtained through the corpus research of multi-word prepositional units requiring thorough study. These multi-word units consist of spatial adverbs and simple prepositions and are used in speech as spatial prepositions (for example, such units as blízko k, blízko u, daleko od, nedaleko od, vysoko nad). By means of excerpting from Czech grammar guides a list of spatial adverbs has been compiled. These spatial adverbs can be components of multi-word prepositional units. At the present moment this list includes more than a hundred adverbs. By using the search system of Corpus SYN2010 it has been found out if these spatial adverbs are used or not with simple prepositions the given adverbs are used with and we can answer the question Are these combinations coincidental or fixed? So a number of the specific spatial prepositional units adverb + preposition has been found. At the present moment this number is equal to 80 units. Moreover, the information about absolute and relative frequency of the given units has been obtained. The given units have been analysed with relation to their quantity. Finally, the conclusion on the role of quantitative analysis in the research of these prepositional units has been drawn.
Klíčová slova / key words:
Český národní korpus, lingvistická statistika, víceslovné předložky
Czech national corpus, multi-word prepositions, statistical linguistics