home edit page issue tracker

This page pertains to UD version 2.

UD Russian Poetry

Language: Russian (code: ru)
Family: Indo-European, Slavic

This treebank has been part of Universal Dependencies since the UD v2.13 release.

The following people have contributed to making this treebank part of UD: Olga Lyashevskaya, Natalia Vlasova, Dmitri Sitchinava.

Repository: UD_Russian-Poetry
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.13

License: CC BY-SA 4.0

Genre: poetry

Questions, comments? General annotation questions (either Russian-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [olesar (æt) yandex • ru]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas annotated manually
UPOS annotated manually, natively in UD style
XPOS assigned by a program, not checked manually
Features annotated manually, natively in UD style
Relations annotated manually, natively in UD style

Description

UD_Russian-Poetry contains samples of Russian poetry written in 19th – early 21th centuries. The treebank is based on the Poetry Corpus of the Russian National Corpus.

UD_Russian-Poetry contains samples of Russian poetry written in 19th – early 21th centuries. The treebank is based on the Poetry Corpus of the Russian National Corpus (https://ruscorpora.ru/s/elRGl). Initial annotation according to the RNC/UD-ext morphological schema and UD dependency schema is created using Rubic BERT-based transformer (Lyashevskaya et al. 2023) and manually corrected. Annotations were converted into UD 2.0 format and additionally checked. The treebank contains original versological annotation of the RNC Poetry Corpus on rhyme zones and metrical properties of the verse (see MISC column).

Acknowledgments

We wish to thank all of the contributors to the RNC Poetry Corpus collection and annotation effort, and especially Vladimir Plungian and Kirill Korchagin.

References

Statistics of UD Russian Poetry

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPARTPRONPROPNPUNCTSCONJSYMVERBX

Features

AbbrAnimacyAspectCaseDegreeForeignGenderInflClassMoodNameTypeNumberNumFormNumTypePersonPolarityPossPronTypeReflexTenseTypoVariantVerbFormVoice

Relations

aclacl:relcladvcladvmodamodapposauxaux:passcaseccccompcompoundconjcopcsubjcsubj:passdepdetdiscoursedislocatedexplfixedflatflat:foreignflat:namegoeswithiobjlistmarknmodnsubjnsubj:outernsubj:passnummodnummod:govobjoblobl:agentobl:depictobl:floatobl:pronmodobl:tmodorphanparataxisparataxis:discoursepunctrootvocativexcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview