home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Poetry: POS Tags: VERB

There are 2806 VERB lemmas (28%), 5440 VERB types (30%) and 8234 VERB tokens (13%). Out of 17 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: знать, быть, нет, идти, любить, мочь, жить, стать, видеть, петь

The 10 most frequent VERB types: нет, знаю, может, надо, стоит, быть, жить, есть, люблю, вижу

The 10 most frequent ambiguous lemmas: знать (VERB 111, NOUN 1), быть (AUX 236, VERB 102), нет (VERB 85, PART 33), мочь (VERB 81, NOUN 1), стать (VERB 66, NOUN 1), надо (VERB 33, ADP 10), пасть (VERB 11, NOUN 5), пора (NOUN 37, VERB 10), пропасть (VERB 4, NOUN 2), лень (NOUN 4, VERB 2)

The 10 most frequent ambiguous types: нет (VERB 73, PART 14), надо (VERB 30, ADP 10), быть (VERB 25, AUX 16), есть (VERB 21, AUX 2), был (AUX 50, VERB 5), будет (AUX 31, VERB 8), жил (VERB 9, NOUN 1), пора (VERB 9, NOUN 5), было (AUX 27, VERB 8), стали (VERB 7, NOUN 5)

Morphology

The form / lemma ratio of VERB is 1.938703 (the average of all parts of speech is 1.831021).

The 1st highest number of forms (18) was observed with the lemma “забыть”: Забыты, забудем, забудешь, забуду, забудут, забудь, забыв, забывший, забыл, забыла, забыли, забытая, забытое, забытой, забытые, забытый, забытых, забыть.

The 2nd highest number of forms (17) was observed with the lemma “знать”: Знала, знавшие, знаем, знает, знаете, знаешь, знай, знайте, знал, знали, знать, знаю, знают, знающей, знающем, знающие, зная.

The 3rd highest number of forms (15) was observed with the lemma “идти”: Идучи, Идущей, Идя, Идёт, идем, идет, иди, идти, иду, идут, идущих, шел, шла, шли, шёл.

VERB occurs with 14 features: VerbForm (8051; 98% instances), Voice (8051; 98% instances), Aspect (7949; 97% instances), Tense (6766; 82% instances), Number (6720; 82% instances), Mood (5758; 70% instances), Person (3682; 45% instances), Gender (2263; 27% instances), Case (720; 9% instances), Variant (247; 3% instances), Polarity (114; 1% instances), Animacy (88; 1% instances), Reflex (35; 0% instances), Typo (4; 0% instances)

VERB occurs with 34 feature-value pairs: Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Reflex=Yes, Tense=Fut, Tense=Past, Tense=Pres, Typo=Yes, Variant=Short, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Mid, Voice=Pass

VERB occurs with 183 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (944 tokens). Examples: может, стоит, поет, знает, пахнет, проходит, идет, значит, смотрит, зовет

Relations

VERB nodes are attached to their parents using 21 different relations: root (3271; 40% instances), conj (2212; 27% instances), advcl (750; 9% instances), parataxis (447; 5% instances), acl (353; 4% instances), amod (325; 4% instances), xcomp (313; 4% instances), acl:relcl (162; 2% instances), csubj (140; 2% instances), ccomp (136; 2% instances), parataxis:discourse (66; 1% instances), nsubj (11; 0% instances), obj (11; 0% instances), csubj:pass (8; 0% instances), nmod (8; 0% instances), iobj (5; 0% instances), fixed (4; 0% instances), obl (4; 0% instances), obl:depict (4; 0% instances), appos (3; 0% instances), obl:agent (1; 0% instances)

Parents of VERB nodes belong to 10 different parts of speech: VERB (3493; 42% instances), (3271; 40% instances), NOUN (931; 11% instances), ADJ (308; 4% instances), PRON (96; 1% instances), ADV (65; 1% instances), DET (50; 1% instances), PROPN (13; 0% instances), NUM (5; 0% instances), PART (2; 0% instances)

474 (6%) VERB nodes are leaves.

623 (8%) VERB nodes have one child.

1188 (14%) VERB nodes have two children.

5949 (72%) VERB nodes have three or more children.

The highest child degree of a VERB node is 11.

Children of VERB nodes are attached using 36 different relations: punct (7523; 26% instances), nsubj (3966; 14% instances), obl (3745; 13% instances), advmod (2657; 9% instances), obj (2342; 8% instances), conj (2260; 8% instances), cc (1550; 5% instances), iobj (1178; 4% instances), advcl (728; 3% instances), parataxis (507; 2% instances), mark (464; 2% instances), xcomp (412; 1% instances), vocative (185; 1% instances), ccomp (176; 1% instances), nsubj:pass (176; 1% instances), obl:agent (129; 0% instances), obl:tmod (113; 0% instances), aux (111; 0% instances), parataxis:discourse (96; 0% instances), csubj (79; 0% instances), discourse (72; 0% instances), obl:float (36; 0% instances), aux:pass (22; 0% instances), obl:depict (20; 0% instances), expl (19; 0% instances), case (8; 0% instances), csubj:pass (8; 0% instances), det (4; 0% instances), acl (3; 0% instances), amod (3; 0% instances), appos (2; 0% instances), cop (2; 0% instances), nmod (2; 0% instances), acl:relcl (1; 0% instances), dislocated (1; 0% instances), nummod:gov (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (9074; 32% instances), PUNCT (7523; 26% instances), VERB (3493; 12% instances), PRON (2579; 9% instances), ADV (1895; 7% instances), CCONJ (1547; 5% instances), PART (929; 3% instances), ADJ (499; 2% instances), SCONJ (436; 2% instances), PROPN (210; 1% instances), DET (163; 1% instances), AUX (138; 0% instances), INTJ (57; 0% instances), NUM (36; 0% instances), ADP (15; 0% instances), X (5; 0% instances), SYM (2; 0% instances)