home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: ADV

There are 286 ADV lemmas (5%), 488 ADV types (3%) and 2405 ADV tokens (2%). Out of 17 observed tags, the rank of ADV is: 5 in number of lemmas, 6 in number of types and 10 in number of tokens.

The 10 most frequent ADV lemmas: какъ, тежъ, тамъ, потомъ, такъ, вечно, также, напотомъ, тогды, вышей

The 10 most frequent ADV types: какъ, как, там, теж, тежъ, вечно, потомъ, тогды, тамъ, так

The 10 most frequent ambiguous lemmas: какъ (ADV 235, SCONJ 80, CCONJ 8), тежъ (ADV 159, CCONJ 43, DET 3), такъ (ADV 99, CCONJ 16, SCONJ 2), также (CCONJ 112, ADV 78), вышей (ADV 53, ADP 4), где (ADV 40, SCONJ 3), уже (ADV 35, PART 1), такежъ (ADV 23, CCONJ 12), первей (ADV 13, ADP 9), болшъ (ADV 12, NUM 1)

The 10 most frequent ambiguous types: какъ (ADV 123, SCONJ 37, CCONJ 5), как (ADV 107, SCONJ 41, CCONJ 3), теж (ADV 76, CCONJ 6, DET 1), тежъ (ADV 56, CCONJ 4), так (ADV 48, CCONJ 6, SCONJ 1), такъ (ADV 47, CCONJ 10, SCONJ 1), вышеи (ADV 36, ADP 1), також (ADV 28, CCONJ 1), где (ADV 26, SCONJ 1), также (ADV 22, CCONJ 3)

Morphology

The form / lemma ratio of ADV is 1.706294 (the average of all parts of speech is 2.909188).

The 1st highest number of forms (9) was observed with the lemma “также”: Такожь, также, такжо, такжь, такжѣ, також, такоже, такъже, такъжо.

The 2nd highest number of forms (8) was observed with the lemma “болше”: бол(ь)ше, бол(ь)шеи, бол(ь)ши, бол(ь)шы, бол(ь)шыи, болшеи, болши, большеи.

The 3rd highest number of forms (8) was observed with the lemma “такежъ”: так(е)жь, такеж, такежъ, такежь, такежѣ, такѣж, такѣжъ, такѣжь.

ADV occurs with 3 features: Degree (2401; 100% instances), Polarity (21; 1% instances), Typo (1; 0% instances)

ADV occurs with 4 feature-value pairs: Degree=Cmp, Degree=Pos, Polarity=Neg, Typo=Yes

ADV occurs with 5 feature combinations. The most frequent feature combination is Degree=Pos (2234 tokens). Examples: какъ, как, там, теж, вечно, тежъ, потомъ, тогды, тамъ, так

Relations

ADV nodes are attached to their parents using 13 different relations: advmod (2130; 89% instances), fixed (125; 5% instances), conj (94; 4% instances), orphan (22; 1% instances), advcl (9; 0% instances), root (9; 0% instances), cc (5; 0% instances), obl (4; 0% instances), case (2; 0% instances), reparandum (2; 0% instances), acl (1; 0% instances), ccomp (1; 0% instances), mark (1; 0% instances)

Parents of ADV nodes belong to 13 different parts of speech: VERB (1848; 77% instances), NOUN (166; 7% instances), CCONJ (124; 5% instances), ADJ (116; 5% instances), ADV (89; 4% instances), DET (18; 1% instances), PRON (15; 1% instances), PROPN (10; 0% instances), (9; 0% instances), AUX (5; 0% instances), NUM (3; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances)

1994 (83%) ADV nodes are leaves.

297 (12%) ADV nodes have one child.

57 (2%) ADV nodes have two children.

57 (2%) ADV nodes have three or more children.

The highest child degree of a ADV node is 9.

Children of ADV nodes are attached using 26 different relations: advmod (117; 18% instances), cc (96; 15% instances), punct (80; 13% instances), obl (79; 12% instances), conj (70; 11% instances), advcl (28; 4% instances), nsubj (25; 4% instances), iobj (24; 4% instances), fixed (22; 3% instances), cop (20; 3% instances), nmod (16; 3% instances), case (11; 2% instances), mark (11; 2% instances), appos (6; 1% instances), obj (6; 1% instances), aux (4; 1% instances), orphan (3; 0% instances), xcomp (3; 0% instances), acl:relcl (2; 0% instances), amod (2; 0% instances), parataxis (2; 0% instances), reparandum (2; 0% instances), ccomp (1; 0% instances), det (1; 0% instances), expl (1; 0% instances), goeswith (1; 0% instances)

Children of ADV nodes belong to 14 different parts of speech: NOUN (108; 17% instances), CCONJ (107; 17% instances), ADV (89; 14% instances), PART (89; 14% instances), PUNCT (80; 13% instances), PRON (41; 6% instances), VERB (37; 6% instances), AUX (26; 4% instances), SCONJ (17; 3% instances), PROPN (13; 2% instances), DET (9; 1% instances), ADJ (8; 1% instances), ADP (8; 1% instances), X (1; 0% instances)