home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: ADJ

There are 829 ADJ lemmas (15%), 2665 ADJ types (16%) and 6427 ADJ tokens (7%). Out of 17 observed tags, the rank of ADJ is: 4 in number of lemmas, 4 in number of types and 8 in number of tokens.

The 10 most frequent ADJ lemmas: полоцкий, рижский, великий, божий, святый, добрый, милый, будучий, путный, старый

The 10 most frequent ADJ types: полоцкии, полоцкого, великии, милым, полоцког(о), Бож(ъ)ю, ризког(о), полоцких, ризкого, великого

The 10 most frequent ambiguous lemmas: святый (ADJ 161, NOUN 1), 14 (ADJ 28, NUM 8), 12 (ADJ 24, NUM 9), 11 (ADJ 18, NUM 5), 9 (ADJ 18, NUM 3), 13 (ADJ 16, NUM 2), 2 (ADJ 15, NUM 3), 5 (NUM 19, ADJ 14), 15 (ADJ 13, NUM 7), детский (ADJ 13, NOUN 3)

The 10 most frequent ambiguous types: 14 (ADJ 28, NUM 7), 12 (ADJ 24, NUM 8), 11 (ADJ 18, NUM 3), 9 (ADJ 18, NUM 1), 13 (ADJ 16, NUM 1), ради (ADJ 8, ADP 2), 2 (ADJ 15, NUM 1), 5 (NUM 16, ADJ 14), 15 (ADJ 13, NUM 6), 10 (NUM 19, ADJ 12)

Morphology

The form / lemma ratio of ADJ is 3.214717 (the average of all parts of speech is 2.909188).

The 1st highest number of forms (92) was observed with the lemma “полоцкий”: Пол]оцкомъ, Полоцкая, Полоцкѡг(о), Полоцого, Полоцъкое, Полоцъкои, Полоцъком, Полоцъкомъ, Полоцькая, Полоцькое, Полочькаѧ, Полочьки, Полочьскаѧ, Полочьскую, Полѡтьцкыи, по[лоцк]ых, пол(о)цких, пол(о)цког(о), пол(о)цкые, пол(о)цкыи, пол(о)цкых, пол(о)цьког(о), пол(о)цькых, пол(оцкии), пол(оцкому), пололоцких, полотскии, полотског(о), полотского, полотьского, полоц(кии), полоцкаг(о), полоцкаѧ, полоцкго, полоцкие, полоцкии, полоцкий, полоцким, полоцкими, полоцкимъ, полоцких, полоцкихъ, полоцкиѣ, полоцког(о), полоцкого, полоцкогѡ, полоцкое, полоцкои, полоцком, полоцкому, полоцкомъ, полоцкомꙋ, полоцкою, полоцкую, полоцкые, полоцкыи, полоцкым, полоцкымъ, полоцкых, полоцкыхъ, полоцкіи, полоцъкаѧ, полоцъкие, полоцъкии, полоцъкиие, полоцъкими, полоцъкимъ, полоцъких, полоцъкихъ, полоцъкого, полоцъкому, полоцъкомꙋ, полоцъкою, полоцъкую, полоцъкы, полоцьки, полоцькии, полоцькими, полоцьких, полоцькихъ, полоцьког(о), полоцького, полоцькыи, полоцькых, полоцькыхъ, полочькии, полочькиих, полочьког(о), полочького, полочьскы, полѡтцкымъ, поцькыи.

The 2nd highest number of forms (69) was observed with the lemma “божий”: Б(о)ж(ъ)ю, Б(о)жег(о), Б(о)жего, Б(о)жиа, Б(о)жиего, Б(о)жиеи, Б(о)жии, Б(о)жиимъ, Б(о)жию, Б(о)жия, Б(о)жою, Б(о)жье, Б(о)жьег(о), Б(о)жьего, Б(о)жьее, Б(о)жьеи, Б(о)жьем, Б(о)жьемъ, Б(о)жьемь, Б(о)жьею, Б(о)жьи, Б(о)жью, Б(о)жьѧ, Б(о)жіи, Б(о)жію, Б(о)жія, Б(о)жіѧ, Б(о)зъ, Б(ож)ии, Б(ож)ье, Б(ож)ьею, Б(ож)ьими, Б(ож)ью, Бож(ъ)е, Бож(ъ)его, Бож(ъ)ее, Бож(ъ)еи, Бож(ъ)емъ, Бож(ъ)ею, Бож(ъ)и, Бож(ъ)имъ, Бож(ъ)ю, Бож(ъ)ѧ, Бож(ь)е, Божего, Божее, Божеи, Божею, Божии, Божия, Божое, Божъе, Божъего, Божъеи, Божъю, Божыю, Божьего, Божьем, Божьею, Божьи, Божьим, Божьимъ, Божью, Божіею, Бѡж(ъ)ю, Бѡж(ь)его, Бѡж(ь)ю, Бѡж(ь)я, бож[ъ]ю.

The 3rd highest number of forms (69) was observed with the lemma “рижский”: Резкого, Ри[з]ког(о), Рижьского, Ризкаг(о), Ризкаго, Ризко, Ризкогѡ, Ризком, Ризкѡгѡ, Ризского, Ризъког(о), Ризькаго, Ризько, Ризького, Ризьского, Рызког(о), Рызкого, Рызкомꙋ, Рызьког(о), Рызького, Рꙋзког(о), рижьскым, рижьскыми, ризкие, ризкии, ризким, ризкимъ, ризког(о), ризкого, ризком(у), ризкомоу, ризкому, ризкомꙋ, ризкым, ризкымъ, ризкых, ризские, ризскии, ризским, ризскимъ, ризском(у), ризскым, ризскꙋю, ризъким, ризъскии, ризькии, ризькиим, ризьким, ризькими, ризькимъ, ризьког(о), ризькое, ризькому, ризькыи, ризькым, ризькымъ, ризьским, ризьскимъ, ризьских, ризьског(о), ризьскым, ризьскымъ, рикимъ, риским, рискомоу, рискым, рискымъ, рызскии, рыским.

ADJ occurs with 9 features: Case (6415; 100% instances), Gender (6415; 100% instances), Number (6415; 100% instances), Degree (5902; 92% instances), Variant (677; 11% instances), NumForm (524; 8% instances), NumType (524; 8% instances), Animacy (135; 2% instances), Abbr (1; 0% instances)

ADJ occurs with 22 feature-value pairs: Abbr=Yes, Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Combi, NumForm=Digit, NumForm=Word, NumType=Ord, Number=Dual, Number=Plur, Number=Sing, Variant=Short

ADJ occurs with 124 feature combinations. The most frequent feature combination is Case=Nom|Degree=Pos|Gender=Masc|Number=Sing (727 tokens). Examples: великии, полоцкии, троцкии, жомоитскии, виленскии, дворныи, полоцъкии, вил(енскии), пол(ь)скии, витебъскии

Relations

ADJ nodes are attached to their parents using 23 different relations: amod (5157; 80% instances), conj (584; 9% instances), obl (140; 2% instances), acl (102; 2% instances), root (86; 1% instances), nmod (82; 1% instances), obj (82; 1% instances), advcl (38; 1% instances), xcomp (35; 1% instances), ccomp (24; 0% instances), flat (19; 0% instances), nsubj (13; 0% instances), flat:name (12; 0% instances), parataxis (11; 0% instances), appos (10; 0% instances), iobj (8; 0% instances), acl:relcl (7; 0% instances), dislocated (5; 0% instances), orphan (4; 0% instances), reparandum (4; 0% instances), dep (2; 0% instances), list (1; 0% instances), vocative (1; 0% instances)

Parents of ADJ nodes belong to 11 different parts of speech: NOUN (5211; 81% instances), ADJ (514; 8% instances), VERB (386; 6% instances), PROPN (185; 3% instances), (86; 1% instances), DET (20; 0% instances), PRON (13; 0% instances), ADV (8; 0% instances), NUM (2; 0% instances), PART (1; 0% instances), X (1; 0% instances)

4893 (76%) ADJ nodes are leaves.

902 (14%) ADJ nodes have one child.

271 (4%) ADJ nodes have two children.

361 (6%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 12.

Children of ADJ nodes are attached using 36 different relations: conj (591; 20% instances), cc (550; 18% instances), punct (482; 16% instances), case (190; 6% instances), obl (172; 6% instances), advmod (158; 5% instances), nsubj (138; 5% instances), cop (121; 4% instances), iobj (105; 4% instances), det (83; 3% instances), mark (54; 2% instances), compound (52; 2% instances), xcomp (40; 1% instances), advcl (35; 1% instances), flat:name (35; 1% instances), amod (31; 1% instances), nmod (24; 1% instances), acl:relcl (22; 1% instances), csubj (22; 1% instances), appos (17; 1% instances), parataxis (17; 1% instances), obj (15; 1% instances), aux (9; 0% instances), ccomp (8; 0% instances), orphan (4; 0% instances), reparandum (4; 0% instances), dep (3; 0% instances), dislocated (3; 0% instances), nsubj:outer (3; 0% instances), acl (2; 0% instances), obl:tmod (2; 0% instances), vocative (2; 0% instances), discourse (1; 0% instances), nsubj:pass (1; 0% instances), nummod (1; 0% instances), nummod:gov (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: CCONJ (550; 18% instances), ADJ (514; 17% instances), PUNCT (482; 16% instances), NOUN (358; 12% instances), ADP (189; 6% instances), PRON (174; 6% instances), VERB (154; 5% instances), AUX (130; 4% instances), DET (128; 4% instances), ADV (116; 4% instances), PROPN (58; 2% instances), SCONJ (53; 2% instances), PART (48; 2% instances), NUM (39; 1% instances), SYM (5; 0% instances)