home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Veps-VWT: POS Tags: VERB

There are 82 VERB lemmas (21%), 141 VERB types (24%) and 183 VERB tokens (14%). Out of 13 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: eläda, tehta, rata, pagišta, el’geta, ajada, abutada, kaita, tahtoida, tulda

The 10 most frequent VERB types: tehta, eläba, eläda, radoin, ajoin, el’geta, rata, seižub, Išttes, abutab

The 10 most frequent ambiguous lemmas: olda (AUX 47, VERB 3), sada (VERB 2, AUX 1), pidada (AUX 7, VERB 1)

The 10 most frequent ambiguous types: Om (AUX 1, VERB 1), oli (AUX 8, VERB 1), pidab (AUX 4, VERB 1)

Morphology

The form / lemma ratio of VERB is 1.719512 (the average of all parts of speech is 1.526854).

The 1st highest number of forms (7) was observed with the lemma “eläda”: eliba, elin, eläb, eläba, eläda, eläiži, elämaha.

The 2nd highest number of forms (7) was observed with the lemma “pagišta”: Pagištihe-ik, pagišta, pagištihe, pagižeba, pagižem, pagižiba, pagižižiba.

The 3rd highest number of forms (6) was observed with the lemma “rata”: radab, radaba, radmaha, radoiba, radoin, rata.

VERB occurs with 10 features: VerbForm (183; 100% instances), Voice (130; 71% instances), Tense (123; 67% instances), Mood (118; 64% instances), Number (107; 58% instances), Person (105; 57% instances), Case (7; 4% instances), Connegative (6; 3% instances), Clitic (3; 2% instances), Typo (1; 1% instances)

VERB occurs with 20 feature-value pairs: Case=Ill, Clitic=Ik, Connegative=Yes, Mood=Cnd, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Sup, Voice=Act, Voice=Pass

VERB occurs with 31 feature combinations. The most frequent feature combination is VerbForm=Inf (50 tokens). Examples: tehta, eläda, el’geta, kaita, pagišta, panda, rata, vajehtada, vastatas, abutada

Relations

VERB nodes are attached to their parents using 10 different relations: root (74; 40% instances), conj (49; 27% instances), xcomp (20; 11% instances), ccomp (13; 7% instances), acl:relcl (10; 5% instances), advcl (10; 5% instances), parataxis (3; 2% instances), csubj (2; 1% instances), acl (1; 1% instances), csubj:cop (1; 1% instances)

Parents of VERB nodes belong to 6 different parts of speech: VERB (85; 46% instances), (74; 40% instances), NOUN (15; 8% instances), ADJ (4; 2% instances), ADV (4; 2% instances), PROPN (1; 1% instances)

10 (5%) VERB nodes are leaves.

11 (6%) VERB nodes have one child.

24 (13%) VERB nodes have two children.

138 (75%) VERB nodes have three or more children.

The highest child degree of a VERB node is 9.

Children of VERB nodes are attached using 19 different relations: punct (144; 21% instances), obl (129; 19% instances), nsubj (94; 14% instances), advmod (64; 9% instances), obj (64; 9% instances), conj (47; 7% instances), aux (42; 6% instances), cc (28; 4% instances), xcomp (21; 3% instances), ccomp (16; 2% instances), mark (13; 2% instances), advcl (7; 1% instances), case (3; 0% instances), nmod (3; 0% instances), parataxis (2; 0% instances), acl:relcl (1; 0% instances), cop (1; 0% instances), csubj (1; 0% instances), nsubj:cop (1; 0% instances)

Children of VERB nodes belong to 12 different parts of speech: NOUN (207; 30% instances), PUNCT (144; 21% instances), VERB (85; 12% instances), PRON (75; 11% instances), ADV (69; 10% instances), AUX (43; 6% instances), CCONJ (27; 4% instances), PROPN (13; 2% instances), SCONJ (11; 2% instances), ADJ (3; 0% instances), ADP (2; 0% instances), PART (2; 0% instances)