home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Veps-VWT: POS Tags: NUM

There are 9 NUM lemmas (2%), 10 NUM types (2%) and 11 NUM tokens (1%). Out of 13 observed tags, the rank of NUM is: 8 in number of lemmas, 9 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: 40, kaks’, 15, 2017, 23., kahesa, koume, üks, üks’

The 10 most frequent NUM types: 40, 15, 2017, 23., kahesa, kaht, kaks’, koume, ühtes, üks’

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.111111 (the average of all parts of speech is 1.526854).

The 1st highest number of forms (2) was observed with the lemma “kaks’”: kaht, kaks’.

The 2nd highest number of forms (1) was observed with the lemma “15”: 15.

The 3rd highest number of forms (1) was observed with the lemma “2017”: 2017.

NUM occurs with 3 features: Case (11; 100% instances), NumForm (11; 100% instances), NumType (11; 100% instances)

NUM occurs with 8 feature-value pairs: Case=Ade, Case=Ine, Case=Nom, Case=Par, NumForm=Digit, NumForm=Word, NumType=Card, NumType=Ord

NUM occurs with 5 feature combinations. The most frequent feature combination is Case=Nom|NumForm=Digit|NumType=Card (4 tokens). Examples: 40, 15, 2017

Relations

NUM nodes are attached to their parents using 1 different relations: nummod (11; 100% instances)

Parents of NUM nodes belong to 2 different parts of speech: NOUN (10; 91% instances), PRON (1; 9% instances)

8 (73%) NUM nodes are leaves.

3 (27%) NUM nodes have one child.

The highest child degree of a NUM node is 1.

Children of NUM nodes are attached using 1 different relations: advmod (3; 100% instances)

Children of NUM nodes belong to 1 different parts of speech: ADV (3; 100% instances)