home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Yakut-YKTDT: POS Tags: NUM

There are 8 NUM lemmas (2%), 10 NUM types (2%) and 45 NUM tokens (3%). Out of 14 observed tags, the rank of NUM is: 7 in number of lemmas, 7 in number of types and 8 in number of tokens.

The 10 most frequent NUM lemmas: биир, икки, үс, хас, уон, алта, биэс, онус

The 10 most frequent NUM types: биир, икки, үс, хас, уон, алта, биэс, иккини, иккитэ, онус

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.250000 (the average of all parts of speech is 1.502358).

The 1st highest number of forms (3) was observed with the lemma “икки”: икки, иккини, иккитэ.

The 2nd highest number of forms (1) was observed with the lemma “алта”: алта.

The 3rd highest number of forms (1) was observed with the lemma “биир”: биир.

NUM occurs with 3 features: NumType (41; 91% instances), PronType (5; 11% instances), Case (2; 4% instances)

NUM occurs with 4 feature-value pairs: Case=Acc, Case=Par, NumType=Card, PronType=Int

NUM occurs with 6 feature combinations. The most frequent feature combination is NumType=Card (37 tokens). Examples: биир, икки, үс, уон, алта, биэс, онус, хас, иккини, иккитэ

Relations

NUM nodes are attached to their parents using 2 different relations: nummod (44; 98% instances), nmod (1; 2% instances)

Parents of NUM nodes belong to 2 different parts of speech: NOUN (42; 93% instances), NUM (3; 7% instances)

42 (93%) NUM nodes are leaves.

3 (7%) NUM nodes have one child.

The highest child degree of a NUM node is 1.

Children of NUM nodes are attached using 1 different relations: nummod (3; 100% instances)

Children of NUM nodes belong to 1 different parts of speech: NUM (3; 100% instances)