home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Cebuano-GJA: POS Tags: NUM

There are 6 NUM lemmas (1%), 7 NUM types (1%) and 12 NUM tokens (1%). Out of 14 observed tags, the rank of NUM is: 10 in number of lemmas, 10 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: tulo, duha, usa, sayis, thirty, tres

The 10 most frequent NUM types: tulo, usa, Duhay, duha, sayis, thirty, tres

The 10 most frequent ambiguous lemmas: usa (NUM 2, PART 2)

The 10 most frequent ambiguous types: usa (NUM 2, PART 2)

Morphology

The form / lemma ratio of NUM is 1.166667 (the average of all parts of speech is 1.142523).

The 1st highest number of forms (2) was observed with the lemma “duha”: Duhay, duha.

The 2nd highest number of forms (1) was observed with the lemma “sayis”: sayis.

The 3rd highest number of forms (1) was observed with the lemma “thirty”: thirty.

NUM occurs with 2 features: Foreign (1; 8% instances), Neutral (1; 8% instances)

NUM occurs with 2 feature-value pairs: Foreign=Yes, Neutral=Yes

NUM occurs with 3 feature combinations. The most frequent feature combination is _ (10 tokens). Examples: tulo, usa, duha, sayis, tres

Relations

NUM nodes are attached to their parents using 2 different relations: nummod (11; 92% instances), amod (1; 8% instances)

Parents of NUM nodes belong to 3 different parts of speech: NOUN (10; 83% instances), PROPN (1; 8% instances), VERB (1; 8% instances)

5 (42%) NUM nodes are leaves.

7 (58%) NUM nodes have one child.

The highest child degree of a NUM node is 1.

Children of NUM nodes are attached using 2 different relations: mark (6; 86% instances), case (1; 14% instances)

Children of NUM nodes belong to 2 different parts of speech: PART (6; 86% instances), ADP (1; 14% instances)