home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Georgian-GLC: POS Tags: NUM

There are 11 NUM lemmas (1%), 15 NUM types (1%) and 21 NUM tokens (1%). Out of 13 observed tags, the rank of NUM is: 9 in number of lemmas, 9 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: ერთი, ორი, ცოტა, 100.000, 1907, 2003, 2004-2005, 30, 363, სამი

The 10 most frequent NUM types: ერთი, მეორე, ცოტა, 100.000, 1907, 2003, 2004-2005, 30, 363, ერთ

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.363636 (the average of all parts of speech is 1.235874).

The 1st highest number of forms (3) was observed with the lemma “ერთი”: ერთ, ერთი, პირველივე.

The 2nd highest number of forms (3) was observed with the lemma “ორი”: მეორე, ორ, ორი.

The 3rd highest number of forms (1) was observed with the lemma “100.000”: 100.000.

NUM occurs with 5 features: NumType (21; 100% instances), Case (15; 71% instances), Number (15; 71% instances), NumForm (6; 29% instances), PartType (1; 5% instances)

NUM occurs with 9 feature-value pairs: Case=Dat, Case=Gen, Case=Ins, Case=Nom, NumForm=Digit, NumType=Card, NumType=Ord, Number=Sing, PartType=Emp

NUM occurs with 8 feature combinations. The most frequent feature combination is NumForm=Digit|NumType=Card (6 tokens). Examples: 100.000, 1907, 2003, 2004-2005, 30, 363

Relations

NUM nodes are attached to their parents using 2 different relations: nummod (20; 95% instances), nmod (1; 5% instances)

Parents of NUM nodes belong to 2 different parts of speech: NOUN (19; 90% instances), VERB (2; 10% instances)

20 (95%) NUM nodes are leaves.

1 (5%) NUM nodes have one child.

The highest child degree of a NUM node is 1.

Children of NUM nodes are attached using 1 different relations: advmod (1; 100% instances)

Children of NUM nodes belong to 1 different parts of speech: ADV (1; 100% instances)