home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Irish-TwittIrish: POS Tags: NUM

There are 313 NUM lemmas (3%), 329 NUM types (3%) and 962 NUM tokens (2%). Out of 17 observed tags, the rank of NUM is: 7 in number of lemmas, 7 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: 2, 1, céad, dó, 3, 0, míle, 8, 4, 7

The 10 most frequent NUM types: 2, 1, dhá, 0, 3, 8, míle, 4, chéad, 10

The 10 most frequent ambiguous lemmas: céad (NUM 35, ADJ 7, ADV 1), (NUM 33, ADP 1), míle (NUM 22, NOUN 2), 4 (NUM 24, ADP 1, NOUN 1), 7 (NUM 23, CCONJ 8, NOUN 1, PROPN 1), 10 (NUM 22, PROPN 2), 6 (NUM 22, NOUN 1), 9 (NUM 12, NOUN 1), dara (NUM 12, ADJ 1), 24 (NUM 11, PROPN 1)

The 10 most frequent ambiguous types: 2 (NUM 53, PRON 1), 1 (NUM 49, ADJ 1), dhá (NUM 18, ADP 3), 4 (NUM 23, ADP 1, NOUN 1), chéad (NUM 21, ADJ 3, ADV 1), 10 (NUM 22, PROPN 2), 7 (NUM 22, CCONJ 8, PROPN 1), 6 (NUM 21, NOUN 1), 24 (NUM 11, PROPN 1), 9 (NUM 11, NOUN 1)

Morphology

The form / lemma ratio of NUM is 1.051118 (the average of all parts of speech is 1.212231).

The 1st highest number of forms (5) was observed with the lemma “céad”: chead, chèad, chéad, céad, gcéad.

The 2nd highest number of forms (4) was observed with the lemma “dó”: dhá, dhó, dá, dó.

The 3rd highest number of forms (3) was observed with the lemma “1”: 1, 1ú, 2.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 19 different relations: nmod (284; 30% instances), nummod (223; 23% instances), obl:tmod (178; 19% instances), flat (76; 8% instances), amod (57; 6% instances), parataxis (38; 4% instances), conj (29; 3% instances), obl (16; 2% instances), appos (12; 1% instances), root (12; 1% instances), parataxis:sentence (10; 1% instances), obj (6; 1% instances), nsubj (5; 1% instances), parataxis:url (4; 0% instances), flat:name (3; 0% instances), xcomp:pred (3; 0% instances), compound (2; 0% instances), nmod:tmod (2; 0% instances), vocative:mention (2; 0% instances)

Parents of NUM nodes belong to 12 different parts of speech: NOUN (498; 52% instances), PROPN (157; 16% instances), NUM (140; 15% instances), VERB (113; 12% instances), SYM (19; 2% instances), (12; 1% instances), ADJ (8; 1% instances), X (7; 1% instances), ADV (4; 0% instances), PRON (2; 0% instances), ADP (1; 0% instances), PART (1; 0% instances)

473 (49%) NUM nodes are leaves.

288 (30%) NUM nodes have one child.

134 (14%) NUM nodes have two children.

67 (7%) NUM nodes have three or more children.

The highest child degree of a NUM node is 14.

Children of NUM nodes are attached using 29 different relations: punct (232; 29% instances), case (165; 20% instances), nmod (125; 15% instances), flat (101; 12% instances), mark:prt (30; 4% instances), conj (20; 2% instances), vocative:mention (18; 2% instances), cc (17; 2% instances), det (16; 2% instances), parataxis (15; 2% instances), obl (10; 1% instances), parataxis:sentence (8; 1% instances), advmod (7; 1% instances), appos (7; 1% instances), nummod (7; 1% instances), obl:tmod (6; 1% instances), xcomp:pred (5; 1% instances), parataxis:url (4; 0% instances), amod (3; 0% instances), compound (3; 0% instances), cop (3; 0% instances), nsubj (3; 0% instances), parataxis:hashtag (3; 0% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), csubj:cleft (1; 0% instances), nmod:tmod (1; 0% instances), parataxis:rt (1; 0% instances), vocative (1; 0% instances)

Children of NUM nodes belong to 16 different parts of speech: PUNCT (232; 29% instances), ADP (165; 20% instances), NUM (140; 17% instances), NOUN (85; 10% instances), PROPN (72; 9% instances), PART (29; 4% instances), CCONJ (17; 2% instances), DET (17; 2% instances), X (17; 2% instances), ADJ (10; 1% instances), ADV (8; 1% instances), VERB (8; 1% instances), SYM (7; 1% instances), AUX (3; 0% instances), PRON (3; 0% instances), SCONJ (1; 0% instances)