home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Maghrebi_Arabic_French-Arabizi: POS Tags: NUM

There are 62 NUM lemmas (1%), 65 NUM types (1%) and 174 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 10 in number of lemmas, 12 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: 1, 0, 2, 4, 3, 100, 10, 2010, 30, 5

The 10 most frequent NUM types: 1, 0, 2, 4, 3, 100, 10, 2010, 30, 5

The 10 most frequent ambiguous lemmas: 1 (NUM 16, INTJ 8), 0 (NUM 15, NOUN 1), 2 (NUM 15, INTJ 8, PRON 1), 4 (NUM 10, NOUN 1), 3 (NUM 9, INTJ 8, DET 1), 10 (NUM 7, PRON 1), 5 (NUM 6, DET 1), 7 (NUM 6, PRON 1), un (DET 50, INTJ 3, NOUN 1, NUM 1)

The 10 most frequent ambiguous types: 1 (NUM 17, INTJ 9, ADJ 1, DET 1, NOUN 1), 0 (NUM 15, NOUN 1), 2 (NUM 13, INTJ 9, ADP 3, ADJ 1, PRON 1), 4 (NUM 10, NOUN 1), 3 (INTJ 9, NUM 9, DET 3), 100 (NUM 8, NOUN 1), 10 (NUM 7, PRON 1), 5 (NUM 6, DET 1), 7 (NUM 5, PRON 1)

Morphology

The form / lemma ratio of NUM is 1.048387 (the average of all parts of speech is 1.474223).

The 1st highest number of forms (3) was observed with the lemma “2”: 02, 02_, 2.

The 2nd highest number of forms (2) was observed with the lemma “7”: 07/, 7.

The 3rd highest number of forms (2) was observed with the lemma “8”: 08, 8.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 12 different relations: nummod (81; 47% instances), nmod (43; 25% instances), obj (16; 9% instances), obl (12; 7% instances), parataxis (8; 5% instances), amod (4; 2% instances), conj (3; 2% instances), flat (3; 2% instances), dep (1; 1% instances), nsubj (1; 1% instances), root (1; 1% instances), vocative (1; 1% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (91; 52% instances), VERB (41; 24% instances), NUM (26; 15% instances), PROPN (12; 7% instances), ADJ (3; 2% instances), (1; 1% instances)

103 (59%) NUM nodes are leaves.

35 (20%) NUM nodes have one child.

29 (17%) NUM nodes have two children.

7 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 4.

Children of NUM nodes are attached using 15 different relations: nmod (25; 21% instances), case (23; 20% instances), punct (23; 20% instances), nummod (17; 15% instances), advmod (4; 3% instances), cc (4; 3% instances), det (4; 3% instances), discourse (4; 3% instances), conj (3; 3% instances), flat (3; 3% instances), nsubj (2; 2% instances), parataxis (2; 2% instances), acl (1; 1% instances), amod (1; 1% instances), mark (1; 1% instances)

Children of NUM nodes belong to 14 different parts of speech: NUM (26; 22% instances), ADP (23; 20% instances), PUNCT (23; 20% instances), NOUN (16; 14% instances), PROPN (8; 7% instances), CCONJ (4; 3% instances), DET (4; 3% instances), INTJ (4; 3% instances), ADV (3; 3% instances), PRON (2; 2% instances), ADJ (1; 1% instances), PART (1; 1% instances), SCONJ (1; 1% instances), VERB (1; 1% instances)