home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_South_Levantine_Arabic-MADAR: POS Tags: NUM

There are 15 NUM lemmas (4%), 15 NUM types (4%) and 18 NUM tokens (2%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: وَاحِد, اِتنَان, أَربَعَة, أَلف, تسعين, تلاتين, تمنية, تنتين, تَسعمِيَة, تِسعَة

The 10 most frequent NUM types: واحد, اتنين, أربعة, الألف, تسعة, تسعمية, تسعين, تلاتين, تمنية, تنتين

The 10 most frequent ambiguous lemmas: وَاحِد (NUM 3, NOUN 2)

The 10 most frequent ambiguous types: واحد (NUM 3, NOUN 2)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 0.992347).

The 1st highest number of forms (1) was observed with the lemma “أَربَعَة”: أربعة.

The 2nd highest number of forms (1) was observed with the lemma “أَلف”: الألف.

The 3rd highest number of forms (1) was observed with the lemma “اِتنَان”: اتنين.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 3 different relations: nummod (14; 78% instances), conj (3; 17% instances), obj (1; 6% instances)

Parents of NUM nodes belong to 3 different parts of speech: NOUN (10; 56% instances), NUM (6; 33% instances), VERB (2; 11% instances)

8 (44%) NUM nodes are leaves.

8 (44%) NUM nodes have one child.

2 (11%) NUM nodes have two children.

The highest child degree of a NUM node is 2.

Children of NUM nodes are attached using 5 different relations: conj (4; 33% instances), nummod (3; 25% instances), cc (2; 17% instances), nmod (2; 17% instances), obl (1; 8% instances)

Children of NUM nodes belong to 3 different parts of speech: NUM (6; 50% instances), NOUN (4; 33% instances), CCONJ (2; 17% instances)