home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Maghrebi_Arabic_French-Arabizi: POS Tags: PUNCT

There are 43 PUNCT lemmas (1%), 44 PUNCT types (1%) and 251 PUNCT tokens (1%). Out of 16 observed tags, the rank of PUNCT is: 11 in number of lemmas, 15 in number of types and 12 in number of tokens.

The 10 most frequent PUNCT lemmas: ,, (, ?, ), !, -, /, ;, !!!, !!

The 10 most frequent PUNCT types: ,, (, ), ?, -, !!!, !!, ;, /, !!!!

The 10 most frequent ambiguous lemmas: _ (X 70, DET 11, PUNCT 5, NOUN 1), plus (ADV 34, PUNCT 5, ADJ 2)

The 10 most frequent ambiguous types: / (PUNCT 8, ADP 1), + (PUNCT 5, ADV 4)

Morphology

The form / lemma ratio of PUNCT is 1.023256 (the average of all parts of speech is 1.474223).

The 1st highest number of forms (7) was observed with the lemma “!”: !, !!, !!!, !!!!, !!!!!, !!!!!!, !!!!!!!!.

The 2nd highest number of forms (5) was observed with the lemma “?”: ?, ??, ???, ????, ?????.

The 3rd highest number of forms (1) was observed with the lemma “!!”: !!.

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (251; 100% instances)

Parents of PUNCT nodes belong to 10 different parts of speech: VERB (139; 55% instances), NOUN (41; 16% instances), NUM (23; 9% instances), PROPN (17; 7% instances), ADJ (14; 6% instances), PRON (9; 4% instances), INTJ (4; 2% instances), ADV (2; 1% instances), ADP (1; 0% instances), SCONJ (1; 0% instances)

251 (100%) PUNCT nodes are leaves.

The highest child degree of a PUNCT node is 0.