home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-CINTIL: POS Tags: SYM

There are 7 SYM lemmas (0%), 7 SYM types (0%) and 66 SYM tokens (0%). Out of 15 observed tags, the rank of SYM is: 13 in number of lemmas, 14 in number of types and 15 in number of tokens.

The 10 most frequent SYM lemmas: /, b, pub, (s), -, jb, jihad

The 10 most frequent SYM types: /, B, pub, (s), -, JB, jihad

The 10 most frequent ambiguous lemmas: / (SYM 52, PUNCT 17), b (SYM 6, PROPN 2), - (PUNCT 284, SYM 1), jihad (PROPN 3, SYM 1)

The 10 most frequent ambiguous types: / (SYM 52, PUNCT 17), B (SYM 6, PROPN 2), - (PUNCT 284, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.389383).

The 1st highest number of forms (1) was observed with the lemma “(s)”: (s).

The 2nd highest number of forms (1) was observed with the lemma “-”: -.

The 3rd highest number of forms (1) was observed with the lemma “/”: /.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 5 different relations: flat (50; 76% instances), dep (8; 12% instances), fixed (4; 6% instances), obl (3; 5% instances), root (1; 2% instances)

Parents of SYM nodes belong to 7 different parts of speech: PROPN (49; 74% instances), NOUN (10; 15% instances), DET (2; 3% instances), NUM (2; 3% instances), ADV (1; 2% instances), (1; 2% instances), VERB (1; 2% instances)

61 (92%) SYM nodes are leaves.

0 (0%) SYM nodes have one child.

0 (0%) SYM nodes have two children.

5 (8%) SYM nodes have three or more children.

The highest child degree of a SYM node is 6.

Children of SYM nodes are attached using 8 different relations: punct (7; 35% instances), case (4; 20% instances), det (4; 20% instances), advmod (1; 5% instances), cc (1; 5% instances), dep (1; 5% instances), nsubj (1; 5% instances), obl (1; 5% instances)

Children of SYM nodes belong to 6 different parts of speech: PUNCT (7; 35% instances), ADP (5; 25% instances), DET (4; 20% instances), PROPN (2; 10% instances), ADV (1; 5% instances), CCONJ (1; 5% instances)