home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Javanese-CSUI: POS Tags: SYM

There are 1 SYM lemmas (6%), 2 SYM types (0%) and 12 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 15 in number of lemmas, 17 in number of types and 17 in number of tokens.

The 10 most frequent SYM lemmas: _

The 10 most frequent SYM types: %, $

The 10 most frequent ambiguous lemmas: _ (NOUN 2867, PUNCT 2233, VERB 1952, PROPN 1573, PRON 961, ADV 798, ADP 748, ADJ 736, DET 701, NUM 362, AUX 340, SCONJ 314, CCONJ 306, PART 234, X 175, INTJ 32, SYM 12)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of SYM is 2.000000 (the average of all parts of speech is 238.235294).

The 1st highest number of forms (2) was observed with the lemma “_”: $, %.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 4 different relations: appos (5; 42% instances), nmod (4; 33% instances), obl (2; 17% instances), conj (1; 8% instances)

Parents of SYM nodes belong to 4 different parts of speech: NOUN (6; 50% instances), PROPN (3; 25% instances), VERB (2; 17% instances), SYM (1; 8% instances)

2 (17%) SYM nodes are leaves.

1 (8%) SYM nodes have one child.

1 (8%) SYM nodes have two children.

8 (67%) SYM nodes have three or more children.

The highest child degree of a SYM node is 3.

Children of SYM nodes are attached using 7 different relations: nummod (10; 37% instances), punct (10; 37% instances), nmod (3; 11% instances), advmod (1; 4% instances), case (1; 4% instances), cc (1; 4% instances), conj (1; 4% instances)

Children of SYM nodes belong to 8 different parts of speech: NUM (10; 37% instances), PUNCT (10; 37% instances), NOUN (2; 7% instances), ADP (1; 4% instances), ADV (1; 4% instances), CCONJ (1; 4% instances), PROPN (1; 4% instances), SYM (1; 4% instances)