home en/pos edit page issue tracker

SYM: symbol

The English SYM covers PTB tags NFP (except for lines of separators, which become PUNCT), #, $, SYM, and for the percent sign (%).


Treebank Statistics (UD_English)

There are 84 SYM lemmas (0%), 84 SYM types (0%) and 759 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 11 in number of lemmas, 11 in number of types and 17 in number of tokens.

The 10 most frequent SYM lemmas: $, -, :), %, /, +, |, :(, :-), :d

The 10 most frequent SYM types: $, -, :), %, /, +, |, :(, :-), :D

The 10 most frequent ambiguous lemmas: $ (SYM 294, NOUN 4), - (PUNCT 1651, SYM 117, X 11), :) (SYM 58, PUNCT 2), % (SYM 46, X 1), / (PUNCT 242, SYM 32, X 2), + (SYM 25, CONJ 1), | (SYM 20, PUNCT 1), x (NOUN 10, SYM 6, X 2, ADP 1), (PUNCT 325, SYM 5), (: (SYM 4, PUNCT 1)

The 10 most frequent ambiguous types: $ (SYM 294, NOUN 4), - (PUNCT 1651, SYM 117, X 11), :) (SYM 58, PUNCT 2), % (SYM 46, X 1), / (PUNCT 242, SYM 32, X 2), + (SYM 25, CONJ 1), | (SYM 20, PUNCT 1), x (NOUN 5, SYM 5, X 1), (PUNCT 325, SYM 5), (: (SYM 4, PUNCT 1)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.173588).

The 1st highest number of forms (1) was observed with the lemma “###”: ###.

The 2nd highest number of forms (1) was observed with the lemma “$”: $.

The 3rd highest number of forms (1) was observed with the lemma “%”: %.

SYM occurs with 1 features: en-feat/Number (48; 6% instances)

SYM occurs with 1 feature-value pairs: Number=Sing

SYM occurs with 2 feature combinations. The most frequent feature combination is _ (711 tokens). Examples: $, -, :), /, +, |, :(, :-), :D, x

Relations

SYM nodes are attached to their parents using 24 different relations: en-dep/case (117; 15% instances), en-dep/discourse (112; 15% instances), en-dep/root (107; 14% instances), en-dep/nmod (78; 10% instances), en-dep/punct (71; 9% instances), en-dep/dobj (65; 9% instances), en-dep/compound (56; 7% instances), en-dep/nmod:npmod (24; 3% instances), en-dep/cc (23; 3% instances), en-dep/appos (21; 3% instances), en-dep/list (19; 3% instances), en-dep/conj (18; 2% instances), en-dep/advmod (16; 2% instances), en-dep/parataxis (13; 2% instances), en-dep/nsubjpass (5; 1% instances), en-dep/acl:relcl (2; 0% instances), en-dep/advcl (2; 0% instances), en-dep/ccomp (2; 0% instances), en-dep/nsubj (2; 0% instances), en-dep/nummod (2; 0% instances), en-dep/amod (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/reparandum (1; 0% instances), en-dep/xcomp (1; 0% instances)

Parents of SYM nodes belong to 13 different parts of speech: NOUN (203; 27% instances), VERB (196; 26% instances), NUM (114; 15% instances), ROOT (107; 14% instances), ADJ (40; 5% instances), PROPN (34; 4% instances), SYM (30; 4% instances), X (17; 2% instances), ADV (11; 1% instances), DET (3; 0% instances), CONJ (2; 0% instances), ADP (1; 0% instances), PRON (1; 0% instances)

399 (53%) SYM nodes are leaves.

122 (16%) SYM nodes have one child.

89 (12%) SYM nodes have two children.

149 (20%) SYM nodes have three or more children.

The highest child degree of a SYM node is 11.

Children of SYM nodes are attached using 26 different relations: en-dep/nummod (303; 33% instances), en-dep/punct (167; 18% instances), en-dep/case (80; 9% instances), en-dep/appos (65; 7% instances), en-dep/nmod (56; 6% instances), en-dep/compound (53; 6% instances), en-dep/advmod (38; 4% instances), en-dep/cop (23; 3% instances), en-dep/nsubj (22; 2% instances), en-dep/cc (20; 2% instances), en-dep/det (20; 2% instances), en-dep/conj (19; 2% instances), en-dep/advcl (9; 1% instances), en-dep/amod (7; 1% instances), en-dep/nmod:npmod (5; 1% instances), en-dep/parataxis (4; 0% instances), en-dep/acl:relcl (3; 0% instances), en-dep/mark (3; 0% instances), en-dep/acl (2; 0% instances), en-dep/discourse (2; 0% instances), en-dep/nmod:poss (2; 0% instances), en-dep/aux (1; 0% instances), en-dep/dobj (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/nmod:tmod (1; 0% instances), en-dep/xcomp (1; 0% instances)

Children of SYM nodes belong to 17 different parts of speech: NUM (355; 39% instances), PUNCT (163; 18% instances), NOUN (130; 14% instances), ADP (81; 9% instances), VERB (40; 4% instances), SYM (30; 3% instances), ADV (29; 3% instances), DET (25; 3% instances), CONJ (20; 2% instances), ADJ (14; 2% instances), PRON (11; 1% instances), PROPN (4; 0% instances), SCONJ (2; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances), X (1; 0% instances)


SYM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]