home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-Porttinari: POS Tags: PROPN

There are 3967 PROPN lemmas (28%), 3967 PROPN types (19%) and 10903 PROPN tokens (6%). Out of 16 observed tags, the rank of PROPN is: 2 in number of lemmas, 3 in number of types and 6 in number of tokens.

The 10 most frequent PROPN lemmas: Brasil, Paulo, São, EUA, Rio, Temer, JBS, Folha, Estado, Doria

The 10 most frequent PROPN types: Brasil, Paulo, São, EUA, Rio, Temer, JBS, Folha, Estado, Doria

The 10 most frequent ambiguous lemmas: the (PROPN 4, X 1), in (PROPN 18, X 3), a (ADP 2152, PROPN 3, NOUN 2), PM (PROPN 6, NOUN 1), and (PROPN 4, X 1), to (PROPN 4, X 1), 5 (NUM 25, PROPN 3), like (NOUN 2, PROPN 1), 55 (NUM 5, PROPN 2), se (PRON 732, SCONJ 274, PROPN 1)

The 10 most frequent ambiguous types: São (PROPN 135, AUX 35, VERB 4), Justiça (PROPN 48, NOUN 1), Polícia (PROPN 30, NOUN 2), Copa (PROPN 27, NOUN 4), the (PROPN 4, X 1), Norte (PROPN 22, NOUN 1), Brasileiro (PROPN 20, NOUN 1), in (PROPN 18, X 3), União (PROPN 19, NOUN 2), Exército (PROPN 17, NOUN 2)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.495837).

The 1st highest number of forms (1) was observed with the lemma “#Tamojunto”: #Tamojunto.

The 2nd highest number of forms (1) was observed with the lemma “1-Azul”: 1-Azul.

The 3rd highest number of forms (1) was observed with the lemma “157”: 157.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 22 different relations: flat:name (3350; 31% instances), nmod (3039; 28% instances), nsubj (1826; 17% instances), obl (990; 9% instances), conj (583; 5% instances), obj (325; 3% instances), appos (296; 3% instances), obl:agent (139; 1% instances), parataxis (118; 1% instances), root (82; 1% instances), nsubj:pass (61; 1% instances), advcl (38; 0% instances), xcomp (13; 0% instances), list (9; 0% instances), vocative (9; 0% instances), ccomp:speech (6; 0% instances), acl:relcl (4; 0% instances), dislocated (4; 0% instances), orphan (4; 0% instances), acl (3; 0% instances), ccomp (3; 0% instances), csubj (1; 0% instances)

Parents of PROPN nodes belong to 12 different parts of speech: PROPN (4230; 39% instances), NOUN (3184; 29% instances), VERB (3063; 28% instances), ADJ (131; 1% instances), (82; 1% instances), PRON (79; 1% instances), ADV (71; 1% instances), NUM (30; 0% instances), SYM (15; 0% instances), X (14; 0% instances), AUX (3; 0% instances), INTJ (1; 0% instances)

3902 (36%) PROPN nodes are leaves.

2204 (20%) PROPN nodes have one child.

2677 (25%) PROPN nodes have two children.

2120 (19%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 10.

Children of PROPN nodes are attached using 25 different relations: case (4002; 26% instances), det (3392; 22% instances), flat:name (3351; 22% instances), punct (2164; 14% instances), conj (616; 4% instances), cc (450; 3% instances), appos (388; 2% instances), nmod (341; 2% instances), parataxis (180; 1% instances), acl:relcl (162; 1% instances), amod (142; 1% instances), cop (90; 1% instances), acl (66; 0% instances), nsubj (65; 0% instances), advmod (59; 0% instances), mark (41; 0% instances), list (17; 0% instances), nummod (16; 0% instances), orphan (11; 0% instances), advcl (7; 0% instances), ccomp (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances), expl (1; 0% instances), obj (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PROPN (4230; 27% instances), ADP (4011; 26% instances), DET (3392; 22% instances), PUNCT (2164; 14% instances), NOUN (555; 4% instances), CCONJ (449; 3% instances), VERB (211; 1% instances), NUM (160; 1% instances), ADJ (153; 1% instances), AUX (90; 1% instances), ADV (73; 0% instances), PRON (33; 0% instances), SYM (19; 0% instances), SCONJ (18; 0% instances), X (7; 0% instances)