home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kaapor-TuDeT: POS Tags: PROPN

There are 16 PROPN lemmas (8%), 18 PROPN types (8%) and 20 PROPN tokens (5%). Out of 14 observed tags, the rank of PROPN is: 5 in number of lemmas, 5 in number of types and 6 in number of tokens.

The 10 most frequent PROPN lemmas: Maíra, Purutu, _, ʃaʔe, Ana, Arauxu, Kaninde, Oropo, Tuti, kaitā

The 10 most frequent PROPN types: Xõru, ʃaʔe, Anake, Arauxu, Kaninde, Maiɾ, Mataru, Maíra, Maírake, Oropo

The 10 most frequent ambiguous lemmas: _ (VERB 3, NOUN 2, PROPN 2, ADV 1, NUM 1, PRON 1), kamarar (NOUN 1, PROPN 1)

The 10 most frequent ambiguous types: kamarar (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.125000 (the average of all parts of speech is 1.154229).

The 1st highest number of forms (2) was observed with the lemma “Maíra”: Maíra, Maírake.

The 2nd highest number of forms (2) was observed with the lemma “Purutu”: Purutu, Purutuke.

The 3rd highest number of forms (1) was observed with the lemma “Ana”: Anake.

PROPN occurs with 3 features: Animacy (2; 10% instances), Case (2; 10% instances), Number (2; 10% instances)

PROPN occurs with 3 feature-value pairs: Animacy=Hum, Case=Aff, Number=Sing

PROPN occurs with 3 feature combinations. The most frequent feature combination is _ (16 tokens). Examples: Xõru, Anake, Arauxu, Kaninde, Maiɾ, Maíra, Maírake, Oropo, Purutu, Tuti

Relations

PROPN nodes are attached to their parents using 4 different relations: nsubj (14; 70% instances), obj (3; 15% instances), nmod (2; 10% instances), obl (1; 5% instances)

Parents of PROPN nodes belong to 2 different parts of speech: VERB (18; 90% instances), NOUN (2; 10% instances)

15 (75%) PROPN nodes are leaves.

5 (25%) PROPN nodes have one child.

The highest child degree of a PROPN node is 1.

Children of PROPN nodes are attached using 3 different relations: discourse (3; 60% instances), case (1; 20% instances), nmod (1; 20% instances)

Children of PROPN nodes belong to 3 different parts of speech: PART (3; 60% instances), ADP (1; 20% instances), PRON (1; 20% instances)