home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-Porttinari: POS Tags: DET

There are 31 DET lemmas (0%), 88 DET types (0%) and 24325 DET tokens (14%). Out of 16 observed tags, the rank of DET is: 10 in number of lemmas, 10 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: o, um, seu, esse, este, outro, todo, meu, algum, mesmo

The 10 most frequent DET types: o, a, os, as, um, uma, sua, seu, esse, essa

The 10 most frequent ambiguous lemmas: o (DET 18898, PRON 801), um (DET 2281, NUM 309, PRON 48), seu (DET 758, PRON 8), esse (DET 531, PRON 60), este (DET 294, PRON 29), outro (DET 239, PRON 83), todo (DET 220, PRON 37, NOUN 4, ADJ 3), meu (DET 150, PRON 4), algum (DET 145, PRON 31), mesmo (DET 125, ADV 99, PRON 16)

The 10 most frequent ambiguous types: o (DET 7184, PRON 481), a (DET 6456, ADP 2024, PRON 122, PROPN 3), os (DET 1856, PRON 80), as (DET 1324, PRON 48), um (DET 1200, NUM 155, PRON 32), uma (DET 973, NUM 116, PRON 8), sua (DET 280, PRON 3), seu (DET 212, PRON 4), esse (DET 189, PRON 13), essa (DET 158, PRON 14)

Morphology

The form / lemma ratio of DET is 2.838710 (the average of all parts of speech is 1.495837).

The 1st highest number of forms (4) was observed with the lemma “algum”: algum, alguma, algumas, alguns.

The 2nd highest number of forms (4) was observed with the lemma “aquele”: aquela, aquelas, aquele, aqueles.

The 3rd highest number of forms (4) was observed with the lemma “cujo”: cuja, cujas, cujo, cujos.

DET occurs with 6 features: PronType (24325; 100% instances), Number (24207; 100% instances), Gender (24025; 99% instances), Definite (21179; 87% instances), Person (1012; 4% instances), Poss (1012; 4% instances)

DET occurs with 15 feature-value pairs: Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=1, Person=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel

DET occurs with 35 feature combinations. The most frequent feature combination is Definite=Def|Gender=Masc|Number=Sing|PronType=Art (8070 tokens). Examples: o

Relations

DET nodes are attached to their parents using 6 different relations: det (24084; 99% instances), fixed (183; 1% instances), advmod (47; 0% instances), mark (5; 0% instances), conj (4; 0% instances), obj (2; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (20422; 84% instances), PROPN (3392; 14% instances), ADP (179; 1% instances), PRON (122; 1% instances), X (82; 0% instances), NUM (59; 0% instances), VERB (25; 0% instances), ADJ (20; 0% instances), SYM (11; 0% instances), DET (7; 0% instances), ADV (6; 0% instances)

24264 (100%) DET nodes are leaves.

23 (0%) DET nodes have one child.

35 (0%) DET nodes have two children.

3 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 4.

Children of DET nodes are attached using 4 different relations: fixed (89; 86% instances), punct (6; 6% instances), cc (4; 4% instances), conj (4; 4% instances)

Children of DET nodes belong to 6 different parts of speech: NOUN (51; 50% instances), ADV (30; 29% instances), DET (7; 7% instances), PUNCT (6; 6% instances), SCONJ (5; 5% instances), CCONJ (4; 4% instances)