home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bororo-BDT: POS Tags: DET

There are 8 DET lemmas (1%), 14 DET types (1%) and 144 DET tokens (2%). Out of 16 observed tags, the rank of DET is: 13 in number of lemmas, 11 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: nowy, awy, ia, _, cewy, ecewy, cowu, mowy

The 10 most frequent DET types: nowy, awy, ia, ecewy, cewy, aia, awu, cowu, Awyre, Iage

The 10 most frequent ambiguous lemmas: nowy (DET 62, PRON 2), awy (DET 33, PRON 1), ia (DET 28, NOUN 3, NUM 2, PRON 1, VERB 1), _ (NOUN 201, VERB 142, ADV 84, PUNCT 64, X 56, ADP 44, PRON 42, PROPN 36, DET 10, PART 6, SCONJ 6, CCONJ 2, ADJ 1), ecewy (DET 3, PRON 1)

The 10 most frequent ambiguous types: ia (DET 17, NUM 2, NOUN 1), ecewy (DET 6, PRON 1), aia (DET 3, NOUN 1), ce (ADP 2, DET 1)

Morphology

The form / lemma ratio of DET is 1.750000 (the average of all parts of speech is 1.661916).

The 1st highest number of forms (6) was observed with the lemma “_”: Jiwy, awu, awyge, ecewy, ia, nowy.

The 2nd highest number of forms (3) was observed with the lemma “awy”: Awyre, awu, awy.

The 3rd highest number of forms (3) was observed with the lemma “ia”: Iage, aia, ia.

DET occurs with 5 features: Deixis (102; 71% instances), PronType (39; 27% instances), Number (26; 18% instances), Definite (23; 16% instances), Mood (1; 1% instances)

DET occurs with 9 feature-value pairs: Definite=Ind, Deixis=Med, Deixis=Prox, Deixis=Remt, Mood=Ind, Number=Plur, Number=Sing, PronType=Art, PronType=Dem

DET occurs with 12 feature combinations. The most frequent feature combination is Deixis=Med (39 tokens). Examples: nowy, no

Relations

DET nodes are attached to their parents using 7 different relations: det (125; 87% instances), nmod (6; 4% instances), nsubj (4; 3% instances), obj (4; 3% instances), obl (3; 2% instances), ccomp (1; 1% instances), dep (1; 1% instances)

Parents of DET nodes belong to 6 different parts of speech: NOUN (117; 81% instances), VERB (14; 10% instances), PROPN (7; 5% instances), PRON (3; 2% instances), ADV (2; 1% instances), X (1; 1% instances)

140 (97%) DET nodes are leaves.

3 (2%) DET nodes have one child.

1 (1%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 3 different relations: case (3; 60% instances), parataxis (1; 20% instances), punct (1; 20% instances)

Children of DET nodes belong to 4 different parts of speech: ADP (2; 40% instances), ADV (1; 20% instances), NOUN (1; 20% instances), PUNCT (1; 20% instances)