home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-PUD: POS Tags: DET

There are 3 DET lemmas (0%), 11 DET types (0%) and 22 DET tokens (0%). Out of 17 observed tags, the rank of DET is: 14 in number of lemmas, 12 in number of types and 14 in number of tokens.

The 10 most frequent DET lemmas: hinn, annar, þessi

The 10 most frequent DET types: hinn, hinna, hins, hið, hinum, Hin, aðrar, hinnar, hinni, hinu

The 10 most frequent ambiguous lemmas: hinn (DET 20, PRON 8), annar (PRON 45, DET 1, NUM 1), þessi (PRON 124, DET 1)

The 10 most frequent ambiguous types: hinn (DET 4, PRON 4), hins (DET 3, PRON 1), hinum (DET 2, PRON 1), aðrar (PRON 3, DET 1), þetta (PRON 29, DET 1)

Morphology

The form / lemma ratio of DET is 3.666667 (the average of all parts of speech is 1.365967).

The 1st highest number of forms (9) was observed with the lemma “hinn”: Hin, hinn, hinna, hinnar, hinni, hins, hinu, hinum, hið.

The 2nd highest number of forms (1) was observed with the lemma “annar”: aðrar.

The 3rd highest number of forms (1) was observed with the lemma “þessi”: þetta.

DET occurs with 4 features: Case (22; 100% instances), Gender (22; 100% instances), Number (22; 100% instances), PronType (2; 9% instances)

DET occurs with 11 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, PronType=Dem, PronType=Ind

DET occurs with 16 feature combinations. The most frequent feature combination is Case=Acc|Gender=Masc|Number=Sing (3 tokens). Examples: hinn

Relations

DET nodes are attached to their parents using 1 different relations: det (22; 100% instances)

Parents of DET nodes belong to 4 different parts of speech: NOUN (10; 45% instances), ADJ (9; 41% instances), PRON (2; 9% instances), PROPN (1; 5% instances)

22 (100%) DET nodes are leaves.

The highest child degree of a DET node is 0.