home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-Porttinari: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

64566 tokens (38%) have a non-empty value of Gender. 9711 types (51%) occur at least once with a non-empty value of Gender. 6645 lemmas (52%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (29451; 18% instances), DET (24025; 14% instances), ADJ (5439; 3% instances), PRON (2760; 2% instances), VERB (2260; 1% instances), NUM (569; 0% instances), AUX (62; 0% instances).

NOUN

29451 NOUN tokens (94% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (21254; 72%).

NOUN tokens may have the following values of Gender:

Paradigm filhoMascFem
Number=Singfilhofilha
Number=Plurfilhosfilhas

Gender seems to be lexical feature of NOUN. 98% lemmas (4562) occur only with one value of Gender.

DET

24025 DET tokens (99% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (21179; 88%), Number=Sing (19613; 82%), Definite=Def (18898; 79%).

DET tokens may have the following values of Gender:

Paradigm oMascFem
Number=Singoa
Number=Plurosas

ADJ

5439 ADJ tokens (64% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: VerbForm=EMPTY (4511; 83%), Number=Sing (3808; 70%).

ADJ tokens may have the following values of Gender:

Paradigm novoMascFem
Number=Singnovonova
Number=Plurnovosnovas

PRON

2760 PRON tokens (43% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (2174; 79%), Person=3 (1824; 66%), Case=EMPTY (1643; 60%).

PRON tokens may have the following values of Gender:

Paradigm eleMascFem
Number=Singeleela
Number=Plureleselas

VERB

2260 VERB tokens (13% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2260; 100%), Person=EMPTY (2260; 100%), Tense=EMPTY (2260; 100%), VerbForm=Part (2259; 100%), Number=Sing (1599; 71%).

VERB tokens may have the following values of Gender:

Paradigm terMascFem
Number=Singtido
Number=Plur|Voice=Passtidas

NUM

569 NUM tokens (18% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (559; 98%).

NUM tokens may have the following values of Gender:

Paradigm umMascFem
umuma

AUX

62 AUX tokens (1% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (62; 100%), Number=Sing (62; 100%), Person=EMPTY (62; 100%), Tense=EMPTY (62; 100%), VerbForm=Part (62; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (18909; 93%), NOUN –[amod]–> ADJ (3738; 62%), NOUN –[conj]–> NOUN (770; 50%), NOUN –[acl]–> VERB (616; 50%), VERB –[nsubj:pass]–> NOUN (471; 91%), ADJ –[nsubj]–> NOUN (222; 53%), PRON –[amod]–> ADJ (141; 56%), NUM –[nmod]–> NOUN (122; 56%), PRON –[nmod]–> NOUN (99; 58%), PRON –[det]–> DET (75; 62%).