home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Poetry: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

25353 tokens (40%) have a non-empty value of Gender. 13575 types (75%) occur at least once with a non-empty value of Gender. 7152 lemmas (72%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (15618; 24% instances), ADJ (4419; 7% instances), VERB (2263; 4% instances), DET (1243; 2% instances), PRON (1122; 2% instances), PROPN (529; 1% instances), AUX (105; 0% instances), NUM (54; 0% instances).

NOUN

15618 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (14003; 90%), Number=Sing (11309; 72%).

NOUN tokens may have the following values of Gender:

Paradigm полчасаMascFemNeut
полчасаполчасаполчаса

Gender seems to be lexical feature of NOUN. 99% lemmas (3988) occur only with one value of Gender.

ADJ

4419 ADJ tokens (73% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (4414; 100%), Degree=Pos (4339; 98%), Variant=EMPTY (3736; 85%).

ADJ tokens may have the following values of Gender:

Paradigm белыйMascFemNeut
Animacy=Inan|Case=Accбелый
Case=Accбелыйбелуюбелое
Case=Genбелогобелой
Case=Insбелымбелою, белойбелым
Case=Locбеломбелойбелом
Case=Nomбелыйбелаябелое
Variant=Shortбелбела

VERB

2263 VERB tokens (27% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (2263; 100%), Number=Sing (2262; 100%), Tense=Past (2117; 94%), Mood=Ind (1589; 70%), VerbForm=Fin (1589; 70%), Voice=Act (1511; 67%), Aspect=Perf (1471; 65%).

VERB tokens may have the following values of Gender:

Paradigm бытьMascFemNeut
былбылабыло

DET

1243 DET tokens (69% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (1243; 100%), Animacy=EMPTY (1100; 88%), Poss=EMPTY (658; 53%).

DET tokens may have the following values of Gender:

Paradigm мойMascFemNeut
Animacy=Inan|Case=Accмой
Case=Accмоюмое
Case=Datмоемумоеймоему
Case=Genмоегомоеймоего
Case=Insмоиммоей
Case=Locмоеммоеймоем
Case=Nomмоймоямое

PRON

1122 PRON tokens (32% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1122; 100%), Person=EMPTY (659; 59%), Case=Nom (634; 57%).

PRON tokens may have the following values of Gender:

Paradigm чтоMascNeut
Animacy=Anim|Case=Nom|PronType=Relчто
Animacy=Inan|Case=Acc|PronType=Intчто
Animacy=Inan|Case=Acc|PronType=Negчто
Animacy=Inan|Case=Acc|PronType=Relчто
Animacy=Inan|Case=Dat|PronType=Intчему
Animacy=Inan|Case=Dat|PronType=Relчему
Animacy=Inan|Case=Gen|PronType=Intчего
Animacy=Inan|Case=Gen|PronType=Relчего
Animacy=Inan|Case=Ins|PronType=Intчем
Animacy=Inan|Case=Ins|PronType=Relчем
Animacy=Inan|Case=Loc|PronType=Intчем, чём
Animacy=Inan|Case=Loc|PronType=Relчем
Animacy=Inan|Case=Nom|PronType=Excчто
Animacy=Inan|Case=Nom|PronType=Intчто
Animacy=Inan|Case=Nom|PronType=Relчто

PROPN

529 PROPN tokens (90% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (519; 98%), Animacy=Anim (327; 62%).

PROPN tokens may have the following values of Gender:

Paradigm ЛакримозаMascFem
Animacy=Anim|Case=Nom|NameType=GivЛакримоза
Animacy=Inan|Case=Gen|NameType=GeoЛакримоза

Gender seems to be lexical feature of PROPN. 100% lemmas (350) occur only with one value of Gender.

AUX

105 AUX tokens (31% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=Ind (105; 100%), Number=Sing (105; 100%), Person=EMPTY (105; 100%), Tense=Past (105; 100%), VerbForm=Fin (105; 100%), Voice=Act (105; 100%).

AUX tokens may have the following values of Gender:

Paradigm бытьMascFemNeut
былбылабыло

NUM

54 NUM tokens (21% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (54; 100%), NumType=Card (51; 94%), Case=Nom (33; 61%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Animacy=Anim|Case=Accдвух
Animacy=Inan|Case=Accдвадве
Case=Nomдвадведва

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (3037; 71%), NOUN –[det]–> DET (921; 65%), ADJ –[conj]–> ADJ (315; 90%), ADJ –[nsubj]–> NOUN (255; 68%), NOUN –[amod]–> VERB (197; 62%), NOUN –[acl]–> VERB (185; 63%), NOUN –[appos]–> NOUN (123; 71%), VERB –[nsubj:pass]–> NOUN (74; 61%), PROPN –[amod]–> ADJ (58; 97%), ADJ –[det]–> DET (44; 100%).