Treebank Statistics: UD_Indonesian-GSD: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
24507 tokens (20%) have a non-empty value of Number
.
3886 types (20%) occur at least once with a non-empty value of Number
.
2552 lemmas (16%) occur at least once with a non-empty value of Number
.
The feature is used with 3 part-of-speech tags: NOUN (21190; 17% instances), PRON (2859; 2% instances), DET (458; 0% instances).
NOUN
21190 NOUN tokens (80% of all NOUN
tokens) have a non-empty value of Number
.
NOUN
tokens may have the following values of Number
:
Plur
(682; 3% of non-emptyNumber
): orang-orang, anak-anak, negara-negara, undang-undang, lagu-lagu, kata-kata, kitab-kitab, kota-kota, raja-raja, kapal-kapalSing
(20508; 97% of non-emptyNumber
): tahun, orang, desa, nama, kota, bagian, bahasa, wilayah, saat, filmEMPTY
(5241): tanggal, sepak, luas, band, atas, pusat, gelar, km, serial, sekarang
Paradigm tahun | Sing | Plur |
---|---|---|
tahun, tahunan | tahun-tahun |
PRON
2859 PRON tokens (45% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: PronType=Prs (2819; 99%), Person=3 (2473; 86%).
PRON
tokens may have the following values of Number
:
Plur
(466; 16% of non-emptyNumber
): mereka, kita, kami, kalian, apa-apa, beberapaSing
(2393; 84% of non-emptyNumber
): nya, ia, dia, ku, kamu, aku, mu, engkau, seseorang, beliauEMPTY
(3558): yang, apa, diri, siapa, mana, itu, demikian, semua, ini, sini
Number
seems to be lexical feature of PRON
. 100% lemmas (21) occur only with one value of Number
.
DET
458 DET tokens (12% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Definite=EMPTY (458; 100%), PronType=Ind (458; 100%).
DET
tokens may have the following values of Number
:
Plur
(457; 100% of non-emptyNumber
): beberapa, para, berbagai, banyak, sejumlah, kebanyakan, serangkaian, aneka, beragam, sekelompokSing
(1; 0% of non-emptyNumber
): sesuatuEMPTY
(3212): ini, itu, sebuah, tersebut, nya, seorang, suatu, semua, setiap, seluruh
Number
seems to be lexical feature of DET
. 100% lemmas (13) occur only with one value of Number
.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[compound]–> NOUN (4091; 66%),
NOUN –[nmod]–> NOUN (1524; 66%),
NOUN –[nmod:poss]–> PRON (964; 71%),
NOUN –[conj]–> NOUN (963; 69%),
NOUN –[nsubj]–> NOUN (122; 63%),
NOUN –[amod]–> NOUN (79; 62%),
NOUN –[nmod:tmod]–> NOUN (50; 74%),
NOUN –[acl]–> NOUN (31; 72%),
NOUN –[clf]–> NOUN (11; 100%),
NOUN –[advcl]–> NOUN (8; 53%).