home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Javanese-CSUI: POS Tags: NOUN

There are 1 NOUN lemmas (6%), 1147 NOUN types (28%) and 2867 NOUN tokens (20%). Out of 17 observed tags, the rank of NOUN is: 8 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: _

The 10 most frequent NOUN types: taun, wong, warga, bathik, tembang, basa, bocah, pusaka, tembung, ati

The 10 most frequent ambiguous lemmas: _ (NOUN 2867, PUNCT 2233, VERB 1952, PROPN 1573, PRON 961, ADV 798, ADP 748, ADJ 736, DET 701, NUM 362, AUX 340, SCONJ 314, CCONJ 306, PART 234, X 175, INTJ 32, SYM 12)

The 10 most frequent ambiguous types: warga (NOUN 32, X 1), dolanan (NOUN 15, VERB 2), tradhisi (NOUN 11, PROPN 1), wujud (NOUN 11, VERB 1), gambar (NOUN 10, VERB 2), teges (NOUN 11, VERB 3), Tari (PROPN 11, NOUN 9), sekolah (NOUN 9, VERB 3), dalem (NOUN 8, ADJ 5), pentas (NOUN 5, VERB 3)

Morphology

The form / lemma ratio of NOUN is 1147.000000 (the average of all parts of speech is 238.235294).

The 1st highest number of forms (1147) was observed with the lemma “_”: 1910-an, 1990-an, 1990an, Bageyan, Bojo, Cacahe, Carikan, Daging, Dekik, Dhudha, Esem, Eseming, Fakultas, Gegambaran, Geladhen, Generasi, Geolog, Gethuk, Gur, Idham-idhaman, Jambu, Janji, Jodho, Judhul, Karakter-karakter, Kedhokan, Kekarepan, Kelinci, Kelurahan, Kung, Larangan, Le, Lokasi, Luar, Mantri, Markas, Mas, Maskape, Mbah, Mbakyu, Momotan, Mongkoging, Mula, Ngarsaning, Pak, Pamaburan, Pamarentahan, Panambahan, Pangarsa, Pekalongan, Penjajah, Permainan, Piring, Priya, Rancangan, Rp100.000, Rudal, Sabisa, Sakabehan, Sarasehan, Sauran, Sawah-sawah, Sepisan, Sesasi, Solo, Surah, Tali, Tanah, Tantangan, Tari, Temon, Tetandhingan, Tetenger, Timur, Tujuwan, Tuladha, Tuladhan, Ula, Umpama, Umpami, Urang, Uti, Wabah, Weteng, abad, abdi, acara, adat, adeging, adhi, adhik, adicara, agama, ajang, ajaran, aji, akhirat, akibat, akir, aksara, aksi, akuntan, alangan, album, alesan, amal, amalan-amalan, ambal, ambegan, ambruk, amor, anak, ancas, angen-angen, anggitan, anggon, anggota, angin, angkah, angkatan, antara, antawis, apem, apotek, ara-ara, arah, aran, arane, area, arepa, arisan, arta, artis-artis, asil, asma, asrama, asta, asteroid, ati, atletik, atmosfer, atur, aturan, awak, awal, awan, ayah, azan, azola, bababagan, babaran, babon, badan, bagean, bahan, bak, bakal, bakpao, baku, bakul, bala, balapan, bancakan, bandha, bangsa, bangsa-bangsa, bantuan, banyu, bapak, barang, barat, barong, barongan, basa, bathik, bathin, bayaran, bebek, bebingah, bedhengan, bekel, beksan, bendungan, bengi, bengokan, bensin, bentelan, beras, bestik, beya, bibit, biologis, blangkon, blanjan, bledheg, bleger, blegere, blek, blog, bocah, bocah-bocah, boncengan, bordir, boy-boy-an, brana, bronjong, bubuk, budaya, budhaya, budi, bujana, buka, buku, buku-buku, bumi, buruh, burung, bus, busana, cacah, cagak, cahyaning, cakepan, campuran, campursari, cangkir, canthing, cara, carik, cathetan, cawuh, cemanthel, cerna, cet, clana, conto, crita, cuaca, dakon, dalan, dalem, dalu, dampak, dana, daptar, darah, darbe, data, dawa, daya, degan, desa, dewan, dhadha, dhaerah, dhaharan, dhasar, dhedhasar, dhele, dhiri, dhoktrin, dhosen, dhukun, dhuwit, dhuwur, dikir, dina, dinten, dipan, dlamakan, dolanan, dolar, dollar, donga, donga-donga, donya, dosa, drajating, efek, ekonomi, emas, emperan, emprit, endhut, endi-endi, energi, episode, era, es, esuk, etan, euro, eyang, film, fragmen, gadaha, galengan, gaman, gambar, gambar-gambar, gambaran, gamelan, gampa, gandheng, gang, garapan, garwa, gas, gawean, gaweyan, gaya, gayeng, gedhang, gedhong, gedhung, gegambaraning, geger, gegujengan, gelar, gelas, gencatan, gerakan, gerji, gesang, getih, ginastel, gobak, godhong, gol, golekan, goncengan, gorengan, gotong, grana, grimis, griya, grup, gudel, gulung, gunem, guneman, gunggung, guru, guyub, halalbihalal, harafiah, hasil, hawa, hik, hotel, huru-hara, ibadah, ibu, ibukitha, impen, implikasi, indhustri, informasi, infrastruktur, ingah-ingah, ingajenan, ingkung, injet, institusi, interpretasi, isi, istri-istri, iwak, jabatan, jadwal, jahe, jahitan, jala, jalur, jam, jaman, jamane, jangan, janganan, jantung, jarak, jasa, jati, jebulan, jejuluk, jembar, jenang, jeneng, jeneng-jeneng, jengki, jero, jeruk, jinis, jinising, jiwa, jok, jujugan, juragan, jurnal, jurnale, juru, jurusan, kabahagyan, kabar, kabecikan, kabegjan, kabudaya, kabudayaan, kabudayan, kabungahan, kadadosan, kahana, kahanan, kain, kaji, kajian, kakang, kakung, kala, kalen, kali, kalung, kalungguhan, kamar, kamardikan, kamenangan, kampung, kampung-kampung, kampus, kanca, kanca-kanca, kangen, kangmas, kanker, kantin, kantor, kapentingan, kapracayan, karak, karang, karep, karesidenan, karir, karung, karya, karya-karya, kasarasaning, kasil, kasrakahan, kasuksesan, kasunyatan, kasus, katentreman, katentremaning, katering, kates, kathok, katrampilan, katrangan, kaum, kautaman, kawruh, kawula, kayu, kebaya, kebo, kebon, kegiyatan, kejuaraan, kekayaan, kekirangan, kekiyatan, kelas, kemampuan, kemanusiaan, kembang, kembul, kempol, kenalan, kenangan, kendhi, kentekan, kenya, kepala, keprajuritan, keraton, kerdhus, keris, kesenian, ketua, kewan, kewan-kewan, khutbah, kidul, kinurmatan, kitha, klambi, klenengan, kluwarga, kolam, koleksi, kolor, komentar, komputer, komunikasi, komunitas, konsorsium, konsultasi, konten, kopi, koran, koridor, korupsi, kostum, kothak, kraman, kranjang, kraton, kreditor, kreket-kreket, kreteg, kridha, krikil, kualitas, kuburan, kudeta, kula, kulanuwun, kulawarga, kulit, kuliyah, kulon, kuluk, kumleyang, kumpulaning, kuping, kurs, kursi, kutha, kuwasa, lagu, lagu-lagu, lair, lakon, laku, laladan, laman, lambe, lanang, lancingan, langit, lap, lapangan, laporan, laptop, latar, lawang, lawuh, layanan, lebo, lele, leluhur, lemari, lemari-lemari, lembaga, lendhang, lendhut, lenga, lengen, les, lingkungan, listrik, lomba, lor, lumah, lungsuran, lurah, macanan, macem-macem, madu, majalah, makluk, makna, mandiri, mangetan, mangsa, mangsak, manten, mantenan, manuk, manungsa, mas-masan, masarakat, masjid, masyarakat, mataun-taun, materi, mburi, medan, meja, mejid, memedi, menara, mendhung, mendhuwur, menit, menyan, meranggi, mesin, mesjid, meter, milis, militan, militer, minggu, misi, modhal, momentum, montor, mopok, motif, muntabing, mupangat, mural, murid, murid-muride, musala, musik, mêdhar, nabuh, naga, nagara, nagari, nakas, napas, naskah, ndalu, ndesa, ndhisik, ndhuwur, ndhuwure, negara, negeri, nem-neman, netra, ngarep, nggone, ngidul, ngisor, ngulon, nipis, njero, njeron, nom-noman, nomer, nominasi, nyadran, nyamikan, nyamuk, obahan, obahing, obat, objek, oleh-oleh, omah, ombak, omongan, onthel, opah, opera, operasi, organ, organisasi, orkestra, pabrik, pace, pacoban, padhepokan, padina, padunung, pageblug, pagelaran, pahlawan, pajek, pakan, pakarya, pakaryan, pakurmatan, pamarentah, pambangunan, pambukaning, pamekaran, pamerangan, pametu, pamikiran, pamor, pamulangan, panah, panantang, pandemen, pandhemen, panemu, panen, pangajab, panganan, panganggep, panganggit, panganggo, pangapunten, pangapuran, pangeling-eling, pangembanganipun, panggaweyan, panggo, panggona, panggonan, panggunaan, panggung, pangripta, pangudi, panguripan, panitya, panjaluk, panjalukane, panjupukan, panuwun, panyarta, panyawang, papan, para-para, parapan, pari, pasar, pasareyan, pasrawungan, pate, pathok, pathokan, patrap, paviliun, pawang, pawelinge, pawitan, pawongan, payung, pecel, pedesaan, pedunungan, pegawe, pehak, pejabat-pejabat, pekarangan, pelabuhan, pelamar, pembangunan, pembina, pembuluh, pemilu, pendhak, pendhapa, pendhidhikan, pendhiri, penembak, pengangen-angen, pengetan, penggalih, pentas, penthung, penulis, penyanyi, pepanthan, pepesthe, perang, perang-perang, perangan, perantau, perawan, peringkat, perkembangan, perkutut, permata, pernah, pertandhingan, perusahaan, pesangon, pesenan, peso, petani, pete, pewarang, pidhato, piguna, pikir, pikiran, pilihan, pimpinan, pindhahan, pinggir, pipi, piranti, pit, pitakon, pitakonan, pithik, pitik, piwulang, plang, plastik, plataran, polah, ponakan, populasi, prajurit, prakara, prakawis, pranata, prapatan, prasaja, prasasti, prasetya, pratandha, praupan, prejurit, prekawis, presidhen, pribadi, pribumi, pring, produk, prototipe, provinsi, proyek, pulisi, pulitik, puncak, pundhak, pungkasan, pungkasaning, punjering, pusaka, pusat, puseran, putra, putri, putu, putusan, putusan-putusan, radio, rafia, raga, ragad, raja, rakyat, rama, rambut, randha, rasa, rasanan, rasul, rata-rata, ratu, rawuh, rayi, referensi, rega, rekor, relief, rembulan, remukan, rencang, resep, residen, resik-resik, respon, reuni, revisi, revisian, revolusi, reyog, rikala, rikma, rintisan, rombongan, royong, ruh, rukun, rumah, rumata, sabuk, sadawaning, sadhuwure, sajroning, sakabeh, sakaratul, sakdhuwure, sakgarapan, sakiwa, sakpinggire, sakubarampene, sakulawarga, salib, saluran, samangke, sambel, sambutan, samesti, samir, sampah, sampeyan, sandhangan, sanding, sandinge, sangu, santri, sapi, sara, saran, sarana, sarananing, sarujuke, sarupaning, sasaran, sasi, sastra, satelit, sato, saubengipun, sawah, sebaran, sebutan, sedasa, sedina, sedulur-sedulur, sedyane, sega, segara, sekolah, sekolahan, sel, semangat, semenit, semester, seminari, seminggu, seni, seniman, senjata, sepur, serangan, serial, sesajen, sesaji, sesambungan, sesawangan, sesepuh, setang, setaun, setingkat, siang, sikil, simbah, simbok, simbol, simbol-simbol, singa, sir, sirah, sisa, sisi, sisih, sistem, skripsi, slira, sodagar, sonten, sore, sorok, sorot, sosial, spanduk, srek, srengenge, status, stelane, studhio, suara, sujarah, sukarelawan, suket, sukses, sumbangan, sumber, sumsum, sumur, sunaring, sunatan, surat, susu, sutresna, suwara, swara, swasana, syaraf, tabuh, takir, taman, tambahan, tamu, tancepan, tandur, tanduran, tangan, tangga, tanggal, tanggapan, tanggung, tangis, tani, tapel, taruna, tata, tataran, taun, tebusan, teges, teh, tekstil, telon, tembang, tembok, tembung, tembung-tembung, tempe, tengah, tengah-tengah, tengene, tenger, teritori, ternak, tetanen, tetulung, teyeng, teyeng-teyeng, thekem, tilas, tim, timbangan, ting, tingkat, tinuku, titikan, tivi, tiyang, tlatah, toko, toko-toko, topeng, topi, tradhisi, traktor, trataban, tratag, tresna, tresnane, trontong, tuan, tuduhan, tugas, tujuwa, tukang, tulis, tulisan, tumbak, tumetesing, tumindak, tumpeng, ucapan, udan, ukara, ukiran-ukiran, ukuran, umur, undangan, undha-usuk, unggulan, unit, universitas, unjukan, unsur, untel-untelan, upama, urip, urusan, urutan, usaha, usus-usus, utang, uwang, versi, virus, visi, wadhah, wadon, wadya, waja, wanci, wangsulan, wangun, wanita, wanodya, warangka, warga, warisan, warna, warok, warsa, wartawan, warung, waspa, watak, wates, watu, wayah, wayangan, wedang, wekdal, wektu, welas, weling, wengi, wereng, werna, wesi, wetan, wilah, windu, winih, wit, wiwit, woh, wolu, wong, wong-wong, wujud, wulan, wulu, yen.

NOUN occurs with 3 features: Number (2862; 100% instances), Polite (421; 15% instances), Typo (2; 0% instances)

NOUN occurs with 6 feature-value pairs: Number=Plur, Number=Sing, Polite=Elev, Polite=Form, Polite=Infm, Typo=Yes

NOUN occurs with 11 feature combinations. The most frequent feature combination is Number=Sing (2407 tokens). Examples: taun, bathik, tembang, basa, pusaka, tembung, buku, rasa, tanggal, wong

Relations

NOUN nodes are attached to their parents using 29 different relations: nmod (720; 25% instances), obl (601; 21% instances), nsubj (487; 17% instances), obj (418; 15% instances), conj (152; 5% instances), obl:tmod (112; 4% instances), root (95; 3% instances), nsubj:pass (81; 3% instances), nmod:lmod (42; 1% instances), nmod:poss (27; 1% instances), appos (21; 1% instances), xcomp (20; 1% instances), nmod:tmod (14; 0% instances), advcl (13; 0% instances), obl:agent (13; 0% instances), vocative (8; 0% instances), clf (7; 0% instances), compound (7; 0% instances), nsubj:outer (7; 0% instances), parataxis (5; 0% instances), acl:relcl (4; 0% instances), acl (3; 0% instances), iobj (3; 0% instances), flat:name (2; 0% instances), amod (1; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), fixed (1; 0% instances), nummod (1; 0% instances)

Parents of NOUN nodes belong to 12 different parts of speech: VERB (1542; 54% instances), NOUN (907; 32% instances), ADJ (177; 6% instances), (95; 3% instances), PROPN (57; 2% instances), PRON (28; 1% instances), X (26; 1% instances), NUM (21; 1% instances), ADV (8; 0% instances), ADP (2; 0% instances), DET (2; 0% instances), SYM (2; 0% instances)

710 (25%) NOUN nodes are leaves.

1049 (37%) NOUN nodes have one child.

690 (24%) NOUN nodes have two children.

418 (15%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 12.

Children of NOUN nodes are attached using 33 different relations: nmod (990; 25% instances), det (537; 14% instances), case (453; 11% instances), punct (317; 8% instances), nmod:poss (275; 7% instances), acl:relcl (218; 5% instances), amod (205; 5% instances), nummod (183; 5% instances), conj (156; 4% instances), advmod (114; 3% instances), cc (106; 3% instances), acl (100; 3% instances), nsubj (75; 2% instances), appos (53; 1% instances), advcl (34; 1% instances), nmod:lmod (32; 1% instances), aux (21; 1% instances), obl (20; 1% instances), cop (15; 0% instances), flat:name (13; 0% instances), mark (11; 0% instances), nmod:tmod (8; 0% instances), advmod:emph (7; 0% instances), compound (7; 0% instances), csubj (6; 0% instances), obl:tmod (6; 0% instances), parataxis (4; 0% instances), discourse (3; 0% instances), flat:foreign (2; 0% instances), goeswith (2; 0% instances), obj (2; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: NOUN (907; 23% instances), DET (539; 14% instances), ADP (453; 11% instances), PROPN (384; 10% instances), PUNCT (317; 8% instances), VERB (316; 8% instances), ADJ (263; 7% instances), PRON (256; 6% instances), NUM (187; 5% instances), ADV (106; 3% instances), CCONJ (106; 3% instances), X (74; 2% instances), AUX (36; 1% instances), PART (13; 0% instances), SCONJ (11; 0% instances), SYM (6; 0% instances), INTJ (3; 0% instances)