home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-Kenet: POS Tags: ADJ

There are 4977 ADJ lemmas (24%), 6468 ADJ types (14%) and 22856 ADJ tokens (13%). Out of 15 observed tags, the rank of ADJ is: 2 in number of lemmas, 3 in number of types and 3 in number of tokens.

The 10 most frequent ADJ lemmas: bir, ol, bütün, var, büyük, yeni, et, uzun, iyi, çok

The 10 most frequent ADJ types: bir, bütün, var, büyük, yeni, olan, uzun, çok, ne, güzel

The 10 most frequent ambiguous lemmas: bir (DET 4522, ADJ 879, NUM 388, ADV 256, NOUN 142, VERB 16), ol (VERB 1083, NOUN 618, ADJ 421, ADV 216), bütün (ADJ 360, NOUN 5, PROPN 4, VERB 1), var (VERB 443, ADJ 287, NOUN 25, ADV 12), büyük (ADJ 231, NOUN 19, VERB 5, ADV 1), yeni (ADJ 199, NOUN 5, VERB 1), et (VERB 909, NOUN 299, ADJ 185, ADV 105), uzun (ADJ 173, VERB 3, NOUN 1), iyi (ADJ 169, ADV 25, NOUN 19, VERB 6), çok (ADV 220, ADJ 165, NOUN 36, ADP 7, VERB 6, DET 5)

The 10 most frequent ambiguous types: bir (DET 4162, ADJ 707, NUM 297, ADV 234), var (ADJ 271, VERB 19), çok (ADV 195, ADJ 143, ADP 7, DET 4), ne (ADV 169, CCONJ 140, ADJ 133, PRON 72), son (ADJ 86, NOUN 6), başka (ADJ 100, ADP 50), ilk (ADJ 104, ADV 35), böyle (ADV 82, ADJ 66), türlü (ADJ 86, NOUN 13), biraz (ADV 110, ADJ 70)

Morphology

The form / lemma ratio of ADJ is 1.299578 (the average of all parts of speech is 2.284446).

The 1st highest number of forms (32) was observed with the lemma “et”: edebildiğim, edecek, edeceği, edeceğim, edemedikleri, edemeyeceği, edemez, eden, eder, edici, edildiği, edilebilen, edilecek, edilemeyen, edilemez, edilen, edilir, edilmeyen, edilmiş, etli, etmeyen, etmez, etmiş, etsel, etsiz, ettikleri, ettireceğim, ettiren, ettiği, ettiğim, ettiğimiz, ettiğin.

The 2nd highest number of forms (26) was observed with the lemma “ol”: olabilir, olacak, olacağı, olamadığı, olamayacak, olamaz, olan, olduk, oldukları, olduğu, olduğum, olduğumuz, olduğun, olduğunuz, olmadığı, olmalı, olmamış, olması, olmayacak, olmayan, olmaz, olmuş, olunan, olunmayan, olunur, olur.

The 3rd highest number of forms (26) was observed with the lemma “çık”: çıkabilecek, çıkacak, çıkacağım, çıkamayacak, çıkan, çıkar, çıkaracak, çıkaran, çıkardığı, çıkardığım, çıkarmamalı, çıkartacak, çıkartacağım, çıkarılamaz, çıkarılan, çıkarılır, çıkmadığınız, çıkmamış, çıkmaz, çıkmış, çıktık, çıktıkları, çıktığı, çıktığım, çıktığımız, çıkılmaz.

ADJ occurs with 1 features: NumType (183; 1% instances)

ADJ occurs with 3 feature-value pairs: NumType=Card, NumType=Dist, NumType=Ord

ADJ occurs with 4 feature combinations. The most frequent feature combination is _ (22673 tokens). Examples: bir, bütün, var, büyük, yeni, olan, uzun, çok, ne, güzel

Relations

ADJ nodes are attached to their parents using 26 different relations: amod (12194; 53% instances), acl (3431; 15% instances), advmod (2016; 9% instances), compound (1184; 5% instances), conj (1066; 5% instances), root (906; 4% instances), nsubj (358; 2% instances), advcl (321; 1% instances), parataxis (321; 1% instances), nmod (233; 1% instances), obj (225; 1% instances), ccomp (150; 1% instances), obl (134; 1% instances), xcomp (117; 1% instances), csubj (61; 0% instances), list (59; 0% instances), discourse (29; 0% instances), flat (22; 0% instances), fixed (9; 0% instances), vocative (7; 0% instances), appos (3; 0% instances), clf (3; 0% instances), dep (3; 0% instances), iobj (2; 0% instances), dislocated (1; 0% instances), orphan (1; 0% instances)

Parents of ADJ nodes belong to 13 different parts of speech: NOUN (15328; 67% instances), VERB (3111; 14% instances), ADJ (2897; 13% instances), (906; 4% instances), ADV (298; 1% instances), PROPN (152; 1% instances), PRON (65; 0% instances), DET (41; 0% instances), NUM (24; 0% instances), X (19; 0% instances), ADP (11; 0% instances), SCONJ (3; 0% instances), CCONJ (1; 0% instances)

12435 (54%) ADJ nodes are leaves.

6319 (28%) ADJ nodes have one child.

2397 (10%) ADJ nodes have two children.

1705 (7%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 8.

Children of ADJ nodes are attached using 30 different relations: obl (2636; 15% instances), punct (2506; 15% instances), advmod (1838; 11% instances), nsubj (1437; 8% instances), obj (1264; 7% instances), compound (1212; 7% instances), conj (1204; 7% instances), nmod (984; 6% instances), amod (862; 5% instances), cc (531; 3% instances), det (427; 3% instances), advcl (397; 2% instances), case (282; 2% instances), nummod (242; 1% instances), parataxis (213; 1% instances), ccomp (188; 1% instances), aux (187; 1% instances), xcomp (124; 1% instances), acl (113; 1% instances), mark (84; 0% instances), list (65; 0% instances), flat (57; 0% instances), csubj (51; 0% instances), discourse (45; 0% instances), iobj (44; 0% instances), vocative (21; 0% instances), fixed (16; 0% instances), dep (3; 0% instances), appos (2; 0% instances), clf (2; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: NOUN (6846; 40% instances), ADJ (2897; 17% instances), PUNCT (2506; 15% instances), ADV (1730; 10% instances), CCONJ (626; 4% instances), PRON (471; 3% instances), VERB (457; 3% instances), DET (446; 3% instances), ADP (319; 2% instances), NUM (250; 1% instances), PROPN (219; 1% instances), AUX (192; 1% instances), SCONJ (39; 0% instances), INTJ (26; 0% instances), X (13; 0% instances)