home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-Kenet: POS Tags: ADV

There are 1420 ADV lemmas (7%), 2002 ADV types (4%) and 11026 ADV tokens (6%). Out of 15 observed tags, the rank of ADV is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.

The 10 most frequent ADV lemmas: daha, en, sonra, bir, bile, çok, içinde, ol, ne, şimdi

The 10 most frequent ADV types: daha, en, sonra, bir, bile, çok, içinde, ne, şimdi, hiç

The 10 most frequent ambiguous lemmas: en (ADV 328, NOUN 5, ADJ 3), sonra (ADV 282, ADP 129, NOUN 24, ADJ 1), bir (DET 4522, ADJ 879, NUM 388, ADV 256, NOUN 142, VERB 16), bile (ADV 247, VERB 32, NOUN 1), çok (ADV 220, ADJ 165, NOUN 36, ADP 7, VERB 6, DET 5), ol (VERB 1083, NOUN 618, ADJ 421, ADV 216), ne (ADV 212, CCONJ 172, ADJ 155, PRON 118, VERB 60), hiç (ADV 185, NOUN 9), artık (ADV 163, ADJ 14, NOUN 6), hep (ADV 130, PRON 24, VERB 2)

The 10 most frequent ambiguous types: en (ADV 265, NOUN 1), sonra (ADV 214, ADP 129, NOUN 17), bir (DET 4162, ADJ 707, NUM 297, ADV 234), bile (ADV 245, VERB 7), çok (ADV 195, ADJ 143, ADP 7, DET 4), içinde (ADV 213, NOUN 106), ne (ADV 169, CCONJ 140, ADJ 133, PRON 72), hiç (ADV 150, NOUN 5), artık (ADV 104, ADJ 11, NOUN 2), hep (ADV 111, PRON 2)

Morphology

The form / lemma ratio of ADV is 1.409859 (the average of all parts of speech is 2.284446).

The 1st highest number of forms (16) was observed with the lemma “et”: edebilip, edemeyerek, edemeyince, ederek, ederken, edilerek, edilmedikçe, edince, edip, etken, etmeden, etmedikçe, etmişçesine, ettikçe, ettirerek, ettirirken.

The 2nd highest number of forms (16) was observed with the lemma “çık”: çıkalı, çıkarak, çıkararak, çıkarken, çıkarmadan, çıkartıp, çıkartırken, çıkarılınca, çıkarıp, çıkarırken, çıkmadan, çıkmışken, çıktıkça, çıkınca, çıkıp, çıkışınca.

The 3rd highest number of forms (15) was observed with the lemma “gör”: gördükçe, göremeyince, görerek, görmeden, görmeksizin, görmemişçesine, görmeyeli, görmeyerek, görmüşcesine, görülmeden, görülünce, görünce, görüp, görürken, görüyormuşçasına.

ADV occurs with 2 features: Degree (774; 7% instances), PronType (438; 4% instances)

ADV occurs with 4 feature-value pairs: Degree=Cmp, Degree=Sup, PronType=Ind, PronType=Int

ADV occurs with 5 feature combinations. The most frequent feature combination is _ (9814 tokens). Examples: sonra, bir, bile, çok, içinde, şimdi, hiç, artık, pek, hemen

Relations

ADV nodes are attached to their parents using 25 different relations: advmod (6569; 60% instances), advcl (2782; 25% instances), amod (383; 3% instances), obl (382; 3% instances), compound (302; 3% instances), case (170; 2% instances), root (89; 1% instances), conj (87; 1% instances), discourse (45; 0% instances), obj (44; 0% instances), acl (32; 0% instances), nmod (32; 0% instances), cc (24; 0% instances), parataxis (19; 0% instances), iobj (13; 0% instances), ccomp (12; 0% instances), nsubj (12; 0% instances), fixed (10; 0% instances), mark (6; 0% instances), csubj (4; 0% instances), xcomp (3; 0% instances), appos (2; 0% instances), list (2; 0% instances), clf (1; 0% instances), dislocated (1; 0% instances)

Parents of ADV nodes belong to 13 different parts of speech: VERB (6221; 56% instances), NOUN (2116; 19% instances), ADJ (1730; 16% instances), ADV (745; 7% instances), (89; 1% instances), PRON (52; 0% instances), NUM (22; 0% instances), DET (17; 0% instances), ADP (15; 0% instances), PROPN (12; 0% instances), AUX (3; 0% instances), X (3; 0% instances), INTJ (1; 0% instances)

6711 (61%) ADV nodes are leaves.

2959 (27%) ADV nodes have one child.

982 (9%) ADV nodes have two children.

374 (3%) ADV nodes have three or more children.

The highest child degree of a ADV node is 8.

Children of ADV nodes are attached using 29 different relations: obl (1174; 19% instances), obj (1115; 18% instances), advmod (720; 12% instances), nmod (638; 10% instances), punct (554; 9% instances), compound (544; 9% instances), nsubj (438; 7% instances), advcl (127; 2% instances), case (123; 2% instances), amod (101; 2% instances), ccomp (98; 2% instances), conj (91; 1% instances), det (72; 1% instances), cc (66; 1% instances), iobj (52; 1% instances), nummod (50; 1% instances), xcomp (44; 1% instances), fixed (37; 1% instances), mark (32; 1% instances), aux (27; 0% instances), acl (19; 0% instances), parataxis (15; 0% instances), csubj (13; 0% instances), discourse (8; 0% instances), vocative (3; 0% instances), dislocated (2; 0% instances), flat (2; 0% instances), clf (1; 0% instances), dep (1; 0% instances)

Children of ADV nodes belong to 15 different parts of speech: NOUN (3638; 59% instances), ADV (745; 12% instances), PUNCT (554; 9% instances), ADJ (298; 5% instances), CCONJ (274; 4% instances), PRON (197; 3% instances), VERB (102; 2% instances), DET (94; 2% instances), PROPN (77; 1% instances), ADP (74; 1% instances), NUM (53; 1% instances), AUX (29; 0% instances), SCONJ (14; 0% instances), INTJ (11; 0% instances), X (7; 0% instances)