home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-CINTIL: POS Tags: ADJ

There are 2814 ADJ lemmas (10%), 4566 ADJ types (12%) and 25069 ADJ tokens (5%). Out of 15 observed tags, the rank of ADJ is: 4 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent ADJ lemmas: outro, novo, grande, português, último, bom, mesmo, primeira, político, próprio

The 10 most frequent ADJ types: grande, novo, outro, nova, outros, primeira, primeiro, outra, grandes, outras

The 10 most frequent ambiguous lemmas: outro (ADJ 1022, PRON 39, DET 1), novo (ADJ 974, PROPN 18), grande (ADJ 708, PROPN 50, NOUN 2), português (ADJ 390, NOUN 110, PROPN 17), último (ADJ 386, PROPN 2, NOUN 1), bom (ADJ 360, ADV 5, PROPN 4, NOUN 2), mesmo (ADJ 355, ADV 275, DET 1), primeira (ADJ 278, PROPN 2), político (ADJ 256, NOUN 30, PROPN 1), primeiro (ADJ 240, ADV 43, NOUN 11)

The 10 most frequent ambiguous types: outro (ADJ 266, PRON 37, DET 1), outros (ADJ 253, PRON 36, DET 1), primeiro (ADJ 239, ADV 20), outra (ADJ 203, PRON 14), grandes (ADJ 218, NOUN 1), outras (ADJ 188, PRON 20), mesmo (ADV 201, ADJ 174, DET 1), segunda (ADJ 140, NOUN 1), mesma (ADJ 144, DET 1), português (ADJ 118, NOUN 25)

Morphology

The form / lemma ratio of ADJ is 1.622601 (the average of all parts of speech is 1.389383).

The 1st highest number of forms (8) was observed with the lemma “baixo”: baixa, baixas, baixo, baixos, baixíssimo, inferior, inferiores, ínfimo.

The 2nd highest number of forms (7) was observed with the lemma “alto”: alta, altas, alto, altos, altíssima, superior, superiores.

The 3rd highest number of forms (7) was observed with the lemma “rico”: rica, ricas, rico, ricos, riquíssima, riquíssimo, riquíssimos.

ADJ occurs with 4 features: Number (25058; 100% instances), Gender (21937; 88% instances), NumType (1162; 5% instances), Degree (124; 0% instances)

ADJ occurs with 6 feature-value pairs: Degree=Abs, Gender=Fem, Gender=Masc, NumType=Ord, Number=Plur, Number=Sing

ADJ occurs with 17 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing (7527 tokens). Examples: novo, outro, mesmo, grande, último, português, próximo, próprio, único, bom

Relations

ADJ nodes are attached to their parents using 15 different relations: amod (20505; 82% instances), root (2324; 9% instances), dep (654; 3% instances), nsubj (338; 1% instances), fixed (223; 1% instances), obl (214; 1% instances), conj (207; 1% instances), ccomp (138; 1% instances), parataxis (131; 1% instances), advcl (113; 0% instances), obj (110; 0% instances), flat (93; 0% instances), csubj (14; 0% instances), xcomp (3; 0% instances), advmod (2; 0% instances)

Parents of ADJ nodes belong to 14 different parts of speech: NOUN (20175; 80% instances), (2324; 9% instances), VERB (1163; 5% instances), PROPN (658; 3% instances), ADJ (329; 1% instances), ADP (181; 1% instances), PRON (86; 0% instances), AUX (52; 0% instances), DET (31; 0% instances), ADV (27; 0% instances), NUM (19; 0% instances), SCONJ (16; 0% instances), CCONJ (6; 0% instances), INTJ (2; 0% instances)

19020 (76%) ADJ nodes are leaves.

2071 (8%) ADJ nodes have one child.

868 (3%) ADJ nodes have two children.

3110 (12%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 13.

Children of ADJ nodes are attached using 22 different relations: punct (3887; 22% instances), advmod (2957; 17% instances), cop (2752; 16% instances), nsubj (2251; 13% instances), obl (1572; 9% instances), det (961; 5% instances), dep (760; 4% instances), cc (740; 4% instances), case (425; 2% instances), conj (314; 2% instances), amod (183; 1% instances), parataxis (147; 1% instances), mark (142; 1% instances), advcl (114; 1% instances), csubj (105; 1% instances), flat (47; 0% instances), obj (44; 0% instances), nummod (27; 0% instances), ccomp (25; 0% instances), det:poss (13; 0% instances), fixed (6; 0% instances), cc:preconj (3; 0% instances)

Children of ADJ nodes belong to 14 different parts of speech: PUNCT (3887; 22% instances), ADV (2947; 17% instances), NOUN (2904; 17% instances), AUX (2786; 16% instances), DET (1124; 6% instances), VERB (893; 5% instances), CCONJ (738; 4% instances), PROPN (683; 4% instances), ADP (470; 3% instances), PRON (331; 2% instances), SCONJ (331; 2% instances), ADJ (329; 2% instances), NUM (46; 0% instances), INTJ (6; 0% instances)