home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Maghrebi_Arabic_French-Arabizi: POS Tags: INTJ

There are 138 INTJ lemmas (3%), 234 INTJ types (3%) and 1018 INTJ tokens (5%). Out of 16 observed tags, the rank of INTJ is: 7 in number of lemmas, 7 in number of types and 9 in number of tokens.

The 10 most frequent INTJ lemmas: oh, inchaAllah, ô, wallah, paix, si_Dieu_le_veut, merci, 123, ah, InchaAllah

The 10 most frequent INTJ types: ya, nchalah, inchallah, nchallah, salam, 123, wallah, inchalah, merci, walah

The 10 most frequent ambiguous lemmas: paix (INTJ 39, NOUN 11, PROPN 1), si_Dieu_le_veut (INTJ 37, VERB 1), merci (INTJ 27, NOUN 1), félicitation (INTJ 11, ADJ 1, NOUN 1), 1 (NUM 16, INTJ 8), 2 (NUM 15, INTJ 8, PRON 1), 3 (NUM 9, INTJ 8, DET 1), salut (INTJ 7, NOUN 3), non (INTJ 7, PART 4, ADV 2), dommage (INTJ 4, NOUN 4, ADV 1)

The 10 most frequent ambiguous types: ya (INTJ 296, CCONJ 2, SCONJ 1, VERB 1), salam (INTJ 28, NOUN 2, PROPN 1), 1 (NUM 17, INTJ 9, ADJ 1, DET 1, NOUN 1), 2 (NUM 13, INTJ 9, ADP 3, ADJ 1, PRON 1), 3 (INTJ 9, NUM 9, DET 3), ha (INTJ 7, PRON 1), y (PRON 12, INTJ 7, VERB 1), in (INTJ 6, SCONJ 3), a (ADP 46, DET 41, VERB 29, AUX 17, INTJ 5, PROPN 1, X 1), mabrouk (INTJ 4, NOUN 1)

Morphology

The form / lemma ratio of INTJ is 1.695652 (the average of all parts of speech is 1.474223).

The 1st highest number of forms (28) was observed with the lemma “inchaAllah”: anchalah, anchallah, anchlah, enchala, enchalah, in, incha, incha’Allah, inchaalah, inchalah, inchallah, inchallahe, inchalllah, inshallah, n’challah, nachallah, ncahalah, nchaalh, nchalah, nchalahe, nchaleh, nchallah, nchallh, nchlah, nchlh, nchllh, nshalah, nshallah.

The 2nd highest number of forms (12) was observed with the lemma “oh”: a, aw, iooo, y, y’a, ya, ya3ni, yaaaaaa, yah, yakh, yaw, ye.

The 3rd highest number of forms (10) was observed with the lemma “ah”: a, ah, awah, ih, ya, yak, yakh, yekhi, yewwwwwwwwwwwwwww, yék.

INTJ occurs with 1 features: Typo (21; 2% instances)

INTJ occurs with 1 feature-value pairs: Typo=Yes

INTJ occurs with 2 feature combinations. The most frequent feature combination is _ (997 tokens). Examples: ya, nchalah, inchallah, nchallah, salam, 123, wallah, inchalah, merci, walah

Relations

INTJ nodes are attached to their parents using 10 different relations: discourse (969; 95% instances), fixed (11; 1% instances), parataxis (10; 1% instances), root (7; 1% instances), vocative (6; 1% instances), conj (5; 0% instances), dep (4; 0% instances), obj (3; 0% instances), flat (2; 0% instances), nsubj (1; 0% instances)

Parents of INTJ nodes belong to 11 different parts of speech: VERB (404; 40% instances), PROPN (267; 26% instances), NOUN (229; 22% instances), PRON (48; 5% instances), ADJ (32; 3% instances), INTJ (19; 2% instances), (7; 1% instances), DET (4; 0% instances), NUM (4; 0% instances), ADV (2; 0% instances), PART (2; 0% instances)

905 (89%) INTJ nodes are leaves.

84 (8%) INTJ nodes have one child.

24 (2%) INTJ nodes have two children.

5 (0%) INTJ nodes have three or more children.

The highest child degree of a INTJ node is 4.

Children of INTJ nodes are attached using 15 different relations: cc (38; 25% instances), obl (29; 19% instances), goeswith (22; 15% instances), det (13; 9% instances), fixed (11; 7% instances), conj (7; 5% instances), vocative (7; 5% instances), amod (6; 4% instances), ccomp (4; 3% instances), punct (4; 3% instances), parataxis (3; 2% instances), flat (2; 1% instances), nmod (2; 1% instances), advmod (1; 1% instances), discourse (1; 1% instances)

Children of INTJ nodes belong to 11 different parts of speech: CCONJ (38; 25% instances), PRON (24; 16% instances), X (22; 15% instances), INTJ (19; 13% instances), DET (13; 9% instances), NOUN (11; 7% instances), ADJ (6; 4% instances), PROPN (6; 4% instances), VERB (6; 4% instances), PUNCT (4; 3% instances), ADV (1; 1% instances)