A Computational Morphological Analysis of Marathi Negative Markers

ABSTRACT

This paper aims to present the computational morphological analysis in the negation of Marathi using ruled base approach. Morphological analysis and generation are necessary steps in any NLP application for analyzing the given word and generator for generating word of stem and its features like affixes. It provides information about word semantics and the syntactic role of the stage in a sentence. Marathi grammarian has discussed on some amount of Marathi negation. This paper is attempting to discuss all the negation marker and suffixes in morphological analysis, through primary data collected from Marathi books and secondary data from several Marathi newspapers. The classified data is analyzed into stems and affixes for equally categorizes and the inflectional process has been treated separately.

INTRODUCTION

In linguistics, morphology is the identification, analysis and description of the structure of a given language’s morpheme and other linguistic units, such as words, affixes. Morphological typology represents a method for classifying languages according to the ways by which morphemes are used in a language from the analytic that use only isolated morphemes, through the agglutinative (“stuck-together”) and fusional languages that use bound morphemes (affixes), up to the polysynthetic, which compress lots of separate morphemes into single words. While words are generally accepted as being with qualities the smallest units of syntax, it is clear that in most languages, if not all, words can be related to other words by rules. Negation pervades ontology, logic, metaphysics, mathematics, science, psychology and several other disciplines. Most of all it is a basic element of human thought and a universal feature of all languages. The addition of a negative element creates a contrast with positive statements. Negating a statement is like creating an opposite polarity in any domain. Negation interacts with all of them, creating several shades of meaning.

This paper presents the morphological analysis in Negation of Marathi, which is an official language of Maharashtra and co-official language of Goa states of Western India. It is an Indo-Aryan language spoken by about 73 million people as a pet census of India 2001, ranking 19th in the list of most spoken languages in the world and fourth among the native speakers in India.

LITERATURE REVIEW

Many Marathi grammarian like Damle (1911), Walimbe (2010), etc. have written on Negation but did not give a descriptive analysis. Pandharipande, R. (1997), In the book Marathi, she has given the markers sense of negation in sentential level without the presence of negative markers but with plenty of examples. She has not given a complete morphological paradigm of negation, but basic information. Wali, K. and Dhongde, R. V. (2009) also discussed negation and have given a limited morphological analysis as negative auxiliary. Mhaiskar, R. (2012), has discussed negation in contact situation but not fully described Marathi negation, though it is a comparative type of study. Nobody has given a complete morphological paradigm which could help for computational analysis.

Computationally, Bharati et al. (1998) suggested a paradigm based algorithm for morphological analysis for Hindi. In Hindi, the inflected forms of roots do not allow further attachment of any other suffixes. In contrast, in Marathi once the root is transformed into its inflected form it is followed by suffixes to show its agreement with the other words in the sentence. Some postpositions derive new words which themselves may undergo inflection and allow attachment of other suffixes. This makes the simple paradigm-based model proposed in this work unfit for Marathi morphological analysis (Bharati et al. 1998:5). Dixit et al. (2006) developed a morphological analyzer with the purpose of using it for spell checking. Though their analyzer successfully analyzes the words with a single suffix, its scope is restricted in the handling of only first level suffixes for simple word forms. Shambhavietal (2001) Introduced Kannada morphology analyzer and generator by using a tire. Ramanathan et. al. (2004) a lightweight stemmer for Hindi. In this research, words conflate terms by suffix removal for information retrieval. Willet.P (2006) proposed the porter stemming algorithm for electronic library and information system. Zahurul, M.D. et al. (2009) developed a lightweight stemmer for Bengali for Bengali language spell checker. Qurat-Ul-Ain Akram and etal. (2009) Assas-band, an affix exception list based Urdu stemmer. It stems the Urdu words using lexical lookup method (Assas-band). Dinesh Kumar and Prince Rana (2010) developed design and development of stemmer for Punjabi. It uses a Brute Force algorithm for stemming the Punjabi words. Vijay Sundar et al. (2010) Introduced Malayalam stemmer for information retrieval. The finite state automata method is used to stem the Malayalam words. In these work researches have given morphological analysis of their respective languages and they have accounted paradigm of negation.

NEGATION OF MARATHI

Negation is a common and important phenomenon in the world languages. Marathi has नाही nāhī and the particle न for negation. Indo-Aryan languages share the surface word-order with adverbs, negation particles appear in the preverbal position or as a particle.

The addition of a negative element creates a contrast with positive statements. Negating a statement is like creating an opposite polarity in any area. Negation interacts with all of them, creating several shades of meaning.

Negation is expressed at both sentential and basic stages. Sentential negation is expressed by a variety of negative verbs, some of which as auxiliaries.

Marathi has a tendency of AUX deletion. Marathi has singular and plural conjugations of negation auxiliaries with gender, number and person agreement. त t is a plural marker for negation. स s is a second person marker and च ʦ as a conformation maker in the present, past and future tense.

Examples: nāhīʦ suffix

मुंबईकरांना टोलमुक्त नाहीच. (Loksatta. 1st Nov 2015. Page no. 1)

mumba īkarānnā  olmukta nāhī ʦ

‘Mumbai people exempted from toll.’

Past negative Aux + े e (pl)+ च ʦ(conformation maker)

बार बंद नव्हतेच. (Loksatta. 16th Nov 2015. Page no. 1)

bār band nwhateʦ

‘Bar was not closed’

धोनिसोबत मतभेद नव्हतेच. (Maharashtra Times. 9th Nov 2015. Page no. 1)

dhonisobat matbhed nwhate ʦ

‘There was no dissension with Dhoni.’

Affixes in Negative forms of आहे āhe. In present tense negation marker is नाही nāhī, नये naye and नव्हे nwhe.

Examples:

त pl suffix

च ʦ suffix

Present negative नाही nāhī+, नव्हे nwhe + नये naye + त –t (pl).

Present negative नाही nāhī+, नव्हे nwhe + च ʦ (conformation maker)

स्मिता वेडी नाही

smitā we  ī nāhī

‘Smita is not mad’

“वर्षा’ बंगल्यावर दिवाळीचे फटाके नाहीत

warṣā banglyāwar diwāḷīʦe phaṭāke nāhīt

‘varsha’ Diwali crackers are not there in bungalow.’

 मोदी ‘पीएम’ होऊ नयेत अशी त्यांची इच्छा होती. (Sakal. 29th Oct 2015. Page no. 1)

modī ‘pīem’ hoū nayet aśī tyānʦī iʦʦhā hotī.

‘They do not want Modi to become PM.’

 महिलांनी उतेजक कपडे घालू नयेत. (Loksatta. 21st Jul 2012. Page no. 1)

mahilānī uteʣak kapaḍe ghālū nayet

‘Women should not wear enticing clothes.’

शपथ नव्हे अग्निपथ. (Maharashtra Times. 31st Oct 2014. Page no. 1)

śapath nawhe agnipath

‘This is not my owe, this is my path of difficulties/ fire’

दादरी मधील ते गोमांस नव्हे तर होते मटण. (Sakal. 9th Oct 2015. Page no. 1)

dādrī madhīl te gomans nawhe tar hote ma  aṇ

‘In the Dadri is not beef meat it was hircine meat’

Present Tense Negation Analysis:

Present tense Negation analysis Followed by Number and Person.

General rules:

नाही nāhī <stem>

नाही nāhī < sg 1st> < sg 2nd> < sg 3rd> < pl 1st> < pl 2nd> < pl 3rd >

त –t < pl 1st> < pl 2nd> < pl 3rd>

स -s < sg 2nd>

Analysis of present tense negation and conformation marker च ʦ.

च ʦ < sg 1st> < sg 2nd> < sg 3rd > < pl 1st > <pl 2nd > <pl 3rd >

तच –taʦ < pl 1st > <pl 2nd > <pl 3rd >

General rules:

नय nay <stem>

नय-े nay-e <Masc sg 1st> <Fem sg 1st> <Masc sg 2nd> <Fem sg 2nd > <Masc sg 3rd> <Fem sg 3rd> <Nut sg 3rd> <Masc pl 2nd> <Fem pl 2nd> <Masc pl 3rd> <Fem pl 3rd> <Nut pl 3rd>

नय-ेत nay-et <Masc pl 2nd> <Fem pl 2nd> <Masc pl 3rd> <Fem pl 3 rd> <Nut pl 3rd>

General rules:

नव्ह nwh <stem>

नव्ह-े nwh-e <Masc sg 1st> <Fem sg 1st> <Masc sg 2nd> <Fem sg 2nd> <Masc sg 3rd> <Fem sg 3rd> <Nut sg 3rd> <Masc pl 1st > <Fem pl 1st> <Masc pl 2nd> <Fem pl 2nd> <Masc pl 3rd> <Fem pl 3rd> <Nut pl 3rd>

नव्ह-ेच nwh-eʦ <Masc sg 1st> <Fem sg 1st> <Masc sg 2nd> <Fem sg 2nd> <Masc sg 3rd> <Fem sg 3rd> <Nut sg 3rd> <Masc pl 1 st > <Fem pl 1st> <Masc pl 2nd> <Fem pl 2nd> <Masc pl 3rd > <Fem pl 3rd> <Nut pl 3rd>

Past tense negation marker नव्हतो nawhto.

Example:

बार बंद नव्हतेच.

b ā r band nwhate ʦ

‘Bar was not closed’

धोनिसोबत मतभेद नव्हतेच

dhonisobat matbhed nwhate ʦ

‘There was no dissension with Dhoni.’

Affixes in Negative forms of नव्हता nawhtā.

Past tense Negation analysis Followed by Gender, Number and Person.

General rules:

नव्हत nawht <stem>

नव्हत-ं nawht-a <Nut sg 3rd >

नव्हत-ो nawht-o <Masc sg 1st> <Masc pl 1st> <Fem pl 1st>

नव्हत-े nawht-e <Fem sg 1st> <Masc pl 2nd > <Masc pl 3rd> <Nut sg 3 rd>

नव्हत-ी nawht-ī <Fem sg 1st> <Fem sg 2nd> <Fem sg 3rd> <Nut pl 3 rd>

नव्हत-ा nawht-ā <Masc sg 2nd> <Masc sg 3rd> <Masc pl 2nd>

नव्हत-ास nawht-ās <Masc sg 2nd>

नव्हत-ात nawht-āt <Masc pl 2nd>

नव्हत-ेत nawht-et <Masc pl 2nd>

नव्हत-ीस nawht-īs <Fem sg 2nd>

नव्हत्-यात nawht-yāt <Fem pl 2nd>

नव्हत्-या nawht-yā <Fem pl 2nd> <Fem pl 3rd>

Analysis of past negation with conformation marker च ʦ.

नव्हत-ोच nawht-oʦ <Masc sg 1st> <Masc pl 1st> <Fem pl 1st>

नव्हत-ेच nawht-eʦ <Fem sg 1st> <Masc pl 3rd > <Nut sg 3rd>

नव्हत-ीच nawht-īʦ <Fem sg 1st> <Fem sg 2nd> <Fem sg 3rd> <Nut pl 3rd>

नव्हत-ाच nawht-āʦ <Masc sg 2nd> <Masc sg 3rd ><Masc pl 2nd > <Fem pl 3rd >

नव्हत्-याच nawht-yāʦ <Fem pl 3rd >

The particle न also function as a vowel change to negate verb.

And future negation marker is नस-ेल nas-el. But in the future tense negation is नसेल nas-el as well future tense अस-ेल asel.

Eg. अस-ेल as-el > नस-ेल nas-el, etc.

कामे होत नसतील, तर तरतूद कशाला?. (Sakal. 23rd Jun 2015. Page no. 1)

kāme hot nastīl, tar tartūd kaśālā?

‘Why such provisions, if such works cannot be completed?’

आपण नसू तेव्हा हे जग तितकेच सुंदर असेल. (Maharashtra Times. 1st Aug 2012. Page no. 1)

āpan nasū tewhā he ʣag titke ʦ sundar asel.

‘Even when we do not exist, even then the world be equally beautiful’

‘This world remains equally beautiful, even after our existence’

Future tense Negation analysis Followed by Number and Person.

General rules:

नस nas <stem>

नस-ेल nas-el <sg 1st> <sg 3rd>

नसू nasū <pl 1st>

नस-शील nas-śīl <sg 2nd>

नस-ाल nas-āl <pl 2nd>

नस-तील nas-tīl <pl 3rd>

In future tense conformation negation marker च ʦ.

म्हातारपणाच्या खुणा नसतीलच (डॉ. सतीश पत्की)

mhātārpaṇāʦyā khuṇā nastīlaʦ (dā satīś patkī)

‘There are no signs of age old (Dr. Satish Patki).’

Future negative नसतील nastīl + च -ʦ (conformation maker)

General rules:

नस nas <stem>

नस-ेल-च nas-el-ʦ <sg 1st> <sg 3rd>

नसू-च nasū-ʦ <pl 1st>

नस-शील-च nas-śīl-ʦ <sg 2nd>

नस-ाल-च nas-āl-ʦ <pl 2nd>

नस-तील-च nas-tīl-ʦ <pl 3rd>

Imperatives:

The first person negative hortative नको nako ‘do not do X’ is placed after his main verb, which is in the imperative.

मी जा-ऊ नको ? / मी नको जा-ऊ / मी नको का जा-ऊ?

mī jā-ū nako? / mī nako jā-ū?/ mī nako kā jā-ū?

‘Shouldn’t I go?’ ‘Don’t you want me to go?’

आपण त्या कविता म्हण-ऊ या नको.

āpa  tyā kawitā mha  -ū yā nəko.

‘Let we don’t sing those poems.’

General rules:

नक nak <stem>

नक-ो nak-o <Masc sg 1st> <Fem sg 1st><Masc pl 1st> <Fem pl 1st > <Masc sg 2nd> <Fem sg 2nd> <Masc sg 3rd > <Fem sg 3rd> <Nut sg 3rd> <Masc pl 3rd > <Fem pl 3rd > <Nut pl 3rd>

नका nak-ā <Masc pl 2nd> <Fem pl 2nd>

नक-ोस-nak-os <Masc sg 2nd> <Fem sg 2nd> <Nut sg 2nd>

The first person negative hortative नको nako. With conformation marker च ʦ.

गुणवतेसाठी ‘ग्रेडेशन’ पद्धती नकोच. (Lokamat. 24th Oct 2015. Page no. 1)

guṇwates ā  h ī greeśan’ paddht nākoʦ

Talent for gradation method

for talent not gradation method

Gradation is not a method to assess talent

For the talent the gradation methods are not

शाळा देऊ नकाच. (Maharashtra Times. 21st Jul 2015. Page no. 1)

śāḷā deū nakā ʦ

Do not give School

General rules:

नक nak <stem>

नक-ोच nak-oʦ <Masc sg 2nd> <Fem sg 2nd> <Masc sg 3rd > <Fem sg 3rd> <Nut sg 3rd> <Masc pl 3rd > <Fem pl 3rd > <Nut pl 3 rd>

नकाच nk-aʦ <Masc pl 2nd> <Fem pl 2nd>

CONCLUSION:

From the above analysis/ discussion, it is understood or understandable that the morphological processing improves the recovery performance for Marathi Language. An important observation is that the suffixes in Marathi can also contribute to the semantics of the document and hence improves the retrieval performance.

The current morphological analysis does not handle derivational morphology. In Marathi, derivational morphology is a very productive way of forming words. Handling derivational morphology can also increase the system performance. This paper presented a highly accurate morphological analysis of negation in Marathi which is very efficiently finds the Root word of a given word and recognizes the Gender of the sentence with the inputs.

त is plural suffix, नव्हे is present tense negation, but when त attached with नव्हे than is past tense negation, past tense suffix according to GNP. च confirmation marker, but confirmation marker is not all time possible, when is conform think that time use confirmation marker.

References:

  • Walmbe, M. R. 2010. Sugam Marathi Wyakran Lekhan. Pune, Nitin prakashan.
  • Bhagwat, S. 2003. Tumache Amche Marathi wyakran. Latur, Widyabharti prakashan.
  • Ramesh V. Dhongde and Kashi Wali 2009. Marathi. Northa Amrica, John Benjiamins Publishing Company.
  • Rajeshwari V. Pandharipande 1997. Marathi. London and New York.
  • Kelkar, A. R. 1958. The Phonology and Morphology of Marathi A Thesis. Presented to the Faculty of the Graduate School of Cornell University for the Degree of Doctor of Philosophy.
  • Andrew Carstairs McCarthy. 1992. Current Morphology. London and New York, Routledge.
  • Khandekar, D. 2012. Verb Dictionary. Pune.
  • Bhatia, T. K. 1979. Negation in South Asian Languages. Urbana-Champaign, South Asian Languages Analysis Vol. 1, Dept. of Linguistics, University of Illinois. 1-12.
  • Bhatia, T. K. 1993. Punjabi: a cognitive-descriptive grammar. Descriptive Grammars, Routledge, London.
  • Bhatt, R. and Munshi, S. 2002. Negation Movement and Optional Verb Second in Kashmiri. Handout of talk given at SALA 22 at the University of Iowa.
  • Bhatt, R. M. 1999. Verb Movement and the Syntax of Kashmiri. Studies in Natural Language and Linguistic Theory 46, Kluwer, Dordrecht.
  • Cardona, G. 1965. A Gujarati Reference Grammar. Philadelphia,The University of Pennsylvania Press.
  • Dasgupta, P. 1987. Sentence Particles in Bangla. Selected Papers from SALA 7, Indiana University Linguistics Club, Bloomington, 49-75.
  • Kumar, R. 2003. The Syntax of Negation in Hindi. Doctoral dissertation. University of Illinois-Urbana, Urbana-Champaign, Illinois. Forthcoming.
  • Lahiri, U. 1998. Focus and Negative Polarity in Hindi. Natural Language Semantics. 6:1, 57-123.
  • Mahajan, A. K. 1990. LF Conditions on Negative Polarity Licensing. Lingua 80:4, 333-348.
  • Masica, C. 1991. The Indo-Aryan Languages. Cambridge Language Surveys, Cambridge University Press, Cambridge, England.
  • Prado, Z. N. and Gair, J. 1994. The position of negation in Bengali: An account of synchronic and diachronic variation,” in A. Davison and F. M. Smith, eds., Papers from the Fifteenth SALA Roundtable Conference 1993, South Asian Studies Program, University of Iowa, Iowa City, 234-250.
  • Rahman, S. 2002. Verb Movement and Negation in Bengali. Handout of a talk given at SALA 22 at the University of Iowa.
  • Ramchand, G. C. 2001. Tense and Negation in Bengali. Linguistic Structure and Language Dynamics in South Asia: Papers from the Proceedings of the SALA XVIII Roundtable, MLBD Series in Linguistics 15, Motilal Banarsidass, Delhi, 308-326.
  • Ramchand, G. C. 2003. Two Types of Negation in Bengali. Clause Structure in South Asian Languages,
  • Loksatta
  • Maharashtra Times
  • Sakal
  • Tarun Bharat
  • Lokmat

Tags:

Computational Linguistics, Grammar, Marathi, Syntax