need and competition in word formation and where to find...

1
Need and competition in word formation and where to find data to study them Anke Lüdeling, Humboldt-University Berlin & Marco Baroni, University of Bologna & Stefan Evert, University of Osnabrück Word formation Morphological theory is concerned not only with the structure of existing complex words but also with morphological productivity (Baayen 2001, Bauer 2001, Nishimoto 2004, Lüdeling/Evert 2005, Meibauer/Guttropf/Scherer 2004 and many others). There are two aspects of morphological productivity the ‘availability’ or ‘qualitative productivity’: Which morphological processes are available in a language? the ‘profitability’ or ‘quantitative potential’: How many complex words have been produced by a given process P? How likely is it that P will produce more words in the future? How does the potential of a rule change over time? Studies of the quantitative potential use frequency data from corpora, mostly focussing on the frequency distribution of the words formed by a given process, and especially on low-frequency data. Need – extralinguistic “Die Möglichkeit zur Bildung von Zuss. aus zwei Substantiven ist unbegrenzt. Ob solche aber wirklich gebildet werden, hängt natürlich vom Bedürfnis ab.” (Paul 1920, 15) “The possibility to form noun-noun compounds is unlimited. Whether they are actually formed, however, depends on the need.” “Words are only formed as and when there is a need for them [...]” (Bauer 2001, 143). Competition – linguistic The need can often be satisfied in several ways. All morphological and syntactic ways of expressing the need compete with each other: ‘Y-ing too much’ – {Y-besessenheit, Y-hysterie, Y-itis, Y-sucht, Y- wahn, Y-zwang} Ich spreche bewusst von Arbeits-Sucht und nicht von Workaholism. In dem Ausspruch: "Ich bin ein Workaholic" fehlt in der Regel die Einsicht in die Krankheit. “I consciously speak of work-addiction and not about workaholism. Someone who says ‘I‘m a workaholic’ typically does not understand that he is ill” […] im Plädoyer für den Müßiggang können die LeserInnen im Diskussionsforum "Arbeit und Leben" Stellung beziehen und persönliche Alternativen zur Arbeitsbesessenheit aufzeigen. “The readers of the discussion forum ‘Work and Life’ can add their comments on the ‘Plea for Idleness’ and discuss personal alternatives to work obsession.” Deswegen fördern wir als Arbeitsamt in Zukunft nur noch folgende Kurse: […] fortgeschrittenes Faulenzen - psychologisches Training zum Abbau des verinnerlichten Arbeitszwanges. “For this reason the employment centre will in the future only support the following courses: […] advanced idling – psychological training in the reduction of the internalized work obsession” Arbeitssucht, Spielsucht, Kaufsucht, Telefonitis die Zahl der bekannten Abhängigkeiten wächst ständig. “Work addiction, gambling addiction, shopping addiction, telephonitis – the number of known addictions is growing constantly.” Aber Vorsicht: hier lauern Telefonsucht, Fernsehsucht und Lesesucht. “But be careful: telephone addiction, TV addiction and reading addiction are waiting.” In einer anderen Sequenz wird der Telefonwahn in Szene gesetzt und karikiert. “In another sequence telephone mania is staged and caricatured.” All data from a German Web corpus collected by the WaCky initiative, see http://wacky.sslmit.unibo.it/ References Baayen, R. H. (2001) Word Frequency Distributions. Kluwer, Dordrecht. Bauer, L. (2001) Morphological Productivity. Cambridge University Press, Cambridge. Lüdeling, A. & Evert, S. (2005) The emergence of productive non-medical -itis. Corpus evidence and qualitative analysis. In: Kepser, S. & Reis, M. (eds) Linguistic Evidence. Empirical, Theoretical, and Computational Perspectives. Mouton de Gruyter, Berlin, 351-370. Meibauer J.; Guttropf, A.; Scherer, C. (2004) Dynamic aspects of German -er-nominals: a probe into the interrelation of language change and language acquisition. In: Linguistics 42, 155-193. Nishimoto, E. (2004) A corpus-based delimitation of new words: cross-segment comparison and morphological productivity. PhD Thesis. City University of New York. Paul, H. (1920) Deutsche Grammatik. Teil V: Wortbildungslehre. Max Niemeyer Verlag, Tübingen (Unveränderter Nachdruck der ersten Auflage, 1968) Corpus data reflect both need and competition. Measures of productivity computed from corpus data: cannot be interpreted as purely linguistic quantities are based on the misconception that it is possible to describe a single word-formation process without taking its competitors into account Our current approach Control for influence of the need by studying a (nearly) exhaustive set of (nearly) synonymous affixes that can be used to express a certain need. Study relative productivity of competing affixes. Analysis of qualitative restrictions and contexts of usage for each of the competing processes helps to predict differences in relative productivity under certain conditions (such as stylistic factors or precise semantics of the need). , Liftboys , Berühmtheiten , über <Telefonitis> und Kinder reicher Leute . Mascha Kaléko " hasse " : Die sich ausbreitende <Telefonitis> . Vor allem am Mt.Blanc , in Biel und au en Vortrag über Sexsucht ! Ob die <Telefonitis> wirklich mit der Edelsteintherapie zu be der Universität eine zunehmende " <Telefonitis> " zu beobachten . Sicher hat auch nieman keiten wie Raserei , Alkohol oder <Telefonitis> gibt es noch richtigen keine Vorstrafen en diese Szene bekannt vorkommt : <Telefonitis> ist ein Phänomen , das in den meisten Fa Telefon Telefonitis -itis Telefonwahn -wahn Telefonsucht -sucht Telefonzwang -zwang “too much” too much

Upload: duongtu

Post on 07-Jun-2018

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Need and competition in word formation and where to find ...clic.cimec.unitn.it/marco/publications/LingEvi06LuedelingBaroni... · Need and competition in word formation and where

Need and competition in word formation and where to find data to study them

Anke Lüdeling, Humboldt-University Berlin & Marco Baroni, University of Bologna & Stefan Evert, University of Osnabrück

Word formationMorphological theory is concerned not only with thestructure of existing complex words but also withmorphological productivity (Baayen 2001, Bauer 2001,Nishimoto 2004, Lüdeling/Evert 2005,Meibauer/Guttropf/Scherer 2004 and many others).

There are two aspects of morphological productivity

the ‘availability’ or ‘qualitative productivity’:Which morphological processes are available in alanguage?

the ‘profitability’ or ‘quantitative potential’:How many complex words have been produced bya given process P?How likely is it that P will produce more words inthe future?How does the potential of a rule change overtime?

Studies of the quantitative potential use frequencydata from corpora, mostly focussing on thefrequency distribution of the words formed by agiven process, and especially on low-frequency data.

Need – extralinguistic“Die Möglichkeit zur Bildung von Zuss. aus zweiSubstantiven ist unbegrenzt. Ob solche aber wirklichgebildet werden, hängt natürlich vom Bedürfnis ab.”(Paul 1920, 15)“The possibility to form noun-noun compounds isunlimited. Whether they are actually formed,however, depends on the need.”

“Words are only formed as and when there is a needfor them [...]” (Bauer 2001, 143).

Competition – linguisticThe need can often be satisfied in several ways. Allmorphological and syntactic ways of expressing theneed compete with each other: ‘Y-ing too much’ –{Y-besessenheit, Y-hysterie, Y-itis, Y-sucht, Y-wahn, Y-zwang}

Ich spreche bewusst von Arbeits-Sucht und nicht vonWorkaholism. In dem Ausspruch: "Ich bin einWorkaholic" fehlt in der Regel die Einsicht in dieKrankheit.“I consciously speak of work-addiction and not aboutworkaholism. Someone who says ‘I‘m a workaholic’typically does not understand that he is ill”

[…] im Plädoyer für den Müßiggang können dieLeserInnen im Diskussionsforum "Arbeit und Leben"Stellung beziehen und persönliche Alternativen zurArbeitsbesessenheit aufzeigen.“The readers of the discussion forum ‘Work and Life’can add their comments on the ‘Plea for Idleness’ anddiscuss personal alternatives to work obsession.”

Deswegen fördern wir als Arbeitsamt in Zukunft nurnoch folgende Kurse: […] fortgeschrittenes Faulenzen- psychologisches Training zum Abbau desverinnerlichten Arbeitszwanges.“For this reason the employment centre will in thefuture only support the following courses: […]advanced idling – psychological training in thereduction of the internalized work obsession”

Arbeitssucht, Spielsucht, Kaufsucht, Telefonitis –die Zahl der bekannten Abhängigkeiten wächstständig.“Work addiction, gambling addiction, shoppingaddiction, telephonitis – the number of knownaddictions is growing constantly.”

Aber Vorsicht: hier lauern Telefonsucht,Fernsehsucht und Lesesucht.“But be careful: telephone addiction, TV addiction andreading addiction are waiting.”

In einer anderen Sequenz wird der Telefonwahn inSzene gesetzt und karikiert.“In another sequence telephone mania is staged andcaricatured.”

All data from a German Web corpus collected by theWaCky initiative, see http://wacky.sslmit.unibo.it/

References

Baayen, R. H. (2001) Word Frequency Distributions. Kluwer, Dordrecht.

Bauer, L. (2001) Morphological Productivity. Cambridge University Press, Cambridge.

Lüdeling, A. & Evert, S. (2005) The emergence of productive non-medical -itis. Corpus evidence andqualitative analysis. In: Kepser, S. & Reis, M. (eds) Linguistic Evidence. Empirical, Theoretical, andComputational Perspectives. Mouton de Gruyter, Berlin, 351-370.

Meibauer J.; Guttropf, A.; Scherer, C. (2004) Dynamic aspects of German -er-nominals: a probe into theinterrelation of language change and language acquisition. In: Linguistics 42, 155-193.

Nishimoto, E. (2004) A corpus-based delimitation of new words: cross-segment comparison andmorphological productivity. PhD Thesis. City University of New York.

Paul, H. (1920) Deutsche Grammatik. Teil V: Wortbildungslehre. Max Niemeyer Verlag, Tübingen(Unveränderter Nachdruck der ersten Auflage, 1968)

Corpus data reflect both need and competition.

Measures of productivity computed from corpus data: cannot be interpreted as purely linguistic quantities are based on the misconception that it is possible todescribe a single word-formation process without taking

its competitors into account

Our current approachControl for influence of the need by studying a (nearly) exhaustive set of(nearly) synonymous affixes that can be used to express a certain need.

Study relative productivity of competing affixes.

Analysis of qualitative restrictions and contexts of usage for each of thecompeting processes helps to predict differences in relative productivity undercertain conditions (such as stylistic factors or precise semantics of the need).

, Liftboys , Berühmtheiten , über <Telefonitis> und Kinder reicher Leute . Mascha Kaléko" hasse " : Die sich ausbreitende <Telefonitis> . Vor allem am Mt.Blanc , in Biel und auen Vortrag über Sexsucht ! Ob die <Telefonitis> wirklich mit der Edelsteintherapie zu beder Universität eine zunehmende " <Telefonitis> " zu beobachten . Sicher hat auch niemankeiten wie Raserei , Alkohol oder <Telefonitis> gibt es noch richtigen keine Vorstrafenen diese Szene bekannt vorkommt : <Telefonitis> ist ein Phänomen , das in den meisten Fa

Telefon Telefonitis

-itis

Telefonwahn

-wahn

Telefonsucht

-sucht

Telefonzwang

-zwang

“too much”

too much