methods and resources

28
hd d Methods and Resources

Upload: duygu-asiklar

Post on 02-Jul-2015

136 views

Category:

Education


0 download

TRANSCRIPT

Page 1: methods and resources

h d d Methods and Resources

Page 2: methods and resources

Comprehension Questionsp Q

1 Write the stages followed during the dictionary-writing process1. Write the stages followed during the dictionary-writing process.2. What is the aim of compiling the database from the corpus?3. State the function of the database entry.4. What are the benefits of a well-designed dictionary database?5. What are the goals of transfer and synthesis stages?6. Write the software types needed for the stages of the dictionary f yp f g f y

writing process.7. What does a ‘lexical profile’ means and what is the benefit of ‘lexical

profiling’?profiling ?8. Write the components that constitute the dictionary writing

system.

Page 3: methods and resources

Comprehension Questionsp Q

9. What is a ‘Style Guide’ and what is the purpose of using it?10. Which areas are covered by a Style Guide?11. Write the issues that Style Guides for teams without a DWS

should include.12 Make a comparison between the earlier and the contemporary 12. Make a comparison between the earlier and the contemporary

forms of Style Guides.13. What are the benefits of a Style Guide?14. What is a ‘template entry’?15. State the significance of using templates.16 Which stages are involved when compiling the templates?16. Which stages are involved when compiling the templates?

Page 4: methods and resources

4.1 Preliminaries

This chapter describes the role the corpus plays in a dictionary This chapter describes the role the corpus plays in a dictionary project and the environment the lexicographers work in.

4 2 Th di ti n itin p4.2 The dictionary-writing processA publishing house owning a dictionary will want you to start from the text of that dictionary, adapt it to suit the new

ifi ti d d t it di t id specifications, and update it according to corpus evidence, editing the wordlist itself as well as the entries.Lexicographers differ in what they do best: some are better at

l l d d analysis, some at translating, and some at dictionary-entry writing.

Page 5: methods and resources

4.1 Preliminaries

4.2.1 RationaleIn a contemporary publishing environment, a good dictionary

itin s st m p id s l f m k f dit s t k writing system provides a clear framework for editors to work within, and offers a lot of guidance on the content of dictionary entries.A li i th h th d t i ti l t f A preliminary pass through the data is an essential part of writing a bilingual dictionary entry: it’s impossible to supply adequate target language equivalents without knowing a great deal more about the contexts in which the headword is found deal more about the contexts in which the headword is found than can eventually be included in the actual entry.

Page 6: methods and resources

The advantages of storing the facts about the headword:The advantages of storing the facts about the headword:• The structure of the database guides the analysis process.• The completed database holds a comprehensive record of

h h d d b h s in th p show your headword behaves in the corpus.• The database allows editors to scan the material in a

systematic way.• The database speeds up the editing process.• The database is re-usable.

Page 7: methods and resources

4.2.2 Analysis: compiling the database from the corpusThe purpose of this database is to store selected facts about th d in s st m ti s th t b s nnin th m n the word in a systematic way, so that by scanning them you can quickly and efficiently get a fix on the headword and extract the information you need for the final dictionary entry.Th f t f th d t b t fl t th t f di ti The format of the database entry reflects that of a dictionary entry, but is much more detailed.The more detail the better, but avoid redundancies.It can hold a rich selection of corpus examples showing the headword in use in its various meanings and patterns.

Page 8: methods and resources

A well-designed and well-populated dictionary database A well-designed and well-populated dictionary database represents a valuable piece of intellectual property. As well as being usable as a basis for all kinds of dictionary and

f f l h l k l as an information-source for linguistic research, it is likely to be attractive to builders of computer applications.All the corpus searching, sense finding, collocate noting, and All the corpus searching, sense finding, collocate noting, and grammar coding is done in the analysis stage.The value of the database lies in the fact that it is an unbiased record of what is happening in the one single language unbiased record of what is happening in the one single language it is describing.

Page 9: methods and resources

4.2.3 Transfer: translating the databaseThe purpose of this stage is to build up a body of target l i l f h h d d i language equivalents of the headword in as many contexts as possible, so that when the entry editors come to extract the final entry they have all their options assembled for them in y y pone place.This work is best done by experienced translators with an excellent knowledge of both languages and native speakers of excellent knowledge of both languages, and native speakers of the target language.

Page 10: methods and resources

4 2 4 Synthesis: editing the entry4.2.4 Synthesis: editing the entryThe purpose of this stage is to produce the final entry, the one most appropriate for the typical user of your dictionary.This work is best done by skilled and experienced lexicographers.Each entry will be drafted by a native speaker of the SL Each entry will be drafted by a native speaker of the SL, checked through by a native speaker of the TL, then finalized by the SL speaker (for bilingual dictionaries).

Page 11: methods and resources

4.3 SoftwareTh t d b t t f ftThe processes are supported by two types of software:

• A corpus query system (CQS): a computer program that enables you to analyze the data in a corpus in various ways.y y p y

• A dictionary writing system (DWS): a program that enables lexicographers to compile and edit dictionary text, as well as facilitating project management and typesetting as well as facilitating project management and typesetting and output to printed or electronic media.

Page 12: methods and resources

4.3.1 The corpus query system (CQS)3 e co pus que y syste (CQS)4.3.1.1 Lexicographic needs and CQS functionality

A good way of evaluating a CQS is to start from the categories of information you want to include in your dictionary but how quickly and easily you can retrieve the

finformation by using your CQS is the question.4.3.2.1 The KWIC concordance

It is the basic tool of corpus lexicography.Th ft t k d t f th f ll i f t f th BNCThe software takes advantage of the following features of the BNC:

• Lemmatization• POS-tagging• Document headers

If the KWIC display doesn’t provide enough information for a given line, you can see more of the source text by clicking on the node word.

4.3.1.3 How to frame a queryThe drop-down list next to the lemma box allows you to select any of the main wordclasses

Page 13: methods and resources

The ‘context’ boxes in the lower half of the screen are used for specifying the e co te t bo es t e o e a o t e sc ee a e used o spec y g t etext to the left of right of the node word, and you can either enter a specify word or simply select a wordclass.

4.3.1.4 CQS functionalityf f fA powerful CQS has a great many functions, the majority of which will be

employed only rarely and a few will be in regular use.4.3.1.5 Lexical profiling: the ‘Word Sketch’

A lexical profile is a kind of statistical summary which reveals the salient factsA lexical profile is a kind of statistical summary which reveals the salient facts about the way a word most typically combines with other words.Lexical profiling offers a solution that maximizes the value of a large corpus while reducing the effort required by the human user.The Word Sketch is a type of lexical profile.The Word Sketch provides collocate lists for a wide range of grammatical relations and achieves this by collecting every corpus instance of the search term and then subjecting this data to a further round of processingand then subjecting this data to a further round of processing.Lexical-profiling software adds a valuable resource to the lexicographer’s repertoire.

Page 14: methods and resources

It was originally seen as a useful supplementary tool well-adapted for identifying g y pp y p y gcollocational patterns – important information for pedagogical dictionaries.Word Sketches provide a compact and revealing snapshot of a word’s bahaviour and uses.Graphic representations of word behaviour offer faster and more reliable ways ofGraphic representations of word behaviour offer faster and more reliable ways of indicating lexicographically relevant facts.

4.3.1.6 The CQS: some conclusionsThe CQS provides the link between raw corpus data and the dictionary.

ll h d d d ll b dAll the data you need to write your dictionary will be present in a good corpus, and a powerful and well-designed CQS allows you to retrieve relevant information efficiently and view it in a variety of ways.Additional functionality in the CQS can extend the scope of what dictionaries are able to do.As new functions become available, lexicographers’ search strategies evolve.A good CQS offers a number of ways of uncovering information, and one of the skills a lexicographer develops over time is to know which functions to use in askills a lexicographer develops over time is to know which functions to use in a given situation in order to get the most out of the corpus with least effort.

Page 15: methods and resources

4.3.2 The dictionary writing system (DWS) 3 e d ct o a y t g syste ( S)Lexicographers compile dictionary text onscreen, and the software that allows them to do this is generally referred to as DWS.The simplest form of text-input software is a generic XML editing tool.A commercial DWS program is designed to manage the entire process of producing a dictionary, from compiling the first entry to outputting the final product for publication in print or electronic media.A typical DWS consists of three main components:A typical DWS consists of three main components:

• A text-editing interface• A database• A set of administrative tools

4.3.2.1 The lexicographer’s interface: the editing toolThis is where dictionary text is compiled and edited.The DWS will generally offer a number of ways of viewing the data.A good DWS maximizes the lexicographer’s productivity by streamlining routine tasks and automating many of the administrative procedures that used to be done manually.

Page 16: methods and resources

4.3.2.2 The database3 e databaseText compiled and edited in the front end of the DWS is stored in the dictionary’s database.The DWS’s database component makes it possible to run complex searches over the entire text.

4.3.2.3 Administrative toolsA DWS program will usually include housekeeping tools that facilitate the management of a large dictionary projectmanagement of a large dictionary project.Systems like this ensure that only one person works on a given entry at any one time, and that individual fields in the database are lockable.

4.3.2.4 BenefitsThe DWS facilitates the entire process of creating and publishing a dictionary.A good DWS streamlines the editorial process and allows lexicographers to focus on lexicographyThe system makes it easy for senior editors to review the text as it develops, monitor its quality, and give feedback to the editorial team.

Page 17: methods and resources

As with CQS software, the trend is for the user’s interface to be accessed online, and this environment makes it possible for widely dispersed editorial teams to work efficiently on the same project.The benefits include productivity gains, a smoother transfer of text to its eventual delivery format, and opportunities for re-using expensively created dictionary y , pp g p y ytext.The various features of the DWS help to deliver higher levels of quality, accuracy, and internal consistency.

4.4 The Style GuideFor each entry component, the editorial team needs a set of guidelines.These guidelines show how the dictionary’s style policies should be applied in

d d l dindividual dictionary entries.The style guide – essentially a book of instructions for lexicographers – is the document in which all these guidelines are assembled.

4.4.1 What kinds of information does a Style Guide include?yThe Style Guide will show how each entry component should be dealth with and will cover the following areas:

Page 18: methods and resources

• Morphology• Variant forms• Grammar• Labels and their use• Definitions• Examples• Derived forms• Cross-references

4.4.2 Style Guides past and present4.4.2 Style Guides past and presentStyle Guides for teams without a DWS must include detailed instructions on issues such as:

• The correct order of the various parts of an entryThe correct font to be used for each element• The correct font to be used for each element

• The precise designation of a recurring elementData is entered in the form of plain text, and the software takes care of its eventual representation on page or screen.Th i f h d i hi h h lThe question of the order in which the entry components may appear almost ceases to be an issue because the writing system won’t allow you to enter elements in the wrong order.

Page 19: methods and resources

A good dictionary writing system will provide a list to choose from items valid for good d ct o a y t g syste p o de a st to c oose o te s a d oany field where there is a finite set of options.The correct forms are thus hardwired into the system.Contemporary Style Guides no longer concern themselves with trivial issues like

f fthe correct form of a grammar code.However, they need to include instructions for inputting data in the right field.The traditional Style Guide was a printed document, and could easily run to several hundred pagesseveral hundred pages.Nowadays, the Style Guide is an electronic document.

4.4.3 Why you need a Style GuideA clear well-structured Style Guide resolves uncertainty in cases whereA clear, well structured Style Guide resolves uncertainty in cases where straightforward rulings can be given, and provides advice in situations where lexicographers have to use their own judgment.All of this enhances editors’ confidence and improves the efficiency of the

il icompilation process.This in turn brings benefits to the dictionary user.

Page 20: methods and resources

Users gradually get to know how their dictionary works, and if it is well organized Use s g adua y get to o o t e d ct o a y o s, a d t s e o ga edand internally consistent, they should find that unsatisfactory look-ups are relatively rare.A good Style Guide will provide the information you need in the great majority of casescases. But the dynamic nature of human languages means that there will always be situations where the Style Guide can’t give a ruling.

4.5 Template entriesTemplate entries are ‘pro forma’ entries for use by the lexicographers writing either database or dictionary.

4.5.1 What they are and how they are usedA lexical set is any group of words that share a common element of meaning such as the days of the week or months of the year.A l i f k d i d f ili i i i f d hA template entry is a framework designed to facilitate writing entries for words that belong to lexical sets. Most of the dictionaries on your shelves will have been produced without the benefit of lexical set compiling or template entries.

Page 21: methods and resources

The members of a lexical set pose the same kinds of lexicographic problem and e e be s o a e ca set pose t e sa e ds o e cog ap c p ob e a dshould be handled in the same way in a dictionary.Once you’ve planned the entry for one word in the set, you can benefit from that work next time you meet another word from the same set.

fThe template is designed to hold, in an ordered way, the essential facts about any word belonging to a specific category.

4.5.1.1 Entry structure and content template Whenever you come to a headword belonging to a lexical set a good first move isWhenever you come to a headword belonging to a lexical set, a good first move is to check the relevant template entry.

4.5.2 Why templates are usefulWriting template entries is a useful form of lexicographic training for g p g p glexicographers because it streamlines the editorial process and ensures systematic and comprehensive coverage of the lexical units involved.

4.5.3 How templates are compiledE h h d f h l i l b i di d d ilEach person chooses one word from the lexical set being studied, and compiles the richest corpus-based entry they can for that word.

Page 22: methods and resources

These entries are then compared and collated in a discussion session, and the ese e t es a e t e co pa ed a d co ated a d scuss o sess o , a d t efinal version of the template entry is drawn up, with all possible relevant lexical units included in it.

• First choose some ‘sample’ words in the lexical set.Share these words out amongst the lexicography team• Share these words out amongst the lexicography team.

• Each person studies the corpus data for their particular word and outlines an entry.• Everyone completes their own entry.• The group assembles the facts into a template entry.• The resulting template can be used as a checklist entry for any word in the set.

Page 23: methods and resources

Turkish Summaryy

Bu bölümde bir bütüncenin sözlük oluşturma sürecindeki işlevi ve sözlükbilimcilerin nasıl ş şbir ortamda çalıştığı anlatılmaktadır. Sözlük oluşturma sürecinde temel prensipler, analiz, aktarım ve sentez aşamaları takip edilir. Bu süreçte veri tabanının oluşturulması sözlüğün tek dilli veya iki dilli olmasına göre farklılık gösterir. Tek dilli sözlüklerde veri tabanının analizlerle oluşturulması ve elde edilen bilginin veri tabanıyla sentezlenmesi ş g ysonucunda sözlük oluşturulurken, iki dilli sözlüklerde bu süreçlere oluşturulan veri tabanının tercüme edilmesi süreci de eklenir. Sözlüğü oluştururken bazı yazılımlardan yardım alınabilir. Bu yazılımlar ‘bütünce sorgulama sistemi (CQS)’ ve ‘sözlük yazma sistemi (DWS)’ olarak adlandırılırlar. Sözlükteki tekrarlanan öğelerin istikrarlı bir ( ) ğbiçimde nasıl ele alınacağı temel bir ilkedir ve bunun için bazı kurallar dizisine ihtiyaç duyulur. Bu kurallar dizisi de sözlükbilimciler için bir talimat kitabı niteliğinde olan ve ‘Biçem Klavuzu’ adı verilen bir el kitabında bir araya getirilir. Bu el kitabı her bir madde başlığının nasıl ele alınacağı konusunda bilgiler verir ve biçimbilim, değişik sözcük ş ğ ğ g ç , ğ şbiçimleri, dilbilgisi, etiketler ve onların kullanımları, tanımlamalar, örnekler, türetilmiş sözcük biçimleri ve çapraz gönderimler alanlarını kapsar. Anlam açısından ortak bir öğeye sahip olan bir sözcük dizisi oluşturulur. Bu sözcük dizisine ait olan sözcüklerin madde başlıklarını yazmayı kolaylaştırmak için tasarlanmış çerçeve ise ‘şablon madde ş y y y ş ç ş ç ç şbaşlığı’ olarak adlandırılır. Şablon madde başlıkları yazı sürecini verimli hale getirir ve sözcük dizisinin sistematik ve anlaşılabilir bir içeriğe sahip olmasını sağlar.

Page 24: methods and resources

Write the stages followed during the dictionary-writing process.te t e stages o o ed du g t e d ct o a y t g p ocessThese stages are rationale, analysis, transfer and synthesis.

What is the aim of compiling the database from the corpus?The purpose of this database is to store selected facts about the word in a systematic wayThe purpose of this database is to store selected facts about the word in a systematic way, so that by scanning them you can quickly and efficiently get a fix on the headword and extract the information you need for the final dictionary entry.State the function of the database entry.

h ld h l f l h h h d dIt can hold a rich selection of corpus examples showing the headword in use in its various meanings and patterns.What are the benefits of a well-designed dictionary database?A well-designed and well-populated dictionary database represents a valuable piece ofA well designed and well populated dictionary database represents a valuable piece of intellectual property. As well as being usable as a basis for all kinds of dictionary and as an information-source for linguistic research, it is likely to be attractive to builders of computer applications. The value of the database lies in the fact that it is an unbiased record of what is happening in the one single language it is describing.What are the goals of transfer and synthesis stages?

Page 25: methods and resources

The purpose of transfer stage is to build up a body of target language equivalents of the p p g p y g g g qheadword in as many contexts as possible, so that when the entry editors come to extract the final entry they have all their options assembled for them in one place. The purpose of synthesis stage is to produce the final entry, the one most appropriate for the typical user of your dictionary.Write the software types needed for the stages of the dictionary writing process.These are ‘corpus query system’ and ‘dictionary writing system’.What does a ‘lexical profile’ means and what is the benefit of ‘lexical profiling’?What does a lexical profile means and what is the benefit of lexical profiling ?A lexical profile is a kind of statistical summary which reveals the salient facts about the way a word most typically combines with other words. Lexical profiling offers a solution that maximizes the value of a large corpus while reducing the effort required by the human user Lexical profiling software adds a valuable resource to the lexicographer’s repertoireuser. Lexical-profiling software adds a valuable resource to the lexicographer’s repertoire. It was originally seen as a useful supplementary tool well-adapted for identifying collocational patterns – important information for pedagogical dictionaries.Write the components that constitute the dictionary writing system.These components are a text-editing interface, a database, and a set of administrative tools.

Page 26: methods and resources

What is a ‘Style Guide’ and what is the purpose of using it?at s a Sty e Gu de a d at s t e pu pose o us g tThe style guide – essentially a book of instructions for lexicographers – is the document in which all these guidelines are assembled. For each entry component, the editorial team needs a set of guidelines. These guidelines show how the dictionary’s style policies should be applied in individual dictionary entries.be applied in individual dictionary entries.

• Which areas are covered by a Style Guide?The Style Guide will show how each entry component should be dealth with and will cover the following areas:

• Morphology• Variant forms• Grammar• Labels and their useLabels and their use• Definitions• Examples• Derived forms

C f• Cross-referencesWrite the issues that Style Guides for teams without a DWS should include.

Page 27: methods and resources

Style Guides for teams without a DWS must include detailed instructions on issues such as:y• The correct order of the various parts of an entry• The correct font to be used for each element• The precise designation of a recurring elementMake a comparison between the earlier and the contemporary forms of StyleMake a comparison between the earlier and the contemporary forms of Style Guides.Contemporary Style Guides no longer concern themselves with trivial issues like the correct form of a grammar code. However, they need to include instructions for inputting data in the right field. The traditional Style Guide was a printed document, and could easily run to several hundred pages. Nowadays, the Style Guide is an electronic document.

• What are the benefits of a Style Guide?A clear well-structured Style Guide resolves uncertainty in cases where straightforwardA clear, well structured Style Guide resolves uncertainty in cases where straightforward rulings can be given, and provides advice in situations where lexicographers have to use their own judgment. All of this enhances editors’ confidence and improves the efficiency of the compilation process. This in turn brings benefits to the dictionary user. Users gradually get to know how their dictionary works, and if it is well organized and internally consistent, g y , g y ,they should find that unsatisfactory look-ups are relatively rare. A good Style Guide will provide the information you need in the great majority of cases.

Page 28: methods and resources

What is a ‘template entry’?at s a te p ate e t yA template entry is a framework designed to facilitate writing entries for words that belong to lexical sets. State the significance of using templates.Writing template entries is a useful form of lexicographic training for lexicographers because it streamlines the editorial process and ensures systematic and comprehensive coverage of the lexical units involved.Which stages are involved when compiling the templates?g p g p

• First choose some ‘sample’ words in the lexical set.• Share these words out amongst the lexicography team.• Each person studies the corpus data for their particular word and outlines an entry.

Everyone completes their own entry• Everyone completes their own entry.• The group assembles the facts into a template entry.• The resulting template can be used as a checklist entry for any word in the set.