the practice of terminology work · don’t forget the maintenance ... fr "indicateur [visuel]...

88
May 2014 Tekom Belgium - © D. O. G. GmbH 1 The Practice of Terminology work Dr. François Massion, D. O. G. Dokumentation ohne Grenzen GmbH ([email protected])

Upload: truongbao

Post on 13-Sep-2018

213 views

Category:

Documents


0 download

TRANSCRIPT

May 2014 Tekom Belgium - © D. O. G. GmbH 1

The Practice of Terminology work

Dr. François Massion, D. O. G. Dokumentation ohne Grenzen GmbH ([email protected])

May 2014 Tekom Belgium - © D. O. G. GmbH 2

The most important aspects of terminology work

Use of terminology

Terminology extraction

Term entry: Models and concepts

Management and distribution

Programs and standards

Overview

May 2014 Tekom Belgium - © D. O. G. GmbH 3

This is Mr. Smith, design engineer

Mr. Smith has

designed a "thing"

Mr. Smith

I am the "thing"

May 2014 Tekom Belgium - © D. O. G. GmbH 4

Journey of a concept

I need 300 timingcircuits

CustomerWe will start a sales

campaign for thetimer A-12

Sales

Now thetime emitter A-12 is being produced.

Supplier

I get a quote for theproduction of the electronic

timer

Purchasing

This is my new thing. I call it A-12 Timer

Developper

May 2014 Tekom Belgium - © D. O. G. GmbH 5

Case study

Let's show something ...

May 2014 Tekom Belgium - © D. O. G. GmbH 6

Test

How do you call this thing “N”? "soap dispenser" "detergent dispenser" "detergent drawer" "detergent tray" "soap drawer" "洗涤剂抽屉"

"清洁剂抽屉"

"洗洁精抽屉"

May 2014 Tekom Belgium - © D. O. G. GmbH 7

50 Years agoHand made documentation

May 2014 Tekom Belgium - © D. O. G. GmbH 8

What has changed since?Modularization and re-use

May 2014 Tekom Belgium - © D. O. G. GmbH 9

On the authoring side

Written by Mr. Smith, June 2007 Written by Mr. Jones, April 2009Modules

May 2014 Tekom Belgium - © D. O. G. GmbH 10

On the translation side

Translated by Mr. Martinez, May 2010

Machine translated, April 2007

(Change Hidden Key : Input the new hidden key .)

Translated by Ms. Fernandez, July 2010

Translated by Mr. Caballero, Oct. 2008

Sentences (or "segments")

May 2014 Tekom Belgium - © D. O. G. GmbH 11

Growing importance of terminology

Globalization of the world economy means: More trade and competition More products and shorter

product development cycles(example: iPhone)

More laws and regulations This leads to:

More documents in shorter time intervals More translators and authors More frequent updates More Languages

May 2014 Tekom Belgium - © D. O. G. GmbH 12

Data bases

Docu-ments

Content Management System

Machine Translation (MT)

Translation MemorySystem

"Single Source Publishing"

New translation processes

<XML>

May 2014 Tekom Belgium - © D. O. G. GmbH 13

Terminology and the outside world

Within the company: Standardize With the outside world: Understand and be

understood Customers (sales, support, user-friendly documentation) Government & institutions Suppliers Public at large

Know and use the language of the outside world

May 2014 Tekom Belgium - © D. O. G. GmbH 14

Terminological influence (in %)

100%

50%

75%

30%10%

0%

20%

40%

60%

80%

100%

120%

Ownpublications

Customers Suppliers Govt. Agencies Public

May 2014 Tekom Belgium - © D. O. G. GmbH 15

Terminology and knowledge

May 2014 Tekom Belgium - © D. O. G. GmbH 16

Terminology and knowledge

The philosophy of the inductive sciences, Band 1, 1840

William Whewell (1794-1866)

Whewell was a co-founder and president of the British Association for the Advancement of Science. He was a Master of Trinity College, Cambridge. He wrote two important studies on the history and philosophy of science. He coined the word "scientist" in 1833.

May 2014 Tekom Belgium - © D. O. G. GmbH 17

Terminology and knowledge

Knowledge is a dynamic development process Terminology helps ordering and transfer

knowledge Break down knowledge in its components:

Creation of concepts Concepts may be different between languages! Relationships between concepts

(e.g. "Belongs to") Add clarity with definitions Work out semantic characteristics

(with respect to terminology gap)

May 2014 Tekom Belgium - © D. O. G. GmbH 18

Use of terminology

Terminology extraction

Term entry: Models and concepts

Management and distribution

Programs and standards

May 2014 Tekom Belgium - © D. O. G. GmbH 19

Types of extraction

Terminology extraction

Manual

Monolingual Multilingual

Automatic

Monolingual Multilingual

Interactive

Monolingual Multilingual

May 2014 Tekom Belgium - © D. O. G. GmbH 20

Exercise 1: Extraction(downloaded from the Internet) on 28.04.2014

May 2014 Tekom Belgium - © D. O. G. GmbH 21

Small exercise

Which words do you want to extract?

May 2014 Tekom Belgium - © D. O. G. GmbH 22

Extraction in the source language

Which words do you want to extract? Which word categories are considered?

– Grammatically: • Nouns, verbs and adjectives• Bi-grams, tri-grams and collocations

– Content: • Specific technical term of a company• General technical terms• Abbreviations• General terms

May 2014 Tekom Belgium - © D. O. G. GmbH 23

How many terms shall be extracted?

Examples: Manufacturing company: 3-10,000 terms per

language Steel plant: 58,000 entries! Open biomedical ontologies: 1.5 million concepts!

Optimal scope Depends on goals, time, and resources Don’t forget the maintenance If little time: only the very specific terms for a

domain / company

May 2014 Tekom Belgium - © D. O. G. GmbH 24

Terminology and goals

Goals have an impact on terminology work Examples of goals:

1. Reduce costs of translation through standardization

2. Facilitate trade between countries

3. Reach more customers4. Organize knowledge5. Improve communication

Tools Data model

Type ofcontent # of entries

GOALS

May 2014 Tekom Belgium - © D. O. G. GmbH 25

Terminology "at the press of a button"

Our vision

Start

Terminologyentries

May 2014 Tekom Belgium - © D. O. G. GmbH 26

But ...

Text = highly complex system of lexical entities Automatic recognition of:

Various appearances of same term ("lemmatize") Collocations: "user interface" vs. „some interface" Lexical variants:

„program", „application", „software", „tool "... Correct relations (ambiguity):

„Check settings" = „Settings of the check" or „Check the settings"?

Correct grammatical category: "Copy" = verb or a noun?

May 2014 Tekom Belgium - © D. O. G. GmbH 27

Learning from computational linguistics

Methods

Nothingcomes

out of it!

May 2014 Tekom Belgium - © D. O. G. GmbH 28

Computational linguistics

Objective: Understand natural languages in order to process them with programs

Natural Language Processing (NLP) Create a language model, e.g. recognize

components and their relations Extraction of patterns such as Adj + Noun, Noun +

Noun. Application examples:

Machine translation (MT) Terminology extraction Information mining

May 2014 Tekom Belgium - © D. O. G. GmbH 29

Word analysis: Morphology

Analytical languages (such as Chinese) and synthetic languages with affixes and inflections (case, number, gender, etc. ). Example: Drukontluchtingsklep / 减压阀 / pressure relief valve / vanne de dépressurisation

Stemming: Reduce to root form of a word (=stem): Activated, activation, actively, actives activ (stem)

Lemmatization: Reduction to base form of a word, lexical entry (=lemma) activated activate (verb)

N-grams: bi-gram, tri-gram = sequences of N consecutive words. Often adjacent, but not always: EN "[visual] clogging indicator" FR "indicateur [visuel] de colmatage"

Vorführender
Präsentationsnotizen
Exercise: Text und Schwierigceiten für die Terminologieextraction aufzeigen: Flexionen collocationen Ambiguität (grammaticalisch/semantisch) Schreibfehler Schreibvarianten Außerdem: Übung Wörter marcieren, die für die Übersetzung-QA relevant wären (lincs/rechts….)

May 2014 Tekom Belgium - © D. O. G. GmbH 30

Extraction steps

Corpus Import•Remove format and punctuation•Create list of words (single words, n-grams)

Clean-up•Remove duplicates, unwanted words, stop words• Lemmatize

Enter data•Add information according to data model (status, etc. )•Add equivalents in other languages

May 2014 Tekom Belgium - © D. O. G. GmbH 31

Which types of words? Raw corpus

Belangrijk!Zorg voor voldoende plaats voor onderhoud en instandhouding!Voldoende ruimte inplannen voor het openen van de aansluitkast en de elektrische aansluiting en de eventuele frequentieomzetter inplannen.Het betonfundament moet- geheel zijn uitgehard,- voldoende stijfheid hebben (minimaal klasse X0 conform DIN EN 206),- een horizontaal en effen oppervlak hebben,- vibraties, krachtinvloeden en stoten kunnen opnemen- zo zijn bemeten, dat de stervormige grepen (004) op de filterdeksel (003) handmatig moeten kunnen worden geopend.Plaats de pomp als de vloer aan deze voorwaarden voldoet.

May 2014 Tekom Belgium - © D. O. G. GmbH 32

Which types of words? First categorization

Belangrijk!Zorg voor voldoende plaats voor onderhoud en instandhouding!Voldoende ruimte inplannen voor het openen van de aansluitkast en de elektrische aansluiting en de eventuele frequentieomzetter inplannen.Het betonfundament moet- geheel zijn uitgehard,- voldoende stijfheid hebben (minimaal klasse X0 conform DIN EN 206),- een horizontaal en effen oppervlak hebben,- vibraties, krachtinvloeden en stoten kunnen opnemen- zo zijn bemeten, dat de stervormige grepen (004) op de filterdeksel (003) handmatig moeten kunnen worden geopend.Plaats de pomp als de vloer aan deze voorwaarden voldoet.

NewTerminology Known Terminology Stop words Words to be qualified Abbreviations/Acronyms/Product names

May 2014 Tekom Belgium - © D. O. G. GmbH 33

The Word/Excel method

Quick help in urgent situations Word:

1. Save file as a text file (=remove formatting)2. Remove superfluous characters3. Generate a list of words (replace blanks with

return) Excel:

1. Delete duplicates2. Remove stop words3. Reduce to basic form („lemmatize")

May 2014 Tekom Belgium - © D. O. G. GmbH 34

Remove superfluous words (Excel)

Stop words (MATCH function) General words (is, the, always... ) Existing terms in database Save rejected words for later

clean-up actions “recycling”

Stop words become more efficient over time

Special stop words for group of words (phrases)

Include existing terminology in stop words list

May 2014 Tekom Belgium - © D. O. G. GmbH 35

Sorting out in Excel

step 1:Insert an empty column

step 2:Enter symbol for terms you want to reject (or accept)

step 3:Sort list with this column step 4:

Edit terms

step 5:Save list of rejected entries for later re-use

May 2014 Tekom Belgium - © D. O. G. GmbH 36

Lemmatization

New column with formulaRIGHT(A1;2)

Sort on endings Edit inflectional endings

May 2014 Tekom Belgium - © D. O. G. GmbH 37

Used Excel functions

Comparison of entries (here "A1 ") with lists=MATCH(A1;MYSTOPWORDS;0) or =MATCH(A1;$C$1:$C$1000;0)

[on German OS: "VERGLEICH"]

"MYSTOPWORDS" is a variable, self-defined name Length of an entry (e.g. cell „A1")

=LEN(A1) Two last characters from the right (of „A1")

=RIGHT(A1;2)

May 2014 Tekom Belgium - © D. O. G. GmbH 38

What are N-grams? Set groups of 1, 2, n words Example: „elektrische aansluiting" More frequent in some languages (English, Italian

etc. ) Challenge: How do you recognize them?

Bi-grams, tri-grams, n-grams (1/4)

May 2014 Tekom Belgium - © D. O. G. GmbH 39

Bi-grams, tri-grams, n-grams (2/4)

Copy the unsorted list of words with an offset of one cell up

May 2014 Tekom Belgium - © D. O. G. GmbH 40

Sort in a text editor (Word, Notepad etc. ) + replace tabs with spaces

Copy in a column in XL and evaluate (partial results)

Bi-grams, tri-grams, n-grams (3/4)

May 2014 Tekom Belgium - © D. O. G. GmbH 41

Excel result:

Review list manually

Bi-grams, tri-grams, n-grams (4/4)

May 2014 Tekom Belgium - © D. O. G. GmbH 42

No one-to-one symmetry:

1) Different morphology: Dutch: frequentieomzetter (1 word); English: frequency converter (2 words);

Chinese: 目录(2 characters)

Chinese: No plural in Chinese symbols: treaty / treaties = 条约

2) Unmatching word categories

3) Different syntactical patterns

4) Semantical differences (terminology gap): Different nomenclatures (e.g. international classification of diseases (ICD) but differing

categories in many countries)

Bilingual extraction: Issues

May 2014 Tekom Belgium - © D. O. G. GmbH 43

Approaches to extract bilingual terminology:Option #1 Align texts (e.g. translate with CAT) Extract separately key terms in both

languages and highlight them Bilingual term alignmentOption #2 Align texts Sort according to length Shorter entries (up to 30 characters)

may be matching termsOption #3 Use extraction software like

Multiterm Extract

Methodology

May 2014 Tekom Belgium - © D. O. G. GmbH 44

Equivalence and standards

Standard ISO 12620 describes equivalence categories: narrower equivalent quasi-equivalent (or near-equivalent) broader equivalent phrase

• Qualifier assigned to a phraseological unit in one language that expresses the same semantic content as a phraseological unit in another language.

Automatic translation of "time" in various languages

May 2014 Tekom Belgium - © D. O. G. GmbH 45

Tips & tricks (1)

Where do you find terminology? Databases (IATE = Inter-Active Terminology for Europe, LEO) Online dictionaries (e.g. : "inurl:terminology") Collections of dictionaries Discussion groups (www.proz.com)

Search engines Helpful search methods

Searched phrase in quotation marks Search in multiple languages "socket" + "ist ein" Search in multilingual sites: "site:ch" term + "de-en" (example: Schraube de-en)

May 2014 Tekom Belgium - © D. O. G. GmbH 46

Tips & tricks (2)

Use the Internet

(a) To validate an expression

(b) To find the frequency of alternatives

Example: "Schakelschema" in French? Schéma de câblage Schéma des connexions Schéma des circuits Schéma électrique

May 2014 Tekom Belgium - © D. O. G. GmbH 47

Tips & tricks (3)

Use the image search feature

Frequency converter = 变频器 ???

May 2014 Tekom Belgium - © D. O. G. GmbH 48

Use of terminology

Terminology extraction

Term entry: Models and concepts

Management and distribution

Programs and standards

May 2014 Tekom Belgium - © D. O. G. GmbH 49

Organizing the terminology

Classical dictionaries are sorted in alphabetical order.

No way to group words with the same meaning: Synonyms such as "application" and "software" Abbreviations and full names: "afz." and "afzender",

Classical dictionaries list all meanings of one word. "bank": what are the meanings of this word?

Same with multilingual dictionaries

May 2014 Tekom Belgium - © D. O. G. GmbH 50

The „semiotic triangle"

" Electric socket "

socket jack wall outlet junction box

Term (= designation)„stopcontact“

Mental representation = "concept"

(actual) object

May 2014 Tekom Belgium - © D. O. G. GmbH 51

What is the difference?

May 2014 Tekom Belgium - © D. O. G. GmbH 52

Concept / Term

Term-based terminologies Traditional dictionaries Encyclopedias

Concept-based terminologies The basic entry is a concept Can include multiple terms

(e.g. synonyms) and languages

May 2014 Tekom Belgium - © D. O. G. GmbH 53

But what is really a concept?

Reality can be looked at in different ways

Different visions and perceptions: „drawer“ vs. „schuiflade“„exit“ „uitgang“ / „afrit“

Influence of culture, educational system, languagecommunity, classification systems … Translation issues

Why is this the same?Why is this different?

May 2014 Tekom Belgium - © D. O. G. GmbH 54

Componential Analysis (CA): relevantcharacteristics

Useful to define the degree of equivalence between words in a language or between languages

Concepts, terms and synonyms

May 2014 Tekom Belgium - © D. O. G. GmbH 55

Meaning relationships

Synonyms - Homonyms

Concept Term

Synonymy

Polysemy

Homonymy

1

123

123

TTT

TaTbTc

T

softwareprogramapplication

foot

mouse (pl. "mice")mouse (pl. "mouses", "mice")

May 2014 Tekom Belgium - © D. O. G. GmbH 56

Exercise 2: Data model

Which information should be added to the terms?

What are the hierarchical levels in the data model?

Which levels the additional information should be allocated to?

May 2014 Tekom Belgium - © D. O. G. GmbH 57

Data categories

ISO 12620:2009-12 defines more than 200 data categories

"Terminology and other language and content resources - Specification of data categories and management of a Data Category Registry for language resources" Concept oriented categories Administrative categories Term related categories

www.isocat.org

May 2014 Tekom Belgium - © D. O. G. GmbH 58

A data model (example)

• Definition• Source of definition• Image• Attribute• Comment

Concept Level

• Homonym available (Y/N)Language level

• Status• Word Type• Usage• Example• Term comment

Term level

ApprovedTo be checked

NounVerbAdjectiveAdverbProper NameAbbreviationShort FormDisplay Text

AuthorizedProhibitedUnclearNot requiredTo be deleted

May 2014 Tekom Belgium - © D. O. G. GmbH 59

Semantic enrichment

Adds important information to the meaning of the concept

Particularly important for complex or ambiguous terms: "device", "disk" ...

Instruments: Definition Reference Image Contextual data (example) Remarks

May 2014 Tekom Belgium - © D. O. G. GmbH 60

Definitions must: Have a coherent structure Give unique essential characteristics

(function, appearance, consistence) Be as short as possible

Definitions must not: Consist of synonyms ("a dog is a hound") Be circular (A is B and B is A) Be negative (XYZ is no ABC) Be too narrow / broad

What is a good definition?

May 2014 Tekom Belgium - © D. O. G. GmbH 61

Define a touch screen

Touch screenBij een aanraakscherm of touch screen kan je met de vinger of een speciale pen (stylus genoemd) een handeling op de monitor uitvoeren. Er komt dus geen toetsenbord of muis bij kijken.

touch screenaanraakgevoelig schermType beeldscherm dat reageert bij het aanwijzen met je vinger. Veel gebruikt bij infokiosken. Maakt een toetsenbord quasi overbodig.

Aanraakschermbeeldscherm waarmee je je computer bedient door bepaalde plaatsen aan te raken computers

Aanraakschermeen beeldscherm dat ook als invoerapparaat voor een computer kan worden gebruikt door het scherm aan te raken.

AanraakschermBij een aanraakscherm of touchscreen wijs je met de vinger of een speciale pen op de monitor om een handeling uit te voeren. Hier komt dus geen muis of toetsenbord aan te pas. Het grootste voordeel hiervan is dat de gebruiker in principe een oneindig aantal (virtuele) knoppen kan aansturen met hetzelfde oppervlakte.

AanraakschermEen aanraakscherm (vaak met het Engelse touchscreen aangeduid) is een beeldscherm dat ook als invoerapparaat voor een computer of embedded system (zoals een smartphone of tablet) kan worden gebruikt door het scherm aan te raken.

May 2014 Tekom Belgium - © D. O. G. GmbH 62

Usage of terminology

Terminology extraction

Term entry: Models and concepts

Management and distribution

Programs and standards

May 2014 Tekom Belgium - © D. O. G. GmbH 63

Distribution of terminology

Usage: What / whom for? Work: Translator, interpreter, author, programmer… Quality check Information and knowledge

Applications Translation Memories Authoring tools,

quality assurance tools Data bases etc…

Central repository Settle responsibilities and rights Stores all modification from various users

May 2014 Tekom Belgium - © D. O. G. GmbH 64

Exchange formats

Excel: List with information in columns List with information in cells

PDF, TXT, CSV, MS-Word XML based standards:

TBX (TermBase eXchange) OLIF (Open Lexicon Interchange Format) MARTIF

(Machine readable terminological interchange format) UTX (universal terminology eXchange)

[designed by AAMT (Asia-Pacific Association for Machine Translation)]

May 2014 Tekom Belgium - © D. O. G. GmbH 65

List with term data in columns

Problem with variable number of columns

Many languages, many attributes: very large tables

Def. Image EN1 Attr. EN2 Attr. DE Attr. FR ...

XXX PC Computer Rechner

May 2014 Tekom Belgium - © D. O. G. GmbH 66

List term data in rows

IDs must be allocated Difficult exchange of data:

Database is three-dimensional List is two-dimensional

ID Def. Image Language Term Attr. 1 Attr. 2

1 DE Rechner

1 EN PC

1 EN Computer

2 DE Maus

..

N

May 2014 Tekom Belgium - © D. O. G. GmbH 67

Maintenance of terminology

Terminology is never complete: Status Optional fields Control the use of synonyms Detection of polysemous terms Use experience gained from Quality Assurance (texts,

translations, terminology). Depending on the structure of the dictionary, efforts vary

(definitions or not, additional information, etc. ) Approx. 10% of the time needed to create new terminology

entries

May 2014 Tekom Belgium - © D. O. G. GmbH 68

Challenge: Manage changes

May 2014 Tekom Belgium - © D. O. G. GmbH 69

LookUp – a short introduction

Access via the internet or an intranet Definition of the different user rights

Read Comment Write Also for individual languages (notification by mail)

Free definition of data model Definable queries Suggestion system Recognition of homonyms Various export and import formats (XLS, TBX)

May 2014 Tekom Belgium - © D. O. G. GmbH 70

New dictionary

May 2014 Tekom Belgium - © D. O. G. GmbH 71

Access Data

http://88.79.119.98 User: Tekom_May14 Password: Antwerp_011

May 2014 Tekom Belgium - © D. O. G. GmbH 72

Exercise 3: Terminology quality

Which entries are not correct? Entry rules

Singular without article Uniform spelling One type/piece of information per data field

May 2014 Tekom Belgium - © D. O. G. GmbH 73

Is terminology always recognized?

Mit elektrischem Handrad,Tasten für Achs-Richtung,Vorschub, Eilgang, Not-Aus-und Zustimmtaste, sowie drei belegbaren Funktionstasten.

With electrical handwheel, keys for axial direction axis direction, feed, rapid traverse, Emergency buttonEmergency Stop button and acceptance key as well as three assignable functional keys function keys.

In database and recognized In database and not recognized Not entered in database

Entries in the terminology database: Not-Aus-Taste Achsrichtung (Bindestrich bitte prüfen)

3 unrecognized errors!

May 2014 Tekom Belgium - © D. O. G. GmbH 74

Can you always rely on terminology?

10%

15%

15%60%

Checking a project: Terminology is….

SuperfluousMissingNot usableCorrect

May 2014 Tekom Belgium - © D. O. G. GmbH 75

1. Grammar and language usage ("apple(s)")2. Unambiguiousness:

Resistance = value or hardware?Control = action or hardware?

3. Consistent spelling (timeout / time out / time-out). (Consistency of terms, ISO standard 704:2000)

4. Comprehensibility ("Hippopotomonstrosesquippediliophobia" = "the fear of long words“[Hip-po-pot-o-mon-stro-ses-quip-pe-da-li-o-phob-i-a])

5. Natural order (sealing, elastic)6. Singular without article

Quality of terminologyLinguistic criteria

May 2014 Tekom Belgium - © D. O. G. GmbH 76

Quality of terminologyTechnical criteria

Format, no parenthesis, no question marks

Data elementarity(information is separated into individual data categories)

Term autonomy in concept based dictionaries

Comprehensiveness and correctness of information(e.g. status, mandatory fields)

May 2014 Tekom Belgium - © D. O. G. GmbH 77

Overview of the problems to be solved

Detection of synonyms Special case: wrongly separated concepts Special case: contextual synonyms (access point/point)

Detection of homonyms Obsolete entries Missing, wrong, contradictory attributes Definitions: Structure and content, homonyms

without definition Organization: References, relationships

May 2014 Tekom Belgium - © D. O. G. GmbH 78

Quality of terminologyAutonomy of terms

Adapted from: Birgit Wöllbrink – Tools für Terminologiemanagement in Tekom – Schriften zur Technischen Kommunikation Nr. 12, 2008, S. 81

May 2014 Tekom Belgium - © D. O. G. GmbH 79

Translation-friendly terminology

Takes into account the international use, "neutral with regard to culture" Always use the complete form

Takes into account the issue of equivalence (e.g. narrower/broader meaning): Definitions, fields for translation related information

Terms should be understandable for readers, who do not see the actual object (machine, software, device, service).

respiratie-apparaathuishoudelijk apparaat

espresso-apparaat massage apparaat

politie apparaat iOS-apparaat

respiratie-apparaatrespiration device

household applianceespresso machine

massagerpolice apparatus

iOS deviceiOS-apparaat

apparaat

May 2014 Tekom Belgium - © D. O. G. GmbH 80

Usage of terminology

Terminology extraction

Term entry: Models and concepts

Management and distribution

Programs and standards

May 2014 Tekom Belgium - © D. O. G. GmbH 81

Terminology programs

May 2014 Tekom Belgium - © D. O. G. GmbH 82

Translation window

May 2014 Tekom Belgium - © D. O. G. GmbH 83

Multiterm Extract

May 2014 Tekom Belgium - © D. O. G. GmbH 84

Synchroterm

May 2014 Tekom Belgium - © D. O. G. GmbH 85

Some software packages…Software Description Web

acrolinx® Terminology Lifecycle Management

Terminology extraction and management, terminology platform www.acrolinx.com

Across crossTerm System for the extraction, management and review of terminology. www.across.net

AntConc3.2 Statistical analysis of text, free www.antlab.sci.waseda.ac.jpConcord Wordsmith: concordance and keywords www.lexically.net

Concordance Monolingual terminology extraction with sentence examples. Price value. www.concordancesoftware.co.uk

Extphr33 Monolingual terminology extraction, free http://publish.uwo.ca/~craven/freeware.htmHeartsome Dictionary Editor XML-based dictionary www.heartsome.net

KWIC Concordance Corpus statistics, concordance, free www.chs.nihon-u.ac.jp/eng_dpt/tukamoto/kwic_e.html

KWICKWIC Concordance tool www.kwickwic.com

Lookup Internet-enabled platform terminology - purchase + rental software version www.dog-gmbh.de

MultiTerm 2011 und MultiTerm Extract Offered by SDL / Trados. Integrated in Trados technology. Multilingual. www.sdl.com

qTerm Web-enabled system for managing terminology. www.kilgray.com

SDLX Phrase Finder Terminology extraction with linguistic skills. Available for seven languages. www.sdl.com

Simple Concordance Program (SCP) Concordance and extraction program (monolingual) - Freeware www.textworld.com

Synchroterm Bilingual terminology extraction, semi-automatic, efficient www.terminotix.com

Terminology Extractor Extraction of Monolingual terminology www.chamblon.com/terminologyextractor.htm

TermStar Terminology module of Star Transit www.star-ag.ch

Textanz Monolingual terminology extraction with context specification www.cro-code.com

Textstat Statistical analysis of text, free www.niederlandistik.fu-berlin.de/textstat/

May 2014 Tekom Belgium - © D. O. G. GmbH 86

ISO Standards

ISO Technical Committee (ISO/TC 37). Since 1951: ISO 704:2009 Terminology work – Principles and methods ISO 1087:2000 Terminology work – Vocabulary – Part 1: Theory and

application ISO 12620:1999 Computer applications in terminology - Data

CategoriesISO 12620:2009 is now an open category database("Data Category Registry"): www.isocat.org

ISO 16642:2003 Computer applications in terminology –Terminological markup framework

ISO 15188:2001 Project management guidelines for terminology standardization

ISO/DIS 22274 Systems to manage terminology, knowledge and content - Concept-related aspects for developing and internationalizing classification systems

May 2014 Tekom Belgium - © D. O. G. GmbH 87

Bibliography

Best Practices in der Terminologiearbeit. 2.0 Akten des Symposions. Deutscher Terminologie-Tag e.V., Heidelberg, (2014) Price: € 50

Cabré, M. Teresa. – Terminology : Theory, Methods and Applications. – John Benjamins Publishing; 1999

COTSOES: Recommendations for Terminology Work. Bern; 2002 (can be downloaded: http://termcoord.files.wordpress.com/2013/07/here.pdf)

Sager, Juan C. - A Practical Course in Terminology Processing. John Benjamins Publishing; 1990

Wright, S.E./Budin, G. comp. (1997/2001). Handbook of Terminology Management. John Benjamins Publishing;Volume 1: 1997; Volume 2: 2001

May 2014 Tekom Belgium - © D. O. G. GmbH 88

Thank you for your attention!

Dr. F. MassionD. O. G. Dokumentation ohne Grenzen GmbHNeue Ramtelstr. 1271229 Leonberg, [email protected]