course 12_computer aided terminology
TRANSCRIPT
-
8/10/2019 Course 12_Computer Aided Terminology
1/23
Computer-aided terminology
Electronic tools, databases
-
8/10/2019 Course 12_Computer Aided Terminology
2/23
Computer-aided terminology
The role of computers in terminology/terminography:
Documentation access to databanks, specialized
texts in electronic format prior to beginning work
Creation of corpus
automatic term extraction Terminological records in electronic format writing
the entry
Checking the information in the entry
Editing glossaries in electronic format
(ap. Rucareanu & Pavel, Cabre)
-
8/10/2019 Course 12_Computer Aided Terminology
3/23
Computer-aided terminology
Problems related to using electronic tools in terminologywork:
Lack of integrating computer resources in work methods
Lack of compatibility among the resources
Limited degree of computer processing available by eachresource, human intervention constantly required (?)
The lack of an operative user friendly interface betweenhumans and computers (resulting from difficulties relatedto communication in natural languages)
Limited number of corpora, especially in languages otherthan the major international languages (English, French)(ap. MT Cabre1999)
-
8/10/2019 Course 12_Computer Aided Terminology
4/23
Computer-aided terminology
Data banks = information organized in
records, subdivided into data fields
Steps for creating a databank
Identification of needs
Defining parameters (by group of
experts) Choosing the working group
-
8/10/2019 Course 12_Computer Aided Terminology
5/23
Computer-aided terminology
Design of the databank
Choosing the hardware and software
Entry
type of information, sources,structure of the information etc.
Storage type of records, relationship
among records, structure of records Retrievaltypes of queries, formats of
retrieved information (ap MT Cabre)
-
8/10/2019 Course 12_Computer Aided Terminology
6/23
Computer-aided terminology
Data banks of interest to terminology
Document databases (Eurlex)
Specialized text data banks (corpora)
Economic neology in Romance languages (Romanian among them) glossary of economic terms extracted from the press (useful for a
comparative approach to terminology and for the study of neology in a
specific domain.
http://obneo.iula.upf.edu/economia/esp/index.html
Ordilex.com, Lexiques multilingueshttp://www.ordilex.com/documents/index.html
Grand dictionnaire terminologiquehttp://w3.granddictionnaire.com/btml/fra/r_motclef/index1024_1.asp
http://obneo.iula.upf.edu/economia/esp/index.htmlhttp://www.ordilex.com/documents/index.htmlhttp://w3.granddictionnaire.com/btml/fra/r_motclef/index1024_1.asphttp://w3.granddictionnaire.com/btml/fra/r_motclef/index1024_1.asphttp://www.ordilex.com/documents/index.htmlhttp://obneo.iula.upf.edu/economia/esp/index.htmlhttp://obneo.iula.upf.edu/economia/esp/index.html -
8/10/2019 Course 12_Computer Aided Terminology
7/23
BNCat http://corpus.byu.edu/bnc/
(a good page on corpora at http://corpus.leeds.ac.uk/list.html)
Corpus of Professional English(CPE) a major research project of PERC(Professional English Research Consortium) currently underway that, when finished, will consist
of a 100-million-word computerized database of Eng used by professionals in science,
engineering, technology, law, medicine, finance & other fields.
Wolverhampton Business English Corpus10,186,259 wds in the generaldomain of business, collected from 23 different web sites around the world (from six months within
the period 1999-2000), covering a wide variety of categories including product descriptions,
company press releases, annual financial reports, business journalism, academic research
papers, political speeches & government reports. POS-tagged. Alternatively you can see &
compare frequency lists & ngrams for various subcorpora/text genres (including business texts).
Knowledge bases (Protga free open source ontology editorand knowledge-base framework at
http://protege.stanford.edu/index.html)
http://corpus.byu.edu/bnc/http://corpus.leeds.ac.uk/list.htmlhttp://www.perc21.org/cpe_project/index.htmlhttp://www.perc21.org/menu.htmlhttp://www.elda.org/catalogue/en/text/W0028.htmlhttp://www.elda.org/catalogue/en/text/W0028.htmlhttp://protege.stanford.edu/index.htmlhttp://protege.stanford.edu/index.htmlhttp://www.elda.org/catalogue/en/text/W0028.htmlhttp://www.elda.org/catalogue/en/text/W0028.htmlhttp://www.perc21.org/menu.htmlhttp://www.perc21.org/cpe_project/index.htmlhttp://corpus.leeds.ac.uk/list.htmlhttp://corpus.byu.edu/bnc/ -
8/10/2019 Course 12_Computer Aided Terminology
8/23
Computer-aided terminology
Terminological databank: a structuredcollection of information about the units of
meaning and designation of a specialized
field addressed to the needs of a specificgroup of users
Consists of:
A main database (including the terms) Other databanks related to each other (containing
information on some aspect of the terms)
-
8/10/2019 Course 12_Computer Aided Terminology
9/23
Computer-aided terminology
Classification of databanks:
By objectives: informative (they disseminateterminology); prescriptive (they intervene in term
usage); By entries: based on terms; based on concepts;
by subject matter: specialized in a subject field;specialized in several related subject fields;
By size: large banks (administrative bodies, ex.IATE); terminology minibanks (developed by a
professional /centre specializing in a subject field)
-
8/10/2019 Course 12_Computer Aided Terminology
10/23
Computer-aided terminology
By type of data (term banks, with definitions,phrase banks, encyclopedic banks, visual banks)
By number of languages (mono-, bi-
multilingual) By type of data organization (organized by
document, organized by terms without context)
(M.T. Cabre)
-
8/10/2019 Course 12_Computer Aided Terminology
11/23
Computer-aided terminology
In addition to the dictionaries which can be
purchased on CD-ROM, and glossaries built
up by the translator himself, there is now the
possibility of accessing term banks publiclyavailable through the Internet. Many national
and international organisations and
institutions are making their term banksavailable in this way, and their availability will
soon become the primary reason for
translators to use modems.
-
8/10/2019 Course 12_Computer Aided Terminology
12/23
Computer-aided terminology
Organisational term banks of this type have the
advantage of being more up-to-date than paper-
based dictionaries, of constituting an
authoritative source, and of being instantlyavailable from the work station.
One problem with these term banks is the lack of
standardization in the layout of the term records
and search engines used to access thetermbank. This lack of standardization is adding
yet another learning experience for the modern,
professional translator.
-
8/10/2019 Course 12_Computer Aided Terminology
13/23
Computer-aided terminology
(Pavel&Nolet: 2001, 61-84) limits its role to a
simple set of tools the terminologist has to
use in different moments of terminological
workflow. The authors mention onlycategories and subcategories of software
used in terminology such as documentary
search tools, term extraction tools,terminology research tools, data recording
tools, electronic-publishing tools, and
database management tools.
-
8/10/2019 Course 12_Computer Aided Terminology
14/23
Computer-aided terminology
Tools for term extraction:
Term extractor Termostatweb
http://idefix.ling.umontreal.ca/~drouinp/termo
stat_web/
http://idefix.ling.umontreal.ca/~drouinp/termostat_web/http://idefix.ling.umontreal.ca/~drouinp/termostat_web/http://idefix.ling.umontreal.ca/~drouinp/termostat_web/http://idefix.ling.umontreal.ca/~drouinp/termostat_web/ -
8/10/2019 Course 12_Computer Aided Terminology
15/23
-
8/10/2019 Course 12_Computer Aided Terminology
16/23
Computer-aided terminology
Ordilex.com, Lexiques multilingues
http://www.ordilex.com/documents/index.html
LEXIQUE FRANAIS-ANGLAIS DE L'IMMOBILIER : UNE VRITABLEBASE DE CONNAISSANCES.
La version en ligne de notre lexique franais-anglais des termes de
l'immobiliervous permet d'utiliser de nombreuses fonctionnalits.
http://www.ordilex.com/documents/index.htmlhttp://d/My%20Documents/terminologie/MA%20an%20II/%20http:/www.ordilex.com/documents/immobilier/index.htmlhttp://d/My%20Documents/terminologie/MA%20an%20II/%20http:/www.ordilex.com/documents/immobilier/index.htmlhttp://d/My%20Documents/terminologie/MA%20an%20II/%20http:/www.ordilex.com/documents/immobilier/index.htmlhttp://d/My%20Documents/terminologie/MA%20an%20II/%20http:/www.ordilex.com/documents/immobilier/index.htmlhttp://d/My%20Documents/terminologie/MA%20an%20II/%20http:/www.ordilex.com/documents/immobilier/index.htmlhttp://d/My%20Documents/terminologie/MA%20an%20II/%20http:/www.ordilex.com/documents/immobilier/index.htmlhttp://www.ordilex.com/documents/index.html -
8/10/2019 Course 12_Computer Aided Terminology
17/23
Computer-aided terminology
-
8/10/2019 Course 12_Computer Aided Terminology
18/23
Computer-aided terminology
Alignment tools:
Alignment tool in Terminotix
at http://www.youalign.com/Default.aspx
http://www.youalign.com/Default.aspxhttp://www.terminotix.com/?lang=enhttp://www.youalign.com/Default.aspx -
8/10/2019 Course 12_Computer Aided Terminology
19/23
Computer-aided terminology
Tools for corpus analysis (providing concordances, wordlist, n-grams, collocations, +/- contexts)
AntConc athttp://www.antlab.sci.waseda.ac.jp/antconc_index.html
Linguateca at http://www.linguateca.pt/corpografo/
More about corpora and tools athttp://courses.washington.edu/englhtml/engl560/corpli
ngresources.htm
http://www.antlab.sci.waseda.ac.jp/antconc_index.htmlhttp://www.linguateca.pt/corpografo/http://courses.washington.edu/englhtml/engl560/corplingresources.htmhttp://courses.washington.edu/englhtml/engl560/corplingresources.htmhttp://courses.washington.edu/englhtml/engl560/corplingresources.htmhttp://courses.washington.edu/englhtml/engl560/corplingresources.htmhttp://www.linguateca.pt/corpografo/http://www.antlab.sci.waseda.ac.jp/antconc_index.html -
8/10/2019 Course 12_Computer Aided Terminology
20/23
Computer-aided terminology
Textalyser This text analysis tool provides information on thereadability and complexity of a text, as well as statistics on word
frequency and character count. It can be of assistance to translators
when calculating quotes for clients.
http://www.lexicool.com/text_analyzer.asp
http://www.lexicool.com/text_analyzer.asphttp://www.lexicool.com/text_analyzer.asp -
8/10/2019 Course 12_Computer Aided Terminology
21/23
-
8/10/2019 Course 12_Computer Aided Terminology
22/23
Computer-aided terminology
the virtualization of terminology describes the new set of
integrated solutions using cloud-based terminology tools
allowing terminologists to work in an interactive and
collaborative environment which facilitates real-time
communication and collective classification of data, easyaccess to the information from anywhere and with almost
any electronic device, continuous evaluation, update,
and revision of processed data. These tools open new
perspective in the further development of the terminologytools to the implementation of terminology as a service
(TaaS). (Cristina Varga, Virtualization of Research in
Terminology. Cloud-based Terminology Management
Tools, 2013)
-
8/10/2019 Course 12_Computer Aided Terminology
23/23
Computer-aided terminology
http://www.termwiki.com/
Termw ik ia cloud-based terminology
management. A strong aspect of Termwik iis
represented by the complex term record formincluding main term, equivalents, definition, part
of speech, synonyms, domain, sub-domain,
product/producer which allows terminologist to
create complex glossaries. Termwik iterminology tools allow the user to create
multimedia glossaries too because they support
image and sound files.