course 12_computer aided terminology

Upload: stefania-balan

Post on 02-Jun-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/10/2019 Course 12_Computer Aided Terminology

    1/23

    Computer-aided terminology

    Electronic tools, databases

  • 8/10/2019 Course 12_Computer Aided Terminology

    2/23

    Computer-aided terminology

    The role of computers in terminology/terminography:

    Documentation access to databanks, specialized

    texts in electronic format prior to beginning work

    Creation of corpus

    automatic term extraction Terminological records in electronic format writing

    the entry

    Checking the information in the entry

    Editing glossaries in electronic format

    (ap. Rucareanu & Pavel, Cabre)

  • 8/10/2019 Course 12_Computer Aided Terminology

    3/23

    Computer-aided terminology

    Problems related to using electronic tools in terminologywork:

    Lack of integrating computer resources in work methods

    Lack of compatibility among the resources

    Limited degree of computer processing available by eachresource, human intervention constantly required (?)

    The lack of an operative user friendly interface betweenhumans and computers (resulting from difficulties relatedto communication in natural languages)

    Limited number of corpora, especially in languages otherthan the major international languages (English, French)(ap. MT Cabre1999)

  • 8/10/2019 Course 12_Computer Aided Terminology

    4/23

    Computer-aided terminology

    Data banks = information organized in

    records, subdivided into data fields

    Steps for creating a databank

    Identification of needs

    Defining parameters (by group of

    experts) Choosing the working group

  • 8/10/2019 Course 12_Computer Aided Terminology

    5/23

    Computer-aided terminology

    Design of the databank

    Choosing the hardware and software

    Entry

    type of information, sources,structure of the information etc.

    Storage type of records, relationship

    among records, structure of records Retrievaltypes of queries, formats of

    retrieved information (ap MT Cabre)

  • 8/10/2019 Course 12_Computer Aided Terminology

    6/23

    Computer-aided terminology

    Data banks of interest to terminology

    Document databases (Eurlex)

    Specialized text data banks (corpora)

    Economic neology in Romance languages (Romanian among them) glossary of economic terms extracted from the press (useful for a

    comparative approach to terminology and for the study of neology in a

    specific domain.

    http://obneo.iula.upf.edu/economia/esp/index.html

    Ordilex.com, Lexiques multilingueshttp://www.ordilex.com/documents/index.html

    Grand dictionnaire terminologiquehttp://w3.granddictionnaire.com/btml/fra/r_motclef/index1024_1.asp

    http://obneo.iula.upf.edu/economia/esp/index.htmlhttp://www.ordilex.com/documents/index.htmlhttp://w3.granddictionnaire.com/btml/fra/r_motclef/index1024_1.asphttp://w3.granddictionnaire.com/btml/fra/r_motclef/index1024_1.asphttp://www.ordilex.com/documents/index.htmlhttp://obneo.iula.upf.edu/economia/esp/index.htmlhttp://obneo.iula.upf.edu/economia/esp/index.html
  • 8/10/2019 Course 12_Computer Aided Terminology

    7/23

    BNCat http://corpus.byu.edu/bnc/

    (a good page on corpora at http://corpus.leeds.ac.uk/list.html)

    Corpus of Professional English(CPE) a major research project of PERC(Professional English Research Consortium) currently underway that, when finished, will consist

    of a 100-million-word computerized database of Eng used by professionals in science,

    engineering, technology, law, medicine, finance & other fields.

    Wolverhampton Business English Corpus10,186,259 wds in the generaldomain of business, collected from 23 different web sites around the world (from six months within

    the period 1999-2000), covering a wide variety of categories including product descriptions,

    company press releases, annual financial reports, business journalism, academic research

    papers, political speeches & government reports. POS-tagged. Alternatively you can see &

    compare frequency lists & ngrams for various subcorpora/text genres (including business texts).

    Knowledge bases (Protga free open source ontology editorand knowledge-base framework at

    http://protege.stanford.edu/index.html)

    http://corpus.byu.edu/bnc/http://corpus.leeds.ac.uk/list.htmlhttp://www.perc21.org/cpe_project/index.htmlhttp://www.perc21.org/menu.htmlhttp://www.elda.org/catalogue/en/text/W0028.htmlhttp://www.elda.org/catalogue/en/text/W0028.htmlhttp://protege.stanford.edu/index.htmlhttp://protege.stanford.edu/index.htmlhttp://www.elda.org/catalogue/en/text/W0028.htmlhttp://www.elda.org/catalogue/en/text/W0028.htmlhttp://www.perc21.org/menu.htmlhttp://www.perc21.org/cpe_project/index.htmlhttp://corpus.leeds.ac.uk/list.htmlhttp://corpus.byu.edu/bnc/
  • 8/10/2019 Course 12_Computer Aided Terminology

    8/23

    Computer-aided terminology

    Terminological databank: a structuredcollection of information about the units of

    meaning and designation of a specialized

    field addressed to the needs of a specificgroup of users

    Consists of:

    A main database (including the terms) Other databanks related to each other (containing

    information on some aspect of the terms)

  • 8/10/2019 Course 12_Computer Aided Terminology

    9/23

    Computer-aided terminology

    Classification of databanks:

    By objectives: informative (they disseminateterminology); prescriptive (they intervene in term

    usage); By entries: based on terms; based on concepts;

    by subject matter: specialized in a subject field;specialized in several related subject fields;

    By size: large banks (administrative bodies, ex.IATE); terminology minibanks (developed by a

    professional /centre specializing in a subject field)

  • 8/10/2019 Course 12_Computer Aided Terminology

    10/23

    Computer-aided terminology

    By type of data (term banks, with definitions,phrase banks, encyclopedic banks, visual banks)

    By number of languages (mono-, bi-

    multilingual) By type of data organization (organized by

    document, organized by terms without context)

    (M.T. Cabre)

  • 8/10/2019 Course 12_Computer Aided Terminology

    11/23

    Computer-aided terminology

    In addition to the dictionaries which can be

    purchased on CD-ROM, and glossaries built

    up by the translator himself, there is now the

    possibility of accessing term banks publiclyavailable through the Internet. Many national

    and international organisations and

    institutions are making their term banksavailable in this way, and their availability will

    soon become the primary reason for

    translators to use modems.

  • 8/10/2019 Course 12_Computer Aided Terminology

    12/23

    Computer-aided terminology

    Organisational term banks of this type have the

    advantage of being more up-to-date than paper-

    based dictionaries, of constituting an

    authoritative source, and of being instantlyavailable from the work station.

    One problem with these term banks is the lack of

    standardization in the layout of the term records

    and search engines used to access thetermbank. This lack of standardization is adding

    yet another learning experience for the modern,

    professional translator.

  • 8/10/2019 Course 12_Computer Aided Terminology

    13/23

    Computer-aided terminology

    (Pavel&Nolet: 2001, 61-84) limits its role to a

    simple set of tools the terminologist has to

    use in different moments of terminological

    workflow. The authors mention onlycategories and subcategories of software

    used in terminology such as documentary

    search tools, term extraction tools,terminology research tools, data recording

    tools, electronic-publishing tools, and

    database management tools.

  • 8/10/2019 Course 12_Computer Aided Terminology

    14/23

    Computer-aided terminology

    Tools for term extraction:

    Term extractor Termostatweb

    http://idefix.ling.umontreal.ca/~drouinp/termo

    stat_web/

    http://idefix.ling.umontreal.ca/~drouinp/termostat_web/http://idefix.ling.umontreal.ca/~drouinp/termostat_web/http://idefix.ling.umontreal.ca/~drouinp/termostat_web/http://idefix.ling.umontreal.ca/~drouinp/termostat_web/
  • 8/10/2019 Course 12_Computer Aided Terminology

    15/23

  • 8/10/2019 Course 12_Computer Aided Terminology

    16/23

    Computer-aided terminology

    Ordilex.com, Lexiques multilingues

    http://www.ordilex.com/documents/index.html

    LEXIQUE FRANAIS-ANGLAIS DE L'IMMOBILIER : UNE VRITABLEBASE DE CONNAISSANCES.

    La version en ligne de notre lexique franais-anglais des termes de

    l'immobiliervous permet d'utiliser de nombreuses fonctionnalits.

    http://www.ordilex.com/documents/index.htmlhttp://d/My%20Documents/terminologie/MA%20an%20II/%20http:/www.ordilex.com/documents/immobilier/index.htmlhttp://d/My%20Documents/terminologie/MA%20an%20II/%20http:/www.ordilex.com/documents/immobilier/index.htmlhttp://d/My%20Documents/terminologie/MA%20an%20II/%20http:/www.ordilex.com/documents/immobilier/index.htmlhttp://d/My%20Documents/terminologie/MA%20an%20II/%20http:/www.ordilex.com/documents/immobilier/index.htmlhttp://d/My%20Documents/terminologie/MA%20an%20II/%20http:/www.ordilex.com/documents/immobilier/index.htmlhttp://d/My%20Documents/terminologie/MA%20an%20II/%20http:/www.ordilex.com/documents/immobilier/index.htmlhttp://www.ordilex.com/documents/index.html
  • 8/10/2019 Course 12_Computer Aided Terminology

    17/23

    Computer-aided terminology

  • 8/10/2019 Course 12_Computer Aided Terminology

    18/23

    Computer-aided terminology

    Alignment tools:

    Alignment tool in Terminotix

    at http://www.youalign.com/Default.aspx

    http://www.youalign.com/Default.aspxhttp://www.terminotix.com/?lang=enhttp://www.youalign.com/Default.aspx
  • 8/10/2019 Course 12_Computer Aided Terminology

    19/23

    Computer-aided terminology

    Tools for corpus analysis (providing concordances, wordlist, n-grams, collocations, +/- contexts)

    AntConc athttp://www.antlab.sci.waseda.ac.jp/antconc_index.html

    Linguateca at http://www.linguateca.pt/corpografo/

    More about corpora and tools athttp://courses.washington.edu/englhtml/engl560/corpli

    ngresources.htm

    http://www.antlab.sci.waseda.ac.jp/antconc_index.htmlhttp://www.linguateca.pt/corpografo/http://courses.washington.edu/englhtml/engl560/corplingresources.htmhttp://courses.washington.edu/englhtml/engl560/corplingresources.htmhttp://courses.washington.edu/englhtml/engl560/corplingresources.htmhttp://courses.washington.edu/englhtml/engl560/corplingresources.htmhttp://www.linguateca.pt/corpografo/http://www.antlab.sci.waseda.ac.jp/antconc_index.html
  • 8/10/2019 Course 12_Computer Aided Terminology

    20/23

    Computer-aided terminology

    Textalyser This text analysis tool provides information on thereadability and complexity of a text, as well as statistics on word

    frequency and character count. It can be of assistance to translators

    when calculating quotes for clients.

    http://www.lexicool.com/text_analyzer.asp

    http://www.lexicool.com/text_analyzer.asphttp://www.lexicool.com/text_analyzer.asp
  • 8/10/2019 Course 12_Computer Aided Terminology

    21/23

  • 8/10/2019 Course 12_Computer Aided Terminology

    22/23

    Computer-aided terminology

    the virtualization of terminology describes the new set of

    integrated solutions using cloud-based terminology tools

    allowing terminologists to work in an interactive and

    collaborative environment which facilitates real-time

    communication and collective classification of data, easyaccess to the information from anywhere and with almost

    any electronic device, continuous evaluation, update,

    and revision of processed data. These tools open new

    perspective in the further development of the terminologytools to the implementation of terminology as a service

    (TaaS). (Cristina Varga, Virtualization of Research in

    Terminology. Cloud-based Terminology Management

    Tools, 2013)

  • 8/10/2019 Course 12_Computer Aided Terminology

    23/23

    Computer-aided terminology

    http://www.termwiki.com/

    Termw ik ia cloud-based terminology

    management. A strong aspect of Termwik iis

    represented by the complex term record formincluding main term, equivalents, definition, part

    of speech, synonyms, domain, sub-domain,

    product/producer which allows terminologist to

    create complex glossaries. Termwik iterminology tools allow the user to create

    multimedia glossaries too because they support

    image and sound files.