![Page 1: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/1.jpg)
CLARIN a European ResearchCLARIN - a European Research Infrastructure
Peter WittenburgMax-Planck Institut für
Psycholinguistik, Nijmegen
![Page 2: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/2.jpg)
eResearch - InfrastructuresBozen,
16.9.2010
www.clarin.eu
J. Taylor“eScience is about global collaboration in key areas of science and the next generation of infrastructures that willgeneration of infrastructures that will enable it”
Requires new persistent platformsRequires new persistent platforms- to enable researchers to combine resourcesand tools to solve the big challenges of today (global migration crisis of cultures and minds)(global migration, crisis of cultures and minds)
- to increase the efficiency of researchers in the many small tasks- 40 % of the time of "knowledge workers" is spent, to find
useful material (Forrester Research)useful material (Forrester Research)
![Page 3: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/3.jpg)
CLARIN GoalBozen,
16.9.2010
www.clarin.eu
What: How: Offer a distributed Research Infrastructure of
allow the combination of existing and web-accessible digitalInfrastructure of
integrated and interoperable
accessible digital centers hosting resources in a common federationLanguage
Resources and Tools that serves
common federationoffer language tools and services as distrib ted ser icesTools that serves
researchers and students in the SSH
distributed services with a common web interface
![Page 4: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/4.jpg)
Key Application/Mission Bozen,
16.9.2010
www.clarin.eu
A researcher authenticates at his own organization and creates a virtual collection of resources from different repositories pand executing a virtual pipeline of processes on them.
![Page 5: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/5.jpg)
CLARIN is pan-European
CLARIN:CLARIN:• 3 Jahre Prep-Phase• ~ 200 members • ~ 25 centre candidates
![Page 6: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/6.jpg)
CLARIN Work Dimensions
at least IT oriented aspects
how to come to how to come to how to make all how to come to how to get it all
... at least IT oriented aspects
a persistent and stable
infrastructure?
a federation and how to get
access?
how to make all of their LRT
visible?
how to come to interoperable
services?
how to get it all together for
user services?
community service CMDI future & service pan-European community centres provider
federationshort term solution
oriented architecture
demo cases
CLARIN has other very important aspects:• Relation with SSH disciplines - mainly driven by national funds• Education/Training, Help/Support/Advice, Dissemination
Harmoni ation of licencing and Code of Cond cts• Harmonization of licencing and Code of Conducts• Specification of the ERIC legal framework to ensure persistency
![Page 7: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/7.jpg)
Community Centres
25 Centre Candidates
all are busy with restructuring plans
2 already give long-term preservation service
how to come to a persistent and stable
how to come to a federation
and how to get
how to make all of their LRT
visible?
how to come to interoperable
services?
how to get it all together for
user services?infrastructure? access?visible? services? user services?
community centres
service provider
federation
CMDI future & short term solution
service oriented
architecture
pan-European demo cases
CLARIN Centres
CentresCriteria
Long-termPreservation
REPLIX Replication
![Page 8: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/8.jpg)
Service Provider Federation
• Service Provider Federation
• Agreement 1
setup federation technology
build initial federation
setup EPIC service
central user attribute server
g• n centers members
• Link up with nationalIdFs
• Agreement 2
how to come to a persistent and stable
how to come to a federation
and how to get
how to make all of their LRT
visible?
how to come to interoperable
services?
how to get it all together for
user services?
• Agreement 2• DFN De• HAKA Fi• SURFnet Nl
infrastructure? access? visible? services? user services?
community centres
service provider
federation
CMDI future & short term solution
service oriented
architecture
pan-European demo cases
• 1 Mio pot. Users-id
• currently more countries and centers coming
h // id ihttp://www.pidconsortium.eu
Trust Domain
Initial Federation
PID Service
![Page 9: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/9.jpg)
Metadata Domain
ISOcat concept registry
myprofile
CLARIN component registry
component registration
CMDI Infra
ISOcat development
setup OAI PMH machinery
Category Definition
LRT Inventory
Virtual Language World
ARBIL MD Editor component
editor
ypg y
how to come to a persistent and stable
how to come to a federation
and how to get
how to make all of their LRT
visible?
how to come to interoperable
services?
how to get it all together for
user services?metadata
user area
infrastructure? access? visible? services? user services?
community centres
service provider
federation
CMDI future & short term solution
service oriented
architecture
pan-European demo cases
editorconcept registration
?
metadata
descriptions
Component Metadata
Metadata now
Virtual Collection
ISOcat Registry
VLO Observatory
![Page 10: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/10.jpg)
Service Oriented Architecture
Stuttgart Tübingen Leipzig
Service Framework Specification
Web Service and Processing Chains
Standards and Best Practices
Web 2.0 Application for RepositoryStandard-conformant
how to come to a persistent and stable
how to come to a federation
and how to get
how to make all of their LRT
visible?
how to come to interoperable
services?
how to get it all together for
user services?
Tool Chainingand Execution
Text Corpus Encoding
infrastructure? access? visible? services? user services?
community centres
service provider
federation
CMDI future & short term solution
service oriented
architecture
pan-European demo cases
Stuttgart Tübingen Berlin Leipzig FinlandRomania
Service Oriented
Infrastructure
Web Services Interoperability
Standards & Best
Practices
![Page 11: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/11.jpg)
Demo Cases (just started)
EU Identity Index Case
Multimedia/multimodal Case
Folkstory Case
C4/WebLicht Corpus Case
how to come to a persistent and stable
how to come to a federation
and how to get
how to make all of their LRT
visible?
how to come to interoperable
services?
how to get it all together for
user services?infrastructure? access? visible? services? user services?
community centres
service provider
federation
CMDI future & short term solution
service oriented
architecture
pan-European demo cases
![Page 12: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/12.jpg)
not alone ...
EUDAT
Meta-Net
![Page 13: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/13.jpg)
need to take care of data ...
Data UsersUser functionalitiesData capture & transfer
generators Users
ion
Virtual Research EnvironmentsCLARIN, DARIAH etc
Community Support Servicesa
Cur
at Data discovery & navigationWorkflow generationAnnotation,Tr
ust
Services
Dat
a Annotation, Interpretability
Safe & persistent storage
Daten e-Infrastructure
Common Data ServicesSafe & persistent storageIdentifiers, Authenticity, Workflow execution, Mining
Architecture created by EC High Level Expert Groupwill be a guideline for coming decades
![Page 14: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/14.jpg)
why European?Bozen,
16.9.2010
www.clarin.eu
live in a multilingual sharing costs in all gEurope with a joint historical tradition
grespects is more efficient
and need to exploit this strength
h
finally it's about global competition
l i SSHmany research questions are cross-national
also in SSH
nationalrequired standards cannot be nationalcannot be national
![Page 15: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/15.jpg)
Why now?Bozen,
16.9.2010
www.clarin.eu
there is the ESFRI we need to organize our process and all countries are synchronized which is a
resource domain due to huge increase of data (MPI: 200 TB)synchronized which is a
unique chance to build infrastructures
(MPI: 200 TB)we need to take care to not loose our cultural
in total 44 initiatives on the ESFRI roadmap and there is the
and scientific memorythere is a huge uptake of RI and there will be
potential of gain by an eco system of RI
of RI and there will be many funding streams!!!
![Page 16: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/16.jpg)
who and when?Bozen,
16.9.2010
www.clarin.eu
current EU CLARIN consortium in prep phase (08-10): 32 partners from 24 countries
CLARIN construction phase from 2011; main funds byCLARIN construction phase from 2011; main funds by national programs - but additional funding streams by EC connected to RI
legal issue: foundation of a European Research Infrastructure Consortiums (ERIC) as basis for future withInfrastructure Consortiums (ERIC) as basis for future with automatic qualification to participate in programs
![Page 17: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/17.jpg)
Organisation of the CLARIN ERIC
CLARINUtrecht
![Page 18: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/18.jpg)
who seems to be on board?Bozen,
16.9.2010
www.clarin.eu
Belgium Bulgaria Germany Denmark EstoniaBelgium, Bulgaria, Germany, Denmark, Estonia,
Latvia, Finland, Croatia, Netherlands, Norwegen,
Austria, Portugal, Spain, Czech Republic, Hungary,
South Tirol ?South Tirol, ?
Some are discussing: FR, SW, GR?, etc.
![Page 19: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/19.jpg)
Advantage of membershipBozen,
16.9.2010
www.clarin.eu
privilaged access to CLARIN federationp gnetworked with CLARIN centres (direct technology transfer)technology transfer)a word when discussing priorities, agreements best practicesagreements, best practicesaccess to EC funding streams
t d ti d t i iaccess to education and training programs to make our young generation competitive
![Page 20: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/20.jpg)
Weitere InformationenBozen,
16.9.2010
www.clarin.eu
CLARIN web site: http://www.clarin.eupCLARIN office: [email protected]
CLARIN Newsletter:http://www.clarin.eu/newsletter
CLARIN members:http://www.clarin.eu/members
![Page 21: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/21.jpg)
Thanks for your attention.
![Page 22: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/22.jpg)
CLARIN Usage Scenario
Scenario: A Serbian and a German PhD student want to study language variation in the Balkan areastudy language variation in the Balkan area
Resource: via VLO they find all relevant language variation data for that area
Tools/Services: Modern clustering methods available via the web allow to quickly build dialect continua on top of a geographic map; visualization services allow to pipeline this to get a nice outputto get a nice output
![Page 23: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/23.jpg)
Visualization of Dialect Data: Clustering
![Page 24: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure](https://reader035.vdocument.in/reader035/viewer/2022071411/610639c70ce3d877bb36b203/html5/thumbnails/24.jpg)
CLARIN Usage Scenario
Scenario: Linguists, sociologists and ethnologists want to study the cultural and linguistic differences of parliamentstudy the cultural and linguistic differences of parliament debates in SE, DE and GR about the swine flue and compare how such global problems are dealt with
Resource: building a virtual collections of all debates (Audio, Video, Transkription)
Tools/Services: allowing researchers to analyse and annotate gestures, intonation, word choices, timing etc
h tl f l t d b i dwhere partly powerful computers need being used
Vision: in 2011/12 such computational services will be d il bl i CLARIN 2011made available in CLARIN 2011