continuing evolution of the ndltd 18 may 1999 etd workshop virginia tech, blacksburg, va, usa ...
TRANSCRIPT
Continuing Evolutionof the
NDLTDhttp://www.ndltd.org
18 May 1999
ETD Workshop
Virginia Tech, Blacksburg, VA, USA
http://fox.cs.vt.edu [email protected]
Announcements
Turn in edited attendance sheets Informal lunch New user / implementation / Q&A session
here 1-2pm Gail McMillan over to Library CD-ROM? FTP copies available Request visits for summer
OUTLINE
Digital libraries
CRIM
NDLTD
Collaboration (NUDL)
Decisions/Straw Polls
Digital Libraries --- Objectives
World Lit.: 24hr / 7day / from desktop Integrated “super” information systems: 5S: streams,
structures, spaces, scenarios, societies Ubiquitous, Higher Quality, Lower Cost Education, Knowledge Sharing, Discovery Disintermediation -> Collaboration Universities Reclaim Property Interactive Courseware, Student Works Scalable, Sustainable, Usable, Useful
DLs: Why of Global Interest? National projects can preserve antiquities and heritage:
cultural, historical, linguistic, scholarly Knowledge and information are essential to economic and
technological growth, education DL - a domain for international collaboration
– wherein all can contribute and benefit
– which leverages investment in networking
– which provides useful content on Internet & WWW
– which will tie nations and peoples together more strongly and through deeper understanding
Why of Interest in Computing? Next step in fields of DBMS, HT, IR, MM Efficiency requires advances in, e.g.,
– algorithms and data structures (ex., MPHF)– networking (ex., HTTP-NG)– OS (ex., support for streams)
Effectiveness requires advances in, e.g.,– AI (ex., multilingual texts, user adaptation)– HCI (ex., visualization, DLs embedded in activities)
CS Educ. can benefit; CS can aid Dist. Educ.
SMETE Library(from www.dlib.org)
Context: Global movement toward Digital Libraries (see April 1998 CACM)
NSF effort: Science, Mathematics, Engineering, and Technology Education Digital Library (focussed on undergraduates)– 3 workshops, yearly increasing funds / new calls
SMETE Library likely to operate as distributed federation, with separate parts for each key discipline, and to lead to a global effort
OUTLINE
Digital libraries
CRIM
NDLTD
Collaboration (NUDL)
Decisions/Straw Polls
ENVISION
A User-Centered Database from the Computer Science Literature (1991-93)
Collected bib/typesetter data, converted to SGML Scanned thousands of page images MARIAN search engine - can be made available (also
applied to the Virginia Tech library catalog) used as part of a prototype object-based DL, with tailored visualization interface (L. Nowell dissertation)
Envision Results Window
CS -> CSTC -> CRIM
NSF and ACM Education Committee are funding a 2 year project “A Computer Science Teaching Center” - CSTC - http://www.cstc.org/~cstc/
College of NJ, U. Ill. Springfield, Virginia Tech Focus initially on labs, visualization, multimedia Multimedia part is also supported by a 2nd grant to
Virginia Tech and The George Washington University: http://www.cstc.org/~crim/
OUTLINE
Digital libraries
CRIM
NDLTD
Collaboration (NUDL)
Decisions/Straw Polls
German Project Members
Humboldt University (lead institution)3 other universities5 learned societies1 computing center2 major libraries
Student Defends and Finalizes ETD
My Thesis
ETD
Student Gets Committee Signatures and Submits ETD
Signed
Grad School
Graduate School Approves ETD Student is Graduated
Ph.D.
Library Catalogs ETD and New StudentsHave Access to the New Research
WWW
NDLTD
Support Offered
Software, documentation, tech supportEmail, listservs ([email protected], ...)Donations passed on: Adobe, MicrosoftEvaluation: instruments, analysis
http://scholar.lib.vt.edu - solutions/statisticsMultilingual federated search: prototypeTemporary storage / archiving, aid - in setting
up an int’l service & archive
PetaPlex
Parallel computer / storage utility for scale of 1000 to 1,000,000 gigabytes (terabyte/petabyte)
Knowledge Systems Incorporated will supply VT-PetaPlex-1 with 2.5 terabytes, > 100 processors, high speed backbone connection
High-performance “superstore” available to help with NUDL
Relationship with publishers
Concern of faculty and students that still wish to publish books or journal articles, voiced: campus, Chronicle, NPR, Times
Solution: Approval Form gives students, faculty choices on access, when to change access condition; use IPR controls in DL
Solution: by case, work with publishers and publisher associations to increase access– AAP, AAUP– AAAS, ACM, ACS, Elsevier, ...
Some responses from publishers
ACM: need to acknowledge copyright Elsevier: need to acknowledge copyright IEEE-CS: need to acknowledge copyright ACS: wants right of first publication, but then can be
freely available U. Mich. Press (Colin Day): think seriously about
effect on market of monographs In general: restricting access to local campus will not
cause any problems
OUTLINE
Digital libraries
CRIM
NDLTD
Collaboration (NUDL)
Decisions/Straw Polls
Future Work Work with publishers to increase level of access,
develop stds, ensure author education Interoperability tests among universities and with
publishers to provide integrated services Study with testbed that emerges, to improve information
retrieval, browsing, interface, and other types of user support
Evaluation, improving learning experience, spread to worldwide initiative, sustainable support and coordination
Education
Retain ABDs Improve writing (including 2nd language) Improve organization Improve presentation’s technical qualitySupport above in libraries, media centersSupport above “anytime, anywhere” Support faculty as well as students
Professional Societies
Like “writing across the curriculum”Besides writing: computing/communications,
information literacy, personal digital library management, tool use, research methods, collaboration, archiving/preservation
Data sets, communities of users of themClassification systems / browsing / searchingNRC’s “On becoming a researcher”
Interoperability IBM DL: donated equipment, powerful IPR (see TOIS,
D-Lib) - in use at Jena, Essen Z39.50: OCLC SiteSearch / VT tailored s/w
– university libraries w. catalogs of freely shared MARC/DC records pointing to archival copies
– via URNs: handles & PURLs Dienst / NCSTRL - www.ncstrl.org: CS depts., DARPA,
NSF, CNRI, Cornell - UVA is working on extensions for ETDs - Portugal is studying use for Europe
Accessibility
Interface design (simple, 3D, VR)Usability studiesGeneric multi-lingual supportSupport for those with disabilitiesHybrid collection (paper, MARC,
abstracts, full-text, multimedia)Disciplinary classifications, toolsVisualization of results, collection
SPIRES Visualization
Record all work with NDLTD, return to any prior situation, prepare bibliography
Powerful (multilingual, text, image, video) searching; browsing (with categories, in cluster hierarchies); using citation links; SDI
Supported collaboration across one’s field: help with literature review, sharing tools and data sets, applying others’ methods
Research Goals: User Aims
Research Goals: Testbed
Heterogeneous: Language, terminology Writing style, “structure” Subjects / topics Communities of use,
“scenarios”/services Types of related publications
and co-uses Cataloging, classification
“spaces” - info. viz.
Personalization Multimedia “streams” Variation in size: MARC
record, title + abstract, full-text, “container object”
“Societies”: dept., college, univ., state,, sports group, nation, language group, ...
Key: Red: Investigators White: NDLTD (besides investigators)
Virginia Tech, USAEdward A. FoxBrian M. KleinerJohn L. EatonGail McMillan
City University, London, UKMaggie CooperAlistair Sutcliffe
Institutions - Investigators Proposing to NSF and JISC
NUDL Partners
Ricardo A. Baeza-Yates, Universidad de Chile, ChileJosé Luis Brinquete Borbinha, Biblioteca Nacional, PortugalJosé Hilario Canós Cerdá, Universidad Politécnica de Valencia, SpainStavros Christodoulakis, Technical University of Crete, GreeceLautaro Guerra Genskowsky, Universidad Técnica Federico Santa Maria, ChileJuan José Goldschtein, Univesidad de Belgrano, ArgentinaPeter Diepold, Humboldt University, GermanyFrancisco Javier Jaén Martinez, SpainSung Hyon Myaeng, Chungnam National University, KoreaAna Maria Beltran Pavani, Prédio Cardeal Leme, BrazilLim Ee Peng, Nanyang Technological University, SingaporeAlexander I. Plemnek, St.-Petersburg State Technical University, RussiaJ. Alfredo Sánchez, Universidad de las Américas-Puebla, Mexico
NUDL
1/15/99 NUDL proposal to NSF under DLI2 international program– VT: Fox, Kleiner, McMillan, Eaton– Partners: UK (2) , Singapore, Russia, Korea, Greece, Germany,
plus Iberoamerican group (Spain, Portugal, Argentina, Brazil, Chile, Mexico)
– Problems: Multilingual search, multimedia submissions, requirements/usability, …
Start with ETDs, then expand to other student works, portfolios, data sets, (CS) courseware, ...
CONCLUSIONS Digital libraries may provide powerful support for learners if
properly developed and supported by suitable, scalable, sustainable infrastructure.
CSTC and CRIM will help us explore how learning about computing can be enhanced by a large number of well-crafted modules that illustrate key concepts and can be “glued” together in a variety of fashions to suit local needs.
NDLTD will have a dramatic impact on graduate education if institutions participate, which is a “win-win situation”. NUDL will broaden this.
OUTLINE
Digital libraries
CRIM
NDLTD
Collaboration (NUDL)
Decisions/Straw Polls
Areas of Concern
DisseminationPolicies/Procedures/WorkflowPublishersStandards/PreservationStatistics/ReportingSystems/Software/Project-RepositoryTrainingGovernance
Areas of Concern
Dissemination– State, Region, Nation, Continent
– Who, What Roles
– Certification/Sharing Tools
Areas of Concern
Policies/Procedures/Workflow– Grad School Documentation
– XML-based
– MSQL-based
– Oracle, Lotus Notes, ...
Areas of Concern
Publishers– Concerted effort?
– Record findings of individual inquiries?
Areas of Concern
Standards/Preservation– MARC– Dublin Core– XML– 3rd party helpers?– Role of Virginia Tech?
Areas of Concern
Statistics/Reporting– When: Training, Submission, Access, Alumni
– How to collect?
– How to combine/analyze?
– Measure educational impact
Areas of Concern
Systems/Software/Project-Repository– FreeWAIS, OpenText, IBM DL, SiteSearch
– Add interface to collect tools (like CSTC)?
– Use of Z39.50, Dienst, Harvest, ...
Areas of Concern
Training– Sourcebook
– Other books, booklets
– Online training
– Video, other multimedia
– References to others’ training resources
Areas of ConcernGovernance
– Steering Committee
– Technical Advisory Committee
– Annual Workshop Chair
– Committee Chairs Standards (Multilingual) Federated Search Training Policies/Publisher Relations