lazerow lecture, utk, 20041 university of tennessee, knoxville, school of information sciences,...
TRANSCRIPT
Lazerow Lecture, UTK, 2004 1
University of Tennessee, Knoxville,School of Information Sciences,
What, Where, When, and Who:
Redesigning the Reference Environment
In Digital Libraries.
Michael Buckland
Samuel Lazerow Lecture, March 31, 2004
Lazerow Lecture, UTK, 2004 2
In the old days, one could visit a the reference collection and make notes on. . .What, When, Where, Who, Why, and Howusing specialist genres of reference work:• Dictionaries and encyclopedias• Atlases and gazetteers• Chronologies and time-lines• Biographical dictionaries• etc. This has become more difficult in a digital environment.
Lazerow Lecture, UTK, 2004 3
Some related issues:
Confusion over “genre”: What kind of a document?
Simplistic notions of “multimedia.”
Have digital libraries have been designed backwards?
How could we re-design the functionality of a reference collection in a digital environment? Form should follow function.
Based on work at Berkeley on designing search support, especially the challenge of searching across different media: text, image, numeric data, sound,…
Lazerow Lecture, UTK, 2004 4
WHAT? Searching by topic, e.g. Dewey, LCSH.
Two kinds of mapping in every search:
• Documents are assigned to topic categories.
• Queries have to map to topic categories
Also mapping between topic systems.
Obviously one would like to search seamlessly across multiple media, e.g. text corpora and socio-economic numeric data series. Is that possible?
Lazerow Lecture, UTK, 2004 5
Text
Numeric datasets
It is difficult to move between different kinds of document
Lazerow Lecture, UTK, 2004 6
Text THESAURUS
Captions Numeric datasets
Different media can be linked indirectly via metadata, but in this case you need to specify place also.
Lazerow Lecture, UTK, 2004 7
Text THESAURUS
Maps GAZETTEER Captions Numeric datasets
Proper name control requires a gazetteer -- and latitude and longitude allow points on maps.
Lazerow Lecture, UTK, 2004 8
WHERE?
People want to search by place!
For –
• Mammals in Madagascar
• Castles in Quercy
• Hikes in the Himalayas . . .
. . . but libraries provide only weak support.
Lazerow Lecture, UTK, 2004 9
Geographical search in library catalogs:
• Place name in title – if present - and
• Place names as / in Subject Headings (MARC 6XX$z, 651)
Search of other geographical clues not supported, e.g.:
• Geographical scope note (MARC 043: n-us-id = Idaho)
• Geographical codes in classification numbers, e.g.
the 7946 for San Francisco Bay in Dewey 917.94604
No spatial relationships: Within / near / next / between.
Map interfaces not yet provided
Lazerow Lecture, UTK, 2004 10
Place names are problematic:
- Variant forms: St. Petersburg, Санкт Петербург, Saint-Pétersbourg, . . .
- Multiple names: Cluj, in Romania / Roumania / Rumania, is also called Klausenburg and Kolozsvar.
- Names changes: Bombay Mumbai.
- Homographs:Vienna, VA, and Vienna, Austria; 50 Springfields.
- Anachronisms: No Germany before 1870
- Vague, e.g. Midwest, Silicon Valley
- Unstable boundaries: 19th century Poland; Balkans; USSR.
Lazerow Lecture, UTK, 2004 11
BUT places have coordinates: latitude and longitude.
. . . and a GAZETTEER links places and spaces!
A gazetteer is a place name authority file and
. . . indicates what kinds of place: “Feature type” and
. . . objectively specifies latitude and longitude and
. . . disambiguates similar place names and
. . . brings variant names together and
. . . allows places to be displayed on maps.
Lazerow Lecture, UTK, 2004 12
Project: “Going Places in the Catalog: Improved Geographical Access” (IMLS)
http://ecai.org/imls2002/
Project objectives
(1) Better use of data already in library catalog records for clarification of place and space;
(2) Link online catalogs with online gazetteers;
(3) Map display of search results;
(4) Map interface for spatial queries;
(5) Extend spatial queries beyond library to other resources relating to the same locality.
Lazerow Lecture, UTK, 2004 13
Geo-temporal search interface. Place names found in documents. Gazetteer provided lat. & long. Places displayed on map.
Timebar
Lazerow Lecture, UTK, 2004 14
Zoom on map. Click on place brings list of records. Click on record displays text.
Lazerow Lecture, UTK, 2004 15
BUT better standards for gazetteer content and format needed. - Multilingual and multiscript entries - Specialists need specialized Feature Type Thesauri, e.g.
Medieval Chinese administrative units;200 feature types in British canal archaeology.Different kinds of Buddhist temple.
- Always declare which thesaurus is being used - Short generic standard thesaurus for upward compatibility - “Preferred name” always a matter of local choice. - Time codes on records because places and names unstable - Harmonize geotemporal metadata across standards families
Based on: “A Multilingual Gazetteer System for Integrating Spatial and Cultural Resources” (NSF-ITR funded)http://ecai.org/projects/gazetteer/
Lazerow Lecture, UTK, 2004 16
WHEN? Places and place names have temporal aspects.
Time period names resemble place names.
- Ambiguous: Civil war, Renaissance, . . . which?
- Unstable: The European War, The Great War, World War I
- Periods have objective calendar dates as well as name
- Dates can display in time-lines, chronologies.
So Time period directory design resembling a gazetteer
Place name Kind of place Where (lat./long.) When
Period name Kind of period When (dates) Where
Lazerow Lecture, UTK, 2004 17
Geographical subject headings with
"Civil war" as chronological subdivision Geographical heading Chronological subdivision
Great Britain Civil war, 1642-1649
United States Civil war, 1861-1865
Spain Civil war, 1936-1939
China Civil war, 1945-1949
Nigeria Civil war, 1967-1970
Lazerow Lecture, UTK, 2004 18
Catalog search for Civil War. Geo-temporal display of sets of results. Click on choice to retrieve documents.
Lazerow Lecture, UTK, 2004 19
Text THESAURUS
Maps GAZETTEER Captions Numeric datasets
TIME PERIOD DIRECTORY Timeline Chronology
Lazerow Lecture, UTK, 2004 20
Place (and time) are broadly important across numerous tools and genres including, e.g.
Language atlases.Library catalogsBiographical dictionaries.BibliographiesArchival finding listsMuseum records, etc., etc.
Biographical dictionaries are heavy on place and time: Emanuel Goldberg, Born Moscow 1881. PhD under Wilhelm Ostwald, Univ. of Leipzig, 1906. Director, Zeiss Ikon, Dresden, 1926-33. Moved to Palestine 1937. Died Tel Aviv, 1970.
Lazerow Lecture, UTK, 2004 21
BIOG. DICT. Text THESAURUS
Maps GAZETTEER Captions Numeric datasets
TIME PERIOD DIRECTORY Timeline Chronology
Lazerow Lecture, UTK, 2004 22
BIOG. DICT. 2 BIOG. DICT. THESAURUS 3
Text 2 THESAURUS 2Text THESAURUS
Maps GAZETTEER Captions Numeric GAZETTEER 2 etc datasets GAZETTEER 3
TIME PERIOD DIRECTORY Time line TIME PERIOD DIRECTORY 2 Chronology TIME PERIOD DIRECTORY 3
Lazerow Lecture, UTK, 2004 23
Linking webpages to library catalogs, gazetteers, etc.
Websites often contain bibliographies.
Another option is to generate a live searches from the a webpage, using the Z39.50 protocol, to search for latest available resources about that topic or place.
Example: See historic sites pages of ECAI Iraq portal of internet accessible resources relating to Iraqi antiquities. Clicking on link generates searches of major U.S. and U.K. research libraries for resources relating to that site.
http://ecai.org/iraq/
Lazerow Lecture, UTK, 2004 24
Through
- standards
- good practice
- interoperability
an “intermediate infrastructure” like a traditional reference collection could be built and shared.
Thank-you!
Acknowledgments: National Science Foundation, Institute for Museum & Library Services, DARPA, and helpful discussions with Academia Sinica, Alexandria Digital Library project, and others.