
acc9cf31a7f786234de6ada48072163a.ppt
- Количество слайдов: 25
ISKO 2008, Montréal 4 W Vocabulary Mapping Across Diverse Reference Genres Michael Buckland Ryan Shaw (& others) Electronic Cultural Atlas Initiative and School of Information, Univ. of California, Berkeley Work supported by the Institute for Museum and Library Studies and by the National Endowment for the Humanities. July 7, 2008 ISKO Montréal 1
Currently: -- Distinct reference genres -- Vocabulary mapping across similar vocabularies -- Codex-like infrastructure Need: -- Interlinked reference genres -- Vocabulary mapping across dissimilar vocabularies -- Union index infrastructure July 7, 2008 ISKO Montréal 2
Five ideas about use of digital corpora. . 1. Understanding requires knowing the context. Context determines understanding! July 7, 2008 ISKO Montréal 3
Five ideas about use of digital corpora. . 1. Understanding requires knowing the context. 2. Using Internet resources should be like using a library reference collection – and as easy and as reliable. 3. Design: Find the context of any museum object, document, or performance: What is related to it in what it is, where it came from, when it originated, and who is associated with it? 4. WHAT, WHERE, WHEN, and WHO (“ 4 W”) as a structure. 5. Make better use of existing descriptive metadata. July 7, 2008 ISKO Montréal 4
Context and relationships: Ireland Irish Studies – Project diagram. Any word, name, document, or event Connect it with its context – and other resources. Facet Vocabulary Displays WHAT Thesaurus e. g. LCSH WHERE Gazetteer WHEN Period directory Timeline WHO Biograph. dict. Personal e. g. Who’s Who relations Any catalog: Archives, Libraries, Museums, TV, Publishers July 7, 2008 Crossreferences Map ISKO Montréal Any resource: Audio, Images, Texts, Numeric data, Objects, Virtual reality, Webpages 5
WHAT Subject headings Cross-references in & between vocabularies Kung fu movies SEE Martial Arts films FORMERLY Hand-to-hand fighting, oriental, in motion pictures NEED TO MAP TO & BETWEEN UNFAMILIAR VOCABULARIES “Automobile” in four dialects: - PASS MOT VEH, SPARK IGN ENG (U. S. Import/Export statistics) - TL 205 (Library of Congress Classification) - 180/280 (US Patent classification) - 3711 (Standard Industrial Classification) “HS 847120 Digital auto data proc mach contng in the same housing a CPU and input & output device. ”(International Harmonized Commodity Classification System). = Computer! July 7, 2008 ISKO Montréal 6
WHEN? What happened in IRELAND in 1690 s? Time Period Directory records in Google Earth. Zoom to Ireland 1690 s. Icon for siege of Limerick, 1690. Click link for library search. Catalog records list books and show context. July 7, 2008 ISKO Montréal 7
WHO Biographical Dictionary Complex relationships Life events metadata But ideally we need external links to the best WHAT: Actions prisoner WHERE: Places Holstein WHEN: Times 1261 -1262 resources! WHO: People Margaret Sambiria Current project: Context finding for biographical texts. Example: Electronic search engine pioneer. July 7, 2008 ISKO Montréal 8
Emanuel Goldberg, b. Moscow, 1881; son of Grigorii Goldberg; Univ. of Moscow, 1900 -04; Ph. D w. Robert Luther, Leipzig Univ. , 1906; Assistant, Adolf Miethe, TU Charlottenburg, 1906 -07; Prof, Akad. f. graphische Künste, Leipzig, 1907 -17; ICA, Zeiss Ikon, Dresden, 1917 -1933; Kinamo cine camera, 1921; microdots, 1925; search engine, 1927; Contax 35 mm camera 1932; kidnapped by Nazi SA; refugee in Paris, 1933 -37; Laboratory, Palestine, Israel, 1937; d. 1970. WHO? Click a name to search for an internet resource. July 7, 2008 ISKO Montréal 9
Emanuel Goldberg, b. Moscow, 1881; son of Grigorii Goldberg; Univ. of Moscow, 1900 -04; Ph. D w. Robert Luther, Leipzig Univ. , 1906; Assistant, Adolf Miethe, TU Charlottenburg, 1906 -07; Prof, Akad. f. graphische Künste, Leipzig, 1907 -17; ICA, Zeiss Ikon, Dresden, 1917 -1933; Kinamo cine camera, 1921; microdots, 1925; search engine, 1927; Contax 35 mm camera 1932; kidnapped by Nazi SA; refugee in Paris, 1933 -37; Laboratory, Palestine, Israel, 1937; d. 1970. WHERE? Trace a life-path. July 7, 2008 ISKO Montréal 10
Emanuel Goldberg, b. Moscow, 1881; son of Grigorii Goldberg; Univ. of Moscow, 1900 -04; Ph. D w. Robert Luther, Leipzig Univ. , 1906; Assistant, Adolf Miethe, TU Charlottenburg, 1906 -07; Prof, Akad. f. graphische Künste, Leipzig, 1907 -17; ICA, Zeiss Ikon, Dresden, 1917 -1933; Kinamo cine camera, 1921; microdots, 1925; search engine, 1927; Contax 35 mm camera 1932; kidnapped by Nazi SA; refugee in Paris, 1933 -37; Laboratory, Palestine, Israel, 1937; d. 1970. WHAT? July 7, 2008 ISKO Montréal 11
Initial sketch for “Context Finding / Building” interface. Insert / block text Ranked lists of suggested resources for each facet chosen Define facet Save search path Save link & notes as “stand-off” markup. Display of search result July 7, 2008 Save link & notes as embedded mark-up. ISKO Montréal 12
Scanned text July 7, 2008 ISKO Montréal Named Entities 13
Hovering over a named entity highlights the areas where it appears in the text. July 7, 2008 ISKO Montréal 14
Named entities are linked to specific resources or dynamic searches over relevant databases. July 7, 2008 ISKO Montréal 15
Initially, named entities are linked to keyword searches at the appropriate name authorities and metadata services. Here we see a number of possible candidates for “Henry V”. July 7, 2008 ISKO Montréal 16
Now that it has been disambiguated, the named entity links directly to the appropriate record. July 7, 2008 ISKO Montréal 17
Named entities not detected automatically can be added manually. July 7, 2008 ISKO Montréal 18
Edmund Hogan’s Onomasticon Goedelicum : Locorum et Tribuum Hiberniae et Scotiae = An Index, with Identifications, to the Gaelic Names of Places and Tribes If searchable online, one could, when reading an Irish studies text: 1. Search it (Context finder) 2. Markup text with links to it (Context builder); 3. Markup Hogan with reverse links to the Irish studies text (Context provider) – with rich consequences. July 7, 2008 ISKO Montréal 19
July 7, 2008 ISKO Montréal 20
July 7, 2008 ISKO Montréal 21
July 7, 2008 ISKO Montréal 22
Facet genres include other facets Library subject headings Topic – Geographic subdivision – Chronological subdivision Place name gazetteer Place name – Type – Spatial markers (Lat & long) – When Time Period Directory Period name – Type – Time markers (Calendar) – Where Biographical Dictionary Person – Activity type – Time – Where – Who else July 7, 2008 ISKO Montréal 23
Facet genres with facets realigned. WHAT (LCSH) What X Where When Who X X X WHERE (Place Gazet. ) X X X - WHEN (Period dir. ) X X X - WHO (Biogr dict. ) X X From LCSH “Lighthouses” to NGA Gazetteer Geographic Description Code “Lthse” (Lighthouse). Gazetteer entries give locations of instances. Vertical mappings extend semantic links vocabularies, e. g. Horizontal links provide additional context. July 7, 2008 ISKO Montréal 24
Paper-based reference collection: Codex determines structure and use. Reference Genre Dictionary, encyclopedia Atlas, gazetteer Almanac, chronology Biogr. Dict. , Who’s Who Vocabulary Topics Places Time Persons Displays Cross-refs Maps Timelines Personal relationships Facet WHAT WHERE WHEN WHO Reversed in a digital environment: Metadata forms infrastructure. Facet WHAT WHERE WHEN WHO Vocabulary Topics Places Periods Persons Displays Cross-references Maps Timeline Personal relationships Reference Genre Dictionary, Encyclopedia Atlas, gazetteer Almanac, Chronology Biogr. dictionary, Whos Who And, better, build a union index, so you know where too look! July 7, 2008 ISKO Montréal 25
acc9cf31a7f786234de6ada48072163a.ppt