522a264ec37c0d96c4a48b55aaf0eac8.ppt
- Количество слайдов: 29
The Virtual International Authority File Thomas Hickey ACIG 2009 July 12 ALA, Chicago IL
VIAF participants § § § § Bibliothèque nationale de France Deutsche Nationalbibliothek Library of Congress/NACO OCLC National Library of the Czech Republic Egypt (Bibliotheca Alexandrina) National Library of Australia National Library of Israel Italy (ICCU) National Library of Portugal National Library of Spain National Library of Sweden Swiss National Library Vatican Library ALA 2009
Goals of the Virtual International Authority File § Link national-level authority records § Expand the concept of universal bibliographic control § Allow national or regional variations in authorized form to co-exist § Support needs for variations in preferred language, script and spelling § Play a role in the emerging semantic web ALA 2009
Scope of VIAF § § § Personal names Geographic Corporate Title Family Events § Everything but concepts are considered in scope § National level, but willing to consider other sources ALA 2009
A standard problem: One name, multiple people Fournier, Marcel, ‡ 1945 - Fournier, Marcel, ‡ 1946 ALA 2009
Another standard problem: One person, multiple personas Roberts, Nora Elly Wilder Robb, J. D. , 1950 - ALA 2009
Fundamental to VIAF: One persona, many representations viaf. org/viaf/29541064 ALA 2009
Matching process ALA 2009
Brief LC authority 010 n 84044261 040 DLC $c DLC $d DLC 100 1 Larson, Jack. 670 Thomson, V. The cat, c 1982: $b t. p. (Jack Larson) ALA 2009
Enhancing the authorities Derived Authority Bibliographic Record Enhanced Authority Record ALA 2009
Mining the bibliographic record LDR 00826 ccm 2200289 a 4500 1 ocm 10025532 5 20031229650847. 0 8 840627 s 1982 nyuuua n eng 10 $a 84758340 40 $a DLC $c DLC 19 $a 17706440 20 $c $2. 95 28 22 $a 48418 $b G. Schirmer 45 2 $b d 198006 $b d 198007 48 $b va 01 $b ve 01 $a ka 01 50 00 $a M 1529. 3 $b. T 100 1 $a Thomson, Virgil, $d 1896245 14 $a The cat : $b duet for soprano and baritone / $c Virgil Thomson ; [words by Jack Larson]. 260 $a New York : $b G. Schirmer, $c c 1982. 300 $a 1 score (11 p. ) ; $c 31 cm. 500 $a For soprano, baritone, and piano. 650 0 $a Vocal duets with piano. 600 10 $a Larson, Jack $x Musical settings. 700 1 $a Larson, Jack. Language LC Control Number LC Classification Usage Title Publisher Place of Publicati Date of Material Type Authors Publicati ALA 2009
Information in bibliographic records § He is a lyricist § His primary subject area is music § He was published in the 80 s and 90 s by G. Schirmer and Belwin Mills in New York § Worked with Virgil Thomson and Gerhard Samuel § Jack Larson is the only name he has used on his publications § Etc. ALA 2009
VIAF data flow Bibs Auths Deduplication/ Disambiguation Bibs Auths VIAF History Auths ALA 2009 VIAF
Current state § Personal names from 16 files § Names are clustered § 10. 4 million names § 8. 7 million clusters § Identifiers assigned: § http: //viaf. org/viaf/77390479 § Preliminary work done on geographic names § Unicode throughout § UNIMARC and MARC-21 supported ALA 2009
VIAF interface is built on top of SRU § SRU grew out of Z 39. 50 § VIAF is SRU plus URL-rewrite rules and contentnegotiation § Also modified to allow the return records without SRU XML wrapper § New query parameter HTTP Accept § http: //viaf. org/search? query=cql. any+all+"dempsey"+& http: accept=application/rss+bxml § Allows support of Open. Search (RSS returned) ALA 2009
URI Patterns and ‘Linked Data’ § VIAF Record Default http: //viaf. org/viaf/9855044 Real World Object http: //viaf. org/viaf/9855044. rwo HTML http: //viaf. org/viaf/9855044. html XML http: //viaf. org/viaf/9855044. viaf RDF (FOAF) http: //viaf. org/viaf/9855044. rdf MARC 21 http: //viaf. org/viaf/9855044. m 21 UNIMARC http: //viaf. org/viaf/9855044. unimarc § Content negotiation: § HTTP headers or SRU extension ALA 2009
SRU Searching § Retrieve record by internal control number § http: //viaf. org/search? query=cql. any+all+"NKC|jn 19990008936“ § Results list for George Washington § http: //viaf. org/search ? query=local. main. Heading. El+all+"george%20 washington“ &stylesheet=xsl/results. xsl &sort. Keys=holdingscount ALA 2009
Matching ALA 2009
What makes a match? 1, 705, 555 Title 846, 722 Double date 123, 487 Joint author 71, 851 LCCN 24, 587 Partial date and partial title 11, 010 Partial date and publisher 9, 179 Partial title and publisher 6, 415 Name as subject 3, 168 Standard number ALA 2009
Consensus ALA 2009
Little consensus ALA 2009
Date variations are common ALA 2009
Occasional long chain ALA 2009
Example ALA 2009
Search results for Sharabi ALA 2009
ALA 2009
Next steps § More participants § More name types (geographics, corporates, …) § More variety of sources § Rights agencies, ISNI § Regional files § Specialized files ALA 2009
Possible applications within OCLC § FRBR matching § Better matching of non-English metadata § Uniform identifier across all languages § Authority control for cataloging § Better regionalization of World. Cat. org § Minimize differences across languages of cataloging ALA 2009
Discussion § How would you use VIAF? § How important is VIAF? § Will anyone use linked-data URIs? ALA 2009


