Скачать презентацию The Virtual International Authority File Thomas Hickey ACIG Скачать презентацию The Virtual International Authority File Thomas Hickey ACIG

522a264ec37c0d96c4a48b55aaf0eac8.ppt

  • Количество слайдов: 29

The Virtual International Authority File Thomas Hickey ACIG 2009 July 12 ALA, Chicago IL The Virtual International Authority File Thomas Hickey ACIG 2009 July 12 ALA, Chicago IL

VIAF participants § § § § Bibliothèque nationale de France Deutsche Nationalbibliothek Library of VIAF participants § § § § Bibliothèque nationale de France Deutsche Nationalbibliothek Library of Congress/NACO OCLC National Library of the Czech Republic Egypt (Bibliotheca Alexandrina) National Library of Australia National Library of Israel Italy (ICCU) National Library of Portugal National Library of Spain National Library of Sweden Swiss National Library Vatican Library ALA 2009

Goals of the Virtual International Authority File § Link national-level authority records § Expand Goals of the Virtual International Authority File § Link national-level authority records § Expand the concept of universal bibliographic control § Allow national or regional variations in authorized form to co-exist § Support needs for variations in preferred language, script and spelling § Play a role in the emerging semantic web ALA 2009

Scope of VIAF § § § Personal names Geographic Corporate Title Family Events § Scope of VIAF § § § Personal names Geographic Corporate Title Family Events § Everything but concepts are considered in scope § National level, but willing to consider other sources ALA 2009

A standard problem: One name, multiple people Fournier, Marcel, ‡ 1945 - Fournier, Marcel, A standard problem: One name, multiple people Fournier, Marcel, ‡ 1945 - Fournier, Marcel, ‡ 1946 ALA 2009

Another standard problem: One person, multiple personas Roberts, Nora Elly Wilder Robb, J. D. Another standard problem: One person, multiple personas Roberts, Nora Elly Wilder Robb, J. D. , 1950 - ALA 2009

Fundamental to VIAF: One persona, many representations viaf. org/viaf/29541064 ALA 2009 Fundamental to VIAF: One persona, many representations viaf. org/viaf/29541064 ALA 2009

Matching process ALA 2009 Matching process ALA 2009

Brief LC authority 010 n 84044261 040 DLC $c DLC $d DLC 100 1 Brief LC authority 010 n 84044261 040 DLC $c DLC $d DLC 100 1 Larson, Jack. 670 Thomson, V. The cat, c 1982: $b t. p. (Jack Larson) ALA 2009

Enhancing the authorities Derived Authority Bibliographic Record Enhanced Authority Record ALA 2009 Enhancing the authorities Derived Authority Bibliographic Record Enhanced Authority Record ALA 2009

Mining the bibliographic record LDR 00826 ccm 2200289 a 4500 1 ocm 10025532 5 Mining the bibliographic record LDR 00826 ccm 2200289 a 4500 1 ocm 10025532 5 20031229650847. 0 8 840627 s 1982 nyuuua n eng 10 $a 84758340 40 $a DLC $c DLC 19 $a 17706440 20 $c $2. 95 28 22 $a 48418 $b G. Schirmer 45 2 $b d 198006 $b d 198007 48 $b va 01 $b ve 01 $a ka 01 50 00 $a M 1529. 3 $b. T 100 1 $a Thomson, Virgil, $d 1896245 14 $a The cat : $b duet for soprano and baritone / $c Virgil Thomson ; [words by Jack Larson]. 260 $a New York : $b G. Schirmer, $c c 1982. 300 $a 1 score (11 p. ) ; $c 31 cm. 500 $a For soprano, baritone, and piano. 650 0 $a Vocal duets with piano. 600 10 $a Larson, Jack $x Musical settings. 700 1 $a Larson, Jack. Language LC Control Number LC Classification Usage Title Publisher Place of Publicati Date of Material Type Authors Publicati ALA 2009

Information in bibliographic records § He is a lyricist § His primary subject area Information in bibliographic records § He is a lyricist § His primary subject area is music § He was published in the 80 s and 90 s by G. Schirmer and Belwin Mills in New York § Worked with Virgil Thomson and Gerhard Samuel § Jack Larson is the only name he has used on his publications § Etc. ALA 2009

VIAF data flow Bibs Auths Deduplication/ Disambiguation Bibs Auths VIAF History Auths ALA 2009 VIAF data flow Bibs Auths Deduplication/ Disambiguation Bibs Auths VIAF History Auths ALA 2009 VIAF

Current state § Personal names from 16 files § Names are clustered § 10. Current state § Personal names from 16 files § Names are clustered § 10. 4 million names § 8. 7 million clusters § Identifiers assigned: § http: //viaf. org/viaf/77390479 § Preliminary work done on geographic names § Unicode throughout § UNIMARC and MARC-21 supported ALA 2009

VIAF interface is built on top of SRU § SRU grew out of Z VIAF interface is built on top of SRU § SRU grew out of Z 39. 50 § VIAF is SRU plus URL-rewrite rules and contentnegotiation § Also modified to allow the return records without SRU XML wrapper § New query parameter HTTP Accept § http: //viaf. org/search? query=cql. any+all+"dempsey"+& http: accept=application/rss+bxml § Allows support of Open. Search (RSS returned) ALA 2009

URI Patterns and ‘Linked Data’ § VIAF Record Default http: //viaf. org/viaf/9855044 Real World URI Patterns and ‘Linked Data’ § VIAF Record Default http: //viaf. org/viaf/9855044 Real World Object http: //viaf. org/viaf/9855044. rwo HTML http: //viaf. org/viaf/9855044. html XML http: //viaf. org/viaf/9855044. viaf RDF (FOAF) http: //viaf. org/viaf/9855044. rdf MARC 21 http: //viaf. org/viaf/9855044. m 21 UNIMARC http: //viaf. org/viaf/9855044. unimarc § Content negotiation: § HTTP headers or SRU extension ALA 2009

SRU Searching § Retrieve record by internal control number § http: //viaf. org/search? query=cql. SRU Searching § Retrieve record by internal control number § http: //viaf. org/search? query=cql. any+all+"NKC|jn 19990008936“ § Results list for George Washington § http: //viaf. org/search ? query=local. main. Heading. El+all+"george%20 washington“ &stylesheet=xsl/results. xsl &sort. Keys=holdingscount ALA 2009

Matching ALA 2009 Matching ALA 2009

What makes a match? 1, 705, 555 Title 846, 722 Double date 123, 487 What makes a match? 1, 705, 555 Title 846, 722 Double date 123, 487 Joint author 71, 851 LCCN 24, 587 Partial date and partial title 11, 010 Partial date and publisher 9, 179 Partial title and publisher 6, 415 Name as subject 3, 168 Standard number ALA 2009

Consensus ALA 2009 Consensus ALA 2009

Little consensus ALA 2009 Little consensus ALA 2009

Date variations are common ALA 2009 Date variations are common ALA 2009

Occasional long chain ALA 2009 Occasional long chain ALA 2009

Example ALA 2009 Example ALA 2009

Search results for Sharabi ALA 2009 Search results for Sharabi ALA 2009

ALA 2009 ALA 2009

Next steps § More participants § More name types (geographics, corporates, …) § More Next steps § More participants § More name types (geographics, corporates, …) § More variety of sources § Rights agencies, ISNI § Regional files § Specialized files ALA 2009

Possible applications within OCLC § FRBR matching § Better matching of non-English metadata § Possible applications within OCLC § FRBR matching § Better matching of non-English metadata § Uniform identifier across all languages § Authority control for cataloging § Better regionalization of World. Cat. org § Minimize differences across languages of cataloging ALA 2009

Discussion § How would you use VIAF? § How important is VIAF? § Will Discussion § How would you use VIAF? § How important is VIAF? § Will anyone use linked-data URIs? ALA 2009