
811b4d4e7c71cc51f80a8e294a5b046d.ppt
- Количество слайдов: 33
OCLC / ISKO-NA Preconference Everything Need Not Be Miscellaneous: Controlled Vocabularies And Classification In A Web World Sponsored by OCLC, ISKO-NA and Université de Montréal 5 August 2008 FAST (Faceted Application of Subject Terminology): A Vocabulary to Facilitate Faceted Browsing Eric Childress Consulting Project Manager OCLC Programs & Research
Outline • A Changing World • About FAST • Illustrative uses of FAST
A Changing World • Search & Retrieval evolves • Interface expectations are set by others • Presentation and navigation patterns flow… • Data/Metadata production sources expand • Machine services are increasingly important
Search & Retrieval evolves Library catalogs 19 th Century Author, Title, Subject Card Catalogs Manual Complex, Non-intuitive Controlled Vocabulary high unit costs 21 st Century Key Word, Relevance Weighted Computers/Web Automated & Social Simpler, User-Friendly Uncontrolled Vocabulary low unit costs
Interface expectations are set by others Top Web properties by visitor June 2008 (SEW)
Presentation and navigation patterns flow… Losing favor: Gaining favor: • Text-heavy interfaces • Single-search box simplicity • All options, all the time • White-space and eye-catching graphics • Jargon-laden labels • Side-bar navigation • Tagclouds
Closer to, but not libraries…
Traditional library interfaces
Newer library system interfaces
Tagclouds
Data/Metadata production sources expand Institution-based Sophisticated rules Expert-built Authority Formal distribution channels Institutional networks cost accounting Chiefly individuals No/lightweight rules UGC (User-Generated Content) Wisdom of the crowds “Metadata in the wild” The Network cost diffusion
Machine services are increasingly important
About FAST • Arose from expert recommendations • Project of OCLC Research in consultation with LC • Complement of faceted vocabularies based on LCSH • Suitable for non-expert application • Machine-to-machine friendly
A New Frontier for Controlled Vocabularies Expert group studied issues related to controlled vocabularies and the Web environment • American Library Association’s ALCTS/SAC/Subcommittee on Metadata and Subject Analysis (1997 -2001) Conclusion: For certain circumstances a controlled vocabulary was needed which was: • Web-friendly • Low-learning curve/non-expert user-friendly Existing major vocabularies were not ready as-is Adapting an existing vocabulary might be a pragmatic option
Requirements for a New Vocabulary • Simple in structure and syntax • Usable by non-catalogers and in non-library environments • Compatible with MARC, Dublin Core, and other popular metadata schemas • Easy maintainability • Machine-compatible
Launching FAST OCLC Research project launched Team: in 1998 OCLC: Eric Childress, Kay Advisory group: ALA/ALCTS/SAC Clapton, Becky Dean, Anya Subcommittee on FAST (Faceted Dyer, Kerre Kammerer, Ed Application of Subject O’Neill (Lead), Diane Vizine. Terminology) Goetz LC CSPO: Lynn El-Hoshy (now retired), Janice Young Consultant: Lois Mai Chan (University of Kentucky)
Why Adapt LCSH? • Rich vocabulary covering all subject areas • Synonym and homograph control • Extensive hierarchical and associative references among terms • De facto standard controlled vocabulary, extensively used by libraries, contained in millions of bibliographic records • Long and well-documented history • Strong institutional support of the Library of Congress
What is FAST? • OCLC FAST (Faceted Application of Subject Terminology) • A faceted vocabulary based on LCSH • Modular – each facet may be used independently • Supports post-coordinate search & retrieval • Designed for use by non-expert assigners • Machine-friendly controlled vocabulary
Authority Control: LCSH vs. FAST LCSH FAST Very large number (billions plus) of possible headings Faceting limits the number of possible headings to a few million Common headings are established; most assigned headings are synthesized by catalogers based on rules All headings (except chronological) are established Most headings are distinct (based on NACO normalization rules*); some conflicts occur particularly with $x & $v All normalized headings are distinct; tagging and subfield coding provides no unique information (with the exception of forms) *http: \www. loc. gov/catdir/pcc/naco/normrule. html
FAST - Eight Facets Topical Subject headings ―Evaluation Geographic New York (State) ―New York Form (Genre) Guidebooks Chronological 1939 - 1945 Personal Names Kilgour, Frederick G. Corporate Names Oregon Library Association Events Olympic Games Uniform Titles Dead Sea scrolls
How is FAST built? • OCLC Research-built software processes LCSH authority file and LCSH present in World. Cat bibliographic records to automatically build: • FAST MARC authority file covering 8 facets • FAST MARC authority reference records to assist with LCSH conversion
Sample Authority Record - Geographic 001 2130675 003 OCo. LC 005 20040512160245. 0 008 040512 nneanz||babn n ana d 040 OCo. LC $b eng $c OCo. LC $f fast 043 n-us-ak 151 Pacific Ocean $z Rowan Bay 670 GNIS, Feb. 10, 2004 $b (Rowan Bay; bay; 7 mi. N of Tebenkof Bay, on W coast of Kuiu I. , Alex. Arch. ; Wrangell-Petersburg Census Area, Alaska; 56º 40'02" N, 134º 14'34" W; another Rowan Bay, pop. place in Wrangell-Petersburg Census Area) 688 LC subject usage: 0 (2006) 688 WC subject usage: 2 (2006) 751 0 Rowan Bay (Alaska : Bay)$0 (DLC)sh 2004005090
Sources of FAST Headings Library of Congress Subject Authority File LC headings that combine different facets are deconstructed into discrete headings, each containing only one facet. Headings assigned to bibliographic records in OCLC’s World. Cat Many complex headings, i. e. , those containing more than one element in the heading string, are based on literary warrant. They are derived from subject fields in the records in OCLC’s World. Cat. Headings created for FAST In some cases, faceting has required FAST headings to be created when no LCSH equivalents exists.
Example of a FAST-only heading – Events Facet In LCSH, it is common to established events as a combination of a geographic heading and a chronological ($y) subdivision: Buffalo (N. Y. ) $x History $y Civil War, 1861 -1865 Grenada $x History $y American Invasion, 1983 For these subdivisions, a FAST topical heading is also created: American Civil War, 1861 -1865 American Invasion of Grenada, 1983
LCSH to FAST Comparison FAST LCSH 600 Lincoln, Abraham, $d 1809 -1865 650 Political leadership $z United States $v Case studies 650 Genius $v Case studies 600 Lincoln, Abraham, $d 1809 -1865 $x Friends and associates 650 Presidents $z United States $v Biography 651 United States $x Politics and government $y 1861 -1865 600 Lincoln, Abraham, $d 1809 -1865 648 1861 - 1865 650 Political leadership 650 Genius 650 Friendship 650 Presidents 650 Political science 651 United States 655 Case studies 655 Biography
Current FAST Authority File Personal name headings 699, 200 Corporate name headings 351, 494 Topical headings 407, 772 Geographic name headings 148, 952 Chronological headings 676 Event headings 12, 225 Title headings 48, 245 Form headings 711 Total FAST authorities July 2008 1, 669, 375
Future Development Plans • Update and resynchronize all FAST headings with LCSH (In process) • Improve the LCSH to FAST conversion (In process) • Complete the FAST manual (In process) • Expand the geographic names based on usage data and add information from the Geographic Names Information System (GNIS) • Revise and expand the form (genre) facet
Sample OCLC applications of FAST World. Cat Identities Fiction. Finder
FAST headings
FAST headings
http: //fictionfinder. oclc. org/
More information OCLC FAST • Project page: http: //www. oclc. org/research/projects/fast/ • Search interface: http: //fast. oclc. org/ OCLC World. Cat Identities • http: //orlabs. oclc. org/Identities/ OCLC Fiction. Finder • http: //fictionfinder. oclc. org/
811b4d4e7c71cc51f80a8e294a5b046d.ppt