b37c55a5f8a5eb9a99831a99f2f4c665.ppt
- Количество слайдов: 20
Building the Universal Library: The Promise and Challenges of Hathi. Trust John Wilkin 2 April 2009
What is Hathi. Trust? • • origins intentions size and growth projections aspirations www. hathitrust. org
current members • • • • California Digital Library Indiana University Michigan State University Northwestern University The Ohio State University Penn State University Purdue University UC Berkeley UC Davis UC Irvine UCLA UC Merced UC Riverside • • • UC San Diego UC San Francisco UC Santa Barbara UC Santa Cruz The University of Chicago University of Illinois at Chicago The University of Iowa University of Michigan University of Minnesota University of Wisconsin-Madison University of Virginia www. hathitrust. org
Preservation: OAIS Reference Model GROOVE (JHOVE) MARC record extensions (Aleph) Rights DB Page Turner Hathi. Trust API OAI Geo. IP DB CNRI Handles [Solr] Google [OCA] In-house Conversion GRIN Internal Data Loading METS/PREMIS object TIFF G 4/JPEG 2000 OCR MD 5 checksums Isilon Site Replication TSM MD 5 checksum validation www. hathitrust. org METS object PNG OCR PDF
Mission and Goals • to contribute to the common good by collecting, organizing, preserving, communicating, and sharing the record of human knowledge – materials converted from print – improve access …to meet the needs of the co-owning institutions – reliable and accessible electronic representations – coordinate shared storage strategies – “public good” … free-riders. – simultaneously …centralized …open www. hathitrust. org
growth trajectory www. hathitrust. org
accomplishments to date 1. 25 partners 2. successful ingest and millions of vols online 3. mirroring and backup www. hathitrust. org
Flow of Materials web mass digitization project network or media delivery web nfs ingest Isilon @UM nfs Sync. IQ ingest www. hathitrust. org Isilon @IU
accomplishments to date 1. 2. 3. 4. 25 partners successful ingest and millions of vols online mirroring and backup rich access www. hathitrust. org
books and journals online? www. hathitrust. org
Search inside in-copyright www. hathitrust. org
accomplishments to date 1. 2. 3. 4. 5. 25 partners successful ingest and millions of vols online mirroring and backup rich access “collection builder” www. hathitrust. org
Collection Builder
accomplishments to date 1. 2. 3. 4. 5. 6. 25 partners successful ingest and millions of vols online mirroring and backup rich access collection builder soon, full text search and data API www. hathitrust. org
Hathi. Trust Project Website Page images Comment 1 Local OPAC Catalog records Title Wasīlat al-ṭullāb li-ma‘rifat a‘māl al-layl wa-alnahār bi-ṭarīq al-ḥisāb : ﻭﺳﻴﻠﺔ ﺍﻟﻄﻼﺏ ﻝ ﻣﻌﺮﻓﺔ ﺃﻌﻤﺎﻝ ﺍﻟﻠﻴﻞ ﻭﺍﻟﻨﻬﺎﺭ ﺑﻄﺮﻳﻖ ﺍﻟﺤﺴﺎﺏ Comment 2 manuscript [between 1525? and 1861] Author Ḥaṭṭāb, Yaḥyá ibn Muḥammad, 1496 or 7 -1586 or 7. Comments Enriched records . ﻳﺤﻴﻰ ﻳﻦ ﻣﺤﻤﺪ ﺍﻟﺤﻄﺎﺏ Project staff review comments and enrich cataloging records. Comment 3
next up … • non-Google ingest (OCA & local digitization) • corpus research support – SEASR – Data export – Research center • openness strategies • binding together shared print and digital in strategy to manage local print www. hathitrust. org
Universal Library? • collaborative work around collaborative problem • preserving the published record • comprehensiveness through consolidation and sense-making • commitment to perpetuity www. hathitrust. org
opportunities economies of scale comprehensive collection combining print and digital strategies more effective digital preservation stepping stone to preserving other forms of digital content • platform for new methods of discovery • non-consumptive research • • • www. hathitrust. org
challenges • • digital preservation collaboration understanding what the right services are The Silence of the Archive: The USPS problem www. hathitrust. org
thank you! • http: //www. hathitrust. org/ • hathitrust-info@umich. edu • jpwilkin@umich. edu www. hathitrust. org
b37c55a5f8a5eb9a99831a99f2f4c665.ppt