b2f7389ef04564cf674f55792304c942.ppt
- Количество слайдов: 34
ETANA-ADD: An Interactive Tool for Integrating Archaeological DL Collections JCDL 2006, Chapel Hill, NC June 13, 2006 Naga Srinivas Vemuri, Rao Shen, Sameer Tupe, Weiguo Fan, Edward A. Fox fox@vt. edu http: //fox. cs. vt. edu
Acknowledgements (Selected) • Sponsors: NSF grant ITR-0325579, ASOR, CWRU, ETANA, Vanderbilt U. , Virginia Tech • VT Students: Vidhya Vijayaraghavan, other DLRL members • Others: Umm el-Jimal Dig Team
Acknowledgements (Selected) • Karen Borstad, MPP • Christopher Holland, LRP • Giorgio Buccellati, UCLA • Paul Jacobs, Mississippi State U. • Douglas Clark, Walla College • Douglas Knight, Vanderbilt U. • Joanne Eustis, CWRU • Stan La. Bianca, Andrews U. • Nick Fischio, CWRU • David Mc. Creery, Willamette U. • Israel Finkelstein, Tel-Aviv University • Eric Meyers, Duke U. • Paul Gherman, Vanderbilt U. • Adam Porter, Illinois College • Andrew Graham, U. Toronto • Jack Sasson, Vanderbilt U. • Tim Harrison, U. Toronto • Tom Schaub, Indiana U. of Penn. • Larry Herr, Canadian University College • Randall Younker, Andrews U.
Outline ¡ Introduction ¡ Related Work ¡ ETANA-ADD Tool ¡ Conclusions
Introduction ¡ ¡ ¡ ETANA, ETANA-DL, 5 S What are the issues involved in integrating new collections into a DL, with evolving metadata schema (i. e. , bottom up schema evolution)? Can we partially automate the process of integrating new collections in such situations?
ETANA-DL Heterogeneity: 8 archaeological sites, 13 different artifact types l Example artifact types: Bone, Burial, Figurine, Locus, Pottery, Seed, etc. l Union services: Multidimensional Browsing, Searching, Recommendation, Annotation, etc. l
ETANA-DL (Cont. ) ¡ Individual (archaeological) site approach l Local conventions for metadata l Custom built services ¡ ETANA-DL l Provides union services across sites l A global schema based on incremental approach
The Mapping Process in ETANA Mapping Process: Global schema defines collections (metadata) in the system using an incremental approach. ¡ Adding a new artifact collection l if artifact type is already defined, perform mapping l if artifact type is not defined, then extend global schema and perform mapping ¡
The Whole Integration Process ¡ Conversion process: custom DB to XML format l l ¡ Mapping process l l ¡ Needs to identify metadata elements in DB Results in local XML data, local XML schema Needs to perform schema mapping Results in global schema extension, and the evolution of a new global collection Integration process l l New site to be “published” as OAI Provider OAI harvesting results in integration of the new collection.
The Integration Problem: The integration process requires both technical and domain expertise. ¡ Propose: Partially automate the process to minimize the need for technical skills ¡ Solution: ETANA-ADD Tool ¡
Outline ¡ Introduction ¡ Related Work ¡ ETANA-ADD Tool ¡ Conclusions
Related Work ¡ ¡ Gatherer: A tool used in Greenstone for adding new collections. l Tightly coupled with Greenstone l No knowledge of its ability to handle evolving schema and its content Database to OAI Provider: OAICat and OAI PMH 2 Perl l Doesn’t accommodate mapping process
Related Work (Cont. ) ¡ ¡ OCHRE proposed archaeo. ML to define DL collections l Doesn’t automate integration process l Ability to handle heterogeneous data is not known Altova Map. Force l Doesn’t support incremental mapping
Outline ¡ Introduction ¡ Related Work ¡ ETANA-ADD Tool ¡ Conclusions
ETANA-ADD Tool ¡ An interactive tool for end users Partially automates the integration process l Minimizes the need for technical skills l Reuses existing tools to some extent, by providing easy GUI wrapper on top of them l
ETANA-ADD Tool (Cont. ) ¡ The process flow involved while using the tool: DB 2 XML Schema Mapper OAI XML File Data Provider
An Integration Scenario ¡ Adding burial artifacts collection to ETANA DL l Perform DB 2 XML process using ETANA-ADD l Perform Schema Mapping l Publish Burial Collection as OAI Data Provider
Initial Screen with Umm el-Jimal Database Open
Tables corresponding to Burial artifact selected
Performing join on tables for burial artifact
DB 2 XML Process Complete
Invoking Schema Mapper
Opening Global Schema
Performing Mapping Process
Extending Global Schema to Integrate Burial Artifact
Mapping Complete, Generating Global XML Collection
Complete Global XML Generation, Publishing as OAI Provider
Publishing as OAI Provider
Results ¡ Integrated Ummm el-Jimal site with the help of ETANA-ADD l Bone, Burial, Locus, Miscellaneous Artifact, Pottery Bucket. No additional code written ¡ A comparison with earlier integrated site, Megiddo (7 artifact collections) ¡
Results (Cont. ) Umm el. Jimal Megiddo Additional LOC Required 0 1350 Human Hours 2 20
Outline ¡ Introduction ¡ Related Work ¡ ETANA-ADD Tool ¡ Conclusions
Conclusions Target users: Administrators handling archaeology data (to be invited in fall for usability studies) ¡ Developed ETANA-ADD to minimize technical expertise in integrating new archaeological collections ¡ Willing to share our software, which may be applicable to other domains with similar problems (i. e. , with evolving global schema) ¡
References ¡ ¡ Brainbridge, D. , Thompson, J. , and Witten, I. H. Assembling and enriching digital library collections. In Proc. JCDL 2003: 323 -334. Raghavan, A. , Vemuri, N. S. , Shen, R. , Gonçalves, M. A. , Fan, W. and Fox, E. A. Incremental, Semiautomatic, Mapping-Based Integration of Heterogeneous Collections into Archaeological Digital Libraries: Megiddo Case Study. In Proc. ECDL 2005: 139 -150. Ravindranathan, U. , Shen, R. , Gonçalves, M. A. , Fan, W. , Fox, E. A. , Flanagan, J. W. , ETANA-DL: a digital library for integrating heterogeneous archaeological data. In Proc. JCDL 2004: 76 -77. Suleman, H. Open Digital Libraries, Ph. D. Dissertation, Dept. Comp. Sci. , Virginia Tech, http: //scholar. lib. vt. edu/theses/available/etd 11222002 -155624, 2002.
Questions/Comments ?