Скачать презентацию ETANA-ADD An Interactive Tool for Integrating Archaeological DL Скачать презентацию ETANA-ADD An Interactive Tool for Integrating Archaeological DL

b2f7389ef04564cf674f55792304c942.ppt

  • Количество слайдов: 34

ETANA-ADD: An Interactive Tool for Integrating Archaeological DL Collections JCDL 2006, Chapel Hill, NC ETANA-ADD: An Interactive Tool for Integrating Archaeological DL Collections JCDL 2006, Chapel Hill, NC June 13, 2006 Naga Srinivas Vemuri, Rao Shen, Sameer Tupe, Weiguo Fan, Edward A. Fox fox@vt. edu http: //fox. cs. vt. edu

Acknowledgements (Selected) • Sponsors: NSF grant ITR-0325579, ASOR, CWRU, ETANA, Vanderbilt U. , Virginia Acknowledgements (Selected) • Sponsors: NSF grant ITR-0325579, ASOR, CWRU, ETANA, Vanderbilt U. , Virginia Tech • VT Students: Vidhya Vijayaraghavan, other DLRL members • Others: Umm el-Jimal Dig Team

Acknowledgements (Selected) • Karen Borstad, MPP • Christopher Holland, LRP • Giorgio Buccellati, UCLA Acknowledgements (Selected) • Karen Borstad, MPP • Christopher Holland, LRP • Giorgio Buccellati, UCLA • Paul Jacobs, Mississippi State U. • Douglas Clark, Walla College • Douglas Knight, Vanderbilt U. • Joanne Eustis, CWRU • Stan La. Bianca, Andrews U. • Nick Fischio, CWRU • David Mc. Creery, Willamette U. • Israel Finkelstein, Tel-Aviv University • Eric Meyers, Duke U. • Paul Gherman, Vanderbilt U. • Adam Porter, Illinois College • Andrew Graham, U. Toronto • Jack Sasson, Vanderbilt U. • Tim Harrison, U. Toronto • Tom Schaub, Indiana U. of Penn. • Larry Herr, Canadian University College • Randall Younker, Andrews U.

Outline ¡ Introduction ¡ Related Work ¡ ETANA-ADD Tool ¡ Conclusions Outline ¡ Introduction ¡ Related Work ¡ ETANA-ADD Tool ¡ Conclusions

Introduction ¡ ¡ ¡ ETANA, ETANA-DL, 5 S What are the issues involved in Introduction ¡ ¡ ¡ ETANA, ETANA-DL, 5 S What are the issues involved in integrating new collections into a DL, with evolving metadata schema (i. e. , bottom up schema evolution)? Can we partially automate the process of integrating new collections in such situations?

ETANA-DL Heterogeneity: 8 archaeological sites, 13 different artifact types l Example artifact types: Bone, ETANA-DL Heterogeneity: 8 archaeological sites, 13 different artifact types l Example artifact types: Bone, Burial, Figurine, Locus, Pottery, Seed, etc. l Union services: Multidimensional Browsing, Searching, Recommendation, Annotation, etc. l

ETANA-DL (Cont. ) ¡ Individual (archaeological) site approach l Local conventions for metadata l ETANA-DL (Cont. ) ¡ Individual (archaeological) site approach l Local conventions for metadata l Custom built services ¡ ETANA-DL l Provides union services across sites l A global schema based on incremental approach

The Mapping Process in ETANA Mapping Process: Global schema defines collections (metadata) in the The Mapping Process in ETANA Mapping Process: Global schema defines collections (metadata) in the system using an incremental approach. ¡ Adding a new artifact collection l if artifact type is already defined, perform mapping l if artifact type is not defined, then extend global schema and perform mapping ¡

The Whole Integration Process ¡ Conversion process: custom DB to XML format l l The Whole Integration Process ¡ Conversion process: custom DB to XML format l l ¡ Mapping process l l ¡ Needs to identify metadata elements in DB Results in local XML data, local XML schema Needs to perform schema mapping Results in global schema extension, and the evolution of a new global collection Integration process l l New site to be “published” as OAI Provider OAI harvesting results in integration of the new collection.

The Integration Problem: The integration process requires both technical and domain expertise. ¡ Propose: The Integration Problem: The integration process requires both technical and domain expertise. ¡ Propose: Partially automate the process to minimize the need for technical skills ¡ Solution: ETANA-ADD Tool ¡

Outline ¡ Introduction ¡ Related Work ¡ ETANA-ADD Tool ¡ Conclusions Outline ¡ Introduction ¡ Related Work ¡ ETANA-ADD Tool ¡ Conclusions

Related Work ¡ ¡ Gatherer: A tool used in Greenstone for adding new collections. Related Work ¡ ¡ Gatherer: A tool used in Greenstone for adding new collections. l Tightly coupled with Greenstone l No knowledge of its ability to handle evolving schema and its content Database to OAI Provider: OAICat and OAI PMH 2 Perl l Doesn’t accommodate mapping process

Related Work (Cont. ) ¡ ¡ OCHRE proposed archaeo. ML to define DL collections Related Work (Cont. ) ¡ ¡ OCHRE proposed archaeo. ML to define DL collections l Doesn’t automate integration process l Ability to handle heterogeneous data is not known Altova Map. Force l Doesn’t support incremental mapping

Outline ¡ Introduction ¡ Related Work ¡ ETANA-ADD Tool ¡ Conclusions Outline ¡ Introduction ¡ Related Work ¡ ETANA-ADD Tool ¡ Conclusions

ETANA-ADD Tool ¡ An interactive tool for end users Partially automates the integration process ETANA-ADD Tool ¡ An interactive tool for end users Partially automates the integration process l Minimizes the need for technical skills l Reuses existing tools to some extent, by providing easy GUI wrapper on top of them l

ETANA-ADD Tool (Cont. ) ¡ The process flow involved while using the tool: DB ETANA-ADD Tool (Cont. ) ¡ The process flow involved while using the tool: DB 2 XML Schema Mapper OAI XML File Data Provider

An Integration Scenario ¡ Adding burial artifacts collection to ETANA DL l Perform DB An Integration Scenario ¡ Adding burial artifacts collection to ETANA DL l Perform DB 2 XML process using ETANA-ADD l Perform Schema Mapping l Publish Burial Collection as OAI Data Provider

Initial Screen with Umm el-Jimal Database Open Initial Screen with Umm el-Jimal Database Open

Tables corresponding to Burial artifact selected Tables corresponding to Burial artifact selected

Performing join on tables for burial artifact Performing join on tables for burial artifact

DB 2 XML Process Complete DB 2 XML Process Complete

Invoking Schema Mapper Invoking Schema Mapper

Opening Global Schema Opening Global Schema

Performing Mapping Process Performing Mapping Process

Extending Global Schema to Integrate Burial Artifact Extending Global Schema to Integrate Burial Artifact

Mapping Complete, Generating Global XML Collection Mapping Complete, Generating Global XML Collection

Complete Global XML Generation, Publishing as OAI Provider Complete Global XML Generation, Publishing as OAI Provider

Publishing as OAI Provider Publishing as OAI Provider

Results ¡ Integrated Ummm el-Jimal site with the help of ETANA-ADD l Bone, Burial, Results ¡ Integrated Ummm el-Jimal site with the help of ETANA-ADD l Bone, Burial, Locus, Miscellaneous Artifact, Pottery Bucket. No additional code written ¡ A comparison with earlier integrated site, Megiddo (7 artifact collections) ¡

Results (Cont. ) Umm el. Jimal Megiddo Additional LOC Required 0 1350 Human Hours Results (Cont. ) Umm el. Jimal Megiddo Additional LOC Required 0 1350 Human Hours 2 20

Outline ¡ Introduction ¡ Related Work ¡ ETANA-ADD Tool ¡ Conclusions Outline ¡ Introduction ¡ Related Work ¡ ETANA-ADD Tool ¡ Conclusions

Conclusions Target users: Administrators handling archaeology data (to be invited in fall for usability Conclusions Target users: Administrators handling archaeology data (to be invited in fall for usability studies) ¡ Developed ETANA-ADD to minimize technical expertise in integrating new archaeological collections ¡ Willing to share our software, which may be applicable to other domains with similar problems (i. e. , with evolving global schema) ¡

References ¡ ¡ Brainbridge, D. , Thompson, J. , and Witten, I. H. Assembling References ¡ ¡ Brainbridge, D. , Thompson, J. , and Witten, I. H. Assembling and enriching digital library collections. In Proc. JCDL 2003: 323 -334. Raghavan, A. , Vemuri, N. S. , Shen, R. , Gonçalves, M. A. , Fan, W. and Fox, E. A. Incremental, Semiautomatic, Mapping-Based Integration of Heterogeneous Collections into Archaeological Digital Libraries: Megiddo Case Study. In Proc. ECDL 2005: 139 -150. Ravindranathan, U. , Shen, R. , Gonçalves, M. A. , Fan, W. , Fox, E. A. , Flanagan, J. W. , ETANA-DL: a digital library for integrating heterogeneous archaeological data. In Proc. JCDL 2004: 76 -77. Suleman, H. Open Digital Libraries, Ph. D. Dissertation, Dept. Comp. Sci. , Virginia Tech, http: //scholar. lib. vt. edu/theses/available/etd 11222002 -155624, 2002.

Questions/Comments ? Questions/Comments ?