Скачать презентацию Un Modelo Formal para la Biblioteca Digital Flujos Скачать презентацию Un Modelo Formal para la Biblioteca Digital Flujos

8a15b84728b17dd0a65f6ba9727b925e.ppt

  • Количество слайдов: 127

Un Modelo Formal para la Biblioteca Digital: Flujos, Estructuras, Espacios, Escenarios, y Sociedades Universidad Un Modelo Formal para la Biblioteca Digital: Flujos, Estructuras, Espacios, Escenarios, y Sociedades Universidad de Buenos Aires May 19, 2004 Edward A. Fox fox@vt. edu http: //fox. cs. vt. edu

Acknowledgements (Selected) • Sponsors: ACM, Adobe, AOL, IBM, Microsoft, NASA, NLM, NSF, OCLC, SUN, Acknowledgements (Selected) • Sponsors: ACM, Adobe, AOL, IBM, Microsoft, NASA, NLM, NSF, OCLC, SUN, US Dept. of Ed. • VT Faculty/Staff: Debra Dudley, Weiguo Fan, Gail Mc. Millan, Manuel Perez, Naren Ramakrishnan, Layne Watson, … • VT Students: Yuxin Chen, Shahrooz Feizabadi, Marcos Gonçalves, Nithiwat Kampanya, S. H. Kim, Bing Liu, Paul Mather, Fernando Das Neves, Unni. Ravindranathan, Ryan Richardson, Rao Shen, Ricardo Torres, Wensi Xi, Baoping Zhang, Qinwei Zhu, …

ACKNOWLEDGEMENTS (NDLTD) • NDLTD Board of Directors, previous Steering Committee + other NDLTD committees; ACKNOWLEDGEMENTS (NDLTD) • NDLTD Board of Directors, previous Steering Committee + other NDLTD committees; those running Electronic Thesis & Dissertation (ETD) initiatives in universities, regions, countries • Helpful sponsorship by many organizations, especially Adobe (new initiative!), CONACy. T, DFG, FIPSE (US Dept. Education), IBM, Microsoft, NSF (IIS-9986089, 0086227, 0080748, 0325579; DUE 0121679, 0136690, 0121741, 0333601), OCLC, SOLINET, SUN, SURA, UNESCO, VTLS, many governments (Australia, Germany, India, …), … • Colleagues at Virginia Tech (faculty, staff, students), and collaborators at many universities • Slides included from: Vinod Chachra, Thom Hickey, Joan Lippincott, Gail Mc. Millan, Axel Plathe, Hussein Suleman, …

Other Collaborators (Selected) • • • Brazil: FUA, UFMG, UNICAMP Case Western Reserve University Other Collaborators (Selected) • • • Brazil: FUA, UFMG, UNICAMP Case Western Reserve University Emory, Notre Dame, Oregon State Germany: Univ. Oldenburg Mexico: UDLA (Puebla), Monterrey College of NJ, Hofstra, Penn State, Villanova University of Arizona University of Florida, Univ. of Illinois University of Virginia • Endowment: VTLS

UNESCO • Cláudio Menezes [cmenezes@unesco. org. uy] • Purpose: • Reinforce local solutions, commitments UNESCO • Cláudio Menezes [cmenezes@unesco. org. uy] • Purpose: • Reinforce local solutions, commitments • Emphasize: • • • ETD does not need many resources. Open source and free software is available. International cooperation can help. Local training is crucial. => Inclusion of ETD in practices, processes => Schedule for ETD projects

Part 2 The 5 S Model: A Formal Model for the Digital Library Part 2 The 5 S Model: A Formal Model for the Digital Library

Motivation • DLs are not benefiting from formal theories as have other CS fields: Motivation • DLs are not benefiting from formal theories as have other CS fields: DB, IR, PL, etc. • DL construction: difficult, ad-hoc, lacking support for tailoring/customization • Conceptual modeling, requirements analysis, and methodological approaches are rarely supported in DL development. • Lack of specific DL models, formalisms, languages

5 S Layers Societies Scenarios Spaces Structures Streams 5 S Layers Societies Scenarios Spaces Structures Streams

Definition: Digital Libraries are complex systems that • • • help satisfy info needs Definition: Digital Libraries are complex systems that • • • help satisfy info needs of users (societies) provide info services (scenarios) organize info in usable ways (structures) present info in usable ways (spaces) communicate info with users (streams)

DL Student Research: Gonçalves • 5 S as a basis for developing digital libraries DL Student Research: Gonçalves • 5 S as a basis for developing digital libraries • Theory • Syntax, Semantics; Definitions, Relationships • Specification of requirements • Generation of systems • Quality

DL Services/Activities Taxonomy (Gonçalves) Infrastructure Services Repository-Building Creational Preservational Acquiring Cataloging Crawling (focused) Describing DL Services/Activities Taxonomy (Gonçalves) Infrastructure Services Repository-Building Creational Preservational Acquiring Cataloging Crawling (focused) Describing Digitizing Federating Harvesting Purchasing Submitting Conserving Converting Copying/Replicating Emulating Renewing Translating (format) Add Value Annotating Classifying Clustering Evaluating Extracting Indexing Measuring Publicizing Rating Reviewing (peer) Surveying Translating (language) Information Satisfaction Services Browsing Collaborating Customizing Filtering Providing access Recommending Requesting Searching Visualizing

Defining Quality in Digital Libraries DL Concept Dimensions of Quality Digital object Metadata specification Defining Quality in Digital Libraries DL Concept Dimensions of Quality Digital object Metadata specification Collection Catalog Repository Structures for Navigation Services Accessibility Pertinence (*) Preservability (*) Relevance Similarity Significance Timeliness (*) Accuracy Completeness Conformance Completeness Impact Factor Completeness Consistency Navigability (*) Composability Efficiency Effectiveness Extensibility Reusability Reliability

5 S Model: Examples, Objectives Models Examples Objectives Stream Text; video; audio; image Describes 5 S Model: Examples, Objectives Models Examples Objectives Stream Text; video; audio; image Describes properties of the DL content such as encoding and language for textual material or particular forms of multimedia data Structures Collection; catalog; hypertext; document; metadata; organization tools Specifies organizational aspects of the DL content Spatial Measure; measurable, topological, vector, probabilistic Defines logical and presentational views of several DL components Scenarios Searching, browsing, recommending, Details the behavior of DL services Societies Service managers, learners, Teachers, etc. Defines managers, responsible for running DL services; actors, that use those services; and relationships among them

Document Models, Representations, and Accesses • Doc = stream + structure + use-scenario; hybrid Document Models, Representations, and Accesses • Doc = stream + structure + use-scenario; hybrid (paper/electronic), digital only • Multilingual: content, summary, metadata • Multimedia: structure, quality (o. S), search • Structured: MARC, SGML, by user: MVD • Distributed collection: Kleisli, CIMI, Z 39. 50 • Federated search: collecting, picking site(s), parallel search / fall-back, fusing results • Access: IPR, payment, security, scenarios

Architectural Issues • Internet middleware • Independent system / part of federation • Decompositions Architectural Issues • Internet middleware • Independent system / part of federation • Decompositions vary • search engine, browser, DBMS, MM support • repository, handle server, client • information resources + mediators, bus or agent collection + client with workspace/environment • Metrics: e. g. , for federated search

Standards • Protocols/federation • Z 39. 50, CIMI • Dienst, NCSTRL • OAI protocol Standards • Protocols/federation • Z 39. 50, CIMI • Dienst, NCSTRL • OAI protocol • Metadata • TEI: inline, detailed (structure in stream) • MARC: two-level, fine-grained • Dublin Core: high-level, 15 elements • RDF: describing resources/collections, annotation • OAMS -> DC and others used in OAI

Digital Library Courseware • http: //ei. cs. vt. edu/~dlib/ • WWW pages or large Digital Library Courseware • http: //ei. cs. vt. edu/~dlib/ • WWW pages or large PDF copy files • Online quizzes based on book by Michael Lesk (Morgan Kaufmann Publishers) • Contents based on book, with several other popular topics added (e. g. , agents) • Separate pages to supplement: Definitions, Resources (People, Projects), and References • UNC-CH proposal; book plans for 2005

Topical Outline - Foundations • • • Early visions Definitions Resources References Projects Topical Outline - Foundations • • • Early visions Definitions Resources References Projects

Topical Outline – IR Areas • • Search, Retrieval, Resource Discovery Information storage and Topical Outline – IR Areas • • Search, Retrieval, Resource Discovery Information storage and retrieval Boolean vs. natural language Search engines Indexing, phrases, thesauri, concepts Federated search and harvesting, OAI Integrating links and ratings Crawlers, spiders, metasearch, fusion • Details following – Li Wang indep. study

What is a Crawler? • • • A Program An Important Module For Web What is a Crawler? • • • A Program An Important Module For Web Search Engine Crawls On The Web According To Its Algorithm Retrieves Web Pages Gets Useful Information Stores The Web Pages For Future Refining

Jobs For Threads Get A New URL From Buffer Put New URLs Into Buffer Jobs For Threads Get A New URL From Buffer Put New URLs Into Buffer Contact The Server For File Type Parse The Web Page Download The File

Advanced Functions • Backward Linkage Information Collector A Web Page Advanced Functions • Backward Linkage Information Collector A Web Page

Topical Outline - Multimedia • • • Multiple media types, representations Text, audio, image, Topical Outline - Multimedia • • • Multiple media types, representations Text, audio, image, video, graphics, animation Capture, digitization, standards, interchange Compression, content-based retrieval Playback (Real), SMIL, Qo. S JPEG, MPEG (and versions)

Topical Outline - Architectures • • • Distributed, centralized Modular, componentized Bus (Info. Bus), Topical Outline - Architectures • • • Distributed, centralized Modular, componentized Bus (Info. Bus), hierarchical, star Mediators, wrappers (TSIMMIS) Light weight protocols Architecture of OAI and XOAI

Topical Outline – Interfaces • • • Taxonomy of interface components Workflow Visualization Environments Topical Outline – Interfaces • • • Taxonomy of interface components Workflow Visualization Environments Design Usability testing

Topical Outline – Metadata • • MARC Dublin Core RDF IMS OAI (Open Archives Topical Outline – Metadata • • MARC Dublin Core RDF IMS OAI (Open Archives Initiative) Crosswalks, mappings Ontologies Topics maps, concept maps

Topical Outline – Epub, SGML, XML • • Authoring Rendering, presenting Structure Tagging, Markup, Topical Outline – Epub, SGML, XML • • Authoring Rendering, presenting Structure Tagging, Markup, DOM Semi-structured information Dual-publishing, e. Books Styles (XSL, XSLT) Structure queries

Topical Outline – Databases • • • Extending database technology Structured and unstructured info Topical Outline – Databases • • • Extending database technology Structured and unstructured info Multimedia databases Link databases Performance Replicated storage, I 2 -DSI (details following)

Topical Outline – Agents • • • Protocols Knowledge interchange Negotiation, registries Distributed issues Topical Outline – Agents • • • Protocols Knowledge interchange Negotiation, registries Distributed issues Ontologies (standard upper) Webbots (automatic indexing)

Topical Outline – Economics • E-commerce • Sustainability • Preservation and archiving • DLF, Topical Outline – Economics • E-commerce • Sustainability • Preservation and archiving • DLF, Besser, Lorie, Gladney • Self-archiving • Open collections • Economic models, business plans

Topical Outline – IPR • • Intellectual property rights (IPR) Legal issues Terms and Topical Outline – IPR • • Intellectual property rights (IPR) Legal issues Terms and conditions Copyright Patents, trademarks Distributed rights management Security

Topical Outline – Social Issues • • • Cooperation, collaboration Annotation, ratings Digital divide Topical Outline – Social Issues • • • Cooperation, collaboration Annotation, ratings Digital divide Educational applications Cultural heritage Museums (AMICO) Organizational acceptance Personalization Internationalization

5 S Model: Definitions 5 S Definition Streams Sequences of elements of an arbitrary 5 S Model: Definitions 5 S Definition Streams Sequences of elements of an arbitrary type Structures Labeled directed graphs Spatial Sets and operations on those sets Scenarios Sequences of events that modify states of a computation in order to accomplish some functional requirement. Societies Sets of communities and relationships among them

Overview of 5 S and DL formal definitions and compositions (Gonçalves) Overview of 5 S and DL formal definitions and compositions (Gonçalves)

Semantic relationships among DL concepts: Partial concept map (Gonçalves) Semantic relationships among DL concepts: Partial concept map (Gonçalves)

5 S Framework and DL Development (Gonçalves) 5 S Framework and DL Development (Gonçalves)

5 SLGen: Automatic DL Generation 5 SLGen: Automatic DL Generation

MARIAN DL Generation Component Pool 5 SL Design XML PARSERS: DOM, SAX MARIAN API MARIAN DL Generation Component Pool 5 SL Design XML PARSERS: DOM, SAX MARIAN API MARIAN Digital Library Generator Class managers Indexing Classes Loader User interfaces Resource Manager Configuration and Processing Classes

Challenges with Approach • The designer should know the 5 S theory very well Challenges with Approach • The designer should know the 5 S theory very well and be very familiar with the syntax and semantics of 5 SL to be able to write correct 5 SL files. • It is difficult to get the big picture of a digital library just from a textual 5 SL file.

5 SGraph: A DL Modeling Tool (Qinwei Zhu MS Thesis) • Overall objective of 5 SGraph: A DL Modeling Tool (Qinwei Zhu MS Thesis) • Overall objective of 5 SGraph: Help users model their own instances of a digital library (DL) in the 5 S language (5 SL). A simple modeling process which enables rapid generation of digital libraries is needed. • • Support non-expert users. Speed-up development process. Increase the quality of final product.

Goals of 5 SGraph • To help digital library designers understand the 5 S Goals of 5 SGraph • To help digital library designers understand the 5 S model quickly and easily • To help digital library designers build their own digital libraries without difficulty • To help digital library designers transform their models into 5 SL files automatically • To help digital library designers understand, maintain, and upgrade existing digital library models conveniently

5 SGraph How does 5 SGraph work? • 5 SGraph loads and displays a 5 SGraph How does 5 SGraph work? • 5 SGraph loads and displays a metamodel in a structured toolbox. • The structured editor of 5 SGraph provides a topdown visual environment for the DL designer. • 5 SGraph produces correct 5 SL files according to the visual model built by the designer.

Overview of 5 SGraph Workspace (instance model) Structured toolbox (metamodel) Overview of 5 SGraph Workspace (instance model) Structured toolbox (metamodel)

Overview of 5 SGraph(cont. ) • Structured toolbox • Show the available concepts in Overview of 5 SGraph(cont. ) • Structured toolbox • Show the available concepts in metamodel and the relationships between those concepts. • Visualize the Metamodel • Concepts in structured toolbox can be added into workspace. • Workspace • Visualize the model • The place where the user creates his/her model.

Visualization Features • The structured toolbox • Visualization of the metamodel • Visual components Visualization Features • The structured toolbox • Visualization of the metamodel • Visual components that can be added • Truncated display of trees • Node-link representation • Deep-node problem • Icons • Type/Instance relationship • Cardinality

Component Reuse • Components can be loaded/saved. • Load and save sub-trees • Component Component Reuse • Components can be loaded/saved. • Load and save sub-trees • Component reuse saves time and effort. • Full reuse from component pool • Partial reuse: adapting components

Functionalities of 5 SGraph • • Load/Close a metamodel Load/Save/Close a model Explore the Functionalities of 5 SGraph • • Load/Close a metamodel Load/Save/Close a model Explore the structure of metamodel/model Add concepts from metamodel to model Delete concepts from model Change the properties of concepts Load/Save a existing concept Specifying inter-model constraints

Open/Close metamodel Open/Close metamodel

Load/Save/Close a model Load/Save/Close a model

Explore the structure of metamodel and model Explore the structure of metamodel and model

Add a concept to user model • Top-down: Before you want to add a Add a concept to user model • Top-down: Before you want to add a concept, make sure you have added its parent. • You can only add a concept to its parent node • Make sure the parent node is chosen before you add a new concept. • If the highlight color is blue, the concept has satisfied all the requirements and can be added. • If the highlight color is yellow, click the parent node in workspace and then add the concept.

Add a concept to user model(cont. ) • Double-click the concept in toolbox or Add a concept to user model(cont. ) • Double-click the concept in toolbox or • Right-click and choose the item in the popup menu

Add a concept to user model(cont. ) Add a concept to user model(cont. )

Add a concept to user model(cont. ) Add a concept to user model(cont. )

Add a concept to user model(cont. ) Add a concept to user model(cont. )

Delete a concept • If the concept has no child concepts, click the concept Delete a concept • If the concept has no child concepts, click the concept first, then press “Delete” key. • If the concept has child concepts, delete the child concept first, and then delete this concept.

Change the name and properties of concepts Change the name and properties of concepts

Change the name and properties of concepts (cont. ) Change the name and properties of concepts (cont. )

Change the name and properties of concepts (cont. ) Change the name and properties of concepts (cont. )

Load/Save concepts Load/Save concepts

Semantic Constraints • There are inherent semantic constraints in the hierarchical structure of the Semantic Constraints • There are inherent semantic constraints in the hierarchical structure of the 5 S model. • 5 SGraph maintains the constraints and enforces these constraints over the instance model to ensure correctness.

Example 1 (Constraint Enforcement) • An actor can only participate in the services that Example 1 (Constraint Enforcement) • An actor can only participate in the services that have been defined in the Scenario Model.

Example 2 (Constraint Enforcement) • A catalog has descriptive metadata for digital objects in Example 2 (Constraint Enforcement) • A catalog has descriptive metadata for digital objects in a specific collection. • Therefore, a catalog must have a 1: 1 relationship with an existing collection. • Thus, a catalog is not independent.

The Preliminary Test of 5 SGraph • Research Questions • Does the tool help The Preliminary Test of 5 SGraph • Research Questions • Does the tool help users understand use the 5 S model to build their own digital libraries? • Does the tool help users efficiently describe digital library models in the 5 SL language? • Are users satisfied with the tool?

The Preliminary Test of 5 SGraph: Experimental Design • Three tasks 1. Build a The Preliminary Test of 5 SGraph: Experimental Design • Three tasks 1. Build a simple digital library using existing components. 2. Complete the partial model for CITIDEL. 3. Build a model for NDLTD from scratch. • Three measures • Effectiveness • Efficiency • User satisfaction • 17 subjects

Measures • Effectiveness • • Completion rate Goal achievement • Efficiency • • Task Measures • Effectiveness • • Completion rate Goal achievement • Efficiency • • Task completion time Closeness to expertise: minimum task time divided by task time • Satisfaction • Subjective rating

Test Results Task 1 Task 2 Task 3 Completion Rate (%) 100 100 Mean Test Results Task 1 Task 2 Task 3 Completion Rate (%) 100 100 Mean Task Time (min) 11. 3 11. 4 15. 1 Mean Closeness to Expertise 0. 483 0. 752 0. 712 Mean Goal Achievement (%) 97. 4 98. 2

Satisfaction and Usefulness • The average rating of user satisfaction is 91%. • The Satisfaction and Usefulness • The average rating of user satisfaction is 91%. • The average rating of usefulness of the tool is 92%. • Statistical analysis shows that the mean value of post-understanding of the 5 S model is significantly greater than that of preunderstanding.

Educational Use Educational Use

Learnability Learnability

Semantic Modeling of Digital Library with Concept Maps • Customized “plugin” tool to model Semantic Modeling of Digital Library with Concept Maps • Customized “plugin” tool to model scenarios and societies • Tools with common principles, abstractions, graphical notations, and operations • Solution: Concepts Maps • Conceptual tools for organizing knowledge and representation

Conclusions • Presented a domain specific visual modeling tool for DLs. • Evaluated the Conclusions • Presented a domain specific visual modeling tool for DLs. • Evaluated the tool and proved efficiency, effectiveness, and learnability. • Built new tools based on concept maps for scenario and societies modeling.

Future work on 5 SGraph • Integration of tools • Further usability studies with Future work on 5 SGraph • Integration of tools • Further usability studies with “digital librarians” • Usiing the tools as educational aids for teaching about digital libraries

Motivating Problems – Toward 5 SLGen (MS Thesis of Rohit Kelapure) • Lack of Motivating Problems – Toward 5 SLGen (MS Thesis of Rohit Kelapure) • Lack of general models for Digital Libraries (DLs) • Little focus on simplifying the process of modeling and building DLs • Divergent DL architectures • • Monolithic: Tightly integrated and generally inflexible Componentized: A network of interoperable components aggregated without a design methodology

Problems (contd. ) • Lack of DL-specific modeling languages, software toolkits, prototyping and CASE Problems (contd. ) • Lack of DL-specific modeling languages, software toolkits, prototyping and CASE tools • Lack of a scenario-based requirements analysis and design approach to DLs • Implication: Problems with • Interoperability • Customizability

Approach • Based on the formal 5 S theory • Streams, Structures, Spaces, Scenarios Approach • Based on the formal 5 S theory • Streams, Structures, Spaces, Scenarios and Societies • Use of • Domain-specific declarative languages (5 SL) • Scenario-based requirements analysis and design • Componentized architectures • Automatic transformations/mappings from models to code • Special attention paid to issues of flexibility, reusability, and extensibility

Approach: 5 SLGen • 5 SLGen is a new generic digital library generator. • Approach: 5 SLGen • 5 SLGen is a new generic digital library generator. • It has been developed, implemented, and deployed in several applications. • 5 SLGen yields implementations of digital library services from models of DL “societies” and “scenarios” (and from the other “Ss”).

5 S Model/ 5 SL Model Objective Primitives in 5 SL Streams Describes properties 5 S Model/ 5 SL Model Objective Primitives in 5 SL Streams Describes properties of the DL content text, audio, video, pictures, … Structures Specifies organizational aspects of the DL content digital object, metadata schema, collection, … Spaces Defines logical properties and presentational views of a DL vector, probabilistic, boolean, … Scenarios Details the behavior of DL services service, event, message, condition, action, state, … Societies Defines managers, (responsible for Service Managers, actors (e. g. , running DL services); actors (those learners, teachers, naïve users) who use services) and their relationships

5 SLGen: Model 5 SLGen: Model

5 SLSocieties Model • Service Manager characteristics: • • Name, attributes, operations, type, visibility 5 SLSocieties Model • Service Manager characteristics: • • Name, attributes, operations, type, visibility Service Manager relationships: • Associations, generalizations (extends), dependencies

5 SLScenarios Model 5 SLScenarios Model

Overview Architecture for DL Modeling and Generation 5 S Meta Model DL Expert component Overview Architecture for DL Modeling and Generation 5 S Meta Model DL Expert component 5 SGraph 5 SL DL Models DL Designer Practitioner Teacher pool ODLSearch, ODLBrowse, ODLRate, ODLReview, ……. Researcher 5 SLGen Tailored DL Services

DL Designer 5 SLGen: Architecture 5 SLSocieties Model XMI Serialized 5 SLSocieties model ODL DL Designer 5 SLGen: Architecture 5 SLSocieties Model XMI Serialized 5 SLSocieties model ODL Browse Wrapping Synthesized Statechart 5 SLGen import ODL Search Wrapping Java Scenarios converter Societies converter Component Pool Java 5 SLScenarios Model Java Classes Model Java Controller Class 5 SFramework import JSP User Interface View DLServices Implementation Web Designer

Societies-converter: Workflow DL Designer 5 SLSocieties Model Societies -converter Java Representation. JDOM Transform Java Societies-converter: Workflow DL Designer 5 SLSocieties Model Societies -converter Java Representation. JDOM Transform Java Mapper Java Classes Model XMI Serializer XMI: Class Model Xmi 2 Java 5 SLGen: Architecture

Scenarios-Converter: Workflow 5 SLScenarios Model DL Designer JDOM Transform Scenario Synthesizer Scenarios-converter Java Controller Scenarios-Converter: Workflow 5 SLScenarios Model DL Designer JDOM Transform Scenario Synthesizer Scenarios-converter Java Controller Class State Machine Compiler State-design pattern Synthesized Statechart

Relevance Feedback Search Service UML Sequence Diagram Event seq. no. = 3 5 SLScenari Relevance Feedback Search Service UML Sequence Diagram Event seq. no. = 3 5 SLScenari os instance

Scenarios-converter: Scenario-Synthesis Scenarios-converter: Scenario-Synthesis

Scenarios-converter: Scenario-Synthesis (contd. ) Scenarios-converter: Scenario-Synthesis (contd. )

Synthesized-Statechart Component statecharts Synthesized-Statechart Component statecharts

Generated DLs • Union Catalog • Simple DL with maximum reuse • 2 components Generated DLs • Union Catalog • Simple DL with maximum reuse • 2 components used: Search and Browse • CITIDEL, including VIADUCT • Aggregates all the 5 SLSocieties and 5 SLScenarios models for its elementary services

Generated DL Services • CITIDEL: Relevance Feedback Search Service • Demonstrate extensibility with the Generated DL Services • CITIDEL: Relevance Feedback Search Service • Demonstrate extensibility with the ODL Search component • CITIDEL: Profile Based Filtering Service • Demonstrate reusability with the ODL Browse component • CITIDEL: Multi-Classification Browsing Service • Generate complex services without any component reuse • CITIDEL: Binding Service • Complete the set of CITIDEL services

Profile Based Filtering (PBF) Service 5 SFramework ODL-Browse componenent Controlle r Model View Profile Based Filtering (PBF) Service 5 SFramework ODL-Browse componenent Controlle r Model View

Conclusion • Introduced a scenario-based approach to the generation of componentized DLs • Applied Conclusion • Introduced a scenario-based approach to the generation of componentized DLs • Applied the 5 SFramework for generation of DLs • Partially validated theory of 5 S • Demonstrated that complex DLs can be built on the basis of a formal theory • Adherence to open standards (OAI-PMH, ODL, XMI, UML) and established design patterns (MVC, GOF’s State) ensures relevance and extensibility of our work.

Future Work • Integration of 5 SLGen with 5 SGraph • Improvements to 5 Future Work • Integration of 5 SLGen with 5 SGraph • Improvements to 5 SFramework architecture • • • Scalability of the generated DLs and DL services Automated construction of user-interfaces with statecharts Support for transaction scoping and error handling Web services support Incorporating the u. Portal framework • Model Validation • Personalization of the 5 S approach using PIPE

DL Student Research: Torres • • Search in collections of fish images using combination DL Student Research: Torres • • Search in collections of fish images using combination of image properties (CBIR) and textual descriptions

Textual information retrieval Query on Google using Sunset and Rio de Janeiro Query result Textual information retrieval Query on Google using Sunset and Rio de Janeiro Query result

Content Based Information Retrieval Content Based Information Retrieval

Torres: Visualizations Concentric Rings Pattern Spiral Pattern Torres: Visualizations Concentric Rings Pattern Spiral Pattern

DL Student Research: Shen • 5 S and component architecture to allow handling of DL Student Research: Shen • 5 S and component architecture to allow handling of very complex DL applications: archaeology • Information visualization, clustering • Mappings across streams, structure, spaces

Case Study (Archaeology): ETANA • NSF ITR with CWRU (and Vanderbilt …) • Faster Case Study (Archaeology): ETANA • NSF ITR with CWRU (and Vanderbilt …) • Faster DL development • for complex application domains, • with suitable tailoring • Approach • ODL – pool of components • 5 S – theory-based generation of systems

ETANA Website ETANA Website

Lahav Website Lahav Website

Megiddo Opening Screen Megiddo Opening Screen

Locus Screen: Pictures View all Locus Screen: Pictures View all

Area Screen: Distribution of Artifacts Area Screen: Distribution of Artifacts

ETANA-DL Website ETANA-DL Website

Archaeology DL – Approach • Solve the following DL problems: • interoperability, • making Archaeology DL – Approach • Solve the following DL problems: • interoperability, • making primary data available, • data preservation • Modeling archaeological information systems • using 5 S theory to design system and services • Rapidly prototyping DLs that handle • heterogeneous archaeological data using • componentized frameworks

ETANA-DL Schema Design Owner Locus Collection Partition ETANA-DL Object ID Bone Subpartition Seed Figurine ETANA-DL Schema Design Owner Locus Collection Partition ETANA-DL Object ID Bone Subpartition Seed Figurine Container Animal Name Dimensions Count Species Description …… ……

Data Mapping Data Mapping

ETANA-DL Architecture Dig. Kit Users Services Dig. Base Data ETANA-DL Union Services Users ETANA-DL Architecture Dig. Kit Users Services Dig. Base Data ETANA-DL Union Services Users

ETANA-DL Architecture Dig. Base and Dig. Kit Lahav Nimrin Umayri Hisban Megiddo Jalul … ETANA-DL Architecture Dig. Base and Dig. Kit Lahav Nimrin Umayri Hisban Megiddo Jalul … D A T A B A S E W R A P P E R S Search U S E R Browse Recommend ETANA-DL UNION CATALOG Note Personalize Review Visualizations Archaeology Specific I N T E R F A C E New Sites Work in progress

Architecture Dig. Base XOAI Search Component In de OAI Union Catalog x Web Interface Architecture Dig. Base XOAI Search Component In de OAI Union Catalog x Web Interface Browse DB de In Configure OAI Data Provider DB Data Mapping Component Inverted Files x Dig. Kit Browse Component Services DB Archaeological Site ETANA-DL Other ETANA-DL Services XOAI

Searching – Search Results Searching – Search Results

Searching – Advanced Search Searching – Advanced Search

Searching – Advanced Search Results Searching – Advanced Search Results

Review of Gonçalves Achievements in Past Year • Book Chapters 1. 2. • Fox, Review of Gonçalves Achievements in Past Year • Book Chapters 1. 2. • Fox, E. A. , Gonçalves, M. A. , Luo, M. , Chen, Y. , Krowne, A. , Zhang, B. , Mc. Devitt, , K. Pérez-Quiñones, M. , Cassel, L. N. Harvesting: Broadening the Field of Distributed Information Retrieval. In Multimedia Distributed Information Retrieval, eds. Fabio Crestani, Mark Sanderson, and Jamie Callan, 2003. Fox, E. , Mc. Millan, G. , Suleman, H. , Gonçalves, M. , Networked Digital Library of Theses and Dissertations. Invited chapter for “Digital Libraries: Policy, Planning, and Practice”, eds. Judith Andrews and Derek Law, Ashgate Publishing, 2003 Journal papers 1. 2. 3. 4. 5 S TOIS paper (April 2004, issue) S. Perugini, M. A. Gonçalves, and E. A. Fox. A Connection-Centric Survey of Recommender Systems Research. Journal of Intelligent Information Systems, Jun, 2004. Zhu, Q. , Gonçalves, M. A. , Fox, E. A. . 5 SGraph: A Domain-Specific Visual Modeling Tool for Digital Libraries. Journal of the American Society for Information Science and Technology, submitted 2003, in revision Baoping Zhang, Marcos Andre Goncalves, Yuxin Chen, Edward A. Fox, and Pavel Calado, "Combining Support Vector Machines and Structural Rules for Effective Filtering of OAI-Based Repositories", submitted to Journal of Digital Libraries (Springer Verlag) Special Issue on Asian Digital Libraries, 2004

 • Conference papers 1. Pável P. Calado, Marcos André Gonçalves, Edward A. Fox, • Conference papers 1. Pável P. Calado, Marcos André Gonçalves, Edward A. Fox, Berthier Ribeiro-Neto, Alberto H. F. Laender, Altigran S. da Silva, Davi C. Reis, Pablo A. Roberto, Monique V. Vieira, and Juliano P. Lage. The Web-DL Environment for Building Digital Libraries from the Web. JCDL'2003, Third Joint ACM / IEEE-CS Joint Conference on Digital Libraries, May 27 -31, 2003, Houston. Marcos André Gonçalves, Ganesh Panchanathan, Unnikrishnan Ravindranathan, Aaron Krowne, Edward A. Fox, Filip Jagodzinski, and Lillian Cassel. The XML Log Standard for Digital Libraries: Analysis, Evolution, and Deployment. Proc. JCDL'2003, Third Joint ACM / IEEE-CS Joint Conference on Digital Libraries, May 27 -31, 2003, Houston. Qinwei Zhu, Marcos André Gonçalves, Rao Shen, Lillian Cassel, Edward A. Fox. Visual Semantic Modeling of Digital Libraries. ECDL'2003, 7 th European Conference on Research and Advanced Technology for Digital Libraries, 17 -22 August, 2003, Trondheim, Norway. Rohit Kelapure, Marcos André Gonçalves, Edward A. Fox. Scenario-Based Generation of Digital Library Services. ECDL'2003, 7 th European Conference on Research and Advanced Technology for Digital Libraries, 17 -22 August, Trondheim, Norway Marco Cristo, Pavel Calado, Edleno Moura, Nivio Ziviani, Berthier Ribeiro-Neto, and Marcos André Gonçalves. Combining Link-Based and Content-Based Methods for Web Document Classification. CIKM 2003, 3 -8 November, New Orleans, Louisiana, USA, 2003. Baoping Zhang, Marcos Andre Goncalves, and Edward A. Fox. An OAI-based Filtering Service for CITIDEL from NDLTD. ICADL 2003, 6 th International Conference of Asian Digital Libraries, 8 -11 December, Kuala Lumpur, Malaysia, 2003 U. Ravindranathan, R. Shen, M. A. Goncalves, W. Fan, E. A. Fox, and J. W. Flanagan. ETANA -DL: A Digital Library for Integrated Handling of Heterogeneous Archaeological Data. To be presented at ACM-IEEE Joint Conference on Digital Libraries (JCDL 2004), Tucson, AZ, June 7 -11, 2004. 2. 3. 4. 5. 6. 7.

 • Conference papers 8. U. Ravindranathan, R. Shen, M. A. Goncalves, W. Fan, • Conference papers 8. U. Ravindranathan, R. Shen, M. A. Goncalves, W. Fan, E. A. Fox, and J. W. Flanagan. ETANA-DL: A Digital Library for Integrated Handling of Heterogeneous Archaeological Data. To be presented at ACM-IEEE Joint Conference on Digital Libraries (JCDL 2004), Tucson, AZ, June 7 -11, 2004. M. A. Goncalves, E. A. Fox, A. Krowne, P. Calado, A. H. F. Laender, A. S. da Silva, and B. Ribeiro-Neto. The Effectiveness of Automatically Structured Queries in Digital Libraries. To be presented at ACM-IEEE Joint Conference on Digital Libraries (JCDL 2004), Tucson, AZ, June 7 -11, 2004. Alberto H. F. Laender, M. A. Goncalves, Pablo A. Roberto. BDBComp: Building a Digital Library for the Brazilian Computer Science Community. To be presented at ACM-IEEE Joint Conference on Digital Libraries (JCDL 2004), Tucson, AZ, June 7 -11, 2004. U. Ravindranathan, R. Shen, M. A. Goncalves, W. Fan, E. A. Fox, and J. W. Flanagan. Prototyping Digital Libraries Handling Heterogeneous Data Sources - The ETANA-DL Case Study. European Conference on Digital Libraries (ECDL 2004), Bath, UK, September 12 -17, 2004. (submitted) 9. 10. 11. q Other publications 1. 2. 3. 4. 5. R. da S. Torres, C. B. Medeiros, M. A. Goncalves, and E. A. Fox. An OAI-based Digital Library Framework for Biodiversity Information Systems. Department of Computer Science, Virginia Tech, Technical Report No. TR-04 -01, 2004. R. da S. Torres, C. B. Medeiros, M. A. Goncalves, and E. A. Fox. An OAI Compliant Content-Based Image Search Component. Demo to be presented at ACM-IEEE Joint Conference on Digital Libraries (JCDL 2004), Tucson, AZ, June 7 -11, 2004. R. da S. Torres, C. B. Medeiros, Renata Q. Dividino, Mauricio A. Figueiredo, M. A. Goncalves, E. A. Fox, and R. Richardson. Using Digital Library Components for Biodiversity Systems. Poster to be presented at ACM-IEEE Joint Conference on Digital Libraries (JCDL 2004), Tucson, AZ, June 7 -11, 2004. U. Ravindranathan, R. Shen, M. A. Goncalves, W. Fan, E. A. Fox, and J. W. Flanagan. ETANA-DL: Managing Complex Information Applications – An Archaeology Digital Library. Demo to be presented at ACM-IEEE Joint Conference on Digital Libraries (JCDL 2004), Tucson, AZ, June 7 -11, 2004. Qinwei Zhu, Marcos André Gonçalves, E. Fox. 5 SGraph Demo: A Graphical Modeling Tool for Digital Libraries. Proc. JCDL'2003, Third Joint ACM / IEEE-CS Joint Conference on Digital Libraries, May 27 -31, 2003, Houston.

Proposed Outline of Dissertation (Marcos André Gonçalves) • • • Chapter 1 – Introduction Proposed Outline of Dissertation (Marcos André Gonçalves) • • • Chapter 1 – Introduction and Motivation Chapter 2 – Background and Related Work Chapter 3 – Streams, Structures, Spaces, Scenarios and Societies: the 5 S Formal Model for Digital Libraries Chapter 4 – Towards a Digital Library Theory: A Formal Digital Library Ontology based on 5 S Chapter 5 – Applications of the 5 S Model/Ontology • • 5. 1 Declarative Specification of DLs: the 5 S Language 5. 2 Semantic Visual Modeling of DLs: the 5 SGraph Tool 5. 3 (Semi-) Automatic Generation of Componentized DLs: The 5 SGen Tool 5. 4 Evaluating DLs: The XML Log Standard for DLs 5. 5 Formally comparing Architectures: Fedora and Buckets (time permitting) Chapter 6 – Defining Quality in Digital Libraries Chapter 7 – Conclusions and Future Work Appendix 1 - Mathematical Preliminaries

Questions/Discussion? Questions/Discussion?