Скачать презентацию Perspectives of information science in the digital age Скачать презентацию Perspectives of information science in the digital age

2faf28abee092cd501685862c126c8e8.ppt

  • Количество слайдов: 49

Perspectives of information science in the digital age Tefko Saracevic, Ph. D Rutgers University Perspectives of information science in the digital age Tefko Saracevic, Ph. D Rutgers University USA http: //www. scils. rutgers. edu/~tefko © Tefko Saracevic, Rutgers University

Information science: “the science dealing with the efficient collection, storage, and retrieval of information” Information science: “the science dealing with the efficient collection, storage, and retrieval of information” Webster © Tefko Saracevic, Rutgers University 2

Organization 1. 2. 3. 4. 5. 6. 7. 8. Big picture – problems, solutions, Organization 1. 2. 3. 4. 5. 6. 7. 8. Big picture – problems, solutions, social place Underlying stuff – theories, phenomena Structure – what is inside stuff Systems stuff – information retrieval, relevance People stuff – users, use, seeking, context Alliances, competition – the OUCH stuff Digital libraries – whose are they anyhow? Conclusions – Will we have a field stuff? © Tefko Saracevic, Rutgers University 3

1. The big picture Problems addressed Bit of history: Vannevar Bush (1945): l Problem: 1. The big picture Problems addressed Bit of history: Vannevar Bush (1945): l Problem: “. . . the massive task of making more accessible of a bewildering store of knowledge. ” l still with us & growing Basic problem of information science: Information explosion today: PLUS Communication explosion © Tefko Saracevic, Rutgers University 4

… solution Bush: “Memex. . . association of ideas. . . duplicate mental processes … solution Bush: “Memex. . . association of ideas. . . duplicate mental processes artificially. ” Technological fix to problem Still with us: technological determinant l tail that wags the dog © Tefko Saracevic, Rutgers University 5

Problems & solutions: SOCIAL CONTEXT Professional practice AND scientific inquiry related to: l l Problems & solutions: SOCIAL CONTEXT Professional practice AND scientific inquiry related to: l l Effective communication of knowledge records ‘literature’ - among humans in the context of social, organizational, & individual need for and use of information. “modeling the world of publications with a practical goal of being able to deliver their content to inquirers [users] on demand. ” White & Mc. Cain Taking advantage of modern information technology © Tefko Saracevic, Rutgers University 6

Elaboration Knowledge records = texts, sounds, images, multimedia. . . literature in given domains Elaboration Knowledge records = texts, sounds, images, multimedia. . . literature in given domains l l content-bearing structures symbol manipulations are content neutral - infrastructural to inf. sc. Communication = human-computer-literature interface l study of inf. science is the interface between people & literatures Inf. need, seeking, and use = reason d'être Effectiveness = relevance, utility © Tefko Saracevic, Rutgers University 7

General characteristics - leitmotifs ¶ Intedisciplinarity - relations with a number of fields · General characteristics - leitmotifs ¶ Intedisciplinarity - relations with a number of fields · Technological imperative - driving force, as in many modern fields ¸ Information society - social context and role in evolution - shared with many fields © Tefko Saracevic, Rutgers University 8

2. Underlying stuff What is information? Intuitively well understood, but formally? ? l Several 2. Underlying stuff What is information? Intuitively well understood, but formally? ? l Several viewpoints, models Shannon: source-channel-destination l grapes into wine Cognitive: changes in cognitive structures l water into wine Social: context is the king l whatever into wine to get drunk © Tefko Saracevic, Rutgers University 9

K(S) + I = K(S + S) (Brookes) Information [structured information] when operating on K(S) + I = K(S + S) (Brookes) Information [structured information] when operating on a knowledge structure produces an effect whereby the knowledge structure is changed Potential information added (Ingwersen) Actually, it states the problem – l l l “unoperational” in information systems involves mental events only constructivists rejected it © Tefko Saracevic, Rutgers University 10

Information in inf science: Three senses (from narrowest to broadest) ¶ Inf. in terms Information in inf science: Three senses (from narrowest to broadest) ¶ Inf. in terms of decision involving little or no cognitive processing l signals, bits, straightforward data - e. g. . inf. theory, economics · Inf. involving cognitive processing & understanding l understanding, matching texts ¸ Inf. also as related to situation, task, problem-athand : USERS, USE For information science (incl. information retrieval): l third, broadest interpretation © Tefko Saracevic, Rutgers University 11

The biggest problem MEASUREMENT © Tefko Saracevic, Rutgers University 12 The biggest problem MEASUREMENT © Tefko Saracevic, Rutgers University 12

3. Structure Specialties (White & Mc. Cain) In desc. order of author co-citation; (120 3. Structure Specialties (White & Mc. Cain) In desc. order of author co-citation; (120 authors, 24 years): l experimental retrieval l citation analysis l practical retrieval l bibliometrics l library systems, automation l user studies and theory l scientific communication l OPAC’s l general - other disciplines l indexing theory l communication theory © Tefko Saracevic, Rutgers University 13

Structure or oeuvres Two large sub-disciplines: Ê “Domain” cluster: analytical study of literatures, their Structure or oeuvres Two large sub-disciplines: Ê “Domain” cluster: analytical study of literatures, their structure, communication, social context, uses Ë Retrieval cluster: human-literature interface: IR systems (largest); interaction; library systems, OPACs, user studies l within each sub-clusters, eras l e. g. . Salton & post-Salton era Largely not connected l l some authors in both, migrating BUT: lacking integrating works, authors, texts - big payout © Tefko Saracevic, Rutgers University 14

Paradigm split in retrieval cluster Split from early 80’s to date ¶ System-centered algorithms, Paradigm split in retrieval cluster Split from early 80’s to date ¶ System-centered algorithms, TREC l continue traditional IR model l · Human-(user)-centered cognitive, situational, user studies l interaction models, some started in TREC l Calls for user-centered approaches & evaluation But: most support for system work l in the digital age support is for digital © Tefko Saracevic, Rutgers University 15

Human vs. system Human (user) side: l l l often highly critical, even one-sided Human vs. system Human (user) side: l l l often highly critical, even one-sided mantra of implications for design but does not deliver concretely System side: l l mostly ignores user side & studies ‘tell us what to do & we will’ Issue NOT H or S approach l l l even less H vs. S but how can H AND S work together major challenge for the future © Tefko Saracevic, Rutgers University 16

4. Systems stuff Information Retrieval “ IR: . . . intellectual aspects of description 4. Systems stuff Information Retrieval “ IR: . . . intellectual aspects of description of inf. , . . . search, . . . & systems, machines. . . ” Calvin Mooers, 1951 How to provide users with useful information effectively? For that objective: 1. How to organize information intellectually? 2. How to specify the search & interaction intellectually? 3. What techniques & systems to use effectively? © Tefko Saracevic, Rutgers University 17

Streams in IR Res. & Dev. 1. Information science: l l l Services, users, Streams in IR Res. & Dev. 1. Information science: l l l Services, users, use; Human-computer interaction; Cognitive aspects 2. Computer science: l l Algorithms, techniques Systems aspects 3. Information industry: l l Products, services, Web Market aspects Problems: . . . relative isolation. . . inadequate cooperation, transfer © Tefko Saracevic, Rutgers University 18

IR successfully effected: Emergence & growth of the INFORMATION INDUSTRY Evolution of IS as IR successfully effected: Emergence & growth of the INFORMATION INDUSTRY Evolution of IS as a PROFESSION & SCIENCE Many APPLICATIONS in many fields l including on the Web – search engines Improvements in HUMAN - COMPUTER INTERACTION Evolution of INTEDISCIPLINARITY IR has a long, proud history © Tefko Saracevic, Rutgers University 19

Broadening of IR OPACs (Online Public Access Catalogs) Natural language processing Summarization Metadata representations Broadening of IR OPACs (Online Public Access Catalogs) Natural language processing Summarization Metadata representations Text “understanding” Hypertext, hypermedia Multimedia - images, sounds. . . l image IR, music IR Many human-computer interactions Web search engines © Tefko Saracevic, Rutgers University 20

5. People stuff Quite a few areas Professional services l l in organization – 5. People stuff Quite a few areas Professional services l l in organization – moving toward knowledge management, competitive intelligence in industry – vendors, aggregators, Internet, Research l l l user & use studies interaction studies broadening to information seeking studies, social context, collaboration relevance studies social informatics © Tefko Saracevic, Rutgers University 21

User & use studies Oldest area covers many topics, methods, orientations l many studies User & use studies Oldest area covers many topics, methods, orientations l many studies related to IR l l e. g. searching, multitasking, browsing, navigation Branching into Web use studies quantitative & qualitative studies l emergence of webmetrics l © Tefko Saracevic, Rutgers University 22

Interaction Traditional IR model concentrates on matching not user side & interaction Several interaction Interaction Traditional IR model concentrates on matching not user side & interaction Several interaction models suggested l l Ingwersen’s cognitive, Belkin’s episode, Saracevic’s stratified model hard to get experiments & confirmation Considered key to providing basis for better design l understanding of use of systems l Web interactions a major new area © Tefko Saracevic, Rutgers University 23

Relevance Effectiveness in IR = relevance l thus, relevance became a key notion l Relevance Effectiveness in IR = relevance l thus, relevance became a key notion l and a key headache A number of studies & reviews on: Nature: Framework, base? l Manifestations: Contexts? Typologies? l Behavior: Variables? Observations? l Effects: Use? Evaluation? l © Tefko Saracevic, Rutgers University 24

Manifestations (types) of relevance System or algorithmic relevance l relation between query & objects Manifestations (types) of relevance System or algorithmic relevance l relation between query & objects (‘texts’) retrieved or failed to retrieve Topical or subject relevance Cognitive relevance or pertinence Situational relevance or utility l relation between the situation, task or problem at hand & texts Motivational or affective relevance l intent, goals, & motivation of user & “texts” Manifestations interact dynamically © Tefko Saracevic, Rutgers University 25

Information seeking Concentrates on broader context not only IR or interaction, people as they Information seeking Concentrates on broader context not only IR or interaction, people as they move in life & work Number of models provided l e. g. Kuhlthau’s stages, Vakkari’s problem situation, task complexity Includes studies of ‘life in the round, ’ making sense, information encountering, work life, information discovery Based on concept of social construction of information © Tefko Saracevic, Rutgers University 26

6. Alliances, competition Relations With a number of fields. . . Strongest: 1. Librarianship 6. Alliances, competition Relations With a number of fields. . . Strongest: 1. Librarianship 2. Computer science © Tefko Saracevic, Rutgers University 27

Librarianship [Library is]. . . “contributing to the total communication system in society. Created Librarianship [Library is]. . . “contributing to the total communication system in society. Created to maximize the utility of graphic record for the benefits of society. . . it achieves that goal by working with the individual and through the individual it reaches society. ” J. H. Shera, 1972 © Tefko Saracevic, Rutgers University 28

Common grounds IS & librarianship share: Social role in information society Concern with effective Common grounds IS & librarianship share: Social role in information society Concern with effective utilization of graphic & other types of records Research problems related to a number of topics Transfer to & from information retrieval © Tefko Saracevic, Rutgers University 29

Differences IS & librarianship differ in: Selection & definition of many problems addressed Theoretical Differences IS & librarianship differ in: Selection & definition of many problems addressed Theoretical questions & framework Nature & degree of experimentation Tools and approaches used Nature & strength of interdisciplinary relations © Tefko Saracevic, Rutgers University 30

One field or two? Point of many debates Suggest: TWO fields in strong interdisciplinary One field or two? Point of many debates Suggest: TWO fields in strong interdisciplinary relations Not a matter of “better” or “worse” - matters little l common arguments between many fields Differences matter in: l l problem selection & definition agenda, paradigms theory, methodology practical solutions, systems Best example: IR & library automation © Tefko Saracevic, Rutgers University 31

Which? Librarianship. Information science Library and information science Libraryandinformationscience Information sciences Information l like Which? Librarianship. Information science Library and information science Libraryandinformationscience Information sciences Information l like in the “Information School” © Tefko Saracevic, Rutgers University 32

Computer science “systematic study of algorithmic processes that describe and transfer information. . The Computer science “systematic study of algorithmic processes that describe and transfer information. . The fundamental question in computing is: ‘What can be (efficiently) automated’. ” Denning et al. , 1989 © Tefko Saracevic, Rutgers University 33

IS & computer science CS primarily about algorithms IS primarily about information and its IS & computer science CS primarily about algorithms IS primarily about information and its users and use Not in competition, but complementary Growing number of computer scientists active in IS – particularly in IR and digital libraries Concentrating on l l l advanced IR algorithms & techniques digital library infrastructure & various domains human computer interaction © Tefko Saracevic, Rutgers University 34

Human-computer interaction (HCI) “ Human computer interaction is a discipline concerned with the design, Human-computer interaction (HCI) “ Human computer interaction is a discipline concerned with the design, evaluation and implementation of interactive computing systems for human use and with the study of major phenomena surrounding them. ” ACM SIGCHI, 1993 Another interdisciplinary area l computers sc. , cognitive sc. , ergonomics, . . . © Tefko Saracevic, Rutgers University 35

Interaction and IS Two streams: l l computer-human interaction human-computer interaction Modern IR is Interaction and IS Two streams: l l computer-human interaction human-computer interaction Modern IR is interactive l BUT: difference between retrieval engine & retrieval interface Many studies on: l l machine aspects of interaction human variables in interaction Problem: little feedback between Interaction very hard to evaluate - few methods yet © Tefko Saracevic, Rutgers University 36

7. Digital libraries LARGE & growing area “Hot” area in R&D a number of 7. Digital libraries LARGE & growing area “Hot” area in R&D a number of large grants & projects in the US, European Union, & other countries l but “DIGITAL” big & “libraries“ small l “Hot” area in practice building digital collections, hybrid libraries, l many projects throughout the world l © Tefko Saracevic, Rutgers University 37

Technical problems Substantial - larger & more complex than anticipated: l representing, storing & Technical problems Substantial - larger & more complex than anticipated: l representing, storing & retrieving of library objects l l l operationally managing large collections - issues of scale dealing with diverse & distributed collections l l l particularly if originally designed to be printed & then digitized interoperability assuring preservation & persistence incorporating rights management © Tefko Saracevic, Rutgers University 38

Digital Library Initiatives in the US (DLI) Research consortia under National Science Foundation l Digital Library Initiatives in the US (DLI) Research consortia under National Science Foundation l l DLI 1: 1994 -98, 3 agencies, $24 M, six large projects DLI 2: 1999 -2006, 8 agencies, $60+M, 77 large & small projects in various categories ‘digital library’ not defined to cover many topics & stretch ideas l not constrained by practice © Tefko Saracevic, Rutgers University 39

European Union DELOS Network of Excelence on Digital Libraries l many projects throughout European European Union DELOS Network of Excelence on Digital Libraries l many projects throughout European Union l heavily technological many meetings, workshops l resembles DLIs in the US l well funded, long range l © Tefko Saracevic, Rutgers University 40

Research issues l understanding objects in DL l l l l l representing in Research issues l understanding objects in DL l l l l l representing in many formats non-textual materials metadata, cataloging, indexing conversion, digitization organizing large collections managing collections, scaling preservation, archiving interoperability, standardization accessing, using, © Tefko Saracevic, Rutgers University 41

DL projects in practice Heavily oriented toward institutions Assoc of Res Libraries (ARL) database: DL projects in practice Heavily oriented toward institutions Assoc of Res Libraries (ARL) database: 427 DL projects in 13 countries l 374 in the US l 51% in universities; 24% fed govmt; 9% hist societies; 6% regional … l 84% are explicitly retrospective; 16% technological l 1 listed from DLI (Illinois) l no connection with DLI projects l © Tefko Saracevic, Rutgers University 42

Agendas Most DL research agenda is set from top down l l from funding Agendas Most DL research agenda is set from top down l l from funding agencies to projects imprint of the computer science community's interest & vision Most DL practice agendas are set from bottom up l l from institutions, incl. many libraries imprint of institutional missions, interests & vision l l providing access to specialized materials and collections from an institution (s) that are otherwise not accessible covering in an integral way a domain with a range of sources © Tefko Saracevic, Rutgers University 43

Connection? DL research & DL practice presently are conducted mostly independent of each other, Connection? DL research & DL practice presently are conducted mostly independent of each other, l minimally informing each other, l & having slight, or no connection l Parallel universes with little connections & interaction © Tefko Saracevic, Rutgers University 44

8. Conclusions IS contributions IS effected handling of inf. in society Developed an organized 8. Conclusions IS contributions IS effected handling of inf. in society Developed an organized body of knowledge & professional competencies Applied interdisciplinarity IR reached a mature stage IR penetrated many fields & human activities Stressed HUMAN in human-computer interaction © Tefko Saracevic, Rutgers University 45

Challenges Adjust to the growing & changing social & organizational role of inf. & Challenges Adjust to the growing & changing social & organizational role of inf. & related infrastructure Play a positive role in globalization of information Respond to technological imperative in human terms Respond to changes from inf. to communication explosion - bringing own experiences to resolutions, particularly to the INTERNET Join competition with quality Join DIGITAL with LIBRARIES © Tefko Saracevic, Rutgers University 46

Juncture IS is at a critical juncture in its evolution Many fields, groups. . Juncture IS is at a critical juncture in its evolution Many fields, groups. . . moving into information l l l big competition entrance of powerful players fight for stakes To be a major player IS needs to progress in its: l l research & development professional competencies educational efforts interdisciplinary relations Reexamination necessary © Tefko Saracevic, Rutgers University 47

© Tefko Saracevic, Rutgers University 48 © Tefko Saracevic, Rutgers University 48

© Tefko Saracevic, Rutgers University 49 © Tefko Saracevic, Rutgers University 49