
fdf4f93615390b083e029402e6a6218a.ppt
- Количество слайдов: 25
Large-scale digital archives of endangered Asia-Pacific languages Pacific And Regional Archive for Digital Sources in Endangered Cultures Linda Barwick, University of Sydney Presentation to APAN E-science workshop, Honolulu, 28 Jan 2004
Endangered regional languages Approx. 2500 of the world’s 6000 languages in Australia’s region (Oceania, E and SE Asia) l Majority of these 2500 are endangered - number of languages likely to fall to a few hundred by 2100 (UNESCO) l Loss of language -> loss of cultural knowledge (e. g. ecological knowledge) and expressions (e. g. songs) -> loss of human diversity l
Why digital archives? l Salvaging materials recorded in endangered analogue formats – Only means of ensuring long-term preservation and access to audio Optimal format for transcription and analysis l Distributed management & access (including authentication) via broadband R&E networks l Participation in international consortia for resource discovery and advice l Quality-controlled citeable primary data resource to support research results l
The coming revolution … l Quality-controlled citeable primary data resource to support research results requires: – Authenticated resource creation path – Finegrained description of resource • • Metadata Transcript Timecoding (Translation… ) – Sustainability, security, discoverability and accessibility of resource (i. e. needs to be online) – Instantiation of links between research results and primary data (e. g. via electronic publication)
Other regional digital language and music archives l l l l l Archive of Maori and Pacific Music, U. Auckland Tjibaou Cultural Centre, New Caledonia Vanuatu Cultural Centre Institute of Papua New Guinea Studies Music Archive, Port Moresby Australian Institute of Aboriginal and Torres Strait Islander studies audiovisual archive Alaskan Native Languages Center Archive of Indigenous Languages of Latin America Formosan Language archive Others … e. g. Malaysia ….
Some European archives hosting Asia-Pacific region material l l l Do. Be. S (Documentation of Endangered Languages) Archive, Max Planck Institute, Nijmegen, Holland Endangered Languages Programme Archive, SOAS, UK Vienna Phonogrammarchiv Berlin Phonogrammarchiv LACITO, France Musée de l’homme, France British National Sound Archive …
About PARADISEC Established 2003 to preserve and make accessible Australian researchers’ field recordings of endangered languages and musics from the Asia-Pacific region l Collaborative project funded by Australian Research council, participants Universities of Sydney, Melbourne and ANU l Does not include Australian languages - these managed via AIATSIS l Present focus on audio recordings - plan to include and integrate other digital resources l
Collection status Jan 2004 l 1324 assessed records, covering approx. 150 regional languages from 14 countries – (Australia, Burma, Fiji, Indonesia, Japan, Laos, Malaysia, Micronesia, New Zealand, Papua New Guinea, Singapore, Taiwan, Vanuatu, Vietnam) 392 hours ingested and online via password, APAC store account - on target for 500 hours (1 terabyte) in first year l Metadata quality control via registration with Open Language Archive Community (6/03) and OAI l First collections digitised and returned to depositors l
Metadata - shared online database l l l For description, assessment, rights, access Filemaker Pro while in development Currently moving to My. SQL/PHP Created & managed online in shared server space Public access to catalogue planned for 2004 Will link to collection Nick Thieberger, Melbourne unit PARADISEC project manager (for authorised users)
PARADISEC audio standards l 24 -bit 96 khz Broadcast Wave Format (uncompressed PCM audio with encapsulated metadata) 2 GB/h l. Ingestion managed via Quadriga system (also used by National Library of Australia, Screensound, etc) l. CD-audio and Mp 3 browser copies via batch processing Frank Davey, audio engineer, Sydney unit
Depositor and user liaison l PARADISEC digital archive only provides temporary storage while objects are digitised – originals returned to originating institution /depositor with CD-audio copy – depositors have online password-protected Amanda Harris, project access to full-resolution administration, Sydney unit digital files – we provide advice on archiving of originals if requested – born-digital originals will revolutionise work practices
PARADISEC structure password authentication metadata/ database design (Melbourne) data entry/ administration digitisation (Sydney) depositor APAC national facility (Canberra) “Azoulay” archive space owner cultural centre authorised general user working space Usyd MSS
Rights l Depositor and user agreement forms online l Rights information embedded in the processing system for eventual automated access or restriction of access l Trial password access currently implemented on APAC store and shared database
Access (audio online) Download whole files from data store (e. g. for authorised community use) l Streaming MP 3 (browsing) l – Audition section of file (in development 2004) – Transcript, dictionaries, maps, images etc as point of entry to collection (in development 2004) Effective access depends on transcripts with translations and timecoding l Need ‘timecoding for dummies’ tools l Encouragement for users to add value to repository by lodging transcripts, indexes etc. l
Training & Resources Demand for practical workshops for researchers and communities l Researcher training to archive in everyday practice not just as end point l Website as gateway for online resources l Potential for online collaboration with users and stakeholder communities in adding value to collection through timecoding and metadata l
PARADISEC’s communities International disciplinerelated digital entities Regional stakeholders and cultural centres PARADISEC Australian Higher Education Sector National media archives
PARADISEC Regional community Regional stakeholders l Speakers/performers and their inheritors l Local and national cultural centres – Vanuatu Kaljoral Senta – Institute of PNG Studies – Etc… l Must be involved for ethical and rights reasons l Significant user community
PARADISEC Issues l Differentials in infrastructure l Differentials in funding l Training and career structures l Technical support l Local language access interface Regional stakeholders
PARADISEC Wishlist Regional stakeholders Effective international networking links to stakeholder communities l User-friendly, cost-effective and open-source database, indexation and annotation software l More opportunities for user workshops and skillsharing within the Asia-Pacific region l Greater awareness of potential for cultural heritage applications in the planning/feasibility study stages of regional infrastructure projects l
PARADISEC OLAC http: //www. language-archives. org Sub-community of Open Archives Initiative l Worldwide virtual library of language resources l PARADISEC one of 27 participating archives l International discipline-related digital entities AIMS l develop consensus on best current practice for digital archiving of language resources l develop network of interoperating repositories & services for housing & accessing such resources
PARADISEC DELAMAN http: //www. delaman. org International discipline-related digital entities Other participants include: l Alaska Native Language Center Archives (University of Alaska Fairbanks, USA) Archive of Indigenous Languages of Latin America (University of Texas, USA) l Archive of Maori and Pacific Music (University l of Auckland, New Zealand) l Do. Be. S archive (Max Planck Institut für Psycholinguistik, Holland) l ELAR archive (School of Oriental and African Studies, UK)
PARADISEC Issues l Differentials International discipline-related digital entities in scope and mission of participants l Differential IP and rights protocols across international boundaries l Differentials in data structures, standards and system architectures
PARADISEC Wishlist l Networking, International discipline-related digital entities ethical agreements and standards to allow mirroring of data between participating archives to provide secure backup and efficiencies in data provision to global user communities
Linkages l Support and advice from. . . – – l ANU Internet Futures, APAC, Grangenet Screen. Sound National Library AIATSIS Collaborations. . . – EMELD (Electronic Metastructures for Endangered Languages Data) – DELAMAN and OLAC – Regional cultural organisations – Strategic partnerships with other digital archives
Contacts Please visit our website http: //www. paradisec. org. au Director (Sydney unit) lb@paradisec. org. au Project manager (Melbourne) nickt@paradisec. org. au