Скачать презентацию Developing a Net CDF-4 Interface to HDF 5 Скачать презентацию Developing a Net CDF-4 Interface to HDF 5

da29b62cc43abcc17436b1f0d2234543.ppt

  • Количество слайдов: 23

Developing a Net. CDF-4 Interface to HDF 5 Data Russ Rew (PI), UCAR Unidata Developing a Net. CDF-4 Interface to HDF 5 Data Russ Rew (PI), UCAR Unidata Mike Folk (Co-PI), NCSA/UIUC Ed Hartnett, UCAR Unidata Quincey Kozial, NCSA/UIUC John Caron, UCAR Unidata Robert E. Mc. Grath, NCSA/UIUC NASA award AIST-02 -0071

Unidata: A Community Endeavor • Community of educators and researchers at 120 universities, 30 Unidata: A Community Endeavor • Community of educators and researchers at 120 universities, 30 other institutions, international in scope • Managed by the University Corporation for Atmospheric Research • Mission: providing data, tools, support, and community leadership for enhanced earth-system education and research • Atmospheric science community, expanding to oceanography, hydrology, other geosciences • Unidata Program Center: 25 staff, 15 2

Overview • • • What is net. CDF? What is HDF 5? Why develop Overview • • • What is net. CDF? What is HDF 5? Why develop a net. CDF interface to HDF 5? What is the current project status? What still needs to be done? Do we have the necessary resources? What are the prospects for success? 3

Net. CDF-3 and HDF 5 Ad hoc standards are useful standards • Standard Data Net. CDF-3 and HDF 5 Ad hoc standards are useful standards • Standard Data Models for scientific data and data abstractions • Standard Interfaces between data providers and data users • Standard Libraries for data access from various languages • Standard Formats for portable binary data • Users need not know about the format 4

Data Models net. CDF-3 HDF 5 Variables Dimensions Attributes Datasets Dataspaces Attributes Coordinates Element Data Models net. CDF-3 HDF 5 Variables Dimensions Attributes Datasets Dataspaces Attributes Coordinates Element types Datatypes Groups Links References Property Lists 5

Libraries net. CDF-3 HDF 5 one interface level serial I/O C, C++ Fortran-77, -90 Libraries net. CDF-3 HDF 5 one interface level serial I/O C, C++ Fortran-77, -90 Java (pure) high- and low-level interfaces serial. parallel (MPI) I/O C, C++ Fortran-90 Java (native) Perl Python Ruby IDL Matlab 6

Formats net. CDF-3 HDF 5 XDR direct access efficiently extendible 32 -bit file offsets Formats net. CDF-3 HDF 5 XDR direct access efficiently extendible 32 -bit file offsets XDR and native direct access efficiently extendible 64 -bit file offsets chunked access compound structures nested structures compression efficient schema changes 7

Other Characterisitics Net. CDF-3 HDF 5 Availability free Development and maintenance UCAR Unidata NCSA Other Characterisitics Net. CDF-3 HDF 5 Availability free Development and maintenance UCAR Unidata NCSA HDF Group Primary funding NSF NASA, DOE ASCI Advantages popular, simple, lots of tools, multiple implementations powerful, highperformance, storage efficiency, extensibility 8

Goals of Net. CDF/HDF Combination • Create net. CDF-4, combining desirable • characteristics of Goals of Net. CDF/HDF Combination • Create net. CDF-4, combining desirable • characteristics of net. CDF-3 and HDF 5, while taking advantage of their separate strengths • Widespread use and simplicity of net. CDF-3 • Generality and performance of HDF 5 Make net. CDF more suitable for highperformance computing • Provide simple high-level interface for HDF 5 • Demonstrate benefits of combination in advanced Earth science modeling efforts 9

Net. CDF-4 Features Enabled by HDF 5 • • Large file support Parallel I/O Net. CDF-4 Features Enabled by HDF 5 • • Large file support Parallel I/O Multiple dynamic dimensions Packed data, compression New data types Dynamic schema modifications Other possibilities: groups, user-defined types, better coordinate support, … 10

Approach • Implement net. CDF-3 over HDF 5, to • • • demonstrate backward Approach • Implement net. CDF-3 over HDF 5, to • • • demonstrate backward compatibility with • Programming interface • Format Design net. CDF-4 interface Implement net. CDF-4 over HDF 5 to add enhancements made possible with HDF 5 Foster continued collaboration between Unidata and NCSA in design, development, testing, and support 11

Net. CDF-4 Architecture net. CDF-3 Interface net. CDF-4 Library HDF 5 Library • Access Net. CDF-4 Architecture net. CDF-3 Interface net. CDF-4 Library HDF 5 Library • Access to net. CDF-3, net. CDF-4, and HDF 5 data created through net. CDF-4 interface 12

User View of Net. CDF-4 • Net. CDF-4 library accesses either the net. CDF-3 User View of Net. CDF-4 • Net. CDF-4 library accesses either the net. CDF-3 or HDF 5 library to read or write data 13

Current Technical Status Implement net. CDF-3 over HDF 5, to demonstrate backward compatibility with Current Technical Status Implement net. CDF-3 over HDF 5, to demonstrate backward compatibility with API and format done Determine needed HDF 5 enhancements done Prepare net. CDF-3 for incorporation with net. CDF-4 nearly done Design net. CDF-4 interface to add enhancements made possible with HDF 5 in progress Implement needed HDF 5 enhancements in progress Implement net. CDF-4 over enhanced HDF 5 not started yet 14

Net. CDF-3 Interface Using HDF 5 • • • 13, 000 lines of C Net. CDF-3 Interface Using HDF 5 • • • 13, 000 lines of C code Passes all net. CDF-3 tests Demonstrates HDF 5 practical for net. CDF-4 Identifies HDF 5 enhancements needed Shows read/write times and file sizes satisfactory • Validates approach to backward compatibility • API compatibility: only recompilation and relinking needed for existing net. CDF-4 programs • Format compatibility: accesses all current net. CDF files as well as new HDF 5 files transparently 15

Net. CDF-3 Enhancements for Net. CDF -4 • To provide • • • stable Net. CDF-3 Enhancements for Net. CDF -4 • To provide • • • stable foundation for incorporating net. CDF-4 • smooth transition for current users Automated multi-platform testing Documentation converted to maintainable form, new language-independent Users Guide Added large file support with backward compatibility Added default format interfaces Better Windows and. Net support 16

HDF 5 Additions for Supporting Net. CDF 4 • HDF 5 enhancements • • HDF 5 Additions for Supporting Net. CDF 4 • HDF 5 enhancements • • numeric type conversions zero-dimensional datasets overflow handling improvements flexible parallel I/O • HDF 5 design specifications • dimension scales for coordinate systems • shared object proposal 17

Project Schedule Currently on schedule for a July 2005 release • July 2004: version Project Schedule Currently on schedule for a July 2005 release • July 2004: version 3. 6. 0 - revised documentation, 64 -bit file offsets, default format functions • October 2004: version 3. 7. 0 - use of autotools • January 2005: version 3. 7. 1: net. CDF-4 prototype included, support for multiple unlimited dimensions • March 2005: version 4. 0. 0_beta - test relelase • July 2005: version 4. 0. 0 - first net. CDF-4 production release 18

Net. CDF-4 Design Issues • Issue: support for coordinate systems in net. CDF and Net. CDF-4 Design Issues • Issue: support for coordinate systems in net. CDF and HDF 5 data models? under consideration • Issue: addition of HDF 5 Groups abstraction to net. CDF data model? yes, tentatively • subset of HDF 5 Group features • constrained by backward compatibility with net. CDF-3 • no Group aliases but try to support Variable aliases and Dimension scoping? • Issue: can we just adopt Northwestern/Argonne pnet. CDF interface for adding parallel I/O? 19

What remains to be done? • Next for net. CDF-4: interface additions for multiple What remains to be done? • Next for net. CDF-4: interface additions for multiple unlimited dimensions, group interfaces, dynamic schema modification, new data types, packed data, parallel I/O, compression • HDF 5 enhancements • zero-length attributes • shared dimensions • creation order access for objects • Testing in models (CCSM, WRF, ESMF, . . . ) 20

Papers, Posters, Presentations 1. R. Rew, M. Folk, E. Hartnett, and R. Mc. Grath: Papers, Posters, Presentations 1. R. Rew, M. Folk, E. Hartnett, and R. Mc. Grath: Plans for an Enhanced Net. CDF-4 Interface to HDF 5 Data. HDF/HDF-EOS Workshop VII, Silver Springs, September 2003. Poster and presentation. 2. M. Folk, R. Rew, K. Yang, R. Mc. Grath: Net. CDF-4: Combining net. CDF and HDF 5 Data. AGU Fall Meeting, San Francisco, December 2003. Poster. 3. R. Rew and E. Hartnett: Merging Net. CDF and HDF 5. 20 th International Conference on Interactive Information Processing Systems (IIPS) for Meteorology, Oceanography, and Hydrology, Seattle, January 2004. Paper and poster. 4. E. Hartnett: Merging the Net. CDF and HDF 5 Libraries to Achieve Gains in Performance and Interoperability. 2004 Earth Science Technology Conference, Palo Alto, June 2004. Paper and presentation. 21

Excellent Prospects for Success • More software engineering than research • Net. CDF-4 web Excellent Prospects for Success • More software engineering than research • Net. CDF-4 web site just announced: • www. unidata. ucar. edu/packages/netcdf-4/ • Unidata and NCSA developers collaborating via email, teleconferences • On schedule for July 2005 release: • www. unidata. ucar. edu/packages/netcdf/release_schedul e. html • Great interest in status of project! Ultimate goal to make earth science researchers more productive. . . 22

Questions? ? 23 Questions? ? 23