Скачать презентацию Technical Status of the Project Bob Jones Скачать презентацию Technical Status of the Project Bob Jones

d992a9d05818ab2443b0cf5128f9c924.ppt

  • Количество слайдов: 18

Technical Status of the Project Bob Jones – 3/16/2018 - n° 1 Technical Status of the Project Bob Jones – 3/16/2018 - n° 1

Overview u Testbed status u Application u Project n status retreat Issues and actions Overview u Testbed status u Application u Project n status retreat Issues and actions for software process, current (EDG 1. 2) and future releases u Tutorials u Summary Bob Jones – 3/16/2018 - n° 2

Testbed Status u Application n testbed Running EDG 1. 2 on 5 core sites Testbed Status u Application n testbed Running EDG 1. 2 on 5 core sites (and a couple of others) s Since first week of August (several months later than initially planned) n Users guide and release notes available (installation guide will come later) n Being used for application tests n Current “show-stopper” issues found: s s Long job status problem (can’t retrieve output) Long file transfers problem (20 mins limit) u Development n testbed Testing urgent updates to EDG 1. 2 s s More recent beta release of Globus 2 Various fixes for data management chain Bob Jones – 3/16/2018 - n° 3

Application Status u WP 8: High Energy Physics n LHC experiments doing tests now Application Status u WP 8: High Energy Physics n LHC experiments doing tests now n ATLAS task force u WP 9: Earth Observation n Installation of EDG 1. 2 at ESA done n Testing to start in September u WP 10: n Biology Initial tests made with EDG 1. 2 u Overall comments: n General confusion about how best to use data mgmt tools n Software not yet stable enough and insufficient diagnostics information available n Too difficult to configure n Concern that EDG 1. 2 in its current configuration will not scale easily to ~40 sites Bob Jones – 3/16/2018 - n° 4

ATLAS Task Force u Task n for with ATLAS & EDG people (lead by ATLAS Task Force u Task n for with ATLAS & EDG people (lead by Oxana Smimova) http: //cern. ch/smirnova/atlas-edg u ATLAS is eager to use Grid tools for the Data Challenges n ATLAS Data Challenges are already on the Grid (Nordu. Grid, i. VDGL) n The DC 1/phase 2 (to start in October) is expected to be done mostly using the Grid tools u By September 16 (ATLAS SW week) evaluate the usability of EDG for the DC tasks u The task: to process 5 input partitions of the Dataset 2000 at the EDG Testbed + one non-EDG site (Karlsruhe) u Intensive activity has meant they could process some partitions but problems with long running jobs is still an issue u Data u Need Management chain is proving difficult to use and sometime unreliable to clarify policy for distribution/installation of applications software On-going activity with very short-timescale: highest priority task Bob Jones – 3/16/2018 - n° 5

Project Retreat u Project u ~45 n retreat held last week (27 & 28 Project Retreat u Project u ~45 n retreat held last week (27 & 28 August) at Chevannes participants work package managers, architecture group, quality group, applications groups, mware experts, representatives from LCG, Data. TAG, Globus & Condor u Agenda and material on the web: n n u 3 http: //documents. cern. ch/age? a 021130 Photos by Jeff Templon http: //www. nikhef. nl/~templon/chavannes/index. html sessions addressing most important aspects of projects current work: n Software Release Process n Release 1. 2 n Testbed 2 Bob Jones – 3/16/2018 - n° 6

Software Process u Over-simplification of the current situation: 1. Mware groups develop software in Software Process u Over-simplification of the current situation: 1. Mware groups develop software in isolation 2. ITeam assembles it as best it can 3. Site managers are asked to install it 4. Application groups are asked to test it Problems: No place for the mware groups to integrate software before delivering it to the ITeam Inadequate software testing – leads to installation/configuration/execution faults We are running blind – no way to control or reliably plan software delivery Bob Jones – 3/16/2018 - n° 7

Software Process: Autobuild u. A release manager will be nominated with overall responsibility for Software Process: Autobuild u. A release manager will be nominated with overall responsibility for ensuring the procedure is followed u Make autobuild tools the basis of the daily work of the mware groups and ITeam n Nightly build from CVS repository for all software s n Mware groups give ITeam CVS tags instead of RPMs s n Problems must be fixed ASAP – checked by Quality Group reps Tagged software must be documented Mware group must perform and supply unit tests s Integrated with nightly build u Tagged software that fails the integration, testing or is inadequately documented will be rejected n Mware group is responsible for fixing it Bob Jones – 3/16/2018 - n° 8

Software Process: Quality Group u Recently formed Quality Group, convened by Gabriel Zaquine, is Software Process: Quality Group u Recently formed Quality Group, convened by Gabriel Zaquine, is responsible for ensuring quality issues are addressed within the WPs n Ensure unit test plans are complete and followed n Follow-up on problems reported bugzilla & nightly builds n Organise running of code checking tools on all EDG software n Agree on adopted project developer-guidelines etc. u http: //eu-datagrid. web. cern. ch/eu-datagrid/QAG/default. htm Bob Jones – 3/16/2018 - n° 9

Software Process: Testing u Strengthen the Testing Group n Identify leader and a small Software Process: Testing u Strengthen the Testing Group n Identify leader and a small number of full-time testers n Assemble and maintain test suite integrated with autobuild tools u Automate n n n installation and configuration of software releases To permit auto testing need to be able to auto install & configure a release on a pre-defined small example site Needs improvements by mware WPs to simplify and complete installation & configuration of their sw Site managers have good overview about how to do this Need to clarify the work involved during this conference u Set-up certificate testbed n Used for testing activities n Involves several sites Bob Jones – 3/16/2018 - n° 10

Technical Management u Architecture group documenting testbed 2 architecture n draft: http: //doc. cern. Technical Management u Architecture group documenting testbed 2 architecture n draft: http: //doc. cern. ch/archive/electronic/other/agenda/a 021130 s 4 t 1/TB 2 Arch_v 0_1. doc n Meets once a month (next meeting tomorrow) u Project Tech. Board addresses deliverables and relationships with other projects n Meets once per quarter (next meeting 2 nd October @ CERN) n http: //documents. cern. ch/AGE/current/display. Level. php? fid=3 l 131 u Need more frequent technical management forum n Authority to make technical & architectural decisions affecting sw development in WPs n Include WP managers, chaired by the Technical Coordinator s n Can call on mware experts according to needs of themed agenda Meets frequently to ensure issues are addressed rapidly s Associate with WP managers weekly meeting u Relationship with Architecture Group and Project Tech. Board needs to be clarified Bob Jones – 3/16/2018 - n° 11

Testbed Support u Strengthen user support group n Ensure people involved have sufficient knowledge Testbed Support u Strengthen user support group n Ensure people involved have sufficient knowledge of the software n Emphasis on the accurate and usefulness of the responses provided s Tools used for support are a secondary issue n Federate with equivalent groups from other projects n Provides support on the application testbed u Clarify n Creating a new CA (CA group) s n Need to reduce time involved (currently 3 months) Site Installation (site managers & ITeam) s n & document procedures Steps for system manager and requirements for a site to join the testbed Creating & Managing a Virtual Organisation (site managers & ITeam) s Steps involved and tasks of a VO manager Bob Jones – 3/16/2018 - n° 12

Release Development Current “show-stoppers” fixed 1. 2 CVS Autobuild Nightly build Continuous support branch Release Development Current “show-stoppers” fixed 1. 2 CVS Autobuild Nightly build Continuous support branch 2. 0 patches Application tests satisfied Incremental Improvements Continuous Support branch Changes foreseen for 1. 3 & 1. 4 become “incremental improvements” Migrate sites 2. x patches Incremental Improveme Etc. Bob Jones – 3/16/2018 - n° 13

Incremental Steps from EDG 1. 2 1. 2. 3. 4. End 5. Sept 2002 Incremental Steps from EDG 1. 2 1. 2. 3. 4. End 5. Sept 2002 6. 7. Fix “show-stoppers” for application groups – mware WPs (continuous) Build EDG 1. 2. x with autobuild tools Iteam Integrate testing framework and limited automatic tests with autobuild tools - testing group Automatic installation & configuration procedure for predefined site (can’t auto test without it) Start autobuild server for RH 7. 2 and attempt build of release 1. 2 – Yannick Patois 7. 8. 9. 10. 11. Giggle & Reptor – WP 2 LCAS with dynamic plug-in modules – WP 4 Network. Cost Function – WP 7 Integrate mapcentre (nordugrid? ) and R-GMA – WP 3 GLUE modified info providers/consumers – WP 1, 4, 5 12. Res. Broker – WP 1 13. LCFG for RH 7. 2 – WP 4 Integration with Condor as batch system – WP 4 New LCFG - WP 4 Expect this list to be Grid. FTP server access to MSS discussed/updated WP 5 this week What do we do about: Space mgmt, VOMS, slashrgrid? 14. Bob Jones – 3/16/2018 - n° 14

EDG Tutorial The tutorials are aimed at users wishing to EDG Tutorial The tutorials are aimed at users wishing to "gridify" their applications using EDG software and are organized over 2 full consecutive days http: //hep-proj-grid-tutorials. web. cern. ch/hep-proj-grid-tutorials/dry. asp user: griduser passwd: tutorials 123 DAY 1 DAY 2 u Tutorial introduction u Introduction to Grid computing and overview of the Data. Grid project u Security u Testbed u Job overview Submission lunch u hands-on exercises: job submission u Data Management u LCFG, fabric mgmt & sw distribution & installation u Applications u Future and Use cases Directions lunch u hands-on exercises: data mgmt Bob Jones – 3/16/2018 - n° 15

Tutorial rehearsal u Rehearsal n 19 participants (members of project or closely related) to Tutorial rehearsal u Rehearsal n 19 participants (members of project or closely related) to check material & approach u Lessons n at CERN, 29 & 30 August learnt Can’t cover as much material as we hoped (goes to fast) s s Explain why not just how Avoid details – can read them from references afterwards n Need as many helpers as possible for hands-on exercises n Participants have difficulties with certificate management s All participants must have a certificate ready for them and be in the same VO u Generated a lot of enthusiasm in the participants and EDG people doing the hands-on n n Recommend mware WPs send developers to help with hands-on exercises n Ø Found genuine bugs during hands-on exercises New project people should follow the tutorial Thanks to: Mario Reale, Elisabetta Ronchieri, Akos Frohner, Erwin Laure, Peter Kunszt, Antony Wilson, Steve Fisher, Maite Barroso Lopez, Owen Synge, Emanuele Leonardi, Steve Traylen, Frank Bonnassieux, Christophe Jacquet, Sophie Nicoud, Karin Burghauser & CERN training people Bob Jones – 3/16/2018 - n° 16

Tutorial Schedule u CERN school of Computing, Naples, 23 -27 September n 80 participants. Tutorial Schedule u CERN school of Computing, Naples, 23 -27 September n 80 participants. Hands-on exercises only (presentations by Carl Kesselman & Ian Foster) n ALL EDG people attending should do exercises first and help others at the school u CERN, October 3 & 4 u Ne. SC, Edinburgh, December n Dates still moving. Maximum 30 participants (more for the presentations) u We n could accommodate more sites in December, January etc. Sites must provide support and handle logistics s Organisers/helpers must attend tutorial at another site first u The tutorial does represent some load on the testbed (own VO & cert. creation) u For the future n Hands-on exercises are a test suite - automate and run with the nightly checks n The material must be kept up to date with each public release of the software s We need to nominate people responsible for the different chapters of the tutorial to be responsible for ensuring the slides and exercises are kept up to date Bob Jones – 3/16/2018 - n° 17

Summary u Addressing the serious bugs found by the application groups on the testbed Summary u Addressing the serious bugs found by the application groups on the testbed is the task with the highest priority u Testing activities need more resources u Test-bed u Future u We support is becoming a more important task releases must continue to address the needs of the application groups need to clarify the following points during this conference: n Autobuild status & how automate installation & configuration n Contents of the test-suites n Release plans until the next EU review In short: What we are doing is right, we are just going about it in a sloppy manner Need to go one step at a time and ensure each step works Bob Jones – 3/16/2018 - n° 18