Скачать презентацию SAMGrid Future Plans Rick St Denis University of Скачать презентацию SAMGrid Future Plans Rick St Denis University of

e61e0acee91a5e2ec7551bb1ea1c7536.ppt

  • Количество слайдов: 32

SAMGrid: Future Plans Rick St. Denis, University of Glasgow • CDF Accepts the Need SAMGrid: Future Plans Rick St. Denis, University of Glasgow • CDF Accepts the Need for the Grid – Requirements • D 0 Relies on the Grid – Requirements • How to Meet the Need – Status of SAMGrid – The Grid Tools 19 February 2004 SAMGrid Project Review

Spokespersons’ Requirements for CDF Maximize physics output @ low Lumi –L 3 output rate: Spokespersons’ Requirements for CDF Maximize physics output @ low Lumi –L 3 output rate: 80 -> 360 Hz by 06 CDF needs the Grid Finance Director’s review, International Committee: 50% computing outside FNAL CDFGrid supported by FNAL PAC 19 February 2004 SAMGrid Project Review

Scale of CDF Requirements THz FY 04 3. 7 %offsite CPU Speed 25% 3 Scale of CDF Requirements THz FY 04 3. 7 %offsite CPU Speed 25% 3 GHz #duals FY 05 9. 0 50% 5 GHz +360 FY 06 16. 5 50% 8 GHz +220 150 6 -7 sites, 100 Duals each, by 2006 + 700 @FNAL 19 February 2004 SAMGrid Project Review

What can 20 duals and 6 TB do? Stream Events Days Input Size A: What can 20 duals and 6 TB do? Stream Events Days Input Size A: Top, W/Z 20. 5 M 10. 3 4. 5 TB H: Hadronic 156 M B and charm 34. 2 TB 78. 3 Need to transfer 0. 6 GB/min or 1 TB/Day 19 February 2004 SAMGrid Project Review

CDF Computing Model • Develop Analysis on desktop – Access to all CDF data CDF Computing Model • Develop Analysis on desktop – Access to all CDF data from anywhere • Large scale processing on batch clusters – Submission from anywhere Implemented Now with – interactive tools: ls, top, head/tail/cat CAF – Output to scratch space or desktop 19 February 2004 SAMGrid Project Review

Use Cases for Summer 2004 • User Level MC Production – All CDF Users Use Cases for Summer 2004 • User Level MC Production – All CDF Users have access – No data on site -> SAM write SAM Essential for Summer 2004 • User Level Data Access – All users have access – Selected samples on site: Full SAM Support 19 February 2004 SAMGrid Project Review

Medium Term Vision • Many Sites • Fully transparent submission to all of CDF Medium Term Vision • Many Sites • Fully transparent submission to all of CDF resources: 75% FNAL, 25% outside • Fully transparent input and output of data • Farm future: not as a special facility 19 February 2004 SAMGrid Project Review

Summer 04 Functionality • User selects submission site, saying what dataset they will use Summer 04 Functionality • User selects submission site, saying what dataset they will use • System checks they can do this (privileges) • User access with SAM/d. Cache • User registers output with SAM 19 February 2004 SAMGrid Project Review

CDF Grid from a a User Perspective CDFGrid from User Perspective CAF Gui/CLI Uses CDF Grid from a a User Perspective CDFGrid from User Perspective CAF Gui/CLI Uses SAM AC++ Grid Italy Toronto Korea 19 February 2004 Only. Grid. Lab Outside Fermilab Taiwan Fermi. CAF SAMGrid Project Review UK

October 04 • To extend beyond 25% outside computing JIM is essential: JIM Test October 04 • To extend beyond 25% outside computing JIM is essential: JIM Test for CDF June 04, production October 04 • HOWEVER: It already seems that the 25% resources are not sufficient for the production passes: will want JIM earlier. 19 February 2004 SAMGrid Project Review

CDF Grid Strategy • 25% of CDF Computing from external resources. All CDF computing CDF Grid Strategy • 25% of CDF Computing from external resources. All CDF computing on CDF Grid by April 15: Utilize resources fully controlled by CDF: Kerberos/fbsng: d. CAF + SAM • October 15, 2004: JIM to capture shared resources • June 2005: 50% of Computing resources external 19 February 2004 SAMGrid Project Review

D 0 Priorities • Mc production using JIM • Reprocessing using as many DH D 0 Priorities • Mc production using JIM • Reprocessing using as many DH tools as possible • Analysis • Remote is 10% now. • All MC and 20% of reprocessing is Now offsite 19 February 2004 SAMGrid Project Review

Meeting the Needs • • • Progress in SAM JIM Status Run. Job CDFGrid. Meeting the Needs • • • Progress in SAM JIM Status Run. Job CDFGrid. Workshop: “Nerd’s Paradise” Strict Project Management and process to respond to operational issues 19 February 2004 SAMGrid Project Review

In the near term future: JIM Adding Grid Standard Tools 19 February 2004 SAMGrid In the near term future: JIM Adding Grid Standard Tools 19 February 2004 SAMGrid Project Review

Anywhere @ each site Desktop Simple JIM Private LAN Globus GK CAF Submitter SAM Anywhere @ each site Desktop Simple JIM Private LAN Globus GK CAF Submitter SAM Station @regional centers Condor Submitter WN Private LAN d. Cache @FNAL SAM DB Condor Matchmaker 19 February 2004 SAMGrid Project Review June 2004 testing June 2005 required

Detailed JIM User Interface Flow of: job User Interface Submission Global Job Queue data Detailed JIM User Interface Flow of: job User Interface Submission Global Job Queue data meta-data User Interface Submission Resource Selector Match Making Global DH Services Info Gatherer SAM Naming Server Info Collector Grid Client SAM Log Server Resource Optimizer MSS Cluster Data Handling Grid Gateway SAM DB Server Site RC Local Job Handling SAM Station (+other servs) Cache SAM Stager(s) Local Job Handler (CAF, D 0 MC, BS, . . . ) JIM Advertise Dist. FS 19 AAA February 2004 Worker Nodes Meta. Data Catalog Bookkeeping Service Info Manager MDS Info Providers Web Serv Grid Monitoring XML DB server Site Conf. Glob/Loc JID map SAMGrid Project Review. . . User Tools Site

Progress in SAM • Dbserver, the database server between applications and Oracle, was upgraded Progress in SAM • Dbserver, the database server between applications and Oracle, was upgraded to use a common schema for CDF and D 0. • All CDF data files are in SAM • Sam in is in beta testing on the CDF CAF (1200 cpus): passed 20 TB/Day delivery • Minos uses SAM for its Data Handling • Steve Mrenna (Phenomenology) depositing ALPGEN files in SAM for common CDF/D 0 use. 19 February 2004 SAMGrid Project Review

Planned Sam Projects Not yet started… 19 February 2004 SAMGrid Project Review Planned Sam Projects Not yet started… 19 February 2004 SAMGrid Project Review

Planned Sam Project • MC/farm Requests – Merge systems of MC request with Farm Planned Sam Project • MC/farm Requests – Merge systems of MC request with Farm Request: Eliminate double work. • Autodestinations – Awkward to use, being discussed in design. 19 February 2004 SAMGrid Project Review

Assimilating/Disseminating the Grid SAMGrid, ARDA, GRIDPP 2, Grid 3+, PPDG 4/5 and making them Assimilating/Disseminating the Grid SAMGrid, ARDA, GRIDPP 2, Grid 3+, PPDG 4/5 and making them aware of us 19 February 2004 SAMGrid Project Review

How to Go Grid Standard • GGF participation • Internal implementation of interfaces and How to Go Grid Standard • GGF participation • Internal implementation of interfaces and standards • Workshop participation in our strong areas where the Grid has a vacuum • Projects to deploy well-defined standards. 19 February 2004 SAMGrid Project Review

GGF • Programme committee for workshop on nuclear and particle physics applications: Paper exists GGF • Programme committee for workshop on nuclear and particle physics applications: Paper exists with catalog of GGF groups and how their interests overlap with ours • Workshop on use cases: paper failed acceptance, learned too late: prepare for next time 19 February 2004 SAMGrid Project Review

Grid. PP 2 • Bid will reward us with positions; recognized a need for Grid. PP 2 • Bid will reward us with positions; recognized a need for a metadata task force : strong interest in SAM solution. • ARDA problematic; agree there is a standard, and mine is the right one to start from. • There is more to grid than ARDA. 19 February 2004 SAMGrid Project Review

Project to Grid. Project • Chains and Links/SBIRII • Caching • Schema rationalized 19 Project to Grid. Project • Chains and Links/SBIRII • Caching • Schema rationalized 19 February 2004 SAMGrid Project Review

Chains and Links/SBIR-II • • • Query language to pursue for the Grid, SBIRII Chains and Links/SBIR-II • • • Query language to pursue for the Grid, SBIRII forces well-defined interface Within the Metadata workshop context: Priority for Grid high, Oversubscription of key CDF/D 0 personnel: great opportunity, could lose it. 19 February 2004 SAMGrid Project Review

Caching • SRM interface within SAM to SAM cache • SRM interface to dcache Caching • SRM interface within SAM to SAM cache • SRM interface to dcache • Application of caching according to requirements: multiple local sam caches, multiple dcache-linked caches as a sam cache, capitalize on redundant distributed cache (nb. worker nodes) 19 February 2004 SAMGrid Project Review

Dcache and SAM • Dcache shapes traffic into disk: If a SAM cache is Dcache and SAM • Dcache shapes traffic into disk: If a SAM cache is large, need to use Dcache instead of nfs mounts • Dcache gives the user what is requested. 1 TB gets same priority as 1 GB: CDF users must send email requesting data to be staged. • SAM examines consumption rate before staging next files – No EMAIL needed. • SAM uses Dcache for its Caching at FNAL. • This needs further work with SRM 19 February 2004 SAMGrid Project Review

SAM Schema Modularization • • • Modularize the schema, Modularize the API, Define interfaces, SAM Schema Modularization • • • Modularize the schema, Modularize the API, Define interfaces, Migrate, Easier project management. 19 February 2004 SAMGrid Project Review

Link of Schema to Cache • Dcache without tape but with postgres becomes Local Link of Schema to Cache • Dcache without tape but with postgres becomes Local Replica Catalog • Need protocol to connect to central (virtual) database (performance) • CSS-DSG - CCF Group strong interactions • Awareness of Grid • Pull out local replica API, cache in SAM schema 19 February 2004 SAMGrid Project Review

Authorization and Accounting • Work with CMS • Gabriele and Virginia Tech • Virtual Authorization and Accounting • Work with CMS • Gabriele and Virginia Tech • Virtual Organizations/VOX – can we have a solution soon? 19 February 2004 SAMGrid Project Review

Monitoring • XML-based information gathering • Grid Mechanisms : MONAlisa • Look at making Monitoring • XML-based information gathering • Grid Mechanisms : MONAlisa • Look at making components with hooks: – C++ API – Caching interfaces – DBServer 19 February 2004 SAMGrid Project Review

Conclusions • D 0 and CDF reliant on Grid • JIM deployment on Track Conclusions • D 0 and CDF reliant on Grid • JIM deployment on Track for March and June milestones. • FNAL has a huge role to play in the Grid if it can work together and technically address problems. 19 February 2004 SAMGrid Project Review