e61e0acee91a5e2ec7551bb1ea1c7536.ppt
- Количество слайдов: 32
SAMGrid: Future Plans Rick St. Denis, University of Glasgow • CDF Accepts the Need for the Grid – Requirements • D 0 Relies on the Grid – Requirements • How to Meet the Need – Status of SAMGrid – The Grid Tools 19 February 2004 SAMGrid Project Review
Spokespersons’ Requirements for CDF Maximize physics output @ low Lumi –L 3 output rate: 80 -> 360 Hz by 06 CDF needs the Grid Finance Director’s review, International Committee: 50% computing outside FNAL CDFGrid supported by FNAL PAC 19 February 2004 SAMGrid Project Review
Scale of CDF Requirements THz FY 04 3. 7 %offsite CPU Speed 25% 3 GHz #duals FY 05 9. 0 50% 5 GHz +360 FY 06 16. 5 50% 8 GHz +220 150 6 -7 sites, 100 Duals each, by 2006 + 700 @FNAL 19 February 2004 SAMGrid Project Review
What can 20 duals and 6 TB do? Stream Events Days Input Size A: Top, W/Z 20. 5 M 10. 3 4. 5 TB H: Hadronic 156 M B and charm 34. 2 TB 78. 3 Need to transfer 0. 6 GB/min or 1 TB/Day 19 February 2004 SAMGrid Project Review
CDF Computing Model • Develop Analysis on desktop – Access to all CDF data from anywhere • Large scale processing on batch clusters – Submission from anywhere Implemented Now with – interactive tools: ls, top, head/tail/cat CAF – Output to scratch space or desktop 19 February 2004 SAMGrid Project Review
Use Cases for Summer 2004 • User Level MC Production – All CDF Users have access – No data on site -> SAM write SAM Essential for Summer 2004 • User Level Data Access – All users have access – Selected samples on site: Full SAM Support 19 February 2004 SAMGrid Project Review
Medium Term Vision • Many Sites • Fully transparent submission to all of CDF resources: 75% FNAL, 25% outside • Fully transparent input and output of data • Farm future: not as a special facility 19 February 2004 SAMGrid Project Review
Summer 04 Functionality • User selects submission site, saying what dataset they will use • System checks they can do this (privileges) • User access with SAM/d. Cache • User registers output with SAM 19 February 2004 SAMGrid Project Review
CDF Grid from a a User Perspective CDFGrid from User Perspective CAF Gui/CLI Uses SAM AC++ Grid Italy Toronto Korea 19 February 2004 Only. Grid. Lab Outside Fermilab Taiwan Fermi. CAF SAMGrid Project Review UK
October 04 • To extend beyond 25% outside computing JIM is essential: JIM Test for CDF June 04, production October 04 • HOWEVER: It already seems that the 25% resources are not sufficient for the production passes: will want JIM earlier. 19 February 2004 SAMGrid Project Review
CDF Grid Strategy • 25% of CDF Computing from external resources. All CDF computing on CDF Grid by April 15: Utilize resources fully controlled by CDF: Kerberos/fbsng: d. CAF + SAM • October 15, 2004: JIM to capture shared resources • June 2005: 50% of Computing resources external 19 February 2004 SAMGrid Project Review
D 0 Priorities • Mc production using JIM • Reprocessing using as many DH tools as possible • Analysis • Remote is 10% now. • All MC and 20% of reprocessing is Now offsite 19 February 2004 SAMGrid Project Review
Meeting the Needs • • • Progress in SAM JIM Status Run. Job CDFGrid. Workshop: “Nerd’s Paradise” Strict Project Management and process to respond to operational issues 19 February 2004 SAMGrid Project Review
In the near term future: JIM Adding Grid Standard Tools 19 February 2004 SAMGrid Project Review
Anywhere @ each site Desktop Simple JIM Private LAN Globus GK CAF Submitter SAM Station @regional centers Condor Submitter WN Private LAN d. Cache @FNAL SAM DB Condor Matchmaker 19 February 2004 SAMGrid Project Review June 2004 testing June 2005 required
Detailed JIM User Interface Flow of: job User Interface Submission Global Job Queue data meta-data User Interface Submission Resource Selector Match Making Global DH Services Info Gatherer SAM Naming Server Info Collector Grid Client SAM Log Server Resource Optimizer MSS Cluster Data Handling Grid Gateway SAM DB Server Site RC Local Job Handling SAM Station (+other servs) Cache SAM Stager(s) Local Job Handler (CAF, D 0 MC, BS, . . . ) JIM Advertise Dist. FS 19 AAA February 2004 Worker Nodes Meta. Data Catalog Bookkeeping Service Info Manager MDS Info Providers Web Serv Grid Monitoring XML DB server Site Conf. Glob/Loc JID map SAMGrid Project Review. . . User Tools Site
Progress in SAM • Dbserver, the database server between applications and Oracle, was upgraded to use a common schema for CDF and D 0. • All CDF data files are in SAM • Sam in is in beta testing on the CDF CAF (1200 cpus): passed 20 TB/Day delivery • Minos uses SAM for its Data Handling • Steve Mrenna (Phenomenology) depositing ALPGEN files in SAM for common CDF/D 0 use. 19 February 2004 SAMGrid Project Review
Planned Sam Projects Not yet started… 19 February 2004 SAMGrid Project Review
Planned Sam Project • MC/farm Requests – Merge systems of MC request with Farm Request: Eliminate double work. • Autodestinations – Awkward to use, being discussed in design. 19 February 2004 SAMGrid Project Review
Assimilating/Disseminating the Grid SAMGrid, ARDA, GRIDPP 2, Grid 3+, PPDG 4/5 and making them aware of us 19 February 2004 SAMGrid Project Review
How to Go Grid Standard • GGF participation • Internal implementation of interfaces and standards • Workshop participation in our strong areas where the Grid has a vacuum • Projects to deploy well-defined standards. 19 February 2004 SAMGrid Project Review
GGF • Programme committee for workshop on nuclear and particle physics applications: Paper exists with catalog of GGF groups and how their interests overlap with ours • Workshop on use cases: paper failed acceptance, learned too late: prepare for next time 19 February 2004 SAMGrid Project Review
Grid. PP 2 • Bid will reward us with positions; recognized a need for a metadata task force : strong interest in SAM solution. • ARDA problematic; agree there is a standard, and mine is the right one to start from. • There is more to grid than ARDA. 19 February 2004 SAMGrid Project Review
Project to Grid. Project • Chains and Links/SBIRII • Caching • Schema rationalized 19 February 2004 SAMGrid Project Review
Chains and Links/SBIR-II • • • Query language to pursue for the Grid, SBIRII forces well-defined interface Within the Metadata workshop context: Priority for Grid high, Oversubscription of key CDF/D 0 personnel: great opportunity, could lose it. 19 February 2004 SAMGrid Project Review
Caching • SRM interface within SAM to SAM cache • SRM interface to dcache • Application of caching according to requirements: multiple local sam caches, multiple dcache-linked caches as a sam cache, capitalize on redundant distributed cache (nb. worker nodes) 19 February 2004 SAMGrid Project Review
Dcache and SAM • Dcache shapes traffic into disk: If a SAM cache is large, need to use Dcache instead of nfs mounts • Dcache gives the user what is requested. 1 TB gets same priority as 1 GB: CDF users must send email requesting data to be staged. • SAM examines consumption rate before staging next files – No EMAIL needed. • SAM uses Dcache for its Caching at FNAL. • This needs further work with SRM 19 February 2004 SAMGrid Project Review
SAM Schema Modularization • • • Modularize the schema, Modularize the API, Define interfaces, Migrate, Easier project management. 19 February 2004 SAMGrid Project Review
Link of Schema to Cache • Dcache without tape but with postgres becomes Local Replica Catalog • Need protocol to connect to central (virtual) database (performance) • CSS-DSG - CCF Group strong interactions • Awareness of Grid • Pull out local replica API, cache in SAM schema 19 February 2004 SAMGrid Project Review
Authorization and Accounting • Work with CMS • Gabriele and Virginia Tech • Virtual Organizations/VOX – can we have a solution soon? 19 February 2004 SAMGrid Project Review
Monitoring • XML-based information gathering • Grid Mechanisms : MONAlisa • Look at making components with hooks: – C++ API – Caching interfaces – DBServer 19 February 2004 SAMGrid Project Review
Conclusions • D 0 and CDF reliant on Grid • JIM deployment on Track for March and June milestones. • FNAL has a huge role to play in the Grid if it can work together and technically address problems. 19 February 2004 SAMGrid Project Review