63dd501e1a85201f284c748698492911.ppt
- Количество слайдов: 22
Tier 2 di Milano Componenti e Monitoring Luca Vaccarossa Milano 14 dicembre 2007
User Interface (UI) • E’ la macchina con i comandi per la sottomissione a Grid • voms-proxy-init / grid-proxy-init • edg-job-sumit
User Interface (UI) • atlfarm 008. mi. infn. it • atlfarm 010. mi. infn. it • grid 008. mi. infn. it
Computing Element • • t 2 -ce-01. mi. infn. it Grid gateway PBS server (TORQUE) MAUI scheduler
Computing Element Il sistema batch della farm e' Torque + Maui. le code abilitate per gli utenti locali sono: • local (max cpu time 48 h, max walltime 72 h) • short (coda corta con cpu riservate, max cpu time 40 m, max walltime 2 h)
Worker Nodes (WN) • • grid 009. mi. infn. it grid 012. mi. infn. it grid 016. mi. infn. it grid 017. mi. infn. it grid 018. mi. infn. it grid 019. mi. infn. it grid 021. mi. infn. it grid 022. mi. infn. it • • grid 023. mi. infn. it grid 024. mi. infn. it grid 025. mi. infn. it grid 026. mi. infn. it t 2 -wn-02. mi. infn. it t 2 -wn-03. mi. infn. it t 2 -wn-04. mi. infn. it t 2 -wn-05. mi. infn. it
Worker Nodes (WN) • • t 2 -wn-06. mi. infn. it t 2 -wn-07. mi. infn. it t 2 -wn-08. mi. infn. it t 2 -wn-09. mi. infn. it t 2 -wn-13. mi. infn. it t 2 -wn-14. mi. infn. it t 2 -wn-15. mi. infn. it t 2 -wn-16. mi. infn. it • • t 2 -wn-17. mi. infn. it t 2 -wn-18. mi. infn. it t 2 -wn-19. mi. infn. it t 2 -wn-21. mi. infn. it t 2 -wn-22. mi. infn. it t 2 -wn-23. mi. infn. it t 2 -wn-24. mi. infn. it
Comandi PBS showq Show job status and some job info showbf [-v] Check for immediately available CPUs and nodes checkjob [-v]
Comandi PBS • PBSNODES –a | less • Si vedono i WN che non hanno job • Segnalare a grid-help@mi. infn. it
Priorita’ e Fair. Share • • Priorita’: diagnose –p http: //tier 2. mi. infn. it/priorita. txt FS: diagnose –f http: //tier 2. mi. infn. it/fairshare. txt
Chi sono io ? "/C=IT/O=INFN/OU=Personal Certificate/L=Milano/CN=Silvia Resconi/Email=Silvia. Resconi@mi. infn. it" resconi "/C=IT/O=INFN/OU=Personal Certificate/L=Milano/CN=Tommaso Lari" lari
Chi sono io ? "/C=IT/O=INFN/OU=Personal Certificate/L=Milano/CN=Attilio Andreazza" andreazz "/C=IT/O=INFN/OU=Personal Certificate/L=Milano/CN=Clara Troncon" troncon "/C=IT/O=INFN/OU=Personal Certificate/L=Milano/CN=Leonardo Carminati" lcarmina
Chi sono io ? "/C=IT/O=INFN/OU=Personal Certificate/L=Milano/CN=Donatella Cavalli" cavalli "/C=IT/O=INFN/OU=Personal Certificate/L=Milano/CN=Caterina Pizio" pizio
Chi sono io ? "/C=IT/O=INFN/OU=Personal Certificate/L=Milano/CN=Umberto De Sanctis" atlas 012 "/C=IT/O=INFN/OU=Personal Certificate/L=Milano/CN=Simone Montesano" atlas 020
Chi sono io ? "/C=IT/O=INFN/OU=Personal Certificate/L=Milano/CN=Chiara Tamarindi" atlas 033 • "/C=IT/O=INFN/OU=Personal Certificate/L=Genova/CN=Fabrizio Parodi" parodi • "/C=IT/O=INFN/OU=Personal Certificate/L=Genova/CN=Bianca Osculati" osculati
Grid. View • • • http: //gridview. cern. ch/GRIDVIEW/ Monitoring and Visualization Tool for LCG Data Transfer Job Status Service Availability
SAM Tests • • • https: //lcg-sam. cern. ch: 8443/sam. py Certificato nel browser Test automatici SAM on demand? https: //cic. gridops. org/index. php? section=r c&page=samadmin
Ganglia • http: //ganglia. sourceforge. net/ • “Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids. ”
Ganglia • It relies on a multicast-based listen/announce protocol to monitor state within clusters and uses a tree of point-to-point connections amongst representative cluster nodes to federate clusters and aggregate their state. • It leverages widely used technologies such as XML for data representation, XDR for compact, portable data transport, and RRDtool for data storage and visualization.