Скачать презентацию Solaris status and plans HEPIX Autumn 2003 Ignacio Скачать презентацию Solaris status and plans HEPIX Autumn 2003 Ignacio

df9845035692b37fe15319abd58a8ace.ppt

  • Количество слайдов: 20

Solaris status and plans HEPIX Autumn 2003 Ignacio Reguero, Michel Manent, Carlos Ungil presented Solaris status and plans HEPIX Autumn 2003 Ignacio Reguero, Michel Manent, Carlos Ungil presented by Sebastian Lopienski 21 October 2003 CERN IT-PS-UI Solaris status and plans 1

Executive Summary • Current Status – Some figures • SUNINST 0 Network Installation Server Executive Summary • Current Status – Some figures • SUNINST 0 Network Installation Server • CAE Server Upgrade • SUNDEV Technology Refresh – New 10 Sun 210 s • Implementation of EDG WP 4 Quattor fabric management on Solaris – System administration view • Solaris 9 Certification • Sun Blade Server 1600 and N 1 Management 21 October 2003 CERN IT-PS-UI Solaris status and plans 2

Current Status of Solaris Usage at CERN • Second platform for LHC physics – Current Status of Solaris Usage at CERN • Second platform for LHC physics – Mostly for validation purposes (numerical software) • Total population of 663 Active nodes – Figures from Lan. DB network database • Around 300 on Solaris 8 • Rest: about a half running Solaris 2. 6 and a half on Solaris 7 – Problem: Most of these machines cannot upgrade OS without hardware upgrade (disks and memory) 21 October 2003 CERN IT-PS-UI Solaris status and plans 3

SUNINST 0 Network Installation Server • Jumpstart server • Network configurations + responsible now SUNINST 0 Network Installation Server • Jumpstart server • Network configurations + responsible now fully extracted from LANDB network database – With single fetch procedure • After router fix Sun DHCP server is stable – On request of SM 18 LHC Magnet Test had to demonstrate boot of exotic devices (like data acquisition devices) from it • However, still working with CS group to replace DHCP server with the one of CS – SOAP interface to an Oracle DB will allow us to update • Similar than the one in place for Print DNS hierarchy 21 October 2003 CERN IT-PS-UI Solaris status and plans 4

CAE Server Upgrade (1) • CAE - Electronic design cluster • To serve the CAE Server Upgrade (1) • CAE - Electronic design cluster • To serve the electronics design community • New server: V 480 – 4 x 900 MHz CPU – 8 Gb SDRAM – Gigabit ethernet • A 1000 RAID Disk box – 436 Gb with RAID 5 – space for users • Had to coordinate IDPROM change with disk movement – Hit technical and sociological problems – IDPROM change: a need to keep old Cadence licenses on new server - but SUN reluctant to do it for new HW models – At the end provided ad hoc solution that does not support OBP upgrade • Found the hard way! • And need OBP upgrade for A 1000 support 21 October 2003 CERN IT-PS-UI Solaris status and plans 5

CAE Server Upgrade (2) • IDPROM problem not solved yet – Considering to use CAE Server Upgrade (2) • IDPROM problem not solved yet – Considering to use other machine • Cannot be too new (works on V 220 or 280 R) • Also lots of A 1000 RAID box problems – RAID manager software has to be coordinated with firmware level in the controller and OBP • So lots of upgrades required before connecting old A 1000 s to new server – After first installation, additional A 1000 s are not seen unless adding entries by hand to /kernel/drv/sd. conf with the relevant SCSI ID 21 October 2003 CERN IT-PS-UI Solaris status and plans 6

CAE Server Upgrade 21 October 2003 CERN IT-PS-UI Solaris status and plans 7 CAE Server Upgrade 21 October 2003 CERN IT-PS-UI Solaris status and plans 7

SUNDEV Technology Refresh • A cluster for physics development • 10 Sun Fire V SUNDEV Technology Refresh • A cluster for physics development • 10 Sun Fire V 210 – State of the art SPARC machines – Thin Rack mountable servers • 1 unit on 19” racks • They all fit in a single CERN rack together with Gigabit switch and Sun blade server – – Dual 1 GHz Ultra. SPARC-IIIi 2 Gb memory 2 x 36 GB Disk drives 4 x GIGABIT Ethernet on the motherboard • They are being installed on Solaris 8. 7. 3 (latest required for this hardware), later Solaris 9 • Performance improvement at least 120% over the current SUNDEV machines 21 October 2003 CERN IT-PS-UI Solaris status and plans 8

SUNDEV Technology Refresh 21 October 2003 CERN IT-PS-UI Solaris status and plans 9 SUNDEV Technology Refresh 21 October 2003 CERN IT-PS-UI Solaris status and plans 9

Implementation of EDG WP 4 Quattor fabric management on Solaris (1) • We plan Implementation of EDG WP 4 Quattor fabric management on Solaris (1) • We plan to use Quattor to manage all Solaris systems – From Solaris 9 onwards • What does it mean for us? – Central Configuration Data. Base (CDB) • Configuration information • Software to be installed – Both applications and system • A cache manager provided for the client accessing the DB – To avoid dependency on the DB server or on the network • The configuration database is linked to the network installation server – The Jumpstart profile is generated from the database 21 October 2003 CERN IT-PS-UI Solaris status and plans 10

Implementation of EDG WP 4 Quattor fabric management on Solaris (2) – Node Configuration Implementation of EDG WP 4 Quattor fabric management on Solaris (2) – Node Configuration Manager (NCM) • A la SUE • For configuration “components” – Simplified SUE features • NCM components are simplified SUE features • They have single action: “configure” • They access Configuration DB through the cache manager – SPMA software distributor (package level) • Replaces ASIS software distribution (file level) • For Linux it uses RPMs, for Solaris implemented with Solaris PKG • Allows to install packages from various SW repositories • Several protocols supported: HTTP, file system (AFS), FTP, etc. 21 October 2003 CERN IT-PS-UI Solaris status and plans 11

Implementation of EDG WP 4 Quattor fabric management on Solaris (3) • Still working Implementation of EDG WP 4 Quattor fabric management on Solaris (3) • Still working on – Creation of Solaris NCM Components from existing SUE features (Juan Pelegrin) – DB Access Control • For delegation – Behavior with “unmanaged” software xml CDB pan Host NCM REPOSITORY PKG REPOSITORY SPMA REPOSITORY target. cf 21 October 2003 CERN IT-PS-UI Solaris status and plans 12

Solaris 9 Certification • Validating Solaris 9 – running all SW on the new Solaris 9 Certification • Validating Solaris 9 – running all SW on the new system • Timescale for the end of 2003 – Refsol 9 reference machine now available • Not big changes in terms of Solaris, but new features: – “Web Start Flash Archives”: system images for installation • Nice for farms (but for same HW) – Resource pools • Guaranteed resources for an application on large shared systems – Gnome 2. 0 is the standard desktop environment – We deliver Mozilla 1. 4 (instead of Netscape recomm. by Sun) – Sun ONE Studio 8 as default compiler • Replacement of ASIS and SUE with Quattor • More Open Source software packaged with the system – Perl, Bash, … – Some of these products supported on same basis as SUN native ones – Probably occasion to reduce the number of products maintained by us 21 October 2003 CERN IT-PS-UI Solaris status and plans 13

Solaris Reference machines + Installation Server 21 October 2003 CERN IT-PS-UI Solaris status and Solaris Reference machines + Installation Server 21 October 2003 CERN IT-PS-UI Solaris status and plans 14

Sun Blade Server 1600 • Sun Blade server 1600 – Packaged farm – Fits Sun Blade Server 1600 • Sun Blade server 1600 – Packaged farm – Fits in 3 units of a 19” rack – SSC Controller with gigabit switch that manages up to 16 CPUs • • Several Gigabit Ethernet external connections VLAN with 16 Gigabit Ethernet Interface Protection attack by Packet Filter configuration Console through Serial Port for each Blade • Received 12 X 650 MHz Ultra. SPARC-IIe • Waiting for 4 “Intel Compatible” CPUs – AMD Athlon XP-M 1. 2 GHz • Other Specialized Blades supported on hardware level – SSL Encryptor – Load Balancer 21 October 2003 CERN IT-PS-UI Solaris status and plans 15

Sun Blade Server 1600 system chassis External Switch SSC 0 Switch Fabric (active) (standby) Sun Blade Server 1600 system chassis External Switch SSC 0 Switch Fabric (active) (standby) Slot 0……s 15 137. 138. x. x (ce 0) SSC 1 Slot 0……s 15 137. 138. x. x (ce 1) 137. 138. x. x (ce 0) 137. 138. x. x (ce 1) Blades 0……. 15 21 October 2003 CERN IT-PS-UI Solaris status and plans 16

Sun Blade Server 1600 21 October 2003 CERN IT-PS-UI Solaris status and plans 17 Sun Blade Server 1600 21 October 2003 CERN IT-PS-UI Solaris status and plans 17

Sun Blade Server 1600 Installation and Configuration • Fully automated network installation (DHCP) using Sun Blade Server 1600 Installation and Configuration • Fully automated network installation (DHCP) using Jumpstart from SUNINST 0 – Initial configuration, installation & application software • One private IP address for each System Controller • One IP address for each Blade • Ongoing Test of Web Start Flash Archives – Quick replicate one Blade’s operating environment & application software on other Blades intended • Sun VTS (Validation Test Suite) online diagnostics tool – verifies configuration and functionality of hardware controllers, devices and platforms 21 October 2003 CERN IT-PS-UI Solaris status and plans 18

Sun N 1 System Management Framework • Sun N 1 Provisioning server 3. 0 Sun N 1 System Management Framework • Sun N 1 Provisioning server 3. 0 Blade Edition being tested – Automates configuration and deployment different kinds of servers • Including specialized servers • Assignment may vary according to a schedule or other input – dynamic management of clusters • To compare N 1 with Quattor functionality • Question: could N 1 manage heterogeneous farms out the Blade server scope? 21 October 2003 CERN IT-PS-UI Solaris status and plans 19

Questions? Unix Infrastructure section: http: //cern. ch/product-support/UI Ignacio. Reguero@cern. ch Michel. Manent@cern. ch Carlos. Questions? Unix Infrastructure section: http: //cern. ch/product-support/UI Ignacio. Reguero@cern. ch Michel. Manent@cern. ch Carlos. Ungil@cern. ch 21 October 2003 CERN IT-PS-UI Solaris status and plans 20