Transcript
®
Tiered Adaptive Storage™ Hardware Guide
HR85-8500-A
Cray Private
© 2014 Cray Inc. All Rights Reserved. This manual or parts thereof may not be reproduced in any form unless permitted by contract or by written permission of Cray Inc. U.S. GOVERNMENT RESTRICTED RIGHTS NOTICE: The Computer Software is delivered as “Commercial Computer Software” as defined in DFARS 48 CFR 252.227-7014. All Computer Software and Computer Software Documentation acquired by or for the U.S. Government is provided with Restricted Rights. Use, duplication or disclosure by the U.S. Government is subject to the restrictions described in FAR 48 CFR 52.227-14 or DFARS 48 CFR 252.227-7014, as applicable. Technical Data acquired by or for the U.S. Government, if any, is provided with Limited Rights. Use, duplication or disclosure by the U.S. Government is subject to the restrictions described in FAR 48 CFR 52.227-14 or DFARS 48 CFR 252.227-7013, as applicable. The following are trademarks of Cray Inc. and are registered in the United States and other countries: CRAY and design, SONEXION, URIKA, and YARCDATA. The following are trademarks of Cray Inc.: ACE, APPRENTICE2, CHAPEL, CLUSTER CONNECT, CRAYPAT, CRAYPORT, ECOPHLEX, LIBSCI, NODEKARE, THREADSTORM. The following system family marks, and associated model number marks, are trademarks of Cray Inc.: CS, CX, XC, XE, XK, XMT, and XT. The registered trademark LINUX is used pursuant to a sublicense from LMI, the exclusive licensee of Linus Torvalds, owner of the mark on a worldwide basis. Other trademarks used in this document are the property of their respective owners. Direct requests for copies of publications to: Mail:
Cray Inc. Logistics PO Box 6000 Chippewa Falls, WI 54729-0080 USA
E-mail: Fax:
[email protected] +1 715 726 4602
Direct comments about this publication to: Mail:
Cray Inc.. Customer Documentation and Training P.O. Box 6000 Chippewa Falls, WI 54729-0080 USA
E-mail: Fax:
[email protected] +1 715 726 4991
Record of Revision Publication Number
Description
HR85-8500-A
April 2014 Revision 0 June 2014 Revision A: Added section on TAS Connector.
HR85-8500-A
HR85-8500-A
Cray Private
3
Tiered Adaptive Storage™ Hardware Guide
4
Cray Private
HR85-8500-A
Contents
Contents Chapter 1: Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 About this Document . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1 Purpose and Scope . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1 Definition of a Cray TAS System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1 Product Naming Conventions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .2 Additional Information Sources . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3 Safety Precautions and Hazards . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .4 Chapter 2: System Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Software Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .7 Hardware Configurations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .7 TAS Gateway System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .7 TAS Connector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .10 Full TAS Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12 Full TAS Basic Configuration Cabling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .15 MDC PCIe Slot Layout . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .15 DM PCIe Slot Layout . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .15 Metadata Connections . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16 FC Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16 IB Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .18 Basic Tape Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19 FC Full TAS Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .20 IB Full TAS Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .24 Hardware Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .28 Cray Integrated Management Server (CIMS) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .28 Metadata Controller (MDC) and Data Mover (DM) Nodes . . . . . . . . . . . . . . . . . . . . . . .28 Administrative Block Storage (ABS) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .29 Data Cache Block Storage (DCBS) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .29 Data Archive Storage Tape (DAST) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .30 Chapter 3: Site Preparation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 Site Planning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .31 System Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .31 Shipping Configurations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .33 Site Access Requirements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .34 Computer Room Environment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .35 Electrical Requirements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .37 Equipment Grounding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .37 Floor Requirements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .38 Chapter 4: Hardware Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 Rack and PDUs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .39 TAS System Rack . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .39 Power Distribution Units (PDUs) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .39 Cray Integrated Management Server (CIMS) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .39 Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .39 CIMS Controls and Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .41 CIMS IO Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .43
HR85-8500-A
Cray Private
1
Tiered Adaptive Storage™ Hardware Guide
Metadata Controller (MDC) and Data Mover (DM) Nodes . . . . . . . . . . . . . . . . . . . . . . . . . . 44 Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 MDC and DM Controls and Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 MDC and DM IO Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48 CIMS, MDC, and DM Nodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 Keyboard, Video, and Mouse (KVM) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 Hotkey Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 TAS Metadata Block Storage Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54 Controller Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54 Administrative Block Storage (ABS) SAS Interface, Controls, and Indicators . . . . 54 Data Cache Block Storage (DCBS) FC Interface, Controls, and Indicators . . . . . . . 63 Data Cache Block (DCBS) IB Interface, Controls, and Indicators . . . . . . . . . . . . . . 70 ABS and DCBS LED Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78 Drive Channels . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80 ABS Drive Channels . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80 DCBS (FC) Drive Channels . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 DCBS (IB) Drive Channels . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82 Disk Drive Trays . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82 ABS 24-bay Drive Tray Controls and Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . 82 DCBS 60-bay Drive Tray Controls and Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . 86 Tape Archive Storage Tier (TAST) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95 Data Network Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95 InfiniBand Data Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96 Switches and Cables (Mellanox SX6036) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96 Switch Platform Hardware Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96 Status LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96 Switch Status LED . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 Power Supply Status LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98 Bad Port LED . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100 UID LED Switch Identifier . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100 Port Connector LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101 Fibre Channel Data Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Switches and Cables (QLogic 5802V) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Chassis Controls and LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Heartbeat LED Blink Patterns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103 FC Ports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103 Logged-In LED Indications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 Ethernet Port . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 Serial Port . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 Power Supplies and Fans . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 Ethernet Management Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 Switches and Cables (Dell PowerConnect 5548) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 Dell PowerConnect 5548 Ports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 Buttons and LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108 Gigabit Ports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110 HDMI Port LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110 SFP LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 Chapter 5: Block Storage Tray ID Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113 ABS (NetApp 2700) and DCBS (NetApp 5500) Tray ID Indicator Codes . . . . . . . . . . . . . . 113
2
Cray Private
HR85-8500-A
Contents
Tray ID Failure Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .113 Controller-Drive Tray Sequence Code Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . .114 Lock-Down Codes for the Controller-Drive Tray . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .114 Tray ID Indicator Display Use Cases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .116 Tray ID Indicator Display for the Drive Trays . . . . . . . . . . . . . . . . . . . . . . . . . . . .117 DCBS (NetApp 5400) Tray ID Indicator Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .118 Sequence Code Definitions for the Controller-Drive Trays . . . . . . . . . . . . . . . . . . .118 Lock-Down Codes for the Controller-Drive Trays . . . . . . . . . . . . . . . . . . . . . . . . .119 Diagnostic Code Sequences for the Controller-Drive Trays . . . . . . . . . . . . . . . . . .120 Tray ID Indicator Display for the ESMs on the Drive Trays . . . . . . . . . . . . . . . . . .121
HR85-8500-A
Cray Private
3
Tiered Adaptive Storage™ Hardware Guide
4
Cray Private
HR85-8500-A
Contents
List of Figures Figure 2-1: TAS Gateway Minimum Hardware Configuration . . . . . . . . . . . . . . . . . . . . . . . . .8 Figure 2-2: TAS Gateway (Front View) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .8 Figure 2-3: ETHSW1, TAS Gateway . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9 Figure 2-4: CIMS, TAS Gateway . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9 Figure 2-5: Secondary MDC 2, TAS Gateway . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9 Figure 2-6: Primary MDC 1, TAS Gateway . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9 Figure 2-7: Administrative Block Storage (ABS), TAS Gateway . . . . . . . . . . . . . . . . . . . . . . .9 Figure 2-8: TAS Connector Manager Node with IB Lustre . . . . . . . . . . . . . . . . . . . . . . . . . . .10 Figure 2-9: TAS Connector Agent on a TAS DM with IB Lustre and IB TAS . . . . . . . . . . . .10 Figure 2-10: TAS Connector Manager SAS Connections, Secondary TCM2 . . . . . . . . . . . . .10 Figure 2-11: TAS Connector Manager SAS Connections, Primary TCM 1 . . . . . . . . . . . . . .11 Figure 2-12: TAS Connector Manager SAS Connections, Administrative Block Storage . . .11 Figure 2-13: Full TAS System (Front View) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .13 Figure 2-14: Full TAS System (Rear View) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .14 Figure 2-15: MDC with FC Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .15 Figure 2-16: MDC with IB Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .15 Figure 2-17: DM with FC Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .15 Figure 2-18: DM with IB Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .15 Figure 2-19: Secondary MDC 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16 Figure 2-20: Primary MDC 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16 Figure 2-21: Administrative Block Storage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16 Figure 2-22: FCSW2, Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16 Figure 2-23: FCSW1, Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16 Figure 2-24: Secondary MDC 2, FC Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17 Figure 2-25: Primary MDC 1, FC Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17 Figure 2-26: Secondary DM 2, FC Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17 Figure 2-27: Primary DM 1, FC Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17 Figure 2-28: DCBS, FC Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17 Figure 2-29: IBSW2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .18 Figure 2-30: IBSW2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .18 Figure 2-31: Secondary MDC 2, IB Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .18 Figure 2-32: Primary MDC 1, IB Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .18 Figure 2-33: Secondary DM 2, IB Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .18 Figure 2-34: Primary DM 1, IB Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .18 Figure 2-35: Primary DM 1, IB Data Cache . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19 Figure 2-36: FCSW2, Tape Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19 Figure 2-37: FCSW1, Tape Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19 Figure 2-38: Secondary MDC 2, Tape Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19 Figure 2-39: Primary MDC 1, Tape Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19
HR85-8500-A
Cray Private
1
Tiered Adaptive Storage™ Hardware Guide
Figure 2-40: Tape Library . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20 Figure 2-41: ETHSW2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20 Figure 2-42: ETHSW1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 Figure 2-43: FCSW2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 Figure 2-44: FCSW1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 Figure 2-45: CIMS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 Figure 2-46: Secondary MDC 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 Figure 2-47: Primary MDC 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 Figure 2-48: Secondary DM 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 Figure 2-49: Secondary DM 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 Figure 2-50: KVM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 Figure 2-51: Administrative Block Storage (ABS) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 Figure 2-52: Data Cache Block Storage (DCBS) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 Figure 2-53: Tape Library . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 Figure 2-54: ETHSW2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 Figure 2-55: ETHSW1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 Figure 2-56: FCSW2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 Figure 2-57: FCSW1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 Figure 2-58: IBSW2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 Figure 2-59: IBSW2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 Figure 2-60: CIMS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 Figure 2-61: Secondary MDC 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 Figure 2-62: Primary MDC 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 Figure 2-63: Secondary DM 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 Figure 2-64: Primary DM 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 Figure 2-65: KVM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 Figure 2-66: Administrative Block Storage (ABS) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 Figure 2-67: Data Cache Block Storage (DCBS) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 Figure 2-68: Tape Library . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 Figure 2-69: TAS Architecture - CIMS to Ethernet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 Figure 2-70: TAS Architecture - Ethernet to MDC and DM Nodes . . . . . . . . . . . . . . . . . . . . 29 Figure 2-71: TAS Architecture - MDC/DM Node Connections . . . . . . . . . . . . . . . . . . . . . . . 29 Figure 2-72: TAS Architecture - MDC/DM Nodes to Switches, DCBS, and DAST . . . . . . . 30 Figure 3-1: TAS System Cabinet Dimensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33 Figure 3-2: TAS Shipping Configurations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 Figure 3-3: TAS System Floor Cutout . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38 Figure 4-1: System Board Block Diagram . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40 Figure 4-2: CIMS Front Panel Features and Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 Figure 4-3: CIMS Rear Panel Features and Indicators, . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 Figure 4-4: CIMS Components . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 Figure 4-5: MDC and DM Node System Block Diagram . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 Figure 4-6: MDC and DM Front Panel Features and Indicators . . . . . . . . . . . . . . . . . . . . . . . 46 Figure 4-7: MDC and DM Rear Panel Features and Indicators . . . . . . . . . . . . . . . . . . . . . . . 47
2
Cray Private
HR85-8500-A
Contents
Figure 4-8: MDC and DM Inside the System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .48 Figure 4-9: CIMS, MDC, and DM LCD Panel Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . .49 Figure 4-10: CIMS, MDC, and DM Hard Drive Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . .50 Figure 4-11: CIMS, MDC, and DM AC Power Supply Status Indicators . . . . . . . . . . . . . . . .50 Figure 4-12: CIMS, MDC, and DM DC Power Supply Status Indicators . . . . . . . . . . . . . . . .51 Figure 4-13: Rear View of an 8-port Rack Mount LCD KVM Assembly . . . . . . . . . . . . . . . .52 Figure 4-14: ABS Controller-Drive Tray - Front View . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .54 Figure 4-15: ABS Controller-Drive Tray - Rear View . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .55 Figure 4-16: ABS Right-Rear Subplate with No Host Interface Card . . . . . . . . . . . . . . . . . . .55 Figure 4-17: ABS Right-Rear Subplate with a Two-Port 12-Gb/s SAS Host Interface Card .56 Figure 4-18: ABS Host Channels on the Controller with a Two-Port SAS HIC - Rear View .56 Figure 4-19: ABS LEDs on the Left End Cap . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .57 Figure 4-20: ABS LEDs on the Drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .58 Figure 4-21: ABS LEDs on the Controller Canister Main Faceplate . . . . . . . . . . . . . . . . . . . .59 Figure 4-22: ABS LEDs on the Controller Canister Two-Port SAS Host Interface Card Subplates* . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .61 Figure 4-23: ABS LEDs on the Power-Fan Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .62 Figure 4-24: DCBS (FC) Controller-Drive Tray - Front View . . . . . . . . . . . . . . . . . . . . . . . .63 Figure 4-25: DCBS (FC) Controller-Drive Tray - Rear View . . . . . . . . . . . . . . . . . . . . . . . . .63 Figure 4-26: DCBS (FC) Host Channels on the Controller-Drive Tray - Rear View . . . . . . .64 Figure 4-27: DCBS (FC) LEDs on the Left End Cap . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .65 Figure 4-28: DCBS (FC) LEDs on the Controller Canister Main Faceplate . . . . . . . . . . . . . .66 Figure 4-29: DCBS (FC) Right-Rear Subplate with an FC Host Interface Card . . . . . . . . . . .68 Figure 4-30: DCBS (FC) LEDs on the Power Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .70 Figure 4-31: DCBS (IB) Host Channels on the Controllers with InfiniBand HIC - Rear View . 71 Figure 4-32: DCBS (IB) LEDs on the Controller Canister Main Faceplate . . . . . . . . . . . . . .72 Figure 4-33: DCBS (IB) LEDs on the Controller Canister InfiniBand Host Interface Card Subplates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .73 Figure 4-34: DCBS (IB) LEDs on the Left Bezel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .75 Figure 4-35: DCBS (IB) LEDs on the Power Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .76 Figure 4-36: DCBS (IB) LEDs on the Fan Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .77 Figure 4-37: ABS Drive Channel Ports on the Controller-Drive Tray - Rear View . . . . . . . .81 Figure 4-38: ABS Controller with Base Host Channel Ports (CH1 and CH2) . . . . . . . . . . . . .81 Figure 4-39: Drive Channel Ports on the Controller-Drive Tray . . . . . . . . . . . . . . . . . . . . . . .81 Figure 4-40: DCBS (IB) Drive Channel Ports on the Controller-Drive Tray . . . . . . . . . . . . .82 Figure 4-41: DCBS (IB) Controller with Labeled Drive Channel Ports (CH1 and CH2) . . . .82 Figure 4-42: ABS Drive Tray - Front View . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .83 Figure 4-43: ABS Drive Tray - Rear View . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .83 Figure 4-44: ABS Drive Tray with AC Power Option - Rear View . . . . . . . . . . . . . . . . . . . .83 Figure 4-45: ABS LEDs on the ESM Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .84 Figure 4-46: ABS LEDs on the AC Power-Fan Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . .85 Figure 4-47: DCBS DE6600 Drive Tray - Rear View . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .86 Figure 4-48: DCBS Maximum Capacity E-Series Array Configuration using DE6600 Shelves
HR85-8500-A
Cray Private
3
Tiered Adaptive Storage™ Hardware Guide
86 Figure 4-49: DCBS LEDs on the Left Bezel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87 Figure 4-50: DCBS LEDs on the ESM Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88 Figure 4-51: DCBS LEDs on the Power Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89 Figure 4-52: DCBS LEDs on the Fan Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90 Figure 4-53: DCBS LEDs on the Drawer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91 Figure 4-54: DCBS LEDs on the Drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92 Figure 4-55: Mellanox SX6036 Switch System Power Side Panel . . . . . . . . . . . . . . . . . . . . . 96 Figure 4-56: Mellanox SX6036 Switch System Connector Side Panel (Internally Managed) 96 Figure 4-57: Mellanox Switch Status LED . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 Figure 4-58: Fan Status LED Connector Side . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98 Figure 4-59: Power Status LED Connector Side . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98 Figure 4-60: Power Side Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 Figure 4-61: PS Unit Status LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 Figure 4-62: Bad Port LED . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100 Figure 4-63: Identifier LED . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100 Figure 4-64: Port LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101 Figure 4-65: QLogic 5802V Chassis Controls and LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Figure 4-66: QLogic 5802V Fibre Channel Ports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103 Figure 4-67: QLogic 5802V Port LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103 Figure 4-68: QLogic 5802V Ethernet Port . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 Figure 4-69: QLogic 5802V Serial Port . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 Figure 4-70: Model 5802V Power Supplies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 Figure 4-71: PowerConnect 5548 Ports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 Figure 4-72: Dell PowerConnect 5548 Front LED Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . 108 Figure 4-73: Giga Port LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
4
Cray Private
HR85-8500-A
Contents
List of Tables Table 1-1: TAS Naming Conventions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .2 Table 3-2: TAS System Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .31 Table 3-3: Access Route Requirements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .35 Table 3-4: TAS Air Quality, Temperature, and Relative Humidity Levels . . . . . . . . . . . . . . .36 Table 3-5: Electrical Service Requirements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .37 Table 4-1: CIMS Front Panel Features and Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .41 Table 4-2: CIMS Rear Panel Features and Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .42 Table 4-3: MDC and DM Front Panel Features and Indicators . . . . . . . . . . . . . . . . . . . . . . . .46 Table 4-4: MDC and DM Rear Panel Features and Indicators . . . . . . . . . . . . . . . . . . . . . . . . .47 Table 4-5: CMS, MDC, and DM Drive-Status Indicator Patterns . . . . . . . . . . . . . . . . . . . . . .50 Table 4-6: CIMS, MDC, and DM Power Indicator Pattern . . . . . . . . . . . . . . . . . . . . . . . . . . .51 Table 4-7: KVM Monitor Push-button Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .52 Table 4-8: KVM Hotkey Commands and Descriptions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .52 Table 4-9: ABS LEDs on the Left End Cap . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .57 Table 4-10: ABS LEDs on the Drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .58 Table 4-11: ABS Drive State LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .58 Table 4-12: ABS LEDs on the Controller Canister Main Faceplate . . . . . . . . . . . . . . . . . . . .59 Table 4-13: ABS LEDs on the Controller Canister Two-Port SAS Host Interface Card Subplates* . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .61 Table 4-14: ABS LEDs on the Power-Fan Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .62 Table 4-15: DCBS (FC) LEDs on the Left End Cap . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .65 Table 4-16: DCBS (FC) LEDs on the Controller Canister Main Faceplate . . . . . . . . . . . . . . .66 Table 4-17: DCBS (FC) LEDs on the Controller Canister Fibre Channel Host Interface Card Subplates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .69 Table 4-18: DCBS (FC) LEDs on the Power Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .70 Table 4-19: DCBS (IB) LEDs on the Controller Canister Main Faceplate . . . . . . . . . . . . . . .72 Table 4-20: DCBS (IB) LEDs on the Controller Canister InfiniBand Host Interface Card Subplates* . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .74 Table 4-21: DCBS (IB) LEDs on the Left Bezel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .75 Table 4-22: DCBS (IB) LEDs on the Power Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .76 Table 4-23: DCBS (IB) LEDs on the Fan Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .77 Table 4-24: ABS and DCBS LED Symbols and General Behavior . . . . . . . . . . . . . . . . . . . . .78 Table 4-25: ABS LEDs on the ESM Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .84 Table 4-26: ABS LEDs on the Power-Fan Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .85 Table 4-27: DCBS LEDs on the Left Bezel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .87 Table 4-28: DCBS LEDs on the ESM Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .88 Table 4-29: DCBS LEDs on the Power-Fan Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .89 Table 4-30: DCBS LEDs on the Fan Canister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .90 Table 4-31: DCBS LEDs on the Drive Tray Drawer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .91 Table 4-32: DCBS Drive State Represented by the LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . .91 Table 4-33: LEDs on the Drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .92
HR85-8500-A
Cray Private
1
Tiered Adaptive Storage™ Hardware Guide
Table 4-34: DCBS Drive State Represented by the LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . 92 Table 4-35: DCBS LED Symbols and General Behavior . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 Table 4-36: Mellanox SX6036 System Status LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96 Table 4-37: Switch Status LED Assignments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 Table 4-38: Fan Status LED Assignments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98 Table 4-39: PS Unit Status LED Assignments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 Table 4-40: Bad Port LED Assignments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100 Table 4-41: Port Connector Physical and Logical Link Assignments for Ethernet Mode . . 101 Table 4-42: Connector Physical and Logical Link Assignments for IB Mode . . . . . . . . . . . 101 Table 4-43: QLogic 5802V Chassis Controls and LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Table 4-44: QLogic 5802V Port LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 Table 4-45: QLogic 5802V Power Supply ED . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 Table 4-46: QLogic 5802V Power Supply LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106 Table 4-47: Dell PowerConnect 5548 Front Panel LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . 108 Table 4-48: Giga Ports on non-POE Enabled Devices LEDs . . . . . . . . . . . . . . . . . . . . . . . . 110 Table 4-49: HDMI (Stacking) Port LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110 Table 4-50: SFP Port LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 Table 5-51: Storage Controller Tray ID Failure Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113 Table 5-52: Controller Drive Tray Sequence Code Definitions . . . . . . . . . . . . . . . . . . . . . . 114 Table 5-53: Supported Diagnostic Lock-Down Codes on the Tray ID Indicator Display . . 115 Table 5-54: Tray ID Indicator Display Uses Cases with Repeating Sequences . . . . . . . . . . 116 Table 5-55: Supported Diagnostic Codes on the ESM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117 Table 5-56: Sequence Code Definitions for the 5400 Controller-Drive Trays . . . . . . . . . . . 118 Table 5-57: Supported Diagnostic Lock-Down Codes on the Tray ID Indicator Display . . 119 Table 5-58: Diagnostic Code Sequences for the E5400 Controller-Drive Trays . . . . . . . . . 120 Table 5-59: Supported Diagnostic Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
2
Cray Private
HR85-8500-A
Introduction [1] About this Document Purpose and Scope This document is intended for Cray employees who need hardware overview and configuration information for Cray TAS systems. This document provides the following information: Chapter 1 - provides an high-level introduction Chapter 2 - provides an architectural overview of the system Chapter 3 - provides site planning and system specifications Chapter 4 - describes the hardware implementation of the storage system Chapter 5 - details block storage tray ID codes for the ABS and DCBS (FC and IB)
Definition of a Cray TAS System A Cray TAS system is a configuration of server, disk, tape and network hardware that runs the Versity Storage Manager (VSM) software, a Linux version of SAM-QFS that has been developed by Versity, Inc. SAM/QFS was originally developed by LSC, Sun and now Oracle for Solaris. VSM is a port of the open source version of SAM-QFS to Linux. Cray will supply first and second level support for VSM and Versity will provide third level support. Cray will supply first, second and third level support for the rest of TAS. Cray will sell and support TAS for use on XC30, XE6, CCS and other non-Cray systems running Linux.
HR85-8500-A
•
The Cray TAS system has two major functional capabilities which are:
•
A POSIX shared file system with good support for small files
•
An archive capability that manages data between disk and/or tape tiers via a policy engine capable of managing up to 4 copies of files
Cray Private
1
Tiered Adaptive Storage™ Hardware Guide
TAS has at its core: •
A metadata management system (MMS)
•
A Gigabit Ethernet (GigE) or 10 Gigabit Ethernet (10GbE) metadata network VLAN
•
A GigE management network (which is usually a separate VLAN on the metadata network)
•
An 8Gb/s Fiber Channel(FC8) or IB QDR storage network (change to FC16 and IB FDR in late 2014)
•
An optional FC8 archive network (change to FC16 in late 2014)
•
Connectivity to a public network
•
A fast data disk cache
•
An optional capability to support one or more disk or tape archive tiers
•
A policy engine that controls movement of data between storage tiers
Cray will supply and support TAS with redundant (default) and non-redundant networks and will support connectivity to a customer's incumbent networks and storage provided they follow the TAS rule conventions documented in the configuration rules
Product Naming Conventions Product naming conventions for standard configurations are based on customer requirements. In this guide, the following terminology refers to the standard configuration. Table 1-1: TAS Naming Conventions
2
TAS Naming
TAS Abbreviation Naming
Tiered Adaptive Storage system
TAS system
Cray Integrated Management System
CIMS
Dell 620
Metadata Controller
MDC
Dell 720
Data mover node
DM
Dell 720
Data Archive Storage Tape
DAST
Tape library
Data Archive Storage Disk
DASD (dsk_archive0 -dsk_archiveX)
Any storage sold (Cray or customer owned)
Data Cache Block Storage
DCBS (dc0000 -- dc000x)
NetApp 5400 or NetApp 5500
Metadata Block Storage
MDBS (md0)
NetApp 2700
Cray Private
Product
HR85-8500-A
Introduction [1]
Table 1-1: TAS Naming Conventions (continued) TAS Naming
TAS Abbreviation Naming
Product
Administrative Block Storage
ABS (adm0 FS = tas_admin)
NetApp 2700
File System Storage
tasfs1
NetApp 5500
Archive storage
DAST or DASD Data Archive Storage Tape Data Archive Storage Disk
Tape library Any storage sold (Cray or customer owned)
Additional Information Sources The following TAS document are available under the Data Management Platform (DMP) section of CrayDoc: •
Installing Cray Tiered Adaptive Storage (TAS) Software, S-2539-10
•
Tiered Adaptive Storage Software Administrator’s Guide, S-2540-10
All current Dell, NetApp, SpectraLogic, and Versity documents, along with information on virtual box server, lustre HSM connector, and TAS administrative information is located on the TAS wiki.
HR85-8500-A
Cray Private
3
Tiered Adaptive Storage™ Hardware Guide
Safety Precautions and Hazards Failure to observe the warnings and cautions throughout this document could result in death or serious injury.
Warning: All procedures in this document must be performed by Cray trained service providers or authorized site personnel. Failure to observe this warning could result in death or serious injury.
Warning: Remove watches and jewelry when installing this equipment. Keep fingers and conductive tools away from high-voltage areas as well as the highcurrent areas on the system. High-voltage areas are indicated with high-voltage warning labels.
Warning: Wait for the system to power down completely before you touch any components associated with the high-voltage circuits. Failure to do so will result in death or serious injury.
Warning: Invisible laser radiation ma be emitted from the end of the unterminated fiber cable or connector. Do not view directly with optical instruments. Viewing the laser output with certain optical instruments (for example, eye loupes, manifiers, and microscopes) within a distance of 100 mm may pose an eye hazard.
Caution: Remove all tools from the system cabinets and replace all covers and panels removed from the system during servicing. Failure to do so could result in serious injury or equipment damage.
Caution: Halt the operation system before removing power from the system. Failure to do so can result customer data loss or corruption.
4
Cray Private
HR85-8500-A
Introduction [1]
Caution: Observe electrostatic discharge (ESD) precautions during the entire installation process. Required apparel includes an ESD smock and an ESD wrist strap. Failure to do so can result in damage to the equipment.
Caution: Other recommended safety apparel are gloves and certified reinforced-toe safety shoes.
Caution: Some procedures require the assistance of qualified personnel wearing proper safety apparel.
HR85-8500-A
Cray Private
5
Tiered Adaptive Storage™ Hardware Guide
6
Cray Private
HR85-8500-A
System Architecture [2] Software Architecture For more information on the software architecture, refer to the Cray Tiered Adaptive Storage (TAS) Administrator’s Guide. S-2540-10, available from CrayDoc.
Hardware Configurations TAS Gateway System The TAS gateway system is the minimum supported hardware configuration. It comprises of the Cray Integrated Management Services (CIMS) node, metadata controller (MDC) nodes, management Ethernet switch, and administrative block storage (ABS). Figure 2-1 shows the TAS Gateway minimum hardware configuration, and Figure 2-2 through Figure 2-7 show the TAS Gateway rear view and cabling.
HR85-8500-A
Cray Private
7
Tiered Adaptive Storage™ Hardware Guide
Figure 2-1: TAS Gateway Minimum Hardware Configuration Cray Integrated Maintence Sevices (CIMS) node(s) eth0
eth2
eth1
DRAC ipmi0 site-admin-net
site-admin-net
ipmi-net
esmaint-net
Bright
metadata-net VLAN 4
site-admin-net VLAN 3
ipmi-net VLAN 2
esmaint-net VLAN 1
Ethernet switch Customer network customer-net
10.141 10.148
10GbE
metadata-net site-admin-net ipmi-net esmaint-net 10.50
failover-net
eth3 eth0
DRAC
eth1 eth2 eth5 eth4 10GbE Primary MDC Node
eth3 eth0 DRAC eth1 eth2 5eth5 hte eth4 10GbE Secondary MDC Node Port B
Port A
Port A
Port B SAS
SAS Cntrl B
Cntrl A
Disks or SSDs
Administrative Storage (MABs)
Figure 2-2: TAS Gateway (Front View)
Ethernet Switch CIMS
EST
EST
MDCs EST
ABS
8
Cray Private
8
16
9
17
HR85-8500-A
System Architecture [2]
Figure 2-3: ETHSW1, TAS Gateway ETHSW1 VLAN1
VLAN2
esmaint-net 1
3
5
7
9
11
13
VLAN3
ipmi-net
15
17
19
21
23
25
27
29
VLAN4
site-admin-net metadata-net 31
33
35
37
39
41
43
45
LNK /ACT
FDX
DIAG
RPS
FAN
TEMP
PWR
47 /HDX COMBO PORTS
RESET LNK/ACT
2
4
6
8
10
12
14
16
18
20
22
24
26
28
30
32
34
36
38
40
42
44
46
LNK/ACT
LNK/ACT
LNK/ACT
48
Figure 2-4: CIMS, TAS Gateway CIMS 1 1
750 W
2 1
2
3
750 W
4
{
iDRAC
eth3 failover-net to second CIMS node eth2 to VLAN 2 - ipmi-net
To KVM
eth1 to VLAN 3 - site-admin-net iDRAC to VLAN 2 - ipmi-net
eth0 to VLAN 1 - esmaint-net
Figure 2-5: Secondary MDC 2, TAS Gateway Slot 1, port 2 to ABS Cntr B Port 2
10GbE to Customer Network
{
Slot 1, port 1 to ABS Cntr A Port 2
4
1
ST
Slot 2, ports 1 and 2 to customer FC network Slot 3, ports 1 and 2 to customer IB network
Slot 7, ports 1 and 2 to customer FC network
Secondary MDC 2
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
eth3 failover-net to MDC1
iDRAC to VLAN 2 - ipmi-net
eth2 to VLAN 4 - metadata-net eth1 to VLAN 3 - site-admin-net eth0 to VLAN 1 - esmaint-net
Figure 2-6: Primary MDC 1, TAS Gateway Slot 1, port 2 to ABS Cntr B Port 1
10GbE to Customer Network
{
Slot 1, port 1 to ABS Cntr A Port 1
4
1
ST
Slot 2, ports 1 and 2 to customer FC network Slot 3, ports 1 and 2 to customer IB network
Slot 7, ports 1 and 2 to customer FC network
Primary MDC 1
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
eth3 failover-net to MDC2
iDRAC to VLAN 2 - ipmi-net
eth2 to VLAN 4 - metadata-net eth1 to VLAN 3 - site-admin-net eth0 to VLAN 1 - esmaint-net
Figure 2-7: Administrative Block Storage (ABS), TAS Gateway Administrative Block Storage Ch 1 Lnk
Ch 2
Lnk
Lnk
Port 1
Port 2
Lnk Lnk Port 1
Port 2
Lnk
Lnk
Ch 3
Ch 4
Lnk Lnk
Ch 5
Ch 6
Lnk
Lnk Lnk
Port 1
Lnk
Port 2
Lnk
Lnk
Lnk
AC
DC
Cntrl A Ethernet Port 1 to esmaint-net
Lnk
Ch 3
Ch 4
Lnk Lnk
Ch 5
Ch 6
Port 1
Port 2
Lnk Lnk
Lnk
Drive Expansion
ID/ Diag Lnk
SAS Cntrl A Ch2 to MDC2 SAS Cntrl A Ch1 to MDC1
HR85-8500-A
Ch 2
Lnk Lnk
Drive Expansion Lnk
DC
Ch 1
Lnk Lnk
Lnk
ID/ Diag
Lnk Lnk
Lnk
AC
SAS Cntrl B Ch2 to MDC2 SAS Cntrl B Ch1 to MDC1 Cntrl B Ethernet Port 1 to esmaint-net
Cray Private
9
Tiered Adaptive Storage™ Hardware Guide
TAS Connector The TAS Connector contains three software services, which are required for proper operation of the system. The three services are RobinHood, Migration Manager , and Migration Agent. RobinHood and Migration Manager are co-located on a single OS node called the Manager Nodes. There are two manager nodes in the HA failover architecture; the manager nodes run on two Dell 720’s. The two manager nodes connect to the NetApp E2700, otherwise known as the administrative block storage (ABS). The NetApp E2700 has 4 600GB disk drives, RAID 10, owned by controller B. RobinHood and Migration Manager nodes have no FC capability. The nodes can only connect to the TAS HSM by InfiniBand or SAS. Refer to Figure 2-8 through Figure 2-12 for the TAS Connector Manager (TCM) Lustre PCIe slots and administrative block storage (ABS) connections. The final service, the Migration Agent, exists on a DM node in the TAS HSM. The Migration Agent connects the TAS HSM to the Lustre client through an InfiniBand connection. If a Migration Agent fails, the Migration Manager will restart the work on another agent. TAS Connector 1.0 HA failover is manually initiated. All services on RobinHood and the Migration Manager must be shutdown in case a user initiates a failover. If a system failure occurs, a secondary node must be brought up so data stores can be checked. Figure 2-8: TAS Connector Manager Node with IB Lustre TCM Node with IB Lustre 1
4
SAS ST
2
IB Disk Lustre
5
6 7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
Figure 2-9: TAS Connector Agent on a TAS DM with IB Lustre and IB TAS TAS Connector Agent on a TAS DM with IB Lustre and IB TAS 4
1
ST
2
IB Disk Lustre
5
6 7 2
1 3
IB Disk TAS HSM 750 W
1
2
3
750 W
4
iDRAC
Figure 2-10: TAS Connector Manager SAS Connections, Secondary TCM2 Port 2 to ABS Cntr B Port 2 Port 1 to ABS Cntr A Port 2
Secondary TCM2 4
1
ST
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
10
Cray Private
HR85-8500-A
System Architecture [2]
Figure 2-11: TAS Connector Manager SAS Connections, Primary TCM 1 Port 2 to ABS Cntr B Port 1
Primary TCM1
Port 1 to ABS Cntr A Port 1
4
1
ST
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
Figure 2-12: TAS Connector Manager SAS Connections, Administrative Block Storage Administrative Block Storage Ch 1 Lnk
Ch 2
Lnk
Lnk
Port 1
Port 2
Lnk Lnk Port 1
Port 2
Lnk
Lnk
Ch 3
Ch 4
Lnk Lnk
Ch 5
Ch 6
Lnk
Lnk Lnk
Lnk
Lnk
Lnk
Port 2
Lnk
Lnk
Ch 3
Ch 4
Lnk Lnk
Ch 5
Ch 6
Port 2
DC
Lnk Lnk
Lnk
Drive Expansion
ID/ Diag Lnk
AC
Cntrl A Ethernet Port 1 to esmaint-net
Port 1
Lnk Lnk Port 1
SAS Cntrl A Ch2 to TCM2 SAS Cntrl A Ch1 to TCM1
HR85-8500-A
Ch 2
Lnk
Drive Expansion Lnk
DC
Ch 1
Lnk
Lnk
ID/ Diag
Lnk Lnk
Lnk
AC
SAS Cntrl B Ch2 to TCM2 SAS Cntrl B Ch1 to TCM1 Cntrl B Ethernet Port 1 to esmaint-net
Cray Private
11
Tiered Adaptive Storage™ Hardware Guide
Full TAS Configuration Cray Tiered Adaptive Storage (TAS) systems are flexible in hardware configurations and architecture. A full TAS configuration includes all disks, MDC’s, and DM’s. A full TAS system could include a CIMS, two metadata controllers (MDC’s) and data mover nodes (DM’s), management switches (Ethernet, FC, and IB), administrative block storage (ABS), and data cache block storage (DCBS). A tape library is optional because the customer can provide their own. Figure 2-13 and Figure 2-14 show example front and review views of a full TAS system. These examples include a tape library. The tape library is in the TAS cabinet for this example, but the tape library can be located in another cabinet. Product naming conventions can be found in “Product Naming Conventions” on page 2.
12
Cray Private
HR85-8500-A
System Architecture [2]
Figure 2-13: Full TAS System (Front View) 42
42
41
LOCATOR
DC IN
RPS 12V,15A
LOCATOR
DC IN
RPS 12V,15A
41
42
Zoned Ethernet switches
42 100V-240VAC, 1.6A
40
40
41
41 100V-240VAC, 1.6A
39
39
38
38
40
40
FC switches 39
39
12C !
37
IB switches
! !
37 38
12C !
36
37
CIMS
!
Mellanox
38
!
Mellanox
! !
36
35
35
36
EST
34
34
33
33
32
32
31
31
35
37
36
35
34
34
MDCs 33
32
0
30
1
2
3
4
5
6
7
8
9
10
11
12
13
14
30
29
29
30
DMs
0
28
1
2
3
4
5
6
7
8
9
10
11
12
13
14
28
27
27
26
26
25
25
24
24
28
27
26
KVM
25
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
MABS
22
23
23
22
22
23
21
6
9 12
2
5
8 11
1
2
3
4
5
6
1
3
4
7
6
9 12
2
5
8 11
1
2
3
4
5
6
1
4
7
22
10
21 7
8
9
10 11 12
7
8
9
10 11 12
7
8
9
10 11 12
30
29
28
27
26
25
24
23
24
31
15
EST
29
32
15
EST
31
33
24
23
22
Front
3
20
21
DCBS
10
20
Front
3
1
2
3
4
5
6
6
2
5
1
4
3
1
2
3
4
5
6
21
9 12
8 11
7
10
Front
19
19
20
20
9 12
2
5
6
1
4
6
9 12
2
5
8 11
6
1
4
8 11
7
7
10
7
8
9
10 11 12
Front
18
3
19
1
2
3
4
5
10
18 7
8
9
19
10 11 12
Front
17
17
16
16
15
15
1514
14
13
13
12
12
1211
11
10
10
9
9
98
8
9
87
7
8
6
6
65
5
6
54
4
5
3
3
4
32
2
3
21
1
2
18
17
16
14
13
Tape
11
10
7
4
1
18
17
16
15
14
13
12
11
10
7
1
220VAC PDU North America: NEMA L6-30R International: IEC 309 32A 2P+E
HR85-8500-A
Cray Private
13
Tiered Adaptive Storage™ Hardware Guide
Figure 2-14: Full TAS System (Rear View) 42
42
3
5
7
9
11
13
15
17
19
21
23
25
27
29
31
33
35
37
39
41
43
45
tus Sta R PW
1
LNK 47 ACT
41
Reset
42
2
4
6
8
10
12
14
16
18
20
22
24
26
28
30
32
34
36
38
40
42
44
46
48
1
3
5
7
9
11
13
15
17
19
21
23
25
27
29
31
33
35
37
39
41
43
45
LNK 47 ACT
ACT
1
LNK
ACT
Stack No. LNK
2
Stacking HDMI
1
40
Reset
41
LNK
2
4
6
8
10
12
14
16
18
20
22
24
26
2-3>
0-1>
28
30
32
34
4-5>
36
38
40
42
44
46
48
8-9>
6-7>
ACT
1
LNK
Stacking HDMI
ACT
Stack No. LNK
2
20
0
41
M RPS Fan
42
ACT
SFP+
2
tus Sta R PW
Zoned Ethernet switches
LNK
1
40
M RPS Fan
41
ACT
SFP+
2
22
39
39
40
FC switches
SANbox 5800 10-11>
12-13>
14-15>
16-17>
18-19>
0-1>
2-3>
4-5>
6-7>
8-9>
0
21
23
20
22
40
38
38
39
39 SANbox 5800
21 10-11>
12-13>
23
18-19>
16-17>
14-15>
SX6036
MGT 1
37 38
3
2
4
5
7
6
8
9
10
11
13
12
15
14
17
16
19
18
21
20
23
22
25
24
26
27
29
28
31
30
33
32
34
35
CONSOLE
36
37
PS1
38
PS2
IB switches
UID RST
SX6036
MGT 1
36
37
3
2
4
5
7
6
8
9
10
11
13
12
15
14
17
16
19
18
21
20
23
22
25
24
26
27
29
28
31
30
33
32
34
35
CONSOLE
36
36
PS1 PS2
37
UID RST
CIMS
35
35
36
Gb 1
Gb 2
Gb 3
36
Gb 4
34
34
33
33
32
32
31
31
35
35
34
34
MDCs 33
32
0
30
1
2
3
4
5
6
7
8
9
10
11
12
13
14
30
29
29
30
DMs
0
28
1
2
3
4
5
6
7
8
9
10
11
12
13
14
28
27
27
26
26
25
25
28
27
26
100-240V~, 0.5A, 50/60 Hz
KVM
24
25
1
3
5
7
2
4
6
8
24
LAN
23
Lnk
24
Ch 1
Ch 2
Lnk
Lnk
Port 1
Port 2
Lnk
MABS
Lnk
22
Port 1
Lnk
Port 2
Lnk
Ch 3
Ch 4
Lnk
23
29
28
27
26
25
24
Lnk
Drive Expansion Lnk
30
Lnk
ID/ Diag
23
31
15
EST
29
32
15
EST
31
33
22
Lnk
23
2
21
21
1
DC
AC
PSU A
2
3
4
PSU B
1
5
2
1
6
6
9 12
5
8
11
4
7
10
22
PSU A
7
8
9
PSU B
3
22
10 11 12
Front
Lnk
Lnk
Lnk
Lnk
3
2
ESM B
2
3
4
5
Ch 1
6
Ch 2
1
2
3
Port 2
4
Lnk
5
Lnk
Lnk
Lnk
2
1
6 Ch 1
9 12
8
11
7
10 IB Host Card
Front
3
Port 1
1
6
5
4
Act
Lnk 7
Act
Lnk
ESM A
8 Act9
8
7
10
7 Lnk
IB Host Card
8
9
19 ESM B
ESM A
3
4
5
3
6
9 12
2
5
8
11
7
10
1
6
4
2
7
8
9
AC
PSU B
18
PSU A
PSU A 3
2
2
20
10 11 12
Front
1
DC
PSU B
19
1
21
10 11 12
Act
ESM A
2
20
11
CH 2
1
18
10 11 12
9 12
Front
Ch 2
ID/ Diag
ESM B
Lnk
CH 1
19
20
6
5
4
CH 2
ID/ Diag
CH 1
DCBS
Port 2
ESM A
1
ESM B
Port 1
20
21
3
4
5
6
1
6
9 12
5
8
11
4
7
10
7
8
9
19
10 11 12
Front
17
17
16
16
15
15
18
17
16
Fibre B
14
Drive Pwr
Fibre A
Status
Drive Pwr
Status
Fibre A
13
Fibre B
11
Drive Pwr
Fibre A
Status
Drive Pwr
Status
Fibre A
10
11
Fibre B
Drive Pwr
Fibre A
Status
Drive Pwr
Status
Fibre A
14
13
12
11
10
8
9
7
8
6
6
Fibre B
7
Fibre B
Drive Pwr
Fibre A
Status
Status
Fibre A
Drive Pwr
65
54
15
Fibre B
98
7
10
9
9
87
16
Fibre B
1211
10
13
12
12
13
Tape
17
Fibre B
14
15
14
18
5
6
4
5
3
3
32
2
3
21
1
2
4
1
4
1
220VAC PDU North America: NEMA L6-30R International: IEC 309 32A 2P+E
14
Cray Private
HR85-8500-A
System Architecture [2]
Full TAS Basic Configuration Cabling The following figures show the full TAS configuration cabling based on the full TAS configuration examples. The cabling will vary with every TAS system due to varying configurations. For any TAS configuration, if there are more than 12 tape drives, add FCSW3 and FCSW4. If there are more than 2 data mover (DM) nodes, add FCSW3 and FCSW4.
MDC PCIe Slot Layout Figure 2-15: MDC with FC Data Cache MDC with FC Data Cache 1
SAS MDT
1
FC Tape
1
3
FC Disk
1
ST
2
4 2 2
0
10GigE - Customer Net
1
2
FC Disk
1
2
FC Tape
1
5
6 7 2
1 2 750 W
1
2
3
750 W
4
iDRAC
Figure 2-16: MDC with IB Data Cache MDC with IB Data Cache 1
3
1
SAS MDT
1
FC Tape
1
IB Disk
4
6
2
ST
2
0
2
5
1
10GigE - Customer Net
2
FC Tape
7 1
2
1 2 750 W
1
2
3
750 W
4
iDRAC
DM PCIe Slot Layout Figure 2-17: DM with FC Data Cache DM with FC Data Cache 4
1
2
ST
2 3
1
FC Disk
0
1
FC Disk
5
10GigE - Customer Net
6 1
7 2
1 2 750 W
1
2
3
750 W
4
iDRAC
Figure 2-18: DM with IB Data Cache DM with IB Data Cache 4
1
ST
2 3
1
IB Disk
0
6
5
1
10GigE - Customer Net
7 2
1 2 750 W
1
2
3
750 W
4
iDRAC
HR85-8500-A
Cray Private
15
Tiered Adaptive Storage™ Hardware Guide
Metadata Connections Figure 2-19: Secondary MDC 2 Port 2 to ABS Cntr B Port 2 Port 1 to ABS Cntr A Port 2
Secondary MDC 2 4
1
ST
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
Figure 2-20: Primary MDC 1 Port 2 to ABS Cntr B Port 1 Port 1 to ABS Cntr A Port 1
Primary MDC 1 4
1
ST
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
Figure 2-21: Administrative Block Storage Administrative Block Storage Ch 1
Ch 2
Lnk
Lnk
Lnk
Port 1
Port 2
Lnk Lnk Port 1
Lnk
Port 2
Lnk
Ch 3
Ch 4
Lnk Lnk
Ch 5
Ch 6
DC
Lnk Lnk
Lnk
Ch 2
Lnk
Lnk
Lnk
Port 1
Port 2
Lnk Lnk
Drive Expansion Lnk
Ch 1
Lnk
Lnk
ID/ Diag
Port 1
Port 2
Lnk
Lnk
Lnk
Ch 3
Ch 4
DC
SAS Cntrl A Ch2 to MDC2 SAS Cntrl A Ch1 to MDC1
Ch 5
Ch 6
Lnk Lnk
Lnk
Drive Expansion Lnk
AC
Lnk Lnk
ID/ Diag Lnk Lnk
Lnk
AC
SAS Cntrl B Ch2 to MDC2 SAS Cntrl B Ch1 to MDC1 Cntrl B Ethernet Port 1 to esmaint-net
Cntrl A Ethernet Port 1 to esmaint-net
FC Data Cache Figure 2-22: FCSW2, Data Cache To Primary and Secondary MDC, Slot 3, Port 2 and Slot 6, Port 1
FCSW2 MDC/DM to FCSW - Data Cache
To Primary and Secondary DM Slot 3, Port 2 and Slot 6, Port 1 0-1> > 0
SANbox 5800 10-11>
12-13>
14-15>
16-17>
To DCBS Cntr A, Port 2
Data Cache to FCSW MDC/DM
To DCBS Cntr B, Port 2 20
22
21
23
18-19>
Figure 2-23: FCSW1, Data Cache To Primary and Secondary MDC, Slot 3, Port 1 and Slot 6, Port 2
FCSW1 MDC/DM to FCSW - Data Cache
To Primary and Secondary DM Slot 3, Port 1 and Slot 6, Port 2 0-1> > 0
SANbox 5800 10-11>
16
12-13>
Cray Private
14-15>
16-17>
To DCBS Cntr A, Port 1
Data Cache to FCSW MDC/DM
To DCBS Cntr B, Port 1 20
22
21
23
18-19>
HR85-8500-A
System Architecture [2]
Figure 2-24: Secondary MDC 2, FC Data Cache Port 2 - MDC/DM to FCSW1 - Data Cache Port 1 - MDC/DM to FCSW2 - Data Cache
Secondary MDC 2 4
1
ST
2
Port 2 - MDC/DM to FCSW2 - Data Cache Port 1 - MDC/DM to FCSW1 - Data Cache
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
Figure 2-25: Primary MDC 1, FC Data Cache Port 2 - MDC/DM to FCSW1 - Data Cache Port 1 - MDC/DM to FCSW2 - Data Cache
Primary MDC 1 4
1
ST
Port 2 - MDC/DM to FCSW2 - Data Cache Port 1 - MDC/DM to FCSW1 - Data Cache
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
Figure 2-26: Secondary DM 2, FC Data Cache Port 2 - MDC/DM to FCSW1 - Data Cache Port 1 - MDC/DM to FCSW2 - Data Cache
Secondary DM 2 4
1
ST
Port 2 - MDC/DM to FCSW2 - Data Cache Port 1 - MDC/DM to FCSW1 - Data Cache
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
Figure 2-27: Primary DM 1, FC Data Cache Port 2 - MDC/DM to FCSW1 - Data Cache Port 1 - MDC/DM to FCSW2 - Data Cache
Primary DM 1 4
1
ST
Port 2 - MDC/DM to FCSW2 - Data Cache Port 1 - MDC/DM to FCSW1 - Data Cache
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
Figure 2-28: DCBS, FC Data Cache Cntrl A, Port 2 - Data Cache to FCSW2 - MDC/DM Cntrl A, Port 1 - Data Cache to FCSW1 - MDC/DM
DCBS (FC) 2 1
DC
AC
PSU A
PSU A PSU B
PSU B
Port 1
Port 2
2
4
2
4
8
2
8
4
2
8
4 Lnk
8
Lnk
ESM A
ESM B
Ch 2 FCHost Ch 3
Ch 1
Drive Expansion
Ch 4
ESM B
ESM A
ID/Diag
Port 2
2
4
2
8
4 8
2
4
Ch 2 FCHost Ch 3
Ch 1
ESM B
2
8
4 8
Ch 4
ID/Diag
Lnk
Lnk
Drive Expansion ESM B ESM A
Port 1
ESM A 2 1
DC
AC
PSU B PSU A
PSU B PSU A
Cntrl B, Port 2 - Data Cache to FCSW2 - MDC/DM Cntrl B, Port 1 - Data Cache to FCSW1 - MDC/DM
HR85-8500-A
Cray Private
17
Tiered Adaptive Storage™ Hardware Guide
IB Data Cache Figure 2-29: IBSW2 To Primary DM 1, Slot 3, Port 2
To Secondary MDC 2, Slot 3, Port 2
To DCBS Cntr A, Port 1
IBSW2 SX6036
MGT 1
3
2
5
4
0-1>
0 7
6
9
8
2-3>
11
10
12
13
4-5>
15
14
17
16
19
18
21
20
6-7>
23
22
8-9>
25
24
27
26
20
29
28
30
31
33
32
34
22 35
CONSOLE
36
PS1
To Primary MDC 1, Slot 3, Port 2 To Secondary DM 2, Slot 3, Port 2 PS2
UID
RST
SANbox 5800
21 12-13>
10-11>
23
18-19>
16-17>
14-15>
To DCBS Cntrl B, Port 1
Figure 2-30: IBSW2 To Primary DM 1, Slot 3, Port 1
To Secondary MDC 2, Slot 3, Port 1
To DCBS Cntr A, Port 2
IBSW1 SX6036
MGT 1
2
3
4
5
6
0-1>
0 7
8
9
10
2-3>
11
12
13
15
14
4-5>
17
16
19
18
20
6-7>
21
22
23
24
25
8-9> 26
27
28
29
20 30
31
32
33
34
22 35
CONSOLE
36
PS1
To Primary MDC 1, Slot 3, Port 1 To Secondary DM 2, Slot 3, Port 1 PS2
UID
RST
SANbox 5800
21 12-13>
10-11>
16-17>
14-15>
23
18-19>
To DCBS Cntrl B, Port 2
Figure 2-31: Secondary MDC 2, IB Data Cache Secondary MDC 2 4
1
ST
2
Slot 3, Port 2 to IBSW2 Slot 3, Port 1 to IBSW1
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
Figure 2-32: Primary MDC 1, IB Data Cache Primary MDC 1 4
1
ST
Slot 3, Port 2 to IBSW2 Slot 3, Port 1 to IBSW1
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
Figure 2-33: Secondary DM 2, IB Data Cache Secondary DM 2 4
1
ST
Slot 3, Port 2 to IBSW2 Slot 3, Port 1 to IBSW1
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
Figure 2-34: Primary DM 1, IB Data Cache Primary DM 1 4
1
ST
Slot 3, Port 2 to IBSW2 Slot 3, Port 1 to IBSW1
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
18
Cray Private
HR85-8500-A
System Architecture [2]
Figure 2-35: Primary DM 1, IB Data Cache Cntr A, Port 1 to IBSW2 Cntr A, Port 2 to IBSW1
DCBS (IB) 2 1
DC
AC
PSU A
PSU A PSU B
PSU B
Lnk
Port 1
Port 2
Lnk
Lnk
ESM A
Lnk
Ch 1
Lnk
Ch 2
Lnk
ESM B
ID/ Diag
Ch 1
Lnk
IB Host Card
Act
Lnk
Act
Act
Lnk
Act
ESM A ESM B
Lnk
CH 2
Port 2
CH 1
Port 1
Lnk
Ch 2
Lnk
CH 2
IB Host Card
CH 1
ID/ Diag
ESM B
ESM B ESM A
ESM A 2 1
DC
PSU B
AC
PSU B PSU A
PSU A
Cntr B, Port 2 to IBSW1 Cntr B, Port 1 to IBSW2
Basic Tape Configuration The tape library cabling is determined by the site. Half the cables must go to FCSW1 and half must go to FCSW2. Figure 2-40 shows a possible tape library cabling configuration. Figure 2-36: FCSW2, Tape Configuration FCSW2 2-3>
0-1>
4-5>
8-9>
6-7>
0
SANbox 5800
20
22
21
23
10-11> >
MDC to FCSW - Tape
To Primary and Secondary MDC, Slot 2, Port 2 and Slot 7, Port 2
Tape Library to FCSW MDC/DM
Tape Library, Fibre B Port
Figure 2-37: FCSW1, Tape Configuration FCSW1 2-3>
0-1>
8-9>
6-7>
4-5>
0
SANbox 5800
20
22
21
23
10-11> >
To Primary and Secondary MDC, Slot 2, Port 1 and Slot 7, Port 1
MDC to FCSW - Tape
Tape Library to FCSW MDC/DM
Tape Library, Fibre A Port
Figure 2-38: Secondary MDC 2, Tape Configuration Port 2 - MDC to FCSW2 - Tape Port 1 - MDC to FCSW1 - Tape
Secondary MDC 2 4
1
ST
Port 2 - MDC to FCSW2 - Tape Port 1 - MDC to FCSW1 - Tape
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
Figure 2-39: Primary MDC 1, Tape Configuration Port 2 - MDC to FCSW2 - Tape Port 1 - MDC to FCSW1 - Tape
Primary MDC 1 4
1
ST
Port 2 - MDC to FCSW2 - Tape Port 1 - MDC to FCSW1 - Tape
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
HR85-8500-A
Cray Private
19
Tiered Adaptive Storage™ Hardware Guide
Figure 2-40: Tape Library Tape Library
Status
Fibre A Status
Fibre B
Fibre B
Fibre A
Fibre A Status
Status
Fibre A port to FCSW2
Status
Fibre A
Fibre A port to FCSW1
Drive Pwr
Fibre B
Fibre A port to FCSW1
Drive Pwr
Fibre B
Status
Fibre A port to FCSW2
Fibre A port to FCSW1
Drive Pwr
Fibre A
Drive Pwr
Fibre B
Fibre A Status
Fibre B
Drive Pwr
Fibre A port to FCSW2
Fibre A port to FCSW1
Drive Pwr
Fibre A
Drive Pwr
Fibre B
Fibre A Status
Fibre B
Drive Pwr
Fibre A port to FCSW2
FC Full TAS Configuration There are two types of full TAS configurations. The Fibre Channel (FC) full TAS configuration consists of two management Ethernet switches and two FC switches. The data cache block storage (DCBS) is a NetApp 5400. Figure 2-41 through Figure 2-53 show the FC full TAS configuration components and cabling. Figure 2-41 shows a second Ethernet switch; however, a second Ethernet switch is only used if the system uses more than 2 DMs.
The tape library cabling is determined by the site. Half the cables must go to FCSW1 and half must go to FCSW2. Figure 2-53 shows a possible tape library cabling configuration. Figure 2-41: ETHSW2 ETHSW2 VLAN1
VLAN2
esmaint-net 1
3
5
7
9
11
13
15
VLAN3
ipmi-net 17
19
21
23
25
27
29
VLAN4
site-admin-net metadata-net 31
33
35
37
39
41
43
45
LNK /ACT
FDX
DIAG
RPS
FAN
TEMP
PWR
47 /HDX COMBO PORTS
RESET LNK/ACT
2
20
4
6
8
10
12
14
16
18
20
22
24
Cray Private
26
28
30
32
34
36
38
40
42
44
46
LNK/ACT
LNK/ACT
LNK/ACT
48
HR85-8500-A
System Architecture [2]
Figure 2-42: ETHSW1 ETHSW1 VLAN1
VLAN2
esmaint-net 1
3
5
7
9
11
13
VLAN3
ipmi-net
15
17
19
21
23
25
27
29
VLAN4
site-admin-net metadata-net 31
33
35
37
39
41
43
45
LNK /ACT
FDX
DIAG
RPS
FAN
TEMP
PWR
47 /HDX COMBO PORTS
RESET LNK/ACT
2
4
6
8
10
12
14
16
18
20
22
24
26
28
30
32
34
36
38
40
42
44
46
LNK/ACT
LNK/ACT
LNK/ACT
48
Figure 2-43: FCSW2 FCSW2
To Primary and Secondary MDC, Slot 3, Port 2 and Slot 6, Port 2 To Primary and Secondary DM Slot 3, Port 2 and Slot 6, Port 2
To DCBS Cntr A, Port 2
Data Cache to FCSW MDC/DM
MDC/DM to FCSW - Data Cache
To DCBS Cntr B, Port 2
0-1> > 0
SANbox 5800
20
22
21
23
10-11> >
MDC to FCSW - Tape
To Primary and Secondary MDC, Slot 2, Port 2 and Slot 7, Port 2
Tape Library to FCSW MDC/DM
Tape Library, Fibre B Port
Figure 2-44: FCSW1 FCSW1
To Primary and Secondary MDC, Slot 3, Port 1 and Slot 4, Port 1 To Primary and Secondary DM Slot 3, Port 1 and Slot 4, Port 1
To DCBS Cntr A, Port 1
Data Cache to FCSW MDC/DM
MDC/DM to FCSW - Data Cache 0-1> > 0
SANbox 5800
To DCBS Cntr B, Port 1 20
22
21
23
10-11> >
MDC to FCSW - Tape
To Primary and Secondary MDC, Slot 2, Port 1 and Slot 5, Port 1
Tape Library to FCSW MDC/DM
Tape Library, Fibre A Port
Figure 2-45: CIMS CIMS 1 1
750 W
2 1
2
3
750 W
4
{
iDRAC
eth3 failover-net to second CIMS node eth2 to VLAN 2 - ipmi-net
To KVM
eth1 to VLAN 3 - site-admin-net iDRAC to VLAN 2 - ipmi-net
eth0 to VLAN 1 - esmaint-net
Figure 2-46: Secondary MDC 2 Slot 1, port 2 to ABS Cntr B Port 2 Slot 1, port 1 to ABS Cntr A Port 2
{
Slot 2, Port 1 MDC to FCSW1 - Tape
Secondary MDC 2 4
1
ST
Slot 2, Port 2 MDC to FCSW2 - Tape
Slot 6, Port 2 - MDC/DM to FCSW1 - Data Cache Slot 6, Port 1 - MDC/DM to FCSW2 - Data Cache
10GbE to Customer Network
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
iDRAC to VLAN 2 - ipmi-net
Slot 3, Port 1 - Slot 3, Port 2 MDC/DM to MDC/DM to FCSW2 - Data Cache FCSW1 - Data Cache
HR85-8500-A
eth3 failover-net to MDC0 eth2 to VLAN 4 - metadata-net eth1 to VLAN 3 - site-admin-net eth0 to VLAN 1 - esmaint-net
Slot 7, Port 1 - Slot 7, Port 2 MDC to MDC to FCSW2 - Tape FCSW1 - Tape
Cray Private
21
Tiered Adaptive Storage™ Hardware Guide
Figure 2-47: Primary MDC 1 Slot 1, port 2 to ABS Cntr B Port 1 Slot 1, port 1 to ABS Cntr A Port 1
{
Slot 2, Port 2 MDC to FCSW2 - Tape
Slot 6, Port 2 - MDC/DM to FCSW1 - Data Cache Slot 6, Port 1 - MDC/DM to FCSW2 - Data Cache
10GbE to Customer Network
Primary MDC 1 4
1
ST
2
Slot 2, Port 1 MDC to FCSW1 - Tape
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
eth3 failover-net to MDC0
iDRAC to VLAN 2 - ipmi-net
eth2 to VLAN 4 - metadata-net eth1 to VLAN 3 - site-admin-net
Slot 3, Port 1 - Slot 3, Port 2 MDC/DM to MDC/DM to FCSW2 - Data Cache FCSW1 - Data Cache
eth0 to VLAN 1 - esmaint-net
Slot 7, Port 1 - Slot 7, Port 2 MDC to MDC to FCSW2 - Tape FCSW1 - Tape
Figure 2-48: Secondary DM 2 Slot 3, Port 2 - MDC/DM to FCSW2 - Data Cache
Slot 6, Port 2 - MDC/DM to FCSW1 - Data Cache Slot 6, Port 1 - MDC/DM to FCSW2 - Data Cache
10GbE to Customer Network
{
Slot 3, Port 1 - MDC/DM to FCSW1 - Data Cache
Secondary DM 2 4
1
ST
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
eth3 failover-net to MDC0 iDRAC to VLAN 2 - ipmi-net
Port 1
Port 2
Lnk
Lnk
Lnk
Ch 1
eth2 to VLAN 4 - metadata-net Lnk
Ch 2
Lnk
Act
Lnk
Act
IB Host Card
CH 2
ID/ Diag
CH 1
eth1 to VLAN 3 - site-admin-net eth0 to VLAN 1 - esmaint-net
Figure 2-49: Secondary DM 1 Slot 3, Port 2 - MDC/DM to FCSW2 - Data Cache
Slot 6, Port 2 - MDC/DM to FCSW1 - Data Cache Slot 6, Port 1 - MDC/DM to FCSW2 - Data Cache
10GbE to Customer Network
{
Slot 3, Port 1 - MDC/DM to FCSW1 - Data Cache
Primary DM 1 4
1
ST
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
eth3 failover-net to MDC0 iDRAC to VLAN 2 - ipmi-net
eth2 to VLAN 4 - metadata-net eth1 to VLAN 3 - site-admin-net eth0 to VLAN 1 - esmaint-net
Figure 2-50: KVM KVM 100-240V~, 0.5A, 50/60 Hz
1
3
5
7
2
4
6
8
LAN
Video esmaint-net or site-admin-net
22
Cray Private
Keyboard and Mouse
Maintenance dongle to connect KVM to another server
HR85-8500-A
System Architecture [2]
Figure 2-51: Administrative Block Storage (ABS) Administrative Block Storage Ch 1 Lnk
Ch 2
Port 1
Lnk
Lnk
Port 2
Lnk Lnk Port 1
Port 2
Lnk
Lnk
Ch 3
Ch 4
Lnk Lnk
Ch 5
Ch 6
Lnk Lnk
Ch 2
Lnk
Lnk
Lnk
Lnk
Port 1
Port 2
Lnk Lnk
Drive Expansion Lnk
DC
Ch 1
Lnk
Lnk
ID/ Diag
Port 1
Lnk
Port 2
Lnk
Ch 3
Ch 4
Ch 5
Ch 6
AC
Lnk Lnk
Lnk
Drive Expansion Lnk
DC
SAS Cntrl A Ch2 to MDC2 SAS Cntrl A Ch1 to MDC1
Lnk Lnk
ID/ Diag
Lnk
Lnk Lnk
Lnk
AC
SAS Cntrl B Ch2 to MDC2 SAS Cntrl B Ch1 to MDC1 Cntrl B Ethernet Port 1 to esmaint-net
Cntrl A Ethernet Port 1 to esmaint-net
Figure 2-52: Data Cache Block Storage (DCBS) Cntrl A, Port 2 - Data Cache to FCSW2 - MDC/DM Cntrl A, Port 1 - Data Cache to FCSW1 - MDC/DM
DCBS (FC) 2 1
DC
AC
PSU A
PSU A PSU B
PSU B
Port 1
Port 2
2
4
2
4
8
2
8
4
2
8
4 Lnk
8
Lnk
ESM A
ESM B
Ch 2 FCHost Ch 3
Ch 1
Drive Expansion
Ch 4
ESM B
ESM A
ID/Diag
Port 1
Port 2
2
4
2
8
4 8
2
4 8
Ch 2 FCHost Ch 3
Ch 1
ESM B
2
4 Lnk
8
Lnk
Drive Expansion
Ch 4
ESM B
ID/Diag
ESM A
ESM A 2 1
DC
PSU B
AC
PSU B PSU A
PSU A
Cntrl B, Port 2 - Data Cache to FCSW2 - MDC/DM Cntrl B, Port 1 - Data Cache to FCSW1 - MDC/DM
Figure 2-53: Tape Library
HR85-8500-A
Status
Fibre A
Fibre B
Fibre B
Fibre A
Fibre A
Cray Private
Drive Pwr
Status
Fibre A
Status
Fibre B
Drive Pwr
Fibre B
Drive Pwr
Fibre A
Drive Pwr
Fibre B
Fibre A Status
Fibre B
Drive Pwr
Drive Pwr
Status
Fibre A
Status
Fibre A port to FCSW2
Fibre A
Status
Fibre A port to FCSW2
Fibre B
Drive Pwr
Fibre A port to FCSW2
Fibre B
Status
Fibre A port to FCSW2
Drive Pwr
Tape Library
Fibre A port to FCSW1
Fibre A port to FCSW1
Fibre A port to FCSW1
Fibre A port to FCSW1
23
Tiered Adaptive Storage™ Hardware Guide
IB Full TAS Configuration There are two types of full TAS configurations. The InfiniBand (IB) full TAS configuration consists of two management Ethernet switches, two FC switches, and two IB switches. The data cache block storage (DCBS) is a NetApp 5500. Figure 2-54 through Figure 2-67 show the FC full TAS configuration components and cabling. Figure 2-54 shows a second Ethernet switch; however, a second Ethernet switch is only used if
the system uses more than 2 DMs. The tape library cabling is determined by the site. Half the cables must go to FCSW1 and half must go to FCSW2. Figure 2-68 shows a possible tape library cabling configuration. Figure 2-54: ETHSW2 ETHSW2 VLAN1
VLAN2
esmaint-net 1
3
5
7
9
11
13
VLAN3
ipmi-net
15
17
19
21
23
25
27
VLAN4
site-admin-net metadata-net
29
31
33
35
37
39
41
43
LNK /ACT
45
FDX
DIAG
RPS
FAN
TEMP
PWR
47 /HDX COMBO PORTS
RESET LNK/ACT
2
4
6
8
10
12
14
16
18
20
22
24
26
28
30
32
34
36
38
40
42
44
46
LNK/ACT
LNK/ACT
LNK/ACT
48
Figure 2-55: ETHSW1 ETHSW1 VLAN1
VLAN2
esmaint-net 1
3
5
7
9
11
13
VLAN3
ipmi-net
15
17
19
21
23
25
27
VLAN4
site-admin-net metadata-net
29
31
33
35
37
39
41
43
45
LNK /ACT
FDX
DIAG
RPS
FAN
TEMP
PWR
47 /HDX COMBO PORTS
RESET LNK/ACT
2
4
6
8
10
12
14
16
18
20
22
24
26
28
30
32
34
36
38
40
42
44
46
LNK/ACT
LNK/ACT
LNK/ACT
48
Figure 2-56: FCSW2 FCSW2 2-3>
0-1>
8-9>
6-7>
4-5>
0
SANbox 5800
20
22
21
23
10-11> >
MDC to FCSW - Tape
To Primary and Secondary MDC, Slot 2, Port 2 and Slot 7, Port 2
Tape Library to FCSW MDC/DM
Tape Library, Fibre B Port
Figure 2-57: FCSW1 FCSW1 2-3>
0-1>
8-9>
6-7>
4-5>
0
SANbox 5800
20
22
21
23
10-11> >
MDC to FCSW - Tape
To Primary and Secondary MDC, Slot 2, Port 1 and Slot 7, Port 1
Tape Library to FCSW MDC/DM
Tape Library, Fibre A Port
Figure 2-58: IBSW2 To Primary DM 1, Slot 3, Port 2
To Secondary MDC 2, Slot 3, Port 2
To DCBS Cntr A, Port 1
IBSW2 MGT
1
2
3
4
5
6
0 7
0-1> 8
9
10
11
2-3> 12
13
14
15
4-5>
16
17
18
19
20
21
6-7> 22
23
24
25
8-9> 26
27
28
29
20 30
31
32
33
34
22 35
36
SX6036 CONSOLE
PS1
To Primary MDC 1, Slot 3, Port 2 To Secondary DM 2, Slot 3, Port 2 PS2
UID
RST
24
SANbox 5800
21 10-11>
Cray Private
12-13>
14-15>
16-17>
23
18-19>
To DCBS Cntrl B, Port 1
HR85-8500-A
System Architecture [2]
Figure 2-59: IBSW2 To Primary DM 1, Slot 3, Port 1
To Secondary MDC 2, Slot 3, Port 1
To DCBS Cntr A, Port 2
IBSW1 SX6036
MGT 1
2
3
4
5
6
0-1>
0 7
9
8
10
2-3>
11
13
12
14
4-5>
15
16
17
18
19
21
20
6-7> 22
23
24
25
8-9> 26
27
20
29
28
30
31
32
33
34
22 35
CONSOLE
36
PS1
To Primary MDC 1, Slot 3, Port 1 To Secondary DM 2, Slot 3, Port 1 PS2
UID
RST
SANbox 5800
21 12-13>
10-11>
23
18-19>
16-17>
14-15>
To DCBS Cntrl B, Port 2
Figure 2-60: CIMS CIMS 1 1
750 W
2 1
2
3
750 W
4
{
iDRAC
eth3 failover-net to second CIMS node eth2 to VLAN 2 - ipmi-net
To KVM
eth1 to VLAN 3 - site-admin-net iDRAC to VLAN 2 - ipmi-net
eth0 to VLAN 1 - esmaint-net
Figure 2-61: Secondary MDC 2 Slot 1, port 2 to ABS Cntr B Port 2
10GbE to Customer Network
{
Slot 1, port 1 to ABS Cntr A Port 2 Slot 2, port 2 to FCSW2 - Tape Slot 2, port 1 to FCSW1 - Tape
Slot 7, port 2 to FCSW1 - Tape Slot 7, port 1 to FCSW2 - Tape
Secondary MDC 2 4
1
ST
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
eth3 failover-net to MDC1
iDRAC to VLAN 2 - ipmi-net
Slot 3, Port 1 to IBSW1
eth2 to VLAN 4 - metadata-net eth1 to VLAN 3 - site-admin-net
Slot 3, Port 2 to IBSW2
eth0 to VLAN 1 - esmaint-net
Figure 2-62: Primary MDC 1 Slot 1, port 2 to ABS Cntr B Port 1
10GbE to Customer Network
{
Slot 1, port 1 to ABS Cntr A Port 1
4
1
ST
Slot 2, port 2 to FCSW2 - Tape Slot 2, port 1 to FCSW1 - Tape
Slot 7, port 2 to FCSW1 - Tape Slot 7, port 1 to FCSW2 - Tape
Secondary MDC 2
2
6
5
7 2
1 3 750 W
1
2
3
750 W
4
iDRAC
iDRAC to VLAN 2 - ipmi-net
Slot 3, Port 1 to IBSW1
HR85-8500-A
Slot 3, Port 2 to IBSW2
eth3 failover-net to MDC1 eth2 to VLAN 4 - metadata-net eth1 to VLAN 3 - site-admin-net eth0 to VLAN 1 - esmaint-net
Cray Private
25
Tiered Adaptive Storage™ Hardware Guide
Figure 2-63: Secondary DM 2 10GbE to Customer Network
{
Secondary DM 2 4
1
ST
2
6
5
7 2
1
Slot 3, Port 2 to IBSW2 Slot 3, Port 1 to IBSW1
3 750 W
1
2
3
750 W
4
iDRAC
iDRAC to VLAN 2 - ipmi-net
Port 1
Port 2
Lnk
Lnk
Ch 1
Lnk
eth2 to VLAN 4 - metadata-net Lnk
Ch 2
Lnk
Act
Lnk
Act
IB Host Card
CH 2
ID/ Diag
CH 1
eth1 to VLAN 3 - site-admin-net eth0 to VLAN 1 - esmaint-net
Figure 2-64: Primary DM 1 10GbE to Customer Network
{
Primary DM 1 4
1
ST
2
6
5
7 2
1
Slot 3, Port 2 to IBSW2 Slot 3, Port 1 to IBSW1
3 750 W
1
2
3
750 W
4
iDRAC
iDRAC to VLAN 2 - ipmi-net
eth2 to VLAN 4 - metadata-net eth1 to VLAN 3 - site-admin-net eth0 to VLAN 1 - esmaint-net
Figure 2-65: KVM KVM 100-240V~, 0.5A, 50/60 Hz
1
3
5
7
2
4
6
8
LAN
Video Keyboard and Mouse
esmaint-net or site-admin-net
Maintenance dongle to connect KVM to another server
Figure 2-66: Administrative Block Storage (ABS) Administrative Block Storage Ch 1 Lnk
Ch 2
Lnk
Lnk
Port 1
Port 2
Lnk Lnk Port 1
Port 2
Lnk
Lnk
Ch 3
Ch 4
Lnk Lnk
Ch 5
Ch 6
DC
Lnk Lnk
Lnk
AC
SAS Cntrl A Ch2 to MDC2 SAS Cntrl A Ch1 to MDC1 Cntrl A Ethernet Port 1 to esmaint-net
26
Ch 1 Lnk
Cray Private
Ch 2
Lnk
Lnk
Port 1
Port 2
Lnk Lnk
Drive Expansion Lnk
Lnk Lnk
Lnk
ID/ Diag
Port 1
Port 2
Lnk
Lnk
Ch 3
Ch 4
Lnk Lnk
Ch 5
Ch 6
Lnk
Drive Expansion Lnk
DC
Lnk
Lnk
ID/ Diag Lnk Lnk
Lnk
AC
SAS Cntrl B Ch2 to MDC2 SAS Cntrl B Ch1 to MDC1 Cntrl B Ethernet Port 1 to esmaint-net
HR85-8500-A
System Architecture [2]
Figure 2-67: Data Cache Block Storage (DCBS) Cntr A, Port 1 to IBSW2 Cntr A, Port 2 to IBSW1
DCBS (IB) 2 1
DC
AC
PSU A
PSU A PSU B
PSU B
Lnk
Port 1
Port 2
Lnk
Lnk
ESM A
Lnk
Ch 1
Lnk
Ch 2
Lnk
ESM B
ID/ Diag
Ch 1
Lnk
Act
Lnk
Act
Act
Lnk
Act
ESM A
Lnk
Ch 2
Lnk
CH 2
IB Host Card
CH 1
ID/ Diag
ESM B
IB Host Card
ESM B
Lnk
CH 2
Port 2
CH 1
Port 1
ESM B ESM A
ESM A 2 1
DC
PSU B
AC
PSU B PSU A
PSU A
Cntr B, Port 2 to IBSW1 Cntr B, Port 1 to IBSW2
Figure 2-68: Tape Library
HR85-8500-A
Status
Fibre A
Fibre B
Fibre B
Fibre A
Fibre A
Cray Private
Drive Pwr
Status
Fibre A
Status
Fibre B
Drive Pwr
Fibre B
Drive Pwr
Fibre A
Drive Pwr
Drive Pwr
Status
Fibre A
Drive Pwr
Fibre B
Status
Fibre B
Status
Fibre A port to FCSW2
Fibre A
Status
Fibre A port to FCSW2
Fibre B
Fibre A
Drive Pwr
Fibre A port to FCSW2
Fibre B
Status
Fibre A port to FCSW2
Drive Pwr
Tape Library
Fibre A port to FCSW1
Fibre A port to FCSW1
Fibre A port to FCSW1
Fibre A port to FCSW1
27
Tiered Adaptive Storage™ Hardware Guide
Hardware Architecture Cray Integrated Management Server (CIMS) A Cray Integrated Management Services (CIMS) node is the centralized management node that manages slave node power, software provisioning, reporting, and health status. The CIMS runs the Bright software and provides a centralized platform to manage the system hardware and software. Nodes that are managed by the CIMS are called slave nodes. The TAS CIMS node is a Dell PowerEdge R620 rack server. Figure 2-69: TAS Architecture - CIMS to Ethernet Cray Integrated Maintence Sevices (CIMS) node(s)
Port B
Port A
eth3
ipmi0 DRAC
eth2 eth3 NIC 1 NIC 1
CIMS node connections e eth0
eth2
eth1
DRAC ipmi0
failover-net
site-admin-net
failover-net fa
ipmi-net site-admin-net esmaint-net
Bright GUI
10.50.x.x
site-admin-net
site-admin-net
site-admin-net
ipmi-net
esmaint-net
10.149.x.x IB switch subnet mgt
eth1 eth0 NIC 1 NIC 1
metadata-net VLAN 4
site-admin-net VLAN 3
ipmi-net VLAN 2
esmaint-net VLAN 1
Ethernet switch(es) or VLANs
Metadata Controller (MDC) and Data Mover (DM) Nodes In each TAS system, the Dell R720's function as metadata controllers. These systems provide management of the VSM cache file system as well as managing the VSM policy engine. They also are responsible for the data movement between the cache file system and the tape archive media. Additional R720's function as data movers. These systems function as the ingress and egress of data into the TAS system. They may have file serving applications, HSM data mover tool, or other copy tool applications running on them. Refer to Figure 2-70 and Figure 2-71 for the MDC and DM architecture diagrams. The Ethernet labeling may differ from what is shown in the figures.
28
Cray Private
HR85-8500-A
System Architecture [2]
Figure 2-70: TAS Architecture - Ethernet to MDC and DM Nodes
metadata-net VLAN 4
site-admin-net VLAN 3
ipmi-net VLAN 2
esmaint-net VLAN 1
Ethernet switch(es) or VLANs Customer network 10GbE
metadata-net
10.141 10.148
customer-net metadata-net
site-admin-net ipmi-net esmaint-net 10.50
eth3 eth0
failover-net
DRAC
eth1 eth2 eth5 eth4 10GbE Primary VSM MDC FC8 IB/FC SAS
hte eth4 eth3 eth0 DRAC eth2 eth1 5eth5 10GbE Secondary VSM MDC Node SAS IB/FC FC8
eth0 DRAC eth1 eth2 eth4 eth5 TAS Client 10GbE (Data Mover Node) IB/FC
eth0 DRAC eth1 eth2 eth4 eth5 TAS Client 10GbE (Data Mover Node) IB/FC Port A
Port B
Port A
Port B
Port A
Port B
Port A
Port B
Port A
Port B
Port A
Port B
Port A
Port B
Port A
Port B
esmaint-net network to IB and FC switches, RAID and tape controllers, and other devices managed by Bright 10.141.0.x Server Nodes 10.141.50.x Ethernet Switches 10.141.100.x RAID controllers 10.141.150.x FC/SAS switches 10.141.200.x IB switches
Figure 2-71: TAS Architecture - MDC/DM Node Connections 1GbE ipmi0 DRAC
eth1 eth0 NIC 1 NIC 1
eth2 NIC 1
10GbE eth3 NIC 1
eth5 eth4 NIC 2 NIC 2
TAS MDC/DM node connections ipmi-net
failover-net (MDC only) metadata-net site-admin-net esmaint-net
customer-net
Administrative Block Storage (ABS) TBD Refer to Figure 2-72 for the ABS architecture diagram. The Ethernet labeling may differ from what is shown in the figure.
Data Cache Block Storage (DCBS) TBD Refer to Figure 2-72 for the DCBS architecture diagram. The Ethernet labeling may differ from what is shown in the figure.
HR85-8500-A
Cray Private
29
Tiered Adaptive Storage™ Hardware Guide
Data Archive Storage Tape (DAST) TBD Refer to Figure 2-72 for the DAST architecture diagram. The Ethernet labeling may differ from what is shown in the figure. Figure 2-72: TAS Architecture - MDC/DM Nodes to Switches, DCBS, and DAST
eth3 eth0
eth3 eth0 DRAC eth2 eth1 5eth5 hte eth4 10GbE Secondary VSM MDC Node SAS IB/FC FC8
DRAC
eth1 eth2 eth5 eth4 10GbE Primary VSM MDC IB/FC SAS FC8
Port A
Cntrl B
Cntrl A
To DCBS via IB switch
eth0 DRAC eth1 eth2 eth4 eth5 TAS Client 10GbE (Data Mover Node) FC Port B
Port A
Port B
Port A
Port B
Port A
Port B
Port A
Port B
Port B
Port A
Port A
Port B
Port A
Port B
Disks or SSDs
eth0 DRAC eth1 eth2 eth4 eth5 TAS Client 10GbE (Data Mover Node) FC
To DCBS via IB switch
Administrative Storage (MABs) 10.149 IB Switch
IB Switch
Cntrl B
Cntrl A
Cntrl B
Cntrl A
LUNs Data cache block storage (DCBS)
LUNs Data Archive Storage Tier (DAST)
FC8 Switch FC8 Switch S
Tape Archive S Storage Tier (TAST)
30
Cray Private
HR85-8500-A
Site Preparation [3] Site Planning Proper site planning and preparation is important for the successful installation of the TAS (tiered adaptive storage) system; Cray site planning personnel (
[email protected]) will assist with the site planning process. Each site has different site planning characteristics to consider. You can guarantee customer’s effective site planning by identifying their initial system configuration as well as any upgrade plans. Qualified electrical and mechanical facility engineers should be involved early in the site planning process. Before customers can begin site preparation activities, they must prepare electrical and mechanical design drawings for approval by Cray site planning personnel. The following subsections describe various considerations and requirements that the site planning and preparation process includes.
System Specifications Refer to Table 3-2 for the TAS system rack specifications and to Figure 3-1 for the dimensions and clearance requirements. Table 3-2: TAS System Specifications Characteristic
HR85-8500-A
Specification
Height
78.40 in. (1991 mm)
Width
23.60 in. (600 mm)
Depth
47.25 in. (1200 mm)
Weight (maximum)
Varies
Shipping weight (maximum)
Varies
Cray Private
31
Tiered Adaptive Storage™ Hardware Guide
Table 3-2: TAS System Specifications Access Requirements Front Rear Side Acoustical noise level
Varies
Heat dissipation to air (maximum)
Varies
Cooling requirement
Air cooled
Air flow
Intake: front Exhaust: rear
Input voltage Nominal Tolerance
32
36.00 in. (914 mm) 36.00 in. (914 mm) none
North American/Japan 200/208Vac 185-264Vac
Frequency
60 or 50 Hz
Phases
1 phase
Power requirement (maximum)
7.89 kVA (7.50 kW)
Hold-up time
8 milliseconds
Power circuit (4 required)
North American/Japan Hubbell L6-30R
Power circuit size (4 required)
30 amp at 200/208Vac 32 amp at 230Vac
Power cord length
8 ft (2.4 m)
Cray Private
International 230Vac 185-264Vac
International Hubbell 332C6W
HR85-8500-A
Site Preparation [3]
Figure 3-1: TAS System Cabinet Dimensions Plan view 11.80 (300 mm)
23.60 (600 mm)
Front view
Side view
23.60 (600 mm)
47.25 (1200 mm)
78.40 (1991 mm)
Shipping Configurations Figure shows the domestic and international TAS cabinet shipping configurations. Each TAS
cabinet is shipped on a separate shipping pallet. Cray provides an attached ramp to facilitate the removal of the cabinet from the shipping pallet. You must provide a pallet jack to move each shipping pallet to the system location. International shipments include a wooden crate.
HR85-8500-A
Cray Private
33
Tiered Adaptive Storage™ Hardware Guide
Figure 3-2: TAS Shipping Configurations
U.S. domestic shipping configuration
International shipping configuration
Site Access Requirements Before system installation, the customer site must meet certain site access requirements. This subsection explains these requirements and provides specifications. For computer system delivery, the loading dock should be approximately 46 in. to 50 in. (1168 mm to 1270 mm) high. Take special precautions when moving equipment if the loading dock or access route has an engraved floor pattern; an engraved pattern can cause vibration damage to computer equipment on casters. The equipment access route should have no slope greater than 10 degrees. If no loading dock exists or if the loading dock does not meet the requirements, the customer must provide a forklift to unload the computer equipment. Refer to Forklift Size Requirements for the Handling of Cray Inc. Equipment, Site Engineering document 10658374, for forklift requirements. If the computer room is on a different level than the loading dock and the building does not have an elevator, the customer may need a crane or other special handling equipment to lift the computer equipment to the same level as the computer room.
34
Cray Private
HR85-8500-A
Site Preparation [3]
The access route must be maintained for initial equipment delivery and for parts delivery during equipment maintenance. The entire access route from the loading dock to the computer room must meet the requirements in Table 3-3. Table 3-3: Access Route Requirements Shipping Specifications Cabinet
Height
Width
Depth
Weight
Tiered Adaptive Storage (TAS)
78.40 in. (1991 mm)
23.60 in. (600 mm)
47.24 in. (1200 mm)
Varies
Computer Room Environment Cray designs computer equipment to operate within specific ranges of air quality, temperature, and relative humidity levels. Significant variations of these levels in a computer room environment can cause disruptions in equipment operation and decrease the life of the equipment. The TAS system must operate in a controlled computer room environment. The following requirements refer to the entire computer room, as they also affect air-cooled devices such as disk drives and servers. Therefore, the design and layout of the environmental control equipment (such as computer room air-conditioning units) must ensure that inlet air to the aircooled device meets these environmental requirements.
HR85-8500-A
Cray Private
35
Tiered Adaptive Storage™ Hardware Guide
The computer system requires a computer room environment that is monitored and remains within the parameters in Table 3-4. Table 3-4: TAS Air Quality, Temperature, and Relative Humidity Levels Temperature (Ambient Air) Operating1 Nonoperating
60°F to 80°F (16°C to 27°C) 32°F to 140°F (0°C to 60°C)
Temperature (Underfloor Air) Range
60°F to 80°F (16°C to 27°C)
Relative Humidity (Ambient Air) Operating* Nonoperating
30% to 65% noncondensing 5% to 80% noncondensing
Relative Humidity (Underfloor Air) Operating
30% to 90% noncondensing
Altitude Operating Nonoperating
0 to 10,000 ft (0 to 3,048 m) 0 to 40,000 ft (0 to 12,192 m)
1. Cray recommends an operating temperature range of 70°F to 77°F (21°C to 25°C) and a relative humidity of 30% to 50% noncondensing.
36
Cray Private
HR85-8500-A
Site Preparation [3]
Electrical Requirements Cray makes every effort to minimize the effects of power failures and interruptions to the hardware. However, if the computer system experiences repeated power interruptions and fluctuations, it may also experience a higher component failure rate than with a stable power source. If your input power is unstable, Cray encourages you to install a reliable power source, such as a UPS, to reduce the possibility of component failures. The TAS system requires one of the following types of power: North America/Japan: • 200/208Vac, 1 phase, 50 or 60Hz International: • 230Vac, 1 phase, 50 or 60Hz Table 3-5 provides additional electrical service requirements. Table 3-5: Electrical Service Requirements Electrical Service
Requirement
Phase imbalance
5% maximum (line-to-line, line-to-line neutral)
Voltage harmonics
5% maximum total, 3% largest
Voltage deviation from sine wave
5% to -10%
Voltage modulation
3% maximum
Transient voltage surges
+5%
Transient voltage sags
-5%
Frequency tolerance
5%
Frequency rate of change
Less than 1.0 Hz during any 10-cycle period
Total kilowatt power requirements depend on system configuration and equipment upgrade plans. Cray will provide documentation that customers can use to estimate the power requirements for their specific system configuration by contacting Site Engineering at
[email protected].
Equipment Grounding All Cray computer equipment requires a protective power safety-ground system. The power safety-ground system protects personnel from shock hazards and protects the computer equipment from damage caused by electrical malfunctions. Local and national electrical codes regulate the power safety-ground system.
HR85-8500-A
Cray Private
37
Tiered Adaptive Storage™ Hardware Guide
All Cray computer equipment also requires a signal reference grid. The signal reference grid establishes an equipotential reference plane for high-frequency digital signals between interconnected computer equipment. Cray supplies braided ground straps with all equipment. Customers are responsible for connecting the ground straps to the signal reference grid. Customers can obtain Equipment Grounding for Cray Inc. Computer Systems, document 10658002, by contacting Site Engineering at
[email protected]. This document describes the grounding system requirements and identifies alternative methods for providing the signal reference grid. In addition, the document describes electrostatic discharge (ESD) precautions and maintenance of the facility’s grounding systems. Customers must provide, install, and maintain the approved grounding systems as described in the equipment grounding document and in this subsection.
Floor Requirements Refer to Figure 3-3 for the location of the floor cutouts for the TAS system. Each TAS system requires a floor cutout for the entry of data and power cables. Cray recommends that you place a perforated floor panel in front of each RAID cabinet to provide additional cooling air. Figure 3-3: TAS System Floor Cutout
38
Cray Private
HR85-8500-A
Hardware Implementation [4] Rack and PDUs TAS System Rack The TAS system is stored in an APC 42U rack. More information on the rack can be found in Chapter 3, "Site Preparation," on page 31.
Power Distribution Units (PDUs) Four 220VAC power distribution units (PDUs) are mounted to the rear of the rack. Each PDU can be supplied with power from a different source for power redundancy. A circuit breaker at the bottom of each PDU applies power to the components in the rack.
Cray Integrated Management Server (CIMS) Marketing Code: XC-TAS01-CIMS-01 A Cray Integrated Management Services (CIMS) node is the centralized management node that manages slave node power, software provisioning, reporting, and health status. The CIMS runs the Bright software and provides a centralized platform to manage the system hardware and software. Nodes that are managed by the CIMS are called slave nodes. The TAS CIMS node is a Dell PowerEdge R620 rack server.
Specifications The Cray TAS CIMS node includes a 4 hard drive bay.
HR85-8500-A
•
Dell R620 - 2.5in chassis with up to 8 hard drives
•
Two Intel Xeon E5-2609v2 2.5 GHz, 10 MB cache, 6.4 GT/s QPI, No Turbo, 4-core, 80 Watts, 1333 MHz max memory
•
Eight 8-GB RDIMM, 1333 MHz RDIMMs (64 GB total memory)
•
One PERC H310 RAID controller
Cray Private
39
Tiered Adaptive Storage™ Hardware Guide
•
Four 1-TB 7.2K RPM SATA 3 Gbps 2.5" hard drives
•
RAID 5 3+1
•
One DVD ROM SATA drive
•
ReadyRails sliding rails without cable management arm
•
Two 495 Watt (1+1) power supplies
•
iDRAC 7 Enterprise
Figure 4-1: System Board Block Diagram
40
Cray Private
HR85-8500-A
Hardware Implementation [4]
CIMS Controls and Indicators Figure 4-2: CIMS Front Panel Features and Indicators USB connector (2) System identification button NMI button
vFlash media card slot LCD panel
Optical drive (optional)
LCD menu buttons
Information tag Video connector
Power-on Indicator, power button
Hard drives (8)
Table 4-1: CIMS Front Panel Features and Indicators Control/Indicator
Description
Power-on indicator
Lights when the system power is on
NMI button
Used to troubleshoot software and device driver errors when running certain operating systems
System identification button
Used to locate a particular system within a rack
USB connector (2) Optical drive (optional)
SATA DVD-ROM drive or DVD+/-RW drive
vFlash media card slot LCD menu buttons
Used to navigate the LCD menu
LCD panel
Displays system ID, status information, and system error messages. The LCD lights blue during normal system operation
Information tag
Records the service tag, NIC, and MAC address
VGA video connector Hard drives (8)
HR85-8500-A
Supports up to four 2.5 hot-swappable hard drives
Cray Private
41
Tiered Adaptive Storage™ Hardware Guide
Figure 4-3: CIMS Rear Panel Features and Indicators,
System identification button
System identification connector
iDRAC7 enterprise port
Serial connector
Ethernet connectors (4)
VGA video connector PCIe expansion card slot (riser 2)
PCIe expansion card slot (riser 3)
Power supply (PSU1)
Power supply (PSU2)
USB connectors (2)
Table 4-2: CIMS Rear Panel Features and Indicators Control/Indicator System identification button
Description Used to locate a particular system within a rack
System identification connector iDRAC7 enterprise port Serial connector PCIe expansion card slot (riser 2) VGA video connector USB connectors (2) Ethernet connectors (4)
Four Ethernet or NIC connectors
PCIe expansion card slot (riser 3)
42
Power supply (PSU1)
Learn what AC/DC power we use
Power supply (PSU2)
Learn what AC/DC power we use
Cray Private
HR85-8500-A
Hardware Implementation [4]
Figure 4-4: CIMS Components
CIMS IO Configuration Network interface controller 1 (NIC), which is built into the motherboard, provides Ethernet connections for the TAS networks. CIMS nodes are configured with the following networks. The Ethernet labeling may differ from what is stated: •
esmaint-net Private network (eth0) for system administration and maintenance. This network is used to provision slave nodes with software images and is used by Bright Cluster Management software.
•
ipmi-net Private maintenance network (eth2) for power control and monitoring.
•
site-admin-net System administration network (eth1 and impi0).
•
HR85-8500-A
The iDRAC port (management controller) is used for system administration and remote power control.
Cray Private
43
Tiered Adaptive Storage™ Hardware Guide
Metadata Controller (MDC) and Data Mover (DM) Nodes Marketing code: XC-TAS01-CMCS-01 The metadata controller (MDC) and data mover (DM) nodes are built from generic Lustre File System by Cray (CLFS) nodes. The metadata controller nodes are redundant and are configured for high-availability. DM nodes move data from customer file systems to the TAS system.
Specifications The MDC and DM nodes contain the following hardware components. Figure 4-5 shows the system board block diagram for the nodes:
44
•
Two (2) Intel Xeon E5-2650v2 2.6 GHz, 20 MB cache, 8.0 GT/s QPI, Turbo, HT, 8-core, 95 Watts, 1866 MHz max memory
•
MDC: 16 8-GB RDIMM, 1866 MHz RDIMMs (128 GB total memory)
•
DM: 8 8-GB RDIMM, 1866 MHz RDIMMs (64 GB total memory)
•
One (1) PERC H710 RAID controller 512 GB cache
•
Two (2) 600-GB 10K RPM SAS 6 Gbps 2.5" hard drives
•
RAID 1
•
One (1) DVD ROM SATA drive
•
ReadyRails sliding rails without cable management arm
•
Two (2) 750 Watt (1+1) power supplies
•
iDRAC 7 Enterprise
Cray Private
HR85-8500-A
Hardware Implementation [4]
Figure 4-5: MDC and DM Node System Block Diagram
USB2.0 Key
RJ45
DELL Se lect Networ k Adapte r Front Video Rear Video
Two Rear USB2.0 Ports
USB2.0
IDSDM
One Front USB2.0 Ports
Serial Port
To Front Panel 2x30
DIMM3
DIMM3
DIMM3
DIMM3
DIMM3
DIMM3
DIMM3
DIMM3
DIMM2
DIMM2
DIMM2
DIMM2
DIMM2
DIMM2
DIMM2
DIMM2
DIMM1
DIMM1
DIMM1
DIMM1
DIMM1
DIMM1
DIMM1
DIMM1
DDR3
DDR3
DDR3
DDR3
DDR3
DDR3
DDR3
DDR3
CH2
CH3
CH0
CH1
CH2
CH3
TPM
Ie
Ge n2
QPI0
SAS A/B (To Backplane)
(Port 2C, 2D) B) x8 ,1 1A ort (P
x8
x2
, 2A ort (P
2B
x8
DELL Select Network Adapter Slot
Center Riser (Slot 5)
)
) B) 3D ,3 C, 3A rt 3 ort (P (Po
x8
x8
QPI 1.1
QPI0
QPI 1.1
QPI1
CPU2 rt 1 (Po
A,
) 1B
Storage Connector
X16 Left Riser (Slot 6)
ort
, 2A
2B
) (P
x8
x8
x8
Storage Connector
Rear External SD
(P
Right Riser (Slot 1)
ort
Center Riser PCIe x16 Connector (Slot 4)
2C
,2
D) ort (P
x8
, 3A
) 3B
(Po
C, rt 3
) 3D
x8 PCIe Gen3 x8
PC
CPU1
PCIe Gen3 x8
PCIe x1 port 3 + port 4
Port 0(DMI)
PCIe Gen3 x8
QPI1 DMI 2GB/s
PCIe Gen3 x8
TPM
South Bridge
PCIe Gen3 x8
PCIe
LPC
PCIe Gen3 x8
SD
LPC
PCIe Gen3 x8
LPC
PCIe Gen3 x8
PCIe
MAC
CH1
PCIe Gen3 x8
PCIe
CH0 MiniSAS Conn
USB
PCIe Gen3 x8
iDRAC
Coin Cell
SAS x8
USB
SATA
USB
MGMT MAC
LPC
Video UART SPI
Right Riser (Slot 2) & (Slot 3)
Internal PERC
System Power Connectors
BP Power Conns. 4X
Power Supply 1
Power Supply 2
(495W /750W/ 1100W )
(495W /750W/ 1100W )
HR85-8500-A
GPGPU Power Conn.
BP Signal Conns. 4X
Cray Private
45
Tiered Adaptive Storage™ Hardware Guide
MDC and DM Controls and Indicators Figure 4-6: MDC and DM Front Panel Features and Indicators
Table 4-3: MDC and DM Front Panel Features and Indicators Control/Indicator
Description
Power-on indicator
Lights when the system power is on
NMI button
Used to troubleshoot software and device driver errors when running certain operating systems
System identification button
Used to locate a particular system within a rack
VGA video connector LCD menu buttons
Used to navigate the LCD menu
Information tag
Records the service tag, NIC, and MAC address
LCD panel
Displays system ID, status information, and system error messages. The LCD lights blue during normal system operation
Optical drive (optional)
SATA DVD-ROM drive or DVD+/-RW drive
Hard drives
Supports eight 2.5 hot-swappable hard drives
vFlash media card slot USB connector (2) Tape drive slot (optional)
46
Supports one optional 3.5 inch tape backup unit
Cray Private
HR85-8500-A
Hardware Implementation [4]
Figure 4-7: MDC and DM Rear Panel Features and Indicators
Table 4-4: MDC and DM Rear Panel Features and Indicators Control/Indicator
Description
System identification button
Used to locate a particular system within a rack
System identification connector iDRAC7 enterprise port PCIe expansion card slots low-profile (3) Serial connector VGA video connectors (2) USB connectors (2) Ethernet connectors (4)
Four Ethernet or NIC connectors
PCIe expansion card slots full height (4)
HR85-8500-A
Power supply (PSU1)
Learn what AC/DC power we use
Power supply (PSU2)
Learn what AC/DC power we use
Cray Private
47
Tiered Adaptive Storage™ Hardware Guide
Figure 4-8: MDC and DM Inside the System
MDC and DM IO Configuration Network interface controller 1 (NIC), which is built into the motherboard, provides Ethernet connections for the TAS networks. In addition to NIC1, a second NIC (10GbE) provides network connections for the customer network. MDC and DM nodes are configured with the following networks. The Ethernet labeling may differ from what is stated: •
esmaint-net Private network (eth0) for system administration and maintenance. This network is used to provision slave nodes with software images and is used by Bright Cluster Management software.
•
ipmi-net iDRAC private maintenance network (impi0) for power control and monitoring.
•
site-admin-net System administration network (eth1).
•
metadata-net Metadata network (eth2) for storage management software.
48
Cray Private
HR85-8500-A
Hardware Implementation [4]
CIMS, MDC, and DM Nodes The following indicators are identical for the CIMS, MDC, and DM nodes. Figure 4-9: CIMS, MDC, and DM LCD Panel Features
The LCD panel provides status and error messages. •
The LCD backlight lights blue during normal operating conditions and lights amber to indicate an error condition.
•
The LCD backlight is off when the system is in standby mode and can be turned on by pressing either the Select, Left, or Right button on the LCD panels.
•
The LCD backlight remains off if LCD messaging is turned off through the iDRAC utility, the LCD panel, or another tools.
1.
The left button moves the cursor back in one-step increments.
2.
The select button selects the menu item highlighted by the cursor.
3.
The right button moves the cursor forward in one-step increments. During message scrolling: • Press once to increase scrolling speed • Press again to stop • Press again to return to default scrolling speed • Press again to repeat the cycle
HR85-8500-A
Cray Private
49
Tiered Adaptive Storage™ Hardware Guide
Figure 4-10: CIMS, MDC, and DM Hard Drive Indicators
Table 4-5: CMS, MDC, and DM Drive-Status Indicator Patterns Drive-Status Indicator Patterns (RAID Only)
Condition
Blinks green two times per second
Identifying drive or preparing for removal
Off
Drive ready for insertion or removal
Blinks green, amber, and off
Predicted drive failure
Blinks amber four times per second
Drive failed
Blinks green slowly
Drive rebuilding
Steady green
Drive online
Blinks green three seconds, amber three seconds, and off six seconds
Rebuild aborted.
Each AC power supply has an illuminated translucent handle and each DC power supply (when available) has an LED that serves as an indicator to show whether power is present or whether a power fault has occurred. Figure 4-11: CIMS, MDC, and DM AC Power Supply Status Indicators
50
Cray Private
HR85-8500-A
Hardware Implementation [4]
Figure 4-12: CIMS, MDC, and DM DC Power Supply Status Indicators
Table 4-6: CIMS, MDC, and DM Power Indicator Pattern Power indicator pattern
HR85-8500-A
Condition
Not lit
Power is not connected
Green
The LED indicator lights green indicating that a valid power source is connected to the power supply and that the power supply is operational.
Flashing amber
Indicates a problem with the power supply or power supply mismatch
Flashing green
Power supply is mismatched
Cray Private
51
Tiered Adaptive Storage™ Hardware Guide
Keyboard, Video, and Mouse (KVM) Figure 4-13: Rear View of an 8-port Rack Mount LCD KVM Assembly KVM 100-240V~, 0.5A, 50/60 Hz
1
3
5
7
2
4
6
8
LAN
Video esmaint-net or site-admin-net
Keyboard and Mouse
Maintenance dongle to connect KVM to another server
Table 4-7: KVM Monitor Push-button Functions Button/LED
Description
AUTO EXIT
Exits from the OSD function or returns to previous menu
SL (+ or -)
Moves the selector right or left on the OSD menu
MENU SELECT
Enables the OSD menu and controls the function to be adjusted
(Insert Power Symbol)
Turns your rack mount LCD KVM assembly on or off
LED (status light)
Green = on Flashing green = power save mode or disconnection of signal cable Red = off
Hotkey Functions The 8-port version of the Rack Mount LCD KVM Assembly features hotkey commands that allow you to easily access various functions of your assembly. The following table explains the hotkey functions available. Note: To utilize the hotkey functions in the following table, press the Ctrl key on the left side of the keyboard twice, then press the keys for the desired command. Table 4-8: KVM Hotkey Commands and Descriptions Function
Keystrokes
Description
Left cntl + left ctrl + 1-8
Selects the desired port channel by number (port 1-8)
Left cntl + left ctrl + F1-F8
Selects port channels 9-16
Left ctrl + left cntl + up or down arrow
The up arrow selects the next channel; the down arrow selects the previous channel
Beep
Left cntl + left ctrl + b
Turns the beep function on or off in scan mode (default is on)
Auto scan
Left ctrl + left ctrl + s
Allows you to exit Auto Scan by pressing any key
Reset assembly
Left ctrl + left ctrl + r
Resets the LCD KVM assembly
Port switch
52
Cray Private
HR85-8500-A
Hardware Implementation [4]
Table 4-8: KVM Hotkey Commands and Descriptions (continued) Function
Keystrokes
Description
Reset mouse
Left ctrl + left ctrl + m
Resets the mouse
Reset keyboard
Left ctrl + left ctrl + k
Reset the keyboard
OSD menu
Left ctrl + left ctrl + spacebar (doubleclick the right mouse button)
Displays the OSD menu
The two consecutive left ctrl keystrokes and the following command key(s) should be pressed within two seconds or the hotkey sequence will not be validated. Port number selections must be made using the numeric keys on the keyboard. Numeric keys on the keypad are not available as hotkey commands.
HR85-8500-A
Cray Private
53
Tiered Adaptive Storage™ Hardware Guide
TAS Metadata Block Storage Devices Introduction Marketing Code: NG6-DE60-C2708-4FC16 The NetApp 2700-24 is known as the Administrative Block Storage (ABS). It is used for cache file system metadata, and also an administrative file system (name = tas_admin). The admin data consists of data including logs and reports, and config files that have to be kept in sync across the two metadata controllers. The NetApp 5400-60 and 5500-60 are used as Data Cache Block Storage (DCBS) devices. The NetApp 5400-60 supports Fibre Channel and the NetApp 5500-60 supports InfiniBand. A NetApp 2700 with 2 IB RAID controllers could be used as a small metadata block storage (MDBS) instead of a NetApp 5500-60 storage device.
Controller Types Administrative Block Storage (ABS) SAS Interface, Controls, and Indicators •
The ABS (administrative block storage) is a NetApp E2700-24 storage device with metadata solid state disk drives. The admin LUN is four 600 GB drives, RAID 5 3+1.
•
The configuration of the host ports might appear different on your system depending on which host interface card configuration is installed. The controls and indicators might appear in different locations depending on which model is used for the CIMS, MDS, etc.
Figure 4-14: ABS Controller-Drive Tray - Front View End cap locate LED
Drive canister
End cap service action required LED End cap over-temperature LED End cap power LED End cap standby power LED
54
Cray Private
HR85-8500-A
Hardware Implementation [4]
Figure 4-15: ABS Controller-Drive Tray - Rear View SAS host 1 link up LED Serial port USB port
SAS host 1 link fault LED
SAS host 2 link up LED
1GbE ethernet management port 2
Tray ID indicator display
SAS host 2 link fault LED SAS expansion link 1 fault LED
Ethernet port 2 link active led
Controller canister
Power-fan canister
SFF-8644 SAS host connector 1 SFF-8466 SAS host connector 2
Ethernet port 1 link rate LED
AC power connector and switch
SAS expansion link 1 up LED
SFF-8644 SAS connector 1 (expansion)
Cache active led Controller service action required LED
DC power connector and switch
Controller service action allowed LED Battery charging LED Battery Service Action Required LED
Figure 4-16: ABS Right-Rear Subplate with No Host Interface Card Expansion SFF-8088 port connector ESM expansion active LED ESM expansion fault LED
Lnk
Port 1 Port 2
Lnk
Lnk
Lnk
Drive Expansion S A S
HR85-8500-A
Cray Private
55
Tiered Adaptive Storage™ Hardware Guide
Figure 4-17: ABS Right-Rear Subplate with a Two-Port 12-Gb/s SAS Host Interface Card 3. SFF-8644 SAS port 1 (expansion) SAS expansion link 1 up LED SAS expansion link 1 fault LED
Lnk
Port 1 Port 2
Lnk
Lnk Lnk
Ch 3
Ch 4
Lnk
Lnk
S A S
Drive Expansion Lnk
Lnk
S A S
ABS Host Channels
Caution: To prevent ESD damage to the tray, use proper ESD protection when you handle tray components.
•
Each controller has two native host ports.
•
In addition, you can have additional host ports a located on a host interface card (HIC). The following types of HICs are supported for TAS: •
Two SAS connectors at 6 Gb/s
Figure 4-18: ABS Host Channels on the Controller with a Two-Port SAS HIC - Rear View HD mini-SAS host interface card channel 4 HD mini-SAS host interface card channel 3
56
Cray Private
HR85-8500-A
Hardware Implementation [4]
LEDs on the ABS Controller-Drive Tray Figure 4-19: ABS LEDs on the Left End Cap
Controller-drive tray locate LED Service action required LED Controller-drive tray overtemperature LED Power LED
Standby Power LED
Table 4-9: ABS LEDs on the Left End Cap Location
HR85-8500-A
LED
Color
On
Off
1
Controllerdrive tray locate
White
Identifies a controller-drive tray that you are trying to find.
Normal status
2
Service action required
Amber
A component within the controller-drive tray needs attention.
Normal status
3
Controllerdrive tray overtemperature
Amber
The temperature of the controller-drive tray has reached an unsafe level
Normal status
4
Power
Green
Power is present
Power is not present
5
Standby power
Green
The controller-drive tray is in Standby Power mode.
The controller-drive tray is not in Standby Power mode.
Cray Private
57
Tiered Adaptive Storage™ Hardware Guide
Figure 4-20: ABS LEDs on the Drive
Drive power LED Drive service action required LED Drive service action allowed LED
Table 4-10: ABS LEDs on the Drive Location
LED
Color
On
Blinking Drive I/O activity is taking place.
Off The power is turned off.
1
Drive power
Green
The power is turned on, and the drive is operating normally.
2
Drive service action required
Amber
An error has occurred.
Normal status.
3
Drive service action required
Blue
The drive canister can be removed safely from the controller-drive tray.
The drive canister cannot be removed safely from the controller-drive tray.
Table 4-11: ABS Drive State LEDs
58
Drive State
Drive Power LED (Green)
Drive Service Action Required LED (Amber)
Power is not applied.
Off
Off
Normal operation - The power is turned on, but drive I/O activity is not occurring.
On
Off
Normal operation - Drive I/O activity is occurring
Blinking
Off
Service action required - A fault condition exists, and the drive is offline
On
On
Cray Private
HR85-8500-A
Hardware Implementation [4]
Figure 4-21: ABS LEDs on the Controller Canister Main Faceplate Ethernet connector 1 link active LED
Host link 1 service action required LED
Ethernet connector 2 link rate LED
Host link 1 service action allowed LED
Ethernet connector 1 link active LED
Host link 2 service action required LED Host link 2 service action allowed LED
Ethernet connector 1 link rate LED
Lnk
Ch 1
S A S
Ch 2
Lnk
Lnk Port 1
Lnk
Port 2
ID/ Diag
Battery service action required LED
Tray ID indicator Cache active LED Controller service action required LED
Battery charging LED
Controller service action allowed LED
Table 4-12: ABS LEDs on the Controller Canister Main Faceplate Location
HR85-8500-A
LED
Color
On
Off
1
Ethernet connector 1 link rate LED
Green
There is a 1000 GB/s rate.
There is a 10BASE-T rate.
2
Ethernet connector 1 link active LED
Green
The link is up (LED blinks when there is activity).
The link is not active.
3
Ethernet connector 2 link rate LED
Green
There is a 1000 Gb/s rate.
There is a 10BASE-T rate.
4
Ethernet connector 2 link active LED
Green
The link is up (the LED blinks when there is activity).
The link is not active.
Cray Private
59
Tiered Adaptive Storage™ Hardware Guide
Table 4-12: ABS LEDs on the Controller Canister Main Faceplate Location
60
LED
Color
On
Off
5
Host link 1 service action required LED
Amber
At least one of the four PHYs is working, but another PHY cannot establish the same link to the device connected to the host interface port connector.
No link error has occurred.
6
Host link 1 service action allowed LED
Green
At least one of the four PHYs in the host interface port is working and a link exists to the device connected to the IN port connector.
A link error has occurred.
7
Host link 2 service action required LED
Amber
At least one of the four PHYs is working, but another PHY cannot establish the same link to the device connected to the host interface port connector.
No link error has occurred.
8
Host link 2 service action allowed LED
Green
At least one of the four PHYs in the host interface port is working and a link exists to the device connected to the IN port connector.
A link error has occurred.
9
Battery service action required LED
Amber
The battery in the controller canister has failed.
Normal status.
10
Battery charging LED
Green
The battery is fully charged. The LED blinks when the battery is charging.
The controller canister is operating without a battery or the existing battery has failed.
11
Controller service action allowed LED
Blue
The controller canister can be removed safely from the controllerdrive tray.
The controller canister cannot be removed safely from the controller-drive tray.
12
Controller service action required LED
Amber
A fault exists within the controller canister.
Normal status.
Cray Private
HR85-8500-A
Hardware Implementation [4]
Table 4-12: ABS LEDs on the Controller Canister Main Faceplate Location
LED
Color
On
Off
13
Cache active LED
Green
Cache is active*
Cache is inactive or the controller canister has been removed from the controllerdrive tray.
14
Tray ID indicator display
Green
For more information, refer to “Controller-Drive Tray Sequence Code Definitions” on page 114.
* After an AC power failure, this LED blinks while cache offload is in process.
Figure 4-22: ABS LEDs on the Controller Canister Two-Port SAS Host Interface Card Subplates* Host interface card link 4 up LED
Host interface card link 5 up LED
Host interface card link 4 active LED
Host interface card link 5 active LED
Host interface card link 3 active LED
Host interface card link 6 up LED
Host interface card link 3 up LED
Host interface card link 6 active LED Lnk
S A S Lnk
Ch 3
Ch 4
Port 2
Lnk Lnk
Lnk
Ch 5
Lnk
Lnk
Lnk Lnk
Ch 6 Lnk
Lnk
Port 1
Lnk
Drive Expansion 12Gb/s
S A S
Table 4-13: ABS LEDs on the Controller Canister Two-Port SAS Host Interface Card Subplates*
HR85-8500-A
Location
LED
Color
On
Off
1
Host interface card link 3 up LED
Green
The Ethernet link has auto-negotiated to 1 GB/s.
The Ethernet link is down or does not auto-negotiate to 1 GB/s.
2
Host interface card link 3 active LED
Green
The link is up (LED blinks when there is activity).
The link is not active.
3
Host interface card link 4 up LED
Green
The Ethernet link has auto-negotiated to 1 GB/s.
The Ethernet link is down or does not auto-negotiate to 1 GB/s.
4
Host interface card link 4 active LED
Green
The link is up (LED blinks when there is activity).
The link is not active.
5
Host interface card link 5 up LED
Green
The Ethernet link has auto-negotiated to 1 GB/s.
The Ethernet link is down or does not auto-negotiate to 1 GB/s.
Cray Private
61
Tiered Adaptive Storage™ Hardware Guide
Table 4-13: ABS LEDs on the Controller Canister Two-Port SAS Host Interface Card Subplates* Location
LED
Color
On
Off
6
Host interface card link 5 active LED
Green
The link is up (LED blinks when there is activity).
The link is not active.
7
Host interface card link 6 up LED
Green
The Ethernet link has auto-negotiated to 1 Gb/s.
The Ethernet link is down or does not auto-negotiate to 1 GB/s.
8
Host interface card link 6 active LED
Green
The link is up (LED blinks when there is activity).
The link is not active.
* The figure shows a 4-port SAS host interface card subplate
Figure 4-23: ABS LEDs on the Power-Fan Canister
Standby power LED
Power-fan AC power LED
Power-fan DC power LED
Power-fan service action required LED
Power-fan service action allowed LED
Table 4-14: ABS LEDs on the Power-Fan Canister
62
Location
LED
Color
On
Off
1
Standby power
Green
The controller-drive tray is in Standby mode, and Dc power is not available.
The controller-drive tray is not in Standby mode, and DC power is available.
2
Power-fan DC power
Green
DC power from the powerfan canister is available.
DC power from the powerfan canister is not available.
3
Power-fan service action allowed
Blue
The power-fan canister can be removed safely from the controller-drive tray.
The power-fan canister cannot be removed safely from the controller-drive tray.
4
Power-fan service action required
Amber
A fault exists within the power-fan canister.
Normal status.
5
Power-fan AC power
Green
AC power to the power-fan canister is present.
AC power to the power-fan canister is not present.
Cray Private
HR85-8500-A
Hardware Implementation [4]
Data Cache Block Storage (DCBS) FC Interface, Controls, and Indicators Marketing Code: NG6-DE60-C5512-8FC16 •
The data cache block storage (DCBS) FC is a NetApp 5400-60 with optional expansion trays.
•
The configuration of the host ports might appear different on your system depending on which host interface card configuration is installed. The controls and indicators might appear in different locations depending on which model is used for the CIMS, MDCs, etc.
Figure 4-24: DCBS (FC) Controller-Drive Tray - Front View
Drive drawer Locate LED Service action required LED Overtemperature LED Power LED Standby power LED
Figure 4-25: DCBS (FC) Controller-Drive Tray - Rear View Ethernet link 2 active LED Ethernet link 1 rate LED Ethernet connector 1
Fibre channel link 1 up LED
Fibre channel link 1 active LED
Ethernet link 2 rate LED Ethernet connector 2
Fibre channel link 2 up LED
Fibre channel connector 1
Fibre channel connector 2
Mini USB connector
Fibre channel link 2 active LED
Expansion fault LED
Fibre channel connector 4
Ethernet link 1 active LED
Expansion active LED Expansion SFF-8088 port connector Controller A
USB connector Fan canister Fan power LED Fan service action required LED Fan service action allowed LED
Battery service action required LED
Second tray ID indicator display field
Battery charging LED Controller A service action allowed LED Controller A service action required LED Cache active LED
HR85-8500-A
First tray ID indicator display field
Power canister standby power LED Power canister DC power LED Power canister service action LED
Controller B Power canister
Power canister AC power LED Power canister service required LED
Cray Private
63
Tiered Adaptive Storage™ Hardware Guide
DCBS (FC) Host Channels
Caution: To prevent electrostatic discharge damage to the tray, use proper ESD protection when you handle tray components.
•
Each controller in the controller-drive tray has four standard 8-GB/s Fibre Channel host ports.
•
Each controller in the controller-drive tray has the following type of host interface card (HIC): •
Four 8-Gb/s Fibre Channel host ports
Figure 4-26: DCBS (FC) Host Channels on the Controller-Drive Tray - Rear View Host interface card (HIC) Standard fibre channel host connectors
Standard fibre channel host connectors Host interface card (HIC)
64
Cray Private
SAS expansion connector InfiniBand host connectors
InfiniBand host connectors SAS expansion connector
HR85-8500-A
Hardware Implementation [4]
LEDs on the DCBS (FC) Controller-Drive Tray Figure 4-27: DCBS (FC) LEDs on the Left End Cap
Power LED Standby power LED Locate LED Overtemperature LED Service action required LED
Table 4-15: DCBS (FC) LEDs on the Left End Cap Location
HR85-8500-A
LED
Color
On
Off
1
Power
Green
Power is present.
Power is not present.
2
Standby power
Green
The controller-drive tray is in Standby Power mode.
The controller-drive tray is not in Standby Power mode.
3
Locate
White
Identifies a controllerdrive tray that you are trying to find.
Normal status.
4
Overtemperature
Amber
The temperature of the controller-drive tray has reached an unsafe level.
Normal status.
5
Service action required
Amber
A component within the controller-drive tray needs attention.
Normal status.
Cray Private
65
Tiered Adaptive Storage™ Hardware Guide
Figure 4-28: DCBS (FC) LEDs on the Controller Canister Main Faceplate Fibre channel link 1 LED 0 Fibre channel link 1 LED 1
Ethernet connector 2 link active LED
Fibre channel link 2 LED 0 Fibre channel link 2 LED 1
Ethernet connector 2 link rate LED
Fibre channel link 3 LED 0 Fibre channel link 3 LED 1
Ethernet connector 1 link active LED
Fibre channel link 4 LED 0 Fibre channel link 4 LED 1
Ethernet connector 1 link rate LED
Cache active LED
Battery service action required LED
Controller service action required LED Controller service action allowed LED
Battery charging LED
Table 4-16: DCBS (FC) LEDs on the Controller Canister Main Faceplate Location
Color
On
Off
1
Ethernet connector 1 link rate LED
Green
There is a 1000BASET rate.
There is a 10/100BASE-T rate.
2
Ethernet connector 1 link active LED
Green
The link is up (the LED blinks when activity is occurring).
The link is not active.
3
Ethernet connector 2 link rate LED
Green
There is a 1000BASET rate.
There is a 10/100BASE-T rate.
4
Ethernet connector 2 link active LED
Green
The link is up (the LED blinks when there is activity)
The link is not active.
5 and 6
Host connect 1 link active and data rate LED 0 off and LED 1 off
Green
The link is not active.
LED 0 on and LED 1 off
Green
Data rate 2 GB/s.
LED 0 off and LED 1 on
Green
Data rate 4 GB/s.
LED 0 on and LED 1 on
Green
Data rate 8 Gb/s.
7 and 8
66
LED
Host connection 2 link active and data rate
Cray Private
HR85-8500-A
Hardware Implementation [4]
Table 4-16: DCBS (FC) LEDs on the Controller Canister Main Faceplate Location
Color
On
Off
LED 0 off and LED 1 off
Green
The link is not active.
LED 0 on and LED 1 off
Green
Data rate 2 Gb/s.
LED 0 off and LED 1 on
Green
Data rate 4 Gb/s.
LED 0 on and LED 1 on
Green
Data rate 8 Gb/s.
LED 0 off and LED 1 of
Green
The link is not active.
LED 0 on and LED 1 off
Green
Data rate 2 Gb/s.
LED 0 on and LED 1 off
Green
Data rate 4 Gb/s.
LED 0 on and LED 1 on
Green
Data rate 8 Gb/s.
LED 0 off and LED 1 of
Green
The link is not active.
LED 0 on and LED 1 off
Green
Data rate 2 Gb/s.
LED 0 on and LED 1 off
Green
Data rate 4 Gb/s.
LED 0 on and LED 1 on
Green
Data rate 8 Gb/s.
13
Battery service action required LED
Amber
The battery in the controller canister has failed.
Normal status.
14
Battery charging LED
Green
The battery is fully charged. The LED blinks when the battery is charging.
The controller canister is operating without a battery or the existing battery has failed.
15
Controller service action allowed LED
Blue
The controller canister can be removed safely from the controllerdrive tray.
The controller canister cannot be removed safely from the controller-drive tray.
9 and 10
11 and 12
HR85-8500-A
LED
Host connection 3 link active and data rate
Host connection 4 link active and data rate
Cray Private
67
Tiered Adaptive Storage™ Hardware Guide
Table 4-16: DCBS (FC) LEDs on the Controller Canister Main Faceplate Location
LED
Color
On
Off
16
Controller service action required LED
Amber
A fault exists within the controller canister.
Normal status.
17
Cache active LED
Green
Cache is active.*
Cache is inactive the controller canister has been removed from the controller-drive tray.
* After an AC power failure, this LED blinks while cache offload is in process.
Figure 4-29: DCBS (FC) Right-Rear Subplate with an FC Host Interface Card SAS expansion active LED
FC Host Card
Lnk
2
8
4
2
Host interface card link 5 LED 1
68
8
7 FC host Host interface card interface card connector 7 link 6 LED 1 Host interface card link 6 LED 0
FC host interface card connector 6
Cray Private
4
Lnk
2
8
Lnk
4 Ch 8
FC host interface card connector 5
2
4 Ch 6
Ch 5 Host interface card link 5 LED 0
8
SAS expansion SFF-8088 port connector
SAS expansion fault LED
Mini USB
Drive Expansion 6Gb/s
S A S
FC host interface card connector 8 Host interface card link 8 LED 1 Host interface card link 8 LED 0 Host interface card link 7 LED 1 Host interface card link 7 LED 0
HR85-8500-A
Hardware Implementation [4]
Table 4-17: DCBS (FC) LEDs on the Controller Canister Fibre Channel Host Interface Card Subplates Location
Color
On
Off
2
Expansion fault LED
Amber
At least one of the four PHY is working, but another PHY cannot establish the same link to the device connected to the Expansion OUT connector.
Normal status.
3
Expansion active LED
Green
At least one of the four PHys in the OUt connector is working and a link has been made to the device connected to the Expansion connector.
The link is not active.
6 and 7
LED 0 off and LED 1 off
Green
The link is not active.
LED 0 on and LED 1 off
Green
Data rate 2 Gb/s.
LED 0 off and LED 1 on
Green
Data rate 4 Gb/s.
LED 0 on and LED 1 on
Green
Data rate 8 Gb/s.
LED 0 off and LED 1 off
Green
The link is not active.
LED 0 on and LED 1 off
Green
Data rate 2 Gb/s.
LED 0 off and LED 1 on
Green
Data rate 4 Gb/s.
LED 0 on and LED 1 on
Green
Data rate 8 Gb/s.
LED 0 off and LED 1 off
Green
The link is not active.
LED 0 on and LED 1 off
Green
Data rate 2 Gb/s.
LED 0 off and LED 1 on
Green
Data rate 4 Gb/s.
LED 0 on and LED 1 on
Green
Data rate 8 Gb/s.
LED 0 off and LED 1 off
Green
The link is not active.
9 and 10
12 and 13
15 and 16
HR85-8500-A
LED
Cray Private
69
Tiered Adaptive Storage™ Hardware Guide
Table 4-17: DCBS (FC) LEDs on the Controller Canister Fibre Channel Host Interface Card Subplates Location
LED
Color
On
LED 0 on and LED 1 off
Green
Data rate 2 Gb/s.
LED 0 off and LED 1 on
Green
Data rate 4 Gb/s.
LED 0 on and LED 1 on
Green
Data rate 8 Gb/s.
Off
Figure 4-30: DCBS (FC) LEDs on the Power Canister
Standby power LED DC power LED
AC power LED
Power service action allowed LED
Power service action required LED
Table 4-18: DCBS (FC) LEDs on the Power Canister Location
LED
Color
On
Off
1
Standby power
Green
The drive tray is in Standby mode and DC power is not available.
The drive tray is not in Standby mode and DC power is available.
2
DC power
Green
DC power from the power canister is available.
DC power from the power canister is not available.
3
Power service action allowed
Blue
The power canister can be removed safely from the drive tray.
The power canister cannot be removed safely from the drive tray.
4
Power service action required
Amber
A fault exists within the power canister.
Normal status.
5
AC power
Green
AC power to the power canister is present.
AC power to the power canister is not present.
Data Cache Block (DCBS) IB Interface, Controls, and Indicators •
70
The Data Cache Block Storage (DCBS) can be configured using a NetApp 5500-60 storage array with optional expansion trays or a standalone Netapp 2700-24 storage arrays with optional expansion trays (known as Metadata Block Storage (MDBS)). The controls and indicators show the NetApp 5500-60 only.
Cray Private
HR85-8500-A
Hardware Implementation [4]
•
The configuration of the host ports might appear different on your system depending on which host interface card configuration is installed. The controls and indicators might appear in different locations depending on which model is used for the CIMS, MDCs, etc.
•
The host ports are located on the host interface card (HIC). The following types of HICs are supported: •
Two InfiniBand connectors at 40 Gb/s
Figure 4-31: DCBS (IB) Host Channels on the Controllers with InfiniBand HIC - Rear View QSFP channels 1 and 2
QSFP channels 1 and 2
HR85-8500-A
Cray Private
71
Tiered Adaptive Storage™ Hardware Guide
LEDs on the DCBS (IB) Controller-Drive Tray Figure 4-32: DCBS (IB) LEDs on the Controller Canister Main Faceplate SAS channel 1 service action required LED
Ethernet port 2 link active LED
SAS channel 1 link up LED
Ethernet port 2 link rate LD
SAS channel 2 service action required LED
Ethernet port 1 link active LED
SAS channel 2 link up LED
Ethernet port 1 link rate LED
Tray ID indicator
Battery service action required LED
Cache active LED Controller service action required LED
10. Battery charging LED
Controller service action allowed LED
Table 4-19: DCBS (IB) LEDs on the Controller Canister Main Faceplate Location
72
LED
Color
On
Off
1
Ethernet port 1 link rate LED
Green
There is a 100BASE-T rate.
There is a 10BASE-T rate.
2
Ethernet port 1 link active LED
Green
The link is up (LED blinks when there is activity)
The link is not active.
3
Ethernet port 2 link rate LED
Green
there is a 100BASE-T rate.
There is a 10BASE-T rate.
4
Ethernet port 2 link active LED
Green
The link is up (the LED blinks when there is activity.)
The link is not active.
5
SAS channel 1 service action required LED
Amber
At least one of the four PHYs is working, but another PHY cannot establish the same link to the device connected to the host interface port connector.
No link error has occurred.
Cray Private
HR85-8500-A
Hardware Implementation [4]
Table 4-19: DCBS (IB) LEDs on the Controller Canister Main Faceplate Location
LED
Color
On
Off
6
SAS channel 1 link up LED
Green
At least one of the four PHYs in the host interface port is working and a link exists to the device connected to the interface port connector.
A link error has occurred.
7
SAS channel 2 service action required LED
Amber
At least one four PHYs is working, but another PHY cannot establish the same link to the device connected to the host interface port connector.
No link error has occurred.
8
SAS channel 2 link up LED
Green
At least one of the four PHYs in the host interface port is working and a link exists to the device connected to the interface port connector.
A link error has occurred.
9
Battery service action required LED
Amber
The battery in the controller canister has failed.
Normal status.
10
Battery charging LED
Green
The battery is fully charged. The LED blinks when the battery is charging.
The controller canister is operating without a battery or the existing battery has failed.
11
Controller service action allowed LED
Blue
The controller canister can be removed safely from the controllerdrive tray.
The controller canister cannot be removed safely from the controller-drive tray.
12
Controller service action required LED
Amber
A fault exists within the controller canister.
Normal status.
13
Cache active LED
Green
Cache is active.*
Cache is inactive or the controlled canister has been removed from the controllerdrive tray.
14
Tray ID indicator
N/A
N/A
N/A
* After an AC power failure, this LED blinks while cache offload is in process.
Figure 4-33: DCBS (IB) LEDs on the Controller Canister InfiniBand Host Interface Card
HR85-8500-A
Cray Private
73
Tiered Adaptive Storage™ Hardware Guide
Subplates QSFP channel 1 physical link up LED
QSFP channel 2 logical link up LED
QSFP channel 1 logical link up LED
QSFP channel 2 physical link up LED
Table 4-20: DCBS (IB) LEDs on the Controller Canister InfiniBand Host Interface Card Subplates* Location
74
LED
Color
On
Off
1
QSFP channel 1 logical link up LED
Amber
The link is up (LED blinks when there is activity.)
The controller has not yet loaded the driver for the ConnectX Host Channel Adapter.
2
QSFP channel 1 physical link up LED
Green
The physical link is active.
The physical link is not active.
3
QSFP channel 2 logical link up LED
Amber
The link is up (LED blinks when there is activity).
The controller has not yet loaded the driver for the ConnectX Host Channel Adapter.
4
QSFP Channel 2 physical link up LED
Green
The physical link is active.
The physical link is not active.
Cray Private
HR85-8500-A
Hardware Implementation [4]
Figure 4-34: DCBS (IB) LEDs on the Left Bezel
Power LED Standby power LED Controller-drive tray locate LED Controller drive tray overtemperature LED Service action required LED
Table 4-21: DCBS (IB) LEDs on the Left Bezel Location
HR85-8500-A
LED
Color
On
Off
1
Controllerdrive tray locate
White
Identifies a controller-drive tray that you are trying to find.
Normal status.
2
Service action required
Amber
A component within the controller-drive tray needs attention.
Normal status.
3
Controllerdrive tray overtemperature
Amber
The temperature of the controller-drive tray has reached an unsafe level.
Normal status.
4
Power
Green
Power is present.
Power is not present.
5
Standby power
Green
The controller-drive tray is in Standby Power mode.
The controller-drive tray is not in Standby Power mode.
Cray Private
75
Tiered Adaptive Storage™ Hardware Guide
Figure 4-35: DCBS (IB) LEDs on the Power Canister
Standby power LED DC power LED
AC power LED
Power service action allowed LED
Power service action required LED
Table 4-22: DCBS (IB) LEDs on the Power Canister Location
76
LED
Color
On
Off
1
Standby power
Green
The controller-drive tray is in Standby mode and DC power is not available.
The controller-drive tray is not in Standby mode and DC power is available.
2
DC power
Green
DC power from the powerfan canister is available.
DC power from the powerfan canister is not available.
3
Power service action allowed
Blue
The power-fan canister can be removed safely from the drive tray.
The power-fan canister cannot be removed safely from the drive tray.
4
Power service action required
Amber
A fault exists within the power-fan canister.
Normal status.
5
AC power
Green
AC power to the power-fan canister is present.
AC power to the power-fan canister is not present.
Cray Private
HR85-8500-A
Hardware Implementation [4]
Figure 4-36: DCBS (IB) LEDs on the Fan Canister
Fan power LED
Fan service action required ED
Fan service action allowed LED
Table 4-23: DCBS (IB) LEDs on the Fan Canister Location
HR85-8500-A
LED
Color
On
Off
1
Fan power
Green
Power to the fan canister is available.
Power to the fan customer-replaceable unit (CRU) is not available.
2
Fan service action required
Amber
A fault exists within the fan canister.
Normal status.
3
Fan service action allowed
Blue
The fan canister can be removed safely from the drive tray.
The fan canister cannot be removed safely from the drive tray.
Cray Private
77
Tiered Adaptive Storage™ Hardware Guide
ABS and DCBS LED Indicators Table 4-24: ABS and DCBS LED Symbols and General Behavior LED
Symbol
Location (Canisters)
Function
Power
Power-fan Interconnect-battery**
• On: The controller has power. • Off: The controller does not have power. Note: The controller canisters do not have a power LED. They receive their power from the power supplies inside the power-fan canisters.
Battery Fault
Battery
• On: The battery is missing or has failed. • Off: The battery is operating normally. • Blinking: The battery is charging.
Service Action Allowed
Drive (left LED, no symbol) Power-fan Controller Battery
On: You can remove the canister safely.
Service Action Required (Fault)
Drive
On: When the controller-drive tray LED is on, the cable is attached and at least one lane has a link up status, but at least one lane has a link down status. Off: One of the following conditions exists: • No cable is attached. • A cable is attached, and all lanes have a link up status. • A cable is attached, and all lanes have a link down status.
Service Action Required (Fault)
Controller Power-fan canister
On: The controller or the power-fan canister needs attention. Off: The controller and the power-fan canister are operating normally.
Service Action Required (Fault)*
FC Host Card
On: The FC cable is attached and at least one lane has a link up status, but at least one lane has a link down status. Off: One of the following conditions exists: • No cable is attached. • A cable is attached, and all lanes have a link up status. • A cable is attached, and all lanes have a link down status.
Locate
Front bezel on the controllerdrive tray* Front frame**
On: Assists in locating the tray.
78
Cray Private
HR85-8500-A
Hardware Implementation [4]
Table 4-24: ABS and DCBS LED Symbols and General Behavior LED
Symbol
Location (Canisters)
Function
InfiniBand Host Channel Connection* Host channel Connection (iSCSI)**
Controller
The status of the host channel is indicated: • “L” LED on: A link is established. • “A” LED on: Activity (data transfer) is present.
Fibre Channel Host Channel Connection*
Controller
The status of the host channel is indicated: • “2” LED on: A link is established. • “4” LED on: Activity (data transfer) is present.
Cache Active
Controller
The activity of the cache is indicated: • On: Data is in the cache. • Off: No data is in the cache.
Controller-Drive Tray Over-Temperature
Front bezel on the controllerdrive tray
On: The temperature of the controller-drive tray has reached an unsafe condition. Off: The temperature of the controller-drive tray is within operational range.
Controller-Drive Tray Standby Power
Front bezel on the controllerdrive tray
On: The controller-drive tray is in standby mode and the main AC/DC power is off. Off: The controller-drive tray is not in standby mode and the main AC/DC power is on.
Tray ID Indicator Diagnostic Display
Controller
The tray ID or a diagnostic code is indicated (see “Diagnostic Code Sequences for the Controller-Drive Trays” on page 120.) For example, if some of the cache memory dual inline memory modules (DIMMs) are missing in a controller, error code L8 appears in the diagnostic display (see “Lock-Down Codes for the Controller-Drive Trays” on page 119.)
AC Power
Power-fan Note: The LED is directly above or below the AC power switch and the AC power connector.
Indicates that the power supply is receiving AC power input.
HR85-8500-A
Cray Private
79
Tiered Adaptive Storage™ Hardware Guide
Table 4-24: ABS and DCBS LED Symbols and General Behavior LED
Symbol
Location (Canisters)
Function
DC Power (not available)
Power-fan Note: The LED is directly above or the below the DC power switch and the DC power connector.
Indicates that the power supply is receiving DC power input.
Ethernet Speed and Ethernet Activity
Controller
The speed of the Ethernet ports and whether a link has been established are indicated: • Left LED On: 1-GB/s speed. • Left LED Off: 100BASE-T or 10BASE-T speed. • Right LED On: A link is established. • Right LED Off: No link exists. • Right LED Blinking: Activity is occurring.
* Applicable to the NetApp 5460 only. ** Applicable to the NetApp 2724 and 5560 only.
Service Action Allowed LEDs Each controller canister, power-fan canister, and battery canister has a Service Action Allowed LED. The Service Action Allowed LED lets you know when you can remove a canister safely.
Caution: Never remove a controller canister, a power-fan canister, or a battery canister unless the appropriate Service Action Allowed LED is on.
Drive Channels ABS Drive Channels Each controller have levers for removing the controller from the controller-drive tray. The close spacing between the handle and the host connector might make it difficult to remove a cable that is attached to the host connector. If this problem occurs, use a flat-blade screwdriver to push in the release component on the cable connector.
80
•
The controller-drive tray supports the DE5600 drive tray for expansion.
•
Each controller has one dual-ported SAS expansion connector to connect to the drive trays.
Cray Private
HR85-8500-A
Hardware Implementation [4]
Figure 4-37: ABS Drive Channel Ports on the Controller-Drive Tray - Rear View Controller canister A
Controller canister B
SAS base host port (CH1 and CH2) SAS base host port (CH1 and CH2)
Figure 4-38: ABS Controller with Base Host Channel Ports (CH1 and CH2)
DCBS (FC) Drive Channels The E5400 controller-drive trays supports the DE6600 drive tray for expansion. •
Each controller has two SAS expansion connectors that must both be connected (one to the ESM A side and the other to the ESM B side) to the drive tray to ensure path redundancy.
Figure 4-39: Drive Channel Ports on the Controller-Drive Tray Controller canister A
Controller canister B
HR85-8500-A
Cray Private
Drive channel port (SAS expansion connector)
Drive channel port SAS drive expansion connector)
81
Tiered Adaptive Storage™ Hardware Guide
DCBS (IB) Drive Channels The E5500 controller-drive trays supports the DE6600 drive tray for expansion. •
Each controller has two SAS expansion connectors that must both be connected (one to the ESM A side and the other to the ESM B side) to the drive tray to ensure path redundancy.
Figure 4-40: DCBS (IB) Drive Channel Ports on the Controller-Drive Tray Controller canister A
SFF-8088 SAS channels (drive expansion)
Controller canister B
SFF-8008 SAS channels (drive expansion)
Figure 4-41: DCBS (IB) Controller with Labeled Drive Channel Ports (CH1 and CH2)
Disk Drive Trays ABS 24-bay Drive Tray Controls and Indicators Drive Trays The following disk drive types are used in the DE5600 drive enclosures:
82
•
600GB 10K SAS 2.5” HDDs
•
400GB or 800GB MLC 2.5” SSD drives
Cray Private
HR85-8500-A
Hardware Implementation [4]
Figure 4-42: ABS Drive Tray - Front View Left end cap (has the drive tray LEDs)
Drives
Right end cap
Figure 4-43: ABS Drive Tray - Rear View
ESM A
SAS IN connector
SAS expansion (out) connector
SAS IN connector
ESM B
Figure 4-44: ABS Drive Tray with AC Power Option - Rear View Expansion Port SFF-8088 Connector 2 (IN)
Seven-Segment Display Indicators Serial Connector
Expansion Port SFF-8088 Connector 1 (IN)
Ethernet Connector Expansion Port SFF-8088 Connector (OUT)
ESM A Canister
Power-Fan Canister
HR85-8500-A
Power Connector
Cray Private
ESM B Canister
Power Switch
83
Tiered Adaptive Storage™ Hardware Guide
LEDs on the DE5400 Figure 4-45: ABS LEDs on the ESM Canister Host link 2 active LED
Host link 2 fault LED Host link 1 active LED
Ethernet link active LED Ethernet link rate LED
Host link 1 fault LED
ESM expansion link fault LED ESM expansion link active LED
ESM service action allowed LED Tray ID indicator
ESM service action required LED ESM power LED
Table 4-25: ABS LEDs on the ESM Canister Location
84
LED
Color
On
Off
1
Host link 1 fault
Amber
At least one PHY of the four connectors is working, but another PHY cannot establish the same link to the device connected to the host interface port connector.
No link error has occurred.
2
Host link 1 active
Green
At least one of the four PHYs in the In port is working, and a link exists to the device connected to the host interface connector.
A link error has occurred.
3
Host link 2 fault
Amber
At least one PHY of the four connections is working, but another PHY cannot establish the same link to the device connected to the host interface port connector.
No link error has occurred.
4
Host link 2 active
Green
At least one of the four PHYs in the In port is working, and a link exists to the device connected to the host interface connector.
A link error has occurred.
5
Ethernet link active
Green
The link is up. (The LED blinks when there is activity.)
The link is not active.
6
Ethernet link rate
Green
There is a 100BASE-T rate.
There is a 10BASE-T rate.
7
ESM expansion link fault
Amber
At least one of the four PHYs in the Out port is working, but another PHY cannot establish the same link to the Expansion Out connector.
Normal status.
8
ESM expansion link active
Green
At least one of the four PHYs in the Out port is working, and a link exists to the device connected to the Expansion Out connector.
A link error has occurred.
Cray Private
HR85-8500-A
Hardware Implementation [4]
Table 4-25: ABS LEDs on the ESM Canister Location
LED
Color
On
Off
9
ESM service action allowed
Blue
The ESM can be removed safely from the drive tray.
The ESM cannot be removed safely from the drive tray.
10
ESM service action required
Amber
A fault exists within the ESM. (This LED defaults on at power up. This LED turns off after the software has completed its power up self-test sequence.)
Normal status.
11
ESM power
Green
12V power to the ESM is present.
Power is not present to the ESM.
12
Tray ID indicator
Green
See note.
Not applicable.
For more information about the tray ID indicators, see Chapter
5, "Block Storage Tray ID Codes," on
page 113. Figure 4-46: ABS LEDs on the AC Power-Fan Canister
Power-fan service action required LED Power-fan input AC power LED
Standby power LED Power-fan output DC power LED
Power-fan service action allowed LED
Table 4-26: ABS LEDs on the Power-Fan Canister Location
HR85-8500-A
LED
Color
On
Off
1
Standby power
Green
The drive tray is in Standby mode, and DC power is not available.
The drive tray is not in Standby mode, and DC power is available.
2
Power-fan DC power
Green
DC power from the powerfan canister is available.
DC power from the powerfan canister is not available.
3
Power-fan service action allowed
Blue
The power-fan canister can be removed safely from the drive tray.
The power-fan canister cannot be removed safely from the drive tray.
4
Power-fan service action required
Amber
A fault exists within the power-fan canister.
Normal status.
5
Power-fan AC power
Green
AC power to the power-fan canister is present.
AC power to the power-fan canister is not present.
Cray Private
85
Tiered Adaptive Storage™ Hardware Guide
DCBS 60-bay Drive Tray Controls and Indicators Figure 4-47: DCBS DE6600 Drive Tray - Rear View ESM A
SAS IN connectors
SAS expansion (out) connectors
ESM B
Figure 4-48: DCBS Maximum Capacity E-Series Array Configuration using DE6600 Shelves
86
Cray Private
HR85-8500-A
Hardware Implementation [4]
Figure 4-49: DCBS LEDs on the Left Bezel
Power LED Standby power LED Controller-drive tray locate LED Controller drive tray overtemperature LED Service action required LED
Table 4-27: DCBS LEDs on the Left Bezel Location
HR85-8500-A
LED
Color
On
Off
1
Power
Green
Power is present.
Power is not present.
2
Standby power
Green
The drive tray is in Standby Power mode.
The drive tray is not in Standby Power mode.
3
Locate
White
Identifies a drive tray that you are trying to find.
Normal status.
4
Overtemperature
Amber
The temperature of the drive tray has reached an unsafe level.
Normal status.
5
Service action required
Amber
A component within the drive tray needs attention.
Normal status
Cray Private
87
Tiered Adaptive Storage™ Hardware Guide
Figure 4-50: DCBS LEDs on the ESM Canister Host link 2 active LED
Host link 2 fault LED Host link 1 active LED
Ethernet link active LED Ethernet link rate LED
Host link 1 fault LED
ESM expansion link fault LED ESM expansion link active LED
ESM service action allowed LED Tray ID indicator
ESM service action required LED ESM power LED
Table 4-28: DCBS LEDs on the ESM Canister Location
88
LED
Color
On
Off
1
Host link 1 fault
Amber
At least one PHY of the four connectors is working, but another PHY cannot establish the same link to the device connected to the host interface port connector.
No link error has occurred.
2
Host link 1 active
Green
At least one of the four PHYs in the In port is working, and a link exists to the device connected to the host interface connector.
A link error has occurred.
3
Host link 2 fault
Amber
At least one PHY of the four connections is working, but another PHY cannot establish the same link to the device connected to the host interface port connector.
No link error has occurred.
4
Host link 2 active
Green
At least one of the four PHYs in the In port is working, and a link exists to the device connected to the host interface connector.
A link error has occurred.
5
Ethernet link active
Green
The link is up. (The LED blinks when there is activity.)
The link is not active.
6
Ethernet link rate
Green
There is a 100BASE-T rate.
There is a 10BASE-T rate.
7
ESM expansion link fault
Amber
At least one of the four PHYs in the Out port is working, but another PHY cannot establish the same link to the Expansion Out connector.
Normal status.
8
ESM expansion link active
Green
At least one of the four PHYs in the Out port is working, and a link exists to the device connected to the Expansion Out connector.
A link error has occurred.
9
ESM service action allowed
Blue
The ESM can be removed safely from the drive tray.
The ESM cannot be removed safely from the drive tray.
Cray Private
HR85-8500-A
Hardware Implementation [4]
Table 4-28: DCBS LEDs on the ESM Canister Location
LED
Color
On
Off
10
ESM service action required
Amber
A fault exists within the ESM. (This LED defaults on at power up. This LED turns off after the software has completed its power up self-test sequence.)
Normal status.
11
ESM power
Green
12V power to the ESM is present.
Power is not present to the ESM.
12
Tray ID indicator
Green
See note.
Not applicable.
For more information about the tray ID indicators, see “Tray ID Indicator Display for the ESMs on the Drive Trays” on page 121.
Figure 4-51: DCBS LEDs on the Power Canister
Standby power LED DC power LED
AC power LED
Power service action allowed LED
Power service action required LED
Table 4-29: DCBS LEDs on the Power-Fan Canister Location
HR85-8500-A
LED
Color
On
Off
1
Standby power
Green
The controller-drive tray is in Standby mode, and AC/DC power is not available.
The controller-drive tray is not in Standby mode, and AC/DC power is available.
2
Power-fan DC power
Green
DC power from the powerfan canister is available.
DC power from the powerfan canister is not available.
3
Power-fan service action allowed
Blue
The power-fan canister can be removed safely from the controller-drive tray.
The power-fan canister cannot be removed safely from the controller-drive tray.
4
Power-fan service action required
Amber
A fault exists within the power-fan canister.
Normal status.
5
Power-fan AC/DC power
Green
AC/DC power to the powerfan canister is present.
AC/DC power to the powerfan canister is not present.
Cray Private
89
Tiered Adaptive Storage™ Hardware Guide
Figure 4-52: DCBS LEDs on the Fan Canister
Fan power LED
Fan service action required ED
Fan service action allowed LED
Table 4-30: DCBS LEDs on the Fan Canister Location
90
LED
Color
On
Off
1
Fan power
Green
Power to the fan canister is available.
Power to the fan customerreplaceable unit (CRU) is available.
2
Fan service action required
Amber
A fault exists within the fan canister.
Normal status.
3
Fan service action allowed
Blue
The fan canister can be removed safely from the drive tray.
The fan canister cannot be removed safely from the drive tray.
Cray Private
HR85-8500-A
Hardware Implementation [4]
LEDs on the DE6600 Drive Drawers Figure 4-53: DCBS LEDs on the Drawer Drive drawer status service action required LED
Drive 4 activity LED Drive 5 activity LED
Drive drawer status service action allowed LED
Drive 6 activity LED
Drive 1 activity LED
Drive 7 activity LED
Drive 10 activity LED
Drive 2 activity LED
Drive 8 activity LED
Drive 11 activity LED
Drive 3 activity LED
Drive 9 activity LED
Drive 12 activity LED
Table 4-31: DCBS LEDs on the Drive Tray Drawer Location
LED
Color
On
Blinking
Off
1
Drive drawer service action required
Amber
An error has occurred.
Normal status.
2
Drive drawer service action allowed
Blue
The drive canister can be removed safely from the drive drawer in the drive tray.
The drive canister cannot be removed safely from the drive drawer in the drive tray.
3-14
Drive activity for drives 1 through 12 in the drive drawer
Green
The power is turned on, and the drive is operating normally.
Drive I/O activity is taking place.
The power is turned off.
Table 4-32: DCBS Drive State Represented by the LEDs Drive Activity LED (Green)
Drive State
HR85-8500-A
Drive Service Action Required LED (Amber)
Power is not applied.
Off
Off
Normal operation - The power is turned on, but drive I/O activity is not occurring.
On
Off
Normal operation - Drive I/O activity is occurring.
Blinking
Off
Service action required - A fault condition exists, and the drive is offline.
On
On
Cray Private
91
Tiered Adaptive Storage™ Hardware Guide
Figure 4-54: DCBS LEDs on the Drive
Drive service action allowed LED
Drive service action required LED
Table 4-33: LEDs on the Drives Location
LED
Color
On
Off
1
Drive service action allowed
Blue
The drive canister can be removed safely from the drive drawer in the drive tray.
The drive canister cannot be removed safely from the drive drawer in the drive tray.
2
Drive service action required
Amber
An error has occurred
Normal status.
Table 4-34: DCBS Drive State Represented by the LEDs Drive Activity LED (Green)
Drive State
92
Drive Service Action Required LED (Amber)
Power is not applied.
Off
Off
Normal operation - The power is turned on, but drive I/O activity is not occurring.
On
Off
Normal operation - Drive I/O activity is occurring.
Blinking
Off
Service action required - A fault condition exists, and the drive is offline.
On
On
Cray Private
HR85-8500-A
Hardware Implementation [4]
General Behavior of the LEDs on the DE6600 Drive Tray Table 4-35: DCBS LED Symbols and General Behavior LED
Symbol
Location (Canisters)
Function
Power
Drive tray (front bezel) ESM canister Power-fan canister
• On: Power is applied to the drive tray or the canister. • Off: Power is not applied to the drive tray or the canister.
Locate
Front bezel on the controller-drive tray
On or blinking: Indicates the drive tray that you are trying to find.
Controller-Drive Tray Over-Temperature
Front bezel on the controller-drive tray
On: The temperature of the drive tray has reached an unsafe condition. Off: The temperature of the drive tray is within operational range.
Controller-Drive Tray Standby Power
Front bezel on the controller-drive tray
On: The drive tray is in standby mode and the main DC power is off. Off: The drive tray is not in standby mode and the main DC power is on.
Service Action Allowed
ESM canister Power-fan canister Drive
On: It is safe to remove the ESM canister, the power-fan canister, or the drive. Off: Do not remove the ESM canister, the power-fan canister, or the drive. The drive has an LED but no symbol.
Service Action Required (Fault)
ESM canister Power-fan canister Drive
On: When the drive tray LED is on, a component within the drive tray needs attention. On: The ESM canister, the power-fan canister, or the drive needs attention. Off: The ESM canister, the power-fan canister, and the drive are operating normally. The drive has an LED but no symbol.
AC Power
ESM canister Power-fan canister
On: AC power is present. Off: AC power is not present.
Output DC power
Power-fan canister
On: Regulated DC output power from the power canister and the fan canister is present. Off: Regulated DC output power from the power-fan canister is not present.
HR85-8500-A
Cray Private
93
Tiered Adaptive Storage™ Hardware Guide
Table 4-35: DCBS LED Symbols and General Behavior LED
Symbol
Location (Canisters)
Function
Input DC power
Power-fan canister
On: Regulated DC input power from the power canister and the fan canister is present. Off: Regulated DC input power from the powerfan canister is not present.
Link Service Action Required (Fault)
ESM canister
On: The cable is attached and at least one lane has a linkup status, but one lane has a link-down status. Off: The cable is not attached, the cable is attached and all lanes have a link-up status, or the cable is attached and all lanes have a linkdown status.
ESM canister
On: The cable is attached and at least one lane has a linkup status. Off: The cable is not attached, or the cable is attached and all lanes have a link-down status.
Link up
94
Two LEDs above each expansion connector
Cray Private
HR85-8500-A
Hardware Implementation [4]
Tape Archive Storage Tier (TAST) TAS is normally configured with a TAST but it is not required. The Cray TAS sells and supports Spectralogic T50e, T200, T380, T680, and T950 tape libraries. However, TAS will support certain other tape libraries from Oracle(STK), Quantum, and IBM. Although Spectralogic tape drives are supported by Cray, they are serviced by Spectralogic. The TAS TAST is connected by FC-8 cables to the storage network. It supports T10k, TS1140, LTO-5, or LTO-6 media. The default offering for new Spectralogic tape library sales will be LTO-6. Cray will supply Spectralogic LTO-6 Terapacks. The TAST will support a minimum of 4 and a maximum of 16 tape drives based on the current MDC I/O. Each tape drives requires a FC8 connection to the SAN. Each group of 4 tape drives requires an FC8 port on each MDC.
Data Network Overview There are 2 types of switches, SAN and Network. The SAN switches may be fibre channel (QLogic) or infiniband (Mellanox). These switches provide a data storage fabric to connect the MDC's and DM's to the cache block storage, and to connect the MDC's to the robotic tape library. The network switches provide administrative communication between the CIMS and the MDC's and DM's. They provide access for system admins and all the servers (CIMS, MDC, and DM). In addition, the network switches provide metadata communication between the MDC’s and DM’s.
HR85-8500-A
Cray Private
95
Tiered Adaptive Storage™ Hardware Guide
InfiniBand Data Network Switches and Cables (Mellanox SX6036) Switch Platform Hardware Overview Figure 4-55: Mellanox SX6036 Switch System Power Side Panel
Figure 4-56: Mellanox SX6036 Switch System Connector Side Panel (Internally Managed)
Status LEDs The system status LEDs are located to the left of the connectors on the connector side panel, and on the power side at the far right. Table 4-36: Mellanox SX6036 System Status LEDs Symbol
96
Name
Description
Normal Conditions
Switch status LED
Shows the health of the switch
Solid green after 5 seconds
Fan status LED
Shows the health of the fans
Green
Power supply 1
Shows the health of the right side power supply unit
Green
Cray Private
HR85-8500-A
Hardware Implementation [4]
Table 4-36: Mellanox SX6036 System Status LEDs Symbol
Name
Description
Normal Conditions
Power supply 2
Shows the health of the left side power supply unit
Green, if installed for redundancy and connected to power source
Bad port LED
Lights up when a symbol error is detected
Off
Unit identifier LED
Lights up on command through the CLI
Off
Switch Status LED Figure 4-57: Mellanox Switch Status LED
Table 4-37: Switch Status LED Assignments LED Configuration
HR85-8500-A
Status/System Health LED
Solid green
The system is up and running
Flashing green
The system is booting up.This assignment is valid on managed systems only
Solid yellow
To be implemented in a future SW release
Solid red
Possible damage can result to the switch. Turn off immediately. e.g. bad firmware, can’t boot, overheated. If the system is booting up, it can take up to 5 minutes for the status LED to change to green.
Off
The system has no power.
Cray Private
97
Tiered Adaptive Storage™ Hardware Guide
Figure 4-58: Fan Status LED Connector Side
Table 4-38: Fan Status LED Assignments LED Configuration
Fan LED
Solid green
All fans are up and running
Solid red
Fan malfunction. The system should be powered down and troubleshoot the fan module.
Off
The fan unit is not receiving any power. Check that the fan unit is properly and completely inserted.
Power Supply Status LEDs Figure 4-59: Power Status LED Connector Side
This switch comes with two power supply units for redundancy. The SX6036 switch is available with one factory installed Power Supply Unit. A second Power Supply Unit can be added to support hot-swap ability and to add redundancy.
98
Cray Private
HR85-8500-A
Hardware Implementation [4]
Figure 4-60: Power Side Panel
The primary power supply unit (PS1) is located on the right side of the power side panel, with PS2 on the left side. Each PS unit has a single 2 color LED on the right side of the PS unit, that indicates the internal status of the unit. Figure 4-61: PS Unit Status LEDs
Table 4-39: PS Unit Status LED Assignments LED Color
HR85-8500-A
Status
Solid green
The power supply is delivering the correct voltage (12VDC)
Solid red
The PS unit is not operational
Off
There is no power to the system (neither PS unit is receiving power). If one PS unit is showing green and the second PS unit is unplugged, it will show a red indication
Cray Private
99
Tiered Adaptive Storage™ Hardware Guide
Bad Port LED Figure 4-62: Bad Port LED
Table 4-40: Bad Port LED Assignments LED Configuration
Description
Off
No symbol errors have bene received in the last few seconds
Flashing orange
One or possibly more ports have just received a symbol error. Possible causes are • Bad cable • Bad connection • Bad connector
UID LED Switch Identifier The UID LED is a debug feature that lists a blue LED on the switch connector side panel for ease in finding a particular switch within a cluster. Figure 4-63: Identifier LED
100
Cray Private
HR85-8500-A
Hardware Implementation [4]
Port Connector LEDs Above the downlink pots are two LEDs: one for the upper port and one for the lower port. Figure 4-64: Port LEDs
Table 4-41: Port Connector Physical and Logical Link Assignments for Ethernet Mode LED Configuration
LED Description
Off
Physical link is down/default
Solid green
Physical link is up with no traffic
Flashing green
Physical link is up with traffic
Flashing orange
Physical errors
Table 4-42: Connector Physical and Logical Link Assignments for IB Mode LED Status
LED Description
Off
No power to the port
Solid green
Logical link is up
Flashing green
Data activity flashing speed is proportional to data transfer speed
Solid orange
Physical link is up
Flashing orange
A problem with the physical link. Usually the SM is down or not enabling the port
In IB mode, the LED indicator, corresponding to each data port, will light orange when the physical connection is established (when the unit is powered on and a cable is plugged into the port with the other end of the connector plugged into a functioning port). When a logical connection is made, the LED will change to green. When data is transferred, the light will blink green.
HR85-8500-A
Cray Private
101
Tiered Adaptive Storage™ Hardware Guide
Fibre Channel Data Network Switches and Cables (QLogic 5802V) Chassis Controls and LEDs Figure 4-65: QLogic 5802V Chassis Controls and LEDs Input Power LED (Green)
Heartbeat LED (Green) System Fault LED (Amber)
Table 4-43: QLogic 5802V Chassis Controls and LEDs LED
Color
On
Off
Input power
Green
Switch logic circuitry is receiving the correct DC voltages
Maintenance mode
Heartbeat
Green
Refer to “Heartbeat LED Blink Patterns” on page 103
Refer to “Heartbeat LED Blink Patterns” on page 103
System fault
Amber
A failure exists in the switch firmware or hardware (POST errors, over-temperature conditions or power supply malfunctions)
Normal status
•
Maintenance Button
Press the maintenance button to reset the switch, or to place the switch in maintenance mode. Maintenance mode sets the IP address to 10.0.0.1 and provides access to the switch for maintenance purposes when flash memory or the resident configuration file has been corrupted.
102
Cray Private
HR85-8500-A
Hardware Implementation [4]
Heartbeat LED Blink Patterns The Heartbeat LED indicates the operational status of the switch. When the POST completes with no errors, the Heartbeat LED blinks at steady rate of once per second. When the switch is in maintenance mode, the Heartbeat LED remains lit. All other blink patterns indicate critical errors. In addition to producing a heartbeat error blink patterns, a critical error also illuminates the System Fault LED. The Heartbeat LED shows an error blink pattern for the following conditions: •
1 blink—Normal operation
•
2 blinks—Internal Firmware Failure Blink Pattern
•
3 blinks—Fatal POST Error Blink Pattern
•
4 blinks—Configuration File System Error Blink Pattern
•
5 blinks—Over-Temperature Blink Pattern
FC Ports Figure 4-66: QLogic 5802V Fibre Channel Ports SFP Ports 0 1
10 11
2
12 13
3
4 5
14 15
XPAK Ports 6 7
16 17
8
18 19
9
20
21
22
23
Each SFP port is capable of 1, 2, 4, or 8Gbps transmission, depending on the SFP. SFP ports are hot-pluggable and can self-discover both the port type and transmission speed when connected to devices or other switches. Setting an SFP port to 1 Gbps that has an 8Gbps SFP transceiver will bring down the port. Figure 4-67: QLogic 5802V Port LEDs Logged-In LED Activity LED
HR85-8500-A
Logged-In LED Activity LED
Cray Private
103
Tiered Adaptive Storage™ Hardware Guide
Table 4-44: QLogic 5802V Port LEDs LED
Color
On
Off
Port logged-in
Green
Port initialized or logged in Refer to Logged-In LED Indications
Port connection is broken or an error occurs that disables the port Refer to Logged-In LED Indications
Port activity
Green
Data is passing through the port
Data is not passing through the port
Logged-In LED Indications The Logged-In LED has three indications: •
Continuous illumination: A device is logged in to the port.
•
Flashing once per second: A device is logging in to the port, or the port is in the diagnostics state.
•
Flashing twice per second: The port is down, offline, or an error has occurred.
If a Logged-In LED is flashing twice per second, review the event browser for alarm messages regarding the affected port. You can also inspect the alarm log using the Show Alarm command. If there is an error, alarm messages may point to one or more of the following conditions: •
E_Port Isolation
•
Excessive Port Errors
Ethernet Port The Ethernet port is an RJ-45 connector that provides a connection to the management workstation through a 10/100 Base-T Ethernet cable. Figure 4-68: QLogic 5802V Ethernet Port Activity LED
104
Link Status LED
Cray Private
HR85-8500-A
Hardware Implementation [4]
Table 4-45: QLogic 5802V Power Supply ED LED
Color
On
Off
Link status
Green
Ethernet connection is established
No ethernet connection is established
Activity
Green
Data is transmitted and received over Ethernet connection
No data is transmitted and received over Ethernet connection
Serial Port Figure 4-69: QLogic 5802V Serial Port
1 6
5 9
RS-232 Connector Pin Identification
Serial Port
Power Supplies and Fans The model 5802V switch has two, hot-pluggable power supplies that convert 100–240 VAC to DC voltages for the various switch circuits. Each power supply has an AC power receptacle and two status LEDs. Figure 4-70: Model 5802V Power Supplies Status LED (Green)
AC Power Receptacle
Fault LED (Amber)
Power Supply 1
HR85-8500-A
Power Supply 2
Cray Private
105
Tiered Adaptive Storage™ Hardware Guide
Table 4-46: QLogic 5802V Power Supply LEDs LED
106
Color
On
Off
Power supply fault
Amber
A power supply fault exists and requires attention
Normal status
Power supply status
Green
Power supply is receiving AC voltage and producing the correct DC voltages
Power supply is not receiving AC voltage or not producing the correct DC voltages
Cray Private
HR85-8500-A
Hardware Implementation [4]
Ethernet Management Network Switches and Cables (Dell PowerConnect 5548) Dell PowerConnect 5548 Ports Figure 4-71: PowerConnect 5548 Ports
HR85-8500-A
Cray Private
107
Tiered Adaptive Storage™ Hardware Guide
Buttons and LEDs Figure 4-72: Dell PowerConnect 5548 Front LED Panel
Table 4-47: Dell PowerConnect 5548 Front Panel LEDs LED
Color
Power supply (PWR)
Green
Static: The switch is on Flashing: The Locator function is enabled
Status
Green or Red
Green static: The switch is operating normally Green flashing: The switch is booting Red static: A critical system error has occurred Red flashing: A noncritical system error has occurred
Stacking number Modular/redunda ncy power supply (MPS/RPS)
108
On
Off The switch is turned off
Indicates the unit ID of the device in stack Green or Red
Cray Private
Green static: The MPS/RPS is currently operating Red static: The MPS/RPS failed
The MPS/RPS is not plugged in
HR85-8500-A
Hardware Implementation [4]
Table 4-47: Dell PowerConnect 5548 Front Panel LEDs LED
HR85-8500-A
Color
On
Locator
Green
Flashing: Locator function is enabled Static: Locator function is disabled
Master
Green
Static: The device is a master unit
Fan (FAN)
Green or Red
Green static: All device fans are operating normally Red static: One or more of the device fans are not operating
Cray Private
Off
The device is not a master unit
109
Tiered Adaptive Storage™ Hardware Guide
Gigabit Ports Figure 4-73: Giga Port LEDs
LNK
ACT/PoE
LNK
ACT/PoE
Table 4-48: Giga Ports on non-POE Enabled Devices LEDs LED
Color
On
Off
LNK
Green, Yellow, or Amber
Green flashing: Link is up and the port is either transmitting or receiving at 1000 Mbps Solid green: Link is up at high speeds Yellow flashing: Link is up and the pot is either transmitting or receiving data at 100 Mbps Solid amber: Link is up at lower speeds
The port is currently not operating
ACT
Green
Green flashing: There is activity on the port
There is no activity on the port
HDMI Port LEDs The HDMI ports have a Speed/link (LNK) LED on their left side and an activity (ACT) LED on their right side. Table 4-49: HDMI (Stacking) Port LEDs LED
110
Color
On
Off
Speed/link
Green
Solid green: Port is linked to device
Port is currently not operating
ACT
Green
Flashing green: Port is either transmitting or receiving
Port is not transmitting or receiving
Cray Private
HR85-8500-A
Hardware Implementation [4]
SFP LEDs The SFP+ ports each have two LEDs, marked as LNK and ACT, associated with them. Table 4-50: SFP Port LEDs LED
HR85-8500-A
Color
On
LNK
Green or Amber
Solid green: Link is at highest speed Solid amber: Link is at lowest speed
ACT
Green
Flashing green: Port is either transmitting or receiving
Cray Private
Off Port is currently not linked
111
Tiered Adaptive Storage™ Hardware Guide
112
Cray Private
HR85-8500-A
Block Storage Tray ID Codes [5] ABS (NetApp 2700) and DCBS (NetApp 5500) Tray ID Indicator Codes Tray ID Failure Codes The tray ID indicator displays values that can identify a failing component. Table 5-51: Storage Controller Tray ID Failure Codes Component
Tray ID Indicator Display Used in this Controller State
(Code) (See footnote 1)
Power-on Diagnostics
Suspended
Operational
Processor DIMM (dx +) (See footnote 2)
yes
yes
no
Cache DIMM (CX +)
yes
yes
no
Host card (HX +)
yes
yes
yes
Flash drive (FX +)
no
no
yes
Category delimiter (--) After (--), a “CF” (for Component Failure) is displayed, followed by one of the codes specified above.
The separator between categorydetail code pairs is used when more than one category detail code pair exists in the sequence.
End-of-sequence delimiter Appears as a blank space. (See footnote 3)
The end-of-sequence delimiter is automatically inserted by the hardware at the end of a code sequence.
1. A two-digit code that starts a dynamic display sequence. 2. A two-digit code that follows the category code with more specific information 3. The plus (+) sign indicates that a two-digit code display with the Diagnostic LED on.
HR85-8500-A
Cray Private
113
Tiered Adaptive Storage™ Hardware Guide
Controller-Drive Tray Sequence Code Definitions During normal operation, the tray ID display on each controller canister displays the controller-drive tray ID. The Diagnostic LED (lower-digit decimal point) comes on when the display is used for diagnostic codes and goes off when the display is used to show the tray ID. Table 5-52: Controller Drive Tray Sequence Code Definitions Category
Category Code (See Footnote 1)
Detail Codes (See Footnote 2)
Startup error
SE+ (See footnote 3)
• 88+ Power-on default • dF+ Power-on diagnostic fault • Sx Power-on validation error
Operational error
OE+
Lx+ Lock-down codes. (See “Lock-Down Codes for the Controller-Drive Tray” on page 114 and “Tray ID Indicator Display Use Cases” on page 116.)
Operational state
OS+
• • • •
OL+ = Offline. bb+ = Battery backup (operating on batteries). OH + CPU temperature exceeds the warning level. CF+ = Component failure.
Component failure
CF+
• • • • • •
dx+ = Processor or cache DIMM. Cx = Cache DIMM. Px+ = Processor DIMM. Hx+ = Host interface card. Fx+ = Flash drive. bl + Base controller card.
Diagnostic failure
dE+
Lx+ = Lock-down codes (See “Lock-Down Codes for the Controller-Drive Tray” on page 114 and “Tray ID Indicator Display Use Cases” on page 116.)
Category delimiter
dash+
The separator between category-detail code pairs is used when more than one category detail code pair exists in the sequence.
End-of-sequence delimiter
blank - (See footnote 4)
The end-of-sequence delimiter is automatically inserted by the hardware at the end of a code sequence.
1.A two-digit code that starts a dynamic display sequence. 2. A two-digit code that follows the category code with more specific information. 3. The plus (+) sign indicates that a two-digit code displays with the Diagnostic LED on. 4. No codes display, and the Diagnostic LED is off.
Lock-Down Codes for the Controller-Drive Tray Use the following table to determine the diagnostic lock-down code definitions on the tray ID indicator display in the controller canister. In general, these diagnostics appear only when the controller is in a non-operational state. The controller can be in a non-operation state as a result
114
Cray Private
HR85-8500-A
Block Storage Tray ID Codes [5]
of a configuration problem (such as mismatched controller types) or as a result of a hardware fault. If the amber Controller Service Action Required LED is on, the non-operational state is the result of a hardware fault. Table 5-53: Supported Diagnostic Lock-Down Codes on the Tray ID Indicator Display Diagnostic Code
Description
-- --
The firmware is booting.
L0
The controller types are mismatched, which result in a suspended controller state.
L2
A persistent memory error has occurred, which results in a suspended controller state.
L3
A persistent hardware error has occurred, which results in a suspended controller state.
L4
A persistent data protection error has occurred, which results in a suspended controller state.
L5
An autocode synchronization (ACS) failure has been detected, which results in a suspended controller state.
L6
An unsupported host interface card has been detected, which results in a suspended controller state.
L7
A sub-model identifier either has not been set or has been mismatched, which results in a suspended controller state.
L8
A memory configuration error has been occurred, which results in a suspended controller state.
L9
A link speed mismatch condition detected in either the ESM or power supply, causing suspended controller state.
Lb
A host interface card configuration error has been detected, which results in a suspended controller state.
LC
A persistent cache backup configuration error has been detected, which results in a suspended controller state.
Ld
A mixed cache memory DIMMs condition has been detected, which results in a suspended controller state.
LE
Uncertified cache memory DIMM sizes have been detected, which result in a suspended controller state.
LF
The controller has locked down in a suspended state with limited symbol support.
LH
A controller firmware mismatch has been detected, which results in a suspended controller state.
LJ
The controller does not have enough memory to support the configuration, causing a suspended controller state.
LL
The controller cannot access either midplane SBB EEPROM, which results in a suspended controller state.
Ln
A canister is not valid for a controller, which results in a suspended controller state.
LP
Drive port mapping tables are not detected, which results in a suspended controller state.
Lr
A non field replaceable unit has been replaced, which results in a suspended controller state.
Lt
A configuration data base corruption, which results in a suspended controller state.
LU
The start-of-day (SOD) reboot limit has been exceeded which results in a suspended controller state.
S1*
The controller detects a checksum failure in EEPROM, which results in a suspended controller state.e
S2*
The SBB Signature/Revision is invalid, which results in a suspended controller state.
S3*
An unsupported enclosure is detected in a storage array, which results in a suspended controller state.
S4*
The power supply is incapable of powering the controller, which results in a suspended controller state.
S5*
The SBB pairing fails, which results in a suspended controller state.
* This lockdown code is for the NetApp 2700 only.
HR85-8500-A
Cray Private
115
Tiered Adaptive Storage™ Hardware Guide
Tray ID Indicator Display Use Cases Use the following table displays both startup errors and operation errors that can be used to point to a component failure. Table 5-54: Tray ID Indicator Display Uses Cases with Repeating Sequences Uses Cases
Repeating Sequence
Use Case: Controller power-on • Normal power-on controller insertion. • Controller inserted while held in reset.
SE+ 88+ blank-
Uses Cases
Repeating Sequence
Use Case: Operational states • Normal operation.
xy- (static controller tray ID)
• Start-of-day (SOD) processing
OS+ Sd+ blank-
• The controller is placed in reset while displaying the tray ID.
OS+ OL+ blank-
• The controller is operating on batteries (cache backup).
OS+ bb+ blank-
• The CPU temperature has exceeded the warning level.
OS+ OH+ blank-
Uses Cases
Repeating Sequence
Use Case: Component failure when the controller is operational. • Failed host interface card
OS+ CF+ HX+ blank-
• Failed flash drive
OS+ CF+ Fx+ blank-
Uses Cases
Repeating Sequence
Use case: Power-on diagnostic failure • A component that is not a field replaceable unit failure has been detected.
SE+ dF+ blank-
• A processor DIMM failure has been detected.
SE+ dF+ “- -”+ CF+ Px+ blank-
• A cache memory DIMM failure has been detected.
SE+ dF+ “- -”+ CF+ Cx+ blank-
• A processor DIMM or a cache memory DIMM failure has been detected.
SE+ dF+ “- -”+ CF+ dx+ blank-
• A host interface card failure has been detected.
SE+ dF+ “- -”+ CF+ Hx+ blank-
• An incorrect number of cache backup devices have been detected.
SE+ LC+ “- -”+ CF+ FX+ blank-
Uses Cases
Repeating Sequence
Use Case: Controller is suspended and there are no other errors to report. • All lockdown conditions
116
OE+ Lx+ blank-
Cray Private
HR85-8500-A
Block Storage Tray ID Codes [5]
Table 5-54: Tray ID Indicator Display Uses Cases with Repeating Sequences (continued) Uses Cases
Repeating Sequence
Use Case: The controller is suspended because of component errors. • Persistent processor DIMM error correcting code (ECC) errors have been detected.
OE+ L2+ “- -”+ CF+ CX+ blank-
• Persistent cache DIMM ECC errors have been detected.
OE+ L2+ “- -”+ CF+ CX+ blank-
• Persistent processor or cache DIMM ECC errors have been detected.
OE+ L2+ “- -”+ CF+ CX+ blank-
Uses Cases
Repeating Sequence
Use Case: The controller has been suspended as a result of persistent cache backup configuration errors. • The write-protect switch is set during cache restore.
OE+ LC+ blank-
• The memory size changed with dirty data in the flash drives.
OE+ LC+ dd+ blank-
Uses Cases
Repeating Sequence
Use case: The controller has been suspended as a result of diagnostic errors. • A cache memory diagnostic failure has been detected.
dE+ L2+ “- -”+ CF+ Cx+ blank-
• A base controller diagnostic failure has been detected
dE+ L3+ “- -”+ CF+ b1+ blank-
• A base controller IOC diagnostic failure has been detected.
dE+ L3+ “- -”+ CF+ b2+ blank-
Tray ID Indicator Display for the Drive Trays During normal operation, the tray ID display on each ESM displays the drive tray ID. The diagnostic LED (lower-digital decimal point) comes on when the display is used for diagnostic codes and goes off when the display is used to show the tray ID. Table 5-55: Supported Diagnostic Codes on the ESM Diagnostic Code
HR85-8500-A
Description
-- --
The firmware is booting.
.8, 8., 88
The ESM is being held in reset by another ESM.
AA
ESM A firmware is in the process of booting (the diagnostic indicator is not yet set).
bb
ESM B firmware is in the process of booting (diagnostic indicator is not yet set).
L0
The ESM types are mismatched.
L2
A persistent memory error has occurred.
L3
A persistent hardware error has occurred.
L9
An over-temperature condition has been detected in either the ESM or the power supply.
LL
The midplane SSB VPD EEPROM cannot be accessed.
Ln
The ESM canister is not valid for this drive tray.
Cray Private
117
Tiered Adaptive Storage™ Hardware Guide
Table 5-55: Supported Diagnostic Codes on the ESM (continued) Diagnostic Code
Description
LP
Drive port mapping tables are not found.
H2
The ESM configuration is invalid or incomplete, and it operates in Degraded state.
H3
The maximum number of ESM reboot attempts has been exceeded.
H4
This ESM cannot communicate with the alternate ESM.
H5
A midplane harness failure has been detected in the drive tray.
H6
An ESM firmware failure has been detected.
H9
A non-catastrophic hardware failure has occurred. The ESM is operating in a Degraded state.
J0
The ESM canister is incompatible with the drive tray firmware.
DCBS (NetApp 5400) Tray ID Indicator Codes Sequence Code Definitions for the Controller-Drive Trays During normal operation, the tray ID display on each controller canister displays the controllerdrive tray ID. The Diagnostic LED (lower-digit decimal point) comes on when the display is used for diagnostic codes and goes off when the display is used to show the tray ID. Table 5-56: Sequence Code Definitions for the 5400 Controller-Drive Trays Category
Category Code (See Footnote 1)
Detail Codes (See Footnote 2)
Startup error
SE+ (see footnote 3)
• 88+ Power-on default • dF+ Power-on diagnostic fault • Sx Power-on Validation error
Operational error
OE+
Lx+ Lock-down codes (See footnote 3.)
Operational state
OS+
• • • •
OL+ = Offline. (See footnote 11.) bb+ = Battery backup (operating on batteries). OH+ = CPU temperature exceeds the warning level. CF+ = Component failure. (See footnote 12.)
Component failure
CF+
• • • • • •
dx+ = Processor or cache DIMM. (See footnote 6.) Cx = Cache DIMM (See footnote 7.) Px+ = Processor DIMM. (See footnote 8.) Hx+ = Host interface card. Fx+ = Flash drive. bl + Base Controller Card
Diagnostic failure
dE+
Lx+ = Lock-down code. (See footnote 3.)
118
Cray Private
HR85-8500-A
Block Storage Tray ID Codes [5]
Table 5-56: Sequence Code Definitions for the 5400 Controller-Drive Trays Category Code (See Footnote 1)
Category
Detail Codes (See Footnote 2)
Category delimiter
dash+
the separator between categorydetail code pairs is used when more than one category detail code pair exists in the sequence. (See footnote 4 and footnote 9.)
End-of-sequence delimiter
blank -
The end-of-sequence delimiter is automatically inserted by the hardware at the end of a code sequence. (see footnote 5 and footnote 10.)
1. xy+ 2-digit code with the Diagnostic light ON. 2. xy- 2-digit code with the Diagnostic light OFF. 3. Lx+ Lock-down codes (see ____________ for a list of the lock-down codes.) 4. dash+ All segments off except for the middle segments and with the Diagnostic light ON. 5. blank- All segments off with the Diagnostic light OFF. 6. dx+ USed when there is a single memory system for processor and data cache. 7. Cx+ USed when there are separate processor and data cache memory systems. 8. Px+ USed when there are separate processor and data cache memory systems. 9. Category- Detail separator used when there is more than one category-detail code pair in the sequence. 10. End-of-sequence indicator automatically inserted by hardware at the end of the sequence. Example: SE+ 88+ blank- (repeat) 11. If a tray ID is being displayed, this sequence is programmed to display if the controller is subsequently held in reset. 12. The tray ID is normally displayed during normal operation. This operational state is displayed if an internal controller component failure occurs while the controller is online. An additional detail code identifies the failed component as defined for the Component Failure category. This sequence continues to be displayed even if the controller is subsequently placed offline (held in reset) to service the failed component.
Lock-Down Codes for the Controller-Drive Trays Use the following table to determine the diagnostic lock-down code definitions on the tray ID indicator display in the controller canister. In general, these diagnostics display only when the controller is in a non-operational state. The controller can be in a non-operation state as a result of a configuration problem (such as mismatched controller types) or as a result of a hardware fault. If the amber Controller Service Action Required LED is on, the non-operational state is the result of a hardware fault. Table 5-57: Supported Diagnostic Lock-Down Codes on the Tray ID Indicator Display Diagnostic Code
Description
L0
The controller types are mismatched, which result in a suspended controller state.
L1
Missing interconnect battery canister. (This code is not applicable for the E5400.)
L2
A persistent memory error has occurred, which results in a suspended controller state.
L3
A persistent hardware error has occurred, which results in a suspended controller state.
L4
A persistent data protection error has occurred, which results in a suspended controller state.
L5
An auto-code synchronization (ACS) failure has been detected, which results in a suspended controller state.
L6
An unsupported host interface card has been detected, which results in a suspended controller state.
L7
A sub-model identifier either has not been set or has been mismatched, which results in a suspended controller state.
L8
A memory configuration error has occurred, which results in a suspended controller state.
L9
A link speed mismatch condition has been detected in either the ESM or the power supply, which results in a suspended controller state.
HR85-8500-A
Cray Private
119
Tiered Adaptive Storage™ Hardware Guide
Table 5-57: Supported Diagnostic Lock-Down Codes on the Tray ID Indicator Display Diagnostic Code
Description
LA
Reserved
Lb
A host interface card configuration error has been detected, which results in a suspended controller state.
LC
A persistent cache backup configuration error has been detected, which results in a suspended controller state.
Ld
A mixed cache memory DIMMs condition has been detected, which results in a suspended controller state.
LE
Uncertified cache memory DIMM sizes have been detected, which result in a suspended controller state.
LF
The controller has locked down in a suspended state with limited symbol support.
LH
A controller firmware mismatch been detected, which results in a suspended controller state.
LL
The controller cannot access either midplane SBB EEP-ROM, which results in a suspended controller state.
Ln
A canister is not valid for a controller, which results in a suspended controller state.
LP
Drive port mapping tables are not detected, which results in a suspended controller state.
Lr
The controller has detected a component that is not field replaceable unit (FRU). Please contact your Technical Support representative.
LU
The start-of-day (SOD) reboot limit has been exceeded, which results in a suspended controller state.
Diagnostic Code Sequences for the Controller-Drive Trays Use the following table to determine the code sequences on the tray ID indicator display in the controller canister for the E5400 controller-drive trays. These repeating sequences can be used to diagnose potential problems with the controller tray. Table 5-58: Diagnostic Code Sequences for the E5400 Controller-Drive Trays Displayed Diagnostic Code Sequences
Description
SE+ 88+ blank-
One of the following power-on conditions exists: • Controller power-on • Controller insertion • Controller inserted while held in reset
xy -
Normal operation.
OS+ Sd+ blank-
Start-of-day (SOD) processing.
OS+ OL+ blank-
The controller is placed in reset while displaying the tray ID.
OS+ bb+ blank-
The controller is operating on batteries (cache backup).
OS+ OH+ blank-
The CPU temperature exceeds the warning level.
OS+ CF+ Hx+ blank-
A failed host interface card has been detected (See footnote 1 and footnote 2.)
OS+ CF+ Fx+ blank-
A failed flash drive has been detected (see footnote 1 and footnote 2.)
SE+ dF+ blank-
A non-replaceable component failure has been detected (See footnote 1.)
SE+ dF+ dash+ CF+ Px+ blank-
A processor DIMM failure has been detected. (See footnote 1.)
SE+ dF+ dash+ CF+ Cx+ blank-
A cache memory DIMM failure has been detected. (See footnote 1.)
SE+ dF+ dash+ CF+ dx+ blank-
A processor or cache DIMM failure has been detected. (See footnote 1.)
120
Cray Private
HR85-8500-A
Block Storage Tray ID Codes [5]
Table 5-58: Diagnostic Code Sequences for the E5400 Controller-Drive Trays Displayed Diagnostic Code Sequences
Description
SE+ dF+ dash+ CF+ Hx+ blank-
A host card failure has been detected. (See footnote 1.)
SE+ LC+ dash+ CF+ Fx+ blank-
A lockdown condition has been detected.
OE+ L2+ dash+ CF+ Px+ blank-
Persistent processor DIMM ECC errors have been detected, which result in a suspended controller state.
OE+ L2+ dash+ CF+ Cx+ blank-
Persistent cache DIMM ECC errors have been detected, which result in a suspended controller state.
OE+ L2+ dash+ CF+ dx+ blank-
Persistent processor or cache DIMM ECC errors have been detected, which result in a suspended controller state.
OE+ LC+ blank-
The write-protect switch is set during cache restore, which results in a suspended controller state.
OE+ LC+ dd+ blank-
The memory size is changed from bad data in the flash drives, which results in a suspended controller state.
DE+ L2+ dash+ CF+ Cx+ blank-
A cache memory diagnostic has failed, which results in a suspended controller state.
DE+ L3+ dash+ CF+ bl+ blank-
A base controller diagnostic has failed, which results in a suspended controller state.
DE+ L3+ dash+ CF+ b2+ blank-
An I/O controller (IOC) diagnostic has failed for the chip on the controller board, which results in a suspended controller state.
1. If the controller detects more than one component failure, only the first component failure detected appears in the tray ID indicator display. 2. If this component failure displays in the tray ID indicator while the controller is operational, the storage management software will also generate event notifications, such as a MEL event, for the same failure condition.
Tray ID Indicator Display for the ESMs on the Drive Trays During normal operation, the tray ID display on each ESM displays the drive tray ID. The Diagnostic LED (lower-digit decimal point) comes on when the display is used for diagnostic codes and goes off when the display is used to show the tray ID. Note: If a power-on or reset occurs, the Diagnostic LED, the Heartbeat LED (upper-digit decimal point), and all tray ID indicators of both digits come on. The Diagnostic LED remains on until the drive tray ID appears. Table 5-59: Supported Diagnostic Codes Diagnostic Code
ESM State
Description
-- --
Suspended
The firmware is booting.
.8, 8., or 88
Suspended
This ESM is being held in reset by another ESM
AA
Suspended
ESM A firmware is in the process of booting (the diagnostic indicator is not yet set).
bb
Suspended
ESM B firmware is in the process of booting (the diagnostic indicator is not yet set).
L0
Suspended
The ESM types are mismatched.
L2
Suspended
A persistent memory error has occurred.
L3
Suspended
A persistent hardware error has occurred.
HR85-8500-A
Cray Private
121
Tiered Adaptive Storage™ Hardware Guide
Table 5-59: Supported Diagnostic Codes Diagnostic Code
ESM State
Description
L9
Suspended
An over-temperature condition has been detected in either the ESM or the power supply.
LL
Suspended
The midplane SBB VPD EEPROM cannot be accessed.
Ln
Suspended
The ESM canister is not valid for this drive tray.
LP
Suspended
Drive port mapping tables are not found.
H2
Suspended
The ESM configuration is invalid or incomplete, and it operates in a Degraded state.
H3
Suspended
The maximum number of ESM reboot attempts have been exceeded.
H4
Suspended
This ESM cannot communicate with the alternate ESM.
H5
Suspended
A midplane harness failure has been detected in the drive tray.
H6
Suspended
An ESM firmware failure has been detected.
H7
ESM is operating as a rate other than the switch setting.
H9
A non-catastrophic hardware failure has occurred. The ESM is operating in a Degraded state.
J0
122
Suspended
The ESM canister is incompatible with the drive tray firmware.
Cray Private
HR85-8500-A