S&C Infrastructure Servers

 

STAR Database, Web Servers and other core servers at BNL

(see $DB_SERVER_LOCAL_CONFIG for default offline STAR XML configuration)

 

NAME BASE HARDWARE LINUX DISTRO PROCESSOR(S) RAM NETWORK INFO DISKS                       PORTS: PURPOSES LOCATION HEALTH MONITORING                        

ORIGINAL PURCHASE DATE

Notes
                       
robinson Dell PowerEdge R420 Sc.Linux 7.x 2x Intel Xeon E5-2430 (Hex-core, 2.2GHz, 15MB cache, HT available but disabled) 64GB + 32GB swap 130.199.148.90:
LACP: 2 x 1Gb/s
PERC H710P RAID controller with 8 600GB 10K SAS drives (2.5")

6 drives form a RAID 10, with the remaining 2 drives set as hot spares

/boot (1GB)
/ (150GB)
swap (32GB)
/db01 (1.5TB)
 
3306: rh-mysql57-mysql-server, offline production/analysis master
 
BCF Rack 4-10 smartd (starsupport)

disk space mon (starsupport)

Ganglia

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Logwatch (WB)

SKM

OCS Inventory

Osiris (starsupport)

No Icinga
May 2014 (inherited from RACF in 2018) offline database master
 db01  Dell PowerEdge R410 Sc.Linux 7.x 2x Intel Xeon X5650 (Hex-core, 2.67GHz, 12MB cache, HT available but disabled)  64GB + 24GB swap 130.199.148.101:
LACP: 2 x 1Gb/s
PERC H700 with four 600GB 15K 3.5" SAS drives in a RAID 10, with four partitions:

/boot: 1GB
/: 200GB
swap: 24GB
/db01: 900GB
(approx.)
3316: rh-mysql57-mysql-server, offline production/analysis slave  BCF smartd (starsupport)

disk space mon (starsupport)

Ganglia

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Logwatch (WB)

SKM

OCS Inventory

Osiris (starsupport)

No Icinga
September 2011
(inherited from RACF in 2018)
 
 db02  Dell PowerEdge R710 Sc.Linux 7.x 2x Intel Xeon X5660 (Hex-core, 2.80GHz, 12 MB cache, HT available but disabled)  96GB + 24GB swap 130.199.148.102:
LACP: 2 x 1Gb/s
PERC H700 RAID controller with 6 2TB GB 7200RPM SAS drives in a RAID 10, partitioned as

/boot (1GB)
/db01 (5.3TB)
/ (187GB)
swap (24GB)
3316: rh-mysql57-mysql-server, offline production/analysis slave  BCF

smartd (starsupport)

disk space mon. (starsupport)

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

No Icinga

February 2012  
 db04 Dell PowerEdge R410 Sc.Linux 7.x 2x Intel Xeon X5650 (Hex-core, 2.67GHz, 12 MB cache, HT available but disabled)  64GB + 32GB swap 130.199.148.104:
LACP: 2 x 1Gb/s
PERC H700 RAID controller with 4 2TB GB 7200RPM SAS drives in a RAID 10, partitioned as

/boot (1GB)
/db01 (934GB)
/ (150GB)
swap (32GB)
3316: rh-mysql57-mysql-server (el7.x86_64) offline production/analysis slave  BCF

smartd (starsupport)

disk space mon. (starsupport)

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

No Icinga

February 2012  
ovirt-sdmz5 (old db05) Dell PowerEdge R410 Sc.Linux 7.x 2x Intel Xeon X5650 (Hex-core, 2.67 GHz, 12MB cache, HT available but disabled)  64GB + 32GB swap 130.199.148.116:
1 x 1Gb/s
PERC H700 RAID controller with 4 1 TB 7200RPM SAS drives, 2 in a RAID 1, partitioned as

/boot (1GB)
/data (794GB)
/ (99GB)
swap

2 drives each in their own RAID 0 for GlusterFS bricks
Ovirt
GlusterFS
 
 BCF smartd (starsupport)

disk space mon (starsupport)

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Ganglia

Logwatch (WB)

SKM

OCS Inventory

Osiris (starsupport)

No Icinga
 
October 2011 (inherited from RACF in 2018)  Ovirt Hypervisor
 Gluster storage host
 
db06 Dell PowerEdge R610 Sc. Linux 7.x 2x Intel Xeon X5670 (Hex-core, 2.93 GHz, 12MB cache, HT available but disabled) 48 GB + 24GB swap 130.199.148.106:
LACP: 2 x 1Gb/s
PERC H700 RAID controller with 6 2.5" 600 GB 10K SAS drives in a RAID 10, partitioned as

/boot (1GB)
/db01 (1.5TB)
/ (187GB)
swap
3316: rh-mysql57-mysql-server (el7.x86_64) offline production/analysis slave BCF smartd (starsupport)

disk space mon (starsupport)

Ganglia

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Logwatch (WB)

SKM

OCS Inventory

Osiris (starsupport)

No Icinga
June 2011 1. part of the dbx.star.bnl.gov DNS round-robin

 db07

Dell PowerEdge R610 Sc.Linux 7.x 2x Intel Xeon X5670 (Hex-core, 2.93 GHz, 12 MB cache, HT available but disabled)  48 GB + 24GB swap 130.199.148.107:
LACP: 2 x 1Gb/s
PERC H700 RAID controller with 6 2.5" 600 GB 10K SAS drives in a RAID 10, partitioned as

/boot (1GB)
/db01 (1.5TB)
/ (187GB)
swap

3316: rh-mysql57-mysql-server (el7.x86_64) offline production/analysis slave

 BCF smartd (starsupport)

disk space mon (starsupport)

Ganglia

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Logwatch (WB)

SKM

OCS Inventory

Osiris (starsupport)

No Icinga
February 2011  1. part of the dbx.star.bnl.gov DNS round-robin
 db08 Dell PowerEdge R610 Sc.Linux 7.x 2x Intel Xeon X5670 (Hex-core, 2.93 GHz, 12 MB cache, HT available but disabled)  48 GB + 24GB swap 130.199.148.108:
LACP: 2 x 1Gb/s

PERC H700 RAID controller with 6 2.5" 600 GB 10K SAS drives in a RAID 10, partitioned as

/boot (1GB)
/db01 (1.5TB)
/ (187GB)
swap

 

3316: rh-mysql57-mysql-server (el7.x86_64) offline production/analysis slave

 BCF smartd (starsupport)

disk space mon (starsupport)

Ganglia

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Logwatch (WB)

SKM

OCS Inventory

Osiris (starsupport)

No Icinga
February 2011  1. part of the dbx.star.bnl.gov DNS round-robin
 db10 Dell PowerEdge R610 Sc.Linux 7.x 2x Intel Xeon X5660 (Hex-core, 2.80GHz, 12MB cache, HT available but disabled)  96GB + 32GB swap 130.199.148.110:
LACP: 2 x 1Gb/s
PERC H700 RAID controller with 6 300 GB 15K SAS drives.  Two form a RAID1 with /boot (500MB), swap, and / (244GB)

Four drives are in a RAID 10 for /db01 (549GB)
 
3316: rh-mysql57-mysql-server, offline production/analysis slave  BCF smartd (starsupport)

disk space mon (starsupport)

Ganglia

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Logwatch (WB)

SKM

OCS Inventory

Osiris (starsupport)

No Icinga
 
November 2012 (inherited from RACF in 2018)

 1. Core team offline db server (see STAR DB XML)

db11 (removed from service 9/16/2019) Dell PowerEdge R710 Sc.Linux 7.x 2x Intel Xeon X5550 (Quad-core, 2.67GHz, 8MB cache, HT available but disabled)  48GB + 24GB swap 130.199.148.111:
LACP: 2 x 1Gb/s

PERC H200 RAID controller

2 1TB 3.5" 7200 RPM SAS drives in a RAID 1 with /, /boot and swap

4 2TB 3.5" 7200 RPM SAS drives in a RAID 10 (/db01, 3.6TB)

3316: rh-mysql57-mysql-server, offline production/analysis slave  BCF smartd (starsupport)

disk space mon (starsupport)

Ganglia

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Logwatch (WB)

SKM

OCS Inventory

Osiris (starsupport)

No Icinga
April 2010  
ovirt-sdmz1 (old db12) Dell PowerEdge R410 Sc.Linux 7.x 2x Intel Xeon X5650 (Hex-core, 2.67GHz, 12MB cache, HT available but disabled)  64GB + 24GB swap 130.199.148.105:
1 x 1Gb/s



PERC H700 RAID controller with 4 1 TB 7200RPM SAS drives, 2 in a RAID 1, partitioned as

/boot (1GB)
/data (794GB)
/ (99GB)
swap

2 drives each in their own RAID 0 for GlusterFS bricks


 

Ovirt
GlusterFS
 BCF

smartd (starsupport)

disk space mon. (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

No Icinga

February 2012  Ovirt Hypervisor
 Gluster storage host
ovirt-sdmz2 (old db13) Dell PowerEdge R410 Sc.Linux 7.x 2x Intel Xeon X5650 (Hex-core, 2.67GHz, 12MB cache, HT available but disabled)  64GB + 24GB swap 130.199.148.112:
 1 x 1Gb/s
PERC H700 RAID controller with 4 1 TB 7200RPM SAS drives, 2 in a RAID 1, partitioned as

/boot (1GB)
/data (794GB)
/ (99GB)
swap

2 drives each in their own RAID 0 for GlusterFS bricks
Ovirt
GlusterFS
 BCF

smartd (starsupport)

disk space mon. (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

No Icinga

February 2012  Ovirt Hypervisor
 Gluster storage host
ovirt-sdmz3 (old db15) Dell PowerEdge R410 Sc.Linux 7.x 2x Intel Xeon X5650 (Hex-core, 2.67GHz, 12MB cache, HT available but disabled)  64GB + 24GB swap 130.199.148.113:
1 x 1Gb/s
PERC H700 RAID controller with 4 1 TB 7200RPM SAS drives, 2 in a RAID 1, partitioned as

/boot (1GB)
/data (794GB)
/ (99GB)
swap

2 drives each in their own RAID 0 for GlusterFS bricks
Ovirt
GlusterFS
 BCF

smartd (starsupport)

disk space mon. (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

No Icinga

February 2012  Ovirt Hypervisor
 Gluster storage host
ovirt-sdmz4 (old db16) Dell PowerEdge R410 Sc.Linux 7.x 2x Intel Xeon X5650 (Hex-core, 2.67GHz, 12MB cache, HT available but disabled)  64GB + 24GB swap 130.199.148.115:
1 x 1Gb/s
 
PERC H700 RAID controller with 4 1 TB 7200RPM SAS drives, 2 in a RAID 1, partitioned as

/boot (1GB)
/data (794GB)
/ (99GB)
swap

2 drives each in their own RAID 0 for GlusterFS bricks
Ovirt
GlusterFS
 BCF  

smartd (starsupport)

disk space mon. (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

No Icinga

February 2012  Ovirt Hypervisor
 Gluster storage host
db17 (removed from service 9/16/2019) IBM System x3650 M3 -[7945AC1]- Sc.Linux 7.x 2x Intel Xeon E5645 (Hex-core, 2.40GHz, 12MB cache, HT available but disabled)  96GB + 24GB swap 130.199.148.117:
LACP: 2 x 1Gb/s
ServeRAID M1015 SAS/SATA Controller

2 160GB SATA 2.5" drives in a RAID 1 with:

/boot: 1GB
/: 125GB
swap: 24GB

4 147GB SAS 15K RPM 2.5" drives in a RAID 10 with:

/db01: 268GB
3316: rh-mysql57-mysql-server, offline production/analysis slave  BCF  

smartd (starsupport)

disk space mon. (starsupport)

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

No Icinga

August 2011  
db18 (removed from service 9/16/2019) IBM System x3650 M3 -[7945AC1]- Sc.Linux 7.x 2x Intel Xeon E5645 (Hex-core, 2.40GHz, 12MB cache, HT available but disabled)  96GB + 24GB swap 130.199.148.118:
LACP: 2 x 1Gb/s
ServeRAID M1015 SAS/SATA Controller

2 160GB SATA 2.5" drives in a RAID 1 with:

/boot: 1GB
/: 125GB
swap: 24GB

4 147GB SAS 15K RPM 2.5" drives in a RAID 10 with:

/db01: 268GB
3316: rh-mysql57-mysql-server, offline production/analysis slave  BCF

smartd (starsupport)

disk space mon. (starsupport)

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

No Icinga

August 2011  
duvall (and db09 is an alias for duvall) ASA/SuperMicro 1U, X9DRW motherboard RHEL Workstation 6.x (64-bit) 2x Intel Xeon E5-2620 v2
(6 cores each, 2.1GHz, 16MB cache, w/HT)
32 GB+ 16GB swap 130.199.148.93:
LACP: 2 x 1Gb/s
10 2.5" drive bays
(0-7 on one controller, 8 & 9 on a different controller)

3x 1TB , each with 3 partitions for
RAID 1 arrays (/, /boot and swap) - 2 active drives, 1 hot spare

2x 240GB SSD with 1 partition each.  RAID1 array for /db01
 3306: mysql-5.1.73 (RH), offlineQA, nova, LibraryJobs, OCS Inventory database (tbc), etc. BCF smartd (starsupport)

disk space mon (starsupport)

Ganglia

Logwatch (WB)

SKM

OCS Inventory

Osiris (starsupport)

LogWatch (WB)

No Icinga
February 2015  

heston

 Penguin Relion 2600SA RHEL Workstation 6.x (64-bit) 2x Intel 
Xeon E5335 (Quad-core, 2.00GHz, 8MB cache)
 8GB + 10GB swap

130.199.148.91:
LACP: 2 x 1Gb/s

 Six 750 GB (or 1TB) SATA drives identically partitioned:

/boot: 190MB, ext4, RAID1 (6 drives)

/: 12GB, ext4, RAID5 (5 drives + 1 spare)

/db01: 2.6TB, ext4, RAID5 (5 drives + 1 spare)

swap: 10GB, RAID5 (5 drives + 1 spare)

3306: mysqld-5.1.73-8.el6_8 (RH)

 BCF

mdmonitor (starsupport)

smartd (starsupport)

disk space mon. (starsupport)

Ganglia (yes)

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

DB start-up scripts (yes, "mysqld_sofi")

August or November 2007  former duvall, now a slave to duvall
 omega
(removed from service in fall 2018 - no direct replacement)
        130.199.148.92:
LACP: 2 x 1Gb/s


 

         
                       
onldb IBM System x3650 M3 -[7945AC1]- Sc.Linux 7.x 2x Intel Xeon E5645 (hex-core, 2.4GHz, 12 MB cache, HT disabled at BIOS) 96GB + 24GB swap

130.199.60.70: 1Gb/s

172.16.0.10: 1Gb/s

ServeRAID M1015 SAS/SATA controller with 12 disks:

2 x 150 GB SATA in a RAID 1 with:
 - /boot (1GB)
 - / (125GB)
 - swap

4 x 146 GB 15K SAS in a RAID 10 with /db01 (268GB)

6 x 900 GB 10K SAS in RAID 10 with /db02 (2.5TB)
online current run DB master server DAQ Room MSM 17.05.00.02 (starsupport)

smartd (starsupport)

disk space mon. (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

OCS Inventory (link needed)

Icinga
August 2011  
 onldb5 (former onldb)  Dell PowerEdge R510  RHEL 6.x (64-bit)  Intel Xeon E5620 (quad-core, 2.4GHz, 12MB cache w/HT)  8GB + 8GB swap  

130.199.60.29: 1Gb/s

172.16.128.10: 1Gb/s

 2 x 147 GB SAS (10K):

 -RAID 1, 8GB swap

 -RAID 1, 200MB /boot

 -RAID 1, 125 GB /

6 x 300 GB SAS (15K):

 -RAID 5, 550GB /mysqldata01

 -RAID 5, 550GB /mysqldata02

 

former online current run DB server
 
DAQ Room

mdmonitor (starsupport)

smartd (starsupport)

disk space mon. (starsupport)

Ganglia

Osiris (no)

LogWatch (WB)

SKM (no)

OCS Inventory

Icinga

DB start-up scripts (yes)

 December 2011  
onldb2 Dell Power Edge R310 RHEL 6.x (64-bit) Quad core (plus HT) Intel Xeon X3460 @ 2.80 GHz 8GB + 8GB swap em1: 130.199.60.89: 1Gb/s 4x 2 TB SATA drives with four active partitions each.

Four software RAID arrays across the four disks, used as follows:

/boot: RAID1 (477MB ext4)
swap: RAID10 (8GB)
/: RAID10 (192GB ext4)
/mysqldata00: RAID10 (3.4TB ext4)

 
 current run online DB server slave DAQ Room    October 2010  
onldb3 Dell PowerEdge 2950 Sc.Linux 7.x 2x Quad core Intel Xeon E5440 @ 2.83 GHz 16GB + 8GB swap 130.199.60.165: 1Gb/s 6x 1TB 3.5" SATA drives with 4 partitions each, forming four software RAID arrays mounted as:

/boot: 1GB
/: 140GB
/db01: 2.6TB
swap: 8GB
3316: rh-mysql57-mysql-server, offline production/analysis slave

docker container with ScLinux 6 environment and mysql-5.1.73 for online database slaves
DAQ Room

smartd (starsupport)

disk space mon. (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

Icinga

 January 2009  
onldb4 Dell PowerEdge 2950 Sc.Linux 7.x 2x Quad core Intel Xeon E5440 @ 2.83 GHz 16GB + 8GB swap 130.199.60.203: 1Gb/s 6x 1TB 3.5" SATA drives with 4 partitions each, forming four software RAID arrays mounted as:

/boot: 1GB
/: 140GB
/db01: 2.6TB
swap: 8GB
3316: rh-mysql57-mysql-server, offline production/analysis slave

docker container with ScLinux 6 environment and mysql-5.1.73 for online database slaves
DAQ Room

smartd (starsupport)

disk space mon. (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

Icinga

 January 2009  
dbbak Dell PowerEdge R320 RHEL 6.x (64-bit) Quad Core Intel E5-1410 @2.8GHz, w/HT  24GB + 8GB swap

dbbak:
130.199.60.88: 1Gb/s

192.168.60.1: 1Gb/s (for direct connection to NAS unit)

  online past run DB server  DAQ   December 2013 Osiris master
mongodev
removed from service
Dell OptiPlex 755 Sc.Linux 6.x (64-bit) Core2 Duo E8300 @ 2.83GHz  4GB + 2GB swap 130.199.60.168: 1Gb/s Intel RAID Matrix Storage Manager with two 500GB disks mirrored   DAQ   June 2008  
mongodev01-03                      
dashboard1 Dell PowerEdge R320 RHEL 6.x (64-bit) Quad Core Intel E5-1410 @2.8GHz, w/HT  24GB + 8GB swap 130.199.60.91: 1Gb/s   S&C operations monitoring host  DAQ   December 2013  Icinga and sFlow
 mq01-mq03                      
cephmon01-03                      
                       
fc1 IBM System x3650 M3 -[7945AC1]- Sc.Linux 7.x 2x Intel Xeon E5645 (Hex-core, 2.40GHz, 12MB cache, HT available but disabled) 96GB + 24GB swap 130.199.148.86:
LACP: 2 x 1Gb/s
ServeRAID M1015 SAS/SATA Controller

2x 147GB 2.5" 15K RPM SAS in a RAID 1 with:

/boot: 1GB
/: 113GB
swap: 24GB


6x 300GB 2.5" 15K RPM SAS in a RAID 10 with:

/db01: 823GB
 
3336: rh-mysql57-mysql-server, master File Catalog server BCF

smartd (starsupport)

disk space mon. (starsupport)

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

No Icinga

August 2011  
fc2 IBM System x3650 M3 -[7945AC1]- Sc.Linux 7.x 2x Intel Xeon E5649 (Hex-core, 2.53GHz, 12MB cache, HT available but disabled) 96GB + 24GB swap 130.199.148.87:
LACP: 2 x 1Gb/s
ServeRAID M1015 SAS/SATA Controller

4x 300GB SAS 15K RPM 2.5" drives in a RAID 10 with:

/boot: 1GB
/: 120GB
swap: 24GB
/db01: 411GB
3336: rh-mysql57-mysql-server, File Catalog slave BCF

smartd (starsupport)

disk space mon. (starsupport)

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

No Icinga

August 2011  
fc3 Dell PowerEdge R610 Sc.Linux 7.x 2x Intel Xeon X5660 (Hex-core, 2.80GHz, 12MB cache, HT available but disabled) 96GB + 8GB swap 130.199.148.88:
LACP: 2 x 1Gb/s
PERC H700 RAID Controller with 6 600GB 10K SAS (2.5")
in a RAID 10 with

/ (150GB)
/boot (1GB)
swap (8GB)
/db01 (1.5TB)

rh-mysql57-mysql-server

unused FC slave(?)

UCM monitoring node?

BCF smartd (starsupport)

disk space mon (starsupport)

Ganglia

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Logwatch (WB)

SKM

OCS Inventory

Osiris (starsupport)

No Icinga
February 2012 (inherited from RACF in 2018)  
fc4 IBM System x3650 M3 -[7945AC1]- Sc.Linux 7.x 2x Intel Xeon E5649 (Hex-core, 2.53GHz, 12MB cache, HT available but disabled) 96GB + 24GB swap 130.199.148.89:
LACP: 2 x 1Gb/s
ServeRAID M1015 SAS/SATA Controller

4x 300GB SAS 15K RPM 2.5" drives in a RAID 10 with:

/boot: 1GB
/: 120GB
swap: 24GB
/db01: 411GB
3336: rh-mysql57-mysql-server, File Catalog slave  BCF

smartd (starsupport)

disk space mon. (starsupport)

MegaRAID Storage Manager (17.05.00.02) (starsupport)

Ganglia

Osiris (starsupport)

LogWatch (WB)

SKM

OCS Inventory

No Icinga

August 2011  
dean
stiers
Dell PowerEdge R310 RHEL Workstation 7.x (64-bit) Intel Xeon X3460 (quad core) 2.8GHz, w/HT  8GB + 6GB swap 130.199.162.175: 1Gb/s

4 software RAID arrays spread over four identically partitioned 2 TB SATA disks:

swap: 6 GB, RAID 10

/boot: xfs, 236 MB, RAID 1

/: xfs, 46 GB, RAID 5

/export: xfs, 5.5 TB, RAID 5

80, 443: former primary online web server (replaced by ovirt virtual machine in late 2017)  DAQ  

mdmonitor (starsupport)

smartd (no)

disk space mon. (no)

Ganglia (yes)

Osiris (starsupport)

LogWatch (no)

SSH Key Mgmt. (yes)

DB start-up scripts (N/A)

December 2010 online Ganglia gmetad and web interface

OCS Inventory

Tomcat for ESL and SUMS scheduler stats

NFS server for online web content generation
 
                       
stardns1 Dell PowerEdge SC440 Sc.Linux 6.x (64-bit) Dual Core Intel Pentium D 2.80GHz  2GB + 4GB swap 130.199.60.150: 1Gb/s  One 200 GB and one 250 GB SATA disk, partitioned identically (so the larger disk is not fully used):

53: BIND (named, DNS server)

 

 DAQ   December 2006  secondary DNS server, supplementing daqman and onlldap (slave to daqman for all zones)
sun (aka drupal and www) Dell PowerEdge R610 RHEL WS 6.x (64-bit) Quad Core Intel Xeon E5520, 2.26GHz, 8M cache, w/HT  12GB + 10GB swap 130.199.59.200: 1Gb/s

Six 15K SAS drives of 73GB each on a PERC 6/i integrated (LSI megaraid) controller. 

first two are in a RAID 1, which contains /boot, / and swap (8GB)

remaining four are in RAID 5 mounted under /data (200GB)

80, 443: STAR primary webserver

25: postfix (SMTP server)
 
 BCF  

mdmonitor (N/A - HW RAID)

smartd (starsupport)

disk space mon.  (WB)

Ganglia (yes)

Osiris (starsupport)

LogWatch (no)

SSH Key Mgmt. (yes)

DB start-up scripts (N/A)

July 2009  STAR Webserver, eg:
 -- Drupal
 -- RT
 -- Ganglia

STAR Hypernews
 
 sunbelt Penguin Relion
2600SA
RHEL Workstation 6.x (64-bit) 2x Intel Xeon E5335 (Quad-core, 2.00GHz, 8MB cache)  16GB + 4GB swap 130.199.59.199: 1Gb/s

/boot 200MB RAID1

/ on 20GB RAID5

/db01 on 3.3TB RAID5

4GB swap on RAID5 array

Five 750GB, 7200 RPM SATA - RAID 5. Plus one spare 750GB (6 disks total)
 

   BCF

mdmonitor (WB, MA, DA)

smartd (WB, MA, DA)

disk space mon. (WB, DA)

Ganglia (yes)

Osiris (WB, MA)

LogWatch (WB)

SSH Key Mgmt. (yes)

DB start-up scripts (yes)

August or November 2007  sun Webserver & MYSQL emergency backup use
                       
stargw3 Dell Precision WorkStation T3400 Sc.Linux 6.x (64-bit)  Intel Core2 Quad CPU    Q9550  @ 2.83GHz  8GB + 6GB swap  130.199.60.93: 1Gb/s Two 500GB SATA drives partitioned identically with RAID 1 arrays for:

/boot: 239 MB, ext4

/: 453 GB, ext4

swap: 6GB
22: OpenSSH server

9619-9620: Condor-CE
 DAQ    May 2009 Online Grid gatekeeper
stargw4  Dell OptiPlex 755 Sc.Linux 6.x (64-bit)  Intel Core2 Duo CPU E8400 @ 3.00GHz  6GB + 6GB swap  130.199.60.74: 1Gb/s One 250GB and one 400GB SATA drive.  The drives are partitioned identically with RAID 1 arrays (so only 250 GB is being used from the 400 GB disk):

/boot: 500MB, ext4

/: 223 GB, ext4

swap: 6GB
 
 22: OpenSSH server  DAQ    January 2009  SSH gateway to starp, part of "stargw.starp.bnl.gov" DNS round-robin
stargw5  Dell OptiPlex 755 Sc.Linux 6.x (64-bit)  Intel Core2 Duo CPU E8500 @ 3.16GHz  4GB + 6GB swap  130.199.60.76: 1Gb/s  Two 160 GB SATA drives partitioned identically with RAID 1 arrays:

/boot: 500MB, ext4

/: 141 GB, ext4

swap: 6GB
 
 22: OpenSSH server  DAQ    September 2008  SSH gateway to starp, part of "stargw.starp.bnl.gov" DNS round-robin
onlldap  Dell PowerEdge R310 Sc.Linux 6.x (64-bit)  Intel Xeon X3440 (quad-core) @ 2.53GHz  8GB + 8GB swap  130.199.60.57: 2Gb/s (bonded 1Gb/s NICs)  Four 1.2TB SAS (10K, 2.5") HDD, identically partitioned with RAID arrays:

/boot: 388MB, ext4, RAID1

/: 118GB, ext4, RAID5

/ldaphome: 3.1TB, ext4, RAID5

swap: 8GB, RAID1
 
 53: named (BIND/DNS)

NFS

NIS
 DAQ    December 2011  Online Linux Pool home directory NFS server

online NIS server master
 
onlam3  Dell PowerEdge R310 Sc.Linux 6.x (64-bit)  Intel Xeon X3440 (quad-core) @ 2.53GHz  8GB + 8GB swap  130.199.60.153: 1Gb/s  Four 1.2TB SAS (10K, 2.5") HDD, identically partitioned with RAID arrays:

/boot: 388MB, ext4, RAID1

/: 118GB, ext4, RAID5

/ldaphome: 3.1TB, ext4, RAID5

swap: 8GB, RAID1
   DAQ    December 2011  backup Online Linux Pool home directories (cron'ed rsyncs)

online NIS server slave
 
online HTCondor Central Manager (collector/negotiator)
cephnfs                      
dean and dean2  oVirt Virtual Machines                    
ovirt1, ovirt2, ovirt3                      
onlcs                      
onlcs2                      
onlhome                      
onlhome2                      

 


There is a fairly comprehensive monitoring system for the database servers at http://online.star.bnl.gov/Mon/

 

Notes about the health and configuration monitoring items listed in the table:
 

(If a particular tool notifies anybody directly (email), then the initials of the notified parties are included.)

Failing disks and filling disks have led to most of the db problems that this writer is aware of.  Towards that end, we have several basic monitoring tools:

1. smartd -- if it starts indicating problems, the safest thing to do is replace the disk.  However, SMART frequently assesses a disk as healthy when it is not.  Also, the configurations in use have yet to be demonstrated to actually detect anything - I've no way to simluate a gradually failing disk.  Also, SMART's abilities and usefulness are highly dependent on the disk itself - even 2 similar disks from the same manufacturer can have very different SMART capabilities.  In any case, if we do have more disk failures, it will be interesting to learn if smartd gives us any warning.  At this point, it is a bit of crossing-the-fingers and hoping.  Any warning is a good warning...

2. mdmonitor or MegaRAID Storage Manager -- monitors software or hardware RAID configurations.

3. disk space monitoring -- We have a perl disk space monitoring script run via cron job.  The iniital warning point is any partition more than 90% full.

 

Other monitoring and configuration details:

Ganglia -- doesn't need much explanation here.

Osiris -- change detection system, in the manner of TripWire, but with additional details that can be monitored, such as users and network ports.

SSH Key management -- doesn't need much explanation here.