2014 Shutdown activities
In this blog entry, I will try to keep thoughts on necessary or suggested activities for the 2014 shutdown period. It will likely be somewhat jumbled for a while, as initially it is just a brain dump of thoughts:
Slow Controls computers all should get some attention:
- sc and sc5: upgrade to Sc.Linux 6.x. (The computers were bought in June 2013, so the hardware is relatively good.)
- alh, softioc3 and barbados: replace hardware (average age of this hardware is 9-10 years) and upgrade to Sc. Linux 6.
replace at least one stargw machine, bring stargw machines up to SL/RHEL 6 (7?)
migrate online web server to RHEL 6 (RHEL 7?). Note also that dean still has dbbak.starp.bnl.gov defined in /etc/hosts as 130.199.59.204, which is really db04.star.bnl.gov. This should be removed, regardless of the upgrade status (done).
replace tofcontrol computer - primarily to have a system with easy-to-swap hard drives - current one requires removing case from the rack, opening the case, removing multiple mounting screws and connectors. Would prefer something in which drives can be swapped with a 5 minute access.
Jack E. mentioned an interest in upgrading startrg to SL 6.x
remove fpdcontrol?
replace or find alternative for the eemc-testdaq laptop (primarily used by Will Jacobs for EEMC work during shutdown and access days)
evp3 setup (finally) and possible replacement for old evp, which is unstable
replace as many network switches as possible with switches that have sFlow and a MIB that includes MAC address to port lists for easy SNMP extraction
Develop (or deploy 3rd party) MAC port mapping software.
DAQ network upgrade
start using ldap or some alternative to NIS for stargw and OLP + possible expansion to other systems
re-assess our backup systems and what we are backing up, possibly introduce our own general purpose backup service going beyond current dean and onlldap rsyncs to dean3 and onlam3 respectively. (Lots of storage will be required for this)
onlldap to onlam3 failover test/improve/document
dean to dean3 failover test/improve/document
expand OCS inventory
restore widespread use of osiris, with tuned configuration to reduce repetitive change notices
go through all Linux systems and update to latest 5.x or 6.x.
Further spread of SKM client (eg. daqman et al.)
DAQ Room temperature monitoring (South Platform too?) RACF uses product(s) from Synapsense; other vendors have similar products.
Windows policy / group membership check - make better split between online and offline machines
Initial tests with RHEL/Sc. Linux 7. What about CentOS?
Additional storage for the trgscratch machine - Hank has suggested 20TB of additional storage for coming run. (NB. 11TB were added during the week of August 10.)
- wbetts's blog
- Login or register to post comments