We are experiencing some problems with our job scheduler; it is running
but will not/very sporadically accept new jobs and will timeout when you
try to access job information (for example, with checkjob). Running jobs
will not be impacted.
It looks like this is not a very obvious problems (most of the
administrative commands timeout, so diagnosing is difficult), it looks
like their might be a problem with the communication between the
resource manager and the scheduler. We are trying everything possible to
get this working.
We are pleased to welcome a new member to our team, Terry Ward.
Terry has 43 years of experience working in systems administration, mostly in large data center environments. His experience encompasses a broad range of IT activities, including system architecture, network management, and software development.
Terry will join our support team, and will take over a number of critical systems management tasks, including HPC scheduler management and storage operations.
As a reminder, FSU will be closed for the Winter holiday, starting tomorrow, Dec 24 though Jan 4. We will re-open on January 5.
During this break, our systems will remain online and functional. RCC staff will respond to any critical support requests sent to firstname.lastname@example.org as soon we are able. All non-critical support requests will be answered when we return on January 5, 2015.
On behalf of the staff here at the RCC, we wish you a fun, festive, and high-performance holiday!
Researchers at major Florida universities can now share data faster and easier than ever with a recently deployed new state-wide storage system. This system implemented as an initiative from the the Sunshine State Education and Research Computing Alliance (SSERCA) uses DDN's Web Object Storage technology. RCC's NoleStor system is part of this new state-wide network.