RCC Active Incidents

RCC Active Incidents

For information about the FlashLite and Euramoo clusters, please consult QRIScloud portal.

This page lists the currently active or recently resolved incidents that have impacted other HPC services.

Active cases

There are 4 active cases.

#9828: Tinaroo HPC: Interactive job submissions do not respect pmem and pvmem settings. (Incident Logged)

Logged 2017-06-21 09:06:12 +1000
Last updated 2017-06-21 09:09:06 +1000

Until further notice, when requesting memory for an interactive job (qsub -I ...), you should request mem and vmem and not use pmem and pvmem figures.

#9819: Tinaroo Remote Desktop access ... use tinaroo2.rcc.uq.edu.au (Incident Solved)

Logged 2017-06-19 12:36:00 +1000
Last updated 2017-06-27 13:40:40 +1000

Access to the Tinaroo Remote Desktop Facility is failing under some circumstances.
Please use Tinaroo2 directly to access the Remote Desktop until further notice.

SOLVED
Web based access to Tinaroo has been restored.
Please use the generic Tinaroo Remote Desktop Portal and follow the link.

#9809: Tinaroo HPC: Problem with MultiNode MPI jobs (Incident Solved)

Logged 2017-06-15 19:09:12 +1000
Last updated 2017-06-16 14:38:08 +1000

There appears to be a problem with the handling of multi-node jobs on Tinaroo.
A number of jobs seem to have left running processes on compute nodes.
The Multiple and Long queue has been stopped temporarily.

Update: Rogue processing has been stopped on the impacted nodes and the queues restarted.

#9261: Tinaroo: Job related emails are not working (Incident Active)

Logged 2016-11-10 10:54:28 +1000
Last updated 2016-11-10 10:54:28 +1000

Emails out of the Tinaroo PBS server do not work.

Recent cases

There are no recent cases.