EAST-1 power failure (resolved)

At 00:57 CEST on Monday, May 29th a power outage caused the cooling system at Ångström Laboratory to shut down, leading to a rapid increase in temperature within the compute hall. To prevent further temperature escalation and safeguard the equipment, all systems in the compute hall were forcefully powered off. The cooling system was restored at approximately 05:00.

Due to the elevated temperatures experienced during the outage, additional inspections are required to ensure the compute hall, compute, storage, and network hardware are functioning as expected. Currently, we have identified an issue with one of the two UPS units.

Throughout the day, we will provide regular updates regarding the progress of the recovery efforts and the status of the affected equipment. We are working diligently to resolve any issues and restore normal operations as soon as possible.

Update 2023-05-29 11:00

The compute hall is fully operational again. We are now working on restoring systems.

Shutdown of all systems on 2 february at 07:00 CET

The UPPMAX compute hall hosting EAST-1 will be partially shutdown during 2 February between 07:00 – 11:00 CET as Akademiska Hus performs work on the cooling circuit. The shutdown has been planned to coincide with our February maintenance day. We will try to provide some level of access but expect all compute capability to be unavailable until the work is completed.

Maintenance with downtime in the HPC2N region.

Planned downtime in the HPC2N region on Monday the 20th of April between 6-12 and Tuesday the 21th of April between 11-17, due to urgent electrical work. All running instances will be suspended before the outage and restarted again afterwards.

The other regions will not be affected by this and so if you can, we suggest move your workloads to the new WEST-1 region that is running a much more resent version of OpenStack on new hardware.