Dear ScienceCloud users,
we are pleased to announce that ScienceCloud is back in production.
You can login again on the web interface: https://cloud.s3it.uzh.ch
What happened to your running instances ?
- All of your running instances experienced a power interruption - pretty much like when your own laptop suddenly loses battery power.
- All data that was in use at the moment of the power off could be in an inconsistent state.
- All memory content has been lost.
- All running applications/processes/services have been interrupted.
What scenario will you find ?
- You should be able to see all of your previously running instances.
- All of your previously running instances should be in 'Shutoff' state.
- All data stored on your previously running instances - being attached volume or simply on the root filesystem - should be available except for the very data being written at the time of the shutdown.
- All Volumes, Networks, Security groups, Keypairs, ObjectStore Containers should be available as before.
- You should *not* be able to connect to any of your previously running instances - because they are in 'Shutoff' state.
What steps you will need to take in order to continue working on ScienceCloud ?
- Restart your previously running instances by logging in to https://cloud.s3it.uzh.ch and selecting "Instances" -> "Start instance" on the right-most menu on the corresponding instance's row.
- If the instance status is reported as 'Active', connect to your running instance as before.
- If data volumes were attached and mounted, you will have to check the consistency of the data on your volume *before* using it:
- make sure the volume is attached but *not* mounted - in case:
- List currently mounted devices
- Record all /dev/vdX (e.b. /dev/vdb) - except /dev/vda - and their corresponding mount point (e.g. /data)
- For each /dev/vdX run
$ sudo umount /dev/vdX
$ sudo fsck.ext4 -p /dev/vdX
- Mount the volume as before
$ sudo mount /dev/vdX [Path where the volume was mounted]
What about my SLURM cluster ?
If you had a running virtual slurm cluster started with elasticluster, please get in touch with us at: email@example.com for more detailed instructions.
Do not hesitate to contact us in case of you have any issue with the restart of your ScienceCloud instances.
We apologize again for the inconvenience this service interruption has caused you.
With best regards,
Your ScienceCloud sysadmin team