Power outage at IFCA

A mini power outage affected to some services of IFCA Datacenter last nigth. Some of them are already recovered, but the rest of them are wating for a manual check.

OpenStack update

During the next few weeks we will proceed to update all the OpenStack services (Cloud), taking advantage of the holiday period of many of you. The services will be updated little by little to more modern versions. They include compute, network, block storage (volumes) and portal access.

Update (08-11-2022): Some services are successfully updated. They are being monitored for future problems (Identity Service as the most dependent service). The rest of the services of the list are currently failing (Dashboard) or not updated yet (compute and network).

Update (19-08-2022): update tasks performed. The system will be monitored for a few days

Update (01-09-2022): All services are updated except compute. We will proceed with the last update.

IFCA Computing status page

IFCA Computing status page

Please refer to this page in order to get the most accurate status of the available resources of the IFCA Advanced Computing and e-Science group.
Follow us on Twitter or subscribe to RSS
to be alerted of new events.

Some services are degraded.Click to refresh page now
Page will refresh in  seconds
Glance multibackend configuration

Add new rbd backend support for glance images

OpenStack Image catalalog (Glance)

is

Disrupted

+ Networking
Up and running
+ Authentication system
Up and running
+ Cloud Infrastructure
Disrupted
Disrupted
+ Storage systems
Up and running
+ Altamira supercomputer
Up and running
+ Grid and HTC
Up and running
+ Web and miscelaneous services
Up and running

Past events

Miscelaneus services not connect

Buenos días,

Estamos detectando que algunos servicios no funcionan correctamente, estamos trabajando en ello.

Disculpen las molestias

Actualización: “Estamos observando que las máquinas con volúmenes attacheados puede que haya que reiniciarlas (no utilizar reiniciar, mejor stop/start), cuando estas se reinician puede seguir habiendo algún microcorte o latencias en las lecturas del disco”

-– — — — —

Good morning,

We are detecting that some services are not working properly, we are working on it.

We apologize for the inconvenience

Update: “We are observing that machines with attached volumes may need to be rebooted (do not use reboot, better to use stop/start), when these are rebooted there may still be some microcutting or latencies in disk reads.”

+
OpenStack Compute Nodes

was

Monitored

Confluence instances unavailable

Attlassian, the Confluence (our wiki sistem) vendor has released a CRITICAL vulnerability. Our Confluence instances:

Will remain unavailable until a patched version is available.

Problem solved

+
Wiki (Confluence)

was

Down

Nextcloud Upgrade
+
Internal networking

was

Maintenance

in under a minute
Upgrade slurm to latest version to fix security issues
+
Batch System

was

Disrupted

[OpenStack] Problem launching new instances

OpenStack launch an error when a new instance is created. We are monitoring the openstack services to detect the reason of the issue.

The problem is already solved.

+
OpenStack Compute (nova) OpenStack Networking (Neutron) OpenStack Cloud Public APIs

were

Disrupted

[Cloud] Failure launching new instances

Since the last upgrade of the version of some components of OpenStack, it fails on launching new instances randomly with error:

Build of instance d47XXXXX aborted: Failed to allocate the network(s), not rescheduling.

We are monitoring the system and trying to resolve it ASAP.

+
OpenStack Cloud Public APIs OpenStack Networking (Neutron) OpenStack Compute (nova) OpenStack Compute Nodes

were

Monitored

[OpenStack] Upgrade version of services

An upgrade in the OpenStack Cloud Computing services will be scheduled at Jan 24 until the services will be updated.

The compute and networking services are successfully updated.

+
OpenStack Compute (nova) OpenStack Networking (Neutron)

were

Up and running

IFCA Network IP Movement

IFCA IP movement afecting a couple of names repo.ifca.es and portal.cloud.ifca.es

+
IFCA repository mirror OpenStack Dashboard (Horizon)

were

Disrupted

Data transfer system upgrade
+
Computing Elements

was

Disrupted

Change topology network IFCA-RedIris

Network conectivity has been restored sucessfully at 8:35 am (Local Time).

Services seems to be working fine.

+
OpenStack Dashboard (Horizon) Login nodes User Interfaces GitLab Indico Agenda pages Wordpress pages Wiki (Confluence) External network connection

were

Disrupted

More recent 1 / 3 Older

IFCA Advanced Computing and e-Science group.

Back to top  • Subscribe via RSS