NTA - Network Issues – Incident details

All systems operational

Network Issues

Resolved
Major outage
Started 4 days agoLasted about 17 hours

Affected

Inbound Calls

Major outage from 3:42 PM to 4:55 PM, Operational from 4:55 PM to 12:00 AM

Outbound Calls

Major outage from 3:42 PM to 4:55 PM, Operational from 4:55 PM to 12:00 AM

Web Interface

Major outage from 3:42 PM to 4:55 PM, Operational from 4:55 PM to 12:00 AM

Busy Lamps

Major outage from 3:42 PM to 4:55 PM, Degraded performance from 4:55 PM to 5:38 PM, Partial outage from 5:38 PM to 12:00 AM

API

Major outage from 3:42 PM to 4:55 PM, Operational from 4:55 PM to 12:00 AM

Secondary Services

Major outage from 3:42 PM to 4:55 PM, Operational from 4:55 PM to 12:00 AM

Updates
  • Postmortem
    Postmortem

    At 15:42, our upstream connectivity supplier experienced a service outage,  causing interface instability (“flapping”) and subsequent service disruption within our network environment.

    At 15:50, investigations confirmed that the primary customer impact was related to call completion. The instability affected SIP signalling, causing a core service to become unresponsive and preventing calls from completing successfully. Also, preventing access to disable the affected routes and allow the Disaster Recovery to kick in

    At 16:21, NTA engineers managed to disable the affected interfaces to stabilise the platform and initiate established disaster recovery procedures.

    At 16:55, service stability was progressively restored following supplier recovery actions, and all services were fully operational and performing within normal parameters.

    NTA has already scheduled a programme of infrastructure improvements for April, including the migration from the current connectivity supplier to an alternative provider that will deliver enhanced redundancy while improving service resilience. This programme also includes increased rack capacity, deployment of new server infrastructure, and a major platform upgrade to support modern technologies, including AI capabilities and future business growth

  • Resolved
    Resolved

    This incident has been resolved. If your handset is still offline, please power cycle it. A reason for the outage will be issued after a thorough investigation into the cause has been conducted. Sorry for any inconvenience this may have caused you

  • Update
    Update

    Whilst services are fully restored, we are waiting for an update from our suppliers as to the reason for the outage. Once we have that,a reason for outage will be issued.

  • Monitoring
    Monitoring

    We now have a restricted service back-up and running, but services such as BLF will not be fully operational for a while as we concentrate on getting things running back as normal

  • Update
    Update

    Whilst we do have failover mechanisms in place, these only apply when the primary connection is hard down.
    As it currently flapping, this prevents the system moving into failover as the connection is not hard down.
    Unfortunately, this has put the system in a state where we will have to manually disconnect the link in order to utilise the other links. Our engineers are at site at the datacentre and working on the servers

  • Update
    Update

    Engineers have been dispatched to one of the datacentres to work on the issue

  • Identified
    Identified
    We are continuing to work on a fix for this incident.
  • Investigating
    Investigating

    We are currently investigating this incident at our datacentres, which is affecting our network and the abilility to make and receive calls.