CORTEX enhances network reliability through active monitoring and fault management. It constantly oversees systems and network elements, employing periodic polling to detect faults. To facilitate this, CORTEX engages with network elements using various communication protocols such as SSH/telnet and HTTPS/SOAP requests, ensuring a broad range of compatibility and responsiveness.
In addition to initiating communication, CORTEX can receive SNMP Traps (versions 1, v2c, and v3) from network elements. These traps, which are notifications of network events or issues, are then parsed and validated by CORTEX for accuracy and relevance. Further extending its integration capabilities, CORTEX collaborates with existing Fault Management applications, enabling it to efficiently receive notifications of network faults.
Upon receiving a fault report, CORTEX undertakes an enrichment process. It queries data from Inventory systems and network elements to add context and depth to the fault report. This enrichment is critical in providing a comprehensive view of the fault. CORTEX also excels in correlating a network fault report with other reported faults, considering factors like the type of fault, reporting devices, and fault hierarchy relationships. This correlation is key in identifying patterns and potential widespread issues.
When a fault is identified, CORTEX creates a new Trouble Ticket or updates an existing one with detailed information about the fault. This action triggers additional activities outlined in the Incident Management use case, ensuring a proactive and systematic approach to fault resolution.
Throughout this entire process, CORTEX maintains a high level of communication, issuing progress notifications at key stages. These updates are crucial for keeping internal teams and external stakeholders informed about the status of network faults and the ongoing efforts to resolve them. This comprehensive approach to fault management by CORTEX ensures prompt detection and resolution of network issues and enhances overall network performance and reliability.
Recognising faults in delivered services is an essential aspect of Service Assurance, a critical process ensuring network operations’ reliability and efficiency. Typically, network elements initially report faults, signalling potential issues that need immediate attention. However, the process goes beyond fault detection. Each reported fault undergoes a systematic classification, prioritisation, and correlation with other faults. This structured approach is vital to understanding the severity and impact of each fault within the broader network context.
Once a fault is identified and assessed, the next crucial step is root cause analysis. This involves a meticulous collection of data and evidence, followed by applying heuristics and logical reasoning to pinpoint the underlying cause of the fault. Identifying the root cause is key to determining the most effective rectification actions. These actions are then carefully selected and applied to address and resolve the fault.
When immediate rectification is not feasible, Service Assurance strategies focus on minimising the fault’s impact. This could involve temporary measures such as re-routing traffic to maintain service continuity. Such interim solutions allow network functionality maintenance while buying time for a more permanent fault rectification to be planned and executed later.
Fault and Event management also sit across the Life Cycle Management line, allowing for reliable network operations.
You can also use our search facility to find the resources like this, business cases, case studies, news and whitepapers. Whatever content you are after, if you cant find it here, contact us and we may have it locked away somewhere.
/ Search here /
We’d love to hear from you.
Please fill out the form or send us an email.
+44 23 8254 8990
Mon–Fri from 9am to 5pm
(GMT)
+44 23 8254 8990
Mon–Fri from 9am to 5pm
(GMT)
22 Kings Park Road
Southampton
SO15 2AT