‎01-10-2022 4:14 PM
‎01-10-2022 4:33 PM
‎01-18-2022 10:06 PM
8 days later and another outage. SAP better check their data center for loose cables.
1/18/2022 (1 Hour 34 Minutes)
1/10/2021 (6 hours)
12/30/2021 (8 minutes)
12/10/2021 (5 minutes)
11/18/2021 (3 hours 46 minutes)
‎01-10-2022 4:42 PM
‎01-10-2022 7:06 PM
‎01-10-2022 7:17 PM
‎01-10-2022 7:11 PM
‎01-10-2022 7:29 PM
Where did you get this Update ? On a support ticket? Because CAC has the same message as before
We are continuing to investigate the issue impacting your SuccessFactors Applications. Recovery efforts are in progress to restore application access. We are investigating the issue and will check back with you as we learn more information.
‎01-10-2022 9:53 PM
‎01-10-2022 10:02 PM
‎01-13-2022 1:07 PM
‎01-13-2022 2:58 PM
‎01-13-2022 3:16 PM
‎01-18-2022 6:08 PM
Our users are receiving the same error when trying to access our Production environment as we were last week during the outage. The Cloud Availability Center isn't showing any issues. Is anyone else experiencing any issues?
‎01-18-2022 6:10 PM
‎01-18-2022 6:12 PM
‎01-18-2022 6:11 PM
‎01-18-2022 6:12 PM
‎01-18-2022 6:31 PM
‎01-18-2022 6:59 PM
‎01-18-2022 6:14 PM
‎01-18-2022 6:18 PM
‎01-18-2022 6:40 PM
‎01-18-2022 6:50 PM
‎01-18-2022 6:59 PM
‎01-18-2022 7:05 PM
I think the SLA for root cause analysis is 10 days (working days). so we should have something by end of week or Monday).
‎01-19-2022 2:04 PM
‎01-21-2022 7:39 PM
Per KBA 2116114 - Root Cause Analysis (RCA) Policy for SuccessFactors Cloud:
Timeline: The target timeline for the RCA to be published in CAC is within 20 days of the incident resolution.
The Jan 10th event EV8166899 has RCA already published to CAC ahead of this timeline.
‎01-21-2022 7:57 PM
this came on 1/20
Dear Customer, We have completed the root cause analysis for this incident and would like to share it with you. Incident Description: On January 10, 2022, customers hosted in the Ashburn (DC08) Data Center experienced a service disruption while accessing multiple SuccessFactors applications. The disruption was caused by a network connectivity issue that continuously caused multiple network switches to restart. The constant restarting of the switches did not allow application requests to process. As a result, the vCenter (central management utility used to manage virtual machines (VMs) from a centralized location) lost connectivity to the underlying storage devices causing all the VMs hosted in DC8 to become non-responsive. Incident Resolution: The network team disabled an authentication configuration on the impacted switches to resolve the issue, which recovered the switches and restored network connectivity. After the network resolution, Cloud Operations recovered the Center, then restarted all VMs in sequential order. The recovery process was complex due to several service interdependencies. After appropriate validation checks, Cloud Operations confirmed service restoration on January 10, 2022. Root Cause Analysis: The root cause of this incident has been attributed to a software defect on a specific version of the network vendor switch firmware. The vendor had recommended an iterative upgrade approach using the interim version of firmware as a first step, followed by an upgrade to the final firmware release. It was during this iterative approach/interim version, the defect was introduced and led to the network connectivity issue/service disruption for all SuccessFactors applications. Preventive Measures: Measure #1: Upgrade the network vendor switch firmware to the final firmware release. (Completed: January 15, 2022) Measure #2: Audit checks of the firmware version on all remaining network devices not affected by the issue, and an upgrade to final firmware version where required. (Target Date: February 25, 2022) Measure #3: Future network switch upgrades to be performed after thorough verifications and checks by the network vendor. (Target Date: February 25, 2022) Measure. #4: Create an automated service restoration process for data center level outages to restore services as quickly as possible if server restarts are required for resolution. (Target Date: February 17, 2022) |