Akeyless Partners with MoovingON to Enhance Platform Reliability
Introduction
Akeyless simplifies the deployment, access, and management of secrets without the cost and complexity of managing vaults. Their innovative technology and cloud-native architecture enable enterprises to secure DevOps cloud workloads and legacy environments, meeting compliance and regulatory requirements.
The Challenge
As a company that prioritizes platform uptime and client impact minimization, Akeyless were looking to maintain their high standards around their platforms’ reliability. MoovingON’s NOC service already offered Akeyless exceptional 24/7 tier 1 remediation, but Akeyless needed continuous sanity checks for their WebUI and CLI during incidents to ensure platform stability. Each playbook for these sanity tests took between 10 to 15 minutes execution per environment, making the request impractical and impossible to achieve manually across multiple monitored production environments.
The Solution
In response to this challenge, MoovingON automation professionals developed a robust solution to meet Akeyless’ requirements efficiently and effectively. The project was led by MoovingON’s customer success team, who designed a three phase process.
In phase one, MoovingON developed a script to trigger sanity tests immediately upon receiving an incident alert. By utilizing various webhooks, the solution enabled parallel execution of WebUI and CLI tests. The test results were sent to a Slack channel, providing real-time updates with step-by-step statuses labeled “Success” or “Failed.”
In phase two, to enhance the value provided to Akeyless, MoovingON integrated latency checks, measuring the time taken for each step to execute. The results were pushed to moovingon.ai, MoovingON’s CloudOps management platform, allowing the collection of status and time data to build a comprehensive daily report. This proactive approach not only allowed for automated checks every 30 minutes, but was scaled rapidly across five different environments. In the last phase, Akeyless requested detailed reports with screenshots for failed execution steps. MoovingON automation professionals build an additional process using Allure Report, Selenium, and Pytest to create a user-friendly interface for these reports. Upon failure, the system generated detailed reports with all steps and screenshots, which were then pushed to Akeyless’ AWS S3 instance and Bitbucket repositories.
Results and Benefits
Partnering with MoovingON automation professionals allowed Akeyless not only to enhance the value of the NOC service but to achieve several key benefits in their reliability strategy:
- Increased Efficiency: Automated sanity tests reduced the manual effort required, ensuring timely and accurate monitoring.
- Enhanced Reporting: Detailed, automated reports with screenshots provided clear insights into failures, facilitating quicker resolution.
- Improved Uptime: Continuous, proactive monitoring and real-time alerts minimized client impact and ensured platform stability.
“MoovingON’s team has been instrumental in ensuring our platform’s reliability and performance. Their innovative solutions and proactive approach have allowed us to maintain our high standards without significant engineering overhead. The real-time alerts and detailed reporting have made a huge difference in our incident management process, truly integrating them as a part of the Akeyless team.” – Ori Mankali, SVP Engineering
Summary
Partnering with MoovingON was a strategic move for Akeyless, aimed at enhancing responsiveness and operational efficiency. This collaboration enabled Akeyless to foster transparency and drive innovation, positioning the company for continuous growth. By focusing on customer-centric solutions, Akeyless is set for ongoing success and development