SECURITY BREACH? CALL 888.234.5990 EXT 9999

OLX Group Standardizes Incident Management with Customized Tools

About OLX Group

OLX Group operates one of the fastest-growing networks of trading platforms globally. Serving 322 million people every month in 30+ countries around the world, OLX Group helps users buy and sell cars, find housing, get jobs, buy and sell household goods, and much more. With more than 20 well-loved local brands including Avito, OLX, OLX Autos, Otomoto, and Property24, its solutions are built to be safe, smart, and convenient for its customers. 

OLX Group is powered by a team of 11,000+ people around the world. OLX Group is a division of Prosus, a global consumer internet group and one of the largest technology investors in the world. 

The Challenge

OLX previously created a Slack bot called the IMP (Incident Management Process) for managing incidents on various transaction networks. The organization needed to revamp the IMP in order to centralize incident reporting, but the SRE team did not have dedicated developers. Finally, OLX needed to quickly implement the IMP process that had already been defined. 

Difficulties with Incident Management

“Worldwide, we have many different teams, countries, cultures, and even different technical capabilities. Teams were taking different approaches to solving technical issues. There were teams that had multiple incidents every day, and there were others with less than a few incidents in a month, but no one had that information organized. We wanted a way to keep all the teams in sync,” explained Paulo Oliveira, SRE Manager, OLX Group. 

“Nearly every month, an urgent incident was occurring and woke up a team member in the middle of the night. There was a lack of insight into how issues were progressing, and an overall lack of visibility and standardization of incidents. Additionally, there was unclear ownership of documentation, which led to teams tackling issues in different ways, as well as insufficient documentation. 

“We needed to standardize the incident response and management process. We had to ensure each incident was reported appropriately and followed up on using standard procedures, as well as be able to analyze metrics generated by the IMP tool based on the incidents. For making informed decisions, accurate metrics are essential,” said Mr. Oliveira. 

The Solution

OLX had worked with Netrix in the past and based on that previous success, chose to work with Netrix to build a new solution. 

The Results

Netrix moved quickly and developed a new version of the IMP, an application that integrates with numerous other systems. After that success, Netrix supported OLX by building two additional apps. 

Automatically Generate Incident Tickets & Kick Off Resolution

When an incident occurs, the on-call tech is alerted and must confirm that the incident is valid. IMP will then create a Jira ticket automatically with the known information and a channel in Slack for the relevant team members. 

Using Jira to store incident information also alleviates the need for an additional database to manage. IMP uses OpsLevel, Jira, and Statuspage and will soon also integrate with Zoom. 

A significant value of IMP is that, beyond generating the ticket, it allows users to clearly and intuitively follow the workflow of an incident. Each status transition is accompanied by the correct buttons, helpful suggestions, and relevant references. IMP supports the entire incident workflow from start to finish, making every step easily accessible, thus simplifying tracking and enabling clear visualization of statistics. 

Organization-Wide Visibility

Engineering leadership can now view detailed analytics for each team in Europe. IMP features dashboards in OpsLevel with charts and graphs showing metrics such as the total number of incidents for each team.  

“The tool is heavily used. In the absence of the Netrix team, we wouldn’t have had the resources to do it in such a small time frame. Netrix’s expertise complements our skills, for instance knowledge in UX/UI and front-end development, which the SRE team lacked at the time. There is a great deal of speed in the development process of Netrix. As a result of their work along with the effort our developer teams have put into improving the code, we have had fewer and fewer incidents per team over time. The ideas Netrix contributed are well thought out, and we have productive discussions together. Even though Netrix is an external service provider, we treat them like a member of our team,” said Mr. Oliveira. 

Standardized Tools & Processes

“Netrix also helped the SRE team develop the front end of the Service Shaper tool, which is currently in production and is one of the most important projects on the team right now. We have a lot of internal procedures to create services. Every time we implement a new service, such as cloud procedures, databases, and security, there is a set of procedures and documentation that must be done. With Service Shaper, developers can focus on their specific programming tasks, while ensuring they’re compliant with numerous requirements. The developer selects the language template, answers a few questions, and Service Shaper creates a GIT project and all the required infrastructure. The aim is to minimize the developer’s repetitive tasks,” explained Mr. Oliveira. 

This project has begun to unify work processes. With the three applications Netrix has helped develop, OLX teams in Europe now use the same standard of tools and processes. 

Helpful Documentation and Training

Netrix provided documentation which now serves as the OLX guidelines for onboarding and training. The valuable documentation provides clarity for the standards and processes across geographies and teams with detailed information such as how to create an incident. 

AWS Technologies Powering OLX’s Incident Management and Automation

To meet OLX’s business and technical requirements, Netrix architected solutions using a suite of AWS services that enabled scalability, automation, and observability across the organization: 

  • Amazon EKS (Elastic Kubernetes Service) was used to orchestrate containerized workloads for several core systems, including the Orbit user management platform, the IMP Slack bot infrastructure, PDMT engine workloads, and Cora bot and web services. 
  • Amazon DynamoDB served as the NoSQL database backbone for storing user roles and team data (Orbit), caching incident data (IMP), tracking metrics and query history (PDMT), and logging conversations and FAQs (Cora). 
  • Amazon SQS (Simple Queue Service) facilitated asynchronous communication between services, powering incident notifications between IMP and Jira, background tasks in PDMT, Cora’s async requests, and syncing with Google Groups. 
  • Amazon SNS (Simple Notification Service) enabled real-time alerts and pub/sub messaging for role changes in Orbit, report triggers in PDMT, and query responses in Cora. 
  • AWS Lambda provided serverless compute capabilities to automate data processing, trigger workflows from SQS/SNS events, and execute lightweight backend logic for Cora and IMP. It also supported integrations with Google Sheets and Confluence. 
  • Amazon Q Developer, an AI-powered development assistant, accelerated feature development, integrated dashboards with New Relic, and supported troubleshooting and codebase optimization. 

These technologies allowed OLX to build a resilient, event-driven architecture that supports real-time incident response, seamless integrations, and scalable analytics. 

Netrix Global: Certified AWS MSP Partner Driving Observability, Resilience, and Secure Cloud Transformation

Netrix Global’s role in OLX’s transformation journey extended far beyond technical implementation. As a certified AWS Managed Service Provider (MSP), Netrix brought a unique blend of strategic advisory and hands-on expertise that was critical to OLX’s success. Their deep understanding of cloud-native architectures and observability practices enabled OLX to standardize incident management, unify service creation workflows, and build resilient systems that scale across geographies. 

The MSP designation is not just a credential—it reflects Netrix’s ability to deliver lifecycle-based cloud solutions that prioritize security, agility, and operational excellence. Their approach includes designing Well-Architected cloud environments and solutions, implementing real-time monitoring and automation, and ensuring compliance with evolving regulatory standards. These capabilities were instrumental in helping OLX build a robust incident response framework powered by AWS technologies. 

Ultimately, Netrix’s certified MSP status validated their ability to act as trusted advisors—guiding OLX through complex architectural decisions, accelerating development cycles, and embedding resilience and security into every layer of the solution. Their partnership exemplifies how strategic cloud expertise can drive measurable impact in modern infrastructure and operations. 

Experience The Impact Of Continuous Improvement

No matter what challenge you’re facing today, our team of technical experts can get you started on a path to a better solution, whether you’re modernizing finance operations or rethinking resource allocation through incremental improvements. We’ll partner with you to:

  • Understand your current technology environment.
  • Interview key stakeholders and external consultants to understand the root of the business issue(s), identify bottlenecks in workflows, and prioritize cost reduction opportunities.
  • Propose a solution with projected timelines, budget, and dependencies, ensuring even inefficient processes can be replaced with a structured approach via standardized processes that drive measurable business success, rolled out in manageable phases.
  • Deliver tangible results by tackling the redesign process head-on.