EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
We are actively seeking an experienced Senior Observability DevOps Engineer to join our team. In this role, you will contribute to the development of cutting-edge operational intelligence solutions for our engineering community, enabling teams to deliver quality customer value quickly and autonomously.
Responsibilities
- Design and implement monitoring solutions using Azure Application Insights, Log Analytics, and Grafana for comprehensive system performance analysis
- Utilize GitHub Actions for continuous integration and deployment to enhance engineering autonomy and productivity
- Develop automation scripts using Python and Bash to streamline operational intelligence processes
- Leverage Kubernetes and Terraform for efficient container orchestration and infrastructure provisioning
- Implement and manage Grafana Loki, Mimir, and Tempo for advanced observability solutions
- Collaborate with cross-functional teams to ensure the successful implementation of observability solutions
- Provide technical mentorship and guidance to junior team members in monitoring and observability engineering
Requirements
- At least 3 years of relevant work experience in monitoring and observability engineering
- Expertise in Azure Application Insights and Log Analytics for comprehensive monitoring and analysis
- Proficiency in GitHub Actions for continuous integration and deployment
- Experience with Grafana for visualization and monitoring of system performance
- In-depth knowledge of Kubernetes and Terraform for efficient container orchestration and infrastructure provisioning
- Familiarity with Grafana Loki, Mimir, and Tempo for advanced observability solutions
- Knowledge of Python / Bash
- Strong problem-solving skills and the ability to work autonomously
- Excellent communication and collaboration skills for effective teamwork
- Willingness to do 24/7 rotation
- Fluent English language skills at an Upper-Intermediate level
Nice to have
- Experience with working in a platform team
- Proficiency with Grafana LGTM-stack
- Experience with infrastructure end-to-end testing (Python, Pytest)
We offer
- We believe that the greatest strength of the company is its people. EPAM is fully committed to help its employees to reach their full potential and achieve their professional goals through continues learning. With this in mind, we would like to introduce to you few of the many opportunities and services which we believe will help you expand your current knowledge:
- Full access to cutting-edge tools and technologies
- Competitive compensation depending on experience and skills
- All-around Social package: professional & soft skills training, medical & family care programs, sports
- Relocation opportunities
- Free English classes
- Unlimited access to LinkedIn learning solutions
- Continuous experience exchange with experts and professionals worldwide
- Friendly team and comfortable working environment
- Engineering, corporate, and social events within and outside the Company
- Flexible working schedule
- Opportunities for self-realization