Senior Site Reliability Engineer I – Domain

9 يوليو، 2023

0 0 2 دقائق

Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million Captains, simplified the lives of over 50 million customers, and built a platform for the region’s best talent to thrive and for entrepreneurs to scale their businesses. Careem operates in over 70 cities across 10 countries, from Morocco to Pakistan.
About The Role
We are looking for engineers who will work within the SRE team to focus on monitoring, automation, improving the reliability of the high scale distributed systems, performance, and availability , and taking a holistic view of system health in addition to enabling Kubernetes and taking cloud-native technology to the next level within Careem. We need expert, execution-focused engineers to help shape the future of the Careem platform and to help us scale our already sizable effort greatly. As an SRE in Careem, you'll architect, build and maintain Careem’s cloud native infrastructure and its corresponding ecosystem required to ensure resilience, reliability of our services and speed up deployments with the aim of improving our products used by millions of customers every day. Key responsibilities include:

Make an impact from design phase, through development and operation of , Cloud Infrastructure and Kubernetes clusters and its ecosystem on AWS
Build core services, tooling and create technical processes that simplify and enable engineers across multiple services
Identifying and automating and scale systems without compromising on security and reliability
Building monitoring that alerts on symptoms rather than on outages
Participate in on-call rotations and help improve incident response

Qualifications

Expertise in architecting, developing, operating and troubleshooting Cloud Infrastructure & Kubernetes clusters and/or other production highly available systems at scale
Good experience with any high level programming language such as Go, Python, Java
Experience with centralized infrastructure automation, IAC And Governance techniques and technologies, Terraform – Terragrunt
Strong Unix or Linux background, including concepts such as processes, network stack, and memory allocation
Experience with cloud-native services on AWS/GCP/Oracle/Azure
Incident response and/or incident management experience
Experience on DevOps topics such as monitoring, CI/CD
Effective communication and collaboration skills: have the ability to drive and promote technical partnerships across teams

What We’ll Provide You
We offer colleagues the opportunity to drive impact in the region while they learn and grow. As a Careem colleague you will be able to:

Work and learn from great minds by joining a community of inspiring colleagues.
Put your passion to work in a purposeful organisation dedicated to creating impact in a region with a lot of untapped potential.
Explore new opportunities to learn and grow every day.
Enjoy the flexibility that comes with the trust of being an owner; work in a hybrid style with a mix of days at the office and at home, and remotely from any country in the world for 30 days a year with unlimited vacation days per year.
Access to healthcare benefits and fitness reimbursements for health activities including: gym, health club and training classes.

للتقدم على الوظيفة

9 يوليو، 2023

0 0 2 دقائق