We are seeking enthusiastic and motivated Site Reliability Engineers who possess a solid foundation in Software Engineering and a keen interest in Infrastructure and Security. This role is ideal for engineers who understand that reliability, scalability, and security are inseparable pillars of modern infrastructure. You'll work in an environment that values continuous learning and technical excellence.
Key Responsibilities :
- Design, build, and maintain secure, scalable, and highly available infrastructure in cloud environments, ensuring security is integrated at every layer.
- Manage the development and deployment of software releases with zero-downtime strategies, implementing comprehensive security controls throughout the deployment pipeline.
- Monitor and optimize system performance, proactively identifying security vulnerabilities, performance bottlenecks, and reliability issues before they impact production.
- Support, troubleshoot, and debug application and infrastructure issues.
- Implement security hardening measures across the infrastructure stack, including network security, access controls, secrets management, and vulnerability remediation.
- Develop robust infrastructure-as-code solutions and automation tools that embed security best practices by default.
- Drive incident response efforts, conduct thorough post-mortems, and implement preventive measures to enhance both reliability and security posture.
- Participate in on-call rotation, serving as an escalation point for complex infrastructure and security incidents.
- Implement security initiatives including threat modeling, security audits, compliance requirements, and implementation of defense-in-depth strategies.
- Help team members on implementing security best practices, infrastructure patterns, and operational excellence.
- Automate operational processes and develop the infrastructure with theinfrastructure-as-code approach to improve workflow efficiency.
Qualifications :
- Bachelor's degree in Computer Science, Engineering, or related field OR equivalent practical experience demonstrating advanced technical capabilities.
- Proven track record in site reliability, DevOps, or infrastructure engineering roles with increasing responsibility.
- Strong software engineering foundation with expertise in building production-grade systems.
- Deep understanding of security principles including authentication, authorization, encryption, network security, and common attack vectors.
- Solid knowledge of web application security, OWASP Top 10, and practical experience implementing security controls (OWASP WSTG knowledge highly valued).
- Experience with databases (PostgreSQL preferred), computer networking, web servers, and Linux system administration.
- Production experience with containerization technologies, with strong understanding of container security.
- Hands-on experience with Kubernetes in production environments; candidates with demonstrable expertise will stand out, while strong engineers eager to deepen their K8s knowledge are also encouraged to apply.
- Proficiency in at least one scripting/programming language, with Python or Go preferred; Django framework experience is a significant plus.
- Experience implementing and maintaining CI/CD pipelines using GitLab or similar platforms, with emphasis on secure software delivery practices.
- Strong analytical and debugging skills with the ability to troubleshoot complex, distributed systems under pressure.
- Experience with scripting languages, preferably Python or Go, and familiarity with Django is a plus.
- Experience with GitLab or similar CI/CD tools is highly desirable.
- Excellent communication skills and collaborative mindset, with ability to work effectively across engineering, security, and product teams.