About the Role:
This is an on-site, full-time position. We are seeking a talented DevOps Engineer to join our team and help streamline our deployment processes. The ideal candidate will have a strong background in system administration and automation, with a passion for building scalable and reliable infrastructure.
Responsibilities:
- Design, implement, and maintain automated deployment pipelines to ensure fast and reliable delivery of software.
- Collaborate with development teams to optimize application performance and ensure scalability, reliability, and security.
- Manage and maintain existing infrastructure, mostly based on Linux environments.
- Monitor system health and performance, troubleshooting issues as they arise.
- Implement and manage containerization and orchestration technologies such as Docker and Kubernetes.
- Continuously research and evaluate new tools and technologies to improve DevOps processes.
Qualifications & Experience:
- 5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
- Strong coding skills in Bash for automation and scripting.
- Proven experience in designing, deploying, and maintaining Kubernetes clusters in production environments.
- Familiarity with Rancher Labs products such as RKE and Rancher.
- Proficiency in Linux (Ubuntu) system administration, including performance tuning, security hardening, and troubleshooting.
- Strong knowledge of the Elastic Stack (Elasticsearch, Logstash, Kibana, Beats family, APM).
- Hands-on experience with monitoring tools like Zabbix and Prometheus.
- Strong mindset in automation using tools such as Terraform and Ansible.
- Familiarity with load balancing solutions, such as HAProxy.
- Experience with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
- Experience in setting up and configuring Apache Kafka for large-scale data streaming and processing.
- Deployment and management of Redis for caching and messaging systems using Kubernetes.
- Proficient in configuring and maintaining PostgreSQL for relational databases and scalable operations.
Preferred Qualifications:
- Strong background in distributed systems and database reliability engineering, including PostgreSQL and MongoDB.
- Familiarity with Rancher Labs products, including RKE, RKE2, and K3s.
- Knowledge of chaos engineering and fault injection testing practices.
- Comfortable working with Helm charts for Kubernetes deployment.
Why Join Us?
- Work with the latest cloud and DevOps technologies in a high-performance, scalable environment.
- Thrive in a culture that prioritizes continuous learning, innovation, and an automation-first mindset.
- Enjoy a competitive salary and benefits package, including comprehensive insurance coverage (with complementary insurance).
- Take ownership of high-impact reliability projects that influence the entire organization.