
Senior DevOps Engineer
Rain is the world's first AI Financial Health Platform, serving 3.5 million employees at leading organizations like McDonald's, Marriott, and T-Mobile. Rain works in the background to optimize every employee's financial life to prevent shortfalls and build long-term stability. Backed by top investors including QED and Prosus, Rain has raised $150M in venture funding to fuel our next stage of hyper growth.
As a Senior DevOps Engineer at Rain, you will play a central role in designing, building, and operating our cloud infrastructure as we continue to scale to millions of users globally. You will work alongside a small, high-performing cloud team to drive automation, improve observability, and ensure the reliability and security of our platform. This role goes beyond keeping the lights on — you will actively shape how we build and operate infrastructure and influence architectural decisions.
What You’ll Do
Design, build, and maintain scalable, secure cloud infrastructure on AWS using Terraform and Terragrunt (IaC, Infrastructure as Code)
Manage and evolve our Kubernetes (EKS) clusters — including node group management, autoscaling with Karpenter, and workload reliability
Own and improve our CI/CD pipelines (GitLab CI), ensuring fast, reliable, and secure delivery
Drive observability initiatives: metrics, logging, alerting, and dashboards using Prometheus, Grafana, and related tooling
Support and evolve our Kafka infrastructure in collaboration with backend engineering teams
Champion infrastructure-as-code practices, ensuring consistent, reviewed, and well-documented changes
Respond to production incidents, lead post-mortems, and drive improvements in incident response processes
Collaborate with backend, security, and product engineering teams to support their infrastructure needs
Leverage AI-assisted tooling (e.g., GitHub Copilot, AI-powered incident analysis, LLM-based automation) to increase productivity and quality
Who You Are
You bring 5+ years of experience managing large-scale production environments and aren't afraid of architectural complexity
You have a "code everything" mindset, replacing manual tasks with scalable, DRY Infrastructure as Code (IaC)
You understand the "why" behind Kubernetes internals and cloud networking, not just the "how" of deployment
You communicate complex infrastructure concepts clearly to both engineering peers and business stakeholders
You treat security, secrets management, and observability as core features, not afterthoughts
Required Technical Qualifications
Advanced AWS & EKS: Deep proficiency in EC2, RDS, S3, IAM, and VPC networking, specifically within multi-account EKS environments
Kubernetes Internals: Hands-on experience with CNI, RBAC, Affinity/Taints, and managing complex workloads (StatefulSets/DaemonSets)
IaC Mastery: Proven ability to scale infrastructure using Terraform and Terragrunt with modular, reusable patterns
CI/CD & Helm: Expertise in designing secure GitLab CI pipelines and managing versioned Helm charts across environments
Observability: Proficiency in building dashboards and alerting logic using Prometheus and Grafana
Linux & Scripting: Strong Bash skills for environment management (Python proficiency is a significant plus)
Diversity, Equity and Inclusion Commitments
As part of our dedication to the diversity of our workforce, Rain is committed to Equal Employment Opportunity and does not discriminate based on race, religion, color, national origin, ethnicity, gender, sex (including pregnancy), protected veteran status, age, disability, sexual orientation, gender identity, gender expression, or any unlawful criterion existing under applicable federal, state, or local laws. If you need assistance or accommodation due to a disability, you may contact us at HR-US@rain.us.
What’s Next
Ensuring a smooth and enjoyable candidate experience is critical for us. Our interview process tends to take about 4 weeks to complete, but may fluctuate depending on the role. Learn more about our hiring process here. Don’t be afraid to let us know if you need more flexibility.
Increase your chances of landing your dream career.
About the company
Similar Remote Jobs
Opened 11 days ago Featured Job Remote Job
Freelance DevOps Support Engineer (Part-Time, Remote, Americas Time Zone)nnSoftware GmbHPart Time$33.8k - $67.6k per yearOpened 9 days ago Promoted Job Remote Job
Closes in 7 days Promoted Job Remote Job
Closes in 4 days Promoted Job Remote Job
New Job! Remote Job
