
DevOps Engineer
About ThriveCart:
ThriveCart is the leading no-code sales platform for digital course creators, coaches, entrepreneurs, and online businesses looking to boost revenue, drive conversions, and scale audiences. ThriveCart powers over 65,000 businesses and 12 million enrolled students, generating over $2 billion in annual sales. The platform provides all the tools businesses need to create high-converting checkout experiences, manage powerful affiliate campaigns, and deliver seamless student experiences with its built-in learning management system, Learn/Learn+.
Location:
Remote in EU OR South America
Must have full authorization to work in country of residence being in EU OR in South America.
The Role
We're looking for an able hands-on DevOps engineer who loves being in the trenches. You'll spend your days building CloudWatch dashboards, writing Terraform code, investigating production issues, and making our infrastructure more observable and reliable.
This is NOT a planning or coordination role - it's about doing the technical work that keeps a revenue-generating platform running smoothly.
What you'll do:
Build monitoring systems - Create CloudWatch dashboards and alarms that catch real problems (payment failures, database issues, API errors) before customers notice
Write infrastructure code - Use Terraform to manage AWS resources (EC2, RDS, CloudWatch, IAM) with code review and automation
Improve deployments - Make releases faster and safer through CI/CD improvements and automated rollbacks
Handle production issues - Investigate anomalies, respond to alerts, find root causes, and prevent future occurrences
Analyze performance - Identify bottlenecks in payment processing, database queries, and API responses
What We're Looking For
Required Experience (3-6 years)
You must have:
⭐ Terraform production experience - managing real infrastructure with IaC
⭐ AWS CloudWatch experience - building dashboards, configuring alarms, writing custom metrics
Production operations - supporting high-traffic web applications where downtime costs money
Database operations - Database system monitoring, performance tuning, backups
Docker & containers - Docker Compose, multi-container applications
Linux system administration - comfortable with shell scripting and troubleshooting
Web servers - Nginx or Apache configuration
Nice to Have
E-commerce or payment processing infrastructure experience
PCI compliance knowledge
Multi-database architecture experience
On-call experience for revenue-critical systems
CI/CD pipeline construction
Personal Qualities
Proactive - You spot problems before they become incidents
Systematic - You troubleshoot methodically with data, not guesses
Clear communicator - You explain complex issues in simple terms
Detail-oriented - You catch edge cases and think about failure modes
Calm under pressure - Production incidents don't rattle you
Documentation-focused - You write runbooks because you know future-you will thank present-you
Technical Environment
Infrastructure you'll work with:
AWS CloudWatch
Terraform for infrastructure as code
Docker for development environments
MariaDB 10.11 (multi-database architecture)
PHP & Node/typescript application stack
Nginx web server
Memcached caching
AWS: EC2, RDS, Route53, IAM
You'll be responsible for:
Production monitoring and alerting
Infrastructure as code (Terraform)
CI/CD pipeline improvements
Database operations and optimization
Container orchestration
Security and compliance support
What Makes This Role Different
High impact - Your observability will directly affect how fast we find and fix issues
Real stakes - When payments fail, merchants lose money. Your work matters.
Hands-on - You'll write code and build systems, not just coordinate or plan
Complex system - Multi-database, legacy + modern, payment processing - interesting technical challenges
Visibility - Your work is immediately visible to the entire engineering team
What Success Looks Like
In 30 days:
You've responded to your first production alert
You understand the architecture and data flows
You've identified gaps in our current monitoring
In 90 days:
You've built 2+ dashboards the team relies on
You've implemented 5+ alarms for previously blind spots
You handle production incidents independently
In 6 months:
You've proactively prevented multiple production issues
The team depends on your monitoring systems
You've led an infrastructure improvement project
You're comfortable on the on-call rotation
On-Call
Shared rotation (about 1 week per month)
Focus on critical alerts (payment processing, database issues)
Most scenarios have documented runbooks
Team backup available for complex issues
Compensated with time off or additional pay
Why Join ThriveCart
✅ Direct impact - Your work directly affects revenue and reliability
✅ Modern practices - Terraform, IaC, CloudWatch, Docker
✅ Interesting problems - Complex distributed systems with real-world challenges
✅ Learning opportunities - Diverse tech stack (legacy + modern)
✅ Collaborative team - Work with experienced engineers
✅ Remote flexibility - FULL Remote role
Our Values are:
Commit to Excellence
Find a Way
Help Each Other
Keep Growing
Data First
Don't wait, tomorrow could be too late.
About the company
Similar Remote Jobs
Opened 10 days ago Featured Job Remote Job
Opened 12 days ago Featured Job Remote Job
Closes in 4 days Featured Job Remote Job
New Job! Remote Job
New Job! Remote Job
