Descripción de la oferta
You're the engineer who maintains uptime across 50+ SaaS products when no one else has the answer. We need DevOps professionals who can step into unfamiliar AWS environments, restore order from disorder, and drive availability beyond 99.9% using proven monitoring, automation, and root-cause analysis. You'll break down complex projects into single-day tasks, deliver production-ready Python or JavaScript, and leverage AI as a force multiplier. Most organizations talk about "cloud maturity" while manually nursing individual systems. We're building industrial-grade reliability across a portfolio of acquired products where original teams have departed and documentation is incomplete. The challenge: you'll deploy agents and current tooling to explore unfamiliar systems 5–10x faster, document your findings, and automate fixes so repeat incidents become impossible. Rather than judging you on certifications and vendor badges, we'll observe how you troubleshoot in real time, author a genuine 5‑Whys that isolates one preventable root cause, and construct automations that hold up under production load. This is not a tier‑two "execute the runbook" position. Here, you author the runbooks, architect the deployment path from dev through staged to 10% and full rollout with soak periods and rollback conditions, and configure the monitoring that surfaces corner cases. You block risky changes before they reach production. You distinguish infrastructure failures under your domain from application bugs owned by Engineering, and you route permanent remediations to the correct team. You'll operate at the engineering heart of reliability, managing infrastructure initiatives, incident triage and RCAs, and change requests backed by copy‑paste‑ready runbooks. If you've already shepherded a serious SaaS platform and want to extend that rigor across an entire fleet, this is your opportunity. Bring expert AWS knowledge, production‑quality coding ability, uncompromising scope discipline, and daily, mission‑critical use of AI tooling. If you're prepared to safeguard operations, apply now. What You Will Be Doing Leading complex infrastructure migrations, consolidations, production‑grade automation builds, and monitoring enhancements Investigating production incidents, deploying immediate remediations, and producing root cause analyses that assign lasting fixes to the appropriate teams Authoring, reviewing, and executing production changes, including assessing whether a proposed change meets safety criteria What You Won’t Be Doing Spending your day in Jira and status calls - we reward engineers who deliver solutions, not those who merely document problems Keeping legacy systems alive indefinitely - you'll be authorized to pursue substantive improvements Waiting for multi‑layer approval processes - you'll possess the authority to deploy immediate fixes during incidents Cloud Architect Key Responsibilities Champion reliability and standardization of cloud infrastructure across our expanding product portfolio by deploying comprehensive monitoring, automation, and AWS best practices. Basic Requirements Deep AWS infrastructure expertise (this is our primary platform - other cloud experience alone won't cut it) Experience managing production infrastructure at a scale of 1,000+ containers Experience scripting with Python and Bash for day‑to‑day administration operations Experience managing and migrating production databases with multiple engines (including MySql, Postgres, Oracle, MS‑SQL) Experience with infrastructure automation (Terraform, Ansible, or CloudFormation) About Trilogy Hundreds of software businesses run on the Trilogy Business Platform. For three decades, Trilogy has been known for relentlessly seeking top talent, innovating new technology, and incubating new businesses. Our technological innovation is spearheaded by a passion for simple customer‑facing designs. Our incubation of new businesses ranges from entirely new moon‑shot ideas to rearchitecting existing projects for today's modern cloud‑based stack. Working with us This is a full‑time (40 hours per week), long‑term position. The position is immediately available and requires entering into an independent contractor agreement with Crossover as a Contractor of Record. The compensation level for this role is $50 USD/hour, which equates to $100,000 USD/year assuming 40 hours per week and 50 weeks per year. The payment period is weekly. Consult crossover.com/help-and-faqs for more details on this topic.
#J-18808-Ljbffr