Sr DevOps Architect
Stem
About Stem - Driven by human and artificial intelligence – Stem is unlocking energy intelligence.
Stem is a global leader reimagining technology to support the energy transition. Turning complexity into clarity, and potential into performance.
We help asset owners, operators and stakeholders benefit from the full value of their energy portfolio by enabling the intelligent development, deployment, and operation of clean energy assets. Our integrated software suite, PowerTrack, is the industry standard and best-in-class for asset monitoring, supported by professional and managed services, under one roof. Meant to tackle challenges as seamlessly as possible, Stem shows the information needed clearly and accurately and helps harness raw data to inform actionable insight. With global projects managed in 55 countries – from Germany to Japan and across North America – customers have relied on Stem for nearly 20 years to maximize the value of their clean energy projects.
Stem’s culture embodies diversity & inclusion beyond the traditional facets of gender, ethnicity, age, disabilities, and sexual orientation to include experience, personality, communication, workstyles, and more. At our core, Stem is at the momentous intersection of clean energy and software technology where diverse ideas, experiences, and professional skills converge to make the inclusive culture we have today. Together, we are turning old school thoughts about software and energy into progressive, collaborative, and innovative solutions. By joining our team, you will be collaborating with data scientists, energy experts, skilled salespeople, thought-leading executives and more from a range of backgrounds. This intersection of ideas, beliefs, and skills is what makes us unique enough to lead the world’s largest network of digitally connected energy storage systems.
What We're Looking For
We are seeking a Senior DevOps Architect to lead the design and evolution of our cloud-native infrastructure. The ideal candidate is a strategic thinker who can drive architectural decisions across multiple platforms and geographies while maintaining hands-on technical expertise. They will have strong technical aptitude, excellent communication skills, and the ability to influence and mentor engineering teams.
This role will lead DevOps architecture with a primary focus on PowerTrack while also driving alignment across the Athena and Locus platforms. A core mission of this position is unifying, standardizing, and consolidating our environments, frameworks, and toolsets to reduce complexity, improve operational efficiency, and enable teams to move faster. You will be responsible for establishing standards, improving observability and reliability, optimizing cloud costs, and ensuring security across our enterprise platforms. Our technology stack includes:
- Languages/Frameworks: Python, Java, C#/.NET
- Databases: DynamoDB, MySQL, MS-SQL, PostgreSQL, MongoDB, InfluxDB, TimescaleDB
- Cloud Platform: AWS
- Observability: Datadog, Grafana, Prometheus, OpenSearch, CloudWatch
Responsibilities
Platform Unification & Standardization
- Drive the consolidation of environments, frameworks, and toolsets across PowerTrack, Athena, and Locus platforms
- Develop and execute a roadmap for platform standardization, reducing technical debt and operational complexity
- Establish unified CI/CD pipelines, deployment patterns, and release processes across teams
- Standardize Infrastructure-as-Code practices, module libraries, and configuration management approaches
- Consolidate observability tooling and establish consistent monitoring, logging, and alerting standards across all platforms
- Define and enforce common security baselines, compliance controls, and operational procedures
- Create reference architectures and golden paths that teams can adopt for common use cases
- Lead migration efforts to move legacy or divergent systems onto standardized platforms
- Document architectural decisions (ADRs) and maintain living documentation for platform standards
Architecture & Leadership
- Lead DevOps architecture strategy across geographies, with primary focus on PowerTrack and collaboration with Athena and Locus platform teams
- Define and drive architectural standards, patterns, and best practices across teams
- Mentor and guide DevOps engineers; conduct architecture reviews and provide technical direction
- Evaluate emerging technology trends and make recommendations to enable evolving business and operating models
- Collaborate with product managers on platform lifecycle decisions including maintenance, modernization, and retirement
- Facilitate evaluation and selection of software products, services, and tooling standards
- Build consensus across teams and drive adoption of unified approaches
Infrastructure & Operations
- Design, deploy, automate, and manage AWS cloud-based production systems ensuring availability, performance, scalability, and security
- Design durable and consistent patterns for distributed systems; recommend architecture and process improvements
- Troubleshoot and solve complex problems across AWS infrastructure and application domains
- Lead incident response for critical issues; conduct blameless post-mortems and drive systemic improvements
- Analyze and resolve complex infrastructure and application deployment issues
Observability, Reliability & Cost Optimization
- Architect comprehensive observability solutions including metrics, centralized logging, and distributed tracing for full-stack visibility
- Design alerting strategies that minimize noise, reduce alert fatigue, and enable rapid incident detection
- Establish SLOs/SLIs and error budgets; drive reliability improvements based on data
- Develop automated remediation workflows and self-healing infrastructure to reduce MTTR
- Analyze cloud spend and architect cost-efficient solutions; drive adoption of Reserved Instances, Savings Plans, right-sizing, and resource lifecycle management
- Build dashboards and reporting for infrastructure cost visibility
- Identify cost savings opportunities through platform consolidation and elimination of redundant tooling
Security & Compliance
- Ensure critical system security using industry-leading cloud security solutions
- Integrate security practices into CI/CD pipelines and infrastructure automation
- Support compliance requirements including NIST, SOC 2, SOX, and FedRAMP
Quality & Delivery
- Oversee pre-production acceptance testing to assure quality of products and services
- Collaborate across functional and technical teams to deliver projects on time per the roadmap
Requirements
- 8+ years of overall experience, with 5+ years in enterprise environments
- 5+ years building and managing cloud platforms supporting large, highly available, enterprise-grade applications
- 5+ years working extensively with AWS technologies (e.g., EC2, EKS, ECS, S3, Redshift, VPC, Glacier, IAM, CloudWatch, SQS, Lambda, CloudTrail, Systems Manager, KMS, Kinesis) with emphasis on the AWS Well-Architected Framework
- Demonstrated experience leading platform consolidation, standardization, or modernization initiatives across multiple teams or business units
- Proven ability to build consensus and drive adoption of unified tooling and practices in organizations with diverse or legacy systems
- Demonstrated experience leading architectural decisions and driving technical strategy across teams
- Strong experience implementing enterprise observability solutions including metrics, logging, and distributed tracing (e.g., OpenTelemetry, Jaeger, X-Ray)
- Proven ability to design effective alerting systems, establish SLOs/SLIs, and drive reliability improvements
- Track record of identifying and implementing AWS cost optimization strategies
- Strong Infrastructure-as-Code expertise using Terraform, Ansible, Python, and Shell scripting
- Hands-on experience with containerization and orchestration (Docker, Kubernetes, AWS EKS, ECS)
- Solid experience in 24x7 production AWS environments including CI/CD pipelines (Jenkins, AWS CodePipeline, GitLab CI, etc.)
- Strong understanding of Site Reliability Engineering principles, error budgets, and chaos engineering
- Linux and Windows server administration
- Experience with observability platforms (Datadog, Grafana, Prometheus, OpenSearch/Elastic Stack, CloudWatch, PagerDuty)
- Understanding of network topologies and protocols (DNS, HTTP/HTTPS, SSH, SFTP, SMTP)
- Experience with IT compliance and risk management frameworks (NIST, SOC 2, SOX, FedRAMP)
- Excellent communication and influencing skills; ability to collaborate with client IT organizations and drive technical decisions across organizational boundaries
Preferred Qualifications
- AWS Solutions Architect Professional certification
- FinOps certification or demonstrated expertise in cloud financial management
- Experience with AIOps or ML-driven anomaly detection
- Experience architecting multi-region or hybrid cloud environments
- Background in IoT platforms and edge computing architectures
- Experience with platform engineering and internal developer platforms (IDPs)
Salary Range
$140,960.00 - $211,440.00What We Offer:
At Stem, you will work in a growing, innovative, mission-driven company with talented colleagues that have a passion for building renewable energy systems. Stem offers competitive compensation as well as a comprehensive set of benefits to support the health and wellness of our employee including:
- A competitive compensation package, including eligibility for a bonus or commission based on the role, and equity
- Full health benefits on the first day of employment (several medical plan options-HDHP and PPO, dental plans, FSA/HSA-with employer contribution, employer paid vision/LTD/STD/Life, variety of voluntary coverage)
- 401k (pre- or post-tax) on first day of employment
- 12 paid calendar holidays per year
- Flexible time-off
Learn More
To learn more about Stem, visit our stem.com where you’ll find information about our solutions, technology, partners, case studies, resources, latest news and more. Here are some relevant links:
Stem, Inc. is an equal opportunity employer committed to diversity in the workplace and does not discriminate against any employee or applicant for employment because of race, color, sex, pregnancy, religion, national origin, ethnicity, citizenship, sexual orientation, gender identity, age, marital status, disability, genetic information, military status, protected veteran status or any other factor protected by applicable federal, state or local laws.