Sr. Site Reliability Engineer - CPO
Confirmed live in the last 24 hours
Addepar
Job Description
Who We Are
Addepar is a global data and AI platform empowering investment professionals to turn complex financial information into actionable intelligence. Addepar unifies portfolio, market and client data in a total portfolio view and delivers AI-powered insights within investment and client workflows. More than 1,400 firms in nearly 60 countries use Addepar to manage and advise on nearly $9 trillion in assets. Its open platform integrates with nearly 650 software, data and consulting partners to power end-to-end investment operations across firms of all sizes and complexity. Addepar supports clients worldwide with offices in New York City, Salt Lake City, London, Edinburgh, Pune, Dubai, Geneva and São Paulo.
The Role
We are looking to add a highly experienced and impactful colleague to the organization to drive the transformation of Addepar’s Production Engineering and SRE team. This role focuses on evolving our platform towards enabling high-level declarative infrastructure orchestration and its operations. This platform closely integrates our Compute, Network, and Storage control planes, allowing us to develop highly efficient and fast-to-iterate-on services tailored to various product areas within the company, abstracting our developers from the nuances of underlying infrastructure.
The ideal candidate will play a senior leading role in implementing, maintaining, and strategically evolving Addepar’s Production Infrastructure.. You will bring a robust combination of leading innovative solutions across functional teams and extensive hands-on development experience in AWS/cloud, Linux/Unix, networking, advanced scripting abilities, containerization, Kubernetes, Terraform, Information Security, deep debugging, and comprehensive monitoring/observability skills. This includes designing, deploying, monitoring, automating, and optimizing all operational aspects of Addepar's platform with a focus on reliability, scalability, and efficiency.
Applicants must have legal authorization to work in the country where this role is based on the first day of employment. Visa sponsorship is not available for this position.
What You’ll Do
- Lead the design, implementation, and operationalization of container infrastructure using Kubernetes (k8s), ensuring high availability, performance, and security
- Build, and maintain advanced, automated CI/CD pipelines using Jenkins, ArgoCD, AWS CodeBuild/Pipeline, GitHub Actions, or similar, establishing best practices for deployment strategies (e.g., blue/green, canary)
- Drive the adoption and evangelism of Infrastructure as Code (IaC) principles using Terraform, focusing on scaling the Addepar Platform across regions with a focus on cost optimization and operational efficiency
- Develop deep application-level knowledge to proactively inform and influence infrastructure requirements and constraints for Developers, QA, and Management, including implementing sophisticated dashboards for Cost and Inventory management, performance analysis, and capacity planning
- Perform advanced monitoring and troubleshooting of our infrastructure and application stack using a wide array of logging/monitoring tools, driving root cause analysis and implementing preventative measures
- Initiate and lead collaborations with cross-functional teams to identify and resolve complex Application or infrastructure issues, serving as a technical subject matter expert
- Serve as a primary on-call responder for critical incidents, demonstrating strong problem-solving skills under pressure and contributing to post-incident reviews to improve system resilience
Who You Are
- Extensive experience of progressive experience in the SRE/DevOps/Systems Engineer field, with a track record of taking on increasing responsibility
- Expert-level understanding of Cloud Infrastructure fundamentals (AWS preferred), including advanced networking, security, and managed services
- Exceptional Programming/Scripting skills in various common languages (Python , Bash, and general Linux tools are essential; Java is a strong plus), with an emphasis on building scalable, maintainable automation and tools
- Broad expertise with UNIX/BSD/Linux internals (Ubuntu preferred), including performance tuning, kernel-level debugging, and advanced system administration
- Extensive Containerization experience with k8s (KOPS, EKS, ECS preferred), including cluster management, custom resource definitions (CRDs), and advanced deployment strategies
- Proficient experience with comprehensive monitoring, logging, and alerting tools such as Prometheus, Grafana, Sentry, Sumologic, or advanced AWS cloud-native tools, with a focus on observability strategy
- Excellent interpersonal and communication skills to effectively collaborate with multi-functional teams, articulate complex technical concepts, and influence outcomes
- Demonstrable experience writing and contributing to significant systems automation tooling or open-source projects is a strong plus
- Exposure to industry practices in financial services is a plus
Must-have Skills:
- Extensive experience with Java, Python, Go, or similar, particularly in developing robust automation and platform components.
- Deep and proven expertise with Terraform and Infrastructure as Code (IaC) in large-scale, complex cloud environments, including best practices for modularity, reusability, and state management.
- Demonstrated experience designing, building & operating highly reliable, fault-tolerant distributed systems in a cloud environment (AWS preferred), including resilience patterns and disaster recovery.
- Exceptional passion for technology, pragmatic thinking, and a proven ability to independently navigate ambiguous areas, define solutions, and break down and solve highly complex, cross-functional problems.
- Strong understanding of system design principles and the ability to influence architectural decisions for large-scale, highly available systems.
Our Values
- Act Like an Owner - Think and operate with intention, purpose and care. Own outcomes.
- Build Together - Collaborate to unlock the best solutions. Deliver lasting value.
- Champion Our Clients - Exceed client expectations. Our clients’ success is our success.
- Drive Innovation - Be bold and unconstrained in problem solving. Transform the industry.
- Embrace Learning - Engage our community to broaden our perspective. Bring a growth mindset.
In addition to our core values, Addepar is proud to be an equal opportunity employer. We seek to bring together diverse ideas, experiences, skill sets, perspectives, backgrounds and identities to drive innovative solutions. We commit to promoting a welcoming environment where inclusion and belonging are held as a shared responsibility.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
PHISHING SCAM WARNING: Addepar is among several companies recently made aware of a phishing scam involving con artists posing as hiring managers recruiting via email, text and social media. The imposters are creating misleading email accounts, conducting remote “interviews,” and making fake job offers in order to collect personal and financial information from unsuspecting individuals. Please be aware that no job offers will be made from Addepar without a formal interview process. Additionally, Addepar will not ask you to purchase equipment or supplies as part of your onboarding process. If you have any questions, please reach out to ta-operations@addepar.com.