Senior Site Reliability Engineer, Lithuania
Confirmed live in the last 24 hours
DriveWealth
Job Description
About Us
DriveWealth is on a mission to make investing easier. We believe that everyone should have the ability to control their financial future, and that access to financial markets should not be limited by geography, wealth, or legacy systems. We are a global B2B financial technology organization dedicated to democratizing access to financial independence around the world. Our mission is realized through an API-based platform, empowering our partners to offer seamless investing and trading experiences to clients worldwide, all from their mobile devices. Our technology provides partners with a modern, extensible toolkit, enabling traditional investment workflows and innovative techniques like fractional share ownership. DriveWealth has evolved into a global platform offering trading of US equities, mutual funds, ETFs, fixed income, and options.
There’s never been a better time to build a category-defining business and there has rarely been a team better positioned for this opportunity. Our culture blends the pace and agility of a fintech start-up with the impact, stability, and discipline of Wall Street. We encourage creativity and experimentation while ensuring institutional-grade execution and regulatory compliance in everything we do. Join us and help build the future of global investing!
About The Role
As a Senior Site Reliability Engineer based in Lithuania, you will enhance the reliability and performance of our Brokerage-as-a-Service platform during critical 7/24 operations. This role demands a proactive approach to managing technical challenges and system optimizations aligned with our global operational strategies.
What You’ll Do
- Support the SRE team in developing and implementing enhancements to support workflows, focusing on automation and efficiency improvements
- Handle technical escalations, troubleshoot complex FIX and API connectivity issues, and actively participate in on-call rotations during non-traditional hours to ensure rapid response and resolution
- Adhere to and administer incident and change management policies
- Coordinate incident resolution efforts and implement change management protocols to maintain and enhance system reliability
- Work closely with the Lithuania office to ensure smooth operation and alignment of SRE practices across time zones
- Coordinate Incident Post Mortems and RCA analysis
- Design, implement, and maintain comprehensive monitoring, logging, and tracing solutions (observability stack) to provide deep insights into system performance and user experience
- Partner with product and engineering teams to define clear Service Level Indicators (SLIs) and Service Level Objectives (SLOs), managing error budgets to ensure service reliability meets business needs
What You'll Need
- 3+ years in a senior SRE role or a similar position, demonstrating deep knowledge and expertise in site reliability engineering and operations
- Knowledge of FIX protocol and messages, ability to read FIX logs
- Familiarity with REST APIs and a strong understanding of API integration
- Proficient in Python and scripting for automation and system management, with a proven track record of developing and implementing automation solutions
- Expertise in SQL and transactional databases, including querying and troubleshooting
- Strong analytical and troubleshooting skills with a proven ability to identify and resolve technical issues through root cause analysis
- In-depth knowledge of core networking concepts including TCP/IP, routing, and DNS
- Familiarity with maintaining and troubleshooting systems within both cloud (AWS) and co-location (colo)
- Availability for flexible work hours and willingness to cover US markets trading sessions, including L2 on-call coverage
- Knowledge of change management processes and risk management
Nice to Have, But No Required
- Experience in the brokerage or financial industry
- Proficient with cloud services, particularly AWS, and knowledgeable about cloud architecture best practices, including IAM, EC2, S3, and DynamoDB
- Experience maintaining and supporting containerized systems, with familiarity in orchestration tools
- Knowledge of Infrastructure
Similar Jobs
Caterpillar
Senior Embedded Software Engineer
Caterpillar
Software Developer - Angular
Caterpillar
Software Engineer- Mobile Application Development
Caterpillar
Senior Software Engineer ( Sr .Net Developer)
Fresenius Medical Care
Pflegefachkraft oder Medizinische Fachangestellte (m/w/d) für die Dialyse
Workato