Site Reliability Engineering Manager
Task information:
As a Manager of Site Reliability Engineering based in Lithuania, you will oversee our Brokerage-as-a-Service platform's reliability and operational efficiency during critical end-of-day and start-of-day operations. You will lead a team that is an extension of our New York office, ensuring seamless integration and continuous operational coverage across time zones.
What You’ll Do
- Lead the Site Reliability Engineering team to enhance support workflows using ticketing systems and tools
- Manage and mentor a team of SREs, facilitating collaboration between internal product, engineering, and client-facing teams
- Oversee partner escalations and ensure operational stability, including monitoring partner channels, troubleshooting issues, and coordinating with partners on remediation
- Adhere to DriveWealth Incident Management Policy for the resolution and documentation with the engineering team for ongoing product improvements
- Oversee the entire incident response lifecycle
- Administer DriveWealth Change Management Policy to ensure minimal disruption to services
- Collaborate closely with the SRE team and other teams operating in the Eastern Standard Time zone to align on strategic initiatives and daily operations, ensuring adherence to global standards and practices. This includes monitoring critical operations outside conventional hours to support our global platform’s scalability and reliability
Requirements:
What You’ll Need:
- 5+ years of experience in software engineering or site reliability engineering
- 3+ years of proven leadership experience and the ability to manage teams
- Availability for flexible work hours and willingness to cover US markets trading sessions.
- Expertise in incident and change management processes
- Knowledge of alerting and automation frameworks
Nice to Have:
- Strong background in technical cloud services, particularly AWS, including expertise with IAM, EC2, S3, and DynamoDB
- Experience with Infrastructure as Code (IAC) tools like Terraform, CloudFormation
- Experience with job orchestrator/scheduler tools like Apache Airflow, Rundeck
- Experience maintaining and supporting containerized systems using Kubernetes and OpenShift
- Knowledge of Confluent Cloud for managing Kafka streams in a production environment
- Scripting capability in Python or similar languages
- Experience with SQL and transactional database querying
Company offers:
- Monthly Fitness/Wellness Reimbursement: €70 per month expense
- Medical Reimbursement: €100 per month expense
- Professional Development: €2,300 per year expense
- Vacation: 20 days annual leave per year
- Parental Leave: Statutory leave (required by law)
- Employee Referral Program: Eligible to receive a €900 referral bonus per referral policy
- Hybrid work experience that allows for flexibility.
Contacts
Contact person:
Reda Maumevičienė
Phone:
E-mail:
Address:
K. Donelaičio str.62-320, BLC Business Centre, Kaunas, Lithuania
Confidentiality guaranteed. Only selected candidates will be informed.