Overview
Cloud Platform Site Reliability Engineer – Barings.
We are seeking a highly motivated and skilled professional to design, implement, and maintain Cloud infrastructure solutions for enterprise-level organizations.
The role combines cloud engineering and operations with a focus on reliability, performance, monitoring, security, and cloud platform management.
The Cloud Platform Site Reliability Engineer will work with application and Data Platform Teams to design, implement and maintain the company's cloud platform, applying software engineering principles to infrastructure and operations.
Responsibilities
Support and maintenance of business-critical applications.
Collaborate with application and data teams to implement robust monitoring and automation to reduce service outages and eliminate manual toil.
Collaborate with Infrastructure teams, including architecture and engineering, to implement scalable and highly automated solutions.
Develop automation processes using scripting and automation tools.
Maintain Barings alert and notification systems to detect failures and notify appropriate teams with service level objectives aligned with business goals.
Troubleshoot complex issues by analyzing logs, network traffic, and application configurations.
Identify and recommend improvements to enhance performance, security, or reliability.
Work with application support teams to ensure application services are compliant and running efficiently.
Participate in the migration from on-premises to cloud-native services, including design, testing, and operations.
Create documentation and artifacts to assist knowledge transfer within the team.
Provide comprehensive support for all server and infrastructure resources, including on-premises and cloud-based platforms, ensuring optimal performance, availability, and security.
Stay updated with industry advancements.
This role may occasionally require weekend work, depending on business needs or project timelines.
Qualifications
5+ years of experience as a Cloud Engineer or Cloud SRE with demonstrable skills in Azure.
Proven ability to manage hybrid infrastructure environments (on-premises and Microsoft Azure).
Experience implementing and supporting backup, archival, and disaster recovery services in hybrid environments.
Manage and optimize resource provisioning across hybrid platforms for cost efficiency.
Experience with Azure Policy for compliance.
Strong understanding of TCP/IP networking concepts and protocols (e.g., FTPS, SSH/SFTP, TLS).
Experience with scripting languages (Bash, Python, or PowerShell).
Experience with logging and monitoring tools across hybrid infrastructure to ensure services are available and within SLOs.
Experience working in a highly automated financial services environment.
Administer and optimize Microsoft Windows Server environments (Active Directory, DNS, DHCP, Group Policy).
Strong experience in building fault-tolerant, scalable, and secure systems.
Ability to learn new technologies and lead technological change.
Strong oral and written communication skills.
Production systems support experience.
Troubleshooting skills with ability to perform root cause analysis.
A continuous learning mindset.
Beneficial
Experience with Nutanix Hyperconverged technologies
Experience in AWS and GCP cloud technologies
Requisite Skills / Additional Skills
Barings is an Equal Employment Opportunity employer; Minority/Female/Age/Sexual Orientation/Gender Identity/Individual with Disability/Protected Veteran.
We welcome all persons to apply.
#J-18808-Ljbffr