Runtime Platforms
We develop and support runtime platforms and services for our engineers worldwide, enabling them to run workloads across various compute platforms.
As part of the firm's strategic move to increase the usage of containerization to package, deliver, and run software, the runtime team plays a critical role in ensuring that we offer container platforms that are
secure, reliable, and operationally inexpensive .
This allows business teams to focus on delivering value to shareholders.
The runtime team designs, builds, and operates large-scale scheduling and compute platforms supporting the firm's most critical businesses and services, both on-premises and in public cloud.
The Kubernetes Team
The Kubernetes team offers a managed multi-tenant solution comprising redundant clusters in various data centers worldwide.
The platform includes many enhancements to enable tighter integration with the Goldman Sachs technology ecosystem, designed to reduce toil and operational overhead for tenants.
These include third-party technologies and bespoke services built by the Kubernetes team.
Given the presence of many business-critical and sensitive applications on Kubernetes, there is a strong focus on reliability engineering to ensure our SLOs are met.
As a financial services institution, we enforce stringent security requirements, both as good practice and regulatory obligation, which must not add to operational overhead for consumers.
Our Culture
At Goldman Sachs, our culture emphasizes teamwork, innovation, and meritocracy.
We believe our people are our greatest asset and support each colleague both professionally and personally.
You will join a talented, globally distributed team passionate about delivering the best experience to our users.
What You Will Do
Design and implement enhancements for the platform
Improve the reliability and monitoring of existing services
Upgrade cluster lifecycle management workflows and tooling
Collaborate with tech risk teams to enforce security controls
Essential Skills
Programming and scripting experience, preferably in Golang and/or shell scripting
Good understanding of Linux and containers
Understanding of Kubernetes architecture and minimum of 1 year (for Analyst) or 3 years (for Associate) of hands-on experience managing or using Kubernetes
Knowledge of common network protocols such as gRPC, TCP, DNS
Experience with reliability engineering practices and tooling, e.g., Prometheus, Grafana
Familiarity with security protocols
Experience with CI/CD pipelines using Git
#J-18808-Ljbffr