Job description
**Job Description**
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.
Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services.
Design and develop designs, architectures, standards, and methods for large-scale distributed systems.
Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
As a Principal Engineer within NRE, you will be responsible for ensuring the reliability, scalability, and security of OCI's network infrastructure.
You will apply engineering principles to measure and automate the network’s reliability, aligning it with Oracle’s service-level objectives.
This role will involve resolving complex network issues, collaborating across teams, and driving automation efforts that enhance the overall operational efficiency of the OCI network.
You'll work with a team dedicated to proactively preventing network disruptions, performing root-cause analysis, and delivering innovative solutions that ensure the smooth operation of a global network environment.
**Lead Network Reliability Efforts** : Develop, automate, and optimize network services that ensure high availability and performance across OCI’s global infrastructure.
**Network Lifecycle Management** : Drive key programs to manage and maintain the network lifecycle, defining objectives and coordinating delivery milestones to meet organizational goals.
**Troubleshoot and Resolve Complex Network Issues** : Serve as the technical expert for network events, providing Tier 2 support and leading efforts to quickly restore services.
**Drive Automation** : Develop scripts and automation tools to improve operational efficiency, reduce manual interventions, and support a rapidly evolving network environment.
**Collaborate Across Teams** : Work closely with cross-functional teams—including engineering, product, and vendor partners—to design, implement, and optimize network solutions that meet the needs of both the business and end-users.
**Mentor and Lead** : Provide technical leadership and mentorship to junior engineers, helping them develop their skills and grow within the organization.
**Innovate and Influence** : Contribute to the roadmap for new network technologies, tools, and methodologies that enhance OCI’s network performance and reliability.
**What You’ll Need to Succeed:**
**Technical Expertise** : Extensive experience in network engineering, with a strong background in protocols like **MPLS, BGP, OSPF, IS-IS, TCP/IP, IPv4, IPv6, DNS** , and **DHCP** .
Experience with **VxLAN** , **EVPN** , and **SDN technologies** is a plus.
**Automation Skills** : Proficiency in scripting or programming, ideally with **Python** , to develop solutions that automate network operations and troubleshooting.
**Deep Understanding of Networking** : Strong knowledge of networking protocols, monitoring tools, telemetry solutions, and network modeling techniques (e.g., **YANG, OpenConfig, NETCONF** ).
**Experience in Cloud or ISP Environments** : Proven track record in large-scale cloud or ISP network environments, ideally supporting complex, multi-cloud infrastructures.
**Problem-Solving Mindset** : Excellent analytical and troubleshooting skills, with a focus on proactive identification and resolution of network issues.
**Collaboration and Leadership** : Ability to work effectively in a fast-paced, cross-functional team environment.
Experience leading technical teams or projects is highly desirable.
**Preferred Experience:**
Experience with **network modeling** and **automation frameworks** for large-scale networks.
Familiarity with **cloud-native network architectures** and modern network management tools.
Experience with **network monitoring** , **telemetry** systems, and **telemetry-based decision-making** .
**Responsibilities**
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.
Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.
Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance.
Authority for end-to-end performance and operability.
Partner with development teams in defining and implementing improvements in service architecture.
Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio.
Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack.
Demonstrate clear understanding of automation and orchestration principles.
Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs).
Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations.
Understand and explain the affect of product architecture decisions on distributed systems.
Professional curiosity and a desire to a develop deep understanding of services and technologies.
Career Level - IC4
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges.
We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute.
That’s why we’re committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes.
We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options.
We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process.
If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.
Oracle is an Equal Employment Opportunity Employer.
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law.
Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Required Skill Profession
Other General