Overview
The System Administrator/Engineer (SA/E) is responsible for effective provisioning, installation/configuration, operation, and maintenance of systems hardware and software and related infrastructure.
This individual participates in technical research and development to enable continuing innovation within the infrastructure.
This individual ensures that system hardware, operating systems, software systems, and related procedures adhere to organizational values, enabling staff, volunteers, and Partners.
Responsibilities
- Infrastructure Management: Configure, deploy, and manage cloud resources like virtual machines, storage, and databases.
- Security: Implement and enforce security policies, monitor for threats, and manage user access controls.
- Performance Optimization: Monitor system performance, identify bottlenecks, and optimize resource utilization.
- Troubleshooting: Diagnose and resolve technical issues with cloud infrastructure and applications.
- Automation: Automate routine tasks using scripting and automation tools.
- Collaboration: Work with other IT teams, including development, operations, and security, to support cloud deployments.
- Compliance: Ensure compliance with relevant regulations and industry standards.
- Cost Optimization: Identify and implement cost-saving measures for cloud resources.
- Software and System Deployment: Plan and deploy new application software upgrades and reports as required.
- Incident Management: Perform incident management, vendor management and reporting.
- Disaster Recovery: Involve in the Disaster Recovery planning and execution of DR procedures during simulated dry runs and actual disaster runs.
Requirements
- Working knowledge of cloud infrastructure is essential
- Technical Expertise and Knowledge:
- OS: Windows, Linux, AIX
- Middleware: WebSphere Application Servers, WebSphere Message Queue, Solace
- Datawarehouse: Snowflake
- Data Integration Tools: Fivetran
- Cloud Platform Expertise: AWS, Azure, Google Cloud, or other similar providers
- Infrastructure as Code (IaC): Terraform, CloudFormation, or Ansible
- Cloud Resource Management: Deploying, managing, and monitoring cloud resources (VMs, storage, networking)
- Automation & Scripting: Python, PowerShell, Bash
- Cloud Security: IAM roles, security groups, firewall configurations
- Monitoring & Performance: Cloud monitoring tools (e.g., CloudWatch, Azure Monitor)
- Scalability & Cost Management: Dynamic scaling and cost optimization (reserved/spot instances, autoscaling)
#J-18808-Ljbffr