Job Description
Manage and maintain Red Hat Enterprise Linux (RHEL) systems across development, testing, and production environments.
Design, implement, and operate OpenShift container platforms, including bare-metal deployments.
Design and optimize Logstash and rsyslog pipelines for ingesting logs from infrastructure devices (network, storage, servers).
Onboard diverse infrastructure systems into ELK stack and maintain device-specific parsing configurations.
Develop and manage observability dashboards using Kibana; implement alerting for critical infrastructure events.
Automate configuration, patching, provisioning, and monitoring tasks using Ansible, Python, and shell scripts.
Integrate log pipelines and monitoring tools with ITSM systems (e.g., ServiceNow) for automated incident handling.
Maintain documentation, runbooks, and SOPs related to platform setup, observability, and log onboarding.
Participate in 24/7 on-call rotation and support incident response and root cause analysis.
Qualifications
Bachelor’s degree in Computer Science, Information Systems, or related field.
5~8 years of hands-on experience in Linux system administration, with expertise in RHEL and OpenShift.
Proven experience with the ELK Stack (Elasticsearch, Logstash, Kibana) in high-volume environments.
Strong scripting skills in Python, Bash, or similar languages.
Experience onboarding logs from infrastructure devices and integrating with ITSM platforms.
Proficient in using Red Hat Satellite and configuration management tools (e.g., Ansible, Puppet).
Familiarity with infrastructure concepts and components such as network, storage, firewall, virtualization and cloud.
Good understanding of networking protocols, SNMP, syslog, and security best practices.
Strong problem-solving skills and ability to work independently or in cross-functional teams.
Experience with version control systems (e.g., Git) and infrastructure-as-code tools is a plus.
Experience working with various monitoring and logging tools such as Zabbix, Datadog, and similar platforms.