Raffles Quay, Singapore | Posted on 04/18/2025
KLAARA provides AI-driven solutions tailored to the needs of customers in the capital market, insurance, reinsurance, and various other organizations.
Our cutting-edge platform empowers businesses to transform their unstructured data into actionable intelligence, fuelling better decision-making and driving competitive advantage.
Headquartered in Singapore, KLAARA was launched in 2023 as a result of 3 years of Research and Development and a spin-off of Upskills, a consultancy company in Capital markets.
KLAARA is composed of creative thinkers and problem solvers, with the ultimate mission to breakthrough innovation! All of us are fully engaged in delivering cutting-edge solutions with the latest state of the art in AI.
We are seeking a Senior DevOps Engineer to join our dynamic team at Klaara.ai, an AI-enabled software vendor providing advanced solutions for enterprise customers, particularly in the financial and banking sectors.
In this role, you will be responsible for managing the deployment and integration of our application suite within the firm and at customer sites, ensuring optimal instrumentation, scalability, and compliance with security standards.
You will collaborate closely with developers, business analysts, and customers to ensure seamless delivery, performance testing, and documentation of our solutions.
This is a hands-on role requiring strong technical acumen, cross-functional teamwork, and occasional code fixes with key responsibilities as below:
Manage the build and deployment of multiple trading applications.
Lead and execute the implementation of software deployment packages at customer sites, ensuring robust instrumentation and scalable performance.
Design, develop, and execute load and scalability tests to validate system reliability under various conditions.
Implement and maintain monitoring and alerting systems to proactively identify and resolve issues in development, staging, and production environments.
Participate in incident response, root cause analysis, and contribute to continuous improvement of system reliability.
Develop and maintain automation scripts and tools to streamline deployment, infrastructure provisioning, and routine operational tasks.
Create and maintain clear, comprehensive documentation for customers on deployment, configuration, and operations.
Collaborate with internal development teams, business analysts, and customers to align technical solutions with business requirements.
Troubleshoot and resolve deployment and integration issues, occasionally contributing fixes in TypeScript or Rust.
Ensure compliance with security and regulatory requirements, particularly in banking environments.
Maintain and optimize CI/CD pipelines and deployment automation using Jenkins and Atlassian tools.
Manage and maintain development, staging, and production environments, ensuring consistency and high availability.
Support and manage infrastructure components using Linux, Docker, and Kubernetes.
Integrate and manage application components such as Kong, OpenTelemetry, Kafka, RabbitMQ, and MySQL.
Work with systems designed using microservice architecture, ensuring seamless deployment and integration across services.
Utilize Infrastructure as Code (IaC) tools such as Terraform, Ansible, or Helm to provision and manage infrastructure effectively.
Requirements
Master's or Bachelor's Degree in Information Technology, Computer Science, Engineering, or related field.
Proven experience with CI/CD tools such as Jenkins and the Atlassian suite (Bitbucket, Jira, Confluence).
Hands-on experience with Linux, Docker, Kubernetes, and cloud-native deployment practices.
Solid understanding of microservice architecture and experience deploying and managing distributed systems.
Experience with monitoring and observability tools, especially OpenTelemetry, and managing alerts and incident response processes.
Proficiency in automation and scripting (e.g., Bash, Python, or similar) for deployment and infrastructure tasks.
Experience with Infrastructure as Code (IaC) tools such as Terraform or OpenTofu.
Familiarity with API gateways (e.g., Kong), message brokers (Kafka, RabbitMQ), and databases (MySQL, PostgreSQL).
Ability to design and execute performance, load, and scalability tests.
Basic proficiency in TypeScript and/or Rust, sufficient for reading code and implementing minor fixes.
Strong understanding of security, compliance, and risk management practices in financial services or banking.
Excellent communication and collaboration skills; able to interact with technical and non-technical stakeholders.
Willingness to travel to customer sites as needed for deployment and support.
Skill set
#J-18808-Ljbffr