We are seeking a skilled professional with strong expertise in AIOps, operational analytics, and AWS services to drive innovation in IT operations. The role involves building intelligent solutions for monitoring, automation, and integration across hybrid environments, leveraging machine learning and observability tools. The ideal candidate will collaborate with cross-functional teams to improve system efficiency, enable proactive operations, and promote AIOps adoption.
Key Responsibilities:
Design and implement AIOps solutions leveraging AWS and machine learning to deliver actionable operational insights.
Integrate and optimize IT operations tools including QuantumMetrics, Dynatrace, ServiceNow CMDB, and Riverbed.
Develop and manage APIs (REST, GraphQL, event-driven) for seamless integrations across systems.
Enhance observability and monitoring capabilities to improve performance and reliability.
Drive automation initiatives through discovery and CMDB-driven frameworks.
Collaborate with cross-functional teams (Ops, Cloud, Security) to identify opportunities for operational innovation.
Translate complex data into clear, data-driven stories to promote AIOps adoption across stakeholders.
Requirements
Experience with AIOps platforms and operational analytics
Knowledge of machine learning for IT operations (MLOps/AIOps) to generate actionable insights
Strong hands-on expertise in AWS services (Lambda, Glue, S3, Kinesis, SageMaker, CloudWatch)
Familiarity with hybrid integrations between on-prem systems and AWS cloud
Experience with IT Operations tools: QuantumMetrics, Dynatrace (DT), ServiceNow CMDB, Riverbed, etc
API development and integration skills (REST, GraphQL, event-driven architectures)
Knowledge of observability platforms and performance monitoring tools
Strong problem-solving and systems-thinking skills to identify new opportunities from existing tools/data
Familiarity with discovery and automation frameworks (e.g., CMDB-driven auto-discovery)
Strong communication skills to tell a data-driven story and evangelize AIOps adoption
Experience working in cross-functional innovation teams (Ops, Cloud, Security)
AIOps platforms, operational analytics, MLOps, AWS Lambda, AWS Glue, Amazon S3, AWS Kinesis, AWS SageMaker, AWS CloudWatch, hybrid integrations (on-prem + cloud), QuantumMetrics, Dynatrace (DT), ServiceNow CMDB, Riverbed, REST APIs, GraphQL, event-driven architecture, observability tools, performance monitoring, problem-solving, systems thinking, discovery frameworks, automation frameworks, communication, cross-functional collaboration