Job Posting
CW Application Operation Engineer IV
Exp- 8 to 12years
Np- 15days to Immediate
Responsible for driving operational excellence for the connected services that a business offers to its customers to deliver an "always on" operation, year-round, at the right cost.
Drive the design, development and implementation of operational standards and capabilities for connected services that enable highly available, scalable & reliable customer experiences. Analyzes and synthesizes a variety of inputs to drives the end-to-end incident management process for multiple offerings.
Includes creating, developing & managing the deployment architecture for the application.
Developing the monitoring architecture and implementing monitoring agents, dashboards, escalations and alerts.
Developing and driving incident management processes, playbooks and stakeholder communication mechanisms.
Overseeing change management & configuration management operating mechanisms.
Driving root cause analysis (RCA) and risk management processes.
Driving ongoing improvements and efficiencies in operational practices, tools & processes BU and Client, SRE with expertise in managing middle tier and app tier and thorough understanding of 3 tier architecture (Mandatory).
Monitoring tools setup and configuration (Mandatory). Troubleshooting production issues and experience in working on production environments (Mandatory)
Skills:
6+ years of experience working in an enterprise hosting complex systems AWS experience (VPC, ELB/ALB, ASG, CFN, Cloudwatch, S3, KMS, SNS, SQS, CodePipeline, Route53, Lambda, IAM, AWS Inspector, DynamoDB, VPN etc) Automation - Expertise in CICD and configuration tools like Chef/Terraform/Spinnaker/Packer etc Programming language experience is a must have e.g Go, Ruby, Python, Java etc ( preferably Go).
Familiarity with jar and war structures helpful Experience with git and github repositories Experience with continuous integration tools (e.g. Jenkins, Maven or Gradle).
Ability to code using groovy Experience with and Strong understanding of container systems (Docker) and container orchestration (e.g. ECS/EKS, Kubernetes) Experience with Nexus and Artifactory.
Experience using package managers, including yum, rpm and apt Experience with securing services (Scanning, PenTest) and artifacts (Signing) in AWS Strong Linux/unix background and Advanced bash scripting knowledge Strong understanding of security best practices in coding and operations.
Vulnerability management experience is a plus Experience with metrics, monitoring and alerting tools such as Splunk, Wavefront, AppDynamics, Prometheus, Pagerduty Experience with hosting and