Job Description

Job Description:
Our client's Observability Engineering team is looking for a well-qualified Enterprise Observability Monitoring Engineer. The qualified candidate will be helping to transform the monitoring landscape for the organization, by enabling techniques and practices that align closely with today's observability and site reliability engineering best practices. The candidate will have a strong familiarity with measuring and monitoring infrastructures and applications across all layers of the Open Systems Interconnection (OSI). The candidate will have a strong familiarity with reliability engineering practices and established criteria associated with monitoring, measuring, and reporting on service level objectives and key performance indicators. A strong track record with observability framework development and use is required, the preferred candidate will have this experience with multiple frameworks.

Job Responsibilities Include:
• Splunk developer who can work in partnership across the enterprise developing custom and standard and custom monitoring, visualization, and reporting solutions.
• Onboard new apps into splunk.
• Create in depth custom dashboards to use day to day and display KPI metrics etc.
• Is a splunk search language expert and can write complex searches.
• Can create custom splunk alerts and help troubleshoot search performance to optimize searches.
• Implement custom scripts, custom/application specific splunk data stores.
• Automate some application configuration management using splunk.
• App Dynamics developer that can create/enable and manage APM, MRUM, BRUM, infrastructure, network and database visibility.
• Familiarity with playbook creation and automations using Ansible.
• Understands application tracing technologies and can work with development and application support teams to enable these technologies, e.g. OpenTelemetry/Jaeger, SignalFx.
• Strong understanding of synthetic transaction creation using frameworks such as Selenium, Apica, Cucumber.
• Understanding of Git/Bitbucket and source code management.
• Scripting/programming to facilitate innovation in observability technology.
• Strong Agile skills and is able to use Jira day to day.
• Understands Enterprise operating systems and technologies and can develop observability standards for them, e.g. RedHat Enterprise OS, Solaris, Windows Server, Oracle and MS SQL DB, VMWare, etc.
• AWS architecture expert, and knowledgeable in instrumenting observability, log aggregation and monitoring technology in cloud infrastructures.
• Familiarity with OpenShift and monitoring OpenShift environments.
• Strong collaboration skills and is able to partner with multiple teams and coordinate meetings, follow through on tasks, and maintain status on workflow items.
• Work closely with application development on early lifecycle initiatives
• Adhere to standard ITIL processes

Basic Qualifications:
• 3-5 years experience as an Enterprise Observability Monitoring Engineer or previous experience in a similar role with extensive knowledge of associated processes
• 3-5 years Shell/Perl/Python Scripting/Programming experience
• 3 years experience in RDBMS database technologies and DML/SQL
• 3 years experience with Java EE
• 1-2 years Splunk Developer or App Dynamics development and support (or other)
• 1-2 years experience with AWS EC2/ECS/EKS and Private Cloud
• Fundamental understanding of the Agile SDLC
• Experience debugging in a distributed system/environment
• Applied experience with Linux/Unix and Windows platforms

Preferred Skills:
• Excellent verbal and written communication skills.
• 1-2 years IT Service Intelligence (ITSI) Developer experience
• 1-2 years Git and Bitbucket source code management experience
• 1-2 years Confluence documentation management experience
• 1-2 years Jira project and work management experience
• 1-2 years Jenkins build and execution management experience
• 1-2 years Apica/Selenium scripting and support experience
• 1-2 years Ansible playbook and automation development experience
• 1-2 years Jaeger/OpenTelemetry development and support experience
• 1-2 years SignalFx development experience
• OEM support
• SCOM support
• Experience working with multiple teams and stakeholders
• Bachelor of Science in a related field (e.g. Computer Science or Information Systems)


Careers at NTT DATA

Innovation is at the heart of what we do. Innovation that makes an impact and improves business performance. Innovation that improves our clients’ bottom line.

We are always on the lookout for talented innovators to join us - especially for people who can use their creativity to drive value for our clients worldwide. Discover more career opportunities that can help you make the most of your skills.


We are one of the largest global IT services company with operations in more than 40 countries. We offer an advanced portfolio of application, business process, cloud, and infrastructure services to businesses and governments worldwide.

Our roots cross continents and cultures, dating back five decades. We’ve grown organically and decisively by acquiring some of the best IT services providers across the globe. This pedigree yields a characteristic special to NTT DATA: the opportunity of a global brand with the creative energy of a start-up.

Similar jobs