Prometheus SME (8year )

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Minimum of 8 years of experience in Enterprise monitoring., At least 4 years of hands-on experience with the Prometheus Platform., Strong understanding of infrastructure network concepts and protocols., Experience with programming languages like Python or NodeJS is a plus..

Key responsibilities:

  • Manage day-to-day maintenance and evolution of Prometheus monitoring and alerting infrastructure.
  • Design, develop, and implement IT Operations monitoring solutions.
  • Coordinate with support teams to resolve issues and ensure system functionality.
  • Configure dashboards, reports, and integrate Prometheus with other tools and applications.

CodersBrain logo
CodersBrain SME https://www.codersbrain.com/
201 - 500 Employees
See all jobs

Job description

Minimum of 8 years of experience in the area of Enterprise monitoring.
Minimum 4 years of hands-on experience in Prometheus Platform
Should have good experience in Design, development, and implementation of IT Operations monitoring solutions with integration into other ITSM applications
Expertise in Installing, configuring Prometheus components.
Experience in time-series databases
Experience with managing large amounts of product analytics
Manage day-to-day maintenance and evolution of Prometheus monitoring and alerting infrastructure
Experience in Grafana is must
Expertise in install and configure required exporter in the Targets
Expertise in configuring the Thresholds for Servers, Network, Storage, Backup, Databases
Experience in configuring the Dashboard & Reports
Experience in Prometheus integration with other tools
Experience in custom exporter
Experience in event management functionality. Netcool Omnibus, ScienceLogic, LogicMonitor, Zabbix and other event management tools are added advantage.
Experience in integration with Service Management tools like ServiceNow, BMC Remedy
Experience in integration with Notification, Collaboration and automation tools like -xMatters, Everbridge, Slack, Ansible, etc.
Knowledge in third party discovery tools like ServiceNow and BMC discovery is an added advantage.
Strong understanding of Infrastructure network concepts and protocols.
Experience in remediation of discovery and monitoring issues in the infrastructure
Good analytical, problem solving, logical thinking. Standby support during non-office hour is required.
Co-ordinating with support teams in resolving issues.
Should Collects, generates, or helps refine high level requirements and creates implementation strategy, acceptance criteria (with input from the customer) and test cases
Knowledge in programming languages like Python, NodeJS, etc. will be additional advantage.
Interested to work in multi skilled environment and adapted to learn new technologies that supports any Enterprise Infrastructure
Customer facing experience is a must.
Prometheus Certifications are an added advantage

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Logical Reasoning
  • Analytical Thinking
  • Problem Solving

Related jobs