Match score not available

Site Reliability Engineer

Remote: 
Full Remote
Work from: 

Rapinno Health Care logo
Rapinno Health Care SME https://rapinnohealthcare.com/index
11 - 50 Employees
See more Rapinno Health Care offers

Job description

Role: Site Reliability Engineer

Location: Piscataway, NJ

Duration: Long Term Contract

Domain: Largest Enterprise Telecom Client

Middleware tech WebSphere/WebLogic/tomcat, Shell scripting, AWS, Ansible/jenkins - Must have Some Production support exp

Description
As a member of the Platform as a Service team, you will be responsible for the design and development of medium to highly complex systems. This includes the design and implementation of infrastructure from specifications, configuration and deployment of applications, connecting to back-end resources, and advanced troubleshooting of moderately complex software applications. Deployment, middleware administration and operational support of (production, staging, test and development) environments for multiple projects using WebSphere, Weblogic, and Tomcat Application Server. Monitors systems capacity and performance, plans and executes disaster recovery procedures, and provides Tier 2 technical support.

In addition, this role requires the candidate to be highly flexible in hours of work because of its customer-facing, highly available infrastructure requirements. Work closely with Dev, QA and production support team members to align and orchestrate resolutions on open issues/defects. Provides high level written communications to upper management regarding production issues.

Required Skills

3-5 years managing and administrating middleware technologies(Weblogic, Websphere, Tomcat).
3+ years hands-on experience with Solaris, Linux (RHEL, CentOS, Ubuntu), in bare-metal and Cloud-based infrastructure (AWS, OpenStack)
Experience with cloud platforms AWS( Auto scaling , AVI, security, EC2 , EFS , EBS , S3 , KMS)
Strong experience with Installing IBM WebSphere MQ and creating multi instance Queue manager in AWS by using EBS/EFS volumes, creating MQ objects, clusters, channels etc.
Experience with configuring the clustered Queue managers for HA and load-balancing as well troubleshooting in clustered environment
Installing open source Rabbit MQ on AWS EC2 instances with the use of CFTs/ansible and automating it by using Jenkins. Also creating Classic Load balancer to distribute traffic among those Rabbit MQ instances
Experience with migrating applications from monolithic to kubernetes container platform
Experience with APIGEE Proxy configurations and troubleshooting
Hands on experience with CI/CD tools such as Jenkins, Ansible
Working knowledge of monitoring tools like CA Wily, New Relic, and Datadog
Experience with Elasticsearch, Kibana, and Logstash
Execution on all release engineering aspects of DevOps including the configuration management , Build and Deployment Management, Continuous Integration and Delivery
Ansible based deployment and configuration automation solutions.
Experience with web based services and protocols ( HTTP , HTTPS, REST , Apache , Tomcat)
Experience with micro-service architectures and deployment.
Knowledge on L2/L3 protocols , IPv4/IPv6 and TCP/IP stack .
Proficiency in high level script languages (Python preferred) as well as script environments like bash Experience with DevOps workflow automation (Jenkins, Ansible, Puppet)
Strong analytical & troubleshooting skills.
Experience with tools like JIRA, Confluence, Stash
M.S. or relevant experience required.

Preferred to have:
AWS Certification

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Site Reliability Engineer (SRE) Related jobs