Engineer, Site Reliability

Addison, TX, United States

Job Description


Overview:

The Site Reliability Engineer (SRE) is responsible for ensuring that the underlying infrastructure and critical systems are working as expected and running smoothly. They also monitor critical
applications and services to minimize downtime and ensure their availability. The SRE plays a large part in improving core system stability and successfully implementing DevOps practices.
SRE Engineer works under the supervision of the Sr. Manager of DevOps. Responsibilities:

  • Develop software to automate processes like analyzing logs, testing production environments, and responding to any issues
  • Embody the philosophy of DevOps and provide a prescriptive way of measuring and achieving reliability through engineering and operations work
  • Develop software tasks in accordance with Information System standards and methodologies
  • Possess very deep knowledge about the whole technology stack of the system
  • Evolve the architecture to support future requirements and defines its SLAs, SLOs and SLIs
  • Mentor others to accelerate their career-growth and encourages them to participate
  • Challenge the team processes, looking for ways to improve them
  • Ensure management awareness of problems that are severe in nature or that are exceeding documented targets
  • Review outstanding issues daily to assure that troubleshooting and resolutions are current
  • Recognize potential areas where policies and procedures require change, or where new ones need to be developed, especially regarding future business expansion. Submit recommendations as appropriate.
  • Participate in relevant information-sharing activities
  • Monitor and report on any security violations related to the unwarranted access to corporate data
  • Ensure that all problems are resolved in a timely and efficient manner
  • Help build team spirit by assisting other staff members and promoting a positive workplace
  • Maintain awareness of the rapidly changing environment and recommend cost efficient techniques
  • Support the mission and direction of Concentra, both within the Information Services department and throughout the corporation
  • Create, maintain, and update disaster recovery procedures when changes in hardware or applications occur
  • Ensure all changes comply with change management policies and procedures
  • Responsible for reporting to and completing work at assigned times
  • Provide technical mentoring to the more junior developers
  • Work with end users and other Information Services staff to develop criteria for specific phases of assigned projects, and ensure proper testing of all developed solutions
  • This job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without notice.
Qualifications:
  • Customarily has at least five or more years of DevOps experience
  • Eight years of total work experience out of which at least 3 years as Site Reliability Engineer, with hands on experience building AWS and/or Azure infrastructure.
  • Proficiency in at least one programming language e.g., Python or Java.
  • Experience in CI/CD tools like Jenkins and Bamboo for automated deployments.
  • Experience in source code repositories like Bitbucket, GitHub, GitLab etc.
  • Experience in JIRA or similar code tagging system
  • Expertise in implementing methodologies for Automation, Continuous Integration, Continuous Delivery, High Availability, High Scalability, Monitoring, Logging, Security and Governance.
  • Experience with Application Performance Management (APM) tools like AppDynamics, NewRelic or AWS CloudWatch.
  • Experience with network monitoring tools like PRTG, Datadog etc.
  • Proficient in Administering Linux, and Windows based platforms.
  • Hands on Experience in containerization using Docker and deploying the microservices using Kubernetes.
  • Experience in creating Docker Images and managing Docker hub.
  • Experience with Terraform or ARM templates for creation of Infrastructure and good understanding of Infrastructure as Code principles.
  • Hands on Experience in launching, configuring, and maintaining AWS or Azure cloud resources.
  • Experience in troubleshooting builds & deployments of applications developed in varied technologies from
  • desktop to web and cloud-based applications via Jenkins, Atlassian Bamboo, Bitbucket, GitLab etc.
  • Strong scripting knowledge using languages such as PowerShell, Bash, Python, Groovy, etc.
  • Must understand complex multi-tiered environments and how they interact and integrate with DevOps Toolsets
  • Must have experience in Administering Atlassian Jira (i.e., workflows, issues management, user administration).
  • Must have experience in Business Process Improvement, Problem Management/Preventive Maintenance and Analytical & Conceptual Problem Solving
  • Experience supporting production enterprise applications.
  • Preferred (not mandatory) experience includes AWS or Microsoft Azure Certifications
Job-Related Skills/Competencies
  • Concentra Core Competencies of Service Mentality, Attention to Detail, Sense of Urgency, Initiative and Flexibility
  • Ability to make decisions or solve problems by using logic to identify key facts, explore alternatives, and propose quality solutions
  • Outstanding customer service skills as well as the ability to deal with people in a manner which shows tact and professionalism
  • The ability to properly handle sensitive and confidential information (including HIPAA and PHI) in accordance with federal and state laws and company policies
  • Ability to rapidly learn new software development technologies and environments
  • Requires superior teamwork skills
  • Strong interpersonal and communication skills a must; ability to read, write, and speak in a professional manner.
  • Excellent analytical and problem-solving skills
  • Ability to effectively multi-task and adapt to changing business priorities
  • Excellent time management and organizational skills
  • Excellent attention to detail Additional Data:
  • 401(k) Retirement Plan with Employer Match
  • Medical, Vision, Prescription, Telehealth, & Dental Plans
  • Life & Disability Insurance
  • Paid Time Off & Extended Illness Days Offered
  • Colleague Referral Bonus Program
  • Tuition Reimbursement
  • Commuter Benefits
  • Dependent Care Spending Account
  • Employee Discounts
Be part of a committed team that\'s growing fast and making a difference. At many locations, you\'ll enjoy a M-F schedule and work with leading edge technologies that continuously advance your knowledge and skills.

Concentra is an Equal Opportunity Employer, including disability/veterans

Concerta

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD4281589
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Addison, TX, United States
  • Education
    Not mentioned