Site Reliability Engineer, Monitoring And Control Engineering

Stamford, CT, United States

Job Description

Company Description
NBCUniversal is one of the world's leading media and entertainment companies. We create world-class content, which we distribute across our portfolio of film, television, and streaming, and bring to life through our theme parks and consumer experiences. We own and operate leading entertainment and news brands, including NBC, NBC News, MSNBC, CNBC, NBC Sports, Telemundo, NBC Local Stations, Bravo, USA Network, and Peacock, our premium ad-supported streaming service. We produce and distribute premier filmed entertainment and programming through Universal Filmed Entertainment Group and Universal Studio Group, and have world-renowned theme parks and attractions through Universal Destinations & Experiences. NBCUniversal is a subsidiary of Comcast Corporation.
Our impact is rooted in improving the communities where our employees, customers, and audiences live and work. We have a rich tradition of giving back and ensuring our employees have the opportunity to serve their communities. We champion an inclusive culture and strive to attract and develop a talented workforce to create and deliver a wide range of content reflecting our world.
Comcast NBCUniversal has announced its intent to create a new publicly traded company ('Versant') comprised of most of NBCUniversal's cable television networks, including USA Network, CNBC, MSNBC, Oxygen, E!, SYFY and Golf Channel along with complementary digital assets Fandango, Rotten Tomatoes, GolfNow, GolfPass, and SportsEngine. The well-capitalized company will have significant scale as a pure-play set of assets anchored by leading news, sports and entertainment content. The spin-off is expected to be completed during 2025.

NBCU is looking for creative engineers willing to learn from the current process but are not afraid to think outside of the box. This role is responsible for the engineering, operations, support, deployment and maintenance of core Distribution Engineering Monitoring and Control systems, both on-premises and cloud.
Utilize scripting and automation to develop, customize and enhance monitoring/alerting tools for "on-air" environments
Interact with automated monitoring infrastructure to ensure healthy environments
Create system dashboards that improve system availability and reliability
Query data stores to quantify the scope of reported issues
Create new metrics and identify monitoring deliverables to improve site reliability
Act as a Level 2 resource, drive and own investigations related to Broadcast issues and report back findings in a timely manner to leadership and operations.
This role requires on-call 24/7 support on a rotating shift schedule
Follow up with team members & 3rd party vendors if issues found cannot be solved and drive vendors for root cause and solutions if possible.
Create comprehensive documentation outlining the intricacies of encountered issue, elucidating the root cause and steps for effective issue resolution.
Administer monitoring and control systems within the "on-air" environments
Develop proof of concept deployments for evaluation of products and architectures
Utilize modern frameworks and scripting languages to develop products and services for NBCU's IP video distribution environment
Qualifications
REQUIREMENTS:
Bachelor's degree in computer science or related degree
Experience with IP video and broadcast technologies
3-5+ yrs experience with monitoring and alerting tools i.e. Grafana, Splunk, ELK Stack, Dataminer
Ability to develop end-to-end monitoring dashboards, alerts and reports for enterprise level environments
3-5 years of SRE experience in the technology sector supporting and maintaining production-quality software or software-defined infrastructure in a high traffic environment run in a cloud environments (AWS preferred)
Ability to collect data from various systems using COTS APIs
Experience with scripting languages and tools i.e C#, Python, Bash
Experience with modern frontend technologies like Vite, React, NodeJS, Typescript
Experience with configuration management technology i.e. Ansible, Salt, and/or Chef
Experience with public cloud platforms such as AWS, GCP or Azure
Experience with networking and cloud-based network environments
Experience with containerization Docker & Kubernetes
Experience with CI/CD build (Github Actions), deployment practices, and Infrastructure as Code (Terraform)
Experience in administrating Linux and Windows environments
Ability to use Agile process for project management, development & tracking
Comfortable working in a fast-paced agile environment. Requirements change quickly and our team needs to adapt to moving targets.
PREFERRED QUALIFICATIONS:
Experience with a variety of software and hardware operating environments
Experience in troubleshooting complex technical issues
Experience with SMPTE standards and implementation
Experience with PTP implementation
Good communicator and able to clearly articulate complex issues and technologies
Great design and problem-solving skills
Willing to take ownership of problems and see them through to resolution
Experience with DevSecOps principles
Ability to create user interface designs based on client workflows
Ability to intake project requirements from Operational partners and work with vendors to meet their needs
Fully Remote: This position has been designated as fully remote, meaning that the position is expected to contribute from a non-NBCUniversal worksite, most commonly an employee's residence.
This position is eligible for company-sponsored benefits, including medical, dental, and vision insurance, 401(k), paid leave, tuition reimbursement, and various other discounts and perks. For a comprehensive overview of the benefits offered by NBCUniversal, please visit the on the Careers website.
Salary Range: $110,000 - $145,000
We are accepting applications on an ongoing basis.
#LI-remote
Additional Information
As part of our selection process, external candidates may be required to attend an in-person interview with an NBCUniversal employee at one of our locations prior to a hiring decision. NBCUniversal's policy is to provide equal employment opportunities to all applicants and employees without regard to race, color, religion, creed, gender, gender identity or expression, age, national origin or ancestry, citizenship, disability, sexual orientation, marital status, pregnancy, veteran status, membership in the uniformed services, genetic information, or any other basis protected by applicable law.
If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access nbcunicareers.com as a result of your disability. You can request reasonable accommodations by emailing .
For LA County and City Residents Only: NBCUniversal will consider for employment qualified applicants with criminal histories, or arrest or conviction records, in a manner consistent with relevant legal requirements, including the City of Los Angeles' Fair Chance Initiative For Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, where applicable.

Skills Required

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD6267010
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    $110,000-145,000 per year
  • Employment Status
    Permanent
  • Job Location
    Stamford, CT, United States
  • Education
    Not mentioned