Posted on: new!Apply
We partner with the world’s most valuable brands to build digital solutions that transform businesses. As a digital native, we bring a 28-year track record of accelerating business impact through complete and scalable digital solutions. With a global presence of 6,500+ professionals in strategy, research, data science, design, and engineering, we unlock top-line growth, improve customer experience, and drive operational efficiency.
Developer / SRE (Site Reliability Engineer)
Position Overview: As a Developer / SRE within a Level 3 Production Support Team, you will bring your development skills and reliability engineering expertise to ensure the stability, scalability, and performance of multiple digital applications. Your role involves investigating and debugging incidents, automating repetitive tasks, configuring proactive monitoring, and collaborating with development teams to implement robust solutions. You will be an integral part of a high-impact team supporting various leading- edge digital products within a renowned automotive brand in the United States. Your contributions will directly influence the success and evolution of these innovative solutions, shaping the forefront of technology and customer experience in the automotive industry.
Incident Investigation and Resolution: - Investigate and debug incidents escalated from Levels 1 and 2 support, providing efficient and effective solutions. - Collaborate with development teams to understand and address the root causes of incidents. - Develop and implement fixes and improvements to prevent the recurrence of issues. - Write documentation and assist in maintaining a knowledge base of Standard Operating Procedures (SOPs), resolutions, and best practices.
Automation: - Automate repetitive tasks to streamline support workflows and increase operational efficiency. - Implement test automations for the most critical scenarios to ensure robust and reliable.
Proactive Monitoring: - Implement alerts and other proactive strategies using existing tools for prompt incident response. - Contribute to fostering a proactive monitoring culture by implementing best practices within the team.
Collaboration with other workstreams: - Rapidly acquire knowledge of new products or features through knowledge transfer sessions with product teams. - Offer constructive critiques of implementations, aiming to enhance the performance and stability of applications.
Qualifications: - Mid-level developer with experience in site reliability engineering. - Strong programming and scripting skills (react, node, graphql). - Experience within a production support team. - Familiarity with incident response and troubleshooting methodologies. - Expertise in proactive monitoring on GCP, ensuring system stability and performance. - Possess a good level of conversational English, enabling effortless communication with other members of the team.
#LI-SC1 #MidSenior **CI &T is an equal-opportunity employer. We celebrate and appreciate the diversity of our CI&Ters’ identities and lived experiences. We are committed to building, promoting, and retaining a diverse, inclusive, and equitable company and culture focused on creating a better tomorrow. **
At CI&T, we recognize that innovation and transformation only happen in diverse, inclusive, and safe work environments. Our teams are most impactful when people from all backgrounds and experiences collaborate to share, create, and hear ideas. Before applying for our opportunities take a look at our Conflict of Interest Policy [ click here ](https://s28.q4cdn.com/106679464/files/doc_downloads/governance/2023/09/conflict- of-interest-policy-sept-23-v-3-0-docx-1.pdf) !!!
We strongly encourage candidates from diverse and underrepresented communities to apply for our vacancies.