DevOps Manager
Date: Nov 9, 2024
Location: India, IN
Company: Responsive
About Responsive
Responsive (formerly RFPIO) is the global leader in strategic response management software, transforming how organizations share and exchange critical information. The AI-powered Responsive Platform is purpose-built to manage responses at scale, empowering companies across the world to accelerate growth, mitigate risk and improve employee experiences. Nearly 2,000 customers have standardized on Responsive to respond to RFPs, RFIs, DDQs, ESGs, security questionnaires, ad hoc information requests and more. Responsive is headquartered in Portland, OR, with additional offices in Kansas City, MO and Coimbatore, India. Learn more at responsive.io.
About the Role
The DevOps Manager is responsible for leading and managing the DevOps team, driving the adoption of DevOps practices, and ensuring the seamless integration of development and operations processes. This role involves strategic planning, team leadership, and hands-on involvement in implementing and optimizing continuous integration, continuous delivery, and infrastructure as code practices.
Essential Responsibilities
Team Leadership:
- Lead and manage the DevOps team, providing guidance, mentorship, and professional development opportunities.
- Foster a culture of collaboration, innovation, and continuous improvement within the team.
DevOps Strategy:
- Develop and implement the organization's DevOps strategy, ensuring alignment with overall business objectives.
- Stay current with industry trends and emerging technologies to enhance the DevOps practice.
Continuous Integration and Continuous Delivery (CI/CD):
- Design, implement, and optimize CI/CD pipelines to enable efficient and automated software delivery.
- Ensure the timely and high-quality release of software to production environments.
Infrastructure as Code (IaC):
- Implement and manage Infrastructure as Code practices using tools such as Terraform, Ansible, or similar.
- Ensure that infrastructure provisioning and configuration are automated, versioned, and scalable.
Cloud Management:
- Oversee the organization's cloud strategy, including the management of resources on platforms such as AWS, Azure, or Google Cloud.
- Optimize cloud infrastructure for performance, cost, and scalability.
Monitoring and Alerting:
- Implement and manage monitoring and alerting systems to proactively identify and address operational issues.
- Define key performance indicators and ensure systems meet or exceed defined SLAs.
Security and Compliance:
- Collaborate with security teams to integrate security practices into the DevOps pipeline.
- Implement and enforce security policies and compliance standards.
Collaboration and Communication:
- Facilitate communication and collaboration between development, operations, and other stakeholders.
- Act as a liaison between technical and non-technical teams to ensure alignment on DevOps goals.
Incident Response and Recovery:
- Develop and maintain incident response plans and participate in incident response activities.
- Conduct post-incident reviews and implement improvements to prevent future incidents.
Budget Management:
- Manage the DevOps budget, including tools, resources, and training.
- Identify cost-saving opportunities without compromising performance or security.
Education
- BS or MS in Computer Science or equivalent industry experience
Experience
- At least 12 to 15 years in a Site Reliability Engineering, DevOps, SecOps or Infrastructure focused role
- Experience supporting internet-facing production services and distributed systems
- Ability to implement and coordinate telemetry using monitoring and observability tools such as Splunk, Grafana or Prometheus
- Coding experience using a high-level programming languages like: Java, or Python
- Automation advocate - you truly believe in removing operational load via software
- Strong leadership and team management skills.
- Experience managing, scaling, and troubleshooting Java applications
- Familiarity with cloud infrastructure concepts (zones, regions, VPCs, etc)
- An understanding of a variety of software service deployment packaging, strategies, and tooling
- Working understanding of common authentication schemes, certificates, and securely managing secrets
- Capable of designing and implementing automated configuration management processes for repeatable and consistent service deployment
- Knowledge of cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).
Knowledge, Ability & Skills
- Prior experience as an SRE, software engineer, DevOps Engineer, or system administrator
- Experience in system automation technology, such as Ansible
- Relevant certifications (e.g., AWS/GCP/Azure Certified DevOps Engineer, Docker Certified) are a plus.
- Understanding of security practices and compliance standards.