Nikhil Gupta

I'm

Resume

Results-driven SRE Manager with 15 years of experience in leading high-performance teams and ensuring the reliability, scalability, and performance of mission-critical systems. Adept at implementing best practices in incident management, infrastructure design, and automation to optimize system reliability and uptime. Proven track record of successfully driving projects, managing stakeholders, and fostering a culture of collaboration and innovation.

Summary

Nikhil Gupta

Experienced and hands-on SRE/DevOps Engineering Leader with a proven track record in managing complex microservices environments serving millions of users. Expertise in centralizing tooling and platform development to enhance developers' productivity, improve product quality, and ensure scalability and resiliency. Strong advocate for DevOps practices, zero-touch CI/CD pipelines, continuous deployment, and modernization of tech stacks. Skilled in defining SRE best practices, implementing monitoring and alerting systems, and driving incident management and root cause analysis. Collaborative leader, adept at managing teams, gathering requirements, and driving automation initiatives to optimize operational efficiency.

  • Whitefield, bangalore, India
  • (0091) 7829-4537-68
  • nikhil.vinod.gupta@gmail.com

Education

Bachelors of Engineering

2005 - 2009

SGSITS, Indore (M.P.), India (No. #1 state college of MP)

Achieved the 70th rank in the state-level engineering entrance exam, securing admission to the premier engineering college in Madhya Pradesh.

Higher Secondary (12th Standards)

2003 - 2004

SVHSS, Ashoknagar(MP)

Completed 12th grade in 2004 with 84% and scored 1st rank in the school, emphasizing major subjects in Physics, Chemistry, and Mathematics.

Professional Experience

Senior Engineering Manager (SRE/DevOps)

Dec-2021 - Present

Walmart Global Tech, Bangalore, India

Staff Software Engineer/Architect (SRE/DevOps)

Jan-2020 - Dec-2021

Walmart Global Tech, Bangalore, India

Senior Software Engineer (SRE/DevOps)

Oct-2018 - Jan-2020

Walmart Global Tech, Bangalore, India

Senior DevOps Engineer

Jan-2017 - Oct-2018

Intuit, Bangalore, India

Senior DevOps & Infrastructure Engineer

Jun-2016 - Dec-2016

DataTorrent, Bangalore, India

Lead DevOps Engineer

Sep-2014 - Jun-2016

Amadeus, Bangalore, India

Senior Build & Release Engineer

May-2014 to Sep-2014

Altisource, Bangalore, India

Senior Software Engineer (Build & Release)

Dec-2009 to May-2014

Accenture, Bangalore, India

About

Experienced and hands-on SRE/DevOps Engineering Leader with a proven track record in managing complex microservices environments serving millions of users. Expertise in centralizing tooling and platform development to enhance developers' productivity, improve product quality, and ensure scalability and resiliency. Strong advocate for DevOps practices, zero-touch CI/CD pipelines, continuous deployment, and modernization of tech stacks. Skilled in defining SRE best practices, implementing monitoring and alerting systems, and driving incident management and root cause analysis. Collaborative leader, adept at managing teams, gathering requirements, and driving automation initiatives to optimize operational efficiency.

Experience Summary:

  • Led a team of SREs in a complex microservices environment serving millions of users.
  • Developed and implemented centralized tooling and platform solutions, enhancing developers' productivity and enabling focus on feature development, quality improvement, and scalability.
  • Defined and enforced SRE and DevOps best practices across teams, resulting in increased service quality and uptime.
  • Established and optimized zero-touch CI/CD pipelines with integrated code quality checks, unit tests, integration tests, end-to-end tests, performance tests, approval processes, and logging.
  • Advocated for continuous deployment and delivery, working with developers to achieve higher deployment frequency.
  • Designed and implemented robust alerting, monitoring, and logging practices, tools, and dashboards.
  • Orchestrated tech modernization efforts, successfully migrating from legacy to new tech stacks.
  • Developed deployment architectures to ensure efficient and scalable service deployments.
  • Implemented SLIs, SLOs, and SLAs for microservices, creating SLO dashboards and proactive alerting systems.
  • Designed and integrated dashboards for CICD, service availability, infrastructure availability, cost analysis, and network latency into a centralized SRE dashboard.
  • Implemented HA/DR strategies and automation to achieve defined business goals.
  • Introduced branching strategies to increase deployment frequency while maintaining high-quality standards.
  • Led incident response efforts, conducting thorough RCAs and implementing improvements to minimize mean time to resolution (MTTR).
  • Provided cost optimization suggestions without compromising quality, availability, and resiliency.
  • Implemented infrastructure and configuration as code principles to ensure consistency and reproducibility.
  • Facilitated sprint planning, retrospectives, and Jira dashboard management for the SRE team.
  • Collaborated with Dev Managers and Architects to gather requirements, provide feedback, and address pain points.
  • Managed 24x7 SRE on-call rotations and ensured timely incident response and resolution.
  • Proactively identified and automated repetitive tasks to improve operational efficiency.
  • Led migration initiatives from on-premises infrastructure to cloud platforms.
  • Improved SDLC for legacy products, implementing modern DevOps practices.
  • Organized and conducted DevOps and SRE tech sessions to foster knowledge sharing and skill development.
  • Established and led the DevOps and SRE forum, facilitating collaboration and the sharing of tools, best practices, and pain points across multiple teams.

Facts

Below Numbers describes about myself

15

Years of Experience

4

Certifications

6

Organisations

7

Awards

Certifications

Below is the list of the certifications which I have passed. You can check my LinkedIn Profile for more details

CKA (Certified Kubernetes Administrator)
Kubernetes: Service Mesh with Istio
Python 3
Liquibase Fundamentals Certification

Tech Skills

I have worked on many tools and technologies. I am always open to learn any new tool or tech based on the requirement. Below is my current tech skill set

Scripting (Python, Shell)
Web (Flask, HTML, CSS, JS)
CI/CD (Jenkins, Bamboo)
Config Management (Ansible, Chef)
Cloud (Azure, AWS)
APM (Dynatrace)
Continuous Monitoring & Alerting (Prometheus, Grafana)
Messaging (Kafka)
Containers and Orchestration (Docker, Kubernetes, Docker Swarm)
Source Code Management (GitHub, BitBucket, Perforce, SVN)
Slack Chatbots
OS - Linux(Mainly), Windows
Build Tools (Maven, Ant, Gradle)
Databases (Liquibase, MySQL, MS SQL, DB2 LUW, Azure SQL)
Logging (Splunk, ELK)
Service Mesh (Istio)

Management Skills

As an SRE (Site Reliability Engineering) Manager, effective leadership skills are crucial for successfully managing and leading a team. I have gained below leadership skill during my tenure as SRE Manager

Coaching and Mentoring
Team Building
Stakeholder Management
Conflict Resolution
Decision Making
Collaborated with cross-functional teams
Strategic Thinking
Prioritization

Testimonials

Below testimonials are from my linkedIn Profile.

Contact

You can reach out to me :

LinkedIn:

nikhil-vinod-gupta

Call:

+91 78 2945 3768