Summary
- Senior Site Reliability Engineer with deep expertise in managing online services, complex database systems, and large-scale distributed systems within Azure cloud environments. Experienced in database architecture, administration, and end-to-end service reliability engineering, including designing, developing, and implementing scalable software systems for mission-critical services.
- Currently leading a software development team while transitioning into an SRE/Engineering Manager role. Demonstrated success in setting strategic direction, mentoring team members, and driving measurable results through automation, system design, and operational excellence. Currently implementing modern DevOps practices and CI/CD workflows for enhanced deployment reliability.
- A dedicated, hands-on leader who thrives in fast-paced, cross-functional environments. Strong collaborator with a focus on quality, performance, and customer impact. Passionate about building high-performing teams, bridging DevOps and development, and aligning engineering efforts with business objectives.
Experience
Senior SRE Manager
Customer Reliability Engineering (C+E) | Microsoft
Oct 2020 - Present
- Lead and manage cross-functional SRE teams following Microsoft's core priorities: model, coach, and care
- Led cross-functional collaboration between software architects, product managers, and a team of developers to define and implement KPI-driven alerting—built and deployed an automated Alerting Module that proactively notifies the team when any KPI value turns red, elevating data quality monitoring and incident response
- Manage ACOM and Tenant Management online services with dedicated sub-teams, serving 100M+ users
- Spearhead hiring and development initiatives for SRE teams across CRE organization
- Contribute to Monitoring Guardrails project, reducing live site incidents by 50%
- Design solutions for production and Sovereign cloud service parity assessment
- Develop Geneva Actions reducing human intervention in compliance services by 80%
Senior Engineering Service Engineer - Focused on the live site management, Defining SLI/SLO and managing the data pipelines.
Business Application Group & Consumer Sales Marketing | Microsoft
Jun 2017 - Oct 2020
- Focused on live site management and data pipeline optimization for customer-facing services
- Defined and implemented SLO/SLI metrics improving service reliability to 99.9%
- Developed and managed Azure Data Factory pipelines processing 10TB+ daily data
- Led testing and execution of business continuity and disaster recovery plans
- Established comprehensive monitoring using Power BI, Lens, and Geneva tools
Senior Engineering Service Engineer - Focused on Telemetry migration, developing POCs in Azure
Visual Studio Online | Microsoft
Jun 2014 - Jun 2017
- Led transformation of service monitoring from legacy tools to Geneva (MDM/MDS)
- Served as SME for Geneva and Kusto onboarding across multiple product teams
- Enhanced HADR measures achieving 99.95% uptime for critical services
- Developed POCs for migrating telemetry data to Big Data solutions (HDInsight, MongoDB)
- Implemented abnormality detection reducing false positive alerts by 60%
Senior Engineering Service Engineer – SQL Systems Redesign
Sales and Marketing IT (SMIT) | Microsoft
Apr 2012 - Jun 2014
- Collaborated with Dev, Test, and PM teams to redesign www.Microsoft.com and Content Management SQL systems
- Participated in architectural discussions and deployed backend services for SMIT
- Supported Azure onboarding through migrating on-premises SQL to Azure IaaS and PaaS
- Led technical adoption engagements with SQL product teams during Azure migration efforts
- Ensured SQL consistency across all team instances through structured governance
- Designed and implemented SQL Availability solutions to boost system robustness
Engineering Service Engineer II - Database Administrator - managing the SQL systems.
EPX Product Group | Microsoft
Jun 2007 - Apr 2012
- Group DBA and Database Architect for MSDN and TechNet serving 50M+ developers
- Point of escalation for database issues across the product group
- Collaborated on v-next architecture and implementation with product teams
- Managed complex database systems with high availability requirements (99.9% uptime)
Project Lead / Senior DBA
WIPRO & Smart Software Technology
Mar 2000 - Jun 2007
- Provided Tier 3 support for SQL servers hosting mission-critical Microsoft applications
- Managed SQL and Windows clusters for load balancing and high availability
- Developed backup and recovery plans for financial trading systems
- Designed and implemented SQL Security policies for highly regulated environments
Strengths
Problem Solving
- Complex system troubleshooting
- Root cause analysis
- Performance optimization
- Incident resolution
Collaboration
- Cross-functional team leadership
- Stakeholder management
- Technical mentoring
- Knowledge transfer
Attention to Detail
- Monitoring and alerting
- SLO/SLI implementation
- Documentation standards
- Quality assurance
Skills
- Cloud: Azure services, Azure Key Vault, Azure AD, Azure SQL database, Azure Data Lake, Azure Data Factory, Azure Data Explorer, Azure Logic Apps, Azure Cosmos DB
- Databases: SQL Server, Performance Tuning, HADR, Azure Cosmos DB (NoSQL, Multi-API), MongoDB
- Monitoring: Geneva MDM/MDS, Application Insights
- Languages: PowerShell, Kusto (KQL), C#, Python
- Tools: Azure DevOps, Git, Power BI, Grafana
- Containers: Docker, Kubernetes
- Automation: CI/CD, Infrastructure as Code
Projects
Monitoring Guardrails Initiative
Led enterprise-wide monitoring standardization project, reducing live site incidents by 50% and improving MTTR by 40%.
Geneva Actions Automation
Developed automated compliance workflows reducing manual intervention by 80% and improving response time from hours to minutes.
Service Maturity Framework
Designed and implemented SLO/SLI framework for 50+ services, establishing industry-standard reliability metrics.
Education
Bachelor's Degree, Mechanical Engineering
Indian Institution of Mechanical Engineering
1999 - 2002
Recognition
Key Talent Award Winner
Multiple years recognition for outstanding performance
GOLD STAR Recipient
SQL Server TAP program contributions
Multiple Ship-It Awards
Technical excellence and collaboration