Summary

Experience

Senior SRE Manager
Customer Reliability Engineering (C+E) | Microsoft
Oct 2020 - Present
  • Lead and manage cross-functional SRE teams following Microsoft's core priorities: model, coach, and care
  • Led cross-functional collaboration between software architects, product managers, and a team of developers to define and implement KPI-driven alerting—built and deployed an automated Alerting Module that proactively notifies the team when any KPI value turns red, elevating data quality monitoring and incident response
  • Manage ACOM and Tenant Management online services with dedicated sub-teams, serving 100M+ users
  • Spearhead hiring and development initiatives for SRE teams across CRE organization
  • Contribute to Monitoring Guardrails project, reducing live site incidents by 50%
  • Design solutions for production and Sovereign cloud service parity assessment
  • Develop Geneva Actions reducing human intervention in compliance services by 80%
Senior Engineering Service Engineer - Focused on the live site management, Defining SLI/SLO and managing the data pipelines.
Business Application Group & Consumer Sales Marketing | Microsoft
Jun 2017 - Oct 2020
  • Focused on live site management and data pipeline optimization for customer-facing services
  • Defined and implemented SLO/SLI metrics improving service reliability to 99.9%
  • Developed and managed Azure Data Factory pipelines processing 10TB+ daily data
  • Led testing and execution of business continuity and disaster recovery plans
  • Established comprehensive monitoring using Power BI, Lens, and Geneva tools
Senior Engineering Service Engineer - Focused on Telemetry migration, developing POCs in Azure
Visual Studio Online | Microsoft
Jun 2014 - Jun 2017
  • Led transformation of service monitoring from legacy tools to Geneva (MDM/MDS)
  • Served as SME for Geneva and Kusto onboarding across multiple product teams
  • Enhanced HADR measures achieving 99.95% uptime for critical services
  • Developed POCs for migrating telemetry data to Big Data solutions (HDInsight, MongoDB)
  • Implemented abnormality detection reducing false positive alerts by 60%
Senior Engineering Service Engineer – SQL Systems Redesign
Sales and Marketing IT (SMIT) | Microsoft
Apr 2012 - Jun 2014
  • Collaborated with Dev, Test, and PM teams to redesign www.Microsoft.com and Content Management SQL systems
  • Participated in architectural discussions and deployed backend services for SMIT
  • Supported Azure onboarding through migrating on-premises SQL to Azure IaaS and PaaS
  • Led technical adoption engagements with SQL product teams during Azure migration efforts
  • Ensured SQL consistency across all team instances through structured governance
  • Designed and implemented SQL Availability solutions to boost system robustness
Engineering Service Engineer II - Database Administrator - managing the SQL systems.
EPX Product Group | Microsoft
Jun 2007 - Apr 2012
  • Group DBA and Database Architect for MSDN and TechNet serving 50M+ developers
  • Point of escalation for database issues across the product group
  • Collaborated on v-next architecture and implementation with product teams
  • Managed complex database systems with high availability requirements (99.9% uptime)
Project Lead / Senior DBA
WIPRO & Smart Software Technology
Mar 2000 - Jun 2007
  • Provided Tier 3 support for SQL servers hosting mission-critical Microsoft applications
  • Managed SQL and Windows clusters for load balancing and high availability
  • Developed backup and recovery plans for financial trading systems
  • Designed and implemented SQL Security policies for highly regulated environments

Strengths

Problem Solving

  • Complex system troubleshooting
  • Root cause analysis
  • Performance optimization
  • Incident resolution

Collaboration

  • Cross-functional team leadership
  • Stakeholder management
  • Technical mentoring
  • Knowledge transfer

Attention to Detail

  • Monitoring and alerting
  • SLO/SLI implementation
  • Documentation standards
  • Quality assurance

Skills

  • Cloud: Azure services, Azure Key Vault, Azure AD, Azure SQL database, Azure Data Lake, Azure Data Factory, Azure Data Explorer, Azure Logic Apps, Azure Cosmos DB
  • Databases: SQL Server, Performance Tuning, HADR, Azure Cosmos DB (NoSQL, Multi-API), MongoDB
  • Monitoring: Geneva MDM/MDS, Application Insights
  • Languages: PowerShell, Kusto (KQL), C#, Python
  • Tools: Azure DevOps, Git, Power BI, Grafana
  • Containers: Docker, Kubernetes
  • Automation: CI/CD, Infrastructure as Code

Projects

Monitoring Guardrails Initiative
Led enterprise-wide monitoring standardization project, reducing live site incidents by 50% and improving MTTR by 40%.
Geneva Actions Automation
Developed automated compliance workflows reducing manual intervention by 80% and improving response time from hours to minutes.
Service Maturity Framework
Designed and implemented SLO/SLI framework for 50+ services, establishing industry-standard reliability metrics.

Education

Bachelor's Degree, Mechanical Engineering
Indian Institution of Mechanical Engineering
1999 - 2002

Recognition

Key Talent Award Winner
Multiple years recognition for outstanding performance
GOLD STAR Recipient
SQL Server TAP program contributions
Multiple Ship-It Awards
Technical excellence and collaboration