Summary
Overview
Skills
Timeline
Work History
Education
Projects
BusinessAnalyst
Hariharakumar Narasimhakumar

Hariharakumar Narasimhakumar

Summary

Highly motivated Site Reliability Engineer with 4+ years of experience in the network automation space. Proficient in identifying problems and delivering long lasting solutions aligned with customer requirements. Adept at ensuring service reliability, managing production incidents, and minimizing operational overhead in the long run.

Overview

9
9
years of professional experience

Skills

  • Programming: Python, Go lang, Bash/Shell scripting, ReactJS, Git, Mercurial, MySQL, Microsoft SQL Server
  • Change Management/Monitoring: Ansible, Puppet, Netconf, Grafana, Prometheus, InfluxDB, Gnocchi
  • Virtualization Technologies: OpenStack, Docker, Kubernetes
  • Softwares: Microsoft Visual Studio, Putty, Cisco Packet Tracer, Wireshark, Oracle VM VirtualBox, Matlab
  • Platforms: Cisco, Arista, Junos, Open Vswitch, Windows, Linux (CentOS, Ubuntu)
  • Networking Protocols: TCP/IP, UDP, OSPF, BGP, IPv6, DHCP, DNS, OpenFlow
  • Certifications: CCNA: CSCO12943208

Timeline

Production Engineer

Meta Inc.
07.2019 - Current

Senior Systems Engineer

Infosys Ltd.
06.2014 - 06.2017

Masters in Telecommunication Engineering -

University of Colorado Boulder

Bachelor of Engineering (B.E) in Electronics and Telecommunication -

University of Mumbai

Work History

Production Engineer

Meta Inc.
07.2019 - Current

Role and Responsibilities:

  • Authored and reviewed code for highly scalable and reliable services that build and maintain the Meta network infrastructure
  • Ensured high reliability of core networking services as a first responder in periodic on call rotations
  • Leading XFN collaboration and involved in technical discussions with partners that define long-term direction for Tier 0 services
  • Mitigated large-scale production incidents, authored incident reports, and created follow up tasks around future detection, resolution, and prevention of similar incidents
  • Demonstrated initiative and took ownership as SME of critical services, improving their reliability, and driving projects to completion


Projects Overview:

  • Revamped the Zero Touch Provisioning (ZTP) pipeline and raised provisioning failures directly to end users, reducing the framework team's operational load by 90%
  • Improved service health visibility of the Network traffic draining service by adding alerting around failures. Upheld reliability of the service by fixing deep-seated, chronic issues in the codebase and owning follow ups for incidents
  • Measured oncall load by building statistical dashboards around alert flakiness, duration, and frequency. Improved operational efficiency by reducing frequently firing alerts fired by 30% and decreasing oncall task backlog by 82%
  • Implemented a solution across multiple systems to enable end users to run more granular health checks relevant to the upgrade being performed, improving deployment success rate by 15%
  • Designed a tool in Go to simplify developer testing of new or modified provisioning workflows for network devices
  • Designed and implemented a continuous testing framework to run provisioning workflows on lab devices using Python asyncio and escalate failures to device owners.

Senior Systems Engineer

Infosys Ltd.
06.2014 - 06.2017
  • Supported a high traffic web application used in daily operations for a global logistics company
  • Provided solutions for technical issues involving complex logic and communicated resolution to customers in business terms
  • Implemented code enhancements in C#, Visual Basic and ASP.NET framework to fix functionality and operational issues.

Education

Masters in Telecommunication Engineering -

University of Colorado Boulder
USA
05.2019

Bachelor of Engineering (B.E) in Electronics and Telecommunication -

University of Mumbai
India
05.2014

Projects

Capstone Project (Mentored by Equinix)

  • Analyzed the performance characteristics and networking overhead of Virtual Network Functions (VNFs) from different vendors in OpenStack using Prometheus, Grafana, and InfluxDB.
  • Co-authored and published the findings in the journal Information Technology in Industry (ITII).

System Design

  • Implemented a Distributed File System (DFS) using UDP Socket programming with GET, PUT, RENAME, LIST and EXIT functionality.
  • Implemented client-server communication using UDP to transfer files reliably with acknowledgment and data encryption.
  • Built a HTTP-based web server that handles multiple simultaneous requests with caching and prefetch functionality.
  • Designed a Content Delivery Network with fault-tolerant applications using Docker as containers, and Kubernetes for orchestration.

Network Design and Management

  • Designed an IPv4 and IPv6 Service Provider backbone network and configured policy-based routing using BGP path attributes.
  • Built a Web UI app using Flask to automate rerouting of traffic using dynamic network configuration.
Hariharakumar Narasimhakumar