Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

ARJUN KANNAN

Site Reliability Engineer
Chennai

Summary

Professional engineer with strong foundation in system reliability and optimization. Known for delivering robust solutions that enhance system performance and reduce downtime. Collaborative team player focused on achieving results and adapting to changing environments. Skilled in automation, incident management, and continuous improvement.

Experienced with maintaining and improving system reliability through automation and proactive monitoring. Utilizes problem-solving skills to troubleshoot and resolve complex issues efficiently. Knowledge of infrastructure optimization and collaborative teamwork to enhance system performance and reliability.

Overview

5
5
years of professional experience
1
1
Certification

Work History

Site Reliability Engineer

Fidelity Investments
05.2022 - Current
  • Managed and supported production environments using Kubernetes, ensuring high availability, scalability, and reliability of containerized applications across multiple clusters and on-prem applications.
  • Monitored system performance and health through observability tools like Datadog, Grafana and splunk setting up real-time alerts to proactively address issues and minimize downtime.
  • Implemented and maintained Kubernetes clusters, including configuration, deployment, and scaling of microservices, improving operational efficiency and system resilience.
  • Automated infrastructure provisioning using Ansible, reducing manual intervention and enhancing infrastructure consistency and scalability.
  • Led incident management processes, troubleshooting and resolving production issues in real-time, conducting root cause analysis, and implementing preventive measures.
  • Collaborated with development and operations teams to ensure smooth CI/CD pipeline integration, optimizing deployment processes and minimizing production disruptions.
  • Established and maintained observability frameworks, providing detailed monitoring dashboards and log aggregation to track system performance and infrastructure health.
  • Contributed to the implementation of Site Reliability Engineering (SRE) practices, improving system reliability and reducing operational overhead through automation and proactive monitoring.
  • Followed ITIL best practices for incident, change, and problem management to ensure effective and efficient resolution of production issues while adhering to service level agreements (SLAs).
  • Provided ongoing support for cloud-based infrastructure (AWS), managing scaling, provisioning, and configuration tasks to ensure optimal system performance.
  • Implemented cost-saving measures by optimizing resource utilization across cloud-based infrastructure environments.
  • Conducted root-cause analyses after major incidents to identify areas for process improvement or technical enhancement opportunities.

Associate Software Engineer

DXC Technologies
07.2020 - 05.2022
  • Teamed with business analysts to deliver high-availability solutions for mission-critical applications.
  • Performed system analysis, documentation, testing, implementation and user support for platform transitions.
  • Worked on Change management, Incident management based on ITIL framework.
  • Good understanding of Infrastructure Expertise in analyzing and resolving production related issues.
  • Resolved or escalated problem tickets to resolve user issues.
  • Application Delivery, Deployment, Scheduling, Training Mentoring.
  • Monitoring, Escalating production issues, Bridge-call handling.
  • Checked configuration files and logs to uncover root causes of problems.
  • Worked as a change manager to approve the changes for change management.
  • Worked on incident management and providing L1, L2 support for other teams.
  • 24/7 email monitoring to address or escalate client issues.
  • Taken care of sending notification during outage to all the stakeholders.

Education

Bachelor's - Computer science and Engineering

M. Kumarasamy College of Engineering
Karur, India
01.2020

High School Diploma -

Mount Zion Matric Higher Secondary School
Pudukkottai, India
01.2016

Skills

Production Support

undefined

Certification

Certified Kubernetes Administrator

Timeline

Certified Kubernetes Administrator

07-2025

Site Reliability Engineer

Fidelity Investments
05.2022 - Current

Associate Software Engineer

DXC Technologies
07.2020 - 05.2022

Bachelor's - Computer science and Engineering

M. Kumarasamy College of Engineering

High School Diploma -

Mount Zion Matric Higher Secondary School
ARJUN KANNANSite Reliability Engineer