Roshan Kakarla

DevSecOps Engineer • SRE • Cloud (AWS/Azure) • Kubernetes

I build secure, reliable cloud platforms—Kubernetes, IaC, CI/CD, and observability. Experience across AWS/Azure, DevSecOps controls, incident response, and production operations.

United States
Quick Snapshot
  • DevSecOps / SRE with cloud + Kubernetes focus
  • Terraform, CI/CD, Observability, Incident Response
  • AWS + Azure production platform operations

Experience

TransUnion

Jan 2025 – Present

DevSecOps Engineer

AWS Enterprise Platforms – Security & Reliability

  • Designed and operated secure AWS infrastructure (EKS, EC2, IAM, VPC, ALB, CloudWatch) for production workloads.
  • Implemented Infrastructure as Code with Terraform modules and security guardrails to reduce misconfigurations.
  • Embedded DevSecOps controls into CI/CD using Jenkins and GitHub Actions with secure secrets practices.
  • Managed Kubernetes security posture on AWS EKS using RBAC, isolation controls, and secure deployment patterns.
  • Implemented monitoring and observability using CloudWatch, Prometheus, and Grafana for production services.
  • Supported on-call rotations, incident response, root cause analysis, and remediation for reliability/security issues.
  • Enforced least-privilege access and environment isolation across AWS accounts, Kubernetes, and CI/CD pipelines.
  • Authored runbooks, operational docs, and audit-ready standards for compliance and production operations.

Dollar General

Jun 2023 – Dec 2024

DevOps Cloud Engineer

Azure Cloud Platforms – Retail & Supply Chain Systems

  • Built and managed Azure infrastructure using AKS, VNets, Load Balancers, Storage Accounts, and Azure Monitor.
  • Provisioned and standardized cloud resources using Terraform and automation to improve reliability and consistency.
  • Designed and maintained CI/CD pipelines using Azure DevOps and GitHub Actions for automated deployments.
  • Supported Kubernetes on Azure, ensuring cluster stability, deployment readiness, and capacity planning.
  • Implemented monitoring/alerting using Azure Monitor, Log Analytics, and Grafana for faster detection/response.
  • Partnered with application teams to streamline deployments and reduce manual operations.
  • Supported cloud cost optimization via right-sizing, usage analysis, and cleanup of unused resources.
  • Created runbooks and operational documentation to support production support and on-call execution.

Procter & Gamble (P&G)

Jan 2022 – May 2023

Site Reliability Engineer (SRE)

On-Prem & Hybrid Enterprise Systems

  • Supported large-scale on-prem and hybrid production systems across Linux and Windows Server with strict SLOs.
  • Automated provisioning and operational workflows using Ansible, Bash, and Python for consistent execution.
  • Built monitoring dashboards using Prometheus and Grafana to improve real-time visibility and alerting.
  • Participated in on-call rotations and led incident response to restore services and reduce downtime.
  • Performed RCA and implemented long-term corrective actions to prevent repeat incidents and improve reliability.
  • Created and maintained runbooks and operational procedures to reduce MTTR and improve response quality.
  • Collaborated with engineering teams on capacity planning, performance tuning, and resilience improvements.
  • Introduced DevOps automation practices into traditional operations to improve stability and speed of change.

Cosmicvent Software Pvt. Ltd.

Jan 2019 – Jun 2021

DevOps Engineer

AWS-Hosted SaaS Platforms

  • Designed and managed AWS infrastructure using EC2, S3, RDS, IAM, and VPC for SaaS environments.
  • Built CI/CD pipelines using Jenkins and Git to automate builds, testing, and deployments across environments.
  • Containerized services using Docker and supported early Kubernetes adoption for consistent deployments.
  • Implemented monitoring and logging using CloudWatch and open-source tools for operational visibility.
  • Supported release management and production deployments with structured change practices.
  • Collaborated with developers to streamline release workflows and promote DevOps best practices.
  • Assisted with infrastructure security, access controls, and environment hardening practices.
  • Maintained documentation including infrastructure diagrams, runbooks, and deployment guides.

Projects

FAITH CHURCH APP

Community events, live sermon streaming, and donations platform using React.js, Firebase, and Flask.

React.jsFirebaseFlaskCloud Functions

SELF CHECK-IN SYSTEM

Automated check-in and payment system with QR validation and secure payments.

FastAPIReact.jsStripePostgreSQLAWS

PARKING LOT SLOT RESERVATION

Slot booking with live availability and RBAC for admins/users.

Node.jsExpressReact.jsMySQLJWT

Tech Stack

Cloud Platforms

AWS (EKS, EC2, IAM, VPC, ALB, CloudWatch)Azure (AKS, VNets, Monitor)

Containers & Orchestration

Kubernetes (EKS/AKS)DockerHelm

IaC & Automation

TerraformAnsibleBashPython

CI/CD

JenkinsGitHub ActionsAzure DevOps

Observability

PrometheusGrafanaCloudWatchAzure MonitorLog Analytics

Security

IAMRBACSecrets ManagementPolicy Guardrails

Selected Impact

Cloud Reliability

Improved production stability with monitoring, alerting, and incident response practices.

Delivery Automation

Automated CI/CD and Infrastructure as Code to reduce manual deployments and drift.

Security & Compliance

Implemented least-privilege access and DevSecOps controls across pipelines and clusters.