Buffalo, New York · US Citizen
I build cloud infrastructure
that stays up.
I build and operate AWS infrastructure, Kubernetes platforms, and CI/CD systems that stay up in production. I work across AWS, Amazon EKS, Terraform, and GitOps delivery, and I care about cost, recovery time, and leaving a clean trail behind me.
- AWS
- Kubernetes
- Amazon EKS
- Docker
- Terraform
- GitLab CI
- GitHub Actions
- ArgoCD
- Linux
Close to the workloads, not the slideware
Seven years spent where uptime, cost, and recovery time are things people actually notice.

AWS SAA · CKA
Certified
I am a Cloud DevOps Engineer with 7 years of experience building and running AWS infrastructure, Kubernetes platforms, and CI/CD systems in production. Most of that time has been spent close to real workloads, where uptime, cost, and recovery time are things people actually notice.
Right now I work at VistaJet, operating and supporting global aviation workloads on Amazon EKS. I owned the cutover plan for a zero-downtime EKS migration, cut AWS spend by 18 percent, and brought mean time to recovery down by 40 percent by pulling monitoring and alerting into one place.
I like the parts of the job that are unglamorous but matter: reusable Terraform modules, GitOps workflows that leave an audit trail, runbooks that a tired on-call engineer can follow at 3 a.m. When I want to go deeper on the fundamentals, I build them from scratch in a bare-metal Kubernetes homelab.
Cloud infrastructure
AWS across compute, storage, networking, and IAM, with Terraform describing it so it can be reviewed, versioned, and rebuilt.
Kubernetes platforms
Operating workloads on Amazon EKS at work and building a cluster by hand with kubeadm at home to understand every layer.
Delivery and GitOps
CI/CD on GitLab and GitHub Actions, with ArgoCD delivering Helm releases through dev, staging, and production.
Reliability
Observability on Prometheus, Grafana, and Loki, paged through PagerDuty, with runbooks that shorten recovery.
7 yrs
Cloud & DevOps experience
18%
AWS spend reduced at VistaJet
40%
Mean time to recovery reduced
2
Certifications (AWS SAA, CKA)
Certifications: AWS Certified Solutions Architect - Associate (SAA-C03), Certified Kubernetes Administrator (CKA). Every figure above is drawn from the resume.
The stack, and how I actually use it
Hover, tap, or tab through any tool to see how it shows up in my work. Nothing here is decorative.
Cloud
Where the workloads actually run.Containers & Orchestration
Packaging and scheduling.CI/CD & GitOps
How code reaches production.Infrastructure as Code
Infrastructure you can review.Observability
Knowing what is happening.Security & Platform
Guardrails and access.AWS
Primary cloud across EKS, EC2, S3, RDS, Lambda, VPC, IAM, Route 53, CloudWatch, ALB/NLB, and Secrets Manager.
Descriptions map to specific roles and projects on this page.
Where I have done the work
Three roles, one throughline: making infrastructure repeatable, observable, and cheaper to run.
Cloud DevOps Engineer · VistaJet
Fort Lauderdale, FL (Remote)
Oct 2022 - Present
- Owned the cutover plan for a zero-downtime migration of production workloads from a self-managed Kubernetes platform to Amazon EKS, using blue-green deployments and rollback validation, with no customer-facing downtime.
- Cut AWS spend 18 percent across EC2 and EBS by right-sizing instances, cleaning up orphaned volumes, and rolling out tag-based resource governance.
- Reduced mean time to recovery 40 percent by unifying observability on CloudWatch, Prometheus, Grafana, and Loki with PagerDuty alerting.
- Built GitLab CI/CD pipelines covering automated testing, container builds, Trivy scanning, and progressive deploys to dev, staging, and production.
- Implemented ArgoCD GitOps workflows that deliver Helm releases across environments with version-controlled, auditable change history.
- Built reusable Terraform modules for EKS, VPC, IAM, RDS, and Secrets Manager, with remote state in S3 and DynamoDB and CI-based plan workflows.
- Supported RDS upgrades and disaster recovery for business-critical databases, focused on cutover planning, replication checks, and rollback readiness.
- Partnered with security to remediate Wiz findings and supported SOC 2 audit evidence collection, and authored on-call runbooks and standardized Helm charts.
DevOps Engineer · HCHK Technologies, Inc.
New York, NY
Oct 2021 - Sep 2022
- Cut deployment time 60 percent by building CI/CD pipelines on Docker, Jenkins, and GitHub Actions.
- Automated configuration across 50+ Linux servers using Ansible Tower with role-based playbooks.
- Built and supported AWS infrastructure for data warehouse workloads across VPCs, NAT gateways, and load balancers, contributing to 99.99 percent application uptime.
- Built reusable CloudFormation templates to standardize network and compute provisioning across environments.
- Set up centralized logging and alerting with the ELK Stack integrated to CloudWatch.
- Deployed OpenVPN with LDAP integration for secure remote access and automated S3 snapshot lifecycle for backups.
DevOps Engineer · Chrysler Motors Corporation
Auburn Hills, MI
2018 - Sep 2021
- Managed 200+ AWS servers across development, staging, and production environments.
- Migrated 300+ legacy Jenkins jobs to pipeline-as-code, improving auditability and team handoff.
- Reduced manual operational work 30 percent through Python and Bash automation of routine tasks.
- Maintained CI/CD on Jenkins, Bamboo, Bitbucket, and Nexus, with Chef cookbooks for Tomcat and JBoss provisioning.
- Built optimized Docker images that shrank build times and deployment payloads.
- Contributed to AWS migration and Azure hybrid cloud connectivity, with monitoring on Splunk and Nagios.
Case studies, not screenshots
Three pieces of work I can talk through end to end: what the problem was, what I built, and what it changed.
Homelab v2.5 - Bare-Metal Kubernetes Platform
A 22-phase, production-style Kubernetes platform built by hand on bare-metal Proxmox to understand cluster lifecycle from first principles.
- Kubernetes
- kubeadm
- Calico
- MetalLB
- Longhorn
- cert-manager
- GitLab CE
- Trivy
Zero-Downtime EKS Migration
Moved production workloads from a self-managed Kubernetes platform to Amazon EKS with no customer-facing downtime.
- Amazon EKS
- Kubernetes
- Blue-Green
- Terraform
- Helm
- AWS
GitOps CI/CD and Observability Platform
CI/CD on GitLab with ArgoCD GitOps delivery and a unified observability stack that cut mean time to recovery by 40 percent.
- GitLab CI
- ArgoCD
- Helm
- Trivy
- Prometheus
- Grafana
- Loki
- PagerDuty
A delivery pipeline you can click through
This is the shape of how I ship and run software. Select any stage to see what happens there and how I use it in real work.
swipe to see the full pipeline
Stage 01 / 10
Git (GitLab / GitHub)
Everything starts in version control. Application code, Helm charts, and Terraform all live in Git so changes are reviewed and auditable. At work delivery runs on GitLab and GitHub Actions, and in the homelab I run a self-hosted GitLab CE with a private container registry.
The short version
The highlights a recruiter needs first. The full resume is one click away, and every project on this page backs it up.
Strengths
Hands-on execution
Comfortable owning work end to end, from a Terraform module to a production cutover to the runbook that supports it.
Cost and reliability
Cut AWS spend 18 percent and reduced mean time to recovery 40 percent by making real, measured changes.
Automation first
CI/CD, GitOps, and infrastructure as code so changes are repeatable, reviewable, and auditable.
Production troubleshooting
Unified observability and on-call runbooks that shorten the path from alert to root cause.
Documentation
Runbooks and standardized charts so the next engineer can pick up the work without a handoff meeting.
Continuous learning
Building a bare-metal Kubernetes platform by hand to keep the fundamentals sharp.
Certifications
- AWS Certified Solutions Architect - Associate (SAA-C03)
- Certified Kubernetes Administrator (CKA)
Education
Bachelor of Science in Computer Science
IBAIS University, Dhaka, Bangladesh
Want the full picture?
For the person deciding who to interview
Straight talk, no filler. Here is what you get, and each point maps to something concrete on this page.
I own outcomes, not tickets
I took a zero-downtime EKS migration from plan to cutover to rollback readiness on workloads where downtime is not acceptable.
I make numbers move
18 percent less AWS spend and 40 percent lower mean time to recovery are the kinds of results I look for, and I can explain exactly how each happened.
I automate the boring parts
Reusable Terraform modules, GitLab and GitHub Actions pipelines, ArgoCD GitOps, and Ansible across 50+ servers.
I know the fundamentals
AWS Solutions Architect Associate and Certified Kubernetes Administrator, plus a bare-metal cluster I built by hand to prove it to myself.
I leave a trail
On-call runbooks, standardized Helm charts, and GitOps history so the team is not dependent on one person's memory.
I work well with security
Remediated Wiz findings, wired Trivy into CI, and supported SOC 2 audit evidence collection.
Let's talk about your infrastructure
If you are hiring for a Cloud, DevOps, Platform, or SRE role, I would be glad to walk through my work. The fastest way to reach me is email.
Buffalo, New York · US Citizen
Tuhin Zaman · Cloud DevOps Engineer · tuhinzaman.com