Skip to content
View Tthomas63's full-sized avatar
  • Post Falls, ID | Remote US

Block or report Tthomas63

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Tthomas63/README.md

Tyler Thomas

Senior infrastructure/platform engineer focused on AWS, Kubernetes, Terraform, CI/CD, observability, and production reliability.

I work on cloud infrastructure, deployment automation, distributed worker systems, and operational reliability for production workloads. My strongest experience is in AWS/EKS/ECS, Terraform-managed environments, CI/CD pipelines, Kubernetes operations, observability, and production incident recovery.

Focus Areas

  • AWS infrastructure: ECS, EKS, EC2, IAM, S3, SQS, VPC, ALB, Route53, CloudWatch
  • Kubernetes operations, Helm, IAM/auth troubleshooting, cluster access, and deployment workflows
  • Terraform-managed multi-environment infrastructure
  • CI/CD automation with GitHub Actions and Azure Pipelines
  • Observability with OpenTelemetry, Prometheus, and Grafana
  • Distributed task processing with RabbitMQ, Celery, Redis, PostgreSQL, and worker-based systems
  • Production troubleshooting, incident response, and recovery-oriented engineering

Selected Work Themes

  • Re-architected Celery/Redis queue-processing workflows toward RabbitMQ-backed durable task processing and safer recovery behavior.
  • Led ECS Fargate to ECS EC2 migration, reducing deployment times by ~40% and improving deployment control.
  • Implemented Kubernetes observability tooling using OpenTelemetry, Prometheus, and Grafana.
  • Helped recover production database systems after accidental deletion by coordinating Azure snapshot/backup discovery and restoration.
  • Operated distributed ML competition infrastructure supporting high-volume submissions and long-running worker workloads.

How I Think About Engineering

I like the unglamorous parts of engineering: deployments that can be trusted, queues that recover cleanly, dashboards that answer real questions, and infrastructure that another engineer can safely operate at 2 AM.

Current Target Roles

Senior Platform Engineer · Infrastructure Engineer · Site Reliability Engineer · Cloud Infrastructure Engineer · Senior DevOps Engineer

Contact

Email: mr.tyler.thomas@gmail.com

Pinned Loading

  1. codalab/codalab-competitions codalab/codalab-competitions Public

    CodaLab Competitions

    Python 536 129

  2. gpy_site gpy_site Public

    A Dockerized Django project integrating Steam for authentication and implementation and providing forums, remote rcon access to source servers, and other planned features. Aiming to be an all in on…

    JavaScript

  3. codalab/chalab codalab/chalab Public

    JavaScript 1 4

  4. codalab/codabench codalab/codabench Public

    Codabench is a flexible, easy-to-use and reproducible benchmarking platform. Check our paper: https://hubs.li/Q01fwRWB0

    Python 154 59