Resume / CV


Professional Background

Aaron Cupp

Summary

Results-driven Infrastructure & Site Reliability Engineering Manager with a diverse background in Cloud Computing, DevOps Practices, and Incident Management and Agile Practices. With a decade of experience working remotely and a strong ability to self-manage, I am poised to excel in a leadership role within the Infrastructure and Site Reliability Engineering fields. I am passionate about optimizing system performance, driving efficiency through automation, as well as building and managing high-performing remote teams. My dedication to continuous learning and my technical expertise makes me an asset in driving reliability, scalability, and operational excellence.

Experience

SRE Manager, Sight Machine - San Francisco, CA (Remote Position) 2021- 2023

  • Promoted to Manager of SRE in November 2021.
  • Member of Company Security Response Team and Engineering Technology Leads Team.
  • Built and managed a team of 6 SRE professionals, ensuring 24/7 coverage across multiple time zones.
  • Advocated for Infra-As-Code and GitOps practices, driving the adoption of related technologies within the team.
  • Coached team members on Terraform, cloud environments, scripting, documentation changes, and on-call processes.
  • Developed and implemented standardized processes for SRE onboarding and on-call responsibilities.
  • Implemented processes that resulted in achieving 99.98+% uptime in production environments.
  • Recognized with the Golden Gear patch for saving the company $5000/month on infrastructure bills.

Senior Infrastructure Engineer, Sight Machine - San Francisco, CA (Remote Position) 2021

  • Hired as Senior Infrastructure Engineer in August 2021 and promoted to Lead SRE in September 2021.
  • Sole full-time employee responsible for day-to-day operations of Sight Machine’s technology stack in a multi-cloud environment.
  • Collaborated with the Company Security Response Team and Engineering Technology Leads Team.
  • Developed and implemented a standardized process for SRE on-call duties and onboarding new team members.

DevOps Engineer, JUUL Labs - San Francisco, CA (Remote Position) 2017-2019

  • Provided 24/7 on-call support for all e-commerce systems and primary support for Windows systems used for the Acumatica ERP system.
  • Implemented Log Shipping-based clustering for SQL Server Enterprise to support expansion into multiple regions.
  • Developed Infrastructure-as-Code using Terraform modules.
  • Successfully migrated legacy e-commerce systems from PAX Labs to JUUL Labs post-company split.

DevOps Engineer, PAX Labs - San Francisco, CA (Remote Position) 2016-2017

  • Principal DevOps hire responsible for implementing a streamlined deployment process using Fabric and Jenkins.
  • Implemented automated paging for system and application-related issues through PagerDuty.
  • Scaled AWS presence from 4 to over 100 systems to accommodate company growth.
  • Managed system patching for security and maintenance and created documentation for new processes.

Systems Engineer, study.com - Mountain View, CA 2014-2016

  • Led the migration of cloud systems from Rackspace to AWS, scaling the infrastructure from 10 to over 300 systems.
  • Provided 24/7 on-call support for all company systems.
  • Mentored Junior Systems Engineers on Team
  • Contributed to the development of a Disaster Recovery plan for the company’s data warehouse project.
  • Implemented Lifecycle Management using Puppet, PuppetDB, and Foreman for all AWS systems.
  • Created comprehensive documentation for new systems.

Senior Linux Administrator, Meta Interfaces LLC – San Francisco, CA 2012-2014

  • Managed day-to-day operations of the datacenter for streaming video systems.
  • Implemented monitoring solutions using PagerDuty and Nagios.
  • Implemented virtualization using KVM and libvirt, replacing aging physical infrastructure.
  • Managed CISCO and F5 BIGIP infrastructure for WAN and LAN networking.
  • Pioneered the company’s initial AWS presence as a proof-of-concept for future cloud migration.

Computer Technician II, Elk Grove USD - Elk Grove, CA 2005-2012

  • Managed datacenter operations and wrote documentation for various secure document production tasks.
  • Assisted with the implementation of district virtualization using VMWare.

Systems Administrator, Groundworks Environmental - Folsom, CA 2004

  • Set up and maintained client-server network, Microsoft AD, IIS, and various applications.
  • Developed critical “Grainsized Log” map plotting software for company geologists using VBA and AutoCAD.

Education

  • AWS Summit, San Francisco, CA - AWS Operations Bootcamp - 2013
  • Modesto Junior College, Modesto, CA - Associates of Science, Computer Science - 2004
  • Modesto Junior College, Modesto, CA - Computer Programming Certification - 2003
  • Grace M. Davis High, Modesto, CA - Diploma with Honors – 1999

(Last modified on September 28, 2023)