Hello, I am
Matthew Butch
IT Director

I have over 10 years of directing infrastructure and automation teams for large and small enterprises in multiple sectors, including teams with leaders and managers. My experience includes leading troublesome teams to success in multiple instances, saving companies hundreds of thousands of dollars per year by implementing automation and improving processes, and implementing complex cross-functional projects to improve operating efficiency.

In addition, my technical knowledge and skills cover the entire technology stack, including server engineering, network engineering, and software development. This includes Ansible, Linux, Windows, Kubernetes, Ruckus, VMware, ServiceNow, Pure, Nginx, MySQL, PHP, Bash, Python, Git/GitHub/GitLab.

I live in Coatesville, PA, 19320 within easy commuting distance to Philadelphia & Wilmington, and their surrounding areas, and I have worked in-office, hybrid, and remote jobs, including ones with travel requirements.

Personal Info

  • nameMatthew Butch
  • phone717-314-4789
  • locationCoatesville, PA 19320

Projects

Go to next/previous page

 

This page lists a selection of major projects I've lead, with a short summary for each. More information is available on request.

  • New Automated Server Deployment
    2024-2025

    When I joined there was an ongoing project to use Ansible Tower (now called AAP) for server deployment, configuration management, and lifecycle management. It has already been going on for 3 years without any end in sight. When I took over I split the project up into phases and started delivering production value. Within 6 months Phase 1 was complete. Phase 2 took another 6 months, and Phase 3 just 3 months, where we started deploying virtual machines with the new process. Phase 4 is now complete, covering Physicals. The project is now considered successful after much delay.

  • CentOS Upgrade
    2024

    We were running 3000 CentOS servers, which was going end of life in June 2024. I lead and managed the project to upgrade. We used ServiceNow Audit Tests and our own framework called User Actions to have every server manager and/or owner declare what they wanted to do with them. For some groups of our business, we took a white glove approach to ensure smooth upgrades, for others we just managed the actions they declared (upgrade, decommission, or temporary exception). I took the opportunity to marry our new Automated Server Deployment process By the end of 2024, 2500 servers were upgraded, with the rest finishing up in early 2025.

  • Department Reoganization
    2023

    Originally the Linux Group team was composed of 3 "teams", the Transformation Team of 3 members (dedicated to automation primarily), the Operations Team of 10 members, and the DQ team of 6 contractors (primarly charged with manual server upgrades). This was inefficient, so I reorganized them into 4 generally equal teams and moved our team lead from the Tranformation Team to a Global team reporting directly to me. The one Supervisor took the Transformation Team, still of 4 members with 2 moved from the Operations Team, split the Operations Team into the Operations & Automation Europe/Asia (EA) Team and the Operations & Automation Team US, segmented by region, and charged with both day to day operations (incidents and requests) as well as automating their work. One member of the DQ team was moved to the Operations & Automation EA team as well, and the DQ team otherwise remained the same, though more now focused on running automations and fixing incidents.

    The teams, and the department as a whole, is now operating more smoothly and efficiently.

  • GitLab CI/CD Implementation
    2023

    We had tests of our application, but while we could run them all at once, they were not automated. I took the tast of using GitLab's CI/CD infrastructure coupled with Ansible for the actual running of the tests. The pipeline would spin up a new DigitalOcean server, deploy our OS configuration and Application, run the tests, and send the passes and fails back to GitLab. Now every commit and merge request made could be fully tested and ensure we were deploying safe code.

  • New Switch Infrastructure
    2022

    We were running old Netgear switches without any HA so I planned a replacement using proper enterprise switches. I evaluated various vendors and went with Ruckus, which had a good price and also great Ansible integration - a key requirement. These switches had 100G backbone connections, and 25G server connections, which was great future proof for our infrastructure. We purchased 2 core switches and 8 top of rack switches, along with the cables match. After purchasing, we planned the installation at our datacenter. This involved where to install the switches in the rack, what cables/adapters we'd need, and how we would do the migration. From there, we followed the plan, and slowly migrated servers. The migration was a complete success, and the additional bandwidth

  • Server Upgrades
    2021-2022

    We were running some pretty old SuperMicro servers with SATA HDDs. It was time to look to modernize our server fleet. We researched a bunch of different options, and settled on new SuperMicro Servers running AMD EPYC processors, that provided lots of PCIe lanes to enable running 24 NVMe SSD drives. We ordered one as a test to see how it would perform with our application, and after successful tests, we ordered enough to replace half hour server fleet. We then began the task of putting them into production, which involved racking and stacking, automating physical deployment of our OS and Application stack, and then switch our application to use them. The whole process was a complete success and the performance of our application increased 10fold.

  • New Production Infrastructure
    2020

    We have been planning a migration of our major internal web application (used by everybody in the business from sales agents in the field to accounting) from virtual machines to Kubernetes pods. We have some small applications and all development environments running on two Kubernetes clusters in two self-hosted datacenters, and we were ready to move the major web application. This had a deadline for that of December 31st, 2020, which meant the infrastructure had to be read by October 31st, 2020. We began our project early July 2020.

    With this charge, I proceeded to design a brand new production infrastructure. This required deciding on and purchasing new servers and confirming we had enough networking and electrical resources. This became three subprojects involving three teams.

    We first evaluated server venders and selected the final brand (Dell) with the server count (22 total, 11 per datacenter). Then we planned the network infrastructure as we had older Cisco switches and would not have enough ports to use all the new servers. We again evaluated switch vendors and selected the final brand (Cisco) and setup (2 core switches with 2 top of rack switches). With the hardware selected, we evaluated the current electrical capacity and the required electrical capacity. We determined we had enough main and generator power, but not enough UPS power. So we selected a new bottom of rack UPS to match the requirements.

    Next I presented the entire plan to the VP and after his approval, the CIO. The plan was approved, and we made final revisions with our vendors and placed the order. With everything ordered, I planned the installation for all three subprojects. This involved coordinating delivery of servers, switches, cables, SFPs, racks, PDUs, and UPS's. I also had to coordinate the electrical circuit for the new UPS with both the UPS vendor and our electrical contractor.

    I assigned tasks to different teams and different members of those teams.I kept track of delivery of all physical items, as well as installation and configuration of all the devices. Once everything was installed and set up (easier said than down - the teams did a lot of work, and I pitched in to connect cables and install servers), the final set was configuration of servers, switches, UPS's, and PDUs. I supervised this work and helped the teams overcome any obstacles, either by making decisions to move the work along, or jumping in to configure settings and troubleshooting issues.

    This took about four months from start to finish and we hit our deadline with two weeks to spare. The new infrastructure was a complete success.

  • Department Reoganization
    2020

    Our company is divided up into Service Groups (mine: Business Technology), then Divisions (mine: Delivery), then Departments (mine: SRE), then teams (mine: Networking/Security & Server/Storage).

    Originally the Server/Storage team was composed of 6 members, but that size was leading to ineffiencies and slowdowns in coordiations and discussions. So a decision was made to split that team up into a Support Server/Storage team and an Architect Server/Storag team.

    This involved selecting the members for each team from the original team, planning out the scope of duties for each team, and communicating to the teams and the rest of the Service Group.

    The teams, and the department as a whole, is now operating more smoothly and efficiently.

  • SAN Addition
    2019

    We wanted to add fast storage, as our existing solution was slow and support was lacking.

    I lead the department that created an evaluation process, evaluated two onsite solutions, and made the proposal to upper management on which solution to select. We selected a Pure Storage FlashArray. This is a highly performant all-flash iSCSI storage solution with an amazing API.

    Once the decision was approved, I lead the department that implemented FlashArray. This involved cross team coordination, as it involved both physically setting up the devices at two datacenters, connecting them to our network, and connecting VMware and Kubernetes servers to the devices.

    The project was a success and the new storage has opened up a lot of options for our company.

  • New Infrastructure
    2018

    Our old ecommerce infrastructure was hosted on Rackspace. This was not a very performant environment, nor was it cost effective. I took a detailed survey of alternative solutions and decided due to cost, control, and performance that self-hosting was the best solution.

    I proceeded to design a brand new infrastructure. This required securing a data center and internet service providers. We hosted with Netrality in Philadelphia, and set up connections with Cogent and Hurricane Electric.

    Next was to design the network infrastructure. We went with pfSense appliances for internet routing and firewalling. There would be 10Gb connections (all HA'd) to the network switches and servers. There were two sets of switches, one for internal networking and one for iSCSI traffic.

    After the network was decided, I planned the server infrastructure. There would be three hosts for virtualization and four database servers, along with a SAN.

    I planned the configuration, with Ansible, and the migration. We switched from our old infrastructure to our new infrastructure with zero issues. The new infrastructure was five times as fast as the old one. It also saved us $200,000 per year.

    This took about four months from start to finish. The migration was a complete success.

  • Site Merger
    2017 - 2018

    Our company merged with another company, and this required merging our ecom sites. Because the other company outsourced theirs, but used the same platform as ours (Magento), we decided to pull their content into ours.

    This required a detailed project plan involving many people. I personally set up the server configurations, and managed the developers who integrated the code and the content (products, inventory, photos, and site UI).

    This took about four months from start to finish. The integration was a complete success, with very minor issues.

  • Website Rescue
    2015-2017

    I took over running a major retail ecommerce website. The site could barely handle a few hundred users due to the poor configuration of the servers, and it was lucky to stay up most days. It was also insecure. I evaluated all parts of the stack: firewall, load balancer, web servers, database servers, and caching servers.

    I optimized and secured each different piece, and added configuration management to the setup. In a month the site was stabilized, and I continued to optimize the configuration and added additional servers as needed. By six months the site rarely went down even under high load. After one year of changes, the site could handle thousands of users during product releases. I also saved hundreds of thousands of dollars and countless human hours. And with the improved security, the site was much less vulnerable.

  • Firewall Upgrades
    2015

    I upgraded four SonicWall firewalls in two locations. This required taking the website offline, which is a major deal for an ecommerce website. I configured all four SonicWalls (two each site for HA) to our specification. In one location I removed the old ones, installed the new ones, and ensured everything worked. For the other location which was many miles away, I managed a contractor to do the physical removing and installing, and then I ensured it all worked. I lead this project from start to finish.

  • Network Reconfiguration
    2014

    I reconfigured our network for PCI compliance and created 4 different networks in 4 locations (Development, Testing, Production Datacenter 1, Production Datacenter 2. This required reconfiguring Force10 switches, EqualLogic SANs, ESXi hosts, vCenter, and SonicWall firewalls. I lead this project from start to finish.

  • Switch and SAN Upgrade
    2014

    I upgraded two Force10 switches (a 1 gigabit and a 10 gigabit one) and an EqualLogic SAN for our Development and Testing Environment. This required configuring both switches differently for the two environments and purposes (iSCSI and VM traffic). I lead this project from start to finish.

  • Upgrading vSphere from 4.1 to 5.1
    2013

    I upgraded two clusters from 4.1 to 5.1. This also included creating two new vCenter VMs and migrating the database.

  • SRM Installation and Configuration
    2013

    After upgrading and creating two new vCenter VMs, I had to recreate the Site Recovery Manager setup. This involved configuring two EqualLogic pools, setting up SRM on both vCenters, and testing out the recovery plan.

  • Upgrading all wireless access points from 3COM to Ruckus
    2012 - 2013

    All four buildings had a limited 3COM wireless deployment. We upgraded the over the span of a few years to Ruckus. This involved a new controller, configuration of new and old switches, contracting out and supervising the physical access point install, configuration of the controller and APs, and finally testing.

  • Upgrading all Windows Server 2003 and Exchange 2007 to Windows Server 2008R2 and Exchange 2010
    2012

    We had physical and VM servers, and I planned a migration of all services (Exchange, AD, File, Print, Financial, Lync). I proceeded to make new VMS and migrate the services, all with minimum downtime and impact to our users.

  • Reconfiguring entire network
    2011 - 2012

    I created a plan to move, upgrade, and retire switches, as well as rewire all the network closets. This was a time intensive task. I removed 43 switches that we no longer needed to due wireless coverage. I moved switches around for better reliability. I upgraded switches for better performance. I reconfigured the entire network for better reliability and performance.

Resume

Go to next/previous page

Skills

  • People Management
  • Project Management
  • Server Management
  • Network Management
  • Client Management
  • Customer Service

Experience

  • Director of Linux Infrastructure - Synopsys | Sunnyvale, CA (Remote)
    May 2023 - present

    Operating in the software simulation industry, a publicly held organization employing 30,000 people. Recently acquired Ansys, which is where this position started.

    Responsibilities

    • Managing a global (US, Europe, and Asia) team of two supervisors with 13 full time and 5 contract system engineers organized in four teams, who are responsible for all aspects of 3,000+ virtual and physical Linux servers
    • Planning and implementing infrastructure and automation improvements in multiple global datacenters, allowing thousands of internal developers to request and use Linux servers matching their specifications
    • Managing and leading multiple simultaneous cross functional projects and initiatives, ensuring successful implementation

    Accomplishments

    • Took troubled and long delayed project to implement Ansible Tower for automated server deployment and configuration managed and after reorganzing the project, started delivering incremental value after 6 months and full production after a year
    • Lead and managed the upgrade of over 3,000 CentOS servers using ServiceNow and Ansible Tower and ensured minimal disruption during the process, with 95% of the servers upgraded in 6 months
    • Reorganized the Linux Group from an unorganized team to to 4 equal teams, with one dedicated to Automation Architecture, two dedicated to Operations & Automation , and one to Server Deployment, increasing efficiency across the Group
    • Technologies used: Linux, Bash, Ansible, AAP, Git, ServiceNow
  • Manager of Infrastructure - Fastmail | Philadelphia, PA (Hybrid)
    January 2021 - May 2023

    Operating in the SaaS industry, a privately held organization employing 50 people.

    Responsibilities

    • Managed a US/Australian team of 4 full time system engineers responsible for all aspects of the company infrastructure, including routing, switching, servers, & storage
    • Planned and implemented infrastructure and automation improvements in multiple US datacenters, while ensuring no data loss and downtime for customers
    • Managing and leading cross functional projects and initiatives, ensuring successful implementation

    Accomplishments

    • Designed and implemented GitLab CI/CD for our application tests, which created a DigitalOcean cloud server, ran the tests, and reported the results back to the Merge Requests, which increased our developer efficiency and reduced bugs
    • Designed and implemented a new switch infrastructure in our primary datacenter, moving from Netgear small business switches to enterprise grade Ruckus switches with a 100G backbone and 25G server connections, which improved speeds, reliability, and management efficiency
    • Designed and lead the implementation of a new server standard, moving from ones running SATA drives to ones running NVMe across all datacenters, which provided vastly more space but also vastly more performance while incurring no downtime or data loss for our customers
    • Technologies used: Linux, Bash, Ansible, Git, NGINX, MySQL
  • SRE Manager - Power Home Remodeling | Chester, PA (In-office)
    July 2019 - January 2021

    Operating in the construction industry, a privately held organization employing 2000 people.

    Responsibilities

    • Managed a US/Brazilian team of 9 full time system engineers and network engineers organized into three teams, who were responsible for all aspects of the company infrastructure, including routing, switching, servers, and storage
    • Planned and implemented infrastructure and automation improvements in multiple US datacenters, while ensuring no data loss and downtime for users
    • Managing and leading cross functional projects and initiatives, ensuring successful implementation

    Accomplishments

    • Designed and lead the implementation of a new application infrastructure, moving from VMware VMs to Kubernetes pods running on new Dell servers with new core & edge switches to handle the extra network ports needed, which after migrating the application lead to faster and more reliable deployments, and substantial better performance
    • Reorganized the department from a flat group of nine engineers to three teams of three engineers each (operations SRE, architectural SRE, and network/security SRE), which improved operating efficiency and team morale
    • Researched and lead the implementation of a new SAN array using Pure Storage FlashArray in our datacenters for our primary storage device, which lead to better performance and management
    • Technologies used: Linux, Bash, Ansible, Git, NGINX, MySQL, Kubernetes, VMWare
  • Director of Development and Infrastructure - DTLR | Philadelphia, PA (On-site)
    June 2017 - July 2019

    Operating in the retail industry, a privately held organization employing 5000 people. Acquired Villa during the tenure of the Manager position, and was promoted up.

    Responsibilities

    • Managed a team of software engineers, system engineers, and project managers who were responsible for all aspects of the company e-commerce site infrastructure, including development, integration, routing, switching, servers, and storage
    • Planned and implemented infrastructure and automation improvements in on-prem and cloud environments while ensuring no downtime and high performance for our e-commerce customers
    • Managing and leading cross functional projects and initiatives, ensuring successful implementation

    Accomplishments

    • Planned and lead the implementation of a new hosting infrastructure (including firewalls, routing, switching, servers and storage), moving from Rackspace to an colocated datacenter with Azure for overflow, which vastly improved management and performance, while saving around $200,000 per year
    • Collaborated and lead the integration of our e-commerce site from Villa to DTLR fulfillment system, which lead to a successful integration of the companies
    • Lead the implementation of converting our e-commerce site from Magento 1 to Magento 2, including a re-write of all our custom code for the new version, which lead to a better site experience for customers and internal users
    • Technologies used: Linux, Bash, Ansible, Git, NGINX, MySQL, PHP, pfSense, Dell
  • Manager of System Engineering - DTLR | Philadelphia, PA (On-site)
    November 2016 - June 2017

    Worked under Villa until Acquired by DTLR.

    Responsibilities

    • Managed a team of software engineers who were responsible for developing the company e-commerce site
    • Planned and implemented infrastructure and automation improvements in a cloud environment while ensuring no downtime and high performance for our e-commerce customers
    • Managing and leading cross functional projects and initiatives, ensuring successful implementation

    Accomplishments

    • Overhauled our development process, which included a development site for testing, implementing code reviews, and automation for pushing to production, which greatly improved developer productivity and efficiency
    • Upgraded all servers to the recent version of Ubuntu, which required migrating site data, databases, and DNS entries to avoid downtime, and provided for a better and more secure base for our e-commerce site
    • Technologies used: Linux, Bash, Ansible, Git, NGINX, MySQL, PHP, Rackspace Cloud
  • Senior System Engineer - DTLR | Philadelphia, PA (On-site)
    November 2015 - November 2016

    Worked under Villa until Acquired by DTLR.

    Responsibilities

    • Planned and implemented development and infrastructure improvements in a cloud environment while ensuring no downtime and high performance for our e-commerce customers
    • Automated server configurations for web, mail, database, and other services
    • Collaborated with other teams in day to day operation

    Accomplishments

    • Rescued the e-commerce website as the first project, which could barely handle a few hundred users (despite having thousands trying to access) and was hand maintained, and instead used Ansible to configure, and also optimized the configuration, so that after a month the site was stable and after six months could handle the standard thousands of users, all while saving $200,000 per month on hosting costs
    • Automated the spinning up of additional servers to handle overflow, whereas before it was hand created and configured, which allowed handling of massive increases of traffic flow
    • Technologies used: Linux, Bash, Ansible, Git, NGINX, MySQL, PHP, Rackspace Cloud
  • Senior System Administrator - Webstaurant Store | Lititz, PA (On-site)
    October 2014 - November 2015

    Operating in the retail industry, a privately held organization employing 1000 people.

    Responsibilities

    • Lead team of four web system administrators which was responsible for the company’s e-commerce sites in two US datacenters
    • Planned and implemented development and infrastructure improvements in a cloud environment while ensuring no downtime and high performance for our e-commerce customers
    • Automated server configurations for web, mail, database, and other services
    • Configured and maintained SANs, firewalls, switches, and routers
    • Wrote internal applications such as code deployment services

    Accomplishments

    • Configured and replaced the firewall in two datacenters, one on site and one with remote hands, ensuring no downtime for our customers, which allowed higher performance for visitors
    • Planned and implemented a network recofiguration to meat PCI compliance standards, which included separating production, testing, and development networks
    • Technologies used: Linux, Bash, Ansible, Git, Apache, NGINX, VMWare, SonicWall, EqualLogic, Force10, MySQL, Windows, IIS, Active Directory, PHP, PowerShell
  • Senior System Administrator - Custom Computer | Ephrata, PA (On-site)
    April 2014 - October 2014
  • System Engineer - Pequea Valley School District | Kinzers, PA (On-site)
    August 2011 - April 2014
  • Technology Support Specialist - Penn Manor School District | Lancaster, PA (On-site)
    May 2008 - August 2011
  • Onsite Hardware Repair Technician - MacOutfitters | Doylestown, PA (On-site)
    August 2005 - May 2008
  • Student Desktop Technician - Millersville University | Millersville, PA (On-site)
    August 2000 - August 2005
  • å

Professional Organzations

  • League of Professional System Administrators
    2021 - present

    Treasurer

    2015 - present

    Member

Education

  • Millersville University
    2000 - 2005

    B.S in Computer Science and B.A. in Government and Political Affairs

  • Harrisburg University
    2009 - 2013

    M.S in Learning Technologies

Detailed Skills

  • People Management

    Managing, leading, and leveling-up mangers, leads, and individual contributors, including both high and low performers.

    Managing, leading, and leveling-up teams and departments, especially poor performing ones

  • Project Management

    Spearheading, planning, and implementing, large and small cross-functional technology projects.

  • Server Management

    Setting up, installing, configuring, and troubleshooting servers.

    Technologies used:

    Linux (Ubuntu, Debian, Red Hat, Rocky, SUSE), Macs, Windows
    Nginx, Apache, MySQL, Redis, Solr, Memcache, Elasticsearch, PHP, Git
    Ansible, Bash, Python, Objective-C
    Magento, Laravel
    Kubernetes, VMWare vSphere with HA and SRM, Azure, AWS
    Active Directory, Exchange, MSSQL, IIS
  • Network Management

    Setting up, installing, configuring, and troubleshooting networking and SANs

    Technologies used:

    Cisco, Brocade, Aruba, Meru, Ruckus, Pure
    IPv4, IPv6, iSCSI, VLANs, layer 3 routing, BGP, DNS
  • Client Management

    Setting up, installing, configuring, and troubleshooting clients including Mac, Windows, Linux, iPads.

  • Customer Service

    Communicating, assisting, and instructing customers with a positive and productive attitude.

Contact Me

Go to next/previous page

Contact info

I'm always open for discussion about any topic. Use the below information to get ahold of me. Email is best.

Send me a message

Thanks for sending your message! We'll get back to you shortly.

There was a problem sending your message. Please try again.

Please complete all the fields in the form before sending.