Infrastructure Engineer & Cloud Architect
Specializing in Azure, AWS, High Availability Systems & Enterprise Infrastructure
San Antonio, TX β’ Open to Remote & Relocation
Building a unified monitoring platform following company mergers and acquisitions. The platform consolidates infrastructure, cloud, security, and FinOps observability into a single pane of glass β serving internal IT teams, stakeholders, leadership, and public-facing dashboards.
This is a complex initiative requiring standardization across multiple legacy systems: unifying naming conventions, tagging strategies, alerting thresholds, Customer Support integrations, and executive reporting. The goal is consistent observability and actionable insights across all acquired entities.
I'm an infrastructure engineer with 30 years of hands-on experience building and maintaining production systems at scale. My background spans the full stack of infrastructureβfrom racking servers in data centers to architecting multi-region cloud deployments on Azure and AWS.
Currently at Comply365, I manage the cloud infrastructure that powers operational software solutions for airlines, defense forces, and rail organizations worldwide. This includes managing hundreds of cloud servers, containers, and storage systems handling tens of millions of requests monthly from customers using DocuNet (operational content management), SafetyNet (safety management), and our other SaaS platforms.
My responsibilities include Azure App Services configuration, implementing Akamai WAF for application security, migrating monitoring from New Relic to Datadog, and managing our annual ISO 27001 compliance audits.
I'm actively leveraging AI tools (Claude for documentation/web development, Gemini CLI and CodeX for infrastructure automation) to enhance operations β from monitoring deployments and Azure/AWS configurations to unifying documentation in Confluence and improving Jenkins automation workflows. I maintain detailed .md files (like GEMINI.md) that serve as both context for AI tools and living documentation of infrastructure changes, ensuring continuity across sessions and providing a comprehensive audit trail.
Senior infrastructure engineering role managing cloud infrastructure across Azure, AWS, and Akamai CDN/WAF for high-availability SaaS applications serving airlines, defense forces, and rail organizations. Focus on site reliability engineering (SRE), application performance management (APM), and real-time monitoring with Datadog.
Served as the customer's trusted Tableau advisor to Strategic Premium Support customers, acting as primary technical point of contact coordinating with Product Management, Sales, Technical Support, and Engineering.
Member of the new technical onboarding team in Customer Success, building processes from the ground up to establish and deliver value to customers faster. Worked cross-functionally to align onboarding initiatives across Professional Services, Product Management, and the training department.
Technical advisor engaging with customers to identify technical needs, roadblocks, and IT/business requirements for both public and private clouds. Worked with support operations, account teams, and product teams to find solutions best suited for customer requirements, while also training non-technical sales roles on products and solutions.
Managed customer base of over $1,000,000/month in account revenue, providing partnership and acting as an extension of their business. Worked with the most complex configurations in a 79,000+ server environment.
Deployed, supported, and maintained servers and infrastructure for customers ranging from medium-sized businesses to global enterprise operations hosted at Rackspace data centers worldwide. Provided support to critical systems configured for optimal uptime using high availability and disaster recovery technologies on both software and hardware levels. Worked independently or collaboratively to resolve alerts and customer tickets while communicating technical issues to both technical and non-technical audiences.
Led customer onboarding and equipment deployment coordination, working with the team to develop the new master ticket process that became standard policy for all deployments company-wide. Gathered client requirements, ensured deployments met their needs, and resolved questions and issues throughout the deployment lifecycle.
Led datacenter operations team through a critical period, successfully migrating all servers from SAT datacenters to DFW for the SAT Datacenter closures. Managed technicians, ensured adequate shift coverage, conducted performance reviews, and motivated peers to increase their technical and business skills. Promoted to oversee both SAT1 and SAT2 facilities simultaneously.
Built, quality-controlled, and racked new and existing customer managed hosting solutions in a fast-paced datacenter environment. Handled server and network device upgrades and maintenances while collaborating with fellow Rackers to resolve issues and bring customers online quickly and professionally.
Built a strong technical foundation through diverse roles in systems administration, technical support, and IT management across multiple industries.
Production-grade homelab running 30+ containerized services across multiple Docker hosts. Features Traefik reverse proxy, AdGuard Home for DNS filtering, comprehensive monitoring with Uptime Kuma, media automation stack (Sonarr, Radarr, Prowlarr), and self-hosted productivity tools.
Custom React-based web applications for comparing Bicep configurations between Production and DR environments. Implements "presence over parity" logic to identify critical missing configurations that could cause DR failures. Helps ensure our disaster recovery infrastructure stays in sync. Built with Claude for Bicep analysis logic, React component design, and UI implementation.
Comprehensive container management interface with RSS feeds, quick commands, and service categorization. Provides at-a-glance status of all homelab services with one-click access to common operations. Built with Claude for UI/UX design and responsive layout implementation.
Open to discussing infrastructure architecture, cloud migrations, or engineering opportunities.
Connect on LinkedIn