Summary
Computer Engineer with hands-on experience in DevOps, Platform Engineering, Site Reliability Engineering (SRE), and Software Engineering, focused on building and operating cloud infrastructure, cloud-native platforms, distributed systems, and production environments. Strong background with AWS, Kubernetes, Infrastructure as Code (IaC), CI/CD automation, observability, Linux systems, containers, and cloud operations.
Experienced in designing scalable infrastructure, automating deployments, improving reliability, supporting production systems, and integrating complex platforms involving identity management, APIs, microservices, databases, message brokers, and high-volume workloads. Combines infrastructure ownership with software engineering skills to bridge application, platform, and operations teams.
Core Specializations
Cloud Infrastructure • Platform Engineering • DevOps • Site Reliability Engineering (SRE) • Infrastructure as Code (IaC) • CI/CD Automation • Kubernetes & Container Orchestration • Observability • Linux Systems Administration • Production Operations • Distributed Systems • High-Performance Computing (HPC) • Identity & Access Management • Incident Response • Cloud Cost Optimization
Technical Skills
- Cloud & Infrastructure: AWS, EC2, ECS, EKS, Lambda, RDS, DynamoDB, S3, CloudFront, Load Balancers, IAM, VPC, Route 53, CloudFormation, Terraform, AWS CDK, eksctl, Cloudflare
- Containers & Orchestration: Kubernetes, Docker, Docker Compose, Helm, Argo CD, MicroK8s, LXD, Istio, Kubernetes Operators, containerized workloads
- CI/CD & Automation: GitHub Actions, CI/CD pipelines, deployment automation, Infrastructure as Code (IaC), Ansible, Bash scripting, release workflows, multi-environment deployments
- Observability & Operations: Prometheus, Grafana, Loki, monitoring, alerting, centralized logging, incident response, disaster recovery, backups, post-mortems, production support, cloud cost optimization
- Systems & Networking: Linux, Nginx, reverse proxies, TLS/SSL, DNS, CDN, WAF, VPN, SSSD, LDAP, Active Directory, IAM, OAuth2, OIDC, SAML, Keycloak, Vault
- HPC & Data Platforms: Slurm, JupyterHub, Kubeflow, HPC clusters, batch workloads, distributed workloads, job scheduling, Kafka, RabbitMQ, VerneMQ, MQTT, gRPC, IPFS
- Programming & Development: Python, TypeScript, JavaScript, Node.js, NestJS, Next.js, React, REST APIs, GraphQL, microservices, backend APIs, frontend applications
- Databases & Storage: PostgreSQL, MySQL, MongoDB, MongoDB Atlas, MongoDB Time Series Collections, DynamoDB, Amazon RDS, S3, database performance tuning
Experience
DevOps Engineer — Vantage Compute
June 2022 – Present
- Contributed to the architecture and implementation of a cloud-native multi-tenant compute platform, supporting the evolution from a single-tenant Kubernetes model to a shared platform architecture with isolated organization-level databases.
- Helped design and implement Slurm-based HPC cluster provisioning on AWS using CloudFormation, EC2, Linux automation, and Infrastructure as Code, enabling dynamic compute provisioning for distributed and batch workloads.
- Led and supported key identity and access management integrations involving Keycloak, LDAP, Active Directory, SSSD, Vault, and Linux user mapping across platform and cluster environments.
- Implemented and supported JupyterHub integration with Slurm, allowing notebook sessions to run as scheduled Slurm workloads and consume cluster resources through the platform.
- Contributed to on-demand Kubernetes cluster environments using LXD, MicroK8s, Kubernetes Operators, Vault, Istio, and internal CLI workflows to support deployable application stacks such as Slurm, JupyterHub, Kubeflow, Grafana, remote desktops, and cloud shells.
- Managed CI/CD workflows across development, QA, staging, and production using GitHub Actions, Helm, containers, Argo CD, CloudFormation, eksctl, and early-stage Terraform migration.
- Operated and maintained production infrastructure for a major enterprise customer, focusing on reliability, observability, deployment integrity, maintenance, and cloud cost control.
- Built Slurm job templates, validation workflows, demos, and technical documentation to support platform testing, customer presentations, and internal operational knowledge sharing.
- Contributed to an internal SOS/support platform for managing and troubleshooting multi-tenant infrastructure and improving operational visibility.
DevOps Engineer — Pollum
Aug 2020 – June 2022
- Acted as the primary DevOps Engineer across multiple blockchain and Web3 product teams, owning cloud infrastructure, deployments, monitoring, incident response, backups, disaster recovery, and production reliability for several internal and third-party applications.
- Designed and operated globally distributed infrastructure for a Syscoin RPC service using AWS ECS, Load Balancers, CloudFront, Terraform, and custom Nginx-based proxy layers, supporting low-latency access and scaling to millions of requests per day at peak traffic.
- Implemented multi-region and multi-zone deployment strategies across global AWS regions, including Asia, to improve RPC availability, latency, and resilience for blockchain infrastructure workloads.
- Built and maintained custom security and traffic-control proxy mechanisms using Nginx, Lua scripts, and JavaScript, supporting contract and wallet allowlists/blocklists, traffic filtering, and access-control rules for RPC usage.
- Deployed and maintained several blockchain ecosystem applications, including Blockscout explorers, Graph Node with IPFS, Pegasys, Rollux Layer 2 infrastructure, Safe dApp components, Discourse forums, wikis, and supporting services.
- Managed complex application stacks using Ansible, Docker Compose, Nginx, PostgreSQL, and cloud infrastructure automation, supporting both in-house products and third-party infrastructure deployments.
- Operated infrastructure for Safe dApp-related services, including transaction services, client gateway services, configuration services, multiple databases, backend components, and frontend deployments.
- Supported the Luxy NFT marketplace infrastructure, including APIs, microservices, RabbitMQ, Kafka, MongoDB Atlas, AWS-hosted services, and a Next.js frontend deployed on Vercel.
- Participated in database performance analysis and tuning for large NFT collection queries in MongoDB, helping improve collection retrieval performance and application responsiveness under growing data volume.
- Managed infrastructure cost tracking, reporting, and optimization across multiple applications and customer-facing services, reducing overall infrastructure costs by at least 50% through architecture improvements and operational adjustments.
- Implemented and maintained observability, monitoring, alerting, backup routines, recovery procedures, incident response workflows, and post-mortem practices across production blockchain applications.
- Used Terraform extensively to provision and manage AWS infrastructure, improving deployment repeatability, infrastructure consistency, and operational control across multiple environments and applications.
- Managed Cloudflare DNS, security rules, access controls, caching, and traffic protection configurations for production Web3 applications and blockchain infrastructure, improving domain management, edge security, and operational resilience.
Independent Software & Platform Engineer — Process Engenharia
Apr 2019 - Sep 2021
- Designed, developed, and deployed an end-to-end industrial IoT supervisory platform for manufacturing environments, enabling real-time machine monitoring, operational visibility, and faster maintenance decision-making for electromechanical systems.
- Acted as the sole engineer responsible for the full product architecture, backend APIs, frontend application, microservices, infrastructure design, deployment strategy, and production implementation.
- Built the platform using Next.js, NestJS, TypeScript microservices, MongoDB, VerneMQ, Kafka, gRPC, Kubernetes, and Docker Compose, with a cloud-agnostic architecture designed to support both cloud and on-premise industrial deployments.
- Designed a flexible deployment model capable of running on any Kubernetes environment or local Docker Compose setups, addressing industrial customers with strict on-premise, low-cloud, or cloud-agnostic infrastructure requirements.
- Selected and implemented VerneMQ as the MQTT broker due to MQTT v5 support, shared subscriptions, Kubernetes scalability, and MIT licensing, enabling scalable message distribution across microservices processing factory-floor telemetry.
- Designed MQTT-based data ingestion pipelines for factory-floor sensors and machine data, using VerneMQ for device communication and Kafka/VerneMQ-based flows for notifications and internal event processing.
- Implemented gRPC-based communication for high-volume data flows, improving efficiency when transferring large telemetry payloads between backend services and processing components.
- Solved large-scale data ingestion and visualization challenges involving sensor readings with approximately 15,000 data points per collection cycle, optimizing backend processing, MongoDB storage, and frontend rendering performance.
- Used MongoDB time series collections to store high-volume industrial telemetry data, addressing challenges related to document size, query performance, and time-based data retrieval.
- Built a high-performance supervisory visualization interface using Konva.js and Canvas, enabling customizable industrial dashboards with standard widgets, SVG-based components, and custom JavaScript-driven behavior.
- Optimized frontend charting and visualization workflows to prevent UI freezes and improve responsiveness when rendering large industrial datasets and real-time machine telemetry.
- Designed the MVP to scale from minimal infrastructure to hundreds of sensors sending telemetry data in test scenarios, validating the platform’s ability to support industrial monitoring workloads.
- Added AI-based data processing pipelines to extract actionable insights, operational signals, and maintenance-oriented recommendations from factory telemetry data.
Freelance Software Engineer — PDA Soluções
Oct 2019 – Aug 2020
- Developed lightweight Android applications and API integrations for logistics operations, building reliable tools used by field teams to streamline workflows and improve operational productivity.
- Designed mobile-first workflows focused on performance, usability, and low-friction execution in real operational environments, contributing to productivity improvements of approximately 60% or more.
- Integrated applications with logistics systems and back-office APIs, supporting data synchronization, operational visibility, and process automation.
- Worked independently across the full software delivery lifecycle, from requirements gathering and solution design to development, testing, deployment support, and iterative improvements.
Full-Stack Software Engineering Intern — MedCloud
Nov 2019 – Aug 2020
- Developed full-stack features across frontend and backend systems using AWS serverless technologies, including Lambda functions, MySQL on Amazon RDS, DynamoDB, S3, and AWS Amplify.
- Contributed to backend services and cloud-based application components supporting healthcare-oriented workflows and internal business processes.
- Participated in the design and implementation of an extensible integration layer for Brazilian municipal electronic invoice systems, addressing variations across city-level tax authority implementations.
- Helped build a flexible invoice integration architecture capable of supporting new municipalities on demand with reduced implementation effort, despite differences in optional fields, schemas, and local requirements.
- Worked across frontend development, backend APIs, cloud services, data persistence, and third-party system integrations in an AWS-based serverless environment.
Selected Projects
Cloud-Native HPC & Multi-Tenant Compute Platform
Contributed to the architecture and implementation of a cloud-native compute platform designed to support HPC workloads, on-demand Kubernetes environments, and multi-tenant infrastructure. Helped migrate the platform from an initial single-tenant Kubernetes model, where each organization required its own namespace, APIs, databases, and Keycloak instance, to a shared multi-tenant architecture with isolated organization-level databases. Worked across Slurm-based AWS clusters, dynamic compute provisioning, JupyterHub with Slurm spawner, Keycloak organization support, LDAP/Active Directory integration, SSSD user mapping, Vault, Istio service routing, internal CLI tooling, MicroK8s, LXD, GitHub Actions, Helm, Argo CD, CloudFormation, eksctl, and Terraform migration initiatives. Core technologies: AWS, Kubernetes, Slurm, JupyterHub, Keycloak, LDAP, Active Directory, SSSD, Vault, Istio, GitHub Actions, Helm, Argo CD, CloudFormation, Terraform.
Globally Distributed Blockchain RPC Infrastructure
Designed and operated a globally distributed Syscoin RPC service infrastructure focused on scalability, low latency, high availability, and production reliability. The platform used AWS ECS, Load Balancers, CloudFront, Terraform, and custom Nginx-based proxy layers to support multi-region deployments and traffic scaling to millions of requests per day at peak usage. Built traffic-control and security mechanisms using Nginx, Lua, and JavaScript to support wallet and contract allowlists/blocklists, request filtering, and controlled RPC access. Also maintained related blockchain ecosystem applications such as Blockscout, Graph Node with IPFS, Rollux Layer 2 services, Safe dApp components, Discourse, wikis, and supporting infrastructure. Core technologies: AWS ECS, CloudFront, Load Balancers, Terraform, Nginx, Lua, JavaScript, Docker Compose, Ansible, PostgreSQL, IPFS, Blockchain RPC, Monitoring.
Industrial IoT Supervisory Platform
Architected and developed a cloud-agnostic industrial IoT supervisory platform for real-time factory-floor monitoring, machine telemetry, operational visibility, and maintenance decision support. The platform was designed to run both in Kubernetes-based environments and local Docker Compose deployments for industrial customers with on-premise requirements. Implemented MQTT-based telemetry ingestion with VerneMQ, leveraging MQTT v5 shared subscriptions for scalable message distribution across processing microservices. Built high-volume data processing flows using gRPC, Kafka, VerneMQ, and MongoDB time series collections, including support for sensor readings with approximately 15,000 data points per collection cycle. Developed real-time supervisory dashboards using Konva.js, Canvas, SVG-based widgets, and custom JavaScript logic, optimizing frontend rendering to handle large telemetry datasets without freezing the user interface. Later added AI-based data processing pipelines to extract actionable insights and maintenance-oriented signals from factory data. Core technologies: Next.js, NestJS, TypeScript, VerneMQ, MQTT v5, Kafka, gRPC, MongoDB Time Series, Kubernetes, Docker Compose, Konva.js, Canvas, AI pipelines.
Personal Projects
- Homelab Platform Manager: Designing a self-hosted platform management system for tracking services, deployment targets, health checks, domains, TLS status, infrastructure metadata, and operational incidents across Docker, virtual machines, and Kubernetes environments. Focused on platform engineering concepts such as service catalog management, automation, observability, operational workflows, and infrastructure lifecycle visibility.
- Self-Hosted Infrastructure Lab: Maintaining a personal infrastructure environment for experimenting with Linux administration, containers, reverse proxies, VPN access, TLS automation, monitoring, backups, and deployment workflows. Used to validate production-like patterns for networking, observability, automation, and service operations.
- Local AI Infrastructure Lab: Building and testing local AI workflows with ComfyUI, Ollama, local LLMs, GPU-based inference, and image generation pipelines. Focused on offline AI tooling, dependency management, model execution, workflow automation, and infrastructure constraints for running AI workloads locally.
Education
Bachelor’s Degree in Computer Engineering — Ponta Grossa State University
January 2017 – June 2022
- Developed applied AI, Machine Learning, Computer Vision, image processing, and data analysis projects across agriculture and healthcare research contexts.
- Worked on agricultural technology projects involving field imaging, image-derived feature extraction, wheat productivity monitoring, productivity prediction, and tree counting.
- Built image classification workflows for healthcare research, including cervical cancer cell classification using machine learning techniques.
Leadership & Volunteering
Co-Founder / Developer — Junior Enterprise
- Helped found and operate a university junior enterprise, contributing to client acquisition, software development, project delivery, and internal team development.
- Trained new members in development practices, delivery workflows, and client-oriented project execution.
Finance Coordinator — Computer Engineering Student Association (2019)
- Supported fiscal and administrative responsibilities for the student association, contributing to financial organization, accountability, and internal operations.