Datacenter Infrastructure Engineer

🌍 Remote, USA 🚀 Full-time 🕐 Posted Recently

Job Description

Job Title: Datacenter Infrastructure Engineer

Duration: 6 Months (Possibility of Extension)

Location Preference: Dallas, TX area (Preferred)

Overview

We are looking for an Infrastructure Development Engineer to design, operate, and scale foundational datacenter services that power bare-metal, virtualization, and cloud-adjacent platforms.

This role owns the automation to boot and manage critical services such as corporate IPAM/DDI, CMDB, and datacenter bootstrapping systems.

The engineer will work across hardware, networking, and platform teams to ensure infrastructure is discoverable, automated, reliable, and ready for self-service consumption.

Key Responsibilities

    Automation & Development
  • Build automation and tools using Python
  • Develop Python-based tools and services for provisioning, configuration, monitoring, and self-service workflows
  • Automate operational tasks such as imaging, deployments, health checks, and remediation
  • Integrate internal and external APIs to orchestrate infrastructure workflows across compute, storage, network, and cloud
  • Developers with experience in C++ or Java will also be considered
    Software Defined Network Services (IPAM, DDI & CMDB)
  • Own and operate corporate IP Address Management (IPAM) and DDI (DNS, DHCP, IPAM) platforms
  • Design scalable IP allocation, DNS, and DHCP strategies across datacenters
  • Integrate IPAM/DDI systems with provisioning, bootstrapping, and CMDB workflows
  • Act as a steward of the CMDB ensuring accuracy and automation-driven updates
  • Define standards for asset discovery, lifecycle state, and dependency mapping
    Monitoring, Observability & Reliability
  • Implement monitoring, alerting, and dashboards for infrastructure health using tools such as Prometheus, Grafana, ELK, Nagios
  • Track key metrics such as availability, latency, capacity, and error rates
  • Participate in incident response and root cause analysis
  • Implement long-term fixes and operational runbooks

Required Skills & Experience

    Core Technical Skills
  • Experience with bare-metal provisioning and hypervisor deployment
  • Hands-on experience with OpenStack, VMware, KubeVirt, or similar virtualization platforms
  • Deep understanding of IPAM, DNS, and DHCP
  • Experience with CMDB systems
  • Knowledge of datacenter networking concepts including Fibre Channel
  • Proficiency with Linux systems and troubleshooting
    Automation & Systems Thinking
  • Experience building infrastructure automation and onboarding pipelines
  • Familiarity with API-driven integrations and workflow orchestration
  • Ability to understand infrastructure as a platform
    Collaboration & Ownership
  • Experience working with hardware, network, storage, and SRE teams
  • Strong operational mindset focused on reliability and supportability
  • Ability to solve complex problems with automated solutions
    Nice to Have
  • Experience with large-scale internal platforms or infrastructure as a product
  • Background in Site Reliability Engineering (SRE)
  • Experience with self-service infrastructure platforms
  • Experience in multi-datacenter or hybrid environments
    Server Bootstrapping & Provisioning Automation
  • Experience with datacenter bootstrapping services including PXE, imaging, and OS/hypervisor provisioning
  • Ensure seamless transition from hardware arrival to production-ready infrastructure
  • Improve time-to-serve metrics for new racks, clusters, and testbeds

Apply tot his job

Apply To this Job

Ready to Apply?

Don't miss out on this amazing opportunity!

🚀 Apply Now

Similar Jobs

Recent Jobs

You May Also Like