← Back to jobs
Milpitas, CA, USA
No related jobs found
Key Responsibilities
Troubleshoot GPU/CPU servers, compute clusters, and networking (InfiniBand)
Diagnose hardware issues (cabling, components, GPUs, servers)
Rack/stack initially limited (systems already built), but may increase if extended
Replace/install server components within racks
Use Linux command line extensively for diagnostics and system validation
Manage lab space and hardware inventory (re-procurement access provided)
Must-Have Skills (Non-Negotiable)
Strong hardware troubleshooting experience (servers, GPUs, compute systems)
Solid understanding of computer/compute architecture
Strong Linux skills for system bring-up and troubleshooting
Experience with GPUs and high-performance compute environments
Ability to independently diagnose and resolve hardware/system issues
Preferred / Nice-to-Have
Prior data center or HPC/compute cluster experience (plus, not mandatory)
Scripting experience (Bash, Python) – expected if candidate has done similar roles
Familiarity with GPU technologies (cutting-edge R&D GPUs; Tesla, etc.)
Candidates who’ve built systems themselves (gaming PCs, lab servers, small data centers)
Experience & Education
Minimum: 3–4 years of relevant experience (not pure sysadmin only)
Bachelor’s degree preferred, but experience matters more than degree
Bachelor's degree
No related jobs found
← Back to jobs