Senior GPU Security Validation Lead
Advanced Micro Devices Lihat semua pekerjaan
- Pulau Pinang
- Tetap
- Sepenuh masa
- Own Datacenter GPU SoC post-silicon security and firmware validation, spanning silicon, VBIOS, system firmware, drivers, and OS layers
- Drive pre-silicon validation using emulation and simulation environments and transition coverage to post-silicon platforms
- Develop and execute feature enablement and validation test plans for SoC- and system-level security, virtualization, RAS, and fuse features
- Eagerness and ability to quickly learn new concepts
- Lead post-silicon debug efforts, performing system-level root cause analysis across HW/FW/SW boundaries
- Build and maintain validation infrastructure, including software tools, automation, scripts, and lab setups
- Validate interactions between multiple GPU SoC features and subsystems in complex datacenter configurations
- Collaborate with silicon, firmware, driver, platform, and PSO teams to improve validation strategy, methodology, and coverage
- Drive technical innovation in security and RAS validation through tools, scripts, and process improvements
- Support customer platforms and engagements in collaboration with customer support and program teams
- Provide clear execution status, risk assessment, and issue updates to program management
- Mentor junior engineers and lead technical initiatives within the validation team
- 7+ years of experience in SoC validation, silicon bring-up, or system-level debug
- Strong background in post-silicon validation and debug methodologies at SoC and system level
- Experience with security IP design/validation, fuse programming, and security feature enablement
- Strong programming and scripting skills (C/C++, Python; Perl a plus)
- Solid understanding of computer hardware architecture (GPU/CPU, x86, PCIe, memory, interconnects)
- Working knowledge of firmware, BIOS, drivers, and OS interactions
- Experience with Linux and/or Windows Server environments
- Hands-on experience with lab equipment (logic analyzers, protocol analyzers, oscilloscopes, etc.)
- Strong communication skills and ability to operate effectively in cross-functional, cross-site teams
- The ideal candidate will have demonstrated experience in overall platform security and/or device security microarchitecture and/or CPU, SOC, or hardware security-oriented definition
- Strong understanding and experience with CPU confidential computing technologies (e.g., AMD SEV, Intel TDX, ARM CCA) and trusted execution environments (TEEs)
- Familiarity and understanding of confidential computing concepts such as remote attestation and sealing
- Expertise in operating systems and virtualization technologies, especially around Linux kernel and driver level development, hypervisors/virtual machines, and containers. Experience with SaaS, PaaS, IaaS and container systems like Kubernetes a plus.
- Familiarity with typical software/hardware interfaces and driver techniques.
- Experience designing and/or implementing secure systems, including concepts such as threat modeling, security architecture, protocol design, network security, and operating system security.
- Familiarity and experience in Data Center interconnect standards and associated underlying technologies and protocols
- Experience with confidential computing and isolation technologies (e.g., GPU/CPU secure modes, TEEs, attestation)
- Knowledge of GPU and PCIe virtualization technologies (SR-IOV, SIOV, vGPU, MIG)
- Familiarity with RAS features, error injection, and reliability validation
- Experience with security threat modeling, vulnerability analysis, or SDL practices
- Understanding of cryptographic primitives and secure boot / root of trust concepts
- Exposure to PCIe and CXL standards and associated security considerations
- Experience supporting customer platforms and field issue debug
- Prior experience mentoring engineers or leading technical initiatives
- Bachelor’s or Master’s degree in Electrical Engineering, Computer Engineering, Computer Science, or a related field