AI Chip Architect - Reliability Focus

International 全职 RMB1,300,000 - RMB1,800,000 年薪
  • Key position in a pre-IPO semiconductor company
  • Strong investment from the company to drive R&D

关于我们的客户

Our client is a fast developing AI chip company and they plan to go IPO in early 2025.

职责描述

1. Design, develop, and optimize the reliability architecture of our AI supercomputing systems to meet high-availability and performance objectives.
2. Collaborate with multi-disciplinary teams to understand system requirements and devise strategies to meet these goals.
3. Drive root cause analysis of reliability issues and devise plans to improve system robustness and uptime.
4. Develop reliability prediction models and carry out regular system risk assessments.
5. Analyze failure modes, predict future failures, and develop strategies to minimize downtime.
6. Support the creation and execution of test strategies to verify system performance and reliability.
7. Stay abreast of advancements in AI, supercomputing, and reliability engineering to help inform future system design.

理想的求职者

1. A master's degree in Computer Engineering, Electrical Engineering, or a related field. An advanced degree would be a plus.
2. Proven experience in system architecture, with a focus on AI supercomputing and reliability engineering.
3. Strong knowledge of GPU architecture, high performance computing (HPC), and deep learning applications.
4. Familiarity with hardware testing, fault detection and fault-tolerant systems.
5. Strong analytical and problem-solving skills.
6. Excellent communication skills to articulate complex technical issues to diverse teams.

薪酬待遇

Opportunity to make a big impact on the AI chip market

联系
Marcus Zhu
职位编号
JN-082023-6138668
联系电话
+86 21 6035 3505

职位概要

职位类别
工程与制造
子类别
集成电路设计/半导体
行业
半导体
地区
International
工作类型
全职
顾问名字
Marcus Zhu
顾问电话号码
+86 21 6035 3505
职位编号
JN-082023-6138668

米高蒲志集团的多元与包容文化

在米高蒲志,我们不仅接受差异,更为之感到自豪。我们鼓励来自不同背景的求职者申请这个职位,并致力于建设包容、多样的工作场所,让所有员工都能绽放自我,成就精彩人生。如果您在招聘过程中需要任何支持或合理的调整,请告知我们