We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results

AI System Research and Development Engineer - Frameworks

Snowflake
$195,000 - $287,500
parental leave, paid time off, paid holidays, 401(k), retirement plan
United States, California, Menlo Park
Mar 29, 2025

Build the future of the AI Data Cloud. Join the Snowflake team.

We are looking for talented System Developers and Researchers to join the Snowflake AI Research team and contribute to LLM inference and training system development, optimizations, and agentic systems. Our mission is to build the most efficient and scalable generative AI systems.

Recent releases from our team include SwiftKV, an advanced inference optimization, and Arctic LLM, one of the largest open-source MoE foundation models. This is an exciting opportunity to collaborate with a world-class team, including founding members of DeepSpeed, vLLM, and TensorFlow. Together, we will push the boundaries of deep learning systems and drive cutting-edge innovations in AI.

Responsibilities:
  • Solve large-scale challenges in data preprocessing, model training, and model evaluation.

  • Develop and deploy state of the art tooling and open-source technologies to enhance the efficiency and effectiveness of AI solutions.

  • Apply advanced optimization techniques to reduce resource requirements while maintaining model performance and ensuring usability for researchers, developers and customers.

  • Stay updated with the latest advancements in LLM training and inference optimizations.

  • Open-source and publish innovations, optimizations, and engineering practices in technical blogs, top-tier conferences and journals.

Requirements:
  • 5 or more years of experience in deep learning frameworks, distributed systems, or high-performance computing (HPC).

  • Bachelor's degree in Computer Science, Electrical Engineering, or a related field. A Master's degree or PhD is preferred.

  • Expertise in distributed training frameworks (e.g., DeepSpeed, PyTorch DDP, FSDP, Megatron-LM).

  • Strong understanding of modern parallelism techniques such as data, tensor, sequence, ZeRO-based parallelism.

  • Programming language proficiency in Python and C++ or CUDA.

  • Solid problem-solving skills and ability to debug complex performance issues.

  • Excellent communication skills and ability to work effectively in a cross-functional team environment.

Join us in optimizing deep learning systems and pushing the boundaries of AI efficiency. Apply now to be part of our dynamic and pioneering team!

Snowflake is growing fast, and we're scaling our team to help enable and accelerate our growth. We are looking for people who share our values, challenge ordinary thinking, and push the pace of innovation while building a future for themselves and Snowflake.

How do you want to make your impact?

For jobs located in the United States, please visit the job posting on the Snowflake Careers Site for salary and benefits information: careers.snowflake.com

The following represents the expected range of compensation for this role:

  • The estimated base salary range for this role is $195,000 - $287,500.
  • Additionally, this role is eligible to participate in Snowflake's bonus and equity plan.

The successful candidate's starting salary will be determined based on permissible, non-discriminatory factors such as skills, experience, and geographic location. This role is also eligible for a competitive benefits package that includes: medical, dental, vision, life, and disability insurance; 401(k) retirement plan; flexible spending & health savings account; at least 12 paid holidays; paid time off; parental leave; employee assistance program; and other company benefits.

Applied = 0

(web-6468d597d4-98p82)