Senior Research Engineer - Training Efficiency

Company: Luma AI
Location: Palo Alto
Posted on: March 15, 2025

Job Description:

Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step function change will come from vision. So, we are working on training and scaling up multimodal foundation models for systems that can see and understand, show and explain, and eventually interact with our world to effect change. We are looking for engineers with significant experience solving hard problems in PyTorch, CUDA and distributed systems. You will work alongside the rest of the research team to build & train cutting edge foundation models on thousands of GPUs that are built to scale from the ground up.Responsibilities

Ensure efficient implementation of models & systems with a focus on large-scale training.
Identify and implement optimization techniques for massively parallel and distributed systems, including the underlying communication layer.
Identify and remedy efficiency bottlenecks (memory, speed, utilization, communication) by profiling and implementing high-performance PyTorch code, deferring to Triton, CUDA, and lower levels as necessary.
Work closely together with the rest of the research team to ensure systems are planned to be as efficient as possible from start to finish.
Conduct research & experiments on state-of-the-art large-scale generative AI models with the goal to improve latency & throughput for training and inference.Must have experience
- Experience training large models using Python & Pytorch, including practical experience working with the full development pipeline from data processing, preparation & dataloading to training and inference.
- Experience profiling GPU & CPU code in Pytorch for optimal device utilization (examples: torch profiler, NVIDIA Nsight systems/compute, memory profilers, trace viewers, custom tooling).
- Experience writing & improving highly parallel & distributed Pytorch code of large generative models, with familiarity in FSDP, Tensor Parallel, Sequence/Context Parallel, Pipeline Parallel etc.
- Experience working with transformer models and attention implementations.Good to have experience
  - Experience with high-performance Triton/CUDA and writing custom PyTorch kernels and ops. Top candidates will be able to write fused kernels for common hot paths, understand when to make use of lower level features like tensor cores or warp intrinsics, and will understand where these tools can be most impactful.
  - Experience writing high-performance parallel C++. Bonus if done within an ML context with Pytorch, like for data loading, data processing, inference code.
  - Experience building inference / demo prototype code (incl. Gradio, Docker etc.).
    #J-18808-Ljbffr

Keywords: Luma AI, Palo Alto , Senior Research Engineer - Training Efficiency, Engineering , Palo Alto, California

Click here to apply!

Didn't find what you're looking for? Search again!

Let Palo Alto recruiters find you. Post your resume for free!

Get Palo Alto Engineering jobs via email.

View more Palo Alto Engineering jobs

Other Engineering Jobs

Travel Cath Lab Technologist - $3,500 per week
Description: Aureus Medical Group - Imaging is seeking a travel Cath Lab Technologist for a travel job in Palo Alto, California.Job Description Requirements ul li Specialty: Cath Lab Technologist li Discipline: (more...)
Company: Aureus Medical Group - Imaging
Location: Palo Alto
Posted on: 03/5/2025

Motion Planning Engineer, Optimus
Description: Tesla is on a path to build humanoid robots at scale to automate repetitive and boring tasks. Core to the Optimus, the motion planning stack presents a unique opportunity to work on state-of-the-art motion (more...)
Company: Tesla, Inc.
Location: Palo Alto
Posted on: 03/5/2025

Travel Interventional Radiology Technologist - $2,424 per week
Description: Medical Solutions is seeking a travel Interventional Radiology Technologist for a travel job in Palo Alto, California.Job Description Requirements ul li Specialty: Interventional Radiology Technologist (more...)
Company: Medical Solutions
Location: Palo Alto
Posted on: 03/5/2025

Salary in Palo Alto, California Area | More details for Palo Alto, California Jobs |Salary

Travel Nuclear Medicine Technologist - $3,440 per week
Description: Aureus Medical Group - Imaging is seeking a travel Nuclear Medicine Technologist for a travel job in Palo Alto, California.Job Description Requirements ul li Specialty: Nuclear Medicine Technologist (more...)
Company: Aureus Medical Group - Imaging
Location: Palo Alto
Posted on: 03/5/2025

Registered Nurse (RN) - Case Manager, Oncology - $73-96 per hour
Description: Stanford Health Care is seeking a Registered Nurse RN Case Manager, Oncology for a nursing job in Palo Alto, California. Job Description Requirements ul li Specialty: Oncology li Discipline: (more...)
Company: Stanford Health Care
Location: Palo Alto
Posted on: 03/5/2025

Sr Analyst, Commercial IT Systems (Palo Alto, CA)
Description: Title: Sr Analyst, Commercial IT Systems Palo Alto , - CA br Terms of Hire: Full Time. br Salary: 95,000- 120,000/ YR Benefits. br - Job description: br Responsibilities br br Client (more...)
Company: Cedent Consulting Inc
Location: Palo Alto
Posted on: 03/5/2025

Travel Ultrasound Technologist - $2,430 per week
Description: Job DescriptionAnders Group is seeking a travel Ultrasound Technologist for a travel job in Palo Alto, California.Job Description Requirements ul li Specialty: Ultrasound Technologist li Discipline: (more...)
Company: Anders Group
Location: Palo Alto
Posted on: 03/5/2025

Stanford Health Care Head and Neck Medical Oncology(NP)
Description: Job Description Requirements br Stanford Health Care Head and Neck Medical Oncology APP NP br StartDate: ASAP Pay Rate: 199000.00 - 225000.00 br br Stanford Health Care is looking for two (more...)
Company: AMN Healthcare
Location: Palo Alto
Posted on: 03/5/2025

Accounts Payable Manager
Description: BitGo is the leading infrastructure provider of digital asset solutions, offering custody, wallets, staking, trading, financing and settlement out of regulated cold storage. Founded in 2013, BitGo is (more...)
Company: NEAR
Location: Palo Alto
Posted on: 03/5/2025

Travel Nurse RN - PACU - Post Anesthesia Care - $3,188 per week
Description: Aureus Medical Group - Nursing is seeking a travel nurse RN PACU - Post Anesthesia Care for a travel nursing job in Palo Alto, California. Job Description Requirements ul li Specialty: PACU - Post (more...)
Company: Aureus Medical Group - Nursing
Location: Palo Alto
Posted on: 03/5/2025

Loading more jobs...

Senior Research Engineer - Training Efficiency

Didn't find what you're looking for? Search again!

Other Engineering Jobs

Log In or Create An Account