AI Inference Researcher

Job ID: 8276
Job Type: Direct Hire
Salary Range: $150K - $175K
Arlington, Virginia
Posted:

To Apply for this Job Click Here

Position: AI Inference Researcher

Location– DMV preferred but open to remote

Our client is looking for a performance-obsessed AI researcher to join their growing team.

Your mission is to turn whitepapers, profiling traces, and raw intuition into working optimizations that make real workloads faster. You’ll work across compiler frameworks, GPU kernels, graph IRs, memory strategies, and quantization techniques — with the freedom to test hypotheses quickly and the responsibility to ship production-worthy gains.

This is a research role with an emphasis on delivering production-quality code. You’ll sit at the boundary between compiler experimentation and systems engineering, helping make Ignite one of the fastest inference engines in the world.

Responsibilities-

– Profile and optimize transformer-based inference pipelines

– Research and experiment with graph-level and kernel-level optimizations

– Design experiments, measure speedups, and continuously shave off latency across varying batch sizes, sequence lengths, and architectures

– Collaborate closely with the engineering teams to ensure results translate into real-world improvements

– Stay ahead of the optimization literature — and bring in ideas before they’re stale

Qualifications-

– Strong programming experience in C++ and Cuda

– Hands-on experience with performance profiling and tuning

– Existing understanding or interest in learning GPU architecture

– Comfort working with compiler toolchains and intermediate representations

– Curiosity and intensity — you enjoy tuning workloads at breakneck speed, not just shipping the baseline

Nice to have: experience with quantization-aware training, kernel autotuning, or specialized LLM serving runtimes

 

What The Client Offers-

– Competitive salary with meaningful equity

– A chance to help define a new category of AI infrastructure

– Greenfield architecture — build the product you’ve always wanted to use

– High trust and autonomy, with deep impact on platform direction

– Remote-first culture with the option to collaborate in person as we scale

 

To Apply for this Job Click Here

Share This Job

Refer A Candidate

Recommend a candidate and receive a referral bonus as a thank-you for helping us find top talent.

Upload Your Resume

Share your resume, and we’ll match you with opportunities that fit your skills and goals.

Related Jobs