AI Applications Runtime Lead (Senior Software Engineer)

Enchargeai in Santa Clara, CA or Princeton, NJ

Role/Responsibilities:

EnCharge is looking for a senior software engineer (and technical lead) who can help us build out Tensor Virtual Machine (TVM) based end-to-end software runtimes for the next generation of Edge AI Hardware. You must have a track record of building runtimes & compiler stacks for AI Inference hardware, experience with TVM, as well as strong cross-functional contributions to AI architectures & software stacks.

Enabling TVM support for EnCharge AI hardware, compiler, neural network operations (ops), compiler optimizations and runtime APIs.
Working closely with the compiler teams to define the APIs and graph exchange formats needed for TVM to connect with the EnCharge Compiler stack (to build optimized binaries).
Optimizing end-to-end performance of the TVM runtime on EnCharge hardware platforms.
Enabling TVM backends for MLPerf benchmarking.
Mentor / lead junior engineers across the company.

Qualifications/Required Skills:

Masters/Ph.D. in EE/CS with >5 years of experience in AI applications, compilers & hardware.
Proficiency in C++ & Python.
Deep experience with modifying the TVM code base to support new hardware backends.
At least 3-5 years of experience with Tensorflow, PyTorch.
>2 years of experience with Deep Learning compilers and strong experience with AI compiler optimizations.
Solid understanding with state-of-the-art neural network topologies in various application domains (and especially in the computer vision space).
Excellent verbal and communication skills.

Preferred/Beneficial Skills:

Knowledge of industry-standard (and advanced) tools, graph, and intermediate-representation (IR) formats and methodologies including LLVM, MLIR etc.
Open-source experience.

Apply