About the Role
<table class="Table">
<tbody>
<tr>
<td valign="top"><span><span><span><b><span>Description: </span></b><span>Join us in building the next generation of AI infrastructure that will power innovation across the customer organization. <span>We’re seeking a full-stack software engineer to support our AI infrastructure team. In this role, you’ll help build and maintain the platform that provides the foundation for the customer’s AI capabilities, with a focus on inference services while supporting a broader ecosystem of AI-enabled applications.</span> </span></span></span></span><br />
<span><span><span><b><span>Responsibilities: </span></b></span></span></span>
<ul>
<li><span><span><span><span>Implement and support infrastructure for AI model inference under the guidance of senior engineers.</span></span></span></span></li>
<li><span><span><span><span>Contribute to the development and maintenance of production AI services and applications, including retrieval augmented generation (RAG) and autonomous agents.</span></span></span></span></li>
<li><span><span><span><span>Participate in implementing monitoring, logging, and observability for AI services. </span></span></span></span></li>
<li><span><span><span><span>Assist with automating infrastructure provisioning and configuration using IaC principles. </span></span></span></span></li>
<li><span><span><span><span>Help ensure availability, reliability, and performance of AI platform components. </span></span></span></span></li>
<li><span><span><span><span>Follow established security best practices for AI systems and data. </span></span></span></span></li>
<li><span><span><span><span>Work within ambiguous problem spaces while learning to define structured solutions. </span></span></span></span></li>
<li><span><span><span><span>Collaborate with cross-functional teams and contribute to shared engineering standards. </span></span></span></span></li>
</ul>
<span><span><span><b><span>Skills Requirements: </span></b></span></span></span>
<ul>
<li><span><span><span><span>Experience contributing to production systems.</span></span></span></span></li>
<li><span><span><span><span>Familiarity with high-volume web application architectures. </span></span></span></span></li>
<li><span><span><span><span>Exposure to cloud engineering, preferably AWS. </span></span></span></span></li>
<li><span><span><span><span>Working knowledge of Kubernetes concepts and containerized deployments. </span></span></span></span></li>
<li><span><span><span><span>Proficiency in Python. </span></span></span></span></li>
<li><span><span><span><span>Familiarity with CI/CD pipelines and DevOps practices. </span></span></span></span></li>
<li><span><span><span><span>Ability to learn unfamiliar technologies quickly. </span></span></span></span></li>
<li><span><span><span><span>Strong communication skills and willingness to ask questions.</span></span></span></span></li>
</ul>
<span><span><span><b><span>Nice to Haves: </span></b></span></span></span>
<ul>
<li><span><span><span><span>Exposure to AI inference serving technologies (vLLM, LiteLLM, etc.).</span></span></span></span></li>
<li><span><span><span><span>Familiarity with agentic frameworks (LangChain). </span></span></span></span></li>
<li><span><span><span><span>Awareness of vector databases and embedding systems. </span></span></span></span></li>
<li><span><span><span><span>Interest in distributed systems or performance engineering.</span></span></span></span></li>
</ul>
</td>
</tr>
</tbody>
</table>