About the Role
As a technical lead and manager of the core infrastructure team, you will be responsible for leading the team in building the Moveworks AI infrastructure. As Moveworks grows fast, the infrastructure team is tasked with designing and operating reliable and resilient foundational services as well as frameworks that allow our product to scale seamlessly and our engineering teams to build customer facing features very rapidly.
We’re building a team that indexes on moving fast, solving challenging product/engineering problems and providing value to our customers. To be successful, you'll be partnering with machine learning, search, product, data, and full stack teams to identify, define and build elegant solutions. This is an opportunity to play an integral role at the fastest-growing AI startup in its space.
Oversee and evolve Moveworks infrastructure architecture and foundation services such with extremely high reliability, scalability, performance, and security.
Closely work with machine learning, search, product, data, and frontend teams, understand their infrastructure needs, influence infrastructure roadmap, and lead as well as execute on various projects.
Architect and build out the core infrastructure and foundational functionalities such as distributed key value store, schema-less data store, authentication and authorization, event streaming, distributed configuration management and hot reloading, rate limiting and circuit breaking, system for feature flags and A/B testing, traffic capture and replay, search infrastructure, etc.
Define methodologies and metrics to measure performance of Moveworks infrastructure services, identify and fix bottlenecks, as well as improve overall performance and scalability of Moveworks stack.
Hire and mentor engineers in the team and foster a culture of technical excellence, collaboration and continuous improvement.
What You Bring To The Table
Hands on technical leader with a strong sense of ownership - proactively leading complex platform and infrastructure initiatives from conception through delivery and measurement.
3+ years of experience growing and managing team with 5+ engineers including senior engineers.
7+ years of hands on experience designing, building, shipping, and maintaining highly scalable and fault tolerant backend distributed systems.
Familiarity with Python/Golang/Java/C++.
Experience with containers/Docker, and with cloud infrastructure like AWS/GCP/Azure.
Experience working with and operating open source technologies like Kafka, Elasticsearch or Opensearch, Istio, Vector DB, etc. at scale.
Experience with search and ML infrastructure is plus.
Excellent verbal and written communication skills, with the ability to influence all levels of the organization and translate complex platform concepts into clear business value.
Desire to work at a startup pace in a small company with a high degree of ownership.
Experience solving for performance, optimization, scalability, latency, and monitoring.
BS+ in computer science or a related field.
High level of curiosity about engineering outside of your immediate discipline and an incessant desire to learn.