At ASAPP, we are on a mission to build transformative machine learning-powered products that push the boundaries of artificial intelligence and customer experience. We focus on solving complex, data-rich problems — the kind where there are huge systemic inefficiencies and where a real solution will have a significant economic impact. Our CX performance platform uses machine learning across both voice and digital engagement channels to augment and automate human work, radically increasing productivity and improving the efficiency and effectiveness of customer experience teams.

Site Reliability Engineers (SREs) are responsible for the overall performance and reliability of ASAPP's infrastructure and products. SREs design and implement the tools that automate building reliable and performant systems. We emphasize building tools over manual processes. We implement, not administer. We’re obsessed with automation, not repetition. Our job is to focus on building reliable infrastructure and tools for our product teams so that they can solve customer problems and deliver new features, not reinvent platforms.

ASAPP is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability, age, or veteran status. If you have a disability and need assistance with our employment application process, please email us at careers@asapp.com to obtain assistance. #LI-MK1 #LI-Hybrid

Work with product engineering teams on service architecture and implementation
Deliver configuration as code and automate everything
Direct and implement monitoring and alerting systems to support rapid problem diagnosis
Perform Root Cause Analysis and design and deliver resolutions
Work on our Kubernetes / AWS infrastructure to support our product engineers
Write software to enable secure and performant communication in our production systems

+4 years of relevant experience bringing software to production at high scale
Participation in on-call rotation, triaging and addressing production issues
Obsession with automation and instrumentation
Understanding of complex systems and failure scenarios
Excellent communication skills
Knowledge of AWS services, containers and container management frameworks
Familiarity with Message Bus based systems and distributed architectures
Proficiency in Python and/or Go

BS or MS degree in the Computer Science field, or equivalent hands-on experience.
Experience in product oriented environments
Scalable distributed applications experience

Competitive compensation with stock options
Comprehensive medical, vision, and dental insurance
401k matching
Fitness and wellness stipend
Mobile phone reimbursement
Mental well-being benefits
Professional learning and development stipend
Parental leave, including adoptive and foster parents
3 weeks paid time off (increases with tenure) and unlimited sick leave

Apply

Site Reliability Engineer

Asapp-2 in New York