Speech Scientist, Language Technology

Asapp-2 in New York

At ASAPP, we’re on a mission to build transformative machine learning-powered products that push the boundaries of artificial intelligence and customer experience. We focus on solving complex, data-rich problems — the kind where there are huge systemic inefficiencies and where a real solution will have a significant economic impact. Our CX performance platform uses machine learning across both voice and digital engagement channels to augment and automate human work, radically increasing productivity and improving the efficiency and effectiveness of customer experience teams.

If you’re interested in our mission and being at the forefront of the speech recognition industry, we encourage you to apply and join our Language Technology group. As an ASAPP Speech Scientist on the Speech Modeling Team, you’ll be building and elevating our state of the art speech recognition system for conversational speech. You’ll make progress and collaborate closely alongside a talented team of Researchers, Scientists and Engineers. You will all advance the state of Speech Science, publish your work and continually establish ASAPP as a first class research institution. We’ve recently had three papers accepted to ICASSP 2023!

This position can be located in Mountain View or NYC.
ASAPP is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability, age, or veteran status. If you have a disability and need assistance with our employment application process, please email us at careers@asapp.com to obtain assistance. #LI-CM1 #LI-Hybrid
    • Develop and optimize speech modeling procedures using large-scale audio data from production systems
    • Work closely with other speech modeling researchers on implementing algorithms that power state-of-the-art machine learning systems for real-time speech applications
    • Build integrated speech application systems and perform evaluations
    • Follow and lead new research trends related to end-to-end speech modeling
    • M.S. or Ph.D. in Computer Science
    • Research and/or work experience in machine learning, deep learning, and/or speech applications including ASR, speaker / emotion recognition and TTS
    • First-authored publications at workshops or conferences such as NeurIPS, Interspeech, ICASSP, ASRU and SLTExperience in C/C++, Python or scripting languages
    • Familiarity with at least one of deep learning toolkits such as PyTorch, Tensorflow and Kaldi/K2
    • Strong analytical/problem solving skills
    • A collaborative spirit with excellent communication skill
    • Proven track record of achieving results as demonstrated by grants, fellowships and patents
    • Experience with end-to-end deep neural network research
    • Strong software engineering experience
    • Ability to thrive in an ever changing, dynamic environment
    • Competitive compensation with stock options
    • Comprehensive medical, vision, and dental insurance
    • 401k matching
    • Fitness and wellness stipend
    • Mobile phone reimbursement
    • Mental well-being benefits
    • Professional learning and development stipend
    • Parental leave, including adoptive and foster parents
    • 3 weeks paid time off (increases with tenure) and unlimited sick leave
Apply