Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best models can continuously process and reason over a year-long stream of audio, video and text—1B text tokens, 10B audio tokens and 1T video tokens—let alone do this on-device.
We’re pioneering the model architectures that will make this possible. Our founding team met as PhDs at the Stanford AI Lab, where we invented State Space Models or SSMs, a new primitive for training efficient, large-scale foundation models. Our team combines deep expertise in model innovation and systems engineering paired with a design-minded product engineering team to build and ship cutting edge models and experiences.
We’re funded by leading investors at Index Ventures and Lightspeed Venture Partners, along with Factory, Conviction, A Star, General Catalyst, SV Angel, Databricks and others. We’re fortunate to have the support of many amazing advisors, and 90+ angels across many industries, including the world’s foremost experts in AI.
We’re seeking an exceptional Product Manager to drive model quality and behavior excellence for our text-to-speech and speech-to-text products at Cartesia. As our Model Behavior PM, you’ll be the bridge between our customers’ needs and our model development teams, defining what world-class TTS and STT models should sound like, perform like, and feel like. This role combines deep analytical rigor with customer empathy to continuously elevate our model quality and establish Cartesia as the gold standard in voice AI.
Define and evolve comprehensive evaluation frameworks for TTS and STT model behavior, establishing clear metrics for naturalness, accuracy, prosody, emotion, latency, and user satisfaction across diverse use cases
Conduct systematic competitive analysis by deeply using our products alongside competitors’ offerings, identifying quality gaps, behavioral differences, and opportunities for differentiation
Partner closely with data teams to design data collection strategies, labeling guidelines, and dataset curation approaches that directly improve model behavior and performance
Collaborate with evaluation teams to build rigorous testing methodologies, automated evaluation pipelines, and human evaluation protocols that catch edge cases and quality regressions
Engage directly with customers across industries to understand their voice AI requirements, gather qualitative feedback on model behavior, and translate insights into actionable product improvements
Drive cross-functional alignment between research, engineering, data, and GTM teams to prioritize and execute on model behavior improvements that deliver maximum customer impact
Build a deep intuition for what makes TTS and STT models truly great—from subtle pronunciation nuances to handling of edge cases—and champion quality standards across the organization
Create frameworks, documentation, and best practices that help internal teams and customers understand model capabilities, limitations, and optimal usage patterns
6+ years of product management experience with technical products, preferably in AI/ML, audio, or speech technologies
Strong analytical mindset with experience designing evaluation frameworks, defining success metrics, and making data-driven quality decisions
Deep customer empathy with proven ability to conduct user research, synthesize qualitative feedback, and translate needs into product requirements
Technical fluency to work effectively with ML researchers, data scientists, and engineers—understanding model behavior at a detailed level
Exceptional attention to detail and quality standards, with the ability to notice subtle differences in model outputs and articulate what makes one better than another
Experience working cross-functionally with data teams, engineering teams, and evaluation/testing teams
Strong communication skills to advocate for quality and influence technical teams toward customer-centric decisions
Direct experience with speech technologies (TTS, STT, voice cloning, or conversational AI)
Background in linguistics, audio engineering or speech sciences
Experience with ML model evaluation, A/B testing methodologies, or human evaluation design
Familiarity with audio quality metrics (MOS, WER, CER, prosody analysis)
Prior experience at a company known for exceptional product quality and attention to detail
🍽 Lunch, dinner and snacks at the office
🏥 Fully covered medical, dental, and vision insurance for employees
🏦 401(k)
✈️ Relocation and immigration support
🦖 Your own personal Yoshi
🏢 We’re an in-person team based out of San Francisco. We love being in the office, hanging out together and learning from each other everyday.
🚢 We ship fast. All of our work is novel and cutting edge, and execution speed is paramount. We have a high bar, and we don’t sacrifice quality and design along the way.
🤝 We support each other. We have an open and inclusive culture that’s focused on giving everyone the resources they need to succeed.
#J-18808-Ljbffr...diverse array of programs and platforms, including major military prime contractors such as Lockheed Martin, Northrop Grumman, and Raytheon. SCOPE: The Procurement Manager for R&D is responsible for managing a team of buyers to source, estimate,...
...intercepts or other digital media in English and in foreign languages and provide an immediate oral summary in English, followed by a hand-written or typed (as instructed by the Linguistic Supervisor) summary in English in a format specified by our government customer....
Job Title: Production Assistant Chicago, ILSalary: $37,000 - $49,000 per yearJob Type: Full-timeWork Type: In-person (strictly on-site)About UsPattern Promotions is a fast-growing marketing and promotions company dedicated to creating memorable brand experiences...
...through the winter months. Get paid within 24 hours!!!!! As a Snow & Ice Management Equipment Operator, you will play a vital role... ...that existing landscaping and property are not damaged during snow removal operations. Care for and maintain company-provided equipment,...
...interpreters in North Carolina or other states for VRI/OPI opportunities #127891; Qualifications Fully bilingual, fluent in Portuguese and English Technologically proficient (familiar with Google Meet, Microsoft Teams , etc.) High school diploma required;...