Data Scientist - Generative AI, 3+ years of AI-experience Babylon Voice | Manan AI | New York

Babylon Voice (Manan AI Inc) New York logo Babylon Voice (Manan AI Inc) New York

Job Description

Data Scientist - Generative AI, AI Voice ID, Multimedia 3+ years of AI-experience

Employee • New York, United States • Remote 

We are looking for a full-time Data Scientist who has 3 years of AI-experience. Generative AI focuses on creating new and original content, voice, chat responses, designs, synthetic data, deepfakes recognition. The role involves working on a Super App with AI VOICE ID -based Media Wallet, providing secure and convenient access to games, entertainment, and enabling interactions with brands, influencers, content, and payments. Our goal at Babylon Voice AI is to empower individuals and corporations to shape the future of voice-driven experiences while ensuring digital identity protection in a dynamic digital world. The team has several US-professors as advisors with expertise in ML, stochastic control who provide mentorship, research in real world NLU/P, conversational AI, voice/dialog systems. Your contributions will drive content discovery and personalization through voice/video interactions across apps, devices (e.g. Alexa, Google Home, etc.), and automotive products.

Company Overview:

Babylon Voice, a cutting-edge technology company, is at the forefront of revolutionizing the digital identity landscape through AI Voice ID. Our top-10 NYC startup team has earned awards, grants, and recognition in hackathons fromAmazon, Spotify, Google, Open AI (Microsoft), Deloitte, and Polygon, Unstoppable Domains. Our team comprises exceptional AI scientists, cryptography devs, CS and Ph.D. from Stanford, MIT, Google, Discord, World of Tanks and Telegram. Our technology has made its mark on Minecraft, Fortnite, Bloomberg, Disney, Republic, JP Morgan, and Roblox. Guiding our journey is a former partner from Andreessen Horowitz (a16z), who has joined as a founder, having previously sold their gaming startup to Disney. Leading our charge is our CEO, a Female Founder armed with a Ph.D. in Mathematics, a background as an MTV host, and selling her startup to Sony Pictures. Babylon Voice is on a mission to redefine the concept of digital identity using advanced AI technologies. We employ cutting-edge ML/AI technologies like STT, TTS, STS, NLP, CVPR, Style transfer using Cycle-GAN and Recycle-GAN, summary search, Speech recognition, Language translation, and Synthetic Media Generation. Our target markets include B2B, B2C, Entertainment, Customer Service, Advertising, Compliance, Security, and Privacy for Non-Pornographic DeepFake. Our vision centers on the convergence of AI and deep tech, welcoming the next billion digital users to an era where AI superpowers augment human intelligence. At the core of our innovation is "VoicePrint," our AI Digital identity standard, infusing biometric security and synthetic voice capabilities. Babylon Voice envisions earning royalties for each authenticated human voice, a testament to our transformative impact.

Why Join Us:

  • We are a fully distributed team with a New York HQ, offering flexible work and schedules;**
  • You will have the opportunity to work on turning bleeding-edge research into commercial products, focusing on digital ID and voice AI
  • We support a growth mindset and provide paper publications, mentorship, and internships from top researchers;
  • We welcome candidates internationally and foster a no-micromanagement environment for highly self-sufficient individuals.
  • Our tech stack includes PyTorch wrapped in Flask and running in a Kubernetes cluster, AWS, and a range of great libraries and frameworks such as React, NLTK, PyBrain, NumPy, SciPy, Pandas, Keras, Airflow, Docker, Fastapi, Flutter, Node.js, and TypeScript. We leverage 48+ AI/ML networks, including DALL-E and Stable Diffusion AI technology, for 3D avatars in Unity and Unreal Engine 5.

Responsibilities and what we are looking for:

  • Developing cutting-edge ML for automatic text summarization/keyword search, semantic search voice/speech recognition/language translation/text generation/NLP
  • Design criteria for text/voice performance evaluation and enhance existing methodologies
  • Research, design, experiment with, and build ML-systems, particularly related to text/voice and search products
  • R&D in text summarization/semantic search/NER. Read, understand and implement research papers. Assemble prototypes and MVP. Compress models and optimize inference
  • Prototype New Features. This means rapidly building prototypes end-to-end, including storage, business logic, UI/UX
  • Initial work could be done remotely with daily Zoom standups with full team and in person meetings Preferably you would be located and work in our New York, NY office 

About You:

  • Advanced STEM degree: M.S. or PhD with extensive relevant AI/NLP experience (Computer Science, Math, Statistics, Physics, Economics, Computational Linguistics, Neuroscience, Engineering 
  • Extensive experience utilizing deep learning & NLP-methodologies, building data pipelines, exploratory data analysis, 
  • Experience with cutting edge NLP techniques - BERT, XLM, XLnet (e.g., word2vec, RNNs, transformers). Experience with libraries ML-frameworks (e.g., PyTorch, Keras, Vowpal Wabbit, scikit-learn)
  • Familiarity with tools such as Python, R, Julia or MATLAB - Familiarity with AWS or another cloud infrastructure provider (GCP, Azure, etc), Technologies: Kafka, Airflow, Composer Production experience implementing machine learning pipelines and models at scale in Python, Java, Scala, or similar languages
  • Proficiency with distributed processing and warehousing frameworks (e.g., Spark, Hadoop, Hive, Tez, etc.). Experience with the research and development workflow/life-cycle for large-scale batch and streaming machine learning systems
  • Excellent written and verbal communication skills, ability to collaborate effectively with non-tech team members and stakeholders Self-motivated, growth-oriented, and driven to pursue solutions to challenging problems
  • A big "Plus" would be experience working in the advertising or media industry 

Our Tech Stack:

Includes PyTorch wrapped in Flask and running in a Kubernetes cluster, AWS, and a range of great libraries and frameworks such as React, NLTK, PyBrain, NumPy, SciPy, Pandas, Keras, Airflow, Docker, Fastapi, Flutter, Node.js, and TypeScript. We leverage 48+ AI/ML networks, including DALL-E and Stable Diffusion AI technology, for 3D avatars in Unity and Unreal Engine 5.

Register to Apply

Please let Babylon Voice (Manan AI Inc) New York know that you found this job role on

Similar Jobs

Babylon Voice (Manan AI Inc)| New York logo

Data Scientist for AI VOICE ID, 2+ years of Multimedia AI/ML | Babylon Voice (Manan AI Inc)| New York at Babylon Voice (Manan AI Inc)| New York

$95,000 - $150,000
AI Voice ID Crypto Web3 ID ZKP ML
41 days ago
Babylon Voice (Manan AI Inc)|New York logo

TON (Telegram) ZK Cryptography Engineer for AI VOICE ID | Babylon Voice (Manan AI Inc)|New York at Babylon Voice (Manan AI Inc)|New York

$80 - $150
TON ZKP Cryptography Telegram Zero-Knowledge Proof Blockchain Smart contract
41 days ago
Babylon Voice (Manan AI Inc) logo

Zero-Knowledge Proof Engineer, snarkVM on Aleo for AI VOICE ID | Babylon Voice (Manan AI Inc) New York at Babylon Voice (Manan AI Inc)

$80 - $150
ZKP AI snarkVM Aleo Zero-Knowledge Proof
41 days ago
ChainRecorder logo

Senior Bitcoin/Lightning Network Developer at ChainRecorder

bitcoin lightning network
55 days ago
Spend IT logo

Full Stack Developer at Spend IT

$80,000 - $100,000
node typescript api solidity
119 days ago
Glassnode logo

Senior Backend Engineer (Golang) - Greenfield Project (m/f/d). Remote at Glassnode

Backend Engineer Golang SQL Kubernetes Helm
201 days ago
ConsenSys logo

QA Engineer (Confirmations System) at ConsenSys

$139,000 - $175,000
QA Engineer Confirmations Selenium UXUI
203 days ago
Gemini logo

Senior Software Engineer, Fraud at Gemini

$152,000 - $213,000
Scala C++ Typescript Software Engineer
203 days ago
ConsenSys logo

Solidity Engineer at ConsenSys

$187,000 - $235,000
Solidity EVM English< Engineer
204 days ago
Coinbase logo

Staff Smart Contract Engineer - Developer at Coinbase

$201,450 - $237,000
Smart Contracts Engineer Solidity Ethereum
205 days ago