Nuvepro - Task Intelligence for the Enterprise
xAI· Human Data· Remote

AI Tutor - Swahili

Classified Tasks (15)

Automate 0%Augment 53%Human-Only 47%

Augment (8)

AI assists, human decides

Use proprietary software to label multilingual audio clips, voice recordings, speech samples, and auditory elements.

technical

Provide inputs and feedback on audio projects to improve speech recognition and voice interactions.

communication

Curate and deliver high-quality audio datasets ensuring clear, natural spoken output and professional audio standards.

operational

Transcribe audio with high accuracy across accents and varying audio quality.

technical

Identify and mark audio quality issues such as noise, distortion, or poor vocal delivery in recordings.

analytical

Collaborate with technical staff to design tasks that improve AI handling of speech modulation, accent variation, and multilingual processing.

technical

Work with technical staff to improve annotation tools and optimize audio workflow efficiency.

technical

Provide feedback on AI outputs to help refine Grok's handling of multilingual audio nuances.

analytical

Human-Only (7)

Requires human judgment

Annotate audio data for linguistic and prosodic features including intonation, rhythm, and accent.

analytical

Record high-quality voice samples in multiple languages for training purposes.

operational

Evaluate speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.

analytical

Make independent judgments on ambiguous or varied audio material, including noisy or accented speech.

analytical

Ensure accurate representation of linguistic details and prosody in annotations to enhance natural spoken interactions.

analytical

Support bridging language barriers by improving speech processing across diverse languages, accents, and cultural contexts.

communication

Contribute hands-on annotated audio assets and recordings for model training and refinement.

operational

Job description

ABOUT xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. ABOUT THE ROLE: As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI's mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts. Your work will focus on curating and annotating high-quality audio data to enhance Grok's global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI's handling of multilingual audio nuances. RESPONSIBILITIES: Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages. Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards. Collaborate with technical staff to develop tasks that improve AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing. Work with technical staff to improve annotation tools for efficient audio workflows. BASIC QUALIFICATIONS: Native proficiency in Swahili with exposure to diverse accents, dialects, or regional variations. Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes. Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages. Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form. Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality. Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages. Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech. Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively. Commitment to developing AI that masters sophisticated multilingual audio capabilities. PREFERRED SKILLS AND EXPERIENCE: Demonstration of exceptional attention to linguistic nuan
Source: xAI careers · scraped 2026-05-22
Apply at xAI