Back

Speech Algorithm Engineer

TencentTencent·Technology, Internet Services

Apply effort

<60 sec

via Aplyr Quick Apply

Posted

2 days

01

About the role

Business Unit

Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers, TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.

What the Role Entails

Research and develop speech/audio large models, including but not limited to models for speech dialogue (speech interaction/audio-video dialogue), audio understanding (ASR/audio captioning), and audio generation (TTS/video dubbing) .

Be responsible for data and algorithm work related to the pre-training, post-training, and reinforcement learning (for both text and audio) of speech/audio large models .

Oversee the open-sourcing of speech dialogue/audio understanding/audio generation models and their productization. This includes end-to-end optimization of the full pipeline for speech dialogue products, optimizing audio understanding in scenarios involving noise/accent/far-field/sound effects/music, and enhancing speech synthesis for applications like broadcasting, casual conversation, gaming, and social interaction.

Who We Look For

Prior experience in speech dialogue, speech synthesis, speech recognition, audio-video multimodality, or large language models (pre-training, fine-tuning, reinforcement learning) is preferred .

Strong coding skills and a solid foundation in data structures and algorithms. Proficiency in Python or C/C++ is required, along with familiarity with model training frameworks like PyTorch, Megatron, or DeepSpeed. Prior awards in competitions such as ACM/ICPC, NOI/IOI, Top Coder, or Kaggle are advantageous .

Having publications in top-tier conferences or journals such as NeurIPS, ICLR, ICML, ACL, CVPR, ICASSP, or INTERSPEECH is preferred .

A solid background in mathematics and signal processing, good reading ability for English technical literature, strong motivation/curiosity/teamwork spirit, excellent problem-solving skills, and a passion for pursuing technological innovation.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Skills & Tags

02

Aplyr's read

Tencent is a tech giant shaping digital landscapes with a diverse portfolio, attracting talent in gaming, AI, and internet services.

Synthesized from recent postings & public sources

What's promising

  • Tencent's vast digital ecosystem offers diverse career paths in gaming, AI, and cloud services.
  • Strong global presence provides opportunities for international career growth and collaboration.
  • Investment in cutting-edge technology fosters innovation and skill development.

What to watch

  • Regulatory scrutiny in China poses challenges for business operations and strategy.
  • High competition in the tech sector may limit rapid career advancement.
  • Complex organizational structure can lead to bureaucratic decision-making processes.

Why Tencent

  • Tencent's WeChat platform integrates social, payment, and service functionalities uniquely.
  • Pioneering investments in AI and gaming set it apart in tech innovation.
  • Extensive partnerships and investments in global tech companies enhance its influence.

Aplyr’s read is generated by AI from public sources. Was it useful?

03

About Tencent

Tencent is a Chinese multinational conglomerate holding company with subsidiaries in various internet-related services and products, entertainment, AI, and technology.

04

Similar roles