***Employment contracts can be signed in Hong Kong, but you must accept full-time or most of the time to work in Shenzhen!!!!
Duties:
1. Research speech source separation, ASR, audio processing, multi-speaker recognition, and high-precision speech recognition modules to achieve high-quality, low-latency speech perception.
2. Research high-quality TTS and controlled speech synthesis, train TTS models, and implement a smooth, low-latency, and emotional speech output module.
Qualifications:
1. Graduated from related fields such as artificial intelligence, computers, electronic engineering, automation, signal processing, physics, etc., with a master's or doctoral degree.
2. Have at least two years of research or project experience in at least one of the following directions: speech processing, speech recognition, speaker recognition, speech synthesis.
3. Need to be familiar with mainstream audio processing, ASR or TTS algorithms, and have relevant model training experience.
4. Have good English reading and writing abilities, as well as excellent communication skills, and can implement new algorithms based on papers.
5. Priority will be given to those with high-quality paper publications (such as Interspeech, ICASSP, IEEE trans on ASLP, etc.), those with strong academic competition experience, and those with significant influence in open-source communities.
6. Solid theoretical foundation, innovative spirit and in-depth thinking ability.