Data Engineer (Financial Data Cleaning and Model Fine-Tuning Direction)

Posted 3 months ago
1 to 3 years
Bachelor
Salary negotiable
Job Highlights
本科及以上学历,计算机等相关专业优先
熟练掌握Python及其数据处理相关库
有调用大模型API并进行数据生成与过滤的经验者优先
Job Description
<p>Duties include:</p><ul><li><p>Responsible for processing a large amount of financial raw data in a server environment, using Python for table merge, join, etc., to ensure efficient data integration</p></li><li><p>Perform data cleaning, including handling missing values, expanding json fields, identifying and correcting duplicates and outliers, and improving data quality</p></li><li><p>According to the semantic needs of the model prompt, build and maintain a data dictionary, and implement prompt-level data processing and null value filtering</p></li><li><p>Based on prompt, call open-source large model API (such as LLM), generate high-quality data, and combine rejection sampling and other methods for secondary processing to ensure that the data can be used for downstream model fine-tuning</p></li></ul><p></p><p>Job Requirements</p><ul><li><p>Bachelor's degree or above, computer science or related fields are preferred</p></li><li><p>Proficient in Python and its data processing related libraries (such as pandas, numpy), with large-scale data processing experience</p></li><li><p>Have data cleaning, feature engineering, ETL, etc. related experience, and be able to independently complete the whole data preprocessing process</p></li><li><p>Familiar with json, data dictionary, prompt engineering, etc., data structures and related processing methods</p></li><li><p>Applicants with experience in calling large model APIs (such as OpenAI, LLaMA, etc.) and generating and filtering data are preferred.</p></li><li><p>Have good business understanding, communication and team collaboration skills, and be able to withstand a certain amount of work pressure</p></li></ul>
View more
Data Cleaning
Data Structures
Python (Programming Language)
English
Cantonese
Mandarin
HR Chen
Hong Kong Generative AI Development Center Co., Ltd.·HR
Company Overview
The Hong Kong Generative AI Research &amp; Development Center (HKGAI) was established in October 2023, focusing on the research and development of generative artificial intelligence technologies. It is one of the research centers under the InnoHK program, a key initiative of the Hong Kong SAR Government. Led by Prof. Yike Guo, the Provost of The Hong Kong University of Science and Technology, HKGAI collaborates with four local top-tier institutions: The University of Hong Kong, The Chinese University of Hong Kong, The Hong Kong Polytechnic University, City University of Hong Kong, as well as the internationally renowned National University of Singapore. HKGAI focuses on developing a series of Multimodal, Multilingual Foundation Models, vertical Foundation Models, and also the tailor-made applications for Hong Kong society. In addition, HKGAI conducts research on ethics, security, and governance in generative AI technologies and applications, providing consultation and recommendations to the HKSAR Government. HKGAI will strive to enhance the role of Hong Kong's innovation and technology industry in promoting economic progress in the Greater Bay Area, cultivating AI talent and ecosystem in Hong Kong, and increasing Hong Kong's global influence in the fields of AI research and application. Learn more about our product, HKChat https://chat.hkchat.app/download.html?lang=tc&amp;from=iam
1/2
2/2
Similar jobs
Leapin Hong Kong
Experience with data visualization tools (e.g., Tableau, Power BI, or Looker)
Bachelor's degree in Data Science, Statistics, Mathematics, Computer Science, or related
Proficiency in analytical tools and programming languages such as SQL, Python, R
$25K-38K/Mth
香港智感傳媒
3年以上游戏行业数据分析经验
熟悉MYSQL等主流数据库
英语可作为工作语言
$25K-38K/Mth
Quick reply
香港生成式人工智能研發中心有限公司
参与构建下一代语音大模型
与AI技术前沿紧密相关
需Python编程及数据处理能力
Quick reply
香港生成式人工智能研發中心有限公司
本科及以上学历,计算机相关专业优先
熟练掌握Python及数据处理库
有大规模数据处理经验
Quick reply
香港生成式人工智能研發中心有限公司
深度参与大模型评测体系设计
多业务场景评测指标构建
计算机、AI相关专业本科以上
Be careful
Don’t provide your bank or credit card details when applying for jobs.
Save