Algorithm Engineer, Large Language Model (Campus Recruitment 2026)
Malaysia
Tetap
Sepenuh masa
1 hari lepas
Job Description: The application exploration and implementation of large language models in the fields of e-commerce, gaming, and payments include building multi-language agents with technologies such as function calling, tool usage, RAG, and code interpreters. These technologies are utilized and applied in business areas such as customer service, shopping guidance, video, and search recommendations. The research and implementation of pre-training and alignment algorithms include ultra-large-scale multilingual pre-training technology, Mixture-of-Experts model training, Instruction Pretraining, SFT, and RLHF. From a multilingual perspective, these efforts aim to reduce the hallucination problem of models, enhance safety capabilities, and improve long-text comprehension and Q&A performance. Building a service framework and platform for large models, as well as accelerating inference, involves creating online service architectures, data, and evaluation platforms. This also includes exploring and implementing inference acceleration strategies such as Medusa, Speculative Decoding, multi-LoRA inference, and Pruning. Establishing a comprehensive evaluation system for Southeast Asian multilingual large models involves providing standardized model evaluation capabilities and creating comprehensive evaluation datasets. This system will drive and refine improvements in large language models through model evaluation, addressing issues encountered in practical business scenarios and during technical iteration and optimization processes. The data mining and optimization of large model algorithms involve constructing data systems for multilingual pre-training, instruction fine-tuning, and human preference behaviors. This includes establishing a comprehensive engineering system for model fine-tuning and dataset preparation to effectively improve the quality and delivery capability of datasets. Requirements: Bachelor's degree or above in Computer Science or related fields. Excellent coding skills, data structure and basic algorithm skills, proficiency in Python/Pytorch coding, and harness the Hands-on ability. Familiar with NLP and CV related algorithms and technologies, and those who are familiar with large model training and RL algorithms. Familiar with the basic principles and training methods of industry-leading LLM (such as GPT, LLaMA), or familiar with the basic principles and training methods of mainstream multi-modal large models (such as Flamingo, LLaVA), and have research experience in text generation or dialogue systems, etc. Excellent problem analysis and solving skills, able to deeply solve problems in large model training and application. Good communication and collaboration skills, able to explore new technologies with the team and promote technological progress.