PAIR Public Forum for Research and Innovation: Prof. YANG Hongxia of PolyU delivers "DeepSeek and Beyond"
PAIR Public Forum for Research and Innovation

-
Date
11 Mar 2025
-
Organiser
PolyU Academy for Interdisciplinary Research
-
Time
15:00 - 16:30
-
Venue
Jockey Club Auditorium (JCA), PolyU Campus
Speaker
Prof. YANG Hongxia
Enquiry
PolyU Academy for Interdisciplinary Research info.pair@polyu.edu.hk
Summary
Abstract
DeepSeek’s recently released model has demonstrated that very strong AI capabilities can be achieved without the need of particularly large models. Their AI models perform in a manner comparable to top AI models in the United States, but require much fewer computing resources. More importantly, they have made this technology sharable through open sourcing. This breakthrough has changed the way how people think about AI development, and has also triggered concerns over privacy security, technological competition and other issues. Our Co-Gen AI project aims to build on this idea to help enhance Hong Kong’s AI competitiveness. Our platform has three innovative features. The first one is Domain-Adaptive Continual Pretraining (DACP), which allows AI models to become more specialised through learning data from specific domains (e.g., industry, scientific research) and demonstrates better training outcomes that open-source models and ChatGPT. The second one is Advanced Model Fusion Infrastructure, which comprises a “Model Fusion” over existing specialised models, significantly reducing the use of computing resources—training a 7B model requires only 64–128 GPU cards, and a 100B large model requires just 512–1,024 cards, thus bringing over 90% resource saving. The third one is Resource-Efficient Architecture, which allows us to leverage ordinary computing resources in Cyberport, Science Park, Pengcheng Lab and other places to develop AI, thus enabling the efficient training of large models by integrating small models, and avoiding the problems in traditional approaches that require large clusters of identical high-end GPUs for training. Through this innovative approach, AI development can be made more accessible and less dependent on massive centralised computational resources. This project expects to help Hong Kong occupy an important position in the global generative AI development landscape.

Prof. YANG Hongxia
Professor, Department of Computing
The Hong Kong Polytechnic University
Prof. YANG Hongxia is a distinguished AI scientist with over 15 years of experience, specialising in large-scale machine learning, data mining, and deep learning. Throughout her illustrious career, Prof. Yang has developed ten significant algorithmic systems that have enhanced the operations of various enterprises. Her research focuses on pre-trained models, big data analytics, and the practical deployment of large language model systems in real-world settings.
Prof. Yang has an impressive academic record, having published over 100 top-tier papers, which have garnered approximately 12,000 citations and an H-index of 46. She also holds more than 50 patents. Her contributions to the field have been recognised with several prestigious awards, including the 2019 Super AI Leader Award at the World Artificial Intelligence Conference and the 2020 National Science and Technology Progress Award.
Prof. Yang was named one of Forbes China’s Top 50 Women in Tech for 2022, and she received the AI 2000 Most Influential Scholar Award for 2023–2024. She founded foundational modelling teams at both ByteDance and Alibaba and has held prominent roles at Yahoo! Inc. and the IBM T.J. Watson Research Center.
Prof. Yang earned her Ph.D. from Duke University and her B.S. from Nankai University. She is recognised globally as a pioneer in the field of Generative AI.