Skip to main content Start main content

PAIR Public Forum for Research and Innovation: Prof. YANG Hongxia of PolyU delivers "DeepSeek and Beyond"

PAIR Public Forum for Research and Innovation

20250311 Lecture on DeepSeek by Prof YANG Hongxia1000 x 540 px
  • Date

    11 Mar 2025

  • Organiser

    PolyU Academy for Interdisciplinary Research

  • Time

    15:00 - 16:30

  • Venue

    Jockey Club Auditorium (JCA), PolyU Campus  

Speaker

Prof. YANG Hongxia

Enquiry

PolyU Academy for Interdisciplinary Research info.pair@polyu.edu.hk

Summary

Abstract

DeepSeek’s recently released model has demonstrated that very strong AI capabilities can be achieved without the need of particularly large models. Their AI models perform in a manner comparable to top AI models in the United States, but require much fewer computing resources. More importantly, they have made this technology sharable through open sourcing. This breakthrough has changed the way how people think about AI development, and has also triggered concerns over privacy security, technological competition and other issues. Our Co-Gen AI project aims to build on this idea to help enhance Hong Kong’s AI competitiveness. Our platform has three innovative features. The first one is Domain-Adaptive Continual Pretraining (DACP), which allows AI models to become more specialised through learning data from specific domains (e.g., industry, scientific research) and demonstrates better training outcomes that open-source models and ChatGPT. The second one is Advanced Model Fusion Infrastructure, which comprises a “Model Fusion” over existing specialised models, significantly reducing the use of computing resources—training a 7B model requires only 64–128 GPU cards, and a 100B large model requires just 512–1,024 cards, thus bringing over 90% resource saving. The third one is Resource-Efficient Architecture, which allows us to leverage ordinary computing resources in Cyberport, Science Park, Pengcheng Lab and other places to develop AI, thus enabling the efficient training of large models by integrating small models, and avoiding the problems in traditional approaches that require large clusters of identical high-end GPUs for training. Through this innovative approach, AI development can be made more accessible and less dependent on massive centralised computational resources. This project expects to help Hong Kong occupy an important position in the global generative AI development landscape.

YANG Hongxia

Prof. YANG Hongxia

Associate Dean (Global Engagement), Faculty of Computer and Mathematical Sciences
Professor, Department of Computing
The Hong Kong Polytechnic University

 

Prof. YANG Hongxia is a distinguished AI scientist with over 15 years of experience, specialising in large-scale machine learning, data mining, and deep learning. Throughout her illustrious career, Prof. Yang has developed ten significant algorithmic systems that have enhanced the operations of various enterprises. Her research focuses on pre-trained models, big data analytics, and the practical deployment of large language model systems in real-world settings.

Prof. Yang has an impressive academic record, having published over 100 top-tier papers, which have garnered approximately 12,000 citations and an H-index of 46. She also holds more than 50 patents. Her contributions to the field have been recognised with several prestigious awards, including the 2019 Super AI Leader Award at the World Artificial Intelligence Conference and the 2020 National Science and Technology Progress Award.

Prof. Yang was named one of Forbes China’s Top 50 Women in Tech for 2022, and she received the AI 2000 Most Influential Scholar Award for 2023–2024. She founded foundational modelling teams at both ByteDance and Alibaba and has held prominent roles at Yahoo! Inc. and the IBM T.J. Watson Research Center.

Prof. Yang earned her Ph.D. from Duke University and her B.S. from Nankai University. She is recognised globally as a pioneer in the field of Generative AI.

Your browser is not the latest version. If you continue to browse our website, Some pages may not function properly.

You are recommended to upgrade to a newer version or switch to a different browser. A list of the web browsers that we support can be found here