About Me

Your Image Description

2023.5 at XiaMen

About Me

  • Hello, I'm Wei Liu (刘维). Here are my Email, Github and Google Scholar.
    • 2014-2018: Bachelor of Communication Engineering in BUPT
    • 2018-2021: Master of Computer Engineering in CIST Lab@BUPT
    • 2021-2023: NLP Researcher, Tencent
    • 2023.8: I will be working as a RA at THUNLP, in collaboration with Prof. Zhiyuan Liu
    • 🙋🏻‍♂ Now actively seeking Ph.D. opportunities (2024 Spring/Fall) on NLP/LLM!

Research Interests

  • I focus on exploring the intelligence in deep natural language processing models when compressing and summarizing languages. I believe that in the process of language compression lies the birth of knowledge and intelligence.
  • I also started to explore the next stage of intelligence in natural languages by studying LLM multi-agent behaviors in a complex environment or society. It is still an early exploration and I am interested in several potential research directions, including:
    • let LLMs build and use complicated tools with multi-agent collaboration
    • self-organization and automatic job differentiation in multi-agent collaboration
    • alignment in multi-agent activities
    • improve the communication efficiency among agents
    • make multi-agent activities improve LLMs

Research Details

  • More Comprehensive and Factual Summarization:
    • Introduce Determinantal Point Processes to solve the attention degeneration in Summarization[1].
    • Discover the subjective bias in public summarization datasets which leads to text degeneration[2].
    • Design a fine-tuning schema based on mutual information that minimizes hallucination in summarization[3].
    • Improve the sentiment consistency in abstractive summarization through a memory-based approach[4].
    • Scientific Paper Summarization, Multi-lingual Lay Summarization, Long Document Summarization[5,6,7].
  • More Accurate and Controllable Keyphrase Prediction:
    • Consider Keyphrase Ranking as an MRC task that better leverages PLM to improve performance. (Patent-only)
    • Develop a unified present/absent Keyphrase Prediction method[8].
    • Explore Controllable Keyphrase Generation as an early attempt at prompt engineering[9].
  • Multi-Agents powered by LLMs:
    • Release a repo for building complicated tools within a LLM multi-agent virtual environment: ChatDev[10]

Industrial Experience

  • At Tencent, I aim to improve the performance of News Feed Recommendations and Advertising.
    • Improve the NLU ability for News Feed Recommendation by more accurate and controllable keyphrase prediction.
    • Introducing non-commercial behaviors into advertising modeling through graph modeling.
    • Explore stable and end-to-end feature quantization methods for Advertising models.
    • Diverse user interest modeling in a diffusion-based way[In Submission].
    • Achieve better tradeoffs between single-tower and two-tower models during the recall/pre-rank stage in Advertising[In Submission].


  • My Google Scholar
[1] In Conclusion Not Repetition: Comprehensive Abstractive Summarization with Diversified Attention Based on Determinantal Point Processes
CoNLL 2019 Long Paper
Lei Li, Wei Liu, Marina Litvak, Natalia Vanetik, Zuying Huang
code paper
[2] Subjective Bias in Abstractive Summarization
Arxiv Preprint
Lei Li, Wei Liu, Marina Litvak, Natalia Vanetik, Jiacheng Pei, Yinan Liu, Siya Qi
code paper
[3] CO2Sum: Contrastive Learning for Factual-Consistent Abstractive Summarization
Arxiv Preprint
Wei Liu, Huanqin Wu, Wenjing Mu, Zhen Li, Tao Chen, Dan Nie
code paper
[4] A Multi-View Abstractive Summarization Model Jointly Considering Semantics and Sentiment
CCIS 2018 Long Paper
Moye Chen, Lei Li, Wei Liu
[5] CIST@CLSciSumm-19: Automatic Scientific Paper Summarization with Citances and Facets
SIGIR 2019 Shared Task
Lei Li, Yingqi Zhu, Yang Xie, Zuying Huang, Wei Liu, Xingyuan Li, Yinan Liu
[6] Multi-lingual Wikipedia Summarization and Title Generation On Low Resource Corpus
RANLP 2019 Shared Task
Wei Liu, Lei Li, Zuying Huang, Yinan Liu
code paper
[7] CIST@CL-SciSumm 2020, LongSumm 2020: Automatic Scientific Document Summarization
EMNLP 2020 Shared Task
Lei Li, Yang Xie, Wei Liu, Yinan Liu, Yafei Jiang, Siya Qi, Xingyuan Li
[8] UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase Prediction
ACL 2021 Findings Long Paper
Huanqin Wu, Wei Liu, Lei Li, Dan Nie, Tao Chen, Feng Zhang, Di Wang
code paper
[9] Fast and Constrained Absent Keyphrase Generation by Prompt-Based Learning
AAAI 2022 Long Paper
Huanqin Wu, Baijiaxin Ma, Wei Liu, Tao Chen, Dan Nie
code paper
[10] Communicative Agents for Software Development
Chen Qian, Xin Cong, Wei Liu, Cheng Yang, Weize Chen, Yusheng Su, Yufan Dang, Jiahao Li, Juyuan Xu, Dahai Li, Zhiyuan Liu, Maosong Sun
code paper