Personal Portfolio

About me

I am a Research Assistant working with professor Bryan Hooi at National University of Singapore, where I recently graduated with a bachelor's degree in Computer Science and Mathematics. I am also currently working with professor Hannaneh Hajishirzi and her student Zeqiu Wu at University of Washington.

My research is concentrated on Natural Language Processing (NLP). A more specific area that I am keen to explore is extending the usage of language models to more general settings.

Previously, I was advised by professor Tat Seng Chua and professor Lizi Liao at NExT++ Research Center. I worked on Conversation Disentanglement and Multi-party Response Generation to enable chit-chat agents to converse properly in multi-party dialogues that involve more than 2 speakers. Solving these tasks is crucial for the eventual application of chatbots in group chat and forums, where different topics may be discussed in a simultaneous and interleaved manner.

My current research directions include
• Applying LMs to multi-party settings: In multi-party settings, potentially with preset goals, different agents can assume different personas and roles, and the relations between the agents can be collaborative, adversarial, or competitive. There have been exciting works released recently that make use of multiple dialogue agents for settings such as problem-solving and text adventures. Exploration is needed for more complicated scenarios.

• Reasoning: I am exploring various ways to enhance the reasoning ability of LMs, including (1) collaboration between agents to make the reasoning path more logical and (2) augmentation of external knowledge to increase truthfulness and provide attribution.

Aside from these, I am excited to explore other NLP topics too.

Previous Research Topics

CV

Please see the attached document for more details.

Publications

Chengyu Huang, Zheng Zhang, Hao Fei, and Lizi Liao

EMNLP
Dec 07, 2022

Conversation Disentanglement with Bi-Level Contrastive Learning

We propose to formulate Conversation Disentanglement as a global clustering task, in contrast to how it is traditionally solved as a pointing task. We use contrastive learning to pull embeddings of utterances from the same session closer and pull away those from different sessions. K means is then used to cluster the utterances.

[Paper] [Slides]
Chengyu Huang, Zeqiu Wu, Wenya Wang, Yushi Hu

ACL, submitted
2024

Helping Large Language Models Generate Better Citations with Fine-Grained Rewards

We propose to use fine-grained rewards to train LLMs to generate both supportive and relevant citations in the response, while preserving other qualities, such as answering correctness in the Question Anwering task.

[Paper]
Yuexin Li, Chengyu Huang, and Bryan Hooi

USENIX, submitted
2024

Unleashing the Power of Reference-Based Phishing Detectors with Multimodal Knowledge Graphs

We propose to construct an industry-based, multimodal knowledge base construction pipeline that are robust to temporal shift as well as a multimodal phishing detector that leverages HTML text to detect brand and credential-taking intentions.

[Paper]

Projects

Helping Large Language Models Generate Better Citations with Fine-Grained Rewards

Supervisor: Professor Hannaneh Hajishirzi, Professor Wenya Wang, Dr Zeqiu Wu

We observed that most existing LLMs do not output attributions for the claims they generated, while attributions can effectively reduce hallucination and increase model responses' explanability.

We propose to use fine-grained rewards to train LLMs to both generate supportive and relevant citations, while preserving other qualities, such as answering correctness in the Question Anwering task.

Toolstack: Python, PyTorch

[Paper]
Unleashing the Power of Reference-Based Phishing Detectors with Multimodal Knowledge Graphs

Supervisor: Professor Bryan Hooi

We observed that existing phishing detection methods are vulnerable to the temporal distributional shift of phishing webpages, and cannot generalize well when webpages shift their phishing intention from logos to HTML text.

We propose to construct an industry-based, multimodal knowledge base construction pipeline that are robust to temporal shift as well as a multimodal phishing detector that leverages HTML text to detect brand and credential-taking intentions.

Toolstack: Python, PyTorch

[Paper]
Multi-party Response Generation with Conversation Disentanglement

Supervisor: Professor Tat Seng Chua, Professor Lizi Liao

This work aims to provide a standard solution for multi-party response generation, and eventually enable dialogue agents to converse properly in multi-party conversations. There has been initial exploration on this task. However, the reliance of existing models on dialogue structure labels restricts their usefulness.

A natural and more robust solution is to first disentangle the conversation, so that the context can be filtered and reorganized for response generation. While there has been substantial work that focuses on conversation disentanglement and response generation as standalone tasks, very few has explored the option of combining them together.

To achieve the goal, we consider 2 integration options.

• Integration with hard disentanglement: Directly filter the dialogue context with a disentanglement model and then input the resulting dialogue into a response generation system, pointing to this project and this project
• Integration with soft disentanglement: Use a soft Relational Disentanglement to infer a relation graph and get the distilled context embedding, and then input the embedding into a decoder for response generation, pointing to this project and this project

We evaluate both types of pipelines against the existing methods, and demonstrate their effectivness.

Toolstack: Python, PyTorch

[Honour Year Project Thesis]
Disentangle Multi-party Conversations as Reordering with Curriculum [Hard Disentanglement]

Supervisor: Professor Tat Seng Chua, Professor Lizi Liao

We observed existing conversation disentanglement models' heavy reliance on handcrafted features and their dependence on Ubuntu IRC. This necessaites the development of a more generalizable techniques that can make the model disentangle as effectively in other settings.

We propose to use reordering as a generalizable strategy to make conversation disentanglement more effective. Reordering has 2 roles. In training, it is used to generate new dialogues and augment the dataset. In inference, it is used recursively to correct the model's prediction.

Toolstack: Python, PyTorch

[Paper]
Multi-party Response Generation with Relation Disentanglement [Soft Disentanglement]

Supervisor: Professor Tat Seng Chua, Professor Lizi Liao

While there have been works on multi-party response generation, they mostly require the dialogue structure ground truth to be given in advance, which undermines their usefulness when such ground truth is not present.

As a continuation of the previous work, in this project, we investigate how to automatically infer the relations via relational thinking on subtle clues inside the conversation context without any human label, and leverage these relations to guide the neural response generation.

Toolstack: Python, PyTorch

[Paper]
Conversation Disentanglement with Bi-Level Contrastive Learning [Hard Disentanglement]

Supervisor: Professor Tat Seng Chua, Professor Lizi Liao

We observed that many existing conversation disentanglement models do not pay sufficient attention to the dialogue context, while context is crucial in cases where the query utterance is ambiguous but the surrounding utterances indicate its belonging.

We propose to formulate Conversation Disentanglement as a global clustering task, in contrast to how it is traditionally solved as a pointing task. We use contrastive learning to pull embeddings of utterances from the same session closer and pull away those from different sessions. K means is then used to cluster the utterances.

Toolstack: Python, PyTorch

[Paper] [Slides]
Relational Thinking-based Response Generation [Soft Disentanglement]

Supervisor: Professor Tat Seng Chua, Professor Lizi Liao

Relational Thinking is a process where people spontaneously perceive complex relationships among objects or concepts. It was originated and developed in cognitive neuroscience and has remained under-explored in the NLP community.

In this project, we work on the task of multi-party response generation, and we make use of Relational Thinking to infer a reply-to relation graph to filter the context in a multi-party conversation. This context is then used by the decoder for effective response generation.

Toolstack: Python, PyTorch

[Paper]
Bank Code Center

I developed a service to store bank records for the Global Payment team in ByteDance. Its functionalities include

• Submission of bank record CRUD requests through the user interface for internal enginners, capable of batch submission
• Audition and execution of the CRUD requests
• Cached reading of bank records from real time transcations

Toolstack: Go, Kite, MongoDB
RepoSense

RepoSense is an open source contribution analysis tool for Git repositories. It is particularly useful for those who want a chronological visualisation of the code contributions and commits made to a group of repositories.

As a project developer and mentor, I personally

• Implemented highlighting of selected single entry or merged group on the summary panel
• Implemented binary file support in authorship contribution panel
• Wrote developer tutorial and release guide for RepoSense
• Reviewed close to 100 pull requests and made 5 product releases

Toolstack: JavaScript, Vue, Java

[Product Site] [Repository]
Transcation monitoring bot

I developed a monitoring bot to monitor the customer transcations for the Global Payment team in ByteDance. It is capable of

• Receiving alerts from an internal platform that monitors a Time Series Database (TSDB) storeing metrics emitted by the payment system
• Sending messages to engineers in charge and help them handle the alerts
• CRUD of alert trigger rules that are used by the internal monitoring platform

Toolstack: Go, Ginex, Kite, MongoDB, Lark Open Platform
UWave Mobile Application

UWave is a mobile application that has more than 10000 users on campus. It provides various utilities to help people on campus connect with each other, learn about campus life, and manage their studies.

I collaborated with other members in the development team of UWave and

• Implemented part of the module calendar page
• Implemented features on several other pages such as the but timing page, library access page, community page, and settings page
• Supported its user growth especially in the NUS community

Toolstack: Dart, Flutter

[Android] [IOS]

Teaching

Teaching Activities

CS3230, Design and Analysis of Algorithms, NUS
Aug 2021 — Nov 2021
Conduct weekly tutorials and consultation sessions; Serve as proctor; Mark the assignments and submissions
CS2100, Computer Organization, NUS
Jan 2021 — April 2021
Conduct weekly lab sessions; Mark the lab reports
CS1231S, Discrete Structures, NUS
Aug 2020 - Nov 2020
Conduct weekly tutorials and consultation sessions; Serve as proctor; Mark the assignments and submissions
CS1101S, Programming Methodology, NUS
Aug 2020 - Nov 2020
Conduct weekly tutorials and consultation sessions; Mark the assignments

Awards

Honors and Awards

IMDA Exellence in Computing Price
5 June 2023
A cash prize of $3500 awarded to the Most Outstanding Graduate in the Bachelor of Computing programme from the Infocomm Media Development Authority in Singapore; Awarded to 1/759 student in School of Computing
Dean's List, School of Computing
Jan - May 2023; Jan - May 2021; Aug - Dec 2020
Awarded to the top 5 percent of students in the cohort in School of Computing in the corresponding semester
Senior Middle 2 Scholarship
Aug 2018 - May 2023
A 5-year scholarship that covers the tuition and living costs of the 1-year bridging course and 4-year undergraduate study in Singapore

Previous Research Topics

Conversation Disentanglement

Response Generation

Conversation Disentanglement with Bi-Level Contrastive Learning

Helping Large Language Models Generate Better Citations with Fine-Grained Rewards

Unleashing the Power of Reference-Based Phishing Detectors with Multimodal Knowledge Graphs

Helping Large Language Models Generate Better Citations with Fine-Grained Rewards

Unleashing the Power of Reference-Based Phishing Detectors with Multimodal Knowledge Graphs

Multi-party Response Generation with Conversation Disentanglement

Disentangle Multi-party Conversations as Reordering with Curriculum [Hard Disentanglement]

Multi-party Response Generation with Relation Disentanglement [Soft Disentanglement]

Conversation Disentanglement with Bi-Level Contrastive Learning [Hard Disentanglement]

Relational Thinking-based Response Generation [Soft Disentanglement]

Bank Code Center

RepoSense

Transcation monitoring bot

UWave Mobile Application

Teaching Activities

CS3230, Design and Analysis of Algorithms, NUS

CS2100, Computer Organization, NUS

CS1231S, Discrete Structures, NUS

CS1101S, Programming Methodology, NUS

Honors and Awards

IMDA Exellence in Computing Price

Dean's List, School of Computing

Senior Middle 2 Scholarship