🏄‍♀️ About Me

I am a MEng student studying Computer Engineering with some expertise in Software Development and Deep Learning.
Currently, I am working with Prof. Jianbo Jiao and Prof. Yunchao Wei.
Before that, I have worked closely with Dr. Erick Purwanto and Prof. Jie Zhang during my undergraduate study.
Fluent in English, Mandarin, and Japanese (Learned through an unsupervised manner).
You are more than welcome to look at my CV on the navigation bar or here.

ΟΣΟΝ ΖΗΣ ΦΑΙΝΟΥ / ΜΗΔΕΝ ΟΛΩΣ ΣΥ / ΛΥΠΟΥ ΠΡΟΣ ΟΛΙ / ΓΟΝ ΕΣΤΙ ΤΟ ΖΗΝ / ΤΟ ΤΕΛΟΣ Ο ΧΡΟ / ΝΟΣ ΑΠΑΙΤΕΙ —-ΣΕΙΚΙΛΟΣ ΕΥΤΕΡ

🔥 News

2024.12 🎉🎉 Glad to join WEI Lab@BJTU as a research intern.
2023.12: 🎉🎉 Glad to join MIx group as a research intern.
2023.08: 🎉🎉 Started my Master degree in UIUC ECE.
2023.07: 🎉🎉 Got my distinction bachelor degree at University of Liverpool and XJTLU.

📖 Education

2023.08 - now, MEng in Electrical and Computer Engineering, University of Illinois, Urbana-Champaign, GPA:3.94/4 [Transcript]
2019.09 - 2023.07, BSc in Information and Computing Science, University of Liverpool, Distinction: [Transcript]

📝 Publications

Under Review

Audio-Visual Separation with Hierarchical Fusion and Representation Alignment, Han Hu^*, Dongheng Lin^*, Qiming Huang, Yuqi Hou, Hyung Jin Chang, Jianbo Jiao^†

We proposed Hierarchical Fusion Strategy that integrates middle and late fusion to enhance separation across diverse sound characteristics.
We introduced a new loss term for the task that aligns U-Net’s latent audio features with pre-trained audio embeddings (CLAP) to improve semantic quality and reduce modality gap.
It achieves superior results on MUSIC and MUSIC-21 datasets, surpassing previous self-supervised methods.

Under Review

What Time Tells Us? Learning time-Awareness from Static Images, Dongheng Lin^*, Han Hu^*, Jianbo Jiao^†

A new paradigm to learn visual cues for timestamps leading to time-aware understanding on random visual inputs.
Built a new benchmark dataset with 130k accurate samples paired with metadata.
Tested the learned representations on various downstream tasks (Image Retrieval, Video Scene Recognition and Time-conditioned Image Editing).

IJCNN 2024

SPSS: A Salience-based Poisoning Selection Strategy for Selecting Backdoor Attack Victims, Zihan Lyu¹, Dongheng Lin¹, Jie Zhang^†, Ruiming Zhang²

Designed an algorithm uses Salience Metric to evaluate sample feature significance towards backdoor learning process.
With its assistance, we managed to realize a more data-efficient backdoor attack to DNN models achieved the same attack success rate to vanilla backdoor attack with only 38.44% of poisoned samples.

CyberC 2022 (IEEE)

Conditional Metadata Embedding Data Preprocessing Method for Semantic Segmentation

Juntuo Wang¹, Qiaochu Zhao¹, Dongheng Lin¹, Erick Purwanto^†, Ka Lok Man²

In this paper, we propose a conditional data preprocessing strategy, i.e., Conditional Metadata Embedding (CME) data preprocessing strategy. The CME data preprocessing method will embed conditional information to the training data, which can assist the model to better overcome the differences in the datasets and extract useful feature information in the images.

🎖 Honors and Awards

2022.10 XJTLU 2022 Summer Undergraduate Research Fellowship Poster Group Winner
2022.07 University Academic Excellence Award — Scholarship for top 5% students

💻 Internships

2023.11 - Now, Research Intern at MiX group at UofB, UK (Remote).
2021.06 - 2021.10, Backend Developer Intern at Pactera, China.

🖨 Projects

Arxiv Explorer: An efficient paper recommendation system we developed on a full dataset of all available Arxiv Papers (until Feb 2024).

All papers are compressed into paper embeddings using category, abstract, and title, stored in a FAISS indexing system.
We modeled a coauthorship map to conduct authority/hub based HITS reranking of the paper recommendation results.

MapleJuice: A light-weight counterpart of Hadoop supported with SQL-like query

The distributed system is built upon a self-implemented file system similar to GFS with corresponding NameNode and DataNode.
We also implemented an efficient Gossip-style failure detection protocol to maintain all the node status using UDP packets, a Bully-algorithm based re-election ensures new leader will be available in case of any failures on master nodes.
The task scheduling mechanism is similar to MapReduce, ensures the parallelism among nodes. We tested the system against Hadoop within a cluster with 10 VMs. The MapleJuice is generally 25% faster than Hadoop when dealing with small clusters.

My undergraduate thesis: StyleDiffuser: Cartoon-Style Image Creation with Diffusion Model and GAN Fusion

In this work, I have introduced a novel approach that fuses Generative Adversarial Networks (GANs) with the Stable Diffusion Model for creating cartoon-style images.
Utilizes StyleGAN2 generated feature maps and corresponding metadata to constrain the Stable Diffusion Network.
Reduces the number of diffusion steps required for the model to converge to a final image, streamlining the image generation process.
The method reduces the reliance on verbose prompts for controlling the output, making the generation process more straightforward.

📎Homework

Apart from these projects, here are few interesting written homeworks to refer: Homework Samples.

🎪 Miscellaneous

A typical Otaku.
- My favorite manga: 臨海センチメント (Nostalgic Ocean Landscape).
- Favorite Light Novel (Single Episode): 『続・終物語』
I also enjoy photography and music.

FUJICHROME Velvia 100, taken at my hometown.

✍Erdős Number

My Erdős number is 4, calculated via two paths:

Dongheng Lin → Jianbo Jiao → Ana I. L. Namburete → Israel Koren → Paul Erdős
Dongheng Lin → Ka Lok Man → Prudence W. H. Wong → Shmuel Zaks → Paul Erdős
A few friends of mine: Youheng Zhu, Jiawei (Kyle) Zhang

📝Service

Invited Reviewer: BMVC, IJCNN

Dongheng Lin