Publications - Zhen Qin

SeMi: When Imbalanced Semi-Supervised Learning Meets Mining Hard Examples

Yin Wang, Zixuan Wang, Hao Lu, Zhen Qin^*, Hailiang Zhao, Guanjie Cheng, Ge Su, Li Kuang, Mengchu Zhou, Shuiguang Deng^* (^* corresponding author)

ACM International Conference on Multimedia (ACM MM) 2025

Semi-Supervised Learning (SSL) can leverage abundant unlabeled data to boost model performance. However, the class-imbalanced data distribution in real-world scenarios poses great challenges to SSL, resulting in performance degradation. Existing class-imbalanced semi-supervised learning (CISSL) methods mainly focus on rebalancing datasets but ignore the potential of using hard examples to enhance performance, making it difficult to fully harness the power of unlabeled data even with sophisticated algorithms. To address this issue, we propose a method that enhances the performance of Imbalanced Semi-Supervised Learning by Mining Hard Examples (SeMi). This method distinguishes the entropy differences among logits of hard and easy examples, thereby identifying hard examples and increasing the utility of unlabeled data, better addressing the imbalance problem in CISSL. In addition, we maintain a class-balanced memory bank with confidence decay for storing high-confidence embeddings to enhance the pseudo-labels' reliability. Although our method is simple, it is effective and seamlessly integrates with existing approaches. We perform comprehensive experiments on standard CISSL benchmarks and experimentally demonstrate that our proposed SeMi outperforms existing state-of-the-art methods on multiple benchmarks, especially in reversed scenarios, where our best result shows approximately a 54.8\% improvement over the baseline methods.

2025

SeMi: When Imbalanced Semi-Supervised Learning Meets Mining Hard Examples

SeMi: When Imbalanced Semi-Supervised Learning Meets Mining Hard Examples

The Synergy Between Data and Multi-Modal Large Language Models: A Survey From Co-Development Perspective

The Synergy Between Data and Multi-Modal Large Language Models: A Survey From Co-Development Perspective

Federated Knowledge Distillation using Hierarchical Reinforcement Learning in Resource-Constrained IoT Edge-Cloud Computing Environments

Federated Knowledge Distillation using Hierarchical Reinforcement Learning in Resource-Constrained IoT Edge-Cloud Computing Environments

Federated Data-Efficient Instruction Tuning for Large Language Models

Federated Data-Efficient Instruction Tuning for Large Language Models

ExploraCoder: Advancing Code Generation for Multiple Unseen APIs via Planning and Chained Exploration

ExploraCoder: Advancing Code Generation for Multiple Unseen APIs via Planning and Chained Exploration

Vertical Federated Learning in Practice: The Good, the Bad, and the Ugly

Vertical Federated Learning in Practice: The Good, the Bad, and the Ugly

2024

Personalized Federated Fine-Tuning for LLMs via Data-Driven Heterogeneous Model Architectures

Personalized Federated Fine-Tuning for LLMs via Data-Driven Heterogeneous Model Architectures

Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes

Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes

Adaptive Scheduling of High-Availability Drone Swarms for Congestion Alleviation in Connected Automated Vehicles

Adaptive Scheduling of High-Availability Drone Swarms for Congestion Alleviation in Connected Automated Vehicles

LARA: A Light and Anti-overfitting Retraining Approach for Unsupervised Time Series Anomaly Detection

LARA: A Light and Anti-overfitting Retraining Approach for Unsupervised Time Series Anomaly Detection

BlockDFL: A Blockchain-based Fully Decentralized Peer-to-Peer Federated Learning Framework

BlockDFL: A Blockchain-based Fully Decentralized Peer-to-Peer Federated Learning Framework

Learning Multi-Pattern Normalities in the Frequency Domain for Efficient Time Series Anomaly Detection

Learning Multi-Pattern Normalities in the Frequency Domain for Efficient Time Series Anomaly Detection

Resisting Backdoor Attacks in Federated Learning via Bidirectional Elections and Individual Perspective

Resisting Backdoor Attacks in Federated Learning via Bidirectional Elections and Individual Perspective

2023

FedAPEN: Personalized Cross-silo Federated Learning with Adaptability to Statistical Heterogeneity

FedAPEN: Personalized Cross-silo Federated Learning with Adaptability to Statistical Heterogeneity

6G Data Plane: A Novel Architecture Enabling Data Collaboration with Arbitrary Topology

6G Data Plane: A Novel Architecture Enabling Data Collaboration with Arbitrary Topology

ST-EUA: Spatio-Temporal Edge User Allocation With Task Decomposition

ST-EUA: Spatio-Temporal Edge User Allocation With Task Decomposition

2022

DeepWSC: Clustering Web Services via Integrating Service Composability into Deep Semantic Features

DeepWSC: Clustering Web Services via Integrating Service Composability into Deep Semantic Features

2021

Towards the optimality of service instance selection in mobile edge computing

Towards the optimality of service instance selection in mobile edge computing

2020

TD-EUA: Task-Decomposable Edge User Allocation with QoE Optimization

TD-EUA: Task-Decomposable Edge User Allocation with QoE Optimization

2019

DeepWSC: A Novel Framework with Deep Neural Network for Web Service Clustering

DeepWSC: A Novel Framework with Deep Neural Network for Web Service Clustering