TSCnet:A Text-driven Semantic-level Controllable Module for Personalized Low-Light Image Enhancement

Shenzhen International Graduate School Tsinghua University
^†Indicates Equal Contribution
^*Indicates Corresponding Author

Abstract

Low-light image enhancement through deep learning can improve noise reduction and visibility. However, existing methods often lack the ability to perform semantic-level, quantitative brightness adjustments, limiting their capacity for personalized lighting control. To address these limitations, we propose a novel framework that utilizes Large Language Model (LLM) capable of interpreting natural language prompt to identify target objects and specify brightness modifications. The framework then employs a Retinex-based Reasoning Segment (RRS) module to generate accurate target localization masks. Concurrently, a Text-based Brightness Controllable (TBC) module applies precise brightness adjustments based on the natural language input. To ensure seamless integration of these components, we introduce an Adaptive Contextual Compensation (ACC) module, which synthesizes multi-source input conditions, guiding a conditional diffusion model to perform accurate lighting adjustments while maintaining overall image coherence. Experimental results on benchmark datasets demonstrate the system's superior performance in enhancing visibility, maintaining natural color balance, and amplifying fine details without introducing artifacts. Our framework also exhibits strong generalization capabilities, enabling complex, semantic-level and personalized lighting adjustments through natural language interactions across various scenarios.