E193 - Institut für Visual Computing and Human-Centered Technology
-
Date (published):
2025
-
Number of Pages:
149
-
Keywords:
WiFi Sensing; Wireless Sensing; Through-Wall Sensing; Channel State Information; Person-Centric Sensing; Human Activity Recognition; Cross-Domain Generalization; Image Synthesis; Deep Learning; Computer Vision
de
WiFi Sensing; Wireless Sensing; Through-Wall Sensing; Channel State Information; Person-Centric Sensing; Human Activity Recognition; Cross-Domain Generalization; Image Synthesis; Deep Learning; Computer Vision
en
Abstract:
WiFi-based Person-Centric Sensing (PCS) offers a visual privacy-preserving alternative to optical methods by enabling passive, contactless monitoring through existing wireless infrastructure. Its low cost, unobtrusive nature, and wall-penetrating capability make it well suited for large-scale indoor monitoring applications. However, practical deployment remains constrained by data scarcity, limited sensing range of consumer off-the-shelf (COTS) hardware, computational inefficiencies, poor cross-domain generalization, and the abstract, non-intuitive nature of WiFi signals. To address the data scarcity, five publicly available WiFi Channel State Information (CSI)-based PCS datasets are contributed: TOA, Wallhack1.8k, HALOC, 3DO, and WiFiCam. Each dataset targets distinct challenges in long-range and through-wall sensing, domain generalization, and crossmodal translation. Building on this foundation, it is demonstrated that directional sensing with low-cost COTS WiFi systems enables long-range through-wall PCS. Experiments confirm robust presence detection, activity recognition, and localization up to 20 meters and across multiple rooms with a single-link setup, validating the effectiveness of the proposed directional sensing approach in complex indoor environments. To support real-time inference under resource constraints, WiFlexFormer, a lightweight Transformer architecture tailored to the temporal and spectral characteristics of WiFi CSI, is introduced. With only ≈ 50k parameters, it achieves inference latencies of ≈ 10 ms on embedded hardware while matching or surpassing the performance of significantly larger generic vision and RF-specific architectures. To improve robustness across domains, data augmentation and preprocessing strategies that enhance generalization without target-domain access are investigated. Building on these insights, the Domain-Adversarial Test-Time Adaptation (DATTA) framework is proposed. DATTA leverages domain-adversarial training, test-time adaptation, random weight resetting, and data augmentation to enable robust, real-time adaptation to domain shifts, achieving state-of-the-art cross-domain generalization performance. Lastly, the thesis presents the first approach to synthesize RGB images from WiFi CSI in through-wall scenarios. The WiFiCam architecture, based on a multimodal variational autoencoder, reconstructs coherent, semantically meaningful images, enabling camera-free visual monitoring and improving the interpretability of CSI for downstream tasks. These contributions address core limitations in data, hardware, efficiency, generalization, and interpretability, advancing WiFi-based PCS toward scalable real-world deployment.
en
Additional information:
Arbeit an der Bibliothek noch nicht eingelangt - Daten nicht geprüft