NSF RINGS: Scalable and Resilient Networked Learning Systems

Investigators: Gustavo de Veciana (ECE, UT Austin), and Haris Vikalo (ECE, UT Austin)
Students and Participants :


Support: This material is based upon work supported by the National Science Foundation under Grant No. 2148224 and is supported in part by funds from OUSD R&E, NIST, and industry partners as specified in the Resilient & Intelligent NextG Systems (RINGS) program.

Goal: Next-generation learning systems enabling applications in, e.g., healthcare, energy, banking, AR/VR design and car/robot navigation, will be privacy-driven, distributed and large-scale, resulting in substantially increased exposure to network congestion/failures. This research proposal centers on developing new, as well as expanding traditional, engineering principles for the design of resilient and scalable networked learning systems. To explore these challenges, we specifically leverage Federated Learning (FL) based systems as a model learning framework.

The proposed research centers on four interrelated themes wherein we combine the development of theoretical underpinnings, architecture, applications and protocol design.

Selected Publications to date

Federated learning under intermittent client availability and time-varying capacity constraints.
M. Ribero, H. Vikalo, and G. de Veciana.   IEEE Journal of Selected Topics in Signal Processing, 17(1):98-111, January 2023.


Network Adaptive Federated Learning: Congestion and Lossy Compression
P. Hegde, G. de Veciana and A. Moktari.   Proceedings of IEEE INFOCOM, May 2023, pp: 1-10. Extended version is
here.


Mohawk: Mobility and heterogeneity-aware dynamic community selection for hierarchical federated learning.
A.-J. Farcas, M. Lee, R. Kompella, H. Latapie, G. de Veciana and R.Marculescu.   In Proc. 8th ACM/IEEE Conference on Internet of Things Design and Implementation, pages 1--12, May 2023.


Federated learning at scale: Addressing client intermittency and resource constraints .
M. Ribero, H. Vikalo, and G. de Veciana.   IEEE Journal of Selected Topics in Signal Processing pages 1-14, July 2024.


Clustered federated learning via gradient partitioning.
Heasung Kim, Hyeji Kim, and G. de Veciana   In Proc. ICML, pages 1-11, July 2024.


Optimal aggregation via overlay trees: Delay-MSE tradeoffs under failures.
Parikshit Hegde and Gustavo de Veciana,   Proc. ACM Meas. Anal. Comput. Syst.,(POMACS) 8(3):1-37, December 2024.