Towards Robust and Fair Vision Learning in Open-World Environments

  • 2024-12-12 16:50:52
  • Thanh-Dat Truong
  • 0

Abstract

The dissertation presents four key contributions toward fairness androbustness in vision learning. First, to address the problem of large-scaledata requirements, the dissertation presents a novel Fairness Domain Adaptationapproach derived from two major novel research findings of Bijective MaximumLikelihood and Fairness Adaptation Learning. Second, to enable the capabilityof open-world modeling of vision learning, this dissertation presents a novelOpen-world Fairness Continual Learning Framework. The success of this researchdirection is the result of two research lines, i.e., Fairness ContinualLearning and Open-world Continual Learning. Third, since visual data are oftencaptured from multiple camera views, robust vision learning methods should becapable of modeling invariant features across views. To achieve this desiredgoal, the research in this thesis will present a novel Geometry-basedCross-view Adaptation framework to learn robust feature representations acrossviews. Finally, with the recent increase in large-scale videos and multimodaldata, understanding the feature representations and improving the robustness oflarge-scale visual foundation models is critical. Therefore, this thesis willpresent novel Transformer-based approaches to improve the robust featurerepresentations against multimodal and temporal data. Then, a novel DomainGeneralization Approach will be presented to improve the robustness of visualfoundation models. The research's theoretical analysis and experimental resultshave shown the effectiveness of the proposed approaches, demonstrating theirsuperior performance compared to prior studies. The contributions in thisdissertation have advanced the fairness and robustness of machine visionlearning.