AIpparel: A Large Multimodal Generative Model for Digital Garments

  • 2024-12-13 06:15:54
  • Kiyohiro Nakayama, Jan Ackermann, Timur Levent Kesdogan, Yang Zheng, Maria Korosteleva, Olga Sorkine-Hornung, Leonidas J. Guibas, Guandao Yang, Gordon Wetzstein
  • 0

Abstract

Apparel is essential to human life, offering protection, mirroring culturalidentities, and showcasing personal style. Yet, the creation of garmentsremains a time-consuming process, largely due to the manual work involved indesigning them. To simplify this process, we introduce AIpparel, a largemultimodal model for generating and editing sewing patterns. Our modelfine-tunes state-of-the-art large multimodal models (LMMs) on a custom-curatedlarge-scale dataset of over 120,000 unique garments, each with multimodalannotations including text, images, and sewing patterns. Additionally, wepropose a novel tokenization scheme that concisely encodes these complex sewingpatterns so that LLMs can learn to predict them efficiently. AIpparelachievesstate-of-the-art performance in single-modal tasks, including text-to-garmentand image-to-garment prediction, and enables novel multimodal garmentgeneration applications such as interactive garment editing. The projectwebsite is at georgenakayama.github.io/AIpparel/.