IMPROVING PORTRAIT STYLE TRANSFER

Authors

  • Pavlo Kopka National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute", Ukraine
  • Artem Volokyta National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute", Ukraine https://orcid.org/0000-0001-9069-5544

Keywords:

style transfer, portrait images, transformer models, self-attention, cross-attention, facial segmentation

Abstract

The paper presents an approach to improving artistic style transfer for portrait images by explicitly accounting for facial geometry and semantic heterogeneity of face regions. The proposed GeometricBiasAttention and SemanticGatedCrossAttention modules are integrated into the transformer-based architectures StyTr2 and S2WAT. Experiments on FFHQ and WikiArt Faces demonstrate improved identity preservation and more controllable stylization. The modified models also show that a smaller configuration can outperform a larger baseline one across several quality metrics, including FID, SSIM, and LPIPS.

References

Gatys L. A., Ecker A. S., Bethge M. A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576, 2016.

Deng Y. et al. StyTr2: Image style transfer with transformers. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.

Fan C., Chen J., Xu X. S2WAT: Image style transfer via hierarchical vision transformer using strips window attention. Proceedings of the AAAI Conference on Artificial Intelligence, 2024.

Vaswani A. et al. Attention is all you need. Advances in Neural Information Processing Systems, 2017.

Simonyan K., Zisserman A. Very deep convolutional networks for large-scale image recognition. ICLR, 2015.

Johnson J., Alahi A., Fei-Fei L. Perceptual losses for real-time style transfer and super-resolution. ECCV, 2016.

Huang X., Belongie S. Arbitrary style transfer in real time with adaptive instance normalization. ICCV, 2017.

Yu C. et al. BiSeNet: Bilateral segmentation network for real-time semantic segmentation. ECCV, 2018.

Deng J. et al. ArcFace: Additive angular margin loss for deep face recognition. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.

Ushio A. WikiArt Face: Face image dataset from portraits, 2024.

Published

2026-05-09

Issue

Section

Machine learning, Big Data (AI)