hzwer/MM2022-ViCoPerceptualHeadGenerationPublic

NotificationsYou must be signed in to change notification settings
Fork1
Star55

MM2022 Workshop-Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer

License

MIT license

55 stars 1 fork Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
images		images
LICENSE		LICENSE
README.md		README.md
audio_to_face.py		audio_to_face.py
inference.py		inference.py
speaker_generator.py		speaker_generator.py
u2net.py		u2net.py

Repository files navigation

Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer

Zhewei Huang,Ailin Huang,Shuchang Zhou

YouTube |arXiv

Introduction

This project is the implement ofPerceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer. We ranked first place in the listening head generation track and second place in the talking head generation track in the official ranking ofMM2022-ViCo Conversational Head Generation Challenge. Our team name is Megvii_goodjuice.

The whole pipeline of challenge can be found onvico challenge baseline. We currently provide our major modification of the baseline.

Contributors:@P2Oileen,@hzwer

Modification

Architecture & Pipeline

Clonevico challenge baseline, replacevico_challenge_baseline/vico/networks/audio_to_face.py,vico_challenge_baseline/vico/networks/speaker_generator.py,vico_challenge_baseline/PIRender/inference.py, and moveu2net.py tovico_challenge_baseline/PIRender/u2net.py.

Fusion

UseU2Net to segment the backgrounds and fuse them to remain the background unchanged. You should also download U2Net weightsu2net_human_seg.pth fromGoogle Drive and save invico_challenge_baseline/PIRender/u2net_human_seg.pth.

Image Boundary Inpainting

Change the padding mode of grid_sample inPIRenderer from "zeros" to "border".

Citation

@InProceedings{huang2022perceptual,  title={Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer},  author={Huang, Ailin and Huang, Zhewei and Zhou, Shuchang},  booktitle={Proceedings of the 30th ACM International Conference on Multimedia (MM'22)},  year={2022}}The ViCo baseline method: @InProceedings{zhou2022responsive,    title={Responsive Listening Head Generation: A Benchmark Dataset and Baseline},    author={Zhou, Mohan and Bai, Yalong and Zhang, Wei and Yao, Ting and Zhao, Tiejun and Mei, Tao},    booktitle={Proceedings of the European conference on computer vision (ECCV)},    year={2022}}

About

MM2022 Workshop-Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer

YouTube |arXiv

Introduction

Modification

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Contributors3

Uh oh!

Languages

Movatterモバイル変換

License

hzwer/MM2022-ViCoPerceptualHeadGeneration

Folders and files

Latest commit

History

Repository files navigation

Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer

YouTube |arXiv

Introduction

Modification

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Contributors3

Uh oh!

Languages

Packages