Text2Head — CLIP-guided Latent Code Optimization

This repository contains the source code for the paper Text2Head — CLIP-guided Latent Code Optimization.

Abstract

We propose Text2Head, a novel method for generating neural parametric 3D head models driven by text descriptions. Our approach takes textual prompts describing a person and outputs latent codes for geometry and appearance, which are then used to generate textured 3D head geometries with a pre-trained Monocular Parametric Head Model (MonoNPHM). In contrast to existing approaches, we do not require the prior generation of ground truth pairs of text prompts and latent codes, which can be limited in quality and availability. Instead, our method allows direct optimization of latent codes leveraging a CLIP loss. Our method demonstrates the capability to faithfully generate 3D head models for various applications.

Setup

The full Setup is not possible at the moment, as the project is heavily based on on MonoNPHM which is not public yet. For the rest follow this tutorial.

Create an environment via

conda create -n "Text2Head" python=3.9
conda activate Text2Head

and install everything required with

pip install -r requirements.txt

Once MonoNPHM is public to follow their instructions to load weights and install all necessary packages.

Usage

notebooks/usage.ipynb provides a script for generating scenes using the trained denoising network.

Results

The figure below provides the results for given input text prompts.

Acknowledgement

This work is developed with TUM Visual Computing Group led by Prof. Matthias Niessner. We thank Matthias for his great support and supervision. Our work builds upon MonoNPHM by Simon Giebenhain.

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
MonoNPHM		MonoNPHM
extra		extra
notebooks		notebooks
utils		utils
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Text2Head — CLIP-guided Latent Code Optimization

Abstract

Table of Contents

Setup

Usage

Results

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

kasothaphie/Text2Head

Folders and files

Latest commit

History

Repository files navigation

Text2Head — CLIP-guided Latent Code Optimization

Abstract

Table of Contents

Setup

Usage

Results

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages