arxiv:2402.00867

AToM: Amortized Text-to-Mesh using 2D Diffusion

Published on Feb 1

· Submitted by

akhaliq on Feb 2

Upvote

Authors:

Guocheng Qian ,

Junli Cao ,

Aliaksandr Siarohin ,

Yash Kant ,

Chaoyang Wang ,

Michael Vasilkovsky ,

Hsin-Ying Lee ,

Ivan Skorokhodov ,

Peiye Zhuang ,

Igor Gilitschenski ,

Jian Ren ,

Abstract

We introduce Amortized Text-to-Mesh (AToM), a feed-forward text-to-mesh framework optimized across multiple text prompts simultaneously. In contrast to existing text-to-3D methods that often entail time-consuming per-prompt optimization and commonly output representations other than polygonal meshes, AToM directly generates high-quality textured meshes in less than 1 second with around 10 times reduction in the training cost, and generalizes to unseen prompts. Our key idea is a novel triplane-based text-to-mesh architecture with a two-stage amortized optimization strategy that ensures stable training and enables scalability. Through extensive experiments on various prompt benchmarks, AToM significantly outperforms state-of-the-art amortized approaches with over 4 times higher accuracy (in DF415 dataset) and produces more distinguishable and higher-quality 3D outputs. AToM demonstrates strong generalizability, offering finegrained 3D assets for unseen interpolated prompts without further optimization during inference, unlike per-prompt solutions.

View arXiv page View PDF Add to collection

Community

librarian-bot

Feb 3

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Xillerate

Feb 23

Are there any links to the relevant datasets or the code used to get the results the paper shows? The linked GitHub from the paper just goes to a GitHub hosted synopsis of the paper.

guochengqian

Paper author Mar 3

Hi Thanks for your interest. Our code will be released upon acceptance.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2402.00867 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2402.00867 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2402.00867 in a Space README.md to link it from this page.