{"id":71779,"date":"2022-12-21T17:31:34","date_gmt":"2022-12-21T15:31:34","guid":{"rendered":"https:\/\/forklog.com\/en\/?p=71779"},"modified":"2025-09-08T11:04:48","modified_gmt":"2025-09-08T08:04:48","slug":"openai-unveils-point-e-a-3d-generation-model","status":"publish","type":"post","link":"https:\/\/forklog.com\/en\/openai-unveils-point-e-a-3d-generation-model\/","title":{"rendered":"OpenAI unveils POINT-E, a 3D-generation model"},"content":{"rendered":"<p>OpenAI has released a new algorithm for generating three-dimensional images from text prompts, POINT-E.<\/p>\n<p>According to the study, the models require a single Nvidia V100 GPU and about two minutes to create an image.<\/p>\n<p>The algorithm does not generate 3D objects in the traditional sense. It creates &#8216;point clouds&#8217; or discrete sets of data points in space that represent a three-dimensional form.<\/p>\n<p>Researchers noted that such data are easier to synthesise computationally. However they do not capture the object&#8217;s detailed structure, shape, or texture.<\/p>\n<figure class=\\\"wp-block-image size-full\\\"><img loading=\\\"lazy\\\" decoding=\\\"async\\\" width=\\\"680\\\" height=\\\"170\\\" src=\\\"https:\/\/forklog.com\/wp-content\/uploads\/E Com-maker.gif\\\" alt=\\\"E com-maker\\\" class=\\\"wp-image-193975\\\"\/><figcaption>Three-dimensional objects created with POINT-E. Data: OpenAI.<\/figcaption><\/figure>\n<p>To overcome this limitation, the OpenAI team trained an additional AI system to convert POINT-E point clouds into meshes.<\/p>\n<p>POINT-E itself consists of two parts:<\/p>\n<ul class=\\\"wp-block-list\\\">\n<li>text-to-image models;<\/li>\n<li>image-to-3D models.<\/li>\n<\/ul>\n<p>The text-to-image model works similarly to DALL-E 2. It was trained on labelled images so that the algorithm understands associations between words and visual concepts.<\/p>\n<p>The image-to-3D model was trained on pairs of image and three-dimensional object.<\/p>\n<p>For example, if you enter the text prompt &#8216;A cat eats a burrito&#8217;, POINT-E will first generate a synthetic image consistent with the prompt. The second model will then synthesize a rough &#8216;cloud&#8217; with 1024 points, and then refine the 3D object to 4096 points.<\/p>\n<figure class=\\\"wp-block-image size-large\\\"><img loading=\\\"lazy\\\" decoding=\\\"async\\\" width=\\\"750\\\" height=\\\"1024\\\" src=\\\"https:\/\/forklog.com\/wp-content\/uploads\/Screenshot-17-1-750x1024.png\\\" alt=\\\"Screenshot-17-1\\\" class=\\\"wp-image-193976\\\" srcset=\\\"https:\/\/forklog.com\/wp-content\/uploads\/Screenshot-17-1-750x1024.png 750w, https:\/\/forklog.com\/wp-content\/uploads\/Screenshot-17-1-220x300.png 220w, https:\/\/forklog.com\/wp-content\/uploads\/Screenshot-17-1.png 760w\\\" sizes=\\\"auto, (max-width: 750px) 100vw, 750px\\\" \/><figcaption>Turning a 2D image into 3D. Data: OpenAI.<\/figcaption><\/figure>\n<p>According to the researchers, after training the models on a dataset of &#8216;several million&#8217; 3D objects and associated metadata, POINT-E can generate coloured point clouds that correspond to textual prompts. They acknowledged the model&#8217;s imperfect performance, but noted the speed of generation.<\/p>\n<blockquote class=\\\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\\\">\n<p>&#8220;Although our method yields worse results in this evaluation than the most advanced methods, it provides samples in a small fraction of the time. This could make it more practical for certain applications or enable the discovery of higher-quality 3D objects,&#8221; the developers said.<\/p>\n<\/blockquote>\n<p>OpenAI released the open-source code for the projects on <a href=\\\"https:\/\/github.com\/openai\/point-e\\\" target=\\\"_blank\\\" rel=\\\"noopener nofollow\\\" title=\\\"\\\">GitHub<\/a>.<\/p>\n<p>In December, the company <a href=\"https:\/\/forklog.com\/en\/news\/openai-unveils-chatgpt-a-chatbot-designed-for-dialogue\">introduced the ChatGPT chatbot<\/a>, built on a large language model.<\/p>\n<p>In April, OpenAI <a href=\"https:\/\/forklog.com\/en\/news\/openai-updates-dall-e-the-text-to-image-generator\">released the second version of its image generator<\/a> for the text-to-image model DALL-E.<\/p>\n<p>Subscribe to ForkLog News on Telegram: <a href=\\\"https:\/\/t.me\/forklogAI\\\" target=\\\"_blank\\\" rel=\\\"noopener nofollow\\\" title=\\\"\\\">ForkLog AI<\/a> \u2014 all the news from the AI world!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>OpenAI has released a new algorithm for generating three-dimensional images from text prompts, POINT-E.<\/p>\n","protected":false},"author":1,"featured_media":71780,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"select":"1","news_style_id":"1","cryptorium_level":"","_short_excerpt_text":"","creation_source":"","_metatest_mainpost_news_update":false,"footnotes":""},"categories":[3],"tags":[438,1190],"class_list":["post-71779","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news-and-analysis","tag-artificial-intelligence","tag-openai"],"aioseo_notices":[],"amp_enabled":true,"views":"20","promo_type":"1","layout_type":"1","short_excerpt":"","is_update":"","_links":{"self":[{"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/posts\/71779","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/comments?post=71779"}],"version-history":[{"count":1,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/posts\/71779\/revisions"}],"predecessor-version":[{"id":71781,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/posts\/71779\/revisions\/71781"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/media\/71780"}],"wp:attachment":[{"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/media?parent=71779"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/categories?post=71779"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/tags?post=71779"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}