AI Turns Text Into Videos And 3D Models

Text-to-video and text-to-3D model services are likely to sweep the web in the coming months but the breakthrough raises important issues of bias, accountability and transparency

The Physics arXiv Blog

By The Physics arXiv Blog

Oct 11, 2022 1:30 PM

(Credit:metamorworks/Shutterstock)

Newsletter

Sign up for our email newsletter for the latest science news

Text-to-image generators have swept the web in recent months. These AI systems turn a written description into an image. So by entering “an astronaut riding a white horse,” the system turns this into an image of, well, an astronaut riding a white horse.

One of the first of these services — DALL-E developed by the Open AI Initiative--appeared early last year producing reasonably well rendered images. But advances since then have been striking. DALL-E 2, launched earlier this year, produces higher resolution images of surprising realism. Other systems look equally impressive.

Nevertheless, this technology has generated controversy because of its biases and potential for abuse. For example, ask DALLE-E 2 to produce an image of a doctor and it will show you a man in a white coat. Ask it for an image of a nurse and it will invariably produce an image of a woman.

But the approaches for tackling bias and preventing abuse are advancing slowly compared to the technology itself. And that raises the question about the challenges more advanced AI systems are likely to throw up.

artificial intelligence

0 free articles left

Want More? Get unlimited access for as low as $1.99/month

Subscribe

Already a subscriber?

Register or Log In

0 free articlesSubscribe

Want more?

Keep reading for as low as $1.99!

Subscribe

Already a subscriber?

Register or Log In

Stay Curious

Sign up for our weekly newsletter and unlock one more article for free.

View our Privacy Policy

Want more?
Keep reading for as low as $1.99!

Subscribe

Log In or Register

Already a subscriber?
Find my Subscription